Configuration#
3LC has several configuration options, which control data discovery, logging, and network configuration.
All the options are documented in the tlcconfig.options
module. The documentation for each option details the
configuration file key, environment variable and command-line argument to the 3LC Object Service that can be
used to set the option.
3LC tries to have as good defaults as possible, however some times you may want to persist a machine global
configuration. This can be achieved by using the default configuration file. The configuration file is located in
$HOME/.config/3LC/config.yaml
on Linux and in %LOCALAPPDATA%\3LC\3LC\config.yaml
on Windows. The location can be
overridden by the TLC_CONFIG_FILE
environment variable.
To create an initial configuration file, you can run the following command:
$ 3lc config --write #add --quiet to produce less verbose output
It is also easy to inspect the output at the command line by running:
$ 3lc config --to-yaml
The config command will try and read the current configuration, including the environment variables and command line
options given to the command.
This means that when writing the configuration file with config --write
, it will first read the current
configuration and write the loaded configuration to file.
This makes it also possible to update the configuration file by passing new options and environment variables to the
config
command and supply --write --force
to enforce overwriting the existing configuration file.
# Setting the root index location in the configuration file
$ 3lc --project-root-url "/some/location" config --write --force
# Setting alias with name CIFAR10_LOCATION in the configuration file
$ TLC_ALIAS_CIFAR10_LOCATION="/some/location" 3lc config --write --force
An example configuration file can be seen below.
Example config.yaml
## Configuration file for `tlc`.
##
## This YAML file contains setting for the `tlc` Python Package.
## Created at 2025-01-08 02:33:20
## Documentation starts with ##,
## commented out default values start with #
##
## Service Settings
service:
## Port for the server.
# port: 5015
## Host for the server.
# host: 127.0.0.1
## Specify license or license file
##
## The option can either be the license key or point to a local file containing the license key.
# license:
## Specify the amount of memory to use for caching, in bytes.
##
## Setting the value to 0 will disable in-memory caching.
##
## Default: 1073741824 (1 GB)
# cache_in_memory_size: 1073741824
## Specify the cache item time out, in seconds.
##
## Setting the value to 0 will disable cache eviction based on time.
##
## Default: 3600 (1 hour)
# cache_time_out: 3600
##
## Indexing Settings
indexing:
## Location for reading and writing 3LC data.
##
## This option is mandatory and must point to a location (e.g. directory on disk or object store bucket) with write
## access. The location will be created if it does not exist.
##
## If the option value contains an environment variable, it will be expanded.
# project-root-url: /home/build/.local/share/3LC/projects
## Locations to scan for 3LC objects (runs and tables) following a standard 3LC project layout.
##
## The option value should be a list of URLs to scan for 3LC objects. Items in the list can be represented either by a
## string value (e.g., "s3://bucket") or as a dictionary with the required field 'url' and optional extra fields
## (e.g., {"url": "s3://bucket", "static": True}).
##
## Example values:
## - "C:\Users\user\Documents\3LC\projects"
## - "s3://my-bucket/3LC/projects"
## - {"url": "s3://my-bucket/3LC/read-only-projects", "static": True}
##
## The only public optional field is 'static', which, when set to 'True', prevents the indexer from re-scanning
## the location.
##
## Each URL in this list is assumed to contain sub-projects. Default (sub-)directories will be created if they do
## not exist.
# project-scan-urls:
##
## Logging Settings
logging:
## Log file for the 3LC logger.
##
## The directory will be created if it does not exist.
##
## If the option value contains an environment variable, it will be expanded.
# logfile: /home/build/.local/state/3LC/log/3LC.log
## Log level for the 3LC logger.
##
## The `tlc` Python package adheres to the standard Python logging levels:
##
## - DEBUG: Detailed information, typically of interest only when diagnosing problems.
## - INFO: Confirmation that things are working as expected.
## - WARNING: An indication that something unexpected happened, or indicative of some problem in the near future
## (e.g. "disk space low"). The software is still working as expected.
## - ERROR: Due to a more serious problem, the software has not been able to perform some function.
## - CRITICAL: A serious error, indicating that the program itself may be unable to continue running.
# loglevel: WARNING
##
## Tlc Settings
tlc:
## Whether to display progress bars or not.
##
## The option can be either 0 or 1, where 0 means no progress bars and 1 means progress bars.
# display-progress: 1
## List of registered aliases
##
## The option value should be a dictionary where each key is an alias name and its value is the alias target (i.e. a
## path).
##
## **Anchored alias syntax**
##
## When aliases are defined in configuration files, their values can reference paths relative to the file's location.
## Use the `$.` prefix (and subsequent dots) to move up directories relative to the configuration file:
##
## - `$./data` => the `data` directory next to the configuration file.
## - `$../data` => the `data` directory one level above the configuration file.
## - `$.../data` => the `data` directory two levels above the configuration file.
##
## **Default aliases**
##
## Default aliases come from special configuration files named `default_aliases.3lc.yaml` in indexed locations,
## providing baseline values without local configuration. They have lower precedence than locally defined aliases.
##
## The system searches for default aliases inside each indexed project, hence the name project aliases is often used:
## - `<PROJECT_SCAN_URL>/<project_xyz>/default_aliases.3lc.yaml` # project aliases
##
## For completeness, the system also searches for default aliases in the following scan locations:
## - `<PROJECT_SCAN_URL>/default_aliases.3lc.yaml`
## - `<EXTRA_TABLE_SCAN_URL>/default_aliases.3lc.yaml`
## - `<EXTRA_RUN_SCAN_URL>/default_aliases.3lc.yaml`
##
##
## **Examples**
##
## ```yaml
## aliases:
## PROJECT_XYZ_REMOTE_ALIAS: s3://my-bucket/3LC/projects/project-xyz
## PROJECT_XYZ_LOCAL_ALIAS: /home/user/projects/project-xyz
## ```
##
## Special anchored syntax examples:
## ```yaml
## aliases:
## PROJECT_XYZ_DATA_ALIAS: $./data # expands to the data directory relative to the configuration file itself
## PROJECT_XYZ_PARENT_DATA_ALIAS: $../data # relative to parent dir
## PROJECT_XYZ_PARENT_PARENT_DATA_ALIAS: $.../data # relative to parent's parent dir
## ```
# aliases:
Configuration files#
Resolution#
Primary configuration
The system attempts to load configuration from:The file specified by
--config-file
The file referenced by the
TLC_CONFIG_FILE
environment variableA default, platform-dependent configuration file
Project root configuration
After the primary configuration, ifconfig.3lc.yaml
exists in the project root (set by the PROJECT_ROOT_URL_OPTION), it will be loaded and merged with the current configuration.
Disabling configuration file reading
If
--no-config-file
is specified on the command line orTLC_CONFIG_FILE
is set to an empty string, no configuration files are considered.
Default/project aliases#
To make sharing of projects and results easier the system supports a special type of partial configuration file that can be used to persist default values for project aliases. These files must adhere to the following rules:
Naming: The files must be named
default_aliases.3lc.yaml
.Content: The files may only contain only the
aliases
option. For further details, refer to themodule documentation
.Location: The files must be placed inside a project or scan folder:
<PROJECT_SCAN_URL>/project-a/default_aliases.3lc.yaml
# project aliases<EXTRA_XXX_SCAN_URL>/default_aliases.3lc.yaml
# default aliases for single folder
Project alias configuration files are read by the indexing system and will be reloaded when changed.
Overriding configuration options#
3LC follows the following precedence rules for configuration:
api-call > command-line argument > environment variable > configuration file
The rules above should be straightforward when launching the Object Service from the command, but it is more complicated
when importing the tlc package in a Jupyter notebook (e.g. import tlc
). Importing tlc
causes the default
configuration file to be read, and it is too late to modify these options after the module has been imported.
For this reason, there exists a tlcconfig
-module that allows for creating and modifying an
OptionLoader
-instance prior to importing the main tlc
-module. See the
module documentation
for an example of how to use this module.