Configuring a Butler

A Gen 3 butler is configured using YAML files. The main sections of the YAML file are handled separately by each sub configuration. Each config specialization, registry, schema, storageClass, composites, and dimensions knows the name of the key for its section of the configuration and knows the names of files providing overrides and defaults for the configuration. Additionally, if the sub configuration contains a cls key, that class is imported and an additional configuration file name can be provided by asking the class for its defaultConfigFile property. All the keys within a sub configuration are processed by the class constructed from cls .

The primary source of default values comes from $DAF_BUTLER_DIR/config – this directory contains YAML files matching the names specified in the sub config classes and also can include names specified by the corresponding component class (for example PosixDatastore specifies that configuration should be found in datastores/posixDatastore.yaml .

There are additional search paths that can be included when a config object is constructed:

Explicit list of directory paths to search passed into the constructor.
Paths defined using the environment path variable DAF_BUTLER_CONFIG_PATH .

To construct a Butler config object from a file the following happens:

The supplied config is read in.
Each sub configuration class is constructed by supplying the relevant subset of the global config to the component Config constructor.
A search path is constructed by concatenating the supplied search path, the environment variable path, and the daf_butler config directory.
Defaults are first read from the config class default file name (e.g., registry.yaml ) and merged in priority order given in the search path.
Then any cls -specific config files are read, overriding the current defaults.
Finally, any child configurations are read as specified in cls.containerKey (assumed to be a list of configurations compatible with the current config class). This is to allow a, for example, ChainedDatastore to be able to expand the child posixDatastore configurations using the same rules.
Values specified in the butler.yaml always take priority since it is assumed that the values explicitly defined in the butler yaml file must be more important than values read from defaults.

We also have a YAML parser extension !include that can be used to pull in other YAML files before the butler specific config parsing happens. This is very useful to allow reuse of YAML snippets but be aware that the path specified is relative to the file that contains the directive.

Michelle Gower 's use case changes things a little.

For this they have a butler.yaml file but it can not be edited. Furthermore, to access the registry some user-specific overrides must be given in the registry.db part of the configuration, overriding the value specified in the butler.yaml configuration. There is no way in the current system to deal with this short of copying the butler.yaml file (which would also result in any butlerRoot directives being incorrect and having to be replaced).

One way to fix it is for the behavior of search paths to change such that explicit search paths given to the constructor, and environment variable paths can overwrite supplied configuration and are distinct from the DAF_BUTLER_DIR/config configuration defaults which can not overwrite.

Space shortcuts

Page tree

6 Comments

Tim Jenness

Jim Bosch

Michelle Gower

Tim Jenness

Michelle Gower

Jim Bosch