...
- Run ci_hsc with large set of Gen3 changes. (
- Waiting for
to merge to masterJira server JIRA columns key,summary,type,created,updated,due,assignee,reporter,priority,status,resolution serverId 9da94fb6-5771-303d-a785-1b6c5ab0f2d2 key DM-17023
- Waiting for
- Run RC2 patch through as much of DRP as applicable
- needs RC2 bootstrap
- Run RC2
- DB folks will now have larger set of data flow information for analysis (along with updated schemas)
Planning
- When OODS includes Gen3 (Tucson and Mountain)
- Raw Ingest (AuxTel and ComCam)
- 2019-09-19 obs_lsst package ready, ingest API ready (see ci_hsc_gen3 for example code). Request to not have to specify instrument class with every ingest. Discussed saving class in registry when registering instrument.
- Purge (to keep size of registry small)
- 2019-09-19 Butler delete functionality ready (but not duration related). Perhaps could code a Datastore with expiration feature instead of an external driver.
- Multi-registry
- Staff schemas (db admin procedures, etc)
- Unique Site IDs?
- 2019-09-19 could be implemented in Jim's next big ticket.
- Get calibs from any production runs
- 2019-09-19 Have export and import when current big ticket merges to master. Not clear if import is enough or will need a merge.
- Raw Ingest (AuxTel and ComCam)
- When NCSA DBB endpoint includes Gen3
- Raw Ingest (AuxTel and ComCam)
- Ingestion of certain LFA files
- 2019-09-19 Wavefronts should be treated like raw, calibration spectrograph images and CBP images should be new instruments. Need example files to determine whether any work needed.
- Multi-registry
- Staff schemas (db admin procedures, etc)
- Unique Site IDs
- EFD ETL (Quick Look & DBB copy)
- No additonal Gen3 work because joins would happen on EXPID?
- 2019-09-13
- Need to figure out how this is going to work
- Will Butler.get read header information from the EFD tables instead of from disk?
- (If yes, then a job scratch repo/subset will need to also copy these EFD entries)
- A third source of exposure information? CAOM-compliant exposure table
- 2019-09-13
- Will EFD ETL values be used in QuantumGraph generation?
- No additonal Gen3 work because joins would happen on EXPID?
- Gen3 "production" pipeline runs
- HSC RC2 DRP running in Gen3 at some cadence (like we currently do with Gen2)
- AuxTel pipelines (calibration? drp?)
- BPS
- HTCondor version of BPS
- BPS job activator
- Next OPS rehearsal (Feb-ish)
- Not clear that it will use Gen3, but can assume that is the hope.
- Could include everything from image generation through accessing it via LSP (multi-registry)
Technical Discussions
- Discussion of Tim's review of ticket
Jira server JIRA columns key,summary,type,created,updated,due,assignee,reporter,priority,status,resolution serverId 9da94fb6-5771-303d-a785-1b6c5ab0f2d2 key DM-17023 - Repo import/export work done to make ci_hsc_gen3 less fragile.
- Export is pretty much the subset functionality
- Import does not replace merge. Data is inserted into repo as is (with no id modification).
- Support re-run of pipetask on the same output collection (
)Jira server JIRA columns key,summary,type,created,updated,due,assignee,reporter,priority,status,resolution serverId 9da94fb6-5771-303d-a785-1b6c5ab0f2d2 key DM-20845 - Discussion of how using collection in filenames and using collections to look up run. Run ids are not human friendly. And collections can be more than 1 run. Instead switch to using a run name in the filename and to identify runs. Jim Boschto write up in ticket.
- obs_lsst gen3 support now merged (
)Jira server JIRA columns key,summary,type,created,updated,due,assignee,reporter,priority,status,resolution serverId 9da94fb6-5771-303d-a785-1b6c5ab0f2d2 key DM-16297 - Need to sort out some things in data model
...
- :
- discussion started because needing to use OBSID in the OCS Batch Service
...
- e.g. OBSID vs EXPID, sequence number, etc.
- Tim Jenness to add as needed.
- Provenance
- Near-term work Butler could start adding quantum records at execution (i.e., saving provenance tree).
- Longer-term understanding what pieces are responsible for saving information that can be duplicated information (e.g., environment when running a quantum)