Logistics

Date

06 Nov 2018 – 08 Nov 2018

Expect the DMLT meeting to start in the morning of 06 Nov 2018 and conclude around lunchtime on 08 Nov 2018 to enable travel that afternoon.

That there will a a pre-DMLT SST meeting on 05 Nov 2018 .

Location

Princeton

Teleconference

Blue Jeans 925683803 (https://bluejeans.com/925683803, http://ls.st/ski)

Hotel / Per Diem

There is a block booking at the Nassau Inn. Group Code "Booking ID" is 24465. Please contact Yusra AlSayyad for assistance.

Travel

Princeton is most conveniently reached by flying to Newark Liberty International Airport (EWR) then taking an NJ Transit train to Princeton (change at Princeton Junction).
- Note that the “Dinky” — the train from Princeton Junction to Princeton — will be suspended during the DMLT meeting. A substitute bus service will be provided by NJ Transit. Please allow extra time for your journey.
- More details from NJ Transit.
It's also accessible from other New York area airports (La Guardia, JFK) or from Philadelphia International.
The Department of Astrophysical Sciences has travel advice.
The DM-SST meeting will take place in the Peyton Hall “Dome Room”.
The DMLT meeting will take place in Jadwin Hall, Joe Henry Room, Room 102.

Attendees

Traveling

John Swinbank
Unknown User (mbutler)
Unknown User (xiuqin)
Leanne Guy
Eric Bellm
Unknown User (gcomoretto)
Fritz Mueller
Vaikunth Thukral
Simon Krughoff
Jeff Kantor (cannot attend in person, will attend some sessions remotely from Tucson)
Wil O'Mullane
Colin Slater
Frossie Economou (remote)
Tim Jenness
Zeljko Ivezic
Kian-Tat Lim
Margaret Gelman
Robert Gruendl

Local

Agenda

Day 0: 2018-11-05

DM-SST meeting. Agenda will be provided by Leanne Guy.

Day 1: 2018-11-06

Time

Topic

Chair

Notes

09:00

Welcome (and all my slides)

Wil O'Mullane

Review outstanding actions
Review & adjust agenda

09:15

Impact of scope changes

Wil O'Mullane & Unknown User (xiuqin)

What portal development will happen, and when?
How does this impact on other aspects of the DM system (e.g. who provides the UI for specifying and managing alert filters)?
What other scope options are in the offing?
Is further work to define new scope options required?

Wil O'Mullane :

Slides in Wil O'Mullane 's master deck.
We discussed whether some DM work can be pushed into the future as effectively a “no cost extension” to construction; the practicalities of this are not clear...
- In terms of what's acceptable to the agencies.
- And in terms of whether there will be staffing in the operations era to work on this.
If there are people in your team/institute who would be interested in moving to AURA positions, please let Wil O'Mullane know.
- There are practical issues here, regarding which states are acceptable for people to reside, and whether there is office space available for them.
- And of course the downsides to breaking up groups.
DM10 alert filters interface may actually represent a scope increase in 02C.03 in terms of providing a mini-broker interface.
We believe that the current portal is sufficient for commissioning data releases (after finishing the closeout plan).
We discussed increase staffing load resulting from making commissioning data widely available: this is expected to be significant.

Unknown User (xiuqin) :

LSST-based scale testing of Firefly has not been performed, but there is experience deploying it at scale in IPAC.
A low-level of support is not enough to get fast turnaround on critical bug fixes.
We discussed whether closeout work can focus on usability and performance enhancements rather than new feature development.
- There is work ongoing here, but the DMLT regards it as necessary to focus on LSST-specific work (which mainly concerns VO services).

Discussion:

There will be an LCR to cover the Portal descope.
- This will likely also include savings from the Data Facility and any other savings.
- And adjustments to the DM Science budget.
- And changes to the DM travel funds.
Wil O'Mullane will be writing this, in discussion with Unknown User (xiuqin) .
Will aim to get this to the CCB before the end of this year.
There will be a presentation to the SAC on roughly the same timescale. This does not block the LCR.

All DMLT members: suggest changes to the Portal priority list if necessary. 09 Nov 2018

10:00

CAOM & image metadata

Jim Bosch

Relationship between our image metadata model and CAOM.

No slides.
Mapping the CAOM standard to existing DM technology & terminology.
- CAOM is becoming a widespread standard in the community; astronomical data centers are adopting it.
- It does not attempt to handle database records.
- Certainly appropriate for single-epoch data and “regular” coadds; extension to cell-based coadds is not obvious.
Difficulties representing provenance in the CAOM model.
- e.g. no hierarchy.
- Do not expect to use CAOM as the “source of truth” for all DM provenance.
We note that the CAOM system may not map obviously to LSST data, but that it has been used for other surveys.
There is ongoing discussion between us and the IVOA folks.
Conclusion:
- We will not further shoehorn the registry into looking like CAOM, but will maintain a separate structure which maps registry to CAOM.
- Export of CAOM from our data model is an explicit action (e.g. it requires running code).

10:30

Break (Refreshments Provided)

11:00

The Science Data Model

Colin Slater

Provide an overview of work which has been undertaken in the DM-SST.
DMLT to agree on direction of travel, deliverables, etc.

A single source of truth for both what columns “mean” in the scientific sense and how they are instantiated in a database.
Proposal to move the contents of the cat package into a YAML file describing the “Science Data Model”.
We discussed keeping the YAML definition of the SDM under change control. This is a minor point: it just needs somebody to define the procedure.
We discussed whether YAML is an appropriate format for describing the database schema; the consensus is “yes”.
Would ultimately like to use this YAML definition to describe the Butler registry.
Would also like to generate the alert schema from the SDM.
- There may be some further work here to map to the hierarchical AVRO system.
Next step is for this plan to be RFCed.
- At that point, it will become part of the DM baseline.
- And then management (ie, Wil O'Mullane ) will need to figure out who is actually delivering the work.

12:00

“DPDDification”

Yusra AlSayyad

Review the tools which have been produced since the last meeting to turn pipeline outputs into something resembling the DPDD.
Describe how this relates to work on the Data Model.

Notebook
`WriteObjectTable` (the precursor to transformation) currently runs in ~a minute per patch
- 45s to read in 15 afwTables from GPFS. run() takes 10s. 5s to write..
The DMLT agrees that Pandas/Parquet is an appropriate technology choice for the remainder of construction (at least).
Inconclusive discussion around bitpacking of flags.
- Refer also to the “outstanding questions” in Yusra's notebook.
Relationship to AP?
- May be necessary to go directly to database.
- But should also be possible to go to Parquet for debugging purposes.
- Requirement for round-tripping to the database.
- “Some careful thought about whether this can be brought to the AP side”; we should try hard to make this possible.
Steps from here:
- Write up this proposal with a DRP-biased perspective.
- Ask the AP team to figure out which parts of this code are relevant (in conjunction with Yusra AlSayyad , Colin Slater , etc).
- We expect that the SDM will apply directly to AP.

John Swinbank Arrange a meeting with Unknown User (cmorrison) , Yusra AlSayyad , Colin Slater , Eric Bellm to discuss SDM standardization in AP. 16 Nov 2018

12:30

Lunch (Provided)

13:30

Transition to Commissioning

Wil O'Mullane

“As discussed at the PST F2F meeting in September, DM should review the names of people going into commissioning” (requested by Leanne Guy )
Not immediately clear whether this means permanent reassignment of staff, or those DM folks who are temporarily assigned to assist the Commissioning Team with specific activities.

Slides in Wil O'Mullane 's master deck.
The commissioning support activities listed are validation of the DM system, but verification of the LSST system.
None of the commissioning activities listed cover aspects of the system outside Science Pipelines.
Concern expressed about the impacts of people being reassigned to commissioning on the rest of the DM team.
- Often the people who might be most effective with commissioning are also those most necessary for facilitating efforts within DM; concern expressed that this will have a disproportionate impact on the DM schedule.
Detailed definition of the contents of the commissioning tests is with Leanne Guy and Keith Bechtol . Expectation that they will often involve repeating tests that have been performed within DM.

14:15

Transition to Operations

Wil O'Mullane

Early Ops funding is now available, and during FY19 (ie, this year) the ADs for both Data Facility and Science Operations (Margaret Gelman and Wil O'Mullane) are funded at 0.25 FTE. That's half an FTE coming out of DM management. How are we handling that?
Similarly there's a total of ~15 FTEs funded across Data Facility and Science Ops during FY20.
Please review:
- The plans and schedule for transitioning staff;
- The activities which the Operations Team will be carrying out with this effort, and how they relate to ongoing DM construction and the commissioning effort.

Slides in Wil O'Mullane 's master deck.
FY19 includes 25% of Wil O'Mullane & Margaret Gelman , as well as Phil Marshall on science performance; there are no milestones in this.
Some milestones (e.g. ops rehearsals) are effectively duplicated between DM and pre-operations funding; where possible, they will migrate from DM to pre-ops.
There's a worry that commissioning data may become available to the public more quickly than we currently plan; DM should be ready to scale up to address this.
- And a feeling that simply making the data available for download will not adequately address this need.
- But we acknowledge the cost and schedule impacts of this.
- There's also a concern about what level of end-user support is implied by this.
How do we handle folks being required in DM and commissioning and pre-ops?
- This is a matter of ongoing planning. There may be some overlap/double counting.
Discussion of how open the access to commissioning data & facilities should be, balancing getting input on commissioning from members of the community with the support load and “chaos” of wide access.

Wil O'Mullane — coordinate the writing of a memo describing what community DM can support during commissioning. 03 Dec 2018

15:00

Break (Refreshments Provided)

15:30

Management of externally contributed packages

Wil O'Mullane

The proposal is to develop a policy for handling packages which have been developed externally and which their authors offer up for inclusion in pipeline processing, the Science Platform environment, or elsewhere in the DM system.
Obvious examples might be scientific algorithms contributed by the community.
There is history of external users refusing to make contributions like this due to the demands of DM engineering (code quality, review, tests, etc...).

Slides in Wil O'Mullane 's master deck.
We did not converge on a requirement for simply installing software for pure end user convenience: they can be e.g. pip installed.
But we do identify this as a potential way forward for software from science collaborations which might migrate into the LSST codebase.
- But there's no urgency to address this now.
Discussion of Erin Sheldon/ngmix specifically:
- Want to run this in Data Release Production, but not maintain it ourselves.
- How do we incorporate this into our pipeline without forcing them and/or us to stop development or rewrite it.
- This requires definition of an interface.

Wil O'Mullane with Kian-Tat Lim , Frossie Economou , Unknown User (gcomoretto) — draft a policy for including external code into data release production processing. 03 Dec 2018

16:00

Product Tree and Document Tree

Unknown User (gcomoretto)

Based on the modeling work done before the review this year, review proposed product tree, components characterization and relation with the document tree

“Inside pipelines, there are more or less five products” — not every Git repository or software package is a product.
“SW Products can depend on other SW products without containing them” — so there is a SW product that contains e.g. the Butler, which can be depended upon by other products.
SW Products are the unit of release.
Dependency relationships happen between SW products, rather than between SW packages.

All DMLT: Review product tree in LDM-294 and provide feedback/corrections to Unknown User (gcomoretto) . 03 Dec 2018

16:30

Test Approach Using Jira

Unknown User (gcomoretto)

How to create test specifications and perform test runs with Adaptavist Test Management in Jira.

17:00

Close

Day 2: 2018-11-07

09:00

Middleware & Workflow

Margaret Gelman & Fritz Mueller

Where are we on the selection & deployment of the workflow management system?
What's the development timeline for next generation middleware (Butler Gen 3, PipelineTask)?
What's the overall roadmap for middleware and workflow over the next ~4 years?

Fritz Mueller & Margaret Gelman — please upload your slides.
The Data Facility can execute based on whatever version of the middleware the developers are using.
- There's no requirement for pipeline developers to support Gen2 from the LDF point of view.
Most Data Facility work going forward is based on Pegasus; there is minimal ongoing support for DESDM.
Assuming availability of pipeline code, the LDF predicts that they could run pipelines in a “sustained processing” mode based on Butler G3 and Pegasus in mid-2019.
Two possible goals for BG3 priority:
- Support for obs_lsst (to make RHL & Merlin's life easier)
- Or to convert code to PipelineTask to enable execution at scale of e.g. HSC on the Data Facility.
The consensus is that the latter is the priority; agreed to prioritise the conversion of ci_hsc Tasks to PipelineTasks until end of Jan 2019 per Fritz Mueller 's recommendation.
- This would also meet Frossie Economou 's immediate validate_drp use case.
We agreed that temporarily abandoning the shared-nothing model for execution might enable faster development.
We should also prepare for “plan B” by assembling a “mini-working-group” to consider wholesale technological change (mini-WG to consist of at least Kian-Tat Lim & Simon Krughoff; not to involve folks who are busy with the ci_hsc conversion ).

Fritz Mueller , Yusra AlSayyad — coordinate the conversion of ci_hsc task to pipeline tasks. 31 Jan 2019
Kian-Tat Lim , Simon Krughoff — convene a mini-WG to create an alternative plan to continuing with the existing approach to PipelineTask/ButlerG3 middleware. 31 Jan 2019
John Swinbank — designate and/or start a recruitment process for a “systems programmer” to act as long term middleware owner. 31 Dec 2018

10:20

Long-term release support

Leanne Guy

Requirements for back-porting bug fixes to release branches, in support of science collaborations, commissioning, etc.

Slides

Kian-Tat Lim — finalise and document deprecation procedure (ie, implement RFC-213, with whatever modernization or updates are necessary). 03 Dec 2018

10:40

Break (Refreshments Provided)

11:00

Release Process

Unknown User (gcomoretto)

Proposed changes to the release process.

11:30

Review of Calibration plans

Robert Lupton

What raw data is being taken?
What products are being generated?
What are the possible CPP execution periodicities?

12:30

Lunch (Provided)

13:30

Summit and Base Data Center Status and Planning

Jeff Kantor

Brief overview of Summit and BDC construction status and move-in schedule
Identification of planned deployments of DM equipment to BDC, visits, tests/rehearsals in FY19 (and IT support required)

14:30

DBB and Consolidated DB SLA

Kian-Tat Lim

Are there ways of providing greater uptime and lower latency using rolling upgrades, schema evolution with backfill, planned Observatory maintenance windows and daytime maintenance, etc.?
What are the features and timeline for DBB?

15:00

Break (Refreshments Provided)