ran a pipeline with no processing (just loaded images and saved them again) and profiled it
Jim and K-T did not agree about which parts of the processing needed worrying about
code spent time in four places:
package imports (variable). Jim thinks can reduce healpy impact on daf.butler; K-T thinks it will only happen on first run
in pre-execution step, gathers conda package info–takes 8 seconds! Jim & Tim discussed caching, K-T thinks it only happens on once
Ian: is this dealt with on startup? KF: no: on first pipeline run (per distinct)
overheads on reading raws specifically
we call lsstcam.getCamera()–very slow generation of Python object). Again, Jim has an idea for improved amplifier handling; K-T again says it's a one-time penalty
Eric: could imagine a warm-up task during preload (ditto the conda info above)
IERS validation, takes several seconds of time per raw, could happen on every run of the pipeline
Ian: is this still the case where astropy is trying to download the table?
we should figure out what can go into preload, and generate tickets for middleware to implement
potential "preload" sprint in the summer?
explore in-memory datastore for the butler?
"ap_verify in Prompt Processing": there is a successor to Squash planned for operations ("Sasquatch", still influxDB/Chronograf). Let's discuss what it would take to push metrics from PP.
(deferred for two weeks)
Eric Bellm discuss "ap_verify in Prompt Processing"
Meredith Rawls : starting reprocessing with decorrelation bug fix, aiming for next week results
Kenneth Herner : Processed tract 4431 with weekly 16 and 17, two patches (42 and 50) and two bands (g and r)
Just running step 4 and step 5
Variance plane currently looks exactly the same. Ian Sullivan : This is unexpected, since the variance plane should have changed
g-band shows ~4% reduction in sources, r-band ~90% increase in sources. We expected a slight to moderate reduction in sources in all cases, so this is unexpected.
Plot of the source distribution shows many sources outside of the patch boundary.
Image differencing algorithms, DCR ( Ian Sullivan ):