Goals
A description of the aims and organization of this project is available here.
The input dataset: HSC Strategic Survey Program (SSP) Public Data Release 1 (PDR1)
Transferred to /datasets by - DM-9683Getting issue details... STATUS
It includes 5654 visits in 7 bands: HSC-G, HSC-R, HSC-I, HSC-Y, HSC-Z, NB0816, NB0921. Their visit IDs are visitId-SSPPDR1.txt. They are grouped in 8 fields:
HSC-G | HSC-R | HSC-I | HSC-Z | HSC-Y | NB0921 | NB0816 | ||
---|---|---|---|---|---|---|---|---|
Field Name ("OBJECT") | Number of visits | Tract IDs (from https://hsc-release.mtk.nao.ac.jp/doc/index.php/database/ ) | ||||||
SSP_DEEP_ELAIS_N1 | 32 | 24 | 28 | 51 | 24 | 20 | 0 | 16984, 16985, 17129, 17130, 17131, 17270, 17271, 17272, 17406, 17407 |
SSP_DEEP_DEEP2_3 | 32 | 31 | 32 | 44 | 32 | 23 | 17 | 9220, 9221, 9462, 9463, 9464, 9465, 9706, 9707, 9708 |
SSP_DEEP_XMM_LSS, SSP_DEEP_XMMS_LSS | 25 | 27 | 18 | 21 | 25 | 0 | 0 | 8282, 8283, 8284, 8523, 8524, 8525, 8765, 8766, 8767 |
SSP_DEEP_COSMOS | 20 | 20 | 40 | 48 | 16 | 18 | 0 | 9569, 9570, 9571, |
SSP_UDEEP_SXDS | 18 | 18 | 31 | 43 | 46 | 21 | 19 | 8523, 8524, 8765, 8766 |
SSP_UDEEP_COSMOS | 19 | 19 | 35 | 33 | 55 | 29 | 0 | 9570, 9571, 9812, 9813, 9814, 10054, 10055 |
SSP_AEGIS | 8 | 5 | 7 | 7 | 7 | 0 | 0 | 16821,16822, 16972, 16973 |
SSP_WIDE | 913 | 818 | 916 | 991 | 928 | 0 | 0 | XMM: 8279-8285, 8520-8526, 8762-8768 GAMA09H: 9314-9318, 9557-9562, 9800-9805 WIDE12H: 9346-9349, 9589-9592 GAMA15H: 9370-9375, 9613-9618 HECTOMAP: 15830-15833, 16008-16011 VVDS: 9450-9456, 9693-9699, 9935-9941 |
Plots of tracts/patches: https://hsc-release.mtk.nao.ac.jp/doc/index.php/data/
The survey has three layers:
- UDEEP: SSP_UDEEP_SXDS, SSP_UDEEP_COSMOS
- DEEP: SSP_DEEP_ELAIS_N1, SSP_DEEP_DEEP2_3, SSP_DEEP_XMM(S)_LSS, SSP_DEEP_COSMOS
- WIDE: SSP_WIDE, SSP_AEGIS
Release Candidate ("RC") dataset
Originally from https://hsc-jira.astro.princeton.edu/jira/browse/HSC-1361 for hscPipe 3.9.0
All public and available at /datasets. 62 visits of them were not included in PDR1 ( - DM-10128Getting issue details... STATUS ): two of SSP_WIDE and 60 of SSP_UDEEP_COSMOS; their visit IDs are 274 276 278 280 282 284 286 288 290 292 294 296 298 300 302 306 308 310 312 314 316 320 334 342 364 366 368 370 1236 1858 1860 1862 1878 9864 9890 11742 28354 28356 28358 28360 28362 28364 28366 28368 28370 28372 28374 28376 28378 28380 28382 28384 28386 28388 28390 28392 28394 28396 28398 28400 28402 29352 (also see here).
The RC dataset includes (a) 237 visits of SSP_UDEEP_COSMOS and (b) 83 visits of SSP_WIDE, in 6 bands:
(a) Cosmos to full depth: (part of SSP_UDEEP_COSMOS)
- HSC-G 11690..11712:2^29324^29326^29336^29340^29350^29352
- HSC-R 1202..1220:2^23692^23694^23704^23706^23716^23718
- HSC-I 1228..1232:2^1236..1248:2^19658^19660^19662^19680^19682^19684^19694^19696^19698^19708^19710^19712^30482..30504:2
- HSC-Y 274..302:2^306..334:2^342..370:2^1858..1862:2^1868..1882:2^11718..11742:2^22602..22608:2^22626..22632:2^22642..22648:2^22658..22664:2
- HSC-Z 1166..1194:2^17900..17908:2^17926..17934:2^17944..17952:2^17962^28354..28402:2
- NB0921 23038..23056:2^23594..23606:2^24298..24310:2^25810..25816:2
(b) Two tracts of wide: (part of SSP_WIDE)
- HSC-G 9852^9856^9860^9864^9868^9870^9888^9890^9898^9900^9904^9906^9912^11568^11572^11576^11582^11588^11590^11596^11598
- HSC-R 11442^11446^11450^11470^11476^11478^11506^11508^11532^11534
- HSC-I 7300^7304^7308^7318^7322^7338^7340^7344^7348^7358^7360^7374^7384^7386^19468^19470^19482^19484^19486
- HSC-Y 6478^6482^6486^6496^6498^6522^6524^6528^6532^6544^6546^6568^13152^13154
- HSC-Z 9708^9712^9716^9724^9726^9730^9732^9736^9740^9750^9752^9764^9772^9774^17738^17740^17750^17752^17754
Pipeline Steps / Config / Software Stack Version
Stack version: w_2017_17 (published on 26-Apr-2017) + master meas_mosaic/obs_subaru/ctrl_pool of 7-May-2017 built with w_2017_17 (i.e. w_2017_17 + DM-10315 + DM-10449 + DM-10430).
That implies the PS1 reference catalog "ps1_pv3_3pi_20170110" in the LSST format (HTM indexed) is used (/datasets/refcats/htm/ps1_pv3_3pi_20170110/).
The calibration dataset is the 20170105 version from Paul Price; the calibration repo is located at /datasets/hsc/calib/20170105 from - DM-9978Getting issue details... STATUS .
The externally provided bright object masks (butler type "brightObjectMask") of version "Arcturus" ( - DM-10436Getting issue details... STATUS ) are added to the repo and applied in coaddDriver.assembleCoadd.
- makeSkyMap.py
- singleFrameDriver.py Ignore ccd=9 which has bad amps and results not trustworthy even if processCcd passes
- mosaic.py
- coaddDriver.py Make config.assembleCoadd.subregionSize small enough so a full stack of images can fit into memory at once; a trade-off between memory and i/o but doesn't matter scientifically, as the pixels are independent.
- multiBandDriver.py
Operational configurations, such as logging configurations in ctrl_pool, different from the tagged stack may be used (e.g. DM-10430).
General hints and tips for processing
coaddDriver:
- To lower the memory when there's a LOT of inputs, decrease the subregion size in making coadds e.g. config.assembleCoadd.subregionSize = [10000, 50]
- To avoid trashing a cluster filesystem, do not use more than 20 or so cores and do not use multiple nodes.
multiband: typically want to use one core per patch; so the upper limit of usefulness is the number of patches multiplied by the number of filters
Units of independent execution
These pipelines will be run no smaller than these units:
- makeSkyMap.py One SkyMap for everything
- singleFrameDriver.py ccd (typically run per visit)
- mosaic.py tract x filter, including all visits overlapping that tract in that filter.
- coaddDriver.py patch x filter, including all visits overlapping that patch in that filter. (typically run per tract)
- multiBandDriver.py patch, including all filters. (typically run per tract)
Data of different "fields" (e.g. DEEP data and UDEEP data) will be processed separately.
Processing the RC dataset
Using the w_2017_17 stack: - DM-10129Getting issue details... STATUS and meas_mosaic ecfbc9d built with w_2017_17
singleFrameDriver: Reproducible failures in 46 ccds from 23 visits. The failed visit/ccds are the same as those in the w_2017_14 stack ( - DM-10084Getting issue details... STATUS ). Their data IDs are:
--id visit=278 ccd=95 --id visit=280 ccd=22^69 --id visit=284 ccd=61 --id visit=1206 ccd=77 --id visit=6478 ccd=99 --id visit=6528 ccd=24^67 --id visit=7344 ccd=67 --id visit=9736 ccd=67 --id visit=9868 ccd=76 --id visit=17738 ccd=69 --id visit=17750 ccd=58 --id visit=19468 ccd=69 --id visit=24308 ccd=29 --id visit=28376 ccd=69 --id visit=28380 ccd=0 --id visit=28382 ccd=101 --id visit=28392 ccd=102 --id visit=28394 ccd=93 --id visit=28396 ccd=102 --id visit=28398 ccd=95^101 --id visit=28400 ccd=5^10^15^23^26^40^53^55^61^68^77^84^89^92^93^94^95^99^100^101^102 --id visit=29324 ccd=99 --id visit=29326 ccd=47
WIDE: The coadd products have all 81 patches in both tracts (8766, 8767) in 5 filters, except that there is no coadd in tract 8767 patch 1,8 in HSC-R (nothing passed the PSF quality selection there); the multiband products of all 162 patches are generated.
COSMOS: The coadd products have 77 patches in tract 9813 in HSC-G, 74 in HSC-R, 79 in HSC-I, 79 in HSC-Y, 79 in HSC-Z, and 76 in NB0921; the multiband products of 79 patches are generated.
"brightObjectMask" were not applied; but they should not affect.
Processing the SSP PDR1
- DM-10404Getting issue details... STATUS
All processing were done with the same stack setup (i.e. without DM-10451). Data of the three layers (UDEEP, DEEP, WIDE) were processed separately.
The output repositories are at:
/datasets/hsc/repo/rerun/DM-10404/UDEEP/
/datasets/hsc/repo/rerun/DM-10404/DEEP/
/datasets/hsc/repo/rerun/DM-10404/WIDE/
All logs are at /datasets/hsc/repo/rerun/DM-10404/logs/
While unnecessary, some edge tracts outside of the PDR1 coverage were attempted in the processing. Those data outputs are kept in the repos as well.
The processing tasks/drivers were run as:
- 514 singleFrame slurm jobs (slurm job IDs: jobids_sfm.txt)
- 1555 mosaic slurm jobs (slurm job IDs: jobids_mosaic.txt)
- 1555 coadd slurm jobs (slurm job IDs: jobids_coadd.txt)
- 362 multiband slurm jobs (slurm job IDs: jobids_multiband_deepUdeep.txt jobids_multiband_wide.txt)
Reproducible Failures
In singleFrameDriver/processCcd, there were reproducible failures in 78 CCDs from 74 visits. Their data IDs are:
--id visit=1206 ccd=77 --id visit=6342 ccd=11 --id visit=6478 ccd=99 --id visit=6528 ccd=24^67 --id visit=6542 ccd=96 --id visit=7344 ccd=67 --id visit=7356 ccd=96 --id visit=7372 ccd=29 --id visit=9736 ccd=67 --id visit=9748 ccd=96 --id visit=9838 ccd=101 --id visit=9868 ccd=76 --id visit=11414 ccd=66 --id visit=13166 ccd=20 --id visit=13178 ccd=91 --id visit=13198 ccd=84 --id visit=13288 ccd=84 --id visit=15096 ccd=47^54 --id visit=15206 ccd=100 --id visit=16064 ccd=101 --id visit=17670 ccd=24 --id visit=17672 ccd=24 --id visit=17692 ccd=8 --id visit=17736 ccd=63 --id visit=17738 ccd=69 --id visit=17750 ccd=58 --id visit=19468 ccd=69 --id visit=23680 ccd=77 --id visit=23798 ccd=76 --id visit=24308 ccd=29 --id visit=25894 ccd=68 --id visit=29324 ccd=99 --id visit=29326 ccd=47 --id visit=29936 ccd=66 --id visit=29942 ccd=96 --id visit=29966 ccd=103 --id visit=30004 ccd=95 --id visit=30704 ccd=101 --id visit=32506 ccd=8 --id visit=33862 ccd=8 --id visit=33890 ccd=61 --id visit=33934 ccd=95 --id visit=33964 ccd=101 --id visit=34332 ccd=61 --id visit=34334 ccd=61 --id visit=34412 ccd=78 --id visit=34634 ccd=61 --id visit=34636 ccd=61 --id visit=34928 ccd=61 --id visit=34930 ccd=61 --id visit=34934 ccd=101 --id visit=34936 ccd=50 --id visit=34938 ccd=95 --id visit=35852 ccd=8 --id visit=35862 ccd=61 --id visit=35916 ccd=50 --id visit=35932 ccd=95 --id visit=36640 ccd=68 --id visit=37342 ccd=78 --id visit=37538 ccd=100 --id visit=37590 ccd=85 --id visit=37988 ccd=33 --id visit=38316 ccd=11 --id visit=38328 ccd=91 --id visit=38494 ccd=6^54 --id visit=42454 ccd=24 --id visit=42510 ccd=77 --id visit=42546 ccd=93 --id visit=44060 ccd=31 --id visit=44090 ccd=27^103 --id visit=44094 ccd=101 --id visit=44162 ccd=61 --id visit=46892 ccd=64 --id visit=47004 ccd=101
Out of the 78 failures:
- 36 failed with: "Unable to match sources"
- 13 failed with: "No objects passed our cuts for consideration as psf stars"
- 7 failed with: "No sources remaining in match list after magnitude limit cuts"
- 3 failed with: "No input matches"
- 3 failed with: "Unable to measure aperture correction for required algorithm 'modelfit_CModel_exp': only 1 sources, but require at least 2."
- 1 failed with: "All matches rejected in iteration 2"
- 15 failed with: "PSF star selector found [123] candidates"
A rerun log of these failures is attached as singleFrameFailures.log.
In multiBandDriver, two patches of WIDE (tract=9934 patch=0,0 and tract=9938 patch=0,0) failed with AssertionError as reported in - DM-10574Getting issue details... STATUS . I excluded the failed patches from the multiBandDriver commands, and then jobs were able to complete and process all other patches.
The multiBandDriver job of WIDE tract=9457 could not finish unless patch=1,8 is excluded. However tract 9457 is actually outside of the PDR1 coverage.