Date

Attendees

Local: Kenny LoBrian Van KlaverenUnknown User (npease)Igor GaponenkoAndy SalnikovVaikunth ThukralAndy Hanushevsky

Remote: John Gates

Discussion items

General:

  • Team has started attending CS245 (MW 1:50pm to 2:30pm). Although Vaikunth emailed prof Nate spoke with him after class and he was ok with our presence
    • Slides are usually put up after class, prof recommends reading DB book for lectures (Fritz has it currently)
  • RFC 107: Team consensus is that since we don't have a lot of python code this change will be ok especially if there is a tool that will wrap the docstrings in older code
    • Some concern over who will actually do the work for older code
  • RFC 245: Right now we use debug mostly everywhere, what to do for production? ok to do WARN for now, INFO ok for production?
    • Other comments already on RFC reason well with our requirements
    • AndyS may have some thoughts he will leave on the RFC
    • Need to internally decide which messages need to go on which level
    • The RFC issue may not be sufficiently well defined yet to answer these questions
  • LSST cybersecurity course expired over the break for Vaikunth and Igor, AndyS didn't get it at all - email NCSA?


John:

  • DM-8226: Still having docker pull issues, Vaikunth will help debug it, need to test this on NCSA to be able to merge

  • DM-8426: ORDER BY needs to match output column in the result table, will work on that, major thoughts listed out on JIRA ticket


Kenny:

  • DM-8286, DM-8234: making branches for both and starting work on it, still some machine/helpdesk issues

    • Planning meetings with John etc about these Imgserv tickets

  • Need Parallels license (and VMWare license for John)
  • Thinking about butler, have high level thoughts for future in agreement with current understanding


Brian:

  • DM-8665: Can now extract column/tables with queries. Still some bugs in parser, can't parse some types of queries
    • Want to fix this before finishing or probably make "if clause" for "can parse OK else NOT", can probably handle most SUIT stuff
    • Proof of concept is sufficiently complete


Nate:

  • DM-8125: Working on it and trying to maintain backward compatibility, not sure if it is possible and probably needs further discussion


Igor:

  • Finishing up on LSST20 dataset for KPMs - 82 partitions around the equator have discrepancies but can probably drop this issue since it no longer affects new data

  • After loading is finished will test a bit then replicate on John's side of the cluster. Vaikunth is certain he won't restart the container in the middle of loading now

  • Secondary index might take a while to populate on this data (~4billion rows and InnoDB)

  • Finishing up the documentation for DM-8406

  • Gregory wants us to generate even bigger dataset from WISE (DM-8271 in next sprint)


AndyS:

  • Supertask is on the backburner for now

  • Running L1DB-proto tests on the super machine at in2p3 with different configs, takes time because tests need large amount of data

  • Baseline config is likely not enough for large scale operation, considering splitting it into different aspects


AndyH:

  • Transitioning to xrdssi v2, documentation is a pain

  • AndyH wants to limit size of a request (set to unlimited right now), wants to discuss again with team

  • Cannot optimize if request size isn't limited, MIN should be 2GB can probably do MAX up to 16GB

  • Currently queries cannot be repeated after 16giga queries due to counter, will wrap beyond that
    • Need it for new czar to work with old worker xrdssi - can have switch for old and new type during transition

  • Limited availability in Jan - Gone to CERN next week for 1 week, then San Diego after that, then jury duty last week of Jan


Vaikunth:

  • General issues at in2p3 like QTP index not working anymore
  • Locked memory also coming from mmfsd(gpfsd, ~2GB) and collectd (~32KB) itself, hence non-zero when not running qserv locked tables

    • Can we take away the gpfsd locked memory?
    • Does it work with the containers on all the machines. is the 2GB recorded across all containers or across the machine
  • KPM measurements will be looked at after LSST20 dataset is done loading
  • Updates from Jeff Tseng @ Oxford:
    • Making progress on Edinburgh people, collecting info from colleagues about use cases
    • Checking SDSS help pages, old examples of science queries
    • Talking to people to get more manpower for this work

    • Pan-STARRS made data public - Brian says DESC is aware of this so nothing to do about that yet