Date

Attendees

Notes from the previous meeting

Discussion items

DiscussedItemNotes
(tick)Project news

Fritz Mueller :

  • summit shut down in late May or early June planned for an electric upgrade
  • leaving this Tue, be back by mid-May.
  • need to set up the planning exercise for Qserv after getting back from Chile: remaining bugs, features that were planned to be done and which havn'e, new features, and what else are we going to do next. It's also a subject for the budgetary situation. Will begin looking at JIRA tickets to make a list of what's left. GPFD, Frossie, Colin and the DAX team to be a part of the discussion. Stuff assignments are going to be a topic. We might have a chance to make a case to get another Qserv developer abroad. Not guaranteed though. First, we need to set up a plan for the "end game".
  • Colin Slater and Yusra are at Fermilab 
(tick)Qserv at USDF

Igor Gaponenko:

  • qserv6prod  expanded from 6 to 15 nodes in the same way it was done earlier for qserv6dev. The instance is back to production.
  • Fritz Mueller there was an issue with TAP upstream (missing schema for DP02). It's been resolved. Verified that Notebooks work with new Qserv in both sync and async query submission modes. So, functionally it works
  • a surprising latency has been observed in query turnaround compared with IDF
  • we should also have the MOnkey running against that instance. Need to check.

Fritz Mueller, Fabrice Jammes: any updates on the status of the Kubernetes-based deployment mode?

  • Fritz Mueller will talk to Yee informally to see if any formal steps are needed to move it forward.
(tick)Upgrade the XROOTD version to ssi-5.3.x

Done. This is in the main  branch of Qserv. Anything else we need to discuss here?

Fritz Mueller will move to the next release branch after 

(tick)Git submodules have been updated in Qserv to the latest state
(tick)Slow (or "zombi") worker queries at USDF

Fritz Mueller: there was a plan to modify Qserv's RelationGraph to generate faster queries where the RefMatch tables are involved. The idea is not to materialize sub-chunks and to put a restrictor on the sub-chunks for the "mother" tables. In this case, MySQL would benefit from indexes on the latter.

  • no progress on this yet
  • will work on it later
(tick)File-based result delivery, Qserv lockups

Igor Gaponenko :

  • the EPIC branch has been cleaned up, commits rearranged and rebased against the latest state of the main  branch.
  • DM-38069 - Getting issue details... STATUS
  • Qserv locks were detected when testing the branch using a full-sky NN query against the 150k chunks catalog GAIA_DR2 at USDF. Further tests indicate the same problem exists in the current state of the main branch. The official release  2023.4.1-rc1 looks fine though. It's possible the problem was introduced in one of the following branches:
  • this needs to be further investigated

Fritz Mueller proposed a few ideas on what to investigate next:

  • test Qserv at the intermediate state in between the above-mentioned merges
  • increase the large result limit to allow queries to complete
  • run the COUNT_STAR version of the queries to see if we would have the same problem

John Gates :

  • add more log messages

Andy Hanushevsky :

  • enable SSI tracing to see if "Finish" get called
(tick)Status of the new Parquet  to CSV translator (partitioner)

Fabrice Jammes:

  • still having trouble with the UNIX account/password at SLAC

Igor Gaponenko:

  • got lots of e-mail messages from Sabine in response to my code review on the PR
  • should look through responses and follow up if needed
  • it would be handy to have a standalone tool to convert the Parquet file into a single CSV  file w/o partitioning the intermediate product.

Fritz Mueller proposed adding a separate container for "tools" which pull a lot of dependencies (like Arrow library). Otherwise, the Qserv container will grow too much in size.

(tick)Qserv at UkDF

Fabrice Jammes:

  • Greg's cluster at UkDF is working now
(tick)DP03

Context:

Fritz Mueller:

  • V-cluster has been setup
  • Tested the environment (Kubernetes authentication at USDF, making connections to PostgreSQL)
  • Started playing with PostgreSQL, ingesting data into the tables.
  • Discovered that Mario used https://ossc-db.github.io/pg_bulkload/pg_bulkload.html for high-performance ingest and which also has many nice features as it allows more control over what's ingested from the NumPy dumps
  • Collin and GPFS are looking at Felis schema
  • Action items for Igor Gaponenko:
    • learn how to authenticate yourself to use the v-cluster and PostgreSQL
    • get familiarized with the tooling for ingesting DP03 into the database
    • an idea is to provide a backup for Fritz when he's away

Action items

  •