May need some down time in a few weeks for a water line repair - still unclear if it will affect our hardware. Unknown User (jalt) will look into that and will provide advance notice if there will be an outage.
Beginning work with an NCSA engineer on configuring monitoring. Have an initial set of ideas for what should be monitored. Will share these with the PDAC group to enable feedback.
Planning to use Nagios and will look into Ganglia. Open to other monitoring frameworks if there is a good reason.
Fritz Mueller will provide input today based on the monitoring configuration currently used by the Qserv group at IN2P3.