You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 3 Next »

Report a problem

Upcoming Scheduled Maintenance

StartEndWhat is happening?What will be affected?

2017-03-23 (1000)

2017-03-23 (1500)

NCSA Nebula OutageNebula will take an outage to balance and build a more stable setup for the file system. This will require a pause of all instances, and Horizon being unavailable.
2017-03-28 (0000)2017-03-29 (1600)Blue Waters maintenance

Due to maintenance of cooling infrastructure at NPCF, Blue Waters will down during this period. Cray will also take this maintenance window to perform some system updates at the same time. We do not anticipate any impact to LSST services. The NPCF will be precooled prior to the maintenance shutdown and the facility has sufficient to ride out the loss of main cooling. UPDATE:

Systems that will be down

  • Slurm cluster compute nodes will be powered down for the duration of the outage.

Systems that will remain up

Qserv nodes ( lsst-qserv-* ), SUI nodes ( lsst-sui-* ), Bastion node ( lsst-bastion01 ) should remain online during the outage.  

However, if temperatures in the NPCF rise too high, we will be forced to shut these down as well. I've been told that this is a low-probability scenario and we will be given time to do graceful shutdowns. In the unlikely event that this happens, it will be communicated through the DM Slack channel and also posted here.


Previous Outages

StartEndWhat happened?What was affected?Outcome
2017-03-16 (0630)2017-03-16 (1130)LSST monthly maintenanceGPFS filesystems will go offline for entire duration of outages. Some systems may be rebooted, especially those that mount one or more of the GPFS filesystems.
2017-02-22 16152017-02-221815Nebula Gluster IssuesAll Nebula instances paused while gluster repairedNebula is available.
  • No labels