Difference between revisions of "Euler power outage (24 July 2018)"

From ScientificComputing
Jump to: navigation, search
Line 11: Line 11:
  
 
13:15 —  Information about outage from the electricity provider (in Italian): https://www.ail.ch/meta-navigation/media/news-comunicati/Ripristino-interruzione-di-servizio.html
 
13:15 —  Information about outage from the electricity provider (in Italian): https://www.ail.ch/meta-navigation/media/news-comunicati/Ripristino-interruzione-di-servizio.html
 +
 +
13:30 —  The cooling infrastructure at CSCS is operational again. We can now progressively restart the network and storage systems of Euler. This will take a few hours.

Revision as of 13:33, 24 July 2018

Due to a massive power outage in Lugano (and apparently a big part of Ticino), most compute nodes of Euler went down at 10:42 this morning, causing the loss of all running jobs.

Since our colleagues at CSCS do not know when the power will be restored, we have initiated an emergency shutdown of all systems connected to UPS, including storage systems, login nodes and admin nodes.

We will update this page as the situation evolves.


11:30 — More information about the outage (in Italian): https://www.ticinonews.ch/ticino/468791/e-luce-fu-elettricita-tornata

12:30 — The power in Lugano is back on-line. CSCS is now restarting the data centre's cooling infrastructure

13:15 — Information about outage from the electricity provider (in Italian): https://www.ail.ch/meta-navigation/media/news-comunicati/Ripristino-interruzione-di-servizio.html

13:30 — The cooling infrastructure at CSCS is operational again. We can now progressively restart the network and storage systems of Euler. This will take a few hours.