Short service interruption 2025-04-14

From ScientificComputing
Jump to: navigation, search

This morning, an internal upgrade caused a short service interruption on Euler. The login was not working for a few minutes and Slurm is affected too.

We are sorry for the inconvenience.

Updates

2024-04-14 09:30
Meanwhile the login to Euler again works. We are still working on fixing the issues with the Slurm batch system.
2024-04-14 10:05
The short service interruption resulted in all running jobs being terminated. We are very sorry about this. Please do not open support tickets about failed jobs. You can already resubmit them. The queues are inactivated but can still accept new jobs. They will go to the queues and stay there until we activate the queues again. This will happen as soon as we are sure that Slurm is running fine again.
2024-04-14 10:40
The cluster is back to normal operation. All queues are again open and jobs are processed.