Difference between revisions of "Network problem 15 Feb 2017"

From ScientificComputing
Jump to: navigation, search
(Updates)
 
(4 intermediate revisions by the same user not shown)
Line 3: Line 3:
 
The network has been repaired and all nodes have been brought back into operation.
 
The network has been repaired and all nodes have been brought back into operation.
  
We will soon reopen the login nodes, but LSF queues will remain closed until we can assure that the cluster is running well.
+
We have reopened the login nodes, but LSF queues remain closed until we can assure that the cluster is running well.
  
 
For updates on the status of the problem, please check the [[System_status| system status]] and this page.
 
For updates on the status of the problem, please check the [[System_status| system status]] and this page.
Line 9: Line 9:
  
 
We are sorry for the inconvenience.
 
We are sorry for the inconvenience.
 +
 +
==Updates==
 +
 +
;'''16.02.2017'''
 +
 +
: LSF queues are progressively being opened to bring the system back into production. MDCS and CLC Genomics server are fully operational.

Latest revision as of 10:44, 12 October 2018

Due to a misconfiguration of the cluster's internal network, the compute and login nodes lost network connectivity. This caused all running jobs to fail without being able to write back their results.

The network has been repaired and all nodes have been brought back into operation.

We have reopened the login nodes, but LSF queues remain closed until we can assure that the cluster is running well.

For updates on the status of the problem, please check the system status and this page.


We are sorry for the inconvenience.

Updates

16.02.2017
LSF queues are progressively being opened to bring the system back into production. MDCS and CLC Genomics server are fully operational.