Difference between revisions of "Infiniband problems on Euler VII nodes (November 2021)"
From ScientificComputing
Line 1: | Line 1: | ||
We are currently experiencing a problem with the '''Infiniband network on Euler VII nodes'''. We are in close contact with the hardware vendors and are investigating the problem. We will make some changes in the scheduling of jobs to avoid that multi-node MPI jobs are starting on Euler VII nodes. | We are currently experiencing a problem with the '''Infiniband network on Euler VII nodes'''. We are in close contact with the hardware vendors and are investigating the problem. We will make some changes in the scheduling of jobs to avoid that multi-node MPI jobs are starting on Euler VII nodes. | ||
− | If you encounter problems with stuck jobs on nodes whose hostname does not start with eu-a2p, then please report those cases to {{cluster_support}} | + | If you encounter problems with stuck jobs on nodes whose hostname does not start with '''eu-a2p''', then please report those cases to {{cluster_support}} |
Revision as of 14:35, 16 November 2021
We are currently experiencing a problem with the Infiniband network on Euler VII nodes. We are in close contact with the hardware vendors and are investigating the problem. We will make some changes in the scheduling of jobs to avoid that multi-node MPI jobs are starting on Euler VII nodes.
If you encounter problems with stuck jobs on nodes whose hostname does not start with eu-a2p, then please report those cases to cluster support