Difference between revisions of "Euler III Beta Testing"

From ScientificComputing
Jump to: navigation, search
(Updates for MVAPICH2 and Intel.)
(Simplified Open MPI module.)
Line 15: Line 15:
 
=== Submitting parallel jobs ===
 
=== Submitting parallel jobs ===
  
While the Euler III nodes are targeted to serial and shared-memory parallel jobs, multi-node parallel jobs are still accepted. You need to tell the system that Infiniband is not available,
+
While the Euler III nodes are targeted to serial and shared-memory parallel jobs, multi-node parallel jobs are still accepted. You need to request at most four cores per node:
 +
bsub -R beta -R "span[ptile=4]" [other bsub options] ./my_command
 +
 
 +
For MVAPICH2 you need to tell the system that Infiniband is not available,
 
  module load interconnect/ethernet
 
  module load interconnect/ethernet
''before'' loading any MPI module. Also you need to request at most four cores per node:
+
''before'' loading the MPI module.  
bsub -R beta -R "span[ptile=4]" [other bsub options] ./my_command
+
 
 
;Open MPI
 
;Open MPI
 
:Open MPI 1.6.5;has been tested to work with acceptable performance.
 
:Open MPI 1.6.5;has been tested to work with acceptable performance.
 
;MVAPICH2
 
;MVAPICH2
:MVAPICH2 2.1 works but preliminary results show a low scalability.
+
:MVAPICH2 2.1 works but preliminary results show low scalability. You need to load the interconnect/ethernet module.
 
;Intel MPI
 
;Intel MPI
 
:Intel MPI 5.1.3 has been tested.
 
:Intel MPI 5.1.3 has been tested.

Revision as of 09:54, 30 January 2017

The Euler III extension to the Euler cluster is available for beta testers.

If you are interested in running test jobs on the new nodes and are willing to work with us to diagnose and resolve problems, then contact us to be granted access as a beta tester for these nodes.

Serial jobs or single-node parallel jobs using from 1 to 4 cores, use up to 30 GB of total memory and request up to 24 hours are good candidates to run on these nodes.

The new nodes run CentOS 7 exclusively. All other production nodes in Euler currently run CentOS 6. Jobs that rely on Infiniband will not run on these nodes. Single-node Open MPI jobs will run but MVAPICH2 jobs will not run.

Submitting beta jobs

To submit a job to run on the beta Euler III nodes, you must request the beta resource, e.g.,

bsub -R beta [other bsub options] ./my_command

Only jobs from approved beta testers will run as long as the system remains in this stage of the beta test.

Submitting parallel jobs

While the Euler III nodes are targeted to serial and shared-memory parallel jobs, multi-node parallel jobs are still accepted. You need to request at most four cores per node:

bsub -R beta -R "span[ptile=4]" [other bsub options] ./my_command

For MVAPICH2 you need to tell the system that Infiniband is not available,

module load interconnect/ethernet

before loading the MPI module.

Open MPI
Open MPI 1.6.5;has been tested to work with acceptable performance.
MVAPICH2
MVAPICH2 2.1 works but preliminary results show low scalability. You need to load the interconnect/ethernet module.
Intel MPI
Intel MPI 5.1.3 has been tested.