Difference between revisions of "Parallel job submission"

From ScientificComputing
Jump to: navigation, search
(Example)
Line 1: Line 1:
 
__NOTOC__
 
__NOTOC__
 +
<table style="width: 100%;">
 +
<tr valign=top>
 +
<td style="width: 30%; text-align:left">
 +
< [[Job submission | Submit a job]]
 +
</td>
 +
<td style="width: 35%; text-align:center">
 +
[[Sandbox Home | Home]]
 +
</td>
 +
<td style="width: 35%; text-align:right">
 +
[[Job monitoring | Monitor a job]] >
 +
</td>
 +
</tr>
 +
</table>
 +
 
== Shared memory job (OpenMP) ==
 
== Shared memory job (OpenMP) ==
 
* Runs on a single compute node
 
* Runs on a single compute node

Revision as of 14:37, 12 May 2021

< Submit a job

Home

Monitor a job >

Shared memory job (OpenMP)

  • Runs on a single compute node
  • Can use up to 24 processors
  • Number of processors must be defined in $OMP_NUM_THREADS
export OMP_NUM_THREADS=8
bsub -n 8 ./program

Distributed memory job (MPI)

  • Runs on multiple compute nodes
  • Can use tens or even hundreds of processors
  • Program must be launched using mpirun
module load compiler
module load mpi_library
bsub -n 240 mpirun ./program

Example

$ export OMP_NUM_THREADS=8
$ bsub -n 8 ./hello_omp
Generic job.
Job <8147290> is submitted to queue <normal.4h>.
$ unset OMP_NUM_THREADS
$ bsub -n 240 mpirun ./hello_mpi
MPI job.
Your environment is not configured for MPI.
Please load the module(s) needed by your job before executing 'bsub'.
Request aborted by esub. Job not submitted.
$ module load intel open_mpi
$ bsub -n 240 mpirun ./hello_mpi
MPI job.
Job <8147303> is submitted to queue <normal.4h>.


< Submit a job

Home

Monitor a job >