Difference between revisions of "Job management with LSF"

From ScientificComputing
Jump to: navigation, search
 
(15 intermediate revisions by the same user not shown)
Line 1: Line 1:
 
__NOTOC__
 
__NOTOC__
 +
<table style="width: 100%;">
 +
<tr valign=top>
 +
<td style="width: 30%; text-align:left">
 +
< [[Modules and applications]]
 +
</td>
 +
<td style="width: 35%; text-align:center">
 +
[[Main_Page | Home]]
 +
</td>
 +
<td style="width: 35%; text-align:right">
 +
[[Job submission | Submit a job]] >
 +
</td>
 +
</tr>
 +
</table>
 +
 +
 
<table style="width: 100%;">
 
<table style="width: 100%;">
 
<tr valign=top>
 
<tr valign=top>
Line 5: Line 20:
 
<td style="width: 30%; background: white; text-align: center;">
 
<td style="width: 30%; background: white; text-align: center;">
  
[[File:batch_system_icon.png|250px]]
+
[[File:batch_system.png|500px]]
 +
</td>
 +
<td style="width: 5%;>
 
</td>
 
</td>
<td style="width: 40%; background: white; text-align: left;>
+
<td style="width: 35%; background: white; text-align: left;>
  
<div style="width: 60%; background: #B2D9EA; height: 35px; border-radius: 10px; padding: 5px; margin:5px">
+
<div style="background: #B2D9EA; text-align: left" class="button">
 
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[[Job submission|'''1. Submit a job''']]
 
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[[Job submission|'''1. Submit a job''']]
 
</div>
 
</div>
  
<div style="width: 60%; background:#B2D9EA; height: 35px; border-radius: 10px; padding: 5px; margin:5px">
+
<div style="background: #B2D9EA; text-align: left" class="button">
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[[Parallel job submission|'''2. Submit a parallel job''']]
+
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[[Parallel job submission|'''1.1 Submit a parallel job''']]
 +
</div>
 +
 
 +
<div style="background: #B2D9EA; text-align: left" class="button">
 +
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[[GPU job submission|'''1.2 Submit a GPU job''']]
 
</div>
 
</div>
  
<div style="width: 60%; background: #B2D9EA; height: 35px; border-radius: 10px; padding: 5px; margin:5px">
+
<div style="background: #B2D9EA; text-align: left" class="button">
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[[Job monitoring|'''3. Monitor a job''']]
+
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[[Job monitoring|'''2. Monitor a job''']]
 
</div>
 
</div>
  
<div style="width: 60%; background: #B2D9EA; height: 35px; border-radius: 10px; padding: 5px; margin:5px">
+
<div style="background: #B2D9EA; text-align: left" class="button">
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[[Job output |'''4. Job output''']]
+
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[[Job output |'''3. Job output''']]
 
</div>
 
</div>
 +
  
 
</td>
 
</td>
  
<td style="width: 20%; background: white; text-align: center;">
+
<td style="width: 30%; background: white; text-align: center;">
  
 
</td>
 
</td>
Line 35: Line 57:
 
</table>
 
</table>
  
== Why use the LSF batch system? ==
+
== Why use the batch system? ==
 
Users can access the computing resources on the cluster solely through the batch system. On the ETH HPC clusters, we use the LSF batch system to manage computing jobs.
 
Users can access the computing resources on the cluster solely through the batch system. On the ETH HPC clusters, we use the LSF batch system to manage computing jobs.
  
== What are the steps? ==
+
 
* Log in to a login node on the cluster
+
<table style="width: 100%;">
* Transfer your data to the cluster
+
<tr valign=top>
* Load necessary modules
+
<td style="width: 30%; text-align:left">
* Prepare a BSUB command with LSF options which request the computing resources that you need
+
< [[Modules and applications]]
* Submit a job with a BSUB command line or a job script
+
</td>
* Wait for your job to run
+
<td style="width: 35%; text-align:center">
* Your job is run on compute nodes
+
[[Main_Page | Home]]
* Get your job results and output
+
</td>
 +
<td style="width: 35%; text-align:right">
 +
[[Job submission | Submit a job]] >
 +
</td>
 +
</tr>
 +
</table>

Latest revision as of 09:24, 1 October 2021

< Modules and applications

Home

Submit a job >


Batch system.png

     1. Submit a job

       1.2 Submit a GPU job

     2. Monitor a job

     3. Job output


Why use the batch system?

Users can access the computing resources on the cluster solely through the batch system. On the ETH HPC clusters, we use the LSF batch system to manage computing jobs.


< Modules and applications

Home

Submit a job >