Neural network training with TensorFlow on CPU

From ScientificComputing
Revision as of 13:18, 25 November 2020 by Jarunanp (talk | contribs) (tensorflow example)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Load Python with TensorFlow module

We will use the new software stack in this tutorial:

 $ env2lmod  

Load the Python module which contains TensorFlow 2.0.0 package

 $ module load gcc/6.3.0 python/3.7.4 hdf5
 The following have been reloaded with a version change:
   1) gcc/4.8.5 => gcc/6.3.0

Check if we could import TensorFlow package

 $ python -c "import tensorflow as tf; print(tf.__version__)"
 2.0.0

Create a CNN model

We used the CNN model from TensorFlow tutorials and created a python script called train_cnn_cifar10.py.

Submit a batch job

Submit a job to the compute node

 $ bsub -n 18 -W 01:00 python train_cnn_cifar10.py
 Generic job.
 Job <153279665> is submitted to queue <normal.4h>.

Check the job status

 $ bjobs
 JOBID      USER    STAT  QUEUE      FROM_HOST   EXEC_HOST   JOB_NAME   SUBMIT_TIME
 153279665  jarunan PEND  normal.4h  eu-login-02             *ifar10.py Nov 25 14:12