Blast++/Example

From ScientificComputing
Jump to: navigation, search

As an example for a blast++ job, we are doing a simple query of the sequence test.fasta

[sfux@eu-login-06 ~]$ cat test.fasta 
>sequence1
MIKKIGVLTSGGDAPGMNAAIRGVVRSALTEGLEVMGIYDGYLGLYEDRMVQLDRYSVSD
MINRGGTFLGSARFPEFRDENIRAVAIENLKKRGIDALVVIGGDGSYMGAMRLTEMGFPC
IGLPGTIDNDIKGTDYTIGFFTALSTVVEAIDRLRDTSSSHQRISVVEVMGRYCGDLTLA
AAIAGGCEFVVVPEVEFSREDLVNEIKAGIAKGKKHAIVAITEHMCDVDELAHFIEKETG
RETRATVLGHIQRGGSPVPYDRILASRMGAYAIDLLLAGYGGRCVGIQNEQLVHHDIIDA
IENMKRPFKGDWLDCAKKLY

and compare it against the nt database.

[sfux@eu-login-06 ~]$ module load gcc/4.8.2 blast/2.2.30
[sfux@eu-login-06 ~]$ bsub -n 1 -W 4:00 -R "rusage[mem=2048]" "blastp -query test.fasta -out output.blast.txt -db nr"
#Generic job.
#Job <33641518> is submitted to queue <normal.4h>.
[sfux@eu-login-06 ~]$ bjobs
JOBID      USER      STAT  QUEUE      FROM_HOST   EXEC_HOST   JOB_NAME   SUBMIT_TIME
33641518   leonhard  PEND  normal.4h  euler06
[sfux@eu-login-06 ~]$ bjobs
JOBID      USER      STAT  QUEUE      FROM_HOST   EXEC_HOST   JOB_NAME   SUBMIT_TIME
33641518   leonhard  RUN   normal.4h  euler06     e1057       *xt -db nr Dec  6 10:02
[sfux@eu-login-06 ~]$ bjobs
No unfinished job found

The result is then written to the output file output.blast.txt:

[sfux@eu-login-06 ~]$ sed -n '25,40p' output.blast.txt 
Query= sequence1

Length=320
                                                                     Score     E
Sequences producing significant alignments:                          (Bits)  Value

ref|WP_000591795.1|  MULTISPECIES: ATP-dependent 6-phosphofructok...    650   0.0   
gb|EFJ85506.1|  6-phosphofructokinase [Escherichia coli MS 84-1]        651   0.0   
gb|ABF05609.1|  6-phosphofructokinase I [Shigella flexneri 5 str....    651   0.0   
ref|WP_024228092.1|  ATP-dependent 6-phosphofructokinase [Escheri...    650   0.0   
gb|ABE09911.1|  6-phosphofructokinase isozyme I [Escherichia coli...    651   0.0   
gb|ADX52955.1|  6-phosphofructokinase [Escherichia coli KO11FL]         651   0.0   
ref|WP_000967668.1|  ATP-dependent 6-phosphofructokinase [Escheri...    651   0.0   
ref|WP_032279226.1|  MULTISPECIES: ATP-dependent 6-phosphofructok...    649   0.0   
gb|EEJ48186.1|  6-phosphofructokinase [Escherichia coli 83972]          650   0.0   
ref|WP_039061908.1|  MULTISPECIES: ATP-dependent 6-phosphofructok...    649   0.0   

You can find the resource usage summary of the job in the corresponding LSF log file.