[Bioperl-l] New Blast parser

Sendu Bala bix at sendu.me.uk
Tue May 15 08:23:52 UTC 2007


Back in August of last year I introduced Bio::PullParserI, a module that 
aids in the creation of fast SearchIO and Search modules. I've finally 
gotten around to implementing a Blast parser using the interface, which 
I've called Bio::SearchIO::blast_pull.

To use it you say:

my $sio = Bio::SearchIO->new(-format => "blast_pull", -file => "file");

or in the near future (when I've committed StandAloneBlast changes):

my $sab = Bio::Tools::Run::StandAloneBlast->new(-_READMETHOD => 
"blast_pull");


Currently the parser is incomplete: I've only tested it with NCBI BLASTN 
and BLASTP. However, results are promising. In one particular real-world 
usage-case involving running and parsing multiple Blast jobs via 
StandAloneBlast (amongst other things), changing only the _READMETHOD 
from 'blast' to 'blast_pull' in the code dropped run time from 20223s to 
951s (~20x faster) and memory usage from over 8GB to less than 5GB (~40% 
less).

Please try it out and feed-back any bugs you discover.


Cheers,
Sendu.



More information about the Bioperl-l mailing list