[Bioperl-l] dealing with large files

Sendu Bala bix at sendu.me.uk
Thu Dec 20 18:48:29 UTC 2007


Amir Karger wrote:
>> It would be nice to code up a lazy sequence object and related  
>> parsers; maybe for the next dev release.
> 
> Yes!
> 
> Also, BLAST parsing. Blasting the proteome against the genome makes for
> rather large result files.

This has already been done. Use Bio::SearchIO::blast_pull. In a 
situation like yours I dropped run time from 20223s to
951s (~20x faster) and memory usage from over 8GB to less than 5GB (~40%
less).



More information about the Bioperl-l mailing list