[Bioperl-l] Speed ups checked in

Ewan Birney birney@ebi.ac.uk
Sun, 29 Sep 2002 19:18:01 +0100 (BST)


i have just checked in speedups for the fasta parsing:

[localhost:~/data] birney% time perl -I ~/src/bioperl-1.1.0/
fasta2fasta.pl < rod.fa > /dev/null
38.750u 2.420s 0:49.67 82.8%    0+0k 22+0io 0pf+0w


vs


[localhost:~/data] birney% time perl -I ~/src/bioperl-live/ fasta2fasta.pl
< rod.fa > /dev/null
15.240u 1.440s 0:19.14 87.1%    0+0k 21+0io 0pf+0w




to do this speed up (factor of 2) i have had to take a pretty direct route
in creating the correct hashes, blessing them and then populating them
correctly. Rather than this being put in the main system, I have taken
advantage of the Factory system - there is now a new Seq Factory called

SeqFastaSpeedFactory

which, unsurprisingly makes sequences very fast.



I have tried to make generic object creating faster.



Next I will work on GenBank and EMBL parsing.




e.