[Biopython-dev] Uniprot XML parser on TrEmbl

Andrea Pierleoni andrea at biocomp.unibo.it
Fri Nov 12 10:24:07 UTC 2010


WIth the submitted patch the parser was able to correctly parse 12.347.303
entries in
the 62Gb XML file in 2h 13m.
it looks like a reasonable performance to me, since you are going to spend
more time
in downloading the 8Gb gzipped file and decompressing it.

Andrea





More information about the Biopython-dev mailing list