[Biopython-dev] Bio.Entrez.parse
Michiel de Hoon
mjldehoon at yahoo.com
Sat Sep 5 04:17:00 EDT 2009
Hi everybody,
Recently I was trying to parse a huge Entrez XML file containing Entrez gene records. Because of the size of the file, Entrez.read failed with a memory error since it could not keep the entire information in the XML file in memory. I decided to add a parse() function to Bio.Entrez that can iterate of such large files. This function is useful if the XML file essentially contains a list of records; the parse() function is a generator function that returns these records one by one.
--Michiel.
More information about the Biopython-dev
mailing list