[Biopython-dev] Bio.Entrez.parse

Michiel de Hoon mjldehoon at yahoo.com
Sat Sep 5 08:17:00 UTC 2009


Hi everybody,
Recently I was trying to parse a huge Entrez XML file containing Entrez gene records. Because of the size of the file, Entrez.read failed with a memory error since it could not keep the entire information in the XML file in memory. I decided to add a parse() function to Bio.Entrez that can iterate of such large files. This function is useful if the XML file essentially contains a list of records; the parse() function is a generator function that returns these records one by one.

--Michiel.


      



More information about the Biopython-dev mailing list