[Biopython-dev] XML parsing library for new modules
Brad Chapman
chapmanb at 50mail.com
Tue May 5 12:50:59 UTC 2009
Peter, Michiel and Eric;
> > Another thing to consider is what to do with the result
> > returned by ElementTree. Whereas it will contain all the
> > information in the XML file, it may not represent it in a
> > user-friendly way. You may want to take the output from
> > ElementTree and store it in a more biopython-like object.
Agreed. Most of the fun creative parts of the project, as opposed to
the parsing nuts and bolts, will be in developing the object
representations.
> > Also keep in mind memory usage: ElementTree will keep
> > the complete XML file in memory, whereas the SAX
> > parser gives you more flexibility here (see below).
ElementTree can do incremental parsing, so you can also deal with
large files using it:
http://effbot.org/zone/element-iterparse.htm
Brad
More information about the Biopython-dev
mailing list