[Biopython-dev] Bio.Entrez / next Biopython release.
Michiel de Hoon
mjldehoon at yahoo.com
Thu Feb 21 14:00:43 UTC 2008
Hi everybody,
As discussed previously, I created a module Bio.Entrez for interacting with NCBI's Entrez databases (GenBank, PubMed, and many others). This is essentially Bio.WWW.NCBI renamed to Bio.Entrez; Bio.WWW.NCBI still exists in the same location but is deprecated. In the process, I updated this module to include all of NCBI Entrez Programming Utilities, and deprecated those that have been superseded at NCBI. The code is now in CVS as Bio/Entrez.py. In hindsight, it would probably have been a better idea to use Bio/Entrez/__init__.py in case we want to expand Bio.Entrez, but anyway this can be rectified before creating the next Biopython release.
I also wrote some documentation for Bio.Entrez. You can have a preview at http://biopython.org/DIST/docs/tutorial/Tutorial-proposal.html; Chapter 6 describes Bio.Entrez, and gives a good overview of the current status of this module.
The module Bio.Entrez was created in response to Bug #2393:
http://bugzilla.open-bio.org/show_bug.cgi?id=2393
Using Bio.Entrez, we can fix this bug easily, and then create a new release. This is one thing to consider though:
Like Bio.WWW.NCBI, Bio.Entrez provides access to NCBI's Entrez databases but does not provide parsers for the output generated by NCBI (note: some file formats generated by NCBI Entrez' sequence databases can be parsed by Bio.SeqIO). Our options are then:
1) Keep Bio.Entrez as a module only to access NCBI Entrez, but not to parse the results.
2) Add parsers to Bio.Entrez.
3) Make a new Biopython release now, and potentially add parsers later.
Suggestions, preferences, comments, anybody?
--Michiel.
---------------------------------
Be a better friend, newshound, and know-it-all with Yahoo! Mobile. Try it now.
More information about the Biopython-dev
mailing list