[Biopython-dev] Fwd: [Fwd: missing NCBI DTDs]
Michiel de Hoon
mjldehoon at yahoo.com
Tue Mar 25 22:15:52 EDT 2014
We could consider to not include any DTDs with Biopython, and rely on downloading them automatically.
This seems a better test case than what we currently have, because as NCBI updates their DTDs, Bio.Entrez depends on this automatic download capability.
Best,
-Michiel.
--------------------------------------------
On Mon, 3/24/14, Peter Cock <p.j.a.cock at googlemail.com> wrote:
Subject: [Biopython-dev] Fwd: [Fwd: missing NCBI DTDs]
To: "Biopython-Dev Mailing List" <biopython-dev at biopython.org>
Date: Monday, March 24, 2014, 12:42 PM
Three more NCBI Entrez DTD files
added:
https://github.com/biopython/biopython/commit/58e024f7704c3b7d3694fda42be6fa47808dad7d
The downside of the new Entrez code is it silently
downloads
and caches missing DTD files, so we may want to add
something
to the manual release process to check what missing DTD
files
have been cached locally... e.g.
$ ls ~/.config/biopython/Bio/Entrez/DTDs/
Regards,
Peter
---------- Forwarded message ----------
From: Peter Cock <p.j.a.cock at googlemail.com>
Date: Mon, Mar 24, 2014 at 4:19 PM
Subject: Re: [Fwd: missing NCBI DTDs]
To: xxx at uci.edu
Cc: "biopython-owner at lists.open-bio.org"
<biopython-owner at lists.open-bio.org>
Thanks for getting in touch.
Sadly back when anyone could email the list we had far too
much spam. Unfortunately the only practical solution was to
insist people join the mailing list before posting.
With hindsight the missing DTD message should have also
said please check the latest code / issue tracker - in this
case we've fixed the missing esummary-v1.dtd file:
https://github.com/biopython/biopython/commit/cb560e79def4b24c831725308f17123af4e8eeff
We do seem to be missing the other three through,
bookdoc_140101.dtd
nlmmedlinecitationset_140101.dtd
pubmed_140101.dtd
The sample code you provided makes testing this easier,
thank you.
Peter
---------- Forwarded message ----------
On Mon, Mar 24, 2014 at 4:04 PM, <xxx at uci.edu> wrote:
When I attempted to run the following python script:
from Bio import Entrez
Entrez.email = "esharman at uci.edu"
handle = Entrez.efetch(db="pubmed", id="24653700",
retmode="xml")
record = Entrez.read(handle)
handle.close()
print record[0]["ArticleTitle"]
the following DTDs were reported missing:
bookdoc_140101.dtd
nlmmedlinecitationset_140101.dtd
pubmed_140101.dtd
When I ran a similar script to access the SNP database, the
following DTD
was reported missing:
esummary-v1.dtd
Downloading and saving these files to the requested python
directory
eliminated the error messages.
Biopython is an absolutely super package! Hope this helps.
_______________________________________________
Biopython-dev mailing list
Biopython-dev at lists.open-bio.org
http://lists.open-bio.org/mailman/listinfo/biopython-dev
More information about the Biopython-dev
mailing list