[Biopython] need help! how to retrieve full text from Pubmed central ?

ning luwen bioinformaticsing at gmail.com
Sat Dec 26 09:54:23 EST 2009


more about the problem.

     From http://eutils.ncbi.nlm.nih.gov/corehtml/query/static/efetchlit_help.html,
I can learn:
      PubMed Central contains a number of articles classified as "open
access" for which you may download the full text as XML. For the
remaining articles in PMC you may download only the abstracts as XML.

but when try to
handle=Entrez.efetch(db='pmc',id=idlist,rettype='full',retmode='xml')
record=Entrez.read(handle)

got following errors:
    Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/usr/local/lib/python2.6/dist-packages/Bio/Entrez/__init__.py",
line 258, in read
    record = handler.read(handle)
  File "/usr/local/lib/python2.6/dist-packages/Bio/Entrez/Parser.py",
line 114, in read
    raise CorruptedXMLError
Bio.Entrez.Parser.CorruptedXMLError: Failed to parse the XML data.
Please make sure that the input data are in XML format, and that the
data are not corrupted.

the python version is 1.53 and my system is ubuntu 9.10.

On Sat, Dec 26, 2009 at 10:37 PM, ning luwen
<bioinformaticsing at gmail.com> wrote:
> Dear everyone,
>    I need to download full text from Pubmed central. After see the
> Entrez manual, maybe Entrez(not the web interface) doesn't give a way
> to download .pdf full text file, is this true?
>
>
>
>
> --
> regards,
> ningluwen
>



-- 
regards,
luwening,bioinformatics center in uestc:
www.bioinformaticsinuestc.cz.cc



More information about the Biopython mailing list