[Biopython] Entrez.efetch from gene

Edith Schlagenhauf ediths at botinst.uzh.ch
Wed Jan 26 11:05:35 UTC 2011


Using retmode='xml' returns the data in XML format, ie.

handle = Entrez.efetch(db=search_database, id =reclist[0], retmode='xml')

# the Bio.Entrez.read()The Bio.Entrez.read() function can parse most (if 
# not all) XML output returned by Entrez.

record = Entrez.read(handle)
handle.close()

print record[0].keys()                 # prints list of available keys
print record[0]["Entrezgene_rna"]      # example key



HTH,
Edith


******************************************
Dr Edith Schlagenhauf
University of Zurich
SWITZERLAND

e-mail: ediths AT botinst DOT uzh DOT ch
******************************************


On Tue, 25 Jan 2011, Michael Fahy wrote:

> Trying to use Entrez.efetch() to query the gene database.
>
> The efetch help at
> http://www.ncbi.nlm.nih.gov/entrez/query/static/efetchseq_help.html says
> there are no retrieval types supported by the gene database.  If I do an
> efetch query without specifying a value for rettype, it returns html.  Is
> there a way in Biopython to parse this html?  Or is there another way to
> query the gene database so it will return data that can be parsed?
>
> Sample code:
>
> from Bio import Entrez
> Entrez.email = 'email at chapman.edu'
>
> search_database = 'gene'
> search_term = 'YIL065C'
>
> handle = Entrez.esearch(db='gene',term=search_term)
> record = Entrez.read(handle)
> reclist = record['IdList']
>
> handle = Entrez.efetch(db=search_database,  id =reclist[0])
>
> myrecord = handle.read()
> print myrecord
>
> ----------------------------------------------------------
> Michael A. Fahy
> fahy at chapman.edu
>
>
> _______________________________________________
> Biopython mailing list  -  Biopython at lists.open-bio.org
> http://lists.open-bio.org/mailman/listinfo/biopython
>



More information about the Biopython mailing list