[BioPython] Getting protein structure names from primary IDs

Nick Matzke matzke at berkeley.edu
Sat Mar 14 05:05:47 UTC 2009


Hi again -- Esummary was what I needed, so nevermind!

Sorry for the trouble,
Nick


Nick Matzke wrote:
> Hi all,
> 
> This has got to be trivial, but I can't find a hint about the solution 
> online.
> 
> I want to:
> 
> 1. Search NCBI's structure database for structures from a certain group
> 
> from Bio import Entrez
> handle = Entrez.einfo()
> record = Entrez.read(handle)
> print "Search the structure database on Organism = Drosophila"
> Entrez.email = "A.N.Other at example.com"     # Always tell NCBI who you are
> #handle = Entrez.esearch(db="structure", term="Drosophila")
> handle = Entrez.esearch(db="structure", term="Drosophila[Orgn]")
> 
> pdb_record = Entrez.read(handle)
> print pdb_record    #["IdList"]
> 
> pdblist = pdb_record["IdList"]
> 
> 
> 
> OK, now I have a list of primary IDs for the protein structures from 
> Drosophila.
> 
> 
> 
> 2. Download those structures.  Apparently I have to do this from RSCB 
> and not NCBI? (NCBI efetch has no information on efetching from the 
> structure database, and I tried a few obvious methods on analogy to 
> other databases without result)
> 
> This will download from RSCB, but apparently you need the structure 
> name, not the NCBI primary ID.
> 
> 
> from Bio.PDB import *
> pdbl=PDBList()
> pdbl.retrieve_pdb_file('1FAT')
> 
> 
> So, how do I get from primary ID to structure name?  I'm sure I'm 
> missing something obvious.
> 
> Cheers,
> Nick
> 
> 
> 
> 

-- 
====================================================
Nicholas J. Matzke
Ph.D. student, Graduate Student Researcher
Huelsenbeck Lab
Center for Theoretical Evolutionary Genomics
4151 VLSB (Valley Life Sciences Building)
Department of Integrative Biology
University of California, Berkeley

Lab websites:
http://ib.berkeley.edu/people/lab_detail.php?lab=54
http://fisher.berkeley.edu/cteg/hlab.html
Dept. personal page: 
http://ib.berkeley.edu/people/students/person_detail.php?person=370
Lab personal page: http://fisher.berkeley.edu/cteg/members/matzke.html
Lab phone: 510-643-6299
Dept. fax: 510-643-6264
Cell phone: 510-301-0179
Email: matzke at berkeley.edu

Mailing address:
Department of Integrative Biology
3060 VLSB #3140
Berkeley, CA 94720-3140

-----------------------------------------------------
"[W]hen people thought the earth was flat, they were wrong. When people 
thought the earth was spherical, they were wrong. But if you think that 
thinking the earth is spherical is just as wrong as thinking the earth 
is flat, then your view is wronger than both of them put together."

Isaac Asimov (1989). "The Relativity of Wrong." The Skeptical Inquirer, 
14(1), 35-44. Fall 1989.
http://chem.tufts.edu/AnswersInScience/RelativityofWrong.htm
====================================================



More information about the Biopython mailing list