[Biopython] download sequences by date from Genbank

Michiel de Hoon mjldehoon at yahoo.com
Thu Jun 28 01:01:59 UTC 2012


Hi Lifeng,

Have a look at esearch in the NCBI E-Utilities:

http://www.ncbi.nlm.nih.gov/books/NBK25499/#chapter4.ESearch

You can access the E-Utilities and parse the results with Bio.Entrez as described in the Biopython manual.

Best,
-Michiel

--- On Wed, 6/27/12, Lifeng Lin <linlifeng at gmail.com> wrote:

> From: Lifeng Lin <linlifeng at gmail.com>
> Subject: [Biopython] download sequences by date from Genbank
> To: biopython at lists.open-bio.org
> Date: Wednesday, June 27, 2012, 5:20 PM
> Hi folks,
> 
> Is there an elegant way of downloading sequences from
> Genbank and using
> date as a cutoff?
> 
> I am trying to maintain an up-to-date local version of all
> sequences for a
> certain number of species. When "synching" with Genbank, all
> i can think of
> is retrieving all GI numbers for these species once again,
> compare them
> with what i have locally, and generate a list of new
> sequences and append
> them. I have a hunch that there might be a better way of
> doing this, for
> example, if there is a date filter that we can apply for
> Genbank download,
> then all the trouble for comparisons would be saved.
> 
> Any suggestions?
> 
> best,
> L.
> _______________________________________________
> Biopython mailing list  -  Biopython at lists.open-bio.org
> http://lists.open-bio.org/mailman/listinfo/biopython
> 




More information about the Biopython mailing list