[Bioperl-l] Fetching > 500 sequences

Brian Osborne brian_osborne at cognia.com
Wed Mar 3 12:04:19 EST 2004


Brad,

Thank you, I'll apply the patch in bioperl-live.

Brian O.

-----Original Message-----
From: bioperl-l-bounces at portal.open-bio.org
[mailto:bioperl-l-bounces at portal.open-bio.org]On Behalf Of Brad Chapman
Sent: Wednesday, March 03, 2004 10:51 AM
To: bioperl
Subject: Re: [Bioperl-l] Fetching > 500 sequences

Hi Rolf, Martin;

> > It seems that I have problems with fetching more than 500 sequences from
> > Genbank using Bioperl. It looks like the script (attached below) fetches
all
> > the 7000+ sequences, but only 500 make it to the output file. Is there
any
> > way to get all these 7000+ sequences written to the file - that is, is
it
> > possible to sidestep the 500 seq. limit?

I actually debugged and fixed this problem recently for Biopython --
it looks like a change in the way EUtils works. If you pass 'retmax'
to the eutils URL then it will only give you back at max 500
sequences, no matter what you pass for this parameter. The fix I
found that worked was to not pass 'retmax'.

The attached patch to Bio/DB/Query/GenBank.pm should fix the
problem, if similar symptoms equal similar fixes in this case. An
actual Perl/BioPerl person should look at this, though, as I'm not
to be trusted for coding Perl :-).

Hope this helps.
Brad




More information about the Bioperl-l mailing list