[BioPython] Big GenBank files
Peter
biopython at maubp.freeserve.co.uk
Mon Apr 25 07:16:35 EDT 2005
aurelie.bornot at free.fr wrote:
> Hi !
>
> I am trying to make a program that do automatically blasts of a base of
> sequences against the genbank sequences. And I would like to retrieve (also
> automatically) the most interesting GenBank files..... to keep informations
> about them in my database.
>
> But I've got a problem (again..sorry ! :'( ) :
>
> I've 2*512 Mega of RAM but it seems that my computer can't deal with 'big'
> GenBank files like 'BA000028.3'(7 M) or 'AP008212' (37 M)
Have a look at bug 1747 which should help with reading large GenBank
files (however I'm not sure if it will affect GenBank.NCBIDictionary)
http://bugzilla.open-bio.org/show_bug.cgi?id=1747
The test code used was based on Section 3.4.2 of the Tutorial, Parsing
GenBank records:
http://www.biopython.org/docs/tutorial/Tutorial.html#htoc35
See also the discussion last month:-
http://www.biopython.org/pipermail/biopython/2005-March/002568.html
Peter
--
PhD Student
MOAC Doctoral Training Centre
University of Warwick, UK
More information about the BioPython
mailing list