[BioPython] Bug in GenBank.FeatureParser detected

Bonis Sanz, Julio JBonis at imim.es
Thu Nov 25 10:39:40 EST 2004


Hi, 

I was trying to parse a genbank record obtained by using eutils:

http://eutils.ncbi.nlm.nih.gov/entrez/eutils/efetch.fcgi?db=nucleotide&id=51511729&rettype=gbwithparts&retmode=text&seq_start=46267941&seq_stop=46467941

It means, human chromosome 13, from 46267941 to 46467941. My idea is to give a position and a chromosome and then get the surrounding genes/mRNAs/CDS/SNPs in the area....

Well, I have found that when trying to use GenBank.FeatureParser() it shows an error here:

LOCUS       NC_000013             200001 bp    DNA     linear   CON 25-OCT-2004
DEFINITION  Homo sapiens chromosome 13, complete sequence.
ACCESSION   NC_000013 REGION: 46267941..46467941
VERSION     NC_000013.9  GI:51511729
KEYWORDS    HTG.

Specifically in line:

ACCESSION   NC_000013 REGION: 46267941..46467941

...

If you remove 'REGION: 46267941..46467941' then the parser works fine.

I guess it should be easy to fix in the Biopython code... but I get lost when trying to find where....

Someone can help? I suggest to add the fix as soon as possible...

Regards, 

Julio Bonis Sanz MD http://www.juliobonis.com/portal/

Research Group on Biomedical Informatics http://www.imim.es/grib/
Barcelona - Spain




More information about the BioPython mailing list