[BioPython] genbank parser returns start position of the location decreased by one

Martin MOKREJŠ mmokrejs at ribosome.natur.cuni.cz
Fri Oct 14 17:12:27 EDT 2005


Hi,
  I am either too tired or have missed some point. I use bipython 1.40b to
fetch data from genbank. The
location: (467..2863) from Genbank as seen on their web pages differs to the string
returned by biopython. I get location: (466..2863) instead. The latter number is
never decreased, only the first-one. What's wrong? ;)
http://www.ncbi.nlm.nih.gov/entrez/viewer.fcgi?db=nucleotide&val=56117851
It happends with CDS feature data, but also with source, just anything:

FEATURES             Location/Qualifiers
     source          1..4115


$ python
Python 2.4.2 (#1, Oct  2 2005, 05:43:55) 
[GCC 3.4.4 (Gentoo 3.4.4-r1, ssp-3.4.4-1.0, pie-8.7.8)] on linux2
Type "help", "copyright", "credits" or "license" for more information.
>>> from Bio import GenBank
>>> record_parser = GenBank.FeatureParser()
>>> ncbi_dict = GenBank.NCBIDictionary('nucleotide', 'genbank', parser = record_parser)
>>> gb_seqrecord = ncbi_dict['56117851']
>>> print _feature.location
(0..4115)
>>> 


More information about the BioPython mailing list