[Biojava-l] Exception thrown when parsing GenBank file
Peter Cock
p.j.a.cock at googlemail.com
Mon Nov 14 12:29:31 EST 2011
On Mon, Nov 14, 2011 at 4:53 PM, Dietmar Birzer wrote:
>
> Hi all,
>
> I am currently trying to debug a little software application which
> uses BioJava's core-1.8.1.jar library because it has started to
> throw exceptions a while ago.
>
> I guess the problem is, that the GenbankLocationParser is not
> able to handle "Het" entries in the features section of the
> GenBank/GenPept format, e.g.
>
> Het join(bond(9),bond(125))
> /heterogen="( NA, 5 )"
>
> for database id 14719485 (http://www.ncbi.nlm.nih.gov/protein/14719485) .
Interesting - note the bond locations are not in the official
DDBJ/EMBL/GenBank feature table specification (v9, Oct 2011):
http://www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html
However, as noted on http://www.bioperl.org/wiki/BioPerl_Locations
that seems to be intended only for nucleotides and not proteins as here.
It might be worth contacting the NCBI to find out if there is an official
specification covering these location strings?
Peter
More information about the Biojava-l
mailing list