[Bioperl-l] load_seqdatabase error with a specific locus from genbank
Hilmar Lapp
hlapp at gmx.net
Thu Apr 9 03:51:52 UTC 2009
On Apr 8, 2009, at 11:35 PM, Hilmar Lapp wrote:
>
> On Apr 8, 2009, at 11:29 AM, Johann PELLET wrote:
>
>> [...]
>> and finally EU608407 and EU608559 made a crash:
>>
>> [...]
>> --------------------- WARNING ---------------------
>> MSG: Unexpected error in feature table for Skipping feature,
>> attempting to recover
>> ---------------------------------------------------
>> #######...14 times ...############
>
> I would assume that you figured out that this was triggered by or
> affected EU608407? Would you mind sharing how?
Looking at EU608407, it most likely wasn't the culprit or stumbling
stone. It must have been triggered before that.
> [...]
> So are there other sequences associated with that PubMed ID?
To answer my own question, it's indeed EU608407 that's from the same
PubMed ID, and so am I correct in assuming that you didn't get the
exception for that record, which would mean that the reference was
properly inserted when that sequence was loaded.
The second occurrence of the same PubMed ID should have actually
triggered a successful lookup of the previously inserted record, which
would then have skipped the insert. The fact that that didn't happen
suggests that the PubMed ID also wasn't properly extracted from the
Genbank record. So my first suspicion remains that your file is
corrupted.
Otherwise, if you download this record:
http://www.ncbi.nlm.nih.gov/nuccore/183191257
in GenBank format and try to load it alone, it should yield the same
error. Can you indeed reproduce the problem in that way?
-hilmar
--
===========================================================
: Hilmar Lapp -:- Durham, NC -:- hlapp at gmx dot net :
===========================================================
More information about the Bioperl-l
mailing list