[Bioperl-l] bp_genbank2gff3.pl error with circular genomes
Chris Fields
cjfields at illinois.edu
Tue Aug 17 15:24:23 EDT 2010
On Aug 17, 2010, at 10:53 AM, Chris Mungall wrote:
> You can merge this in. It should allow David to proceed.
Will do. I'll go ahead and delete the remote branch as well.
> I haven't kept up on synchrony between bioperl and GFF on circular genomes. The above fix is conservative in that essentially preserves the genbank coordinates even when the origin is crossed:
>
> http://github.com/bioperl/bioperl-live/commit/d752a4cb5168d1bb01f8c80247a57f66b2bd9daf
>
> However, if this is to conform to GFF3 then the resulting coordinates that cross the origin should have start/end incremented by the genome length
Yes, that is a problem that needs to be addressed. Might be worth filing a bug report for tracking this; we can use David's example, or the one I recently added for phi-X174.
chris
> On Aug 17, 2010, at 6:51 AM, Chris Fields wrote:
>
>> I think Chris Mungall has a branch set up for this in bioperl:
>>
>> http://github.com/bioperl/bioperl-live/tree/circular
>>
>> Is that correct? Should we merge that code into the master branch?
>>
>> chris
>>
>> On Aug 17, 2010, at 8:44 AM, David Breimann wrote:
>>
>>> Hello,
>>>
>>> The following genbank has a gene that runs over the 'end" of the
>>> chromosome and into its "beginning", and the script generates an
>>> error.
>>>
>>> ftp://ftp.ncbi.nih.gov/genomes/Bacteria/Bacillus_cereus_ATCC_10987/NC_005707.gbk
>>>
>>> NC_005707 Unflattening error:
>>> Details:
>>> ------------- EXCEPTION: Bio::Root::Exception -------------
>>> MSG: PROBLEM, SEVERITY==2
>>> Ranges not in correct order. Strange ensembl genbank entry? Range:
>>> [207497,208369] [1,687]
>>> STACK: Error::throw
>>> STACK: Bio::Root::Root::throw /usr/local/share/perl/5.10.1/Bio/Root/Root.pm:473
>>> STACK: Bio::SeqFeature::Tools::Unflattener::problem
>>> /usr/local/share/perl/5.10.1/Bio/SeqFeature/Tools/Unflattener.pm:952
>>> STACK: Bio::SeqFeature::Tools::Unflattener::_check_order_is_consistent
>>> /usr/local/share/perl/5.10.1/Bio/SeqFeature/Tools/Unflattener.pm:2842
>>> STACK: Bio::SeqFeature::Tools::Unflattener::infer_mRNA_from_CDS
>>> /usr/local/share/perl/5.10.1/Bio/SeqFeature/Tools/Unflattener.pm:2713
>>> STACK: Bio::SeqFeature::Tools::Unflattener::unflatten_seq
>>> /usr/local/share/perl/5.10.1/Bio/SeqFeature/Tools/Unflattener.pm:1532
>>> STACK: main::unflatten_seq /usr/local/bin/bp_genbank2gff3.pl:1023
>>> STACK: /usr/local/bin/bp_genbank2gff3.pl:506
>>> -----------------------------------------------------------
>>>
>>> Best,
>>> Dave
>>> _______________________________________________
>>> Bioperl-l mailing list
>>> Bioperl-l at lists.open-bio.org
>>> http://lists.open-bio.org/mailman/listinfo/bioperl-l
>>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l at lists.open-bio.org
> http://lists.open-bio.org/mailman/listinfo/bioperl-l
More information about the Bioperl-l
mailing list