[Bioperl-l] AlignIO crashes when reading stockholm alignment with negative PDB ranges
Chris Fields
cjfields at illinois.edu
Mon Jan 18 21:19:30 EST 2010
Alexandr,
Posting the bug report would be great, should be an easy enough fix.
chris
On Jan 18, 2010, at 7:04 PM, Jason Stajich wrote:
> Alexandr -
>
> Thanks for getting back to us - I am guessing the parser needs to recognize negative coordinates around about line 370 in Bio/AlignIO/Handler/GenericAlignHandler.pm which assumes a split on '-' will be sufficient.
>
> Can you post it as a bug to bugzilla along with attaching a record and script that replicates the problem so a test can be written for this. http://bugzilla.open-bio.org/
>
> -jason
> On Jan 18, 2010, at 4:54 PM, Alexandr Bezginov wrote:
>
>> I have contacted Pfam, and I have been told that The PDB file actually
>> does include a reference to residue "-1":
>>
>> DBREF 1E5N A -1 347 UNP P14768 XYNA_PSEFL 264 611
>>
>> DBREF 1E5N B -1 347 UNP P14768 XYNA_PSEFL 264 611
>>
>>
>> Since negative numbers are allowed in PDB, the data should probably be
>> considered valid.
>>
>> There are quite a few records like this, so this is not an isolated issue.
>>
>> Alexandr
>>
>> On 1/14/2010 7:20 PM, Jason Stajich wrote:
>>> Seems like improper data really -- "-1" is an improper coordinate as far
>>> as the parser is concerned. You may want to tell Pfam that there is
>>> possible error in the dumper since that was the only record that had
>>> this problem?
>>>
>>> -jason
>>> On Jan 13, 2010, at 5:57 PM, albezg wrote:
>>>
>>>> Hi all,
>>>>
>>>> I have a problem using AlignIO to read Pfam database:
>>>> ftp://ftp.sanger.ac.uk/pub/databases/Pfam/current_release/Pfam-A.seed.gz
>>>> The database is in STOCKHOLM 1.0 format. AlignIO can read the
>>>> alignment OK until the alignment PF00331.13. There it crashes with the
>>>> following message:
>>>>
>>>> ------------- EXCEPTION: Bio::Root::Exception -------------
>>>> MSG: '1-344' is not an integer.
>>>>
>>>> STACK: Error::throw
>>>> STACK: Bio::Root::Root::throw
>>>> /home/albezg/lib/perl5/site_perl/5.10.0/Bio/Root/Root.pm:368
>>>> STACK: Bio::Range::end
>>>> /home/albezg/lib/perl5/site_perl/5.10.0/Bio/Range.pm:228
>>>> STACK: Bio::Annotation::Target::new
>>>> /home/albezg/lib/perl5/site_perl/5.10.0/Bio/Annotation/Target.pm:82
>>>> STACK: Bio::AlignIO::Handler::GenericAlignHandler::_stockholm_target
>>>> /home/albezg/lib/perl5/site_perl/5.10.0/Bio/AlignIO/Handler/GenericAlignHandler.pm:293
>>>>
>>>> STACK: Bio::AlignIO::Handler::GenericAlignHandler::data_handler
>>>> /home/albezg/lib/perl5/site_perl/5.10.0/Bio/AlignIO/Handler/GenericAlignHandler.pm:73
>>>>
>>>> STACK: Bio::AlignIO::stockholm::next_aln
>>>> /home/albezg/lib/perl5/site_perl/5.10.0/Bio/AlignIO/stockholm.pm:471
>>>> STACK: /home/albezg/scripts/pfam2fasta.pl:22
>>>> -----------------------------------------------------------
>>>>
>>>> It appears this is caused by this entry:
>>>> #=GS XYNA_PSEFL/263-608 DR PDB; 1e5n B; -1-344;
>>>>
>>>> I don't care about residues in PDB, so I have just removed minus signs
>>>> from the ranges. This seems to have fixed the crashing.
>>>>
>>>> Is it a known problem? Is there a solution for it?
>>>>
>>>> Thanks,
>>>> Alexandr
>>>>
>>>>
>>>> On 03/20/2009 05:09 PM, albezg wrote:
>>>>>
>>>>> I'm trying to change FASTA header(display_id) for a sequence in an
>>>>> alignment(SimpleAlign).
>>>>>
>>>>> There are no issues when I print it, however when I use AlignIO to write
>>>>> the alignment to a FASTA file, it does not work. Is this behavior
>>>>> intended?
>>>>>
>>>>> Demo code: http://github.com/jhannah/sandbox/tree/master/Bio_AlignIO_bug
>>>>>
>>>>> The error:
>>>>> ------------- EXCEPTION -------------
>>>>> MSG: No sequence with name [1/1-11]
>>>>> STACK Bio::SimpleAlign::displayname
>>>>> /scratch/BioSoftware/bioperl-live/Bio/SimpleAlign.pm:2659
>>>>> STACK Bio::AlignIO::fasta::write_aln
>>>>> /scratch/BioSoftware/bioperl-live/Bio/AlignIO/fasta.pm:200
>>>>> STACK toplevel ./demo.pl:14
>>>>> -------------------------------------
>>>>>
>>>>> Alexandr
>>>>
>>>> _______________________________________________
>>>> Bioperl-l mailing list
>>>> Bioperl-l at lists.open-bio.org
>>>> http://lists.open-bio.org/mailman/listinfo/bioperl-l
>>>
>>> --
>>> Jason Stajich
>>> jason.stajich at gmail.com
>>> jason at bioperl.org
>>> http://fungalgenomes.org/
>>>
>>
>
> --
> Jason Stajich
> jason.stajich at gmail.com
> jason at bioperl.org
> http://fungalgenomes.org/
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l at lists.open-bio.org
> http://lists.open-bio.org/mailman/listinfo/bioperl-l
More information about the Bioperl-l
mailing list