[Bioperl-l] Bio::DB::GFF/Postgres test failures

Chris Fields cjfields at illinois.edu
Fri Apr 8 18:56:40 UTC 2011


Scott, 

I'll try documenting the Pg error for SF::Store in the next hour.  Had my hands full with the GSoC onslaught of emails and local $job stuff.  Would like to get it fixed for the CPAN release.

chris

On Apr 8, 2011, at 12:58 PM, Scott Cain wrote:

> OK, I'll take it out and move on to the next problem.
> 
> Thanks,
> Scott
> 
> 
> On Fri, Apr 8, 2011 at 1:51 PM, Lincoln Stein <lincoln.stein at gmail.com> wrote:
>> Oh right. The Bio::DB::GFF adaptor has that broken behavior and it is too
>> late to change it now. (Bio::DB::SeqFeature::Store had better not!) Best to
>> remove the test altogether.
>> Lincoln
>> 
>> On Fri, Apr 8, 2011 at 1:18 PM, Scott Cain <scott at scottcain.net> wrote:
>>> 
>>> Hi Lincoln,
>>> 
>>> Yes, apparently, it does.  It does this for both the memory and the
>>> postgres adaptors.  I looked at how the data was stored in the feature
>>> object with Data::Dumper and that is how it is represented in the hash
>>> too.  Perhaps this test should be calling the "absolute" method first?
>>> 
>>> Scott
>>> 
>>> 
>>> On Fri, Apr 8, 2011 at 1:10 PM, Lincoln Stein <lincoln.stein at gmail.com>
>>> wrote:
>>>> Do start() and end() flip values for minus strand features? This isn't
>>>> supposed to happen.
>>>> Lincoln
>>>> 
>>>> On Fri, Apr 8, 2011 at 11:41 AM, Scott Cain <scott at scottcain.net> wrote:
>>>>> 
>>>>> Hi Lincoln,
>>>>> 
>>>>> I've been looking into some test failures with the postgres adaptor
>>>>> for Bio::DB::GFF and I wanted to check with you that I'm interpreting
>>>>> this correctly.  In t/LocalDB/BioDBGFF.t there are these lines:
>>>>> 
>>>>>  @features = sort {$a->start<=>$b->start} @features;
>>>>> 
>>>>>  is($features[0]->type,'Component:reference');
>>>>>  is($features[-1]->type,'exon:confirmed');
>>>>> 
>>>>> So that the features in the data set are sorted by their start values
>>>>> and the beginning and end of the list are checked.  The test refers to
>>>>> the test.gff data file, that contains among others these lines:
>>>>> 
>>>>> Contig1 confirmed   transcript      30001   31000   .   -   .
>>>>> Transcript trans-2; Gene "xyz-2"; Note "Terribly interesting"
>>>>> Contig1 confirmed   exon    30001   30100   .   -   .   Transcript
>>>>> trans-2; Gene "abc-1"; Note "function unknown"
>>>>> Contig1 confirmed   exon    30701   30800   .   -   .   Transcript
>>>>> trans-2
>>>>> Contig1 confirmed   exon    30801   31000   .   -   .   Transcript
>>>>> trans-2
>>>>> 
>>>>> Since this transcript and its exons are on the minus strand, the
>>>>> values that the start and stop method return will be reversed, so that
>>>>> start for the transcript will be 31000 and stop will be 30001.  The
>>>>> problem with this test is since the last exon and the transcript share
>>>>> a start value (31000), you can't really be sure which one will be at
>>>>> the bottom of the list after sorting, right?  In the case of the
>>>>> postgres adaptor, it fails this test on my machine because the
>>>>> transcript is at the bottom of the list.  The test for the beginning
>>>>> of the list similarly could fail though it didn't in my case, as other
>>>>> features that have 1 as a start are of type "Component:clone".
>>>>> 
>>>>> So, my question is this: am I missing something, and the postgres
>>>>> adaptor is not behaving as expected, or are these tests ambiguous?
>>>>> 
>>>>> Thanks,
>>>>> Scott
>>>>> 
>>>>> 
>>>>> --
>>>>> 
>>>>> ------------------------------------------------------------------------
>>>>> Scott Cain, Ph. D.                                   scott at scottcain
>>>>> dot net
>>>>> GMOD Coordinator (http://gmod.org/)                     216-392-3087
>>>>> Ontario Institute for Cancer Research
>>>> 
>>>> 
>>>> 
>>>> --
>>>> Lincoln D. Stein
>>>> Director, Informatics and Biocomputing Platform
>>>> Ontario Institute for Cancer Research
>>>> 101 College St., Suite 800
>>>> Toronto, ON, Canada M5G0A3
>>>> 416 673-8514
>>>> Assistant: Renata Musa <Renata.Musa at oicr.on.ca>
>>>> 
>>> 
>>> 
>>> 
>>> --
>>> ------------------------------------------------------------------------
>>> Scott Cain, Ph. D.                                   scott at scottcain
>>> dot net
>>> GMOD Coordinator (http://gmod.org/)                     216-392-3087
>>> Ontario Institute for Cancer Research
>> 
>> 
>> 
>> --
>> Lincoln D. Stein
>> Director, Informatics and Biocomputing Platform
>> Ontario Institute for Cancer Research
>> 101 College St., Suite 800
>> Toronto, ON, Canada M5G0A3
>> 416 673-8514
>> Assistant: Renata Musa <Renata.Musa at oicr.on.ca>
>> 
> 
> 
> 
> -- 
> ------------------------------------------------------------------------
> Scott Cain, Ph. D.                                   scott at scottcain dot net
> GMOD Coordinator (http://gmod.org/)                     216-392-3087
> Ontario Institute for Cancer Research
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l at lists.open-bio.org
> http://lists.open-bio.org/mailman/listinfo/bioperl-l





More information about the Bioperl-l mailing list