[Bioperl-l] Clear range from Bio::Seq::Quality?
Chris Fields
cjfields at illinois.edu
Fri Apr 24 14:56:34 EDT 2009
You could submit this as a diff against Bio::Seq::Quality to
bugzilla. If possible, tests don't hurt either!
chris
On Apr 24, 2009, at 11:20 AM, Dan Bolser wrote:
> Its a bit rough and ready, but it does what I need...
>
>
>
>
> =head2 get_clear_range
>
> Title : get_clear_range
>
> Title : subqual
> Usage : $subobj = $obj->get_clear_range();
> $subobj = $obj->get_clear_range(20);
> Function : Get the clear range using the given quality score as a
> cutoff or a default value of 13.
>
> Returns : a new Bio::Seq::Quality object
> Args : a minimum quality value, optional, devault = 13
>
> =cut
>
> sub get_clear_range
> {
> my $self = shift;
> my $qual = $self->qual;
> my $minQual = shift || 13;
>
> my (@ranges, $rangeFlag);
>
> for(my $i=0; $i<@$qual; $i++){
> ## Are we currently within a clear range or not?
> if(defined($rangeFlag)){
> ## Did we just leave the clear range?
> if($qual->[$i]<$minQual){
> ## Log the range
> push @ranges, [$rangeFlag, $i-1, $i-$rangeFlag];
> ## and reset the range flag.
> $rangeFlag = undef;
> }
> ## else nothing changes
> }
> else{
> ## Did we just enter a clear range?
> if($qual->[$i]>=$minQual){
> ## Better set the range flag!
> $rangeFlag = $i;
> }
> ## else nothing changes
> }
> }
> ## Did we exit the last clear range?
> if(defined($rangeFlag)){
> my $i = scalar(@$qual);
> ## Log the range
> push @ranges, [$rangeFlag, $i-1, $i-$rangeFlag];
> }
>
> unless(@ranges){
> die "There is no clear range... I don't know what to do here!\n";
> }
>
> print "there are ", scalar(@ranges), " clear ranges\n";
>
> my $sum; map {$sum += $_->[2]} @ranges;
>
> print "of ", scalar(@$qual), " bases, there are $sum with ".
> "quality scores above the given threshold\n";
>
> for (sort {$b->[2] <=> $a->[2]} @ranges){
> if($_->[2]/$sum < 0.5){
> warn "not so much a clear range as a clear chunk...\n";
> }
> print $_->[2], "\t", $_->[2]/$sum, "\n";
>
> return Bio::Seq::QualityDB->new( -seq => $self->subseq( $_->[0]+1,
> $_->[1]+1),
> -qual => $self->subqual($_->[0]+1, $_->[1]+1)
> );
> }
> }
>
>
>
>
> Note, for testing I made a package called Bio/Seq/QualityDB.pm (which
> is a copy of Bio/Seq/Quality.pm that just has the above method added).
> That is why the 'new Bio::Seq::Quality object' is actually a
> Bio::Seq::QualityDB object, but other than that it should slot right
> in (apart from all the debugging output that I spit out).
>
>
> Cheers,
> Dan.
>
>
> 2009/4/24 Dan Bolser <dan.bolser at gmail.com>:
>> Hi all,
>>
>> I couldn't find out how to get the 'clear range' from a
>> Bio::Seq::Quality object... Am I looking in the wrong place, or
>> should
>> this method be a part of the Bio::Seq::Quality class?
>>
>> In the latter case I'm on my way to an implementation, but I am not
>> good at navigating the bioperl docs, so I thought I should ask before
>> I take the time to finish that off.
>>
>>
>> Cheers,
>> Dan.
>>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l at lists.open-bio.org
> http://lists.open-bio.org/mailman/listinfo/bioperl-l
More information about the Bioperl-l
mailing list