[Bioperl-l] commandexts and array refs bug - found problem, not sure of best fix

Ben Bimber bimber at wisc.edu
Thu Jun 10 12:46:04 EDT 2010


i suspect this piece of code is seldom encountered.  if any existing
bioperl wrapper actually tried to pass an arrayref to commandexts, it
would hit this same error.  i only encountered this using samtools
merge.

I looked a little more into exactly what @switch is doing.  it seems
to allow you to supply switches specific to a given file.  the bowtie
wrapper uses it with paired end data (the first fasta gets a -1 and
and second gets a -2).  however, i dont see anywhere where this
actually applies in a multi-file situation.  nonetheless, reversing
the order of the two existing splice() (lines 991-992) I think will
preserve whatever functionality was there and prevent the error:

    my @files = @args{@specs};
    # expand arrayrefs
    my $l = $#files;
    for (0..$l) {
       if (ref($files[$_]) eq 'ARRAY') {
           splice(@switches, $_, 1, ($switches[$_]) x @{$files[$_]});
           splice(@files, $_, 1, @{$files[$_]});
       }
    }

As I write this email and look at the code above a little more, I
think that the current approach used to expand these arrayrefs will
also break down in other situations, including when the arrayref is
not in the last element of the @files array or when 2 arrayrefs are
passed.  However, it seems that this is barely used and it really
depends how much time/thought you want to put into it.

-Ben





On Thu, Jun 10, 2010 at 9:58 AM, Chris Fields <cjfields at illinois.edu> wrote:
> On Jun 10, 2010, at 8:43 AM, Ben Bimber wrote:
>
>> hello,
>>
>> a little while ago i posted a bug about commandexts when you pass an
>> array ref.  when you pass an arrayref of files, instead of a string
>> with 1 file, you get an error saying 'cannot use string '......' as an
>> array ref while strict refs in place'.  i believe i understand the
>> bug, but am not entirely sure the best way to fix it.  here's the
>> original code from commandexts.pm, starting line 989:
>>
>>    my @files = @args{@specs};
>>    # expand arrayrefs
>>    my $l = $#files;
>>    for (0..$l) {
>>       if (ref($files[$_]) eq 'ARRAY') {
>>           splice(@files, $_, 1, @{$files[$_]});
>>           splice(@switches, $_, 1, ($switches[$_]) x @{$files[$_]});
>>       }
>>    }
>>
>> It throws the error on the second splice().  The reason is b/c
>> $files[$_] isnt an array ref anymore, b/c we just expanded it in the
>> line before.  Truthfully, I dont entirely understand why we're
>> expanding out @switches for each file.  Maybe i'm missing something
>> obvious, but why would we want these copied?
>
> Maybe @file and @switches are in-sync and thus need similar expansion?  Not sure, personally.
>
>> however, there's couple ways around it:
>>
>> 1. save the value of  @{$files[$_]} by adding: my @tmp =
>> @{$files[$_]};, then using @tmp on that line
>> 2. put the second splice() before the first
>> 3. if the idea is actually to expand @switches against every file,
>> then move this outside the for loop
>
> The first one seems safest.  The real question is, do any of these solutions work?  Do any of the modules requiring the command extensions in bioperl-run show problems, for instance?
>
>> I originally posted that this could be fixed by surrounding the
>> problem line with 'no strict'/ 'use strict'; however, that really only
>> worked b/c the command I was using did not actually have any switches.
>> I doubt it would work properly if your command had them.
>>
>> -Ben
>
> I think, if one of the proposed solutions works, pick which you think is the best and go with it.  Test it out, then we can commit this to github.
>
> BTW, sorry about the warnocking, a number of us (myself included) have been extremely busy as of late.
>
> chris
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l at lists.open-bio.org
> http://lists.open-bio.org/mailman/listinfo/bioperl-l
>




More information about the Bioperl-l mailing list