From heikki at ebi.ac.uk  Wed Oct  1 12:30:47 2003
From: heikki at ebi.ac.uk (Heikki Lehvaslaiho)
Date: Wed Oct  1 12:28:42 2003
Subject: [Bioperl-l] Bioperl Developer snapshot 1.3.01
Message-ID: <1065025847.2553.112.camel@localhost>


   Bioperl developer snap shot  1.3.01
   -----------------------------------


This is the first developer snap shot from the BioPerl CVS head
that will eventually lead to release 1.4.

Note that this is called snap shot rather than release. Number of things
are guarantied to fail. The meaning of the snapshot is get more people
to move from 1.2 series into using the latest code base to have enough
momentum to put together a solid release.

1.3 series will be available only from the BioPerl FTP site and its
mirrors, not from CPAN. Depending on developer activity new snapshots
will be made available weekly or forth-nightly. These snap shots will be
taken from the CVS head until there is a good reason to branch.

http://bioperl.org/DIST/current_core_unstable.tar.gz
http://bioperl.org/DIST/bioperl-1.3.01.tar.gz


New things is featured for the first time in 1.3 which will be relaesed
in 1.4 include:

- installable scripts
- Bio::Popgen
- Bio::Restriction
- Bio::Tools::Analysis - web based DNA and Protein analysis tools
- Bio::Seq::Meta - per residue annotable sequences
- Bio::Matrix::IO
- improved Bio::Tools::Genewise
- Bio::Tools::SiRNA, Bio::SeqFeature::SiRNA - small inhibitory RNA
- Bio::SeqFeature::Tools - seqFeature mapping tools
- Bio::Tools::dpAlign - pure perl dynamic programming sequence alignment
- new Bio::SearchIO formats
- new parsers in Bio::Tools:
      Blat,  Geneid, Lagan, Mdust, Promoterwise,PrositeScan,  
- hundreds of new and improved files 


Start these now! All feed back to bioperl-l@bioperl.org

	-Heikki
	

-- 
______ _/      _/_____________________________________________________
      _/      _/                      http://www.ebi.ac.uk/mutations/
     _/  _/  _/  Heikki Lehvaslaiho    heikki_at_ebi ac uk
    _/_/_/_/_/  EMBL Outstation, European Bioinformatics Institute
   _/  _/  _/  Wellcome Trust Genome Campus, Hinxton
  _/  _/  _/  Cambs. CB10 1SD, United Kingdom
     _/      Phone: +44 (0)1223 494 644   FAX: +44 (0)1223 494 468
___ _/_/_/_/_/________________________________________________________

From bmadsen at mathworks.com  Wed Oct  1 18:25:06 2003
From: bmadsen at mathworks.com (Brian Madsen)
Date: Wed Oct  1 18:22:57 2003
Subject: [Bioperl-l] Installation Problems
Message-ID: <DB873318D1A41648BEAC4B5AAB63B3E101017FFB@MESSAGE-AH.ad.mathworks.com>

I'm new to Bioperl and am trying to complete an installation. The
company has a Solaris server, and I don't have write privileges to the
perl5/site_perl source area, so I followed the instructions to install
to a personal module area.
In addition to the bioperl-1.2.3 tar file, I also ran installations for
bioperl-ext-06, Bundle-BioPerl-2.05 and a separate installation for
IO-String-1.02. In every case, I used: PREFIX=/home/bmadsen/bjm_perl as
a parameter in building the makefile, and all of the installations were
successful.
Still, when I run the bptutorial.pl file, I get the error: "Can't locate
IO/String.pm in @INC (@INC contains: ..." followed by a half-dozen
notices about failed compilation. I've even tried copy the String.pm
file into the bioperl-1.2.3 directory, but it doesn't help.
I have already printed bptutorial.pl and am reading through it, but I
had hoped to run it interactively. Any advice?
Thanks
-Brian

From cjm at fruitfly.org  Wed Oct  1 20:12:06 2003
From: cjm at fruitfly.org (Chris Mungall)
Date: Wed Oct  1 19:26:27 2003
Subject: [Bioperl-l] Bio::Tools::GFF GFF3 parsing
In-Reply-To: <200309261144.22482.lstein@cshl.edu>
Message-ID: <Pine.LNX.4.33.0310011707270.1678-100000@heartbroken.lbl.gov>


The main thing it seems to be missing is use of the ID and Parent tags to
indicate the relationship of features to one another.

I'm not really sure how best to do this, as this requires generating a
unique identifier for any feature that contains other features. The code
on the freaky-dev branch had a way of dealing with this, didn't it? I seem
to recall this would require some big changes to the core model.

Otherwise it looks great!

On Fri, 26 Sep 2003, Lincoln Stein wrote:

> Hi Jason,
>
> This is very good and I'm looking it over now.  Does it handle the CIGAR lines
> for gapped alignments?
>
> Lincoln
>
> On Thursday 18 September 2003 07:02 pm, Jason Stajich wrote:
> > I added support for GFF3 parsing to Bio::Tools::GFF and added some simple
> > tests.  I'm not 100% I have GFF3 output correct so Chris Mungall, Lincoln
> > if you don't mind giving it a look over that would be great.  If I've
> > duplicated functionality from somewhere else let me know, but I think
> > Bio::Tools::GFF needs to be able to parse in GFF3 format at some point.
> >
> > The GFF3 parsing seems to work fine for processing the BGDP annotations so
> > I feel confident it is working correctly, but more testing is welcomed!
> >
> > This is no real support for Unicode type output, the simpliest solution
> > would be to rely on HTML::Entities for encoding non-ASCII codes.  I wasn't
> > sure I wanted to make Tools::GFF depend on this right now so I've just
> > implemented a simple encoding for '=,;'.  Feel free to fix this if it
> > needs to be more aggresive.
> >
> > -jason
>
>

From heikki at ebi.ac.uk  Thu Oct  2 06:00:57 2003
From: heikki at ebi.ac.uk (Heikki Lehvaslaiho)
Date: Thu Oct  2 05:58:51 2003
Subject: [Bioperl-l] Installation Problems
In-Reply-To: <DB873318D1A41648BEAC4B5AAB63B3E101017FFB@MESSAGE-AH.ad.mathworks.com>
References: <DB873318D1A41648BEAC4B5AAB63B3E101017FFB@MESSAGE-AH.ad.mathworks.com>
Message-ID: <1065088856.4556.49.camel@localhost>

Brian,

Have you set up environmental variable PERL5LIB? 

Here is a command line example (which is best put into shell resource
file):

   bash: % export PERL5LIB='$HOME/src/bioperl'
   tcsh: % setenv PERL5LIB '$HOME/src/bioperl'

Using ':' as a separator you can add more than one directory.

In your case for perl to find IO::String, PERL5LIB should contain
'/home/bmadsen/bjm_perl' and in that directory you should have directory
'IO' with file 'String.pm'

Hope this helps,

	-Heikki


On Wed, 2003-10-01 at 23:25, Brian Madsen wrote:
> I'm new to Bioperl and am trying to complete an installation. The
> company has a Solaris server, and I don't have write privileges to the
> perl5/site_perl source area, so I followed the instructions to install
> to a personal module area.
> In addition to the bioperl-1.2.3 tar file, I also ran installations for
> bioperl-ext-06, Bundle-BioPerl-2.05 and a separate installation for
> IO-String-1.02. In every case, I used: PREFIX=/home/bmadsen/bjm_perl as
> a parameter in building the makefile, and all of the installations were
> successful.
> Still, when I run the bptutorial.pl file, I get the error: "Can't locate
> IO/String.pm in @INC (@INC contains: ..." followed by a half-dozen
> notices about failed compilation. I've even tried copy the String.pm
> file into the bioperl-1.2.3 directory, but it doesn't help.
> I have already printed bptutorial.pl and am reading through it, but I
> had hoped to run it interactively. Any advice?
> Thanks
> -Brian
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
-- 
______ _/      _/_____________________________________________________
      _/      _/                      http://www.ebi.ac.uk/mutations/
     _/  _/  _/  Heikki Lehvaslaiho    heikki_at_ebi ac uk
    _/_/_/_/_/  EMBL Outstation, European Bioinformatics Institute
   _/  _/  _/  Wellcome Trust Genome Campus, Hinxton
  _/  _/  _/  Cambs. CB10 1SD, United Kingdom
     _/      Phone: +44 (0)1223 494 644   FAX: +44 (0)1223 494 468
___ _/_/_/_/_/________________________________________________________

From e_lyn_goh at yahoo.com  Thu Oct  2 01:36:53 2003
From: e_lyn_goh at yahoo.com (Eileen goh)
Date: Thu Oct  2 08:22:37 2003
Subject: [Bioperl-l] Can't locate loadable object for module GD 
Message-ID: <20031002053653.6202.qmail@web40505.mail.yahoo.com>

hi, 
 i am trying create graphics for my output but i encounter the following exception :
 
Can't locate loadable object for module GD in @INC (@INC contains: C:/Perl/lib C
:/Perl/site/lib .) at C:/Perl/site/lib/Bio/Graphics/Glyph/Factory.pm line 55
Compilation failed in require at C:/Perl/site/lib/Bio/Graphics/Glyph/Factory.pm
line 55.
BEGIN failed--compilation aborted at C:/Perl/site/lib/Bio/Graphics/Glyph/Factory
.pm line 55.
Compilation failed in require at C:/Perl/site/lib/Bio/Graphics/Panel.pm line 4.
BEGIN failed--compilation aborted at C:/Perl/site/lib/Bio/Graphics/Panel.pm line
 4.
Compilation failed in require at C:/Perl/site/lib/Bio/Graphics.pm line 3.
BEGIN failed--compilation aborted at C:/Perl/site/lib/Bio/Graphics.pm line 3.
Compilation failed in require at chart.pl line 4.
BEGIN failed--compilation aborted at chart.pl line 4.
 
i have downloaded and installed the module GD but i doesnt work. could it be i installed wrongly?
 
 
my codes:
#!/usr/bin/perl
use strict;
use Bio::Graphics;
use Bio::SeqFeature::Generic;
my $panel = Bio::Graphics::Panel->new(-length => 1000,-width => 800);
my $track = $panel->add_track(-glyph => 'generic',-label =>1);
while(<>) {
 chomp;
 next if /^\#/;
 my($name, $score, $start, $end) = spilt/\t+/;
 my $feature = Bio::SeqFeature::Generic->new(-display_name=>$name,-score=>$score,
          -start=>$start, -end=>$end);
 $tracj->add_feature($feature);
}
print $panel->png;

 
Thanks 
blur queen


---------------------------------
Do you Yahoo!?
The New Yahoo! Shopping - with improved product search
From jason at cgt.duhs.duke.edu  Thu Oct  2 09:01:31 2003
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Thu Oct  2 08:59:43 2003
Subject: [Bioperl-l] RE: problem connecting to remote blast
In-Reply-To: <EF4D1D640345BE4890E5594F109B5E8329B573@motivation.sit>
References: <EF4D1D640345BE4890E5594F109B5E8329B573@motivation.sit>
Message-ID: <Pine.LNX.4.50.0310020901010.20483-100000@tenero.duhs.duke.edu>

I really don't know - i don't really have time to debug this for you I'm
sorry - please post your question to the bioperl list.

On Mon, 29 Sep 2003, 010381P GOH PHUAY CHENG wrote:

> hi,
> i have already remove the entrez query part but i still encountered the same
> warning.
>
> my codes:
>
> #!/usr/bin/perl -w
> use Bio::SeqIO;
> use Bio::Tools::Run::RemoteBlast;
> use strict;
>   my $prog = 'blastn';
>   my $db   = 'ecoli.nt';
>   my $e_val= '1e-10';
>   my @params = ( '-prog' => $prog,
>          '-data' => $db,
>          '-expect' => $e_val,
>          '-readmethod' => 'SearchIO' );
>   my $factory = Bio::Tools::Run::RemoteBlast->new(@params);
>
>   #delete $Bio::Tools::Run::RemoteBlast::HEADER{'FILTER'};
>   my $v = 1;
>
>   my $str = Bio::SeqIO->new(-file=>'test.txt' , '-format' => 'fasta' );
>   while (my $input = $str->next_seq()){
>
>     #my $r = $factory->submit_blast($input);
>     my $r = $factory->submit_blast('test.txt');
>     print STDERR "waiting..." if( $v > 0 );
>     while ( my @rids = $factory->each_rid ) {
>       foreach my $rid ( @rids ) {
>         my $rc = $factory->retrieve_blast($rid);
>         if( !ref($rc) ) {
>           if( $rc < 0 ) {
>             $factory->remove_rid($rid);
>           }
>           print STDERR "." if ( $v > 0 );
>           sleep 5;
>         } else {
>           my $result = $rc->next_result();
>           #save the output
>           my $filename = $result->query_name()."\.out";
>           $factory->save_output($filename);
>           $factory->remove_rid($rid);
>           print "\nQuery Name: ", $result->query_name(), "\n";
>           while ( my $hit = $result->next_hit ) {
>             next unless ( $v > 0);
>             print "\thit name is ", $hit->name, "\n";
>             while( my $hsp = $hit->next_hsp ) {
>               print "\t\tscore is ", $hsp->score, "\n";
>             }
>           }
>         }
>       }
>     }
>   }
>
> -------------------- WARNING ---------------------
> MSG: <p><!--
> QBlastInfoBegin
>         Status=READY
> QBlastInfoEnd
> --><p>
> <b><FONT color="red">INFO: [blastsrv4.REAL]: Error: Segmentation violation
> (or m
> emory usage limit was exceeded) SIGSEGV (11).
> </FONT></B><p><HR><p>
> <PRE>
> BLASTN 2.2.6 [Apr-09-2003]
>
> <pre>RID: 1064815728-21972-271666.BLASTQ3
> Query= Test
>          (560 letters)
>
> <b>No significant similarity found.</b> For reasons why, <A HREF =
> "/blast/blast
> _FAQs.html#no hits"><b>click here</A>.</b><br><br>
>
>
> ---------------------------------------------------
>
>
>
>
> -----Original Message-----
> From:	Jason Stajich [mailto:jason@cgt.duhs.duke.edu]
> Sent:	Mon 9/29/2003 9:32 AM
> To:	010381P GOH PHUAY CHENG
> Cc:
> Subject:	Re: problem connecting to remote blast
>
> I suppose it might be because you specified an entrez query for homo
> sapiens on the ecoli db - that is not going to return any hits.  remove
> the entrez query part perhaps.
>
> please post your questions to the bioperl list in the future -
> bioperl-l@bioperl.org
>
> Thanks,
> -jason
>
> On Mon, 29 Sep 2003, 010381P GOH PHUAY CHENG wrote:
>
> > hi Jason,
> > i encountered some problems running the remote blast.could u please check
> for
> > me wats wrong with my codes. Thanks.
> >
> > phuay cheng
> >
> > -------------------- WARNING ---------------------
> > MSG: <p><!--
> > QBlastInfoBegin
> >         Status=READY
> > QBlastInfoEnd
> > --><p>
> > <b><FONT color="red">INFO: [blastsrv4.REAL]: Error: Segmentation violation
> > (or m
> > emory usage limit was exceeded) SIGSEGV (11).
> > </FONT></B><p><HR><p>
> > <PRE>
> > BLASTN 2.2.6 [Apr-09-2003]
> >
> > <pre>RID: 1064796976-1676-2574126.BLASTQ3
> > Query= Test
> >          (560 letters)
> >
> > <b>No significant similarity found.</b> For reasons why, <A HREF =
> > "/blast/blast
> > _FAQs.html#no hits"><b>click here</A>.</b><br><br>
> >
> >
> > ---------------------------------------------------
> >
> > #!/usr/bin/perl -w
> > use Bio::SeqIO;
> > use Bio::Tools::Run::RemoteBlast;
> >   use strict;
> >   my $prog = 'blastn';
> >   my $db   = 'ecoli.nt';
> >   #my $e_val= '1e-10';
> >   my @params = ( '-prog' => $prog,
> >          '-data' => $db,
> >          '-readmethod' => 'SearchIO' );
> >   my $factory = Bio::Tools::Run::RemoteBlast->new(@params);
> >
> >   $Bio::Tools::Run::RemoteBlast::HEADER{'ENTREZ_QUERY'} = 'Homo sapiens
> > [ORGN]';
> >
> >   delete $Bio::Tools::Run::RemoteBlast::HEADER{'FILTER'};
> >   my $v = 1;
> >
> >   my $str = Bio::SeqIO->new(-file=>'test.txt' , '-format' => 'fasta' );
> >   while (my $input = $str->next_seq()){
> >
> >     #my $r = $factory->submit_blast($input);
> >     my $r = $factory->submit_blast('test.txt');
> >     print STDERR "waiting..." if( $v > 0 );
> >     while ( my @rids = $factory->each_rid ) {
> >       foreach my $rid ( @rids ) {
> >         my $rc = $factory->retrieve_blast($rid);
> >         if( !ref($rc) ) {
> >           if( $rc < 0 ) {
> >             $factory->remove_rid($rid);
> >           }
> >           print STDERR "." if ( $v > 0 );
> >           sleep 5;
> >         } else {
> >           my $result = $rc->next_result();
> >           #save the output
> >           my $filename = $result->query_name()."\.out";
> >           $factory->save_output($filename);
> >           $factory->remove_rid($rid);
> >           print "\nQuery Name: ", $result->query_name(), "\n";
> >           while ( my $hit = $result->next_hit ) {
> >             next unless ( $v > 0);
> >             print "\thit name is ", $hit->name, "\n";
> >             while( my $hsp = $hit->next_hsp ) {
> >               print "\t\tscore is ", $hsp->score, "\n";
> >             }
> >           }
> >         }
> >       }
> >     }
> >   }
> >
>
> --
> Jason Stajich
> Duke University
> jason at cgt.mc.duke.edu
>
>
>
>
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From skirov at utk.edu  Thu Oct  2 10:20:59 2003
From: skirov at utk.edu (Stefan Kirov)
Date: Thu Oct  2 10:19:13 2003
Subject: [Bioperl-l] RE: problem connecting to remote blast
In-Reply-To: <Pine.LNX.4.50.0310020901010.20483-100000@tenero.duhs.duke.edu>
References: <EF4D1D640345BE4890E5594F109B5E8329B573@motivation.sit>
	<Pine.LNX.4.50.0310020901010.20483-100000@tenero.duhs.duke.edu>
Message-ID: <3F7C344B.30404@utk.edu>

Did you try to run your search from the web interface as well? Did it 
work? Looking at the error you get I guess something might be wrong with 
your test.txt file: format violation, size, etc. Try it from the web 
interface (or if you have the db and blastn locally) to validate the input.
Good luck
Stefan

Jason Stajich wrote:

>I really don't know - i don't really have time to debug this for you I'm
>sorry - please post your question to the bioperl list.
>
>On Mon, 29 Sep 2003, 010381P GOH PHUAY CHENG wrote:
>
>  
>
>>hi,
>>i have already remove the entrez query part but i still encountered the same
>>warning.
>>
>>my codes:
>>
>>#!/usr/bin/perl -w
>>use Bio::SeqIO;
>>use Bio::Tools::Run::RemoteBlast;
>>use strict;
>>  my $prog = 'blastn';
>>  my $db   = 'ecoli.nt';
>>  my $e_val= '1e-10';
>>  my @params = ( '-prog' => $prog,
>>         '-data' => $db,
>>         '-expect' => $e_val,
>>         '-readmethod' => 'SearchIO' );
>>  my $factory = Bio::Tools::Run::RemoteBlast->new(@params);
>>
>>  #delete $Bio::Tools::Run::RemoteBlast::HEADER{'FILTER'};
>>  my $v = 1;
>>
>>  my $str = Bio::SeqIO->new(-file=>'test.txt' , '-format' => 'fasta' );
>>  while (my $input = $str->next_seq()){
>>
>>    #my $r = $factory->submit_blast($input);
>>    my $r = $factory->submit_blast('test.txt');
>>    print STDERR "waiting..." if( $v > 0 );
>>    while ( my @rids = $factory->each_rid ) {
>>      foreach my $rid ( @rids ) {
>>        my $rc = $factory->retrieve_blast($rid);
>>        if( !ref($rc) ) {
>>          if( $rc < 0 ) {
>>            $factory->remove_rid($rid);
>>          }
>>          print STDERR "." if ( $v > 0 );
>>          sleep 5;
>>        } else {
>>          my $result = $rc->next_result();
>>          #save the output
>>          my $filename = $result->query_name()."\.out";
>>          $factory->save_output($filename);
>>          $factory->remove_rid($rid);
>>          print "\nQuery Name: ", $result->query_name(), "\n";
>>          while ( my $hit = $result->next_hit ) {
>>            next unless ( $v > 0);
>>            print "\thit name is ", $hit->name, "\n";
>>            while( my $hsp = $hit->next_hsp ) {
>>              print "\t\tscore is ", $hsp->score, "\n";
>>            }
>>          }
>>        }
>>      }
>>    }
>>  }
>>
>>-------------------- WARNING ---------------------
>>MSG: <p><!--
>>QBlastInfoBegin
>>        Status=READY
>>QBlastInfoEnd
>>--><p>
>><b><FONT color="red">INFO: [blastsrv4.REAL]: Error: Segmentation violation
>>(or m
>>emory usage limit was exceeded) SIGSEGV (11).
>></FONT></B><p><HR><p>
>><PRE>
>>BLASTN 2.2.6 [Apr-09-2003]
>>
>><pre>RID: 1064815728-21972-271666.BLASTQ3
>>Query= Test
>>         (560 letters)
>>
>><b>No significant similarity found.</b> For reasons why, <A HREF =
>>"/blast/blast
>>_FAQs.html#no hits"><b>click here</A>.</b><br><br>
>>
>>
>>---------------------------------------------------
>>
>>
>>
>>
>>-----Original Message-----
>>From:	Jason Stajich [mailto:jason@cgt.duhs.duke.edu]
>>Sent:	Mon 9/29/2003 9:32 AM
>>To:	010381P GOH PHUAY CHENG
>>Cc:
>>Subject:	Re: problem connecting to remote blast
>>
>>I suppose it might be because you specified an entrez query for homo
>>sapiens on the ecoli db - that is not going to return any hits.  remove
>>the entrez query part perhaps.
>>
>>please post your questions to the bioperl list in the future -
>>bioperl-l@bioperl.org
>>
>>Thanks,
>>-jason
>>
>>On Mon, 29 Sep 2003, 010381P GOH PHUAY CHENG wrote:
>>
>>    
>>
>>>hi Jason,
>>>i encountered some problems running the remote blast.could u please check
>>>      
>>>
>>for
>>    
>>
>>>me wats wrong with my codes. Thanks.
>>>
>>>phuay cheng
>>>
>>>-------------------- WARNING ---------------------
>>>MSG: <p><!--
>>>QBlastInfoBegin
>>>        Status=READY
>>>QBlastInfoEnd
>>>--><p>
>>><b><FONT color="red">INFO: [blastsrv4.REAL]: Error: Segmentation violation
>>>(or m
>>>emory usage limit was exceeded) SIGSEGV (11).
>>></FONT></B><p><HR><p>
>>><PRE>
>>>BLASTN 2.2.6 [Apr-09-2003]
>>>
>>><pre>RID: 1064796976-1676-2574126.BLASTQ3
>>>Query= Test
>>>         (560 letters)
>>>
>>><b>No significant similarity found.</b> For reasons why, <A HREF =
>>>"/blast/blast
>>>_FAQs.html#no hits"><b>click here</A>.</b><br><br>
>>>
>>>
>>>---------------------------------------------------
>>>
>>>#!/usr/bin/perl -w
>>>use Bio::SeqIO;
>>>use Bio::Tools::Run::RemoteBlast;
>>>  use strict;
>>>  my $prog = 'blastn';
>>>  my $db   = 'ecoli.nt';
>>>  #my $e_val= '1e-10';
>>>  my @params = ( '-prog' => $prog,
>>>         '-data' => $db,
>>>         '-readmethod' => 'SearchIO' );
>>>  my $factory = Bio::Tools::Run::RemoteBlast->new(@params);
>>>
>>>  $Bio::Tools::Run::RemoteBlast::HEADER{'ENTREZ_QUERY'} = 'Homo sapiens
>>>[ORGN]';
>>>
>>>  delete $Bio::Tools::Run::RemoteBlast::HEADER{'FILTER'};
>>>  my $v = 1;
>>>
>>>  my $str = Bio::SeqIO->new(-file=>'test.txt' , '-format' => 'fasta' );
>>>  while (my $input = $str->next_seq()){
>>>
>>>    #my $r = $factory->submit_blast($input);
>>>    my $r = $factory->submit_blast('test.txt');
>>>    print STDERR "waiting..." if( $v > 0 );
>>>    while ( my @rids = $factory->each_rid ) {
>>>      foreach my $rid ( @rids ) {
>>>        my $rc = $factory->retrieve_blast($rid);
>>>        if( !ref($rc) ) {
>>>          if( $rc < 0 ) {
>>>            $factory->remove_rid($rid);
>>>          }
>>>          print STDERR "." if ( $v > 0 );
>>>          sleep 5;
>>>        } else {
>>>          my $result = $rc->next_result();
>>>          #save the output
>>>          my $filename = $result->query_name()."\.out";
>>>          $factory->save_output($filename);
>>>          $factory->remove_rid($rid);
>>>          print "\nQuery Name: ", $result->query_name(), "\n";
>>>          while ( my $hit = $result->next_hit ) {
>>>            next unless ( $v > 0);
>>>            print "\thit name is ", $hit->name, "\n";
>>>            while( my $hsp = $hit->next_hsp ) {
>>>              print "\t\tscore is ", $hsp->score, "\n";
>>>            }
>>>          }
>>>        }
>>>      }
>>>    }
>>>  }
>>>
>>>      
>>>
>>--
>>Jason Stajich
>>Duke University
>>jason at cgt.mc.duke.edu
>>
>>
>>
>>
>>
>>    
>>
>
>--
>Jason Stajich
>Duke University
>jason at cgt.mc.duke.edu
>_______________________________________________
>Bioperl-l mailing list
>Bioperl-l@portal.open-bio.org
>http://portal.open-bio.org/mailman/listinfo/bioperl-l
>  
>

-- 
Stefan Kirov, Ph.D.
University of Tennessee/Oak Ridge National Laboratory
1060 Commerce Park, Oak Ridge
TN 37830-8026
USA
tel +865 576 5120
fax +865 241 1965
e-mail: skirov@utk.edu
sao@ornl.gov


From scamargo at inf.ufrgs.br  Thu Oct  2 12:45:03 2003
From: scamargo at inf.ufrgs.br (Sandro da Silva Camargo)
Date: Thu Oct  2 12:45:25 2003
Subject: [Bioperl-l] Remote Blast
Message-ID: <1065113103.3f7c560fbc1a1@webmail.inf.ufrgs.br>

Hi,

I am trying to connect to the remote blast, but i got the following
error. 

waiting...Error in tempfile() using /tmp/XXXXXXXXXX: Have exceeded the maximum
number of attempts (10) to open temp file/dir at
/usr/local/ActivePerl-5.8/lib/site_perl/5.8.0/Bio/Root/IO.pm line 659

Can somebody help me?

Thanks in advance.

Sandro

Sandro da Silva Camargo
Instituto de Informatica - Universidade Federal do Rio Grande do Sul
E-mail: scamargo@inf.ufrgs.br
URL: www.inf.ufrgs.br/~scamargo

----------------------------------------------------------------
This message was sent using IMP, the Internet Messaging Program.
From jason at cgt.duhs.duke.edu  Thu Oct  2 12:51:19 2003
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Thu Oct  2 12:49:23 2003
Subject: [Bioperl-l] help (fwd)
Message-ID: <Pine.LNX.4.50.0310021251050.20483-100000@tenero.duhs.duke.edu>


---------- Forwarded message ----------
Date: Thu, 02 Oct 2003 18:47:46 +0200
From: "[iso-8859-1] Antonio Ramos Fern?ndez" <tniram@hotmail.com>
Subject: help

Help ! Help please !

Sorry about this question (it could seem some silly)  but I have some doubts
about bioperl installation and I don?t know anybody who could help me (I?ve
read the installation info at README, INSTALL, INSTALL_WIN, and bptutorial
core documentation files, but i think it?s not clear enough for me). I have
downloaded the zip-compressed 1.2.3 bioperl release, and my Win98 has the
5.8.0 Activestate Perl binary distribution installed. I have unzipped the
file and the directory tree is intact. I guess I have to move the ?Bio?
folder to c:\perl\lib (I?ve read in INSTALL_WIN that the other stuff is
useful but not recommended to install). To install it, I understand I have
to execute the makefile.pl script, but if i move it out its original folder
it gives error messages about ?tests?. If I move the full content of the
zip.file (decompressed) ot the c:\perl\lib folder makefile.pl runs and
prints info about external module requirements for some of the bioperl
modules (and does not seem to do anythhing else). It also prints:

Enjoy the rest of bioperl, which you can use after going 'make install'
Writing Makefile for Bio

I dont? understand the steps I have to follow to install bioperl. MS-DOS
does not recognise these commands (I refer to the three steps make- make
install-make tests) and makefile.pl does not run without other files in the
zip ?root directory?. I don?t know what I have to do exactly with the zip
content. I don?t remember all the command combinations and folders I?ve
tested, but I?ve noticed I don?t know what the hell I?m doing. I think three
new files have appeared in the perl\lib folder after running makefile.pl:
?bioperl.conf?,  ?makefile? and ?&1? (have I to erase them to try again?
-I?ve removed the bio folder from perl\lib after getting tired of all this).
Perl only notices the presence o specific modules when I include in my
scripts the statement [ use lib ?c:\perl\lib\bio\module_name?;   ], but I
don?t think this is the best way to use the modules. Sorry, I started with
perl some months ago and I?m not a skilled programmer (and I have never been
the best friend of my computer). Buff!!! I know I need a makefile utility
(?makefile.pl?), but I don?t know what exactly it is and how does it work.
Thanks a lot if you can help me. Sorry for possible disturbances/time
wasted/economical losses/etc etc. Long life for bioperl.

NOTE: I?ve noticed that modules generally aren?t bigger than 100Kb. Should I
search for and install additional modules using the activestate perl package
manager? (I haven?t done it for bioperl because of its size. I have a 56K
modem and sometimes connection is interrupted by phone calls, so I used a
download manager program called Go!zilla).

Antonio Ramos Fern?ndez, Girona, Spain.

_________________________________________________________________
Localiza y ponte en contacto con tus antiguos compa?eros de clase en MSN
Compa?eros.  http://mipasado.msn.es/

From birney at ebi.ac.uk  Thu Oct  2 13:11:56 2003
From: birney at ebi.ac.uk (Ewan Birney)
Date: Thu Oct  2 13:09:51 2003
Subject: [Bioperl-l] Remote Blast
In-Reply-To: <1065113103.3f7c560fbc1a1@webmail.inf.ufrgs.br>
Message-ID: <Pine.LNX.4.44.0310021811340.8175-100000@pigeon.ebi.ac.uk>

On Thu, 2 Oct 2003, Sandro da Silva Camargo wrote:

> Hi,
> 
> I am trying to connect to the remote blast, but i got the following
> error. 
> 
> waiting...Error in tempfile() using /tmp/XXXXXXXXXX: Have exceeded the maximum
> number of attempts (10) to open temp file/dir at
> /usr/local/ActivePerl-5.8/lib/site_perl/5.8.0/Bio/Root/IO.pm line 659
> 
> Can somebody help me?

Which version of bioperl are you using - I think this bug has been 
squashed in 1.2.2 onwards


> 
> Thanks in advance.
> 
> Sandro
> 
> Sandro da Silva Camargo
> Instituto de Informatica - Universidade Federal do Rio Grande do Sul
> E-mail: scamargo@inf.ufrgs.br
> URL: www.inf.ufrgs.br/~scamargo
> 
> ----------------------------------------------------------------
> This message was sent using IMP, the Internet Messaging Program.
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 

-----------------------------------------------------------------
Ewan Birney. Mobile: +44 (0)7970 151230, Work: +44 1223 494420
<birney@ebi.ac.uk>. 
-----------------------------------------------------------------

From birney at ebi.ac.uk  Thu Oct  2 13:12:46 2003
From: birney at ebi.ac.uk (Ewan Birney)
Date: Thu Oct  2 13:10:41 2003
Subject: [Bioperl-l] Remote Blast
In-Reply-To: <1065113103.3f7c560fbc1a1@webmail.inf.ufrgs.br>
Message-ID: <Pine.LNX.4.44.0310021812350.8175-100000@pigeon.ebi.ac.uk>

On Thu, 2 Oct 2003, Sandro da Silva Camargo wrote:

> Hi,
> 
> I am trying to connect to the remote blast, but i got the following
> error. 
> 
> waiting...Error in tempfile() using /tmp/XXXXXXXXXX: Have exceeded the maximum
> number of attempts (10) to open temp file/dir at
> /usr/local/ActivePerl-5.8/lib/site_perl/5.8.0/Bio/Root/IO.pm line 659

Also - what system are you running on?


> 
> Can somebody help me?
> 
> Thanks in advance.
> 
> Sandro
> 
> Sandro da Silva Camargo
> Instituto de Informatica - Universidade Federal do Rio Grande do Sul
> E-mail: scamargo@inf.ufrgs.br
> URL: www.inf.ufrgs.br/~scamargo
> 
> ----------------------------------------------------------------
> This message was sent using IMP, the Internet Messaging Program.
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 

-----------------------------------------------------------------
Ewan Birney. Mobile: +44 (0)7970 151230, Work: +44 1223 494420
<birney@ebi.ac.uk>. 
-----------------------------------------------------------------

From skirov at utk.edu  Thu Oct  2 13:37:00 2003
From: skirov at utk.edu (Stefan Kirov)
Date: Thu Oct  2 13:35:12 2003
Subject: [Bioperl-l] help (fwd)
In-Reply-To: <Pine.LNX.4.50.0310021251050.20483-100000@tenero.duhs.duke.edu>
References: <Pine.LNX.4.50.0310021251050.20483-100000@tenero.duhs.duke.edu>
Message-ID: <3F7C623C.9000508@utk.edu>

Use the ActiveState Perl PPM3:
(MS-DOS console):

C:\ppm3

PPM>repository add Bioperl http://bioperl.org/DIST/

PPM>search bioperl

PPM>install <bioperl package number>

(for example install 1).
This should be in the documentation somewhere I believe- read carefully 
again.

Jason Stajich wrote:

>---------- Forwarded message ----------
>Date: Thu, 02 Oct 2003 18:47:46 +0200
>From: "[iso-8859-1] Antonio Ramos Fern?ndez" <tniram@hotmail.com>
>Subject: help
>
>Help ! Help please !
>
>Sorry about this question (it could seem some silly)  but I have some doubts
>about bioperl installation and I don?t know anybody who could help me (I?ve
>read the installation info at README, INSTALL, INSTALL_WIN, and bptutorial
>core documentation files, but i think it?s not clear enough for me). I have
>downloaded the zip-compressed 1.2.3 bioperl release, and my Win98 has the
>5.8.0 Activestate Perl binary distribution installed. I have unzipped the
>file and the directory tree is intact. I guess I have to move the ?Bio?
>folder to c:\perl\lib (I?ve read in INSTALL_WIN that the other stuff is
>useful but not recommended to install). To install it, I understand I have
>to execute the makefile.pl script, but if i move it out its original folder
>it gives error messages about ?tests?. If I move the full content of the
>zip.file (decompressed) ot the c:\perl\lib folder makefile.pl runs and
>prints info about external module requirements for some of the bioperl
>modules (and does not seem to do anythhing else). It also prints:
>
>Enjoy the rest of bioperl, which you can use after going 'make install'
>Writing Makefile for Bio
>
>I dont? understand the steps I have to follow to install bioperl. MS-DOS
>does not recognise these commands (I refer to the three steps make- make
>install-make tests) and makefile.pl does not run without other files in the
>zip ?root directory?. I don?t know what I have to do exactly with the zip
>content. I don?t remember all the command combinations and folders I?ve
>tested, but I?ve noticed I don?t know what the hell I?m doing. I think three
>new files have appeared in the perl\lib folder after running makefile.pl:
>?bioperl.conf?,  ?makefile? and ?&1? (have I to erase them to try again?
>-I?ve removed the bio folder from perl\lib after getting tired of all this).
>Perl only notices the presence o specific modules when I include in my
>scripts the statement [ use lib ?c:\perl\lib\bio\module_name?;   ], but I
>don?t think this is the best way to use the modules. Sorry, I started with
>perl some months ago and I?m not a skilled programmer (and I have never been
>the best friend of my computer). Buff!!! I know I need a makefile utility
>(?makefile.pl?), but I don?t know what exactly it is and how does it work.
>Thanks a lot if you can help me. Sorry for possible disturbances/time
>wasted/economical losses/etc etc. Long life for bioperl.
>
>NOTE: I?ve noticed that modules generally aren?t bigger than 100Kb. Should I
>search for and install additional modules using the activestate perl package
>manager? (I haven?t done it for bioperl because of its size. I have a 56K
>modem and sometimes connection is interrupted by phone calls, so I used a
>download manager program called Go!zilla).
>
>Antonio Ramos Fern?ndez, Girona, Spain.
>
>_________________________________________________________________
>Localiza y ponte en contacto con tus antiguos compa?eros de clase en MSN
>Compa?eros.  http://mipasado.msn.es/
>
>_______________________________________________
>Bioperl-l mailing list
>Bioperl-l@portal.open-bio.org
>http://portal.open-bio.org/mailman/listinfo/bioperl-l
>  
>

-- 
Stefan Kirov, Ph.D.
University of Tennessee/Oak Ridge National Laboratory
1060 Commerce Park, Oak Ridge
TN 37830-8026
USA
tel +865 576 5120
fax +865 241 1965
e-mail: skirov@utk.edu
sao@ornl.gov


From bmadsen at mathworks.com  Thu Oct  2 14:56:40 2003
From: bmadsen at mathworks.com (Brian Madsen)
Date: Thu Oct  2 14:54:31 2003
Subject: [Bioperl-l] Installation Problems
Message-ID: <DB873318D1A41648BEAC4B5AAB63B3E101017FFD@MESSAGE-AH.ad.mathworks.com>

Thanks for the help. I moved everything to home/bmadsen/bioperl and was
able to set the variable by adding:
setenv PERL5LIB '$HOME/bmadsen/bioperl'
to .login and:
DTSOURCEPROFILE=true
to .dtprofile

-----Original Message-----
From: Heikki Lehvaslaiho [mailto:heikki@ebi.ac.uk] 
Sent: Thursday, October 02, 2003 6:01 AM
To: Brian Madsen
Cc: Bioperl
Subject: Re: [Bioperl-l] Installation Problems

Brian,

Have you set up environmental variable PERL5LIB? 

Here is a command line example (which is best put into shell resource
file):

   bash: % export PERL5LIB='$HOME/src/bioperl'
   tcsh: % setenv PERL5LIB '$HOME/src/bioperl'

Using ':' as a separator you can add more than one directory.

In your case for perl to find IO::String, PERL5LIB should contain
'/home/bmadsen/bjm_perl' and in that directory you should have directory
'IO' with file 'String.pm'

Hope this helps,

	-Heikki


On Wed, 2003-10-01 at 23:25, Brian Madsen wrote:
> I'm new to Bioperl and am trying to complete an installation. The
> company has a Solaris server, and I don't have write privileges to the
> perl5/site_perl source area, so I followed the instructions to install
> to a personal module area.
> In addition to the bioperl-1.2.3 tar file, I also ran installations
for
> bioperl-ext-06, Bundle-BioPerl-2.05 and a separate installation for
> IO-String-1.02. In every case, I used: PREFIX=/home/bmadsen/bjm_perl
as
> a parameter in building the makefile, and all of the installations
were
> successful.
> Still, when I run the bptutorial.pl file, I get the error: "Can't
locate
> IO/String.pm in @INC (@INC contains: ..." followed by a half-dozen
> notices about failed compilation. I've even tried copy the String.pm
> file into the bioperl-1.2.3 directory, but it doesn't help.
> I have already printed bptutorial.pl and am reading through it, but I
> had hoped to run it interactively. Any advice?
> Thanks
> -Brian
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
-- 
______ _/      _/_____________________________________________________
      _/      _/                      http://www.ebi.ac.uk/mutations/
     _/  _/  _/  Heikki Lehvaslaiho    heikki_at_ebi ac uk
    _/_/_/_/_/  EMBL Outstation, European Bioinformatics Institute
   _/  _/  _/  Wellcome Trust Genome Campus, Hinxton
  _/  _/  _/  Cambs. CB10 1SD, United Kingdom
     _/      Phone: +44 (0)1223 494 644   FAX: +44 (0)1223 494 468
___ _/_/_/_/_/________________________________________________________


From brian_osborne at cognia.com  Thu Oct  2 15:29:14 2003
From: brian_osborne at cognia.com (Brian Osborne)
Date: Thu Oct  2 15:31:18 2003
Subject: [Bioperl-l] Bioperl PPM for 1.2.3
Message-ID: <GAEDKMGOKFBLJPKCLKCCGELDDCAA.brian_osborne@cognia.com>

bioperl-l,

>Cool. bioperl-1.2.3-ppm.tar.gz tested for installation on perl 5.6 and 5.8
>from the bioperl.org site. The PPM's are functional and everything (almost)
>seems to works.

>Nigam Shah

That one failed test is probably due to a missing module.

Thanks very much Nigam!


Brian O.


From ng at cs.usask.ca  Thu Oct  2 15:50:53 2003
From: ng at cs.usask.ca (Ambrose Ng)
Date: Thu Oct  2 22:07:51 2003
Subject: [Bioperl-l] something wrong with my code?
Message-ID: <200310021350.53962.ng@cs.usask.ca>

my code here:
		
my @s_ind = $hsp->seq_inds('hit', 'id', 1);
print "conserved=", my $s_ind = join (" ",@s_ind), "\n";

reports this:

conserved=7896-8225 8249-8949


but the numbers are different from the blast report.

Is it more likely to do with my code or is there something going on with the 
bioperl module compatibility with this blast report that I'm 
misunderstanding?

Ambrose

-------------- next part --------------
Query=  gnl|UG|Os#S16394561 Oryza sativa (japonica cultivar-group) cDNA clone:00
2-141-A12, full insert sequence /gb=AK108259 /gi=32993468 /ug=Os.4 /len=1031
        (1031 letters)

Database:  /fdf/coe05/gm0/0/rice_Chr1f.dna
           426 sequences; 63,195,892 total letters.
Searching.......................................................done.

                                                               Score     E
Sequences producing significant alignments:                    (bits)  Value

o_sativa|chr_1|OJ1116_H09|6985                                  1933  0.0

>o_sativa|chr_1|OJ1116_H09|6985
            Length = 110086

 Score = 1933 bits (975), Expect = 0.0
 Identities = 1031/1124 (91%)
 Strand = Plus / Plus


Query: 1    aagctagctcgcgcgcaaattaatcgccgatcgatcctacgatgaagcggaagcggccgg 60
            ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 7826 aagctagctcgcgcgcaaattaatcgccgatcgatcctacgatgaagcggaagcggccgg 7885


Query: 61   cggcgctgcgcggcggggaggaggcggcggcggcggcgctgaagcgtgggccgtggacgc 120
            ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 7886 cggcgctgcgcggcggggaggaggcggcggcggcggcgctgaagcgtgggccgtggacgc 7945


Query: 121  ccgaggaggacgaggtgctggcgcggttcgtggcgcgggaggggtgcgaccggtggcgca 180
            ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 7946 ccgaggaggacgaggtgctggcgcggttcgtggcgcgggaggggtgcgaccggtggcgca 8005


Query: 181  cgctgccgcggcgcgcgggcctgctgcgctgcggcaagagctgccgcctccggtggatga 240
            ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 8006 cgctgccgcggcgcgcgggcctgctgcgctgcggcaagagctgccgcctccggtggatga 8065


Query: 241  actacctccgccccgacatcaagcgctgccccatcgccgacgacgaggaggacctcatcc 300
            ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 8066 actacctccgccccgacatcaagcgctgccccatcgccgacgacgaggaggacctcatcc 8125


Query: 301  tccgcctccaccgcctcctcggcaaccggt---------------------------ggt 333
            ||||||||||||||||||||||||||||||                           |||
Sbjct: 8126 tccgcctccaccgcctcctcggcaaccggtaagca<--[8161:8243]-->taggtggt 8251


Query: 334  cgctgatcgccgggaggttgccggggcgcacggacaacgagatcaagaactactggaact 393
            ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 8252 cgctgatcgccgggaggttgccggggcgcacggacaacgagatcaagaactactggaact 8311


Query: 394  cgcatctcagcaagaagctcatcgcgcagggcatcgacccgcggacgcacaagccgctga 453
            ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 8312 cgcatctcagcaagaagctcatcgcgcagggcatcgacccgcggacgcacaagccgctga 8371


Query: 454  cggccgccgccgatcactccaacgccgccgctgccgtcgccgccacttcttacaagaagg 513
            ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 8372 cggccgccgccgatcactccaacgccgccgctgccgtcgccgccacttcttacaagaagg 8431


Query: 514  cggtgccggccaagccgccgaggacggcatcctcgccggccgctggcattgagtgcagcg 573
            ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 8432 cggtgccggccaagccgccgaggacggcatcctcgccggccgctggcattgagtgcagcg 8491


Query: 574  acgatcgtgcccgaccggccgacggtggcggtgacttcgcagcgatggtgagcgccgccg 633
            ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 8492 acgatcgtgcccgaccggccgacggtggcggtgacttcgcagcgatggtgagcgccgccg 8551


Query: 634  atgccgagggattcgaaggaggatttggcgatcagttctgtgccgaggatgcagttcatg 693
            ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 8552 atgccgagggattcgaaggaggatttggcgatcagttctgtgccgaggatgcagttcatg 8611


Query: 694  gtggcttcgacatgggttccgcttccgccatggtgggtgacgacgacttctcctcgtttc 753
            ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 8612 gtggcttcgacatgggttccgcttccgccatggtgggtgacgacgacttctcctcgtttc 8671


Query: 754  ttgattctctgatcaacgacgagcagttaggcgatctcttcgtcgtcgagggcaacgatc 813
            ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 8672 ttgattctctgatcaacgacgagcagttaggcgatctcttcgtcgtcgagggcaacgatc 8731


Query: 814  acgagcatggcaatggtgagattggtcatggagacgtcatggaatccaaacagtaatctt 873
            ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 8732 acgagcatggcaatggtgagattggtcatggagacgtcatggaatccaaacagtaatctt 8791


Query: 874  tcgggagattgatcagggaataagttgaccatgagaaaacatgtaagaacagcttgtctg 933
            ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 8792 tcgggagattgatcagggaataagttgaccatgagaaaacatgtaagaacagcttgtctg 8851


Query: 934  ctgagtcatgaccggtgatgtgtatgtgatggaagaaaacgtatgtacgacttaatgctt 993
            ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 8852 ctgagtcatgaccggtgatgtgtatgtgatggaagaaaacgtatgtacgacttaatgctt 8911


Query: 994  gcacctatactactgctgtgtgcattattcgagtgatc 1031
            ||||||||||||||||||||||||||||||||||||||
Sbjct: 8912 gcacctatactactgctgtgtgcattattcgagtgatc 8949
From jason at cgt.duhs.duke.edu  Fri Oct  3 09:42:31 2003
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Fri Oct  3 09:40:37 2003
Subject: [Bioperl-l] something wrong with my code?
In-Reply-To: <200310021350.53962.ng@cs.usask.ca>
References: <200310021350.53962.ng@cs.usask.ca>
Message-ID: <Pine.LNX.4.50.0310030939130.28471-100000@tenero.duhs.duke.edu>

Looks like a bug alright, but it has to do with this part of the
alignment in your report.

Query: 301  tccgcctccaccgcctcctcggcaaccggt---------------------------ggt 333
            ||||||||||||||||||||||||||||||                           |||
Sbjct: 8126 tccgcctccaccgcctcctcggcaaccggtaagca<--[8161:8243]-->taggtggt 8251

We currently can't parse that an turn it into anything appropriate - does
it mean there is a gap there or should those bases be 'N's?

Can you report it to http://bugzilla.open-bio.org for the time being.

-jason
On Thu, 2 Oct 2003, Ambrose Ng wrote:

> my code here:
>
> my @s_ind = $hsp->seq_inds('hit', 'id', 1);
> print "conserved=", my $s_ind = join (" ",@s_ind), "\n";
>
> reports this:
>
> conserved=7896-8225 8249-8949
>
>
> but the numbers are different from the blast report.
>
> Is it more likely to do with my code or is there something going on with the
> bioperl module compatibility with this blast report that I'm
> misunderstanding?
>
> Ambrose
>
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From scamargo at inf.ufrgs.br  Fri Oct  3 12:25:27 2003
From: scamargo at inf.ufrgs.br (Sandro da Silva Camargo)
Date: Fri Oct  3 12:25:03 2003
Subject: [Bioperl-l] Remote Blast
Message-ID: <1065198327.3f7da2f7d030c@webmail.inf.ufrgs.br>

Hi,

I am using:

Solaris 8
ActivePerl 5.8
BioPerl 1.2.3

Thanks,

Sandro


----- Mensagem encaminhada de Ewan Birney <birney@ebi.ac.uk> -----
    Data: Thu, 2 Oct 2003 18:12:46 +0100 (BST)
      De: Ewan Birney <birney@ebi.ac.uk>
Reponder para: Ewan Birney <birney@ebi.ac.uk>
 Assunto: Re: [Bioperl-l] Remote Blast
    Para: Sandro da Silva Camargo <scamargo@inf.ufrgs.br>

On Thu, 2 Oct 2003, Sandro da Silva Camargo wrote:

> Hi,
> 
> I am trying to connect to the remote blast, but i got the following
> error. 
> 
> waiting...Error in tempfile() using /tmp/XXXXXXXXXX: Have exceeded the
maximum
> number of attempts (10) to open temp file/dir at
> /usr/local/ActivePerl-5.8/lib/site_perl/5.8.0/Bio/Root/IO.pm line 659

Also - what system are you running on?


> 
> Can somebody help me?
> 
> Thanks in advance.
> 
> Sandro
> 
> Sandro da Silva Camargo
> Instituto de Informatica - Universidade Federal do Rio Grande do Sul
> E-mail: scamargo@inf.ufrgs.br
> URL: www.inf.ufrgs.br/~scamargo
> 
> ----------------------------------------------------------------
> This message was sent using IMP, the Internet Messaging Program.
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 

-----------------------------------------------------------------
Ewan Birney. Mobile: +44 (0)7970 151230, Work: +44 1223 494420
<birney@ebi.ac.uk>. 
-----------------------------------------------------------------

----- Finalizar mensagem encaminhada -----


Sandro da Silva Camargo
Instituto de Informatica - Universidade Federal do Rio Grande do Sul
Porto Alegre - RS - Brasil
E-mail: scamargo@inf.ufrgs.br
URL: www.inf.ufrgs.br/~scamargo

----------------------------------------------------------------
This message was sent using IMP, the Internet Messaging Program.
From lstein at cshl.edu  Fri Oct  3 12:27:37 2003
From: lstein at cshl.edu (Lincoln Stein)
Date: Fri Oct  3 12:26:20 2003
Subject: [Bioperl-l] Bio::Graphics output
In-Reply-To: <200309122139.47541.nel@birc.dk>
References: <BEE28BF86078B6429D6C780635718E212E7395@morelia.be.devgen.com>
	<200309122139.47541.nel@birc.dk>
Message-ID: <200310031227.37523.lstein@cshl.edu>

Then you have to generate a temporary file and point at it.

Lincoln

On Friday 12 September 2003 03:39 pm, Niels Larsen wrote:
> > You have to output the image to a file and make an html img tag pointing
> > to that file. HTH,
> > Marc
>
> Or have a script make it while the page is loaded,
>
> <img src="make_image.cgi">
>
> That works good for few large images; many mall ones would
> slow the server.
>
> Niels L
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

-- 
========================================================================
Lincoln D. Stein                           Cold Spring Harbor Laboratory
lstein@cshl.org			                  Cold Spring Harbor, NY
========================================================================


From jnolan at tulane.edu  Fri Oct  3 17:06:02 2003
From: jnolan at tulane.edu (Jim Nolan)
Date: Fri Oct  3 17:05:20 2003
Subject: [Bioperl-l] OS X bioperl, staden/read, install problems
Message-ID: <a05210601bba3888a3e88@[129.81.38.94]>

I have been unable to install bioperl versions later than 1.2.1 on 
Mac OS X. The first error I get is during 'make test':
"The extension 'Bio::SeqIO::staden::read' is not properly installed in path:
   '/Library/Perl'
If this is a CPAN/distributed module, you may need to reinstall it on your
system.
To allow Inline to compile the module in a temporary cache, simply remove the
Inline config option 'VERSION=' from the Bio::SeqIO::staden::read module.
  at t/FootPrinter.t line 0"
and numerous other tests.

So I attempted to install bioperl-ext, which meant instaling the 
Staden io_lib, though I don't use Staden. This had the difficulty of 
errors caused by the locations of os.h and config.h 
(http://portal.open-bio.org/mailman/htdig/bioperl-l/2003-June/012472.html). 
I proceeded with the bioperl-ext install, using the directions at 
http://www.tc.umn.edu/~cann0010/Bioperl_OSX_install.html#bioperlxs
(which worked great until recently) I did the -fPIC edit, I did the 
ranlib, but on make test for bioper-ext, I get:
/usr/bin/perl -I/System/Library/Perl/darwin -I/System/Library/Perl 
/System/Library/Perl/ExtUtils/xsubpp  -typemap 
/System/Library/Perl/ExtUtils/typemap  read.xs > read.xsc && mv 
read.xsc read.c
cc -c -I/Users/jnolan/Desktop/download/bioperl-ext/Bio/SeqIO/staden 
-I/usr/local/include/io_lib -g -pipe -pipe -fno-common 
-no-cpp-precomp -flat_namespace -DHAS_TELLDIR_PROTOTYPE 
-fno-strict-aliasing -Os     -DVERSION=\"0.01\" -DXS_VERSION=\"0.01\" 
-I/System/Library/Perl/darwin/CORE  read.c
In file included from /usr/local/include/io_lib/os.h:4,
                  from /usr/local/include/io_lib/Read.h:43,
                  from read.xs:5:
/usr/local/include/io_lib/config.h:45:1: warning: "VERSION" redefined
read.c:1:1: warning: this is the location of the previous definition
In file included from /usr/local/include/io_lib/translate.h:20,
                  from /usr/local/include/io_lib/Read.h:227,
                  from read.xs:5:
/usr/local/include/io_lib/expFileIO.h:42: parse error before "Exp_entries"
/usr/local/include/io_lib/expFileIO.h:42: warning: data definition 
has no type o"
and lots of similar warnings.

So it seems like there is still a problem with these libraries, 
though I copied them where the compiler seems to be looking for them.

Can someone point me in the right direction?


It would be great if there was a way to get rid of the dependency on 
installing bioperl-ext to get bioperl to work. Even if the solution 
for my particular problem is simple, the need to install additional 
layers of dependencies takes a lot of the fun out of it for us lame 
biological types ;-)
.

Thanks.
Jim
-- 
---------------------------------------------------------
Jim Nolan			email:JNolan@tulane.edu
Department of Biochemistry SL43
Tulane University Health Sciences Center
1430 Tulane Ave.		Phone: (504)-584-2453
New Orleans, LA 70112-2699	FAX:   (504)-584-2739
http://www.tulane.edu/~biochem/faculty/nolan.htm
http://phage.bioc.tulane.edu
The requirements said: Windows 2000 or better.
So I got a Macintosh.
From jason at cgt.duhs.duke.edu  Fri Oct  3 17:40:27 2003
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Fri Oct  3 17:39:07 2003
Subject: [Bioperl-l] OS X bioperl, staden/read, install problems
In-Reply-To: <a05210601bba3888a3e88@[129.81.38.94]>
References: <a05210601bba3888a3e88@[129.81.38.94]>
Message-ID: <Pine.LNX.4.50.0310031730260.28471-100000@tenero.duhs.duke.edu>

And you've tried with bioperl 1.2.3?

But I'm a little confused -

 a) you do not need bioperl-ext unless you want to use the specialized
    stuff in there (c-extensions to bioperl modules).

 b) What test is giving you the warnings about IO::Staden - I can
    find no reference to it in any of the tests.  Very weird error
    t/FootPrinter.t has nothing to do with Staden.

We have tried very hard (or at least I advocate it very strongly) not to
have a lot of dependancies, I think we are using a bare minimum of them,
IO::String being one of the most essential. Lincoln even wrote
his own socket based HTTP access to begin to remove LWP as a dependancy.

What people need to remember is when it warns you about extra modules when
you run perl Makefile.PL you don't HAVE To install all of those things,
only those which are dependancies for functionality you want.


-jason
On Fri, 3 Oct 2003, Jim Nolan wrote:

> I have been unable to install bioperl versions later than 1.2.1 on
> Mac OS X. The first error I get is during 'make test':
> "The extension 'Bio::SeqIO::staden::read' is not properly installed in path:
>    '/Library/Perl'
> If this is a CPAN/distributed module, you may need to reinstall it on your
> system.
> To allow Inline to compile the module in a temporary cache, simply remove the
> Inline config option 'VERSION=' from the Bio::SeqIO::staden::read module.
>   at t/FootPrinter.t line 0"
> and numerous other tests.
>
> So I attempted to install bioperl-ext, which meant instaling the
> Staden io_lib, though I don't use Staden. This had the difficulty of
> errors caused by the locations of os.h and config.h
> (http://portal.open-bio.org/mailman/htdig/bioperl-l/2003-June/012472.html).
> I proceeded with the bioperl-ext install, using the directions at
> http://www.tc.umn.edu/~cann0010/Bioperl_OSX_install.html#bioperlxs
> (which worked great until recently) I did the -fPIC edit, I did the
> ranlib, but on make test for bioper-ext, I get:
> /usr/bin/perl -I/System/Library/Perl/darwin -I/System/Library/Perl
> /System/Library/Perl/ExtUtils/xsubpp  -typemap
> /System/Library/Perl/ExtUtils/typemap  read.xs > read.xsc && mv
> read.xsc read.c
> cc -c -I/Users/jnolan/Desktop/download/bioperl-ext/Bio/SeqIO/staden
> -I/usr/local/include/io_lib -g -pipe -pipe -fno-common
> -no-cpp-precomp -flat_namespace -DHAS_TELLDIR_PROTOTYPE
> -fno-strict-aliasing -Os     -DVERSION=\"0.01\" -DXS_VERSION=\"0.01\"
> -I/System/Library/Perl/darwin/CORE  read.c
> In file included from /usr/local/include/io_lib/os.h:4,
>                   from /usr/local/include/io_lib/Read.h:43,
>                   from read.xs:5:
> /usr/local/include/io_lib/config.h:45:1: warning: "VERSION" redefined
> read.c:1:1: warning: this is the location of the previous definition
> In file included from /usr/local/include/io_lib/translate.h:20,
>                   from /usr/local/include/io_lib/Read.h:227,
>                   from read.xs:5:
> /usr/local/include/io_lib/expFileIO.h:42: parse error before "Exp_entries"
> /usr/local/include/io_lib/expFileIO.h:42: warning: data definition
> has no type o"
> and lots of similar warnings.
>
> So it seems like there is still a problem with these libraries,
> though I copied them where the compiler seems to be looking for them.
>
> Can someone point me in the right direction?
>
>
> It would be great if there was a way to get rid of the dependency on
> installing bioperl-ext to get bioperl to work. Even if the solution
> for my particular problem is simple, the need to install additional
> layers of dependencies takes a lot of the fun out of it for us lame
> biological types ;-)
> .
>
> Thanks.
> Jim
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From brian_osborne at cognia.com  Fri Oct  3 18:08:10 2003
From: brian_osborne at cognia.com (Brian Osborne)
Date: Fri Oct  3 18:11:30 2003
Subject: [Bioperl-l] OS X bioperl, staden/read, install problems
In-Reply-To: <a05210601bba3888a3e88@[129.81.38.94]>
Message-ID: <GAEDKMGOKFBLJPKCLKCCIEMHDCAA.brian_osborne@cognia.com>

Jim,

> It would be great if there was a way to get rid of the dependency on
installing bioperl-ext to get bioperl to work.

This is odd. I've done 'make install' many, many times but I've never
installed nor used the bioperl-ext library. Ever. Do you think you could
have done something unconventional to see the behavior you're seeing?

Brian O.

-----Original Message-----
From: bioperl-l-bounces@portal.open-bio.org
[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Jim Nolan
Sent: Friday, October 03, 2003 5:06 PM
To: bioperl-l@portal.open-bio.org
Subject: [Bioperl-l] OS X bioperl, staden/read, install problems

I have been unable to install bioperl versions later than 1.2.1 on
Mac OS X. The first error I get is during 'make test':
"The extension 'Bio::SeqIO::staden::read' is not properly installed in path:
   '/Library/Perl'
If this is a CPAN/distributed module, you may need to reinstall it on your
system.
To allow Inline to compile the module in a temporary cache, simply remove
the
Inline config option 'VERSION=' from the Bio::SeqIO::staden::read module.
  at t/FootPrinter.t line 0"
and numerous other tests.

So I attempted to install bioperl-ext, which meant instaling the
Staden io_lib, though I don't use Staden. This had the difficulty of
errors caused by the locations of os.h and config.h
(http://portal.open-bio.org/mailman/htdig/bioperl-l/2003-June/012472.html).
I proceeded with the bioperl-ext install, using the directions at
http://www.tc.umn.edu/~cann0010/Bioperl_OSX_install.html#bioperlxs
(which worked great until recently) I did the -fPIC edit, I did the
ranlib, but on make test for bioper-ext, I get:
/usr/bin/perl -I/System/Library/Perl/darwin -I/System/Library/Perl
/System/Library/Perl/ExtUtils/xsubpp  -typemap
/System/Library/Perl/ExtUtils/typemap  read.xs > read.xsc && mv
read.xsc read.c
cc -c -I/Users/jnolan/Desktop/download/bioperl-ext/Bio/SeqIO/staden
-I/usr/local/include/io_lib -g -pipe -pipe -fno-common
-no-cpp-precomp -flat_namespace -DHAS_TELLDIR_PROTOTYPE
-fno-strict-aliasing -Os     -DVERSION=\"0.01\" -DXS_VERSION=\"0.01\"
-I/System/Library/Perl/darwin/CORE  read.c
In file included from /usr/local/include/io_lib/os.h:4,
                  from /usr/local/include/io_lib/Read.h:43,
                  from read.xs:5:
/usr/local/include/io_lib/config.h:45:1: warning: "VERSION" redefined
read.c:1:1: warning: this is the location of the previous definition
In file included from /usr/local/include/io_lib/translate.h:20,
                  from /usr/local/include/io_lib/Read.h:227,
                  from read.xs:5:
/usr/local/include/io_lib/expFileIO.h:42: parse error before "Exp_entries"
/usr/local/include/io_lib/expFileIO.h:42: warning: data definition
has no type o"
and lots of similar warnings.

So it seems like there is still a problem with these libraries,
though I copied them where the compiler seems to be looking for them.

Can someone point me in the right direction?


It would be great if there was a way to get rid of the dependency on
installing bioperl-ext to get bioperl to work. Even if the solution
for my particular problem is simple, the need to install additional
layers of dependencies takes a lot of the fun out of it for us lame
biological types ;-)
.

Thanks.
Jim
--
---------------------------------------------------------
Jim Nolan                       email:JNolan@tulane.edu
Department of Biochemistry SL43
Tulane University Health Sciences Center
1430 Tulane Ave.                Phone: (504)-584-2453
New Orleans, LA 70112-2699      FAX:   (504)-584-2739
http://www.tulane.edu/~biochem/faculty/nolan.htm
http://phage.bioc.tulane.edu
The requirements said: Windows 2000 or better.
So I got a Macintosh.
_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org
http://portal.open-bio.org/mailman/listinfo/bioperl-l


From jnolan at tulane.edu  Fri Oct  3 18:54:29 2003
From: jnolan at tulane.edu (Jim Nolan)
Date: Fri Oct  3 18:52:41 2003
Subject: [Bioperl-l] OS X bioperl, staden/read, install problems
In-Reply-To: <GAEDKMGOKFBLJPKCLKCCIEMHDCAA.brian_osborne@cognia.com>
References: <GAEDKMGOKFBLJPKCLKCCIEMHDCAA.brian_osborne@cognia.com>
Message-ID: <a05210602bba3ab395f97@[129.81.38.94]>

I sent the full list of test errors to Jason, but I got 29 errors 
looking for Staden::read when I tested Bioperl 1.2.3.
These were the tests that asked for it:
BioDBGFF, FootPrin, Genewise, Genpred, Index, LocusLin, Mutator, 
Perl, RepeatMa, SeqAnaly, SeqBuild, SeqFeatC, SeqFeatu, SeqIO, 
Sequence, Similari, StandAlo, Swiss, Tools, Variatio, ePCR, game, 
largefas, multiple, phd, primaryq, qual, scf, splicedseq

I think I get even more for the CVS version.
I guess it must be something specific to my machine, since nobody's 
heard of this one. When I saw the Staden::read errors, then I figured 
I had to have bioperl-ext.
Sorry I whined, I thought that it was just a new dependency that 
wasn't as much of a hassle for most machines as it is for mine.

If anyone has any clues as to what I did wrong, I'd be glad to hear them.

Thanks
Jim

>Jim,
>
>>  It would be great if there was a way to get rid of the dependency on
>installing bioperl-ext to get bioperl to work.
>
>This is odd. I've done 'make install' many, many times but I've never
>installed nor used the bioperl-ext library. Ever. Do you think you could
>have done something unconventional to see the behavior you're seeing?
>
>Brian O.
>
>-----Original Message-----
>From: bioperl-l-bounces@portal.open-bio.org
>[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Jim Nolan
>Sent: Friday, October 03, 2003 5:06 PM
>To: bioperl-l@portal.open-bio.org
>Subject: [Bioperl-l] OS X bioperl, staden/read, install problems
>
>I have been unable to install bioperl versions later than 1.2.1 on
>Mac OS X. The first error I get is during 'make test':
>"The extension 'Bio::SeqIO::staden::read' is not properly installed in path:
>    '/Library/Perl'
>If this is a CPAN/distributed module, you may need to reinstall it on your
>system.
>To allow Inline to compile the module in a temporary cache, simply remove
>the
>Inline config option 'VERSION=' from the Bio::SeqIO::staden::read module.
>   at t/FootPrinter.t line 0"
>and numerous other tests.
>
>So I attempted to install bioperl-ext, which meant instaling the
>Staden io_lib, though I don't use Staden. This had the difficulty of
>errors caused by the locations of os.h and config.h
>(http://portal.open-bio.org/mailman/htdig/bioperl-l/2003-June/012472.html).
>I proceeded with the bioperl-ext install, using the directions at
>http://www.tc.umn.edu/~cann0010/Bioperl_OSX_install.html#bioperlxs
>(which worked great until recently) I did the -fPIC edit, I did the
>ranlib, but on make test for bioper-ext, I get:
>/usr/bin/perl -I/System/Library/Perl/darwin -I/System/Library/Perl
>/System/Library/Perl/ExtUtils/xsubpp  -typemap
>/System/Library/Perl/ExtUtils/typemap  read.xs > read.xsc && mv
>read.xsc read.c
>cc -c -I/Users/jnolan/Desktop/download/bioperl-ext/Bio/SeqIO/staden
>-I/usr/local/include/io_lib -g -pipe -pipe -fno-common
>-no-cpp-precomp -flat_namespace -DHAS_TELLDIR_PROTOTYPE
>-fno-strict-aliasing -Os     -DVERSION=\"0.01\" -DXS_VERSION=\"0.01\"
>-I/System/Library/Perl/darwin/CORE  read.c
>In file included from /usr/local/include/io_lib/os.h:4,
>                   from /usr/local/include/io_lib/Read.h:43,
>                   from read.xs:5:
>/usr/local/include/io_lib/config.h:45:1: warning: "VERSION" redefined
>read.c:1:1: warning: this is the location of the previous definition
>In file included from /usr/local/include/io_lib/translate.h:20,
>                   from /usr/local/include/io_lib/Read.h:227,
>                   from read.xs:5:
>/usr/local/include/io_lib/expFileIO.h:42: parse error before "Exp_entries"
>/usr/local/include/io_lib/expFileIO.h:42: warning: data definition
>has no type o"
>and lots of similar warnings.
>
>So it seems like there is still a problem with these libraries,
>though I copied them where the compiler seems to be looking for them.
>
>Can someone point me in the right direction?
>
>
>It would be great if there was a way to get rid of the dependency on
>installing bioperl-ext to get bioperl to work. Even if the solution
>for my particular problem is simple, the need to install additional
>layers of dependencies takes a lot of the fun out of it for us lame
>biological types ;-)
>.
>
>Thanks.
>Jim
>--
>---------------------------------------------------------
>Jim Nolan                       email:JNolan@tulane.edu
>Department of Biochemistry SL43
>Tulane University Health Sciences Center
>1430 Tulane Ave.                Phone: (504)-584-2453
>New Orleans, LA 70112-2699      FAX:   (504)-584-2739
>http://www.tulane.edu/~biochem/faculty/nolan.htm
>http://phage.bioc.tulane.edu
>The requirements said: Windows 2000 or better.
>So I got a Macintosh.
>_______________________________________________
>Bioperl-l mailing list
>Bioperl-l@portal.open-bio.org
>http://portal.open-bio.org/mailman/listinfo/bioperl-l


-- 
---------------------------------------------------------
Jim Nolan			email:JNolan@tulane.edu
Department of Biochemistry SL43
Tulane University Health Sciences Center
1430 Tulane Ave.		Phone: (504)-584-2453
New Orleans, LA 70112-2699	FAX:   (504)-584-2739
http://www.tulane.edu/~biochem/faculty/nolan.htm
http://phage.bioc.tulane.edu
The requirements said: Windows 2000 or better.
So I got a Macintosh.
From vamsi at warta.bio.psu.edu  Fri Oct  3 20:48:25 2003
From: vamsi at warta.bio.psu.edu (Vamsi)
Date: Fri Oct  3 20:46:09 2003
Subject: [Bioperl-l] Bio::Root::IO possible bug?
Message-ID: <200310040048.h940mPN24316@warta.bio.psu.edu>

Hi,

After I upgraded to Bioperl 1.2.3 having problems with even
simple scripts.

For instance

--------------
use Bio::SeqIO;
my $in = Bio::SeqIO->new(-file => $ARGV[0], -format => 'Fasta');
$in->next_seq;
$in->close;
--------------

results in

Can't call method "isa" on unblessed reference at /usr/perl5/site_perl/5.005/Bio/Root/IO.pm line 487, <GEN0> chunk 1.

Is there a bug in Bio::Root::IO.pm?

Thanks,
Vamsi

From 010381P at nyp.edu.sg  Fri Oct  3 21:35:43 2003
From: 010381P at nyp.edu.sg (010381P GOH PHUAY CHENG)
Date: Sat Oct  4 00:09:47 2003
Subject: [Bioperl-l] (no subject)
Message-ID: <EF4D1D640345BE4890E5594F109B5E8329B575@motivation.sit>

hi,
i am trying to use the GD module module but i encountered the following
error.. pls help me.. wat should i do.

GD object version 2.07 does not match $GD::VERSION 1.19 at
C:/Perl/lib/DynaLoade
r.pm line 249.
Compilation failed in require at
C:/Perl/site/lib/Bio/Graphics/Glyph/Factory.pm
line 55.
BEGIN failed--compilation aborted at
C:/Perl/site/lib/Bio/Graphics/Glyph/Factory
.pm line 55.
Compilation failed in require at C:/Perl/site/lib/Bio/Graphics/Panel.pm line
4.
BEGIN failed--compilation aborted at C:/Perl/site/lib/Bio/Graphics/Panel.pm
line
 4.
Compilation failed in require at C:/Perl/site/lib/Bio/Graphics.pm line 3.
BEGIN failed--compilation aborted at C:/Perl/site/lib/Bio/Graphics.pm line 3.
Compilation failed in require at chart.pl line 4.
BEGIN failed--compilation aborted at chart.pl line 4.

thanks 
blur queen
From jason at cgt.duhs.duke.edu  Sat Oct  4 08:37:23 2003
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Sat Oct  4 08:35:00 2003
Subject: [Bioperl-l] OS X bioperl, staden/read, install problems (fwd)
Message-ID: <Pine.LNX.4.50.0310040837130.3344-100000@tenero.duhs.duke.edu>


--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu

---------- Forwarded message ----------
Date: Fri, 3 Oct 2003 17:12:08 -0500
From: Jim Nolan <jnolan@tulane.edu>
To: Jason Stajich <jason@cgt.duhs.duke.edu>
Subject: Re: [Bioperl-l] OS X bioperl, staden/read, install problems

I get 29  errors in 'make test' for bioperl 1.2.3, asking for staden:read.
Here is a truncated output from the tests. I removed all the "ok"s
and truncated this part of the error message for staden:read:
"If this is a CPAN/distributed module, you may need to reinstall it on your
rsystem.
To allow Inline to compile the module in a temporary cache, simply remove the
Inline config option 'VERSION=' from the Bio::SeqIO::staden::read module."

I get this error when I try to run my old scripts now.

The rest of the test output is below:
t/AlignIO....................ok 67/67One or more DATA sections were
not processed by Inline.

t/Biblio_biofetch............ok 11/11One or more DATA sections were
not processed by Inline.

t/BioDBGFF...................The extension 'Bio::SeqIO::staden::read'
is not properly installed in path:
   '/Library/Perl'
  at t/BioDBGFF.t line 0
INIT failed--call queue aborted, <DATA> line 1.
t/BioDBGFF...................dubious
         Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 1-120
         Failed 120/120 tests, 0.00% okay
t/BioFetch_DB................ok 27/27One or more DATA sections were
not processed by Inline.

t/DB.........................ok 78/78One or more DATA sections were
not processed by Inline.

         1/78 skipped:
t/EMBL_DB....................ok 15/15One or more DATA sections were
not processed by Inline.

t/FootPrinter................The extension 'Bio::SeqIO::staden::read'
is not properly installed in path:
   '/Library/Perl'
  at t/FootPrinter.t line 0
INIT failed--call queue aborted, <DATA> line 1.
t/FootPrinter................dubious
         Test returned status 255 (wstat 65280, 0xff00)
         after all the subtests completed successfully
t/Genewise...................The extension 'Bio::SeqIO::staden::read'
is not properly installed in path:
   '/Library/Perl'
  at t/Genewise.t line 0
INIT failed--call queue aborted, <DATA> line 1.
t/Genewise...................dubious
         Test returned status 255 (wstat 65280, 0xff00)
         after all the subtests completed successfully
t/Genpred....................The extension 'Bio::SeqIO::staden::read'
is not properly installed in path:
   '/Library/Perl'
  at t/Genpred.t line 0
INIT failed--call queue aborted, <DATA> line 1.
t/Genpred....................dubious
         Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 1-35
         Failed 35/35 tests, 0.00% okay
t/Index......................The extension 'Bio::SeqIO::staden::read'
is not properly installed in path:
   '/Library/Perl'
  at t/Index.t line 0
INIT failed--call queue aborted, <DATA> line 1.
t/Index......................dubious
         Test returned status 255 (wstat 65280, 0xff00)
         after all the subtests completed successfully
t/LiveSeq....................ok 48/48One or more DATA sections were
not processed by Inline.

t/LocusLink..................The extension 'Bio::SeqIO::staden::read'
is not properly installed in path:
   '/Library/Perl'
  at t/LocusLink.t line 0
INIT failed--call queue aborted, <DATA> line 1.
t/LocusLink..................dubious
         Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 1-23
         Failed 23/23 tests, 0.00% okay
t/Mutator....................The extension 'Bio::SeqIO::staden::read'
is not properly installed in path:
   '/Library/Perl'
  at t/Mutator.t line 0
INIT failed--call queue aborted, <DATA> line 1.
t/Mutator....................dubious
         Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 1-13
         Failed 13/13 tests, 0.00% okay
t/Perl.......................The extension 'Bio::SeqIO::staden::read'
is not properly installed in path:
   '/Library/Perl'
  at t/Perl.t line 0
INIT failed--call queue aborted, <DATA> line 1.
t/Perl.......................dubious
         Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 1-14
         Failed 14/14 tests, 0.00% okay
t/RefSeq.....................ok 3/13sleeping for 3 seconds
One or more DATA sections were not processed by Inline.

t/RepeatMasker...............The extension 'Bio::SeqIO::staden::read'
is not properly installed in path:
   '/Library/Perl'
  at t/RepeatMasker.t line 0
INIT failed--call queue aborted, <DATA> line 1.
t/RepeatMasker...............dubious
         Test returned status 255 (wstat 65280, 0xff00)
         after all the subtests completed successfully
t/RootI......................ok 10/10One or more DATA sections were
not processed by Inline.

t/SeqAnalysisParser..........The extension 'Bio::SeqIO::staden::read'
is not properly installed in path:
   '/Library/Perl'
  at t/SeqAnalysisParser.t line 0
INIT failed--call queue aborted, <DATA> line 1.
t/SeqAnalysisParser..........dubious
         Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 1-11
         Failed 11/11 tests, 0.00% okay
t/SeqBuilder.................The extension 'Bio::SeqIO::staden::read'
is not properly installed in path:
   '/Library/Perl'
  at t/SeqBuilder.t line 0
INIT failed--call queue aborted, <DATA> line 1.
t/SeqBuilder.................dubious
         Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 1-101
         Failed 101/101 tests, 0.00% okay
t/SeqFeatCollection..........The extension 'Bio::SeqIO::staden::read'
is not properly installed in path:
   '/Library/Perl'
  at t/SeqFeatCollection.t line 0
INIT failed--call queue aborted, <DATA> line 1.
t/SeqFeatCollection..........dubious
         Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 1-428
         Failed 428/428 tests, 0.00% okay
t/SeqFeature.................The extension 'Bio::SeqIO::staden::read'
is not properly installed in path:
   '/Library/Perl'
  at t/SeqFeature.t line 0
INIT failed--call queue aborted, <DATA> line 1.
t/SeqFeature.................dubious
         Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 1-62
         Failed 62/62 tests, 0.00% okay
t/SeqIO......................The extension 'Bio::SeqIO::staden::read'
is not properly installed in path:
   '/Library/Perl'
  at t/SeqIO.t line 0
INIT failed--call queue aborted, <DATA> line 1.
t/SeqIO......................dubious
         Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 1-184
         Failed 184/184 tests, 0.00% okay
t/SequenceFamily.............The extension 'Bio::SeqIO::staden::read'
is not properly installed in path:
   '/Library/Perl'
  at t/SequenceFamily.t line 0
INIT failed--call queue aborted, <DATA> line 1.
t/SequenceFamily.............dubious
         Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 1-17
         Failed 17/17 tests, 0.00% okay
t/SimilarityPair.............The extension 'Bio::SeqIO::staden::read'
is not properly installed in path:
   '/Library/Perl'
  at t/SimilarityPair.t line 0
INIT failed--call queue aborted, <DATA> line 1.
t/SimilarityPair.............dubious
         Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 1-5
         Failed 5/5 tests, 0.00% okay
t/StandAloneBlast............The extension 'Bio::SeqIO::staden::read'
is not properly installed in path:
   '/Library/Perl'
  at t/StandAloneBlast.t line 0
INIT failed--call queue aborted, <DATA> line 1.
t/StandAloneBlast............dubious
         Test returned status 255 (wstat 65280, 0xff00)
         after all the subtests completed successfully
t/Swiss......................The extension 'Bio::SeqIO::staden::read'
is not properly installed in path:
   '/Library/Perl'
  at t/Swiss.t line 0
INIT failed--call queue aborted, <DATA> line 1.
t/Swiss......................dubious
         Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 1-5
         Failed 5/5 tests, 0.00% okay
t/Tools......................The extension 'Bio::SeqIO::staden::read'
is not properly installed in path:
   '/Library/Perl'
  at t/Tools.t line 0
INIT failed--call queue aborted, <DATA> line 1.
t/Tools......................dubious
         Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 1-8
         Failed 8/8 tests, 0.00% okay
t/Variation_IO...............The extension 'Bio::SeqIO::staden::read'
is not properly installed in path:
   '/Library/Perl'
  at t/Variation_IO.t line 0
INIT failed--call queue aborted, <DATA> line 1.
t/Variation_IO...............dubious
         Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 1-25
         Failed 25/25 tests, 0.00% okay
t/XEMBL_DB...................ok 2/9One or more DATA sections were not
processed by Inline.

t/ePCR.......................The extension 'Bio::SeqIO::staden::read'
is not properly installed in path:
   '/Library/Perl'
  at t/ePCR.t line 0
INIT failed--call queue aborted, <DATA> line 1.
t/ePCR.......................dubious
         Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 1-23
         Failed 23/23 tests, 0.00% okay
t/flat.......................ok 11/11One or more DATA sections were
not processed by Inline.

t/game.......................The extension 'Bio::SeqIO::staden::read'
is not properly installed in path:
   '/Library/Perl'
  at t/game.t line 0
INIT failed--call queue aborted, <DATA> line 1.
t/game.......................dubious
         Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 1-23
         Failed 23/23 tests, 0.00% okay
t/largefasta.................The extension 'Bio::SeqIO::staden::read'
is not properly installed in path:
   '/Library/Perl'
  at t/largefasta.t line 0
INIT failed--call queue aborted, <DATA> line 1.
Use of uninitialized value in unlink at t/largefasta.t line 16, <DATA> line 1.
t/largefasta.................dubious
         Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 1-15
         Failed 15/15 tests, 0.00% okay
t/multiple_fasta.............The extension 'Bio::SeqIO::staden::read'
is not properly installed in path:
   '/Library/Perl'
  at t/multiple_fasta.t line 0
INIT failed--call queue aborted, <DATA> line 1.
t/multiple_fasta.............dubious
         Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 1-8
         Failed 8/8 tests, 0.00% okay
t/phd........................The extension 'Bio::SeqIO::staden::read'
is not properly installed in path:
   '/Library/Perl'
  at t/phd.t line 0
INIT failed--call queue aborted, <DATA> line 1.
t/phd........................dubious
         Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 1-5
         Failed 5/5 tests, 0.00% okay
t/primaryqual................The extension 'Bio::SeqIO::staden::read'
is not properly installed in path:
   '/Library/Perl'
  at t/primaryqual.t line 0
INIT failed--call queue aborted, <DATA> line 1.
t/primaryqual................dubious
         Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 1-31
         Failed 31/31 tests, 0.00% okay
t/qual.......................The extension 'Bio::SeqIO::staden::read'
is not properly installed in path:
   '/Library/Perl'
  at t/qual.t line 0
INIT failed--call queue aborted, <DATA> line 1.
t/qual.......................dubious
         Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 1-12
         Failed 12/12 tests, 0.00% okay
t/scf........................The extension 'Bio::SeqIO::staden::read'
is not properly installed in path:
   '/Library/Perl'
  at t/scf.t line 0
INIT failed--call queue aborted, <DATA> line 1.
t/scf........................dubious
         Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 1-15
         Failed 15/15 tests, 0.00% okay
t/splicedseq.................The extension 'Bio::SeqIO::staden::read'
is not properly installed in path:
   '/Library/Perl'
  at t/splicedseq.t line 0
INIT failed--call queue aborted, <DATA> line 1.
t/splicedseq.................dubious
         Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 1-9
         Failed 9/9 tests, 0.00% okay
Failed Test           Stat Wstat Total Fail  Failed  List of Failed
-------------------------------------------------------------------------------
t/BioDBGFF.t           255 65280   120  240 200.00%  1-120
t/FootPrinter.t        255 65280    20    0   0.00%  ??
t/Genewise.t           255 65280    32    0   0.00%  ??
t/Genpred.t            255 65280    35   70 200.00%  1-35
t/Index.t              255 65280    32    0   0.00%  ??
t/LocusLink.t          255 65280    23   46 200.00%  1-23
t/Mutator.t            255 65280    13   26 200.00%  1-13
t/Perl.t               255 65280    14   28 200.00%  1-14
t/RepeatMasker.t       255 65280     6    0   0.00%  ??
t/SeqAnalysisParser.t  255 65280    11   22 200.00%  1-11
t/SeqBuilder.t         255 65280   101  202 200.00%  1-101
t/SeqFeatCollection.t  255 65280   428  856 200.00%  1-428
t/SeqFeature.t         255 65280    62  124 200.00%  1-62
t/SeqIO.t              255 65280   184  368 200.00%  1-184
t/SequenceFamily.t     255 65280    17   34 200.00%  1-17
t/SimilarityPair.t     255 65280     5   10 200.00%  1-5
t/StandAloneBlast.t    255 65280    10    0   0.00%  ??
t/Swiss.t              255 65280     5   10 200.00%  1-5
t/Tools.t              255 65280     8   16 200.00%  1-8
t/Variation_IO.t       255 65280    25   50 200.00%  1-25
t/ePCR.t               255 65280    23   46 200.00%  1-23
t/game.t               255 65280    23   46 200.00%  1-23
t/largefasta.t         255 65280    15   30 200.00%  1-15
t/multiple_fasta.t     255 65280     8   16 200.00%  1-8
t/phd.t                255 65280     5   10 200.00%  1-5
t/primaryqual.t        255 65280    31   62 200.00%  1-31
t/qual.t               255 65280    12   24 200.00%  1-12
t/scf.t                255 65280    15   30 200.00%  1-15
t/splicedseq.t         255 65280     9   18 200.00%  1-9
1 subtest skipped.
Failed 29/137 test scripts, 78.83% okay. 1192/6319 subtests failed,
81.14% okay.
make: *** [test_dynamic] Error 35

I never tried to install bioperl-ext before, but with all of these
errors, I figured it was a new dependency. some of the usual
dependencies I installed with fink, which mostly works OK (I
sometimes have to put a use/sw/libs/perl... line in scripts)

I appreciate your efforts to keep down the dependencies. Everything
was smooth for the last year or so on OS X.

Thanks again,
Jim

PS I was in the Duke Genetics Program, too.  I was in the same class
as Tai-ping Sun.
I enjoyed the dUPG web page

>And you've tried with bioperl 1.2.3?
>
>But I'm a little confused -
>
>  a) you do not need bioperl-ext unless you want to use the specialized
>     stuff in there (c-extensions to bioperl modules).
>
>  b) What test is giving you the warnings about IO::Staden - I can
>     find no reference to it in any of the tests.  Very weird error
>     t/FootPrinter.t has nothing to do with Staden.
>
>We have tried very hard (or at least I advocate it very strongly) not to
>have a lot of dependancies, I think we are using a bare minimum of them,
>IO::String being one of the most essential. Lincoln even wrote
>his own socket based HTTP access to begin to remove LWP as a dependancy.
>
>What people need to remember is when it warns you about extra modules when
>you run perl Makefile.PL you don't HAVE To install all of those things,
>only those which are dependancies for functionality you want.
>
>
>-jason
>On Fri, 3 Oct 2003, Jim Nolan wrote:
>
>>  I have been unable to install bioperl versions later than 1.2.1 on
>  > Mac OS X. The first error I get is during 'make test':
>  > "The extension 'Bio::SeqIO::staden::read' is not properly
>installed in path:
>  >    '/Library/Perl'
>>  If this is a CPAN/distributed module, you may need to reinstall it on your
>>  system.
>>  To allow Inline to compile the module in a temporary cache, simply
>>remove the
>>  Inline config option 'VERSION=' from the Bio::SeqIO::staden::read module.
>>    at t/FootPrinter.t line 0"
>>  and numerous other tests.
>>
>>  So I attempted to install bioperl-ext, which meant instaling the
>>  Staden io_lib, though I don't use Staden. This had the difficulty of
>>  errors caused by the locations of os.h and config.h
>>  (http://portal.open-bio.org/mailman/htdig/bioperl-l/2003-June/012472.html).
>>  I proceeded with the bioperl-ext install, using the directions at
>>  http://www.tc.umn.edu/~cann0010/Bioperl_OSX_install.html#bioperlxs
>>  (which worked great until recently) I did the -fPIC edit, I did the
>>  ranlib, but on make test for bioper-ext, I get:
>>  /usr/bin/perl -I/System/Library/Perl/darwin -I/System/Library/Perl
>>  /System/Library/Perl/ExtUtils/xsubpp  -typemap
>>  /System/Library/Perl/ExtUtils/typemap  read.xs > read.xsc && mv
>>  read.xsc read.c
>>  cc -c -I/Users/jnolan/Desktop/download/bioperl-ext/Bio/SeqIO/staden
>>  -I/usr/local/include/io_lib -g -pipe -pipe -fno-common
>>  -no-cpp-precomp -flat_namespace -DHAS_TELLDIR_PROTOTYPE
>>  -fno-strict-aliasing -Os     -DVERSION=\"0.01\" -DXS_VERSION=\"0.01\"
>  > -I/System/Library/Perl/darwin/CORE  read.c
>  > In file included from /usr/local/include/io_lib/os.h:4,
>  >                   from /usr/local/include/io_lib/Read.h:43,
>>                    from read.xs:5:
>>  /usr/local/include/io_lib/config.h:45:1: warning: "VERSION" redefined
>>  read.c:1:1: warning: this is the location of the previous definition
>>  In file included from /usr/local/include/io_lib/translate.h:20,
>>                    from /usr/local/include/io_lib/Read.h:227,
>>                    from read.xs:5:
>>  /usr/local/include/io_lib/expFileIO.h:42: parse error before "Exp_entries"
>>  /usr/local/include/io_lib/expFileIO.h:42: warning: data definition
>>  has no type o"
>>  and lots of similar warnings.
>>
>>  So it seems like there is still a problem with these libraries,
>>  though I copied them where the compiler seems to be looking for them.
>>
>>  Can someone point me in the right direction?
>>
>>
>>  It would be great if there was a way to get rid of the dependency on
>>  installing bioperl-ext to get bioperl to work. Even if the solution
>>  for my particular problem is simple, the need to install additional
>>  layers of dependencies takes a lot of the fun out of it for us lame
>>  biological types ;-)
>>  .
>>
>>  Thanks.
>>  Jim
>>
>
>--
>Jason Stajich
>Duke University
>jason at cgt.mc.duke.edu


-- 
---------------------------------------------------------
Jim Nolan			email:JNolan@tulane.edu
Department of Biochemistry SL43
Tulane University Health Sciences Center
1430 Tulane Ave.		Phone: (504)-584-2453
New Orleans, LA 70112-2699	FAX:   (504)-584-2739
http://www.tulane.edu/~biochem/faculty/nolan.htm
http://phage.bioc.tulane.edu
The requirements said: Windows 2000 or better.
So I got a Macintosh.
From jason at cgt.duhs.duke.edu  Sat Oct  4 08:48:23 2003
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Sat Oct  4 08:45:59 2003
Subject: [Bioperl-l] Bio::Root::IO possible bug?
In-Reply-To: <200310040048.h940mPN24316@warta.bio.psu.edu>
References: <200310040048.h940mPN24316@warta.bio.psu.edu>
Message-ID: <Pine.LNX.4.50.0310040839590.3344-100000@tenero.duhs.duke.edu>

This seems like a perl 5.005 bug I guess - we obviously did't see this
error in any of our tests.

You can try and add this to the logic in the if statement on line 487
which is causing you grief:

if( ref($self->{'_filehandle'}) =~ /GLOB/i  ||
    ! ref($self->{'_filehandle'}) ||
    ! $self->{'_filehandle'}->isa('IO::String') ) {
  close($self->{'_filehandle'});
}

-jason
On Fri, 3 Oct 2003, Vamsi wrote:

> Hi,
>
> After I upgraded to Bioperl 1.2.3 having problems with even
> simple scripts.
>
> For instance
>
> --------------
> use Bio::SeqIO;
> my $in = Bio::SeqIO->new(-file => $ARGV[0], -format => 'Fasta');
> $in->next_seq;
> $in->close;
> --------------
>
> results in
>
> Can't call method "isa" on unblessed reference at /usr/perl5/site_perl/5.005/Bio/Root/IO.pm line 487, <GEN0> chunk 1.
>
> Is there a bug in Bio::Root::IO.pm?
>
> Thanks,
> Vamsi
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From eFredcadx at hotmail.com  Sat Oct  4 08:49:34 2003
From: eFredcadx at hotmail.com (C B R C)
Date: Sat Oct  4 08:47:22 2003
Subject: [Bioperl-l] Available; Government and private financing in Canada
Message-ID: <200310041247.h94Cl8db019641@portal.open-bio.org>


Canadian Subsidy Directory
4865 Hwy. 138
St-Andrews west
Ontario
K0C 2A0

CANADIAN SUBSIDY DIRECTORY YEAR 2003 EDITION

PRESS RELEASE


Legal Deposit-National Library of Canada
ISBN 2-922870-05-7

The new revised edition of the Canadian Subsidy Directory 2003 is now
available. 
The new edition is the most complete and affordable reference for anyone
looking for financial support.
It is deemed to be the perfect tool for new or existing businesses,
individual ventures, foundations and associations.

This Publication contains  more than 2000 direct and indirect financial
subsidies, grants and loans offered by government departments and
agencies, foundations, associations and organisations.  In this new 2003
edition
all programs are well described.

The Canadian Subsidy Directory is the most comprehensive tool to start up
a business, improve existent activities, set up a business plan, or obtain
assistance from experts in fields such as: Industry, transport,
agriculture, communications, municipal infrastructure, education,
import-export, labor, construction and renovation, the service sector,
hi-tech industries, research and development, joint ventures, arts,
cinema, theatre, music and recording industry, the self employed,
contests, and new talents.
Assistance from and for foundations and associations, guidance to prepare
a business plan, market surveys, computers, and much more!

The Canadian Subsidy Directory is sold $ 69.95, to obtain a copy please
call:
Canadian Business Ressource Center at  (250) 381-4822 or Canadian
Publications (819) 322-5756
From jason at cgt.duhs.duke.edu  Sat Oct  4 08:55:51 2003
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Sat Oct  4 08:54:10 2003
Subject: [Bioperl-l] OS X bioperl, staden/read, install problems
In-Reply-To: <a05210602bba3ab395f97@[129.81.38.94]>
References: <GAEDKMGOKFBLJPKCLKCCIEMHDCAA.brian_osborne@cognia.com>
	<a05210602bba3ab395f97@[129.81.38.94]>
Message-ID: <Pine.LNX.4.50.0310040848560.3344-100000@tenero.duhs.duke.edu>

I would try to get things back to a simple state.

Are you using the system installed perl 5.6.0?

Hopefully you've made your changes on your system by adding things to
PERL5LIB so you can just reset it to a simple state.   If not, you'll be
picking out files and dirs from /Library/Perl I guess

Get rid of any of the bioperl-ext stuff from your paths, you don't need
them.

At a very minimum look at Bio::SeqIO.pm - you can comment out the
following lines and see if your warnings go away.

#sub BEGIN {
#    eval { require Bio::SeqIO::staden::read; };
#}

We've managed to have so many problems with this - I would just suggest
people comment it out from their local installs if they have problems
until Aaron or someone can get detailed, reproduceable problem reports
from folks.

I would really try and whittle things down to a bare minimum and then
try and run the tests.  You can always run a specific test by saying
% make test_FootPrinter
or
% perl -I. -w t/FootPrinter.t

to see more details and when the error is occuring.

-jason

On Fri, 3 Oct 2003, Jim Nolan wrote:

> I sent the full list of test errors to Jason, but I got 29 errors
> looking for Staden::read when I tested Bioperl 1.2.3.
> These were the tests that asked for it:
> BioDBGFF, FootPrin, Genewise, Genpred, Index, LocusLin, Mutator,
> Perl, RepeatMa, SeqAnaly, SeqBuild, SeqFeatC, SeqFeatu, SeqIO,
> Sequence, Similari, StandAlo, Swiss, Tools, Variatio, ePCR, game,
> largefas, multiple, phd, primaryq, qual, scf, splicedseq
>
> I think I get even more for the CVS version.
> I guess it must be something specific to my machine, since nobody's
> heard of this one. When I saw the Staden::read errors, then I figured
> I had to have bioperl-ext.
> Sorry I whined, I thought that it was just a new dependency that
> wasn't as much of a hassle for most machines as it is for mine.
>
> If anyone has any clues as to what I did wrong, I'd be glad to hear them.
>
> Thanks
> Jim
>
> >Jim,
> >
> >>  It would be great if there was a way to get rid of the dependency on
> >installing bioperl-ext to get bioperl to work.
> >
> >This is odd. I've done 'make install' many, many times but I've never
> >installed nor used the bioperl-ext library. Ever. Do you think you could
> >have done something unconventional to see the behavior you're seeing?
> >
> >Brian O.
> >
> >-----Original Message-----
> >From: bioperl-l-bounces@portal.open-bio.org
> >[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Jim Nolan
> >Sent: Friday, October 03, 2003 5:06 PM
> >To: bioperl-l@portal.open-bio.org
> >Subject: [Bioperl-l] OS X bioperl, staden/read, install problems
> >
> >I have been unable to install bioperl versions later than 1.2.1 on
> >Mac OS X. The first error I get is during 'make test':
> >"The extension 'Bio::SeqIO::staden::read' is not properly installed in path:
> >    '/Library/Perl'
> >If this is a CPAN/distributed module, you may need to reinstall it on your
> >system.
> >To allow Inline to compile the module in a temporary cache, simply remove
> >the
> >Inline config option 'VERSION=' from the Bio::SeqIO::staden::read module.
> >   at t/FootPrinter.t line 0"
> >and numerous other tests.
> >
> >So I attempted to install bioperl-ext, which meant instaling the
> >Staden io_lib, though I don't use Staden. This had the difficulty of
> >errors caused by the locations of os.h and config.h
> >(http://portal.open-bio.org/mailman/htdig/bioperl-l/2003-June/012472.html).
> >I proceeded with the bioperl-ext install, using the directions at
> >http://www.tc.umn.edu/~cann0010/Bioperl_OSX_install.html#bioperlxs
> >(which worked great until recently) I did the -fPIC edit, I did the
> >ranlib, but on make test for bioper-ext, I get:
> >/usr/bin/perl -I/System/Library/Perl/darwin -I/System/Library/Perl
> >/System/Library/Perl/ExtUtils/xsubpp  -typemap
> >/System/Library/Perl/ExtUtils/typemap  read.xs > read.xsc && mv
> >read.xsc read.c
> >cc -c -I/Users/jnolan/Desktop/download/bioperl-ext/Bio/SeqIO/staden
> >-I/usr/local/include/io_lib -g -pipe -pipe -fno-common
> >-no-cpp-precomp -flat_namespace -DHAS_TELLDIR_PROTOTYPE
> >-fno-strict-aliasing -Os     -DVERSION=\"0.01\" -DXS_VERSION=\"0.01\"
> >-I/System/Library/Perl/darwin/CORE  read.c
> >In file included from /usr/local/include/io_lib/os.h:4,
> >                   from /usr/local/include/io_lib/Read.h:43,
> >                   from read.xs:5:
> >/usr/local/include/io_lib/config.h:45:1: warning: "VERSION" redefined
> >read.c:1:1: warning: this is the location of the previous definition
> >In file included from /usr/local/include/io_lib/translate.h:20,
> >                   from /usr/local/include/io_lib/Read.h:227,
> >                   from read.xs:5:
> >/usr/local/include/io_lib/expFileIO.h:42: parse error before "Exp_entries"
> >/usr/local/include/io_lib/expFileIO.h:42: warning: data definition
> >has no type o"
> >and lots of similar warnings.
> >
> >So it seems like there is still a problem with these libraries,
> >though I copied them where the compiler seems to be looking for them.
> >
> >Can someone point me in the right direction?
> >
> >
> >It would be great if there was a way to get rid of the dependency on
> >installing bioperl-ext to get bioperl to work. Even if the solution
> >for my particular problem is simple, the need to install additional
> >layers of dependencies takes a lot of the fun out of it for us lame
> >biological types ;-)
> >.
> >
> >Thanks.
> >Jim
> >--
> >---------------------------------------------------------
> >Jim Nolan                       email:JNolan@tulane.edu
> >Department of Biochemistry SL43
> >Tulane University Health Sciences Center
> >1430 Tulane Ave.                Phone: (504)-584-2453
> >New Orleans, LA 70112-2699      FAX:   (504)-584-2739
> >http://www.tulane.edu/~biochem/faculty/nolan.htm
> >http://phage.bioc.tulane.edu
> >The requirements said: Windows 2000 or better.
> >So I got a Macintosh.
> >_______________________________________________
> >Bioperl-l mailing list
> >Bioperl-l@portal.open-bio.org
> >http://portal.open-bio.org/mailman/listinfo/bioperl-l
>
>
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From vamsi at warta.bio.psu.edu  Sat Oct  4 13:46:43 2003
From: vamsi at warta.bio.psu.edu (Vamsi)
Date: Sat Oct  4 13:44:33 2003
Subject: [Bioperl-l] Bio::Root::IO possible bug?
In-Reply-To: <Pine.LNX.4.50.0310040839590.3344-100000@tenero.duhs.duke.edu>
	from "Jason Stajich" at Oct 04, 2003 08:48:23 AM
Message-ID: <200310041746.h94Hkh325784@warta.bio.psu.edu>

Yes, I rewrote the lines as

if ( ! ref($self->{'_filehandle'}) ||
     (ref($self->{'_filehandle'}) eq 'IO::String') ) {
   close($self->{'_filehandle'});
}

and it works fine. The trouble seems to be with the use of 'isa'

Thanks,
Vamsi
> 
> This seems like a perl 5.005 bug I guess - we obviously did't see this
> error in any of our tests.
> 
> You can try and add this to the logic in the if statement on line 487
> which is causing you grief:
> 
> if( ref($self->{'_filehandle'}) =~ /GLOB/i  ||
>     ! ref($self->{'_filehandle'}) ||
>     ! $self->{'_filehandle'}->isa('IO::String') ) {
>   close($self->{'_filehandle'});
> }
> 
> -jason
> On Fri, 3 Oct 2003, Vamsi wrote:
> 
> > Hi,
> >
> > After I upgraded to Bioperl 1.2.3 having problems with even
> > simple scripts.
> >
> > For instance
> >
> > --------------
> > use Bio::SeqIO;
> > my $in = Bio::SeqIO->new(-file => $ARGV[0], -format => 'Fasta');
> > $in->next_seq;
> > $in->close;
> > --------------
> >
> > results in
> >
> > Can't call method "isa" on unblessed reference at /usr/perl5/site_perl/5.005/Bio/Root/IO.pm line 487, <GEN0> chunk 1.
> >
> > Is there a bug in Bio::Root::IO.pm?
> >
> > Thanks,
> > Vamsi
> >
> > _______________________________________________
> > Bioperl-l mailing list
> > Bioperl-l@portal.open-bio.org
> > http://portal.open-bio.org/mailman/listinfo/bioperl-l
> >
> 
> --
> Jason Stajich
> Duke University
> jason at cgt.mc.duke.edu
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 

From lstein at cshl.edu  Sat Oct  4 14:18:54 2003
From: lstein at cshl.edu (Lincoln Stein)
Date: Sat Oct  4 14:16:51 2003
Subject: [Bioperl-l] Bio::Graphics segments problem
In-Reply-To: <200309291926.h8TJQik06123@warta.bio.psu.edu>
References: <200309291926.h8TJQik06123@warta.bio.psu.edu>
Message-ID: <200310041418.54706.lstein@cshl.edu>

Hi,

I know it looks weird, but a Bio::Graphics now defines the 1..2 as:

	1-------2-------3
        =========

That is, the width of the line will be exactly two units long.  The position 
of the dimensionless coordinate is to the left of the graphic.  In graphics 
terminology it is a "half open interval".  I know you would prefer this:

	1------2-------3
        =====

But this causes visual anomalies at high magnifications when the sequence is 
shown:

	0------1-----2------3
         A   GC?

I am willing to consider this alternative:

	0-----1------2------3
            ========

but it might be troublesome.

Lincoln

On Monday 29 September 2003 03:26 pm, Vamsi wrote:
> use Bio::Graphics::Panel;
> use Bio::SeqFeature::Generic;
>
> my $length = 100;
> my $seq = Bio::SeqFeature::Generic->new(-start => 1,
>                                         -end => $length);
>
> my $p = new Bio::Graphics::Panel(
>                                  -length => $length,
>                                  -width => 800,
>                                  );
> $p->add_track($seq,
>               -glyph => 'arrow',
>               -tick => 2,
>               -double => 1,
>               );
>
> #$p->add_track(segments => [[1,40],[61,90]]);
> my $g = $p->add_track();
> $g->add_group([[1,40],[61,90]]);
>
> open(F, '>b.png');
> binmode F;
> print F $p->png();
> close F;

-- 
Lincoln Stein
lstein@cshl.edu
Cold Spring Harbor Laboratory
1 Bungtown Road
Cold Spring Harbor, NY 11724
(516) 367-8380 (voice)
(516) 367-8389 (fax)
From jnolan at tulane.edu  Sat Oct  4 14:42:56 2003
From: jnolan at tulane.edu (Jim Nolan)
Date: Sat Oct  4 14:32:10 2003
Subject: [Bioperl-l] OS X bioperl, staden/read, install problems
In-Reply-To: <Pine.LNX.4.50.0310040848560.3344-100000@tenero.duhs.duke.edu>
References: <GAEDKMGOKFBLJPKCLKCCIEMHDCAA.brian_osborne@cognia.com>
	<a05210602bba3ab395f97@[129.81.38.94]>
	<Pine.LNX.4.50.0310040848560.3344-100000@tenero.duhs.duke.edu>
Message-ID: <a05200f00bba4c44a09d0@[137.30.89.89]>

Yes, I am using the system perl 5.6.0. I will try your suggestions on 
Monday, and let you know if anything interesting happens.

Thanks!
Jim

At 8:55 AM -0400 10/4/03, Jason Stajich wrote:
>I would try to get things back to a simple state.
>
>Are you using the system installed perl 5.6.0?
>
>Hopefully you've made your changes on your system by adding things to
>PERL5LIB so you can just reset it to a simple state.   If not, you'll be
>picking out files and dirs from /Library/Perl I guess
>
>Get rid of any of the bioperl-ext stuff from your paths, you don't need
>them.
>
>At a very minimum look at Bio::SeqIO.pm - you can comment out the
>following lines and see if your warnings go away.
>
>#sub BEGIN {
>#    eval { require Bio::SeqIO::staden::read; };
>#}
>
>We've managed to have so many problems with this - I would just suggest
>people comment it out from their local installs if they have problems
>until Aaron or someone can get detailed, reproduceable problem reports
>from folks.
>
>I would really try and whittle things down to a bare minimum and then
>try and run the tests.  You can always run a specific test by saying
>% make test_FootPrinter
>or
>% perl -I. -w t/FootPrinter.t
>
>to see more details and when the error is occuring.
>
>-jason
>
>On Fri, 3 Oct 2003, Jim Nolan wrote:
>
>>  I sent the full list of test errors to Jason, but I got 29 errors
>>  looking for Staden::read when I tested Bioperl 1.2.3.
>>  These were the tests that asked for it:
>>  BioDBGFF, FootPrin, Genewise, Genpred, Index, LocusLin, Mutator,
>>  Perl, RepeatMa, SeqAnaly, SeqBuild, SeqFeatC, SeqFeatu, SeqIO,
>>  Sequence, Similari, StandAlo, Swiss, Tools, Variatio, ePCR, game,
>>  largefas, multiple, phd, primaryq, qual, scf, splicedseq
>>
>>  I think I get even more for the CVS version.
>>  I guess it must be something specific to my machine, since nobody's
>>  heard of this one. When I saw the Staden::read errors, then I figured
>>  I had to have bioperl-ext.
>>  Sorry I whined, I thought that it was just a new dependency that
>>  wasn't as much of a hassle for most machines as it is for mine.
>>
>>  If anyone has any clues as to what I did wrong, I'd be glad to hear them.
>>
>>  Thanks
>>  Jim
>>
>>  >Jim,
>>  >
>>  >>  It would be great if there was a way to get rid of the dependency on
>>  >installing bioperl-ext to get bioperl to work.
>>  >
>>  >This is odd. I've done 'make install' many, many times but I've never
>>  >installed nor used the bioperl-ext library. Ever. Do you think you could
>>  >have done something unconventional to see the behavior you're seeing?
>>  >
>>  >Brian O.
>>  >
>>  >-----Original Message-----
>>  >From: bioperl-l-bounces@portal.open-bio.org
>>  >[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Jim Nolan
>>  >Sent: Friday, October 03, 2003 5:06 PM
>>  >To: bioperl-l@portal.open-bio.org
>>  >Subject: [Bioperl-l] OS X bioperl, staden/read, install problems
>>  >
>>  >I have been unable to install bioperl versions later than 1.2.1 on
>>  >Mac OS X. The first error I get is during 'make test':
>>  >"The extension 'Bio::SeqIO::staden::read' is not properly 
>>installed in path:
>>  >    '/Library/Perl'
>>  >If this is a CPAN/distributed module, you may need to reinstall it on your
>>  >system.
>>  >To allow Inline to compile the module in a temporary cache, simply remove
>>  >the
>>  >Inline config option 'VERSION=' from the Bio::SeqIO::staden::read module.
>>  >   at t/FootPrinter.t line 0"
>>  >and numerous other tests.
>>  >
>>  >So I attempted to install bioperl-ext, which meant instaling the
>>  >Staden io_lib, though I don't use Staden. This had the difficulty of
>>  >errors caused by the locations of os.h and config.h
>>  >(http://portal.open-bio.org/mailman/htdig/bioperl-l/2003-June/012472.html).
>>  >I proceeded with the bioperl-ext install, using the directions at
>>  >http://www.tc.umn.edu/~cann0010/Bioperl_OSX_install.html#bioperlxs
>>  >(which worked great until recently) I did the -fPIC edit, I did the
>  > >ranlib, but on make test for bioper-ext, I get:
>>  >/usr/bin/perl -I/System/Library/Perl/darwin -I/System/Library/Perl
>>  >/System/Library/Perl/ExtUtils/xsubpp  -typemap
>>  >/System/Library/Perl/ExtUtils/typemap  read.xs > read.xsc && mv
>>  >read.xsc read.c
>>  >cc -c -I/Users/jnolan/Desktop/download/bioperl-ext/Bio/SeqIO/staden
>>  >-I/usr/local/include/io_lib -g -pipe -pipe -fno-common
>>  >-no-cpp-precomp -flat_namespace -DHAS_TELLDIR_PROTOTYPE
>>  >-fno-strict-aliasing -Os     -DVERSION=\"0.01\" -DXS_VERSION=\"0.01\"
>>  >-I/System/Library/Perl/darwin/CORE  read.c
>>  >In file included from /usr/local/include/io_lib/os.h:4,
>>  >                   from /usr/local/include/io_lib/Read.h:43,
>>  >                   from read.xs:5:
>>  >/usr/local/include/io_lib/config.h:45:1: warning: "VERSION" redefined
>>  >read.c:1:1: warning: this is the location of the previous definition
>>  >In file included from /usr/local/include/io_lib/translate.h:20,
>>  >                   from /usr/local/include/io_lib/Read.h:227,
>>  >                   from read.xs:5:
>>  >/usr/local/include/io_lib/expFileIO.h:42: parse error before "Exp_entries"
>>  >/usr/local/include/io_lib/expFileIO.h:42: warning: data definition
>>  >has no type o"
>>  >and lots of similar warnings.
>>  >
>>  >So it seems like there is still a problem with these libraries,
>>  >though I copied them where the compiler seems to be looking for them.
>>  >
>>  >Can someone point me in the right direction?
>>  >
>>  >
>>  >It would be great if there was a way to get rid of the dependency on
>>  >installing bioperl-ext to get bioperl to work. Even if the solution
>>  >for my particular problem is simple, the need to install additional
>>  >layers of dependencies takes a lot of the fun out of it for us lame
>>  >biological types ;-)
>>  >.
>>  >
>>  >Thanks.
>>  >Jim
>>  >--
>>  >---------------------------------------------------------
>>  >Jim Nolan                       email:JNolan@tulane.edu
>>  >Department of Biochemistry SL43
>>  >Tulane University Health Sciences Center
>>  >1430 Tulane Ave.                Phone: (504)-584-2453
>>  >New Orleans, LA 70112-2699      FAX:   (504)-584-2739
>>  >http://www.tulane.edu/~biochem/faculty/nolan.htm
>>  >http://phage.bioc.tulane.edu
>>  >The requirements said: Windows 2000 or better.
>>  >So I got a Macintosh.
>>  >_______________________________________________
>>  >Bioperl-l mailing list
>>  >Bioperl-l@portal.open-bio.org
>>  >http://portal.open-bio.org/mailman/listinfo/bioperl-l
>>
>>
>>
>
>--
>Jason Stajich
>Duke University
>jason at cgt.mc.duke.edu

From lstein at cshl.edu  Sat Oct  4 14:20:31 2003
From: lstein at cshl.edu (Lincoln Stein)
Date: Sat Oct  4 18:55:52 2003
Subject: [Bioperl-l] GD  problem
In-Reply-To: <871010117.1063954521108.JavaMail.nobody@storage.ni.bg>
References: <871010117.1063954521108.JavaMail.nobody@storage.ni.bg>
Message-ID: <200310041420.31209.lstein@cshl.edu>

When you installed GD, did "make test" pass all its tests?  If so, then 
there's something awry in your script with respect to the @INC path or with 
PERL5LIB environment variable.  If not, then you should send me a copy of the 
GD make test output.

Lincoln

On Friday 19 September 2003 02:55 am, Vesko Baev wrote:
> Hello,
> We have Linux SuSE, Perl(latest ver.),Bioperl(latest ver.).We installed GD
> package and then we started the programm we have this error (but we have
> libgd): Can't locate loadible object for module GD
>
> What's that mean?
> Thanks!
>
>
> -----------------------------------------------------------------
> http://club.abv.bg/jsp/abvCard.jsp - ???? ??? - ??????? ? ?????? ????? ?
> ???? ! _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

-- 
Lincoln Stein
lstein@cshl.edu
Cold Spring Harbor Laboratory
1 Bungtown Road
Cold Spring Harbor, NY 11724
(516) 367-8380 (voice)
(516) 367-8389 (fax)

From martin_artist at yahoo.com  Sun Oct  5 09:46:17 2003
From: martin_artist at yahoo.com (ENRON  =  Swarzenegger)
Date: Sun Oct  5 12:45:52 2003
Subject: [Bioperl-l] Stealing CA $9 billions = Recall Election
Message-ID: <200310051645.h95GjSdb025656@portal.open-bio.org>


Bush bible PNAC 

 Swarzeneger got money from ENRON to forgive 9 billions pay back

I'm very worry, desperate PNAC may do 3rd Pearl Harbor to US.  -  Second is here:  1997

http://www.newamericancentury.org/RebuildingAmericasDefenses.pdf

page 48 Clinton economy is good   -   page 51 on document or 63 / on Acrobat viewer.. 

- We have to have major attack on America first, to convince people

OPEN YOUR MIND FOR YOURSELF AND YOUR KIDS LIFE; 	http://www.alternetnews.com/

Arnold or Adolph, what the different;   D A N K E,    will Terminate US all & California first
Prescott  help Adolph, George him & Enron;  Gasoline $4 per gallon is just the beginning.

Actor = skilled liar, how can we trust them ?  I heard,  his porno job is for sale; & He is ENRON BOY

http://www.tarpley.net/bushb.htm               http://www.bartcop.com/421102.htm

http://www.bk2k.com/bushbodycount/prescott-bush/index.shtml        http://hnn.us/comments/15105.html

 Hitler, Noriega, Hussein, bin Laden, Pinochet   -  did business with the Bush family.
Now we do business with Bush as well;   -  $500 000 000 000 is gone, missing, and make US poor
Perfect Taxpayers Rip Off -  MONEY LAUNDRY    -   true Iraq repair kidding,  - direct to own pocket,


Isn't this a crime ??  + Lie of war reasons, and lie, and lie, and lie.
Please look on this and think by yourself; where we are going ??  
Or PNAC push us there, like stick to a blind sheep. - Start thinking on your own, please. 

http://pnac.tvnewslies.org							http://www.tvnewslies.org
START THINKING AGAIN !!!
www.moveon.org/news/1577.html						www.prisonplanet.com

www.moveon.org	www.votetoimpeach.com

www.kgo810.com		listen       810 AM;    Bernie,   Genie,  Ray only    (night time)

How many Concentration Camps do they need today ?    one for sure is on Cuba.

If,  after that news - you can't sleep too well -;   get Warren book;   " The Governor "   try understand  - how to blackmail Congress   -  Or movie films;       " Enemy of the State"  that most of us  today         " Spy like US "    - abusing power by top managers    " Air America "   - how marihuana come to US

It may happened, Pentagon will have to arrest White House  and his own " Liar in Chief" to save USA  
On the base of George Washington Legacy
War President & his bible "PNAC" inventing Terrorism & new Pearl Harbor: 

http://www.newamericancentury.org/RebuildingAmericasDefenses.pdf )

From martin_artist at yahoo.com  Sun Oct  5 09:46:18 2003
From: martin_artist at yahoo.com (ENRON  =  Swarzenegger)
Date: Sun Oct  5 12:45:59 2003
Subject: [Bioperl-l] Stealing CA $9 billions = Recall Election
Message-ID: <200310051645.h95GjSdb025657@portal.open-bio.org>


Bush bible PNAC 

 Swarzeneger got money from ENRON to forgive 9 billions pay back

I'm very worry, desperate PNAC may do 3rd Pearl Harbor to US.  -  Second is here:  1997

http://www.newamericancentury.org/RebuildingAmericasDefenses.pdf

page 48 Clinton economy is good   -   page 51 on document or 63 / on Acrobat viewer.. 

- We have to have major attack on America first, to convince people

OPEN YOUR MIND FOR YOURSELF AND YOUR KIDS LIFE; 	http://www.alternetnews.com/

Arnold or Adolph, what the different;   D A N K E,    will Terminate US all & California first
Prescott  help Adolph, George him & Enron;  Gasoline $4 per gallon is just the beginning.

Actor = skilled liar, how can we trust them ?  I heard,  his porno job is for sale; & He is ENRON BOY

http://www.tarpley.net/bushb.htm               http://www.bartcop.com/421102.htm

http://www.bk2k.com/bushbodycount/prescott-bush/index.shtml        http://hnn.us/comments/15105.html

 Hitler, Noriega, Hussein, bin Laden, Pinochet   -  did business with the Bush family.
Now we do business with Bush as well;   -  $500 000 000 000 is gone, missing, and make US poor
Perfect Taxpayers Rip Off -  MONEY LAUNDRY    -   true Iraq repair kidding,  - direct to own pocket,


Isn't this a crime ??  + Lie of war reasons, and lie, and lie, and lie.
Please look on this and think by yourself; where we are going ??  
Or PNAC push us there, like stick to a blind sheep. - Start thinking on your own, please. 

http://pnac.tvnewslies.org							http://www.tvnewslies.org
START THINKING AGAIN !!!
www.moveon.org/news/1577.html						www.prisonplanet.com

www.moveon.org	www.votetoimpeach.com

www.kgo810.com		listen       810 AM;    Bernie,   Genie,  Ray only    (night time)

How many Concentration Camps do they need today ?    one for sure is on Cuba.

If,  after that news - you can't sleep too well -;   get Warren book;   " The Governor "   try understand  - how to blackmail Congress   -  Or movie films;       " Enemy of the State"  that most of us  today         " Spy like US "    - abusing power by top managers    " Air America "   - how marihuana come to US

It may happened, Pentagon will have to arrest White House  and his own " Liar in Chief" to save USA  
On the base of George Washington Legacy
War President & his bible "PNAC" inventing Terrorism & new Pearl Harbor: 

http://www.newamericancentury.org/RebuildingAmericasDefenses.pdf )

From nel at birc.dk  Sun Oct  5 12:49:57 2003
From: nel at birc.dk (Niels Larsen)
Date: Sun Oct  5 12:52:44 2003
Subject: [Bioperl-l] Bio::Graphics output
In-Reply-To: <200310031227.37523.lstein@cshl.edu>
References: <BEE28BF86078B6429D6C780635718E212E7395@morelia.be.devgen.com>
	<200309122139.47541.nel@birc.dk>
	<200310031227.37523.lstein@cshl.edu>
Message-ID: <200310051849.57173.nel@birc.dk>

Hmm .. unless one doesnt like temporary files, which someone have 
to come and clean up. The crude piece below does it without, and 
with a separate low-overhead image-serving script and a low number
of images per page it would work reasonably well. 

Niels

On Friday 03 Oct 2003 6:27 pm, Lincoln Stein wrote:
> Then you have to generate a temporary file and point at it.
> 
> Lincoln
> 
> On Friday 12 September 2003 03:39 pm, Niels Larsen wrote:
> > > You have to output the image to a file and make an html img tag pointing
> > > to that file. HTH,
> > > Marc
> >
> > Or have a script make it while the page is loaded,
> >
> > <img src="make_image.cgi">
> >
> > That works good for few large images; many mall ones would
> > slow the server.
> >
> > Niels L


#!/usr/bin/env perl

# Place script in document root

use strict;
use warnings;

use GD;
use CGI;

my ( $cgi, $img, $black, $red, $script );

$cgi = new CGI;

print qq (Content-type: text/html\n\n);

if ( $cgi->param('make_image') )
{
    $img = new GD::Image(100,100);

    $black = $img->colorAllocate( 0,0,0 );       
    $red = $img->colorAllocate( 255,0,0 );      

    $img->rectangle( 0,0,99,99,$black );
    $img->fill( 50,50,$red );

    binmode STDOUT;

    print $img->png;
}
else
{
    $script = ( split "/", $0 )[-1];
    print qq (<img src="/$script?make_image=1">\n);
}

From lstein at cshl.edu  Sun Oct  5 12:44:40 2003
From: lstein at cshl.edu (Lincoln Stein)
Date: Sun Oct  5 14:02:58 2003
Subject: [Bioperl-l] Re: [apollo] Guidance for a newbie
In-Reply-To: <F0568A55-F6A0-11D7-8649-000A95A001EE@oriongenomics.com>
References: <F0568A55-F6A0-11D7-8649-000A95A001EE@oriongenomics.com>
Message-ID: <200310051244.40031.lstein@cshl.edu>

On Saturday 04 October 2003 03:28 pm, Andrew Nunberg wrote:
> I am trying to convert genbank records into a format that is readable
> by Appollo.
> I can achieve this with gff format but I would like to do some
> curation/annotation so I am now trying to create game xml.
>

Hi,

This is the same problem that I have.  The approach that I am taking is to 
rewrite the GAME SequenceIO module in Bioperl so that it creates the correct 
version of GAME (the version it currently creates is incompatible with 
Bioperl).  Then what you want to do will be as easy as:

	$in = Bio::SeqIO->new(-file=>'genbank_file',-format=>'genbank');
	$out = Bio::SeqIO->new(-file=>'genbank_file.game',-format=>'GAME');
	while (my $seq = $in->next_seq) {
		$out->write_seq($seq);
	}

Because of many other obligations, this is moving very slowly and don't expect 
much progress until things lighten up for me in November.  If you or anyone 
else on these lists want to work on this, I will gladly share my notes and 
what I've got so far.

Lincoln

On Saturday 04 October 2003 03:28 pm, Andrew Nunberg wrote:
> I am trying to convert genbank records into a format that is readable
> by Appollo.
> I can achieve this with gff format but I would like to do some
> curation/annotation so I am now trying to create game xml.
>
> I see two basic strategies, creating annotation or
> computational_analysis tracks
> In either case, i am lacking data to make the xml
>
> For annotation there is a requirement for one more sequences to go with
> the 'gene'.
> for computational_analysis i am require to give output
> this is according to the dtd i have.
>
> Has anyone tried to do this and what approach did you take?
> Is it possible to fake the sequence requirement for annotations?
> so far the only xml that has worked is one with just the root sequence
> and nothing else..
>
> Thanks
> Andy
> ---------------------------------------------------
> Andrew Nunberg Ph.D
> Bioinfomagician
> Orion Genomics
> 4041 Forest Park
> St Louis, MO
> 314-615-6989
> anunberg@oriongenomics.com
> www.oriongenomics.com
>
> _______________________________________________
> apollo mailing list
> apollo@mail.fruitfly.org
> http://mail.fruitfly.org/mailman/listinfo/apollo

-- 
Lincoln Stein
lstein@cshl.edu
Cold Spring Harbor Laboratory
1 Bungtown Road
Cold Spring Harbor, NY 11724
(516) 367-8380 (voice)
(516) 367-8389 (fax)
From e_lyn_goh at yahoo.com  Sun Oct  5 22:09:59 2003
From: e_lyn_goh at yahoo.com (Eileen goh)
Date: Mon Oct  6 00:11:39 2003
Subject: [Bioperl-l] problems running Bio::Graphics
Message-ID: <20031006020959.96255.qmail@web40502.mail.yahoo.com>

hi,
i am trying to use the Bio::Graphics. i kept encountering the following error while running the Bio:: graphics.. wat should i do..
 
GD object version 2.07 does not match $GD::VERSION 1.19 at C:/Perl/lib/DynaLoade
r.pm line 249.
Compilation failed in require at C:/Perl/site/lib/Bio/Graphics/Glyph/Factory.pm
line 55.
BEGIN failed--compilation aborted at C:/Perl/site/lib/Bio/Graphics/Glyph/Factory
.pm line 55.
Compilation failed in require at C:/Perl/site/lib/Bio/Graphics/Panel.pm line 4.
BEGIN failed--compilation aborted at C:/Perl/site/lib/Bio/Graphics/Panel.pm line
 4.
Compilation failed in require at C:/Perl/site/lib/Bio/Graphics.pm line 3.
BEGIN failed--compilation aborted at C:/Perl/site/lib/Bio/Graphics.pm line 3.
Compilation failed in require at chart.pl line 4.
BEGIN failed--compilation aborted at chart.pl line 4.
 

---------------------------------
Do you Yahoo!?
The New Yahoo! Shopping - with improved product search
From Richard.Adams at ed.ac.uk  Mon Oct  6 04:44:33 2003
From: Richard.Adams at ed.ac.uk (Richard Adams)
Date: Mon Oct  6 04:42:14 2003
Subject: [Bioperl-l] Bio::Graphics sort_order
Message-ID: <3F812B71.8AB6C28A@ed.ac.uk>

Hello,
    I'm trying to sort the display order  of features based on values
in  one of the features' tags using the syntax
    described in the Bio::Graphics::Panel :doc.
        i.e.,
sort_order =>    sub {(  ($a->each_tag_value('Conserved'))[0]
<=>($b->each_tag_value('Conserved'))[0]  )};
But I get the error message
"Can't call method "each_tag_value" on an undefined value at
Draw_Output.pm". The code doesn't seem to be
recognizing that $a and $b should refer to the features. I've tried
delving into the internal workings of Bio::Graphics::Glyph::Factory
with little enlightenment, would be grateful for any ideas.
Cheers
Richard


--
Dr Richard Adams
Bioinformatician,
Psychiatric Genetics Group,
Medical Genetics,
Molecular Medicine Centre,
Western General Hospital,
Crewe Rd West,
Edinburgh UK
EH4 2XU

Tel: 44 131 651 1084
richard.adams@ed.ac.uk


From lstein at cshl.edu  Mon Oct  6 08:13:14 2003
From: lstein at cshl.edu (Lincoln Stein)
Date: Mon Oct  6 08:28:44 2003
Subject: [Bioperl-l] Re: [apollo] Guidance for a newbie
In-Reply-To: <200310051244.40031.lstein@cshl.edu>
References: <F0568A55-F6A0-11D7-8649-000A95A001EE@oriongenomics.com>
	<200310051244.40031.lstein@cshl.edu>
Message-ID: <200310060813.14881.lstein@cshl.edu>

> This is the same problem that I have.  The approach that I am taking is to
> rewrite the GAME SequenceIO module in Bioperl so that it creates the
> correct version of GAME (the version it currently creates is incompatible
> with Bioperl). 

I meant to say that bioperl currently creates a version of game that is 
incompatible with apollo, and vice-versa.

Lincoln

-- 
Lincoln Stein
lstein@cshl.edu
Cold Spring Harbor Laboratory
1 Bungtown Road
Cold Spring Harbor, NY 11724
(516) 367-8380 (voice)
(516) 367-8389 (fax)
From lstein at cshl.edu  Mon Oct  6 10:04:11 2003
From: lstein at cshl.edu (Lincoln Stein)
Date: Mon Oct  6 10:02:28 2003
Subject: [Bioperl-l] problems running Bio::Graphics
In-Reply-To: <20031006020959.96255.qmail@web40502.mail.yahoo.com>
References: <20031006020959.96255.qmail@web40502.mail.yahoo.com>
Message-ID: <200310061004.11515.lstein@cshl.edu>

Hi Eileen,

The problem is with the GD library.  You appear to have a newer version of the 
GD .dll code library installed, but an older version of the GD.pm file.  I 
suggest that you go through your Perl library directory and remove everything 
that has a GD in its name.  Then use the ActiveState PPM manager to download 
and install a fresh copy of GD.

Lincoln


On Sunday 05 October 2003 10:09 pm, Eileen goh wrote:
> hi,
> i am trying to use the Bio::Graphics. i kept encountering the following
> error while running the Bio:: graphics.. wat should i do..
>
> GD object version 2.07 does not match $GD::VERSION 1.19 at
> C:/Perl/lib/DynaLoade r.pm line 249.
> Compilation failed in require at
> C:/Perl/site/lib/Bio/Graphics/Glyph/Factory.pm line 55.
> BEGIN failed--compilation aborted at
> C:/Perl/site/lib/Bio/Graphics/Glyph/Factory .pm line 55.
> Compilation failed in require at C:/Perl/site/lib/Bio/Graphics/Panel.pm
> line 4. BEGIN failed--compilation aborted at
> C:/Perl/site/lib/Bio/Graphics/Panel.pm line 4.
> Compilation failed in require at C:/Perl/site/lib/Bio/Graphics.pm line 3.
> BEGIN failed--compilation aborted at C:/Perl/site/lib/Bio/Graphics.pm line
> 3. Compilation failed in require at chart.pl line 4.
> BEGIN failed--compilation aborted at chart.pl line 4.
>
>
>
> ---------------------------------
> Do you Yahoo!?
> The New Yahoo! Shopping - with improved product search

-- 
Lincoln Stein
lstein@cshl.edu
Cold Spring Harbor Laboratory
1 Bungtown Road
Cold Spring Harbor, NY 11724
(516) 367-8380 (voice)
(516) 367-8389 (fax)
From charles at moulinette.dyndns.org  Mon Oct  6 10:05:04 2003
From: charles at moulinette.dyndns.org (Charles Plessy)
Date: Mon Oct  6 10:02:44 2003
Subject: [Bioperl-l] Bio::Biblio
Message-ID: <20031006140504.GA2631@plessy.org>

Hi,

	My goal is to write a quick-and-dirty medline2docbook fliter
for immediate use, using bioperl since it has already a library able
to query medline.

	However, I don't know how to quickly extract basic fields,
like author or journal, to use them in this kind of output system:

print << DOCBOOK
(...)
<biblioset>
   <authorgroup>
    <author>
     <surname>$object->Author_surname</surname>
     <firstname>$object->Author_firstname</firstname>
    </author>
(...)
DOCBOOK

Can anybody point me to the correct object and its construction method?

-- 
Charles
From jnolan at tulane.edu  Mon Oct  6 10:47:57 2003
From: jnolan at tulane.edu (Jim Nolan)
Date: Mon Oct  6 10:46:32 2003
Subject: [Bioperl-l] OS X bioperl, staden/read, install problems
In-Reply-To: <Pine.LNX.4.50.0310040848560.3344-100000@tenero.duhs.duke.edu>
References: <GAEDKMGOKFBLJPKCLKCCIEMHDCAA.brian_osborne@cognia.com>
	<a05210602bba3ab395f97@[129.81.38.94]>
	<Pine.LNX.4.50.0310040848560.3344-100000@tenero.duhs.duke.edu>
Message-ID: <a05210603bba729e7103b@[129.81.38.94]>

All is right with the world. I rooted around and only found that 
SeqIO/staden/read had been installed, but not the rest of 
bioperl-ext, since I never passed all the tests. I commented the 
lines from SeqIO.pm which required staden/read and ran make test. 
Essentially all passed, with some minor errors.
PERL_DL_NONLAZY=1 /usr/bin/perl -Iblib/arch -Iblib/lib 
-I/System/Library/Perl/darwin -I/System/Library/Perl -e 'use 
Test::Harness qw(&runtests $verbose); $verbose=0; runtests @ARGV;' 
t/*.t
t/DB.........................ok
         1/78 skipped:
t/SeqIO......................ok
         3/184 skipped:
t/StandAloneBlast............ok 4/10Blast Database swissprot not 
found at t/StandAloneBlast.t line 72.
Blast databases(s) not found, skipping remaining  tests at 
t/StandAloneBlast.t line 76.
t/StandAloneBlast............ok
t/game.......................ok 20/23Use of uninitialized value in 
pattern match (m//) at /Library/Perl/XML/Writer.pm line 307.
Use of uninitialized value in print at 
/System/Library/Perl/darwin/IO/Handle.pm line 379.
t/game.......................ok
All tests successful, 4 subtests skipped.
Files=137, Tests=6319, 460 wallclock secs (93.89 cusr + 16.76 csys = 
110.65 CPU)

Thanks for your help. I would like to reinstate my whine about adding 
new dependencies, though ;-).

I would be happy to provide feedback on errors in installing 
bioperl-ext on OS X if needed.

Thanks again.
Jim

At 8:55 AM -0400 10/4/03, Jason Stajich wrote:
>I would try to get things back to a simple state.
>
>Are you using the system installed perl 5.6.0?
>
>Hopefully you've made your changes on your system by adding things to
>PERL5LIB so you can just reset it to a simple state.   If not, you'll be
>picking out files and dirs from /Library/Perl I guess
>
>Get rid of any of the bioperl-ext stuff from your paths, you don't need
>them.
>
>At a very minimum look at Bio::SeqIO.pm - you can comment out the
>following lines and see if your warnings go away.
>
>#sub BEGIN {
>#    eval { require Bio::SeqIO::staden::read; };
>#}
>
>We've managed to have so many problems with this - I would just suggest
>people comment it out from their local installs if they have problems
>until Aaron or someone can get detailed, reproduceable problem reports
>from folks.
>
>I would really try and whittle things down to a bare minimum and then
>try and run the tests.  You can always run a specific test by saying
>% make test_FootPrinter
>or
>% perl -I. -w t/FootPrinter.t
>
>to see more details and when the error is occuring.
>
>-jason
>
>On Fri, 3 Oct 2003, Jim Nolan wrote:
>
>>  I sent the full list of test errors to Jason, but I got 29 errors
>>  looking for Staden::read when I tested Bioperl 1.2.3.
>>  These were the tests that asked for it:
>>  BioDBGFF, FootPrin, Genewise, Genpred, Index, LocusLin, Mutator,
>>  Perl, RepeatMa, SeqAnaly, SeqBuild, SeqFeatC, SeqFeatu, SeqIO,
>>  Sequence, Similari, StandAlo, Swiss, Tools, Variatio, ePCR, game,
>>  largefas, multiple, phd, primaryq, qual, scf, splicedseq
>>
>>  I think I get even more for the CVS version.
>>  I guess it must be something specific to my machine, since nobody's
>>  heard of this one. When I saw the Staden::read errors, then I figured
>>  I had to have bioperl-ext.
>>  Sorry I whined, I thought that it was just a new dependency that
>>  wasn't as much of a hassle for most machines as it is for mine.
>>
>>  If anyone has any clues as to what I did wrong, I'd be glad to hear them.
>>
>>  Thanks
>>  Jim
>>
>>  >Jim,
>>  >
>>  >>  It would be great if there was a way to get rid of the dependency on
>>  >installing bioperl-ext to get bioperl to work.
>>  >
>>  >This is odd. I've done 'make install' many, many times but I've never
>>  >installed nor used the bioperl-ext library. Ever. Do you think you could
>>  >have done something unconventional to see the behavior you're seeing?
>  > >
>>  >Brian O.
>  > >
>  > >-----Original Message-----
>  > >From: bioperl-l-bounces@portal.open-bio.org
>  > >[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Jim Nolan
>  > >Sent: Friday, October 03, 2003 5:06 PM
>  > >To: bioperl-l@portal.open-bio.org
>>  >Subject: [Bioperl-l] OS X bioperl, staden/read, install problems
>>  >
>>  >I have been unable to install bioperl versions later than 1.2.1 on
>>  >Mac OS X. The first error I get is during 'make test':
>>  >"The extension 'Bio::SeqIO::staden::read' is not properly 
>>installed in path:
>>  >    '/Library/Perl'
>>  >If this is a CPAN/distributed module, you may need to reinstall it on your
>>  >system.
>>  >To allow Inline to compile the module in a temporary cache, simply remove
>>  >the
>>  >Inline config option 'VERSION=' from the Bio::SeqIO::staden::read module.
>>  >   at t/FootPrinter.t line 0"
>>  >and numerous other tests.
>>  >
>>  >So I attempted to install bioperl-ext, which meant instaling the
>>  >Staden io_lib, though I don't use Staden. This had the difficulty of
>>  >errors caused by the locations of os.h and config.h
>>  >(http://portal.open-bio.org/mailman/htdig/bioperl-l/2003-June/012472.html).
>>  >I proceeded with the bioperl-ext install, using the directions at
>>  >http://www.tc.umn.edu/~cann0010/Bioperl_OSX_install.html#bioperlxs
>>  >(which worked great until recently) I did the -fPIC edit, I did the
>>  >ranlib, but on make test for bioper-ext, I get:
>>  >/usr/bin/perl -I/System/Library/Perl/darwin -I/System/Library/Perl
>>  >/System/Library/Perl/ExtUtils/xsubpp  -typemap
>>  >/System/Library/Perl/ExtUtils/typemap  read.xs > read.xsc && mv
>>  >read.xsc read.c
>>  >cc -c -I/Users/jnolan/Desktop/download/bioperl-ext/Bio/SeqIO/staden
>>  >-I/usr/local/include/io_lib -g -pipe -pipe -fno-common
>>  >-no-cpp-precomp -flat_namespace -DHAS_TELLDIR_PROTOTYPE
>>  >-fno-strict-aliasing -Os     -DVERSION=\"0.01\" -DXS_VERSION=\"0.01\"
>>  >-I/System/Library/Perl/darwin/CORE  read.c
>>  >In file included from /usr/local/include/io_lib/os.h:4,
>>  >                   from /usr/local/include/io_lib/Read.h:43,
>>  >                   from read.xs:5:
>>  >/usr/local/include/io_lib/config.h:45:1: warning: "VERSION" redefined
>>  >read.c:1:1: warning: this is the location of the previous definition
>>  >In file included from /usr/local/include/io_lib/translate.h:20,
>>  >                   from /usr/local/include/io_lib/Read.h:227,
>>  >                   from read.xs:5:
>>  >/usr/local/include/io_lib/expFileIO.h:42: parse error before "Exp_entries"
>>  >/usr/local/include/io_lib/expFileIO.h:42: warning: data definition
>>  >has no type o"
>>  >and lots of similar warnings.
>>  >
>>  >So it seems like there is still a problem with these libraries,
>>  >though I copied them where the compiler seems to be looking for them.
>>  >
>>  >Can someone point me in the right direction?
>>  >
>>  >
>>  >It would be great if there was a way to get rid of the dependency on
>>  >installing bioperl-ext to get bioperl to work. Even if the solution
>>  >for my particular problem is simple, the need to install additional
>>  >layers of dependencies takes a lot of the fun out of it for us lame
>>  >biological types ;-)
>>  >.
>>  >
>>  >Thanks.
>>  >Jim
>>  >--
>>  >---------------------------------------------------------
>>  >Jim Nolan                       email:JNolan@tulane.edu
>>  >Department of Biochemistry SL43
>>  >Tulane University Health Sciences Center
>>  >1430 Tulane Ave.                Phone: (504)-584-2453
>>  >New Orleans, LA 70112-2699      FAX:   (504)-584-2739
>>  >http://www.tulane.edu/~biochem/faculty/nolan.htm
>>  >http://phage.bioc.tulane.edu
>>  >The requirements said: Windows 2000 or better.
>>  >So I got a Macintosh.
>>  >_______________________________________________
>>  >Bioperl-l mailing list
>>  >Bioperl-l@portal.open-bio.org
>>  >http://portal.open-bio.org/mailman/listinfo/bioperl-l
>>
>>
>>
>
>--
>Jason Stajich
>Duke University
>jason at cgt.mc.duke.edu


-- 
---------------------------------------------------------
Jim Nolan			email:JNolan@tulane.edu
Department of Biochemistry SL43
Tulane University Health Sciences Center
1430 Tulane Ave.		Phone: (504)-584-2453
New Orleans, LA 70112-2699	FAX:   (504)-584-2739
http://www.tulane.edu/~biochem/faculty/nolan.htm
http://phage.bioc.tulane.edu
The requirements said: Windows 2000 or better.
So I got a Macintosh.
From jason at cgt.duhs.duke.edu  Mon Oct  6 10:52:28 2003
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Mon Oct  6 10:50:43 2003
Subject: [Bioperl-l] OS X bioperl, staden/read, install problems
In-Reply-To: <a05210603bba729e7103b@[129.81.38.94]>
References: <GAEDKMGOKFBLJPKCLKCCIEMHDCAA.brian_osborne@cognia.com>
	<a05210602bba3ab395f97@[129.81.38.94]>
	<Pine.LNX.4.50.0310040848560.3344-100000@tenero.duhs.duke.edu>
	<a05210603bba729e7103b@[129.81.38.94]>
Message-ID: <Pine.LNX.4.50.0310061050480.27277-100000@tenero.duhs.duke.edu>

> Thanks for your help. I would like to reinstate my whine about adding
> new dependencies, though ;-).

Noted - it is always a trade-off, feel free to pipe up when we debate
on-list about whether or not add another dependancy.

>
> I would be happy to provide feedback on errors in installing
> bioperl-ext on OS X if needed.

When I get together with Aaron at the end of the month I will try and go
through it with him so I understand what is supposed to happen and/or how
we might reorg bioperl-ext to be less problematic.


>
> Thanks again.
> Jim
>
> At 8:55 AM -0400 10/4/03, Jason Stajich wrote:
> >I would try to get things back to a simple state.
> >
> >Are you using the system installed perl 5.6.0?
> >
> >Hopefully you've made your changes on your system by adding things to
> >PERL5LIB so you can just reset it to a simple state.   If not, you'll be
> >picking out files and dirs from /Library/Perl I guess
> >
> >Get rid of any of the bioperl-ext stuff from your paths, you don't need
> >them.
> >
> >At a very minimum look at Bio::SeqIO.pm - you can comment out the
> >following lines and see if your warnings go away.
> >
> >#sub BEGIN {
> >#    eval { require Bio::SeqIO::staden::read; };
> >#}
> >
> >We've managed to have so many problems with this - I would just suggest
> >people comment it out from their local installs if they have problems
> >until Aaron or someone can get detailed, reproduceable problem reports
> >from folks.
> >
> >I would really try and whittle things down to a bare minimum and then
> >try and run the tests.  You can always run a specific test by saying
> >% make test_FootPrinter
> >or
> >% perl -I. -w t/FootPrinter.t
> >
> >to see more details and when the error is occuring.
> >
> >-jason
> >
> >On Fri, 3 Oct 2003, Jim Nolan wrote:
> >
> >>  I sent the full list of test errors to Jason, but I got 29 errors
> >>  looking for Staden::read when I tested Bioperl 1.2.3.
> >>  These were the tests that asked for it:
> >>  BioDBGFF, FootPrin, Genewise, Genpred, Index, LocusLin, Mutator,
> >>  Perl, RepeatMa, SeqAnaly, SeqBuild, SeqFeatC, SeqFeatu, SeqIO,
> >>  Sequence, Similari, StandAlo, Swiss, Tools, Variatio, ePCR, game,
> >>  largefas, multiple, phd, primaryq, qual, scf, splicedseq
> >>
> >>  I think I get even more for the CVS version.
> >>  I guess it must be something specific to my machine, since nobody's
> >>  heard of this one. When I saw the Staden::read errors, then I figured
> >>  I had to have bioperl-ext.
> >>  Sorry I whined, I thought that it was just a new dependency that
> >>  wasn't as much of a hassle for most machines as it is for mine.
> >>
> >>  If anyone has any clues as to what I did wrong, I'd be glad to hear them.
> >>
> >>  Thanks
> >>  Jim
> >>
> >>  >Jim,
> >>  >
> >>  >>  It would be great if there was a way to get rid of the dependency on
> >>  >installing bioperl-ext to get bioperl to work.
> >>  >
> >>  >This is odd. I've done 'make install' many, many times but I've never
> >>  >installed nor used the bioperl-ext library. Ever. Do you think you could
> >>  >have done something unconventional to see the behavior you're seeing?
> >  > >
> >>  >Brian O.
> >  > >
> >  > >-----Original Message-----
> >  > >From: bioperl-l-bounces@portal.open-bio.org
> >  > >[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Jim Nolan
> >  > >Sent: Friday, October 03, 2003 5:06 PM
> >  > >To: bioperl-l@portal.open-bio.org
> >>  >Subject: [Bioperl-l] OS X bioperl, staden/read, install problems
> >>  >
> >>  >I have been unable to install bioperl versions later than 1.2.1 on
> >>  >Mac OS X. The first error I get is during 'make test':
> >>  >"The extension 'Bio::SeqIO::staden::read' is not properly
> >>installed in path:
> >>  >    '/Library/Perl'
> >>  >If this is a CPAN/distributed module, you may need to reinstall it on your
> >>  >system.
> >>  >To allow Inline to compile the module in a temporary cache, simply remove
> >>  >the
> >>  >Inline config option 'VERSION=' from the Bio::SeqIO::staden::read module.
> >>  >   at t/FootPrinter.t line 0"
> >>  >and numerous other tests.
> >>  >
> >>  >So I attempted to install bioperl-ext, which meant instaling the
> >>  >Staden io_lib, though I don't use Staden. This had the difficulty of
> >>  >errors caused by the locations of os.h and config.h
> >>  >(http://portal.open-bio.org/mailman/htdig/bioperl-l/2003-June/012472.html).
> >>  >I proceeded with the bioperl-ext install, using the directions at
> >>  >http://www.tc.umn.edu/~cann0010/Bioperl_OSX_install.html#bioperlxs
> >>  >(which worked great until recently) I did the -fPIC edit, I did the
> >>  >ranlib, but on make test for bioper-ext, I get:
> >>  >/usr/bin/perl -I/System/Library/Perl/darwin -I/System/Library/Perl
> >>  >/System/Library/Perl/ExtUtils/xsubpp  -typemap
> >>  >/System/Library/Perl/ExtUtils/typemap  read.xs > read.xsc && mv
> >>  >read.xsc read.c
> >>  >cc -c -I/Users/jnolan/Desktop/download/bioperl-ext/Bio/SeqIO/staden
> >>  >-I/usr/local/include/io_lib -g -pipe -pipe -fno-common
> >>  >-no-cpp-precomp -flat_namespace -DHAS_TELLDIR_PROTOTYPE
> >>  >-fno-strict-aliasing -Os     -DVERSION=\"0.01\" -DXS_VERSION=\"0.01\"
> >>  >-I/System/Library/Perl/darwin/CORE  read.c
> >>  >In file included from /usr/local/include/io_lib/os.h:4,
> >>  >                   from /usr/local/include/io_lib/Read.h:43,
> >>  >                   from read.xs:5:
> >>  >/usr/local/include/io_lib/config.h:45:1: warning: "VERSION" redefined
> >>  >read.c:1:1: warning: this is the location of the previous definition
> >>  >In file included from /usr/local/include/io_lib/translate.h:20,
> >>  >                   from /usr/local/include/io_lib/Read.h:227,
> >>  >                   from read.xs:5:
> >>  >/usr/local/include/io_lib/expFileIO.h:42: parse error before "Exp_entries"
> >>  >/usr/local/include/io_lib/expFileIO.h:42: warning: data definition
> >>  >has no type o"
> >>  >and lots of similar warnings.
> >>  >
> >>  >So it seems like there is still a problem with these libraries,
> >>  >though I copied them where the compiler seems to be looking for them.
> >>  >
> >>  >Can someone point me in the right direction?
> >>  >
> >>  >
> >>  >It would be great if there was a way to get rid of the dependency on
> >>  >installing bioperl-ext to get bioperl to work. Even if the solution
> >>  >for my particular problem is simple, the need to install additional
> >>  >layers of dependencies takes a lot of the fun out of it for us lame
> >>  >biological types ;-)
> >>  >.
> >>  >
> >>  >Thanks.
> >>  >Jim
> >>  >--
> >>  >---------------------------------------------------------
> >>  >Jim Nolan                       email:JNolan@tulane.edu
> >>  >Department of Biochemistry SL43
> >>  >Tulane University Health Sciences Center
> >>  >1430 Tulane Ave.                Phone: (504)-584-2453
> >>  >New Orleans, LA 70112-2699      FAX:   (504)-584-2739
> >>  >http://www.tulane.edu/~biochem/faculty/nolan.htm
> >>  >http://phage.bioc.tulane.edu
> >>  >The requirements said: Windows 2000 or better.
> >>  >So I got a Macintosh.
> >>  >_______________________________________________
> >>  >Bioperl-l mailing list
> >>  >Bioperl-l@portal.open-bio.org
> >>  >http://portal.open-bio.org/mailman/listinfo/bioperl-l
> >>
> >>
> >>
> >
> >--
> >Jason Stajich
> >Duke University
> >jason at cgt.mc.duke.edu
>
>
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From lstein at cshl.edu  Mon Oct  6 11:19:12 2003
From: lstein at cshl.edu (Lincoln Stein)
Date: Mon Oct  6 12:03:24 2003
Subject: [Bioperl-l] Bio::Graphics sort_order
In-Reply-To: <3F812B71.8AB6C28A@ed.ac.uk>
References: <3F812B71.8AB6C28A@ed.ac.uk>
Message-ID: <200310061119.12141.lstein@cshl.edu>

Hi Richard,

This has now been fixed in both the live and 1.2 branches.  You will have to 
get them via CVS.

As it happens, passing an anonymous subroutine to -sort_order could never work 
in the way that was described in the documentation because the automagic 
variables $a and $b are package variables.  To illustrate, consider:

	package main;
	my $sort_sub = sub {warn "comparing $a to $b\n"; $a cmp $b};
	my @a           = sort $sort_sub qw(one two three four five);
	@a                 = Foo::package_sort($sort_sub);

	package Foo;
	sub package_sort {
		my $sort_sub = shift;
		return sort $sort_sub qw(one two three four five);
	}

The first invocation of sort will work because the anonymous sub and the call 
to sort are both in package main.  The second try will not work because the 
anonymous sub will look in package main for $a and $b, but they are defined 
in package Foo.

To work around this in Bio::Graphics, you must create an anonymous subroutine 
that has the $$ prototype:

	-sort_order => sub ($$) {
		my ($a,$b) = @_; 
		$a->feature->display_name cmp $b->feature->display_name; 
	}

When perl sees a sort sub with the $$ prototype, it passes $a and $b on the 
stack rather than as package variables.  Another issue is that the original 
documentation is wrong about what appears in $a and $b.  They are glyph 
objects, not feature objects.  To get at the feature, you must call their 
feature() method as shown above.

The documentation has now been fixed to explain this behavior correctly.

Lincoln


On Monday 06 October 2003 04:44 am, Richard Adams wrote:
> Hello,
>     I'm trying to sort the display order  of features based on values
> in  one of the features' tags using the syntax
>     described in the Bio::Graphics::Panel :doc.
>         i.e.,
> sort_order =>    sub {(  ($a->each_tag_value('Conserved'))[0]
> <=>($b->each_tag_value('Conserved'))[0]  )};
> But I get the error message
> "Can't call method "each_tag_value" on an undefined value at
> Draw_Output.pm". The code doesn't seem to be
> recognizing that $a and $b should refer to the features. I've tried
> delving into the internal workings of Bio::Graphics::Glyph::Factory
> with little enlightenment, would be grateful for any ideas.
> Cheers
> Richard
>
>
> --
> Dr Richard Adams
> Bioinformatician,
> Psychiatric Genetics Group,
> Medical Genetics,
> Molecular Medicine Centre,
> Western General Hospital,
> Crewe Rd West,
> Edinburgh UK
> EH4 2XU
>
> Tel: 44 131 651 1084
> richard.adams@ed.ac.uk
>
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

-- 
Lincoln Stein
lstein@cshl.edu
Cold Spring Harbor Laboratory
1 Bungtown Road
Cold Spring Harbor, NY 11724
(516) 367-8380 (voice)
(516) 367-8389 (fax)
From senger at ebi.ac.uk  Mon Oct  6 12:21:24 2003
From: senger at ebi.ac.uk (Martin Senger)
Date: Mon Oct  6 12:20:07 2003
Subject: [Bioperl-l] Re: Bio::Biblio
Message-ID: <Pine.OSF.4.10.10310061705550.795273-100000@ice.ebi.ac.uk>

Hi Charles,
   I will reply to you (and to the list) with a detailed script that can
do what you asked for (hopefully) in two days when I am back in my office.
Sorry for the delay.

   Martin

-- 
Martin Senger

EMBL Outstation - Hinxton                Senger@EBI.ac.uk     
European Bioinformatics Institute        Phone: (+44) 1223 494636      
Wellcome Trust Genome Campus             (Switchboard:     494444)
Hinxton                                  Fax  : (+44) 1223 494468
Cambridge CB10 1SD
United Kingdom                           http://industry.ebi.ac.uk/~senger


From ypeng at sfu.ca  Mon Oct  6 13:24:30 2003
From: ypeng at sfu.ca (ypeng@sfu.ca)
Date: Mon Oct  6 13:22:15 2003
Subject: [Bioperl-l] Bio::DB::Query::GenBank - how to specify Keyword
Message-ID: <200310061724.h96HOU1n014721@rm-rstar.sfu.ca>

Hi,

I wish to use this module to get the GI number (or accession number) of
somes genes in human, given the species and protein name (or gene name). But
if I use the protein name as the keyword and species as Organism, "phrase
not found" was returned. I was wondering if I can achieve this, and how to
specify the keyword for this module?

Many thanks in advance.


Fred Peng
----------------------------------------------
Department of Molecular Biology & Biochemsitry
Simon Fraser University
8888 University Drive
Burnaby, British Columbia
Canada V5A 1S6
From difazios at ornl.gov  Mon Oct  6 13:50:34 2003
From: difazios at ornl.gov (Stephen DiFazio)
Date: Mon Oct  6 13:48:21 2003
Subject: [Bioperl-l] GD installation problems for Solaris 9
Message-ID: <5.1.1.5.2.20031006134505.01510d38@email.cind.ornl.gov>

Hello,

I'd like to use Bio::Graphics but can't get GD.pm to work under Solaris 9 
on a Sun Blade 2000.  I'm attempting GD 2.07, though I've tried various 
earlier versions as well with no luck.  I've installed the necessary 
prerequisites:

zlib-1.1.3
libgd-2.0.15
libpng-1.2.5
jpeg-6b

Makefile.PL throws the following warnings for GD.pm (only):

WARNING: CAPI is not a known parameter.
Warning: prerequisite Math::Trig 1 not found.

In fact, Math::Trig 1.01 is installed and functioning on my system.

'make install' gives no errors.

Here's the output I get from 'make test':

PERL_DL_NONLAZY=1 /usr/bin/perl "-MExtUtils::Command::MM" "-e" 
"test_harness(0, 'blib/lib', 'blib/arch')" t/*.t
t/GD..........Can't load './blib/arch/auto/GD/GD.so' for module GD: 
ld.so.1: /usr/bin/perl: fatal: relocation error: file 
./blib/arch/auto/GD/GD.so: symbol gdImageCreateFromPng: referenced symbol 
not found at /usr/local/lib/perl5/5.8.0/sun4-solaris/DynaLoader.pm line 229.
  at t/GD.t line 13
Compilation failed in require at t/GD.t line 13.
BEGIN failed--compilation aborted at t/GD.t line 13.
t/GD..........dubious
         Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 1-10
         Failed 10/10 tests, 0.00% okay
t/Polyline....Can't load '/usr/local/GD-2.07/blib/arch/auto/GD/GD.so' for 
module GD: ld.so.1: /usr/bin/perl: fatal: relocation error: file 
/usr/local/GD-2.07/blib/arch/auto/GD/GD.so: symbol gdImageCreateFromPng: 
referenced symbol not found at 
/usr/local/lib/perl5/5.8.0/sun4-solaris/DynaLoader.pm line 229.
  at /usr/local/GD-2.07/blib/lib/GD/Polyline.pm line 49
Compilation failed in require at /usr/local/GD-2.07/blib/lib/GD/Polyline.pm 
line 49.
BEGIN failed--compilation aborted at 
/usr/local/GD-2.07/blib/lib/GD/Polyline.pm line 49.
Compilation failed in require at t/Polyline.t line 10.
BEGIN failed--compilation aborted at t/Polyline.t line 10.
t/Polyline....dubious
         Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED test 1
         Failed 1/1 tests, 0.00% okay
Failed Test  Stat Wstat Total Fail  Failed  List of Failed
-------------------------------------------------------------------------------
t/GD.t        255 65280    10   10 100.00%  1-10
t/Polyline.t  255 65280     1    1 100.00%  1
Failed 2/2 test scripts, 0.00% okay. 11/11 subtests failed, 0.00% okay.
make: *** [test_dynamic] Error 2

thanks in advance for any suggestions you may have.

Steve

Stephen P. DiFazio, Ph.D

Staff Scientist
Oak Ridge National Laboratory
Bldg. 1062-206
P.O. Box 2008
Oak Ridge, TN 37831-6422
(865) 574-7844; fax:(865) 576-8143
difazios@ornl.gov
http://www.esd.ornl.gov/PGG/home.html
Stephen P. DiFazio, Ph.D
Staff Scientist
Oak Ridge National Laboratory
Bldg. 1062-206
P.O. Box 2008
Oak Ridge, TN 37831-6422
(865) 574-7844; fax:(865) 576-8143
difazios@ornl.gov
http://www.esd.ornl.gov/PGG/home.html

From fsanchez at cifn.unam.mx  Mon Oct  6 14:30:17 2003
From: fsanchez at cifn.unam.mx (=?ISO-8859-1?Q?Fabiola_S=E1nchez?=)
Date: Mon Oct  6 14:18:17 2003
Subject: [Bioperl-l] How can i get lines without tag, next to organism
Message-ID: <3F81B4B9.3030909@cifn.unam.mx>

Hello!
I'm reading  files in genbank format
i want to parser the lines below  or next to organism, but these don't 
have tag
how can i get this
i can get organism but  i can't get the lines next to organism
for example:

ORGANISM  Pseudospirillum japonicum
            Bacteria; Proteobacteria; Gammaproteobacteria; 
Oceanospirillales;
            Pseudospirillum.
I need to get: 
            Bacteria; Proteobacteria; Gammaproteobacteria; 
Oceanospirillales;
            Pseudospirillum.

Thank you.

Fabi


LOCUS       AB006766                1459 bp    DNA     linear   BCT 
13-FEB-1999
DEFINITION  Oceanospirillum japonicum gene for 16S rRNA, partial sequence,
            strain:IFO19191.
ACCESSION   AB006766
VERSION     AB006766.1  GI:4049363
KEYWORDS    16S ribosomal RNA.
SOURCE      Pseudospirillum japonicum
  ORGANISM  Pseudospirillum japonicum
            Bacteria; Proteobacteria; Gammaproteobacteria; 
Oceanospirillales;
            Pseudospirillum.
REFERENCE   1  (sites)
  AUTHORS   Satomi,M., Kimura,B., Hayashi,M., Shouzen,Y., Okuzumi,M. and
            Fujii,T.
  TITLE     Marinospirillum gen. nov., with descriptions of Marinospirillum
            megaterium sp. nov., isolated from kusaya gravy, and transfer of
            Oceanospirillum minutulum to Marinospirillum minutulum comb. nov
  JOURNAL   Int. J. Syst. Bacteriol. 48 Pt 4, 1341-1348 (1998)
  MEDLINE   99045875
   PUBMED   9828435
REFERENCE   2  (bases 1 to 1459)
  AUTHORS   Satomi,M.
  TITLE     Direct Submission
  JOURNAL   Submitted (20-AUG-1997) Masataka Satomi, National Reserch 
Institute
            of Fisheries Science, Food Processing and Preservation Division;
            Fukuura 2-12-4, Kanazawa-ku,, Yokohama, Kanagawa 236, Japan
            (E-mail:msatomi@nrifs.affrc.go.jp, Tel:81-45-788-7670,
            Fax:81-45-788-7670)
FEATURES             Location/Qualifiers
     source          1..1459
                     /organism="Pseudospirillum japonicum"
                     /mol_type="genomic DNA"
                     /strain="ATCC19191"
                     /db_xref="taxon:64971"
     rRNA            <1..>1459
                     /product="16S ribosomal RNA"
BASE COUNT      368 a    326 c    462 g    302 t      1 others
ORIGIN     
        1 attgaacgct ggcggcaggc ctaacacatg caagtcgagc ggcagcgggg agtagcttgc
       61 tactttgccg gcgagcggcg gacgggtgag taacgcatag gaatctgccc agtagagggg
      121 gatagccagg ggaaactctg attaataccg catacgccct acgggggaaa aggggctttt
      181 agctcctgct attggatgag cctatgtcgg attagctagt tggtagggta aaggcctacc
      241 aaggcgacga tccgtagctg ttctgagagg atgatcagcc acactgggac tgagacacgg
      301 cccagactcc tacgggaggc agcagtgggg aatattgcac aatgggggga accctgatgc
      361 acccatcccg cgtgtgtgaa gaaggccttc gggttgtaaa gcactttcag caaggaggaa
      421 ggccgtatgc ttaataggca tgcggattga cgttacttgc agaagaagca ccggctaact
      481 ccgtgccagc agccgcggta atacggaggg tgcgagcgtt aatcggaatt actgggcgta
      541 aagcgcgcgt aggcggatag gtcagtcaga tgtgaaagcc ctgggctcaa cctaggacgt
      601 gcacctgata ctgcttatct agagtaaggt agagggtagt agaatttcct gtgtagcggt
      661 gaaatgcgta gatataggaa ggaataccgg tggcgaaggc ggctacctgg actattactg
      721 acgctgaggt gcgaaagcgt ggggatcaaa caggattaga taccctggta gtccacgctg
      781 taaacgatgt cgactagccg ttgccgacct tgagttggga gtggcgcagc taacgcgata
      841 agtcgaccgc ctggggagta cggccgcaag gttaaaactc aaatgaattg acgggggccc
      901 gcacaagcgg tggagcatgt ggtttaattc gatgcaacgc gaagaacctt acctactctt
      961 gacatccaga gaactttcca gagatggata ggtgccttcg ggaactctga gacaggtgct
     1021 gcatggctgt cgtcagctcg tgttgtgaaa tgttgggtta agtcccgtaa cgagcgcaac
     1081 ccttatcctt atttgccagc gggttatgcc gggaacttta aggaaactgc cggtgacaaa
     1141 ccggaggaag gtggggacga cgtcaagtca tcatggccct tacgagtagg gctacacacg
     1201 tgctacaatg gtaagtacag agggttgcaa gaccgcgagg tggagctaat ctcagaaaac
     1261 ttatcgtagt ccggattgga gtctgcaact cgactccatg aagtcggaat cgctagtaat
     1321 cgcgaatcag aatgtcgcgg tgaatacgtt cccgggcctt gtacacaccg cccgtcacac
     1381 catgggagtg gacttcacca gaagtagtta gtctaaccgn aagggggacg attaccacgg
     1441 tggggttcat gactggggt
//

From jason at cgt.duhs.duke.edu  Mon Oct  6 14:42:41 2003
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Mon Oct  6 14:41:13 2003
Subject: [Bioperl-l] How can i get lines without tag, next to organism
In-Reply-To: <3F81B4B9.3030909@cifn.unam.mx>
References: <3F81B4B9.3030909@cifn.unam.mx>
Message-ID: <Pine.LNX.4.50.0310061441320.27277-100000@tenero.duhs.duke.edu>

See docs for Bio::Species
 my $species = $seq->species;
 my @classification = $species->classification;

-jason
On Mon, 6 Oct 2003, [ISO-8859-1] Fabiola S?nchez wrote:

> Hello!
> I'm reading  files in genbank format
> i want to parser the lines below  or next to organism, but these don't
> have tag
> how can i get this
> i can get organism but  i can't get the lines next to organism
> for example:
>
> ORGANISM  Pseudospirillum japonicum
>             Bacteria; Proteobacteria; Gammaproteobacteria;
> Oceanospirillales;
>             Pseudospirillum.
> I need to get:
>             Bacteria; Proteobacteria; Gammaproteobacteria;
> Oceanospirillales;
>             Pseudospirillum.
>
> Thank you.
>
> Fabi
>
>
>
> LOCUS       AB006766                1459 bp    DNA     linear   BCT
> 13-FEB-1999
> DEFINITION  Oceanospirillum japonicum gene for 16S rRNA, partial sequence,
>             strain:IFO19191.
> ACCESSION   AB006766
> VERSION     AB006766.1  GI:4049363
> KEYWORDS    16S ribosomal RNA.
> SOURCE      Pseudospirillum japonicum
>   ORGANISM  Pseudospirillum japonicum
>             Bacteria; Proteobacteria; Gammaproteobacteria;
> Oceanospirillales;
>             Pseudospirillum.
> REFERENCE   1  (sites)
>   AUTHORS   Satomi,M., Kimura,B., Hayashi,M., Shouzen,Y., Okuzumi,M. and
>             Fujii,T.
>   TITLE     Marinospirillum gen. nov., with descriptions of Marinospirillum
>             megaterium sp. nov., isolated from kusaya gravy, and transfer of
>             Oceanospirillum minutulum to Marinospirillum minutulum comb. nov
>   JOURNAL   Int. J. Syst. Bacteriol. 48 Pt 4, 1341-1348 (1998)
>   MEDLINE   99045875
>    PUBMED   9828435
> REFERENCE   2  (bases 1 to 1459)
>   AUTHORS   Satomi,M.
>   TITLE     Direct Submission
>   JOURNAL   Submitted (20-AUG-1997) Masataka Satomi, National Reserch
> Institute
>             of Fisheries Science, Food Processing and Preservation Division;
>             Fukuura 2-12-4, Kanazawa-ku,, Yokohama, Kanagawa 236, Japan
>             (E-mail:msatomi@nrifs.affrc.go.jp, Tel:81-45-788-7670,
>             Fax:81-45-788-7670)
> FEATURES             Location/Qualifiers
>      source          1..1459
>                      /organism="Pseudospirillum japonicum"
>                      /mol_type="genomic DNA"
>                      /strain="ATCC19191"
>                      /db_xref="taxon:64971"
>      rRNA            <1..>1459
>                      /product="16S ribosomal RNA"
> BASE COUNT      368 a    326 c    462 g    302 t      1 others
> ORIGIN
>         1 attgaacgct ggcggcaggc ctaacacatg caagtcgagc ggcagcgggg agtagcttgc
>        61 tactttgccg gcgagcggcg gacgggtgag taacgcatag gaatctgccc agtagagggg
>       121 gatagccagg ggaaactctg attaataccg catacgccct acgggggaaa aggggctttt
>       181 agctcctgct attggatgag cctatgtcgg attagctagt tggtagggta aaggcctacc
>       241 aaggcgacga tccgtagctg ttctgagagg atgatcagcc acactgggac tgagacacgg
>       301 cccagactcc tacgggaggc agcagtgggg aatattgcac aatgggggga accctgatgc
>       361 acccatcccg cgtgtgtgaa gaaggccttc gggttgtaaa gcactttcag caaggaggaa
>       421 ggccgtatgc ttaataggca tgcggattga cgttacttgc agaagaagca ccggctaact
>       481 ccgtgccagc agccgcggta atacggaggg tgcgagcgtt aatcggaatt actgggcgta
>       541 aagcgcgcgt aggcggatag gtcagtcaga tgtgaaagcc ctgggctcaa cctaggacgt
>       601 gcacctgata ctgcttatct agagtaaggt agagggtagt agaatttcct gtgtagcggt
>       661 gaaatgcgta gatataggaa ggaataccgg tggcgaaggc ggctacctgg actattactg
>       721 acgctgaggt gcgaaagcgt ggggatcaaa caggattaga taccctggta gtccacgctg
>       781 taaacgatgt cgactagccg ttgccgacct tgagttggga gtggcgcagc taacgcgata
>       841 agtcgaccgc ctggggagta cggccgcaag gttaaaactc aaatgaattg acgggggccc
>       901 gcacaagcgg tggagcatgt ggtttaattc gatgcaacgc gaagaacctt acctactctt
>       961 gacatccaga gaactttcca gagatggata ggtgccttcg ggaactctga gacaggtgct
>      1021 gcatggctgt cgtcagctcg tgttgtgaaa tgttgggtta agtcccgtaa cgagcgcaac
>      1081 ccttatcctt atttgccagc gggttatgcc gggaacttta aggaaactgc cggtgacaaa
>      1141 ccggaggaag gtggggacga cgtcaagtca tcatggccct tacgagtagg gctacacacg
>      1201 tgctacaatg gtaagtacag agggttgcaa gaccgcgagg tggagctaat ctcagaaaac
>      1261 ttatcgtagt ccggattgga gtctgcaact cgactccatg aagtcggaat cgctagtaat
>      1321 cgcgaatcag aatgtcgcgg tgaatacgtt cccgggcctt gtacacaccg cccgtcacac
>      1381 catgggagtg gacttcacca gaagtagtta gtctaaccgn aagggggacg attaccacgg
>      1441 tggggttcat gactggggt
> //
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu

From heikki at nildram.co.uk  Mon Oct  6 14:47:57 2003
From: heikki at nildram.co.uk (Heikki Lehvaslaiho)
Date: Mon Oct  6 14:49:03 2003
Subject: [Bioperl-l] How can i get lines without tag, next to organism
In-Reply-To: <3F81B4B9.3030909@cifn.unam.mx>
References: <3F81B4B9.3030909@cifn.unam.mx>
Message-ID: <1065466076.2475.8.camel@localhost>

Fabiola,

Organism information is put into a Bio::Species object which can be
accessed by $seq->species().

$seq->species->classification() gives you the whole parsed taxomomy
hierarchy from species upwards in an array.

	-Heikki


On Mon, 2003-10-06 at 19:30, Fabiola S?nchez wrote:
> Hello!
> I'm reading  files in genbank format
> i want to parser the lines below  or next to organism, but these don't 
> have tag
> how can i get this
> i can get organism but  i can't get the lines next to organism
> for example:
> 
> ORGANISM  Pseudospirillum japonicum
>             Bacteria; Proteobacteria; Gammaproteobacteria; 
> Oceanospirillales;
>             Pseudospirillum.
> I need to get: 
>             Bacteria; Proteobacteria; Gammaproteobacteria; 
> Oceanospirillales;
>             Pseudospirillum.
> 
> Thank you.
> 
> Fabi
> 
> 
> 
> LOCUS       AB006766                1459 bp    DNA     linear   BCT 
> 13-FEB-1999
> DEFINITION  Oceanospirillum japonicum gene for 16S rRNA, partial sequence,
>             strain:IFO19191.
> ACCESSION   AB006766
> VERSION     AB006766.1  GI:4049363
> KEYWORDS    16S ribosomal RNA.
> SOURCE      Pseudospirillum japonicum
>   ORGANISM  Pseudospirillum japonicum
>             Bacteria; Proteobacteria; Gammaproteobacteria; 
> Oceanospirillales;
>             Pseudospirillum.
> REFERENCE   1  (sites)
>   AUTHORS   Satomi,M., Kimura,B., Hayashi,M., Shouzen,Y., Okuzumi,M. and
>             Fujii,T.
>   TITLE     Marinospirillum gen. nov., with descriptions of Marinospirillum
>             megaterium sp. nov., isolated from kusaya gravy, and transfer of
>             Oceanospirillum minutulum to Marinospirillum minutulum comb. nov
>   JOURNAL   Int. J. Syst. Bacteriol. 48 Pt 4, 1341-1348 (1998)
>   MEDLINE   99045875
>    PUBMED   9828435
> REFERENCE   2  (bases 1 to 1459)
>   AUTHORS   Satomi,M.
>   TITLE     Direct Submission
>   JOURNAL   Submitted (20-AUG-1997) Masataka Satomi, National Reserch 
> Institute
>             of Fisheries Science, Food Processing and Preservation Division;
>             Fukuura 2-12-4, Kanazawa-ku,, Yokohama, Kanagawa 236, Japan
>             (E-mail:msatomi@nrifs.affrc.go.jp, Tel:81-45-788-7670,
>             Fax:81-45-788-7670)
> FEATURES             Location/Qualifiers
>      source          1..1459
>                      /organism="Pseudospirillum japonicum"
>                      /mol_type="genomic DNA"
>                      /strain="ATCC19191"
>                      /db_xref="taxon:64971"
>      rRNA            <1..>1459
>                      /product="16S ribosomal RNA"
> BASE COUNT      368 a    326 c    462 g    302 t      1 others
> ORIGIN     
>         1 attgaacgct ggcggcaggc ctaacacatg caagtcgagc ggcagcgggg agtagcttgc
>        61 tactttgccg gcgagcggcg gacgggtgag taacgcatag gaatctgccc agtagagggg
>       121 gatagccagg ggaaactctg attaataccg catacgccct acgggggaaa aggggctttt
>       181 agctcctgct attggatgag cctatgtcgg attagctagt tggtagggta aaggcctacc
>       241 aaggcgacga tccgtagctg ttctgagagg atgatcagcc acactgggac tgagacacgg
>       301 cccagactcc tacgggaggc agcagtgggg aatattgcac aatgggggga accctgatgc
>       361 acccatcccg cgtgtgtgaa gaaggccttc gggttgtaaa gcactttcag caaggaggaa
>       421 ggccgtatgc ttaataggca tgcggattga cgttacttgc agaagaagca ccggctaact
>       481 ccgtgccagc agccgcggta atacggaggg tgcgagcgtt aatcggaatt actgggcgta
>       541 aagcgcgcgt aggcggatag gtcagtcaga tgtgaaagcc ctgggctcaa cctaggacgt
>       601 gcacctgata ctgcttatct agagtaaggt agagggtagt agaatttcct gtgtagcggt
>       661 gaaatgcgta gatataggaa ggaataccgg tggcgaaggc ggctacctgg actattactg
>       721 acgctgaggt gcgaaagcgt ggggatcaaa caggattaga taccctggta gtccacgctg
>       781 taaacgatgt cgactagccg ttgccgacct tgagttggga gtggcgcagc taacgcgata
>       841 agtcgaccgc ctggggagta cggccgcaag gttaaaactc aaatgaattg acgggggccc
>       901 gcacaagcgg tggagcatgt ggtttaattc gatgcaacgc gaagaacctt acctactctt
>       961 gacatccaga gaactttcca gagatggata ggtgccttcg ggaactctga gacaggtgct
>      1021 gcatggctgt cgtcagctcg tgttgtgaaa tgttgggtta agtcccgtaa cgagcgcaac
>      1081 ccttatcctt atttgccagc gggttatgcc gggaacttta aggaaactgc cggtgacaaa
>      1141 ccggaggaag gtggggacga cgtcaagtca tcatggccct tacgagtagg gctacacacg
>      1201 tgctacaatg gtaagtacag agggttgcaa gaccgcgagg tggagctaat ctcagaaaac
>      1261 ttatcgtagt ccggattgga gtctgcaact cgactccatg aagtcggaat cgctagtaat
>      1321 cgcgaatcag aatgtcgcgg tgaatacgtt cccgggcctt gtacacaccg cccgtcacac
>      1381 catgggagtg gacttcacca gaagtagtta gtctaaccgn aagggggacg attaccacgg
>      1441 tggggttcat gactggggt
> //
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

From cain at cshl.edu  Mon Oct  6 16:06:40 2003
From: cain at cshl.edu (Scott Cain)
Date: Mon Oct  6 16:04:18 2003
Subject: [Bioperl-l] bp 1.2.3 ppm install fails
Message-ID: <Pine.GSO.4.05.10310061559240.19978-100000@phage.cshl.edu>

Hello all,

First, thanks to the kind soul who provided the 1.2.3 version of bioperl
as a ppm.  Unfortunately, I can't seem to get it installed due to a failed
dependency.  While using AS Perl 5.8, it seems that none of the
repositories know what 'libxml-perl' is, and since it is listed as a
prerequisite in the ppd file, installation cannot continue.  Is there
somewhere special to get that from?  At the moment, I am only searching
the ActiveState and bioperl repositories.

As a side note, it seems like a bad idea to force the installation of
every prereq module for bioperl.  In my case, I have no need of
libxml-perl, but can't install bioperl for a lack of it.

Thanks,
Scott

----------------------------------------------------------------------
Scott Cain, Ph. D.				 	 cain@cshl.org
GMOD Coordinator, http://www.gmod.org/			 (216)392-3087
----------------------------------------------------------------------


From vesko_baev at abv.bg  Mon Oct  6 19:23:50 2003
From: vesko_baev at abv.bg (Vesko Baev)
Date: Mon Oct  6 19:21:32 2003
Subject: [Bioperl-l] protein localiz. & signal peptide
Message-ID: <699462842.1065482630568.JavaMail.nobody@app1.ni.bg>

Hi,
Are there any modules for prediction of protein localization and signal peptides (excluding SigCleave)?

Thanks!
Vesko

-----------------------------------------------------------------
http://club.abv.bg/jsp/abvCard.jsp - ???? ??? - ??????? ? ?????? ????? ? ???? !
From lstein at cshl.edu  Mon Oct  6 18:58:26 2003
From: lstein at cshl.edu (Lincoln Stein)
Date: Mon Oct  6 19:48:17 2003
Subject: [Bioperl-l] GD installation problems for Solaris 9
In-Reply-To: <5.1.1.5.2.20031006134505.01510d38@email.cind.ornl.gov>
References: <5.1.1.5.2.20031006134505.01510d38@email.cind.ornl.gov>
Message-ID: <200310061858.26935.lstein@cshl.edu>

Hi,

The CAPI warning comes from using a newer version of perl.  I will issue a new 
version of GD that avoids this warning.  The Math::Trig errors warning 
because Jarkko forgot to return a 1; at the bottom of Math::Trig in the 
latest version, and hence Perl things it didn't load properly.  You will have 
to wait for version 1.02 of Math::Trig or add a 1; to the Trig.pm file 
yourself.

To track down the runtime errors, make sure that libgd passed its regression 
tests when you installed it (there are a number of example applications that 
it compiles; make sure they run).  Then make sure that it is installed in a 
library that is included in your standard library path.  Finally, make sure 
that there is not an older version of libgd lurking somewhere.

Lincoln


On Monday 06 October 2003 01:50 pm, Stephen DiFazio wrote:
> Hello,
>
> I'd like to use Bio::Graphics but can't get GD.pm to work under Solaris 9
> on a Sun Blade 2000.  I'm attempting GD 2.07, though I've tried various
> earlier versions as well with no luck.  I've installed the necessary
> prerequisites:
>
> zlib-1.1.3
> libgd-2.0.15
> libpng-1.2.5
> jpeg-6b
>
> Makefile.PL throws the following warnings for GD.pm (only):
>
> WARNING: CAPI is not a known parameter.
> Warning: prerequisite Math::Trig 1 not found.
>
> In fact, Math::Trig 1.01 is installed and functioning on my system.
>
> 'make install' gives no errors.
>
> Here's the output I get from 'make test':
>
> PERL_DL_NONLAZY=1 /usr/bin/perl "-MExtUtils::Command::MM" "-e"
> "test_harness(0, 'blib/lib', 'blib/arch')" t/*.t
> t/GD..........Can't load './blib/arch/auto/GD/GD.so' for module GD:
> ld.so.1: /usr/bin/perl: fatal: relocation error: file
> ./blib/arch/auto/GD/GD.so: symbol gdImageCreateFromPng: referenced symbol
> not found at /usr/local/lib/perl5/5.8.0/sun4-solaris/DynaLoader.pm line
> 229. at t/GD.t line 13
> Compilation failed in require at t/GD.t line 13.
> BEGIN failed--compilation aborted at t/GD.t line 13.
> t/GD..........dubious
>          Test returned status 255 (wstat 65280, 0xff00)
> DIED. FAILED tests 1-10
>          Failed 10/10 tests, 0.00% okay
> t/Polyline....Can't load '/usr/local/GD-2.07/blib/arch/auto/GD/GD.so' for
> module GD: ld.so.1: /usr/bin/perl: fatal: relocation error: file
> /usr/local/GD-2.07/blib/arch/auto/GD/GD.so: symbol gdImageCreateFromPng:
> referenced symbol not found at
> /usr/local/lib/perl5/5.8.0/sun4-solaris/DynaLoader.pm line 229.
>   at /usr/local/GD-2.07/blib/lib/GD/Polyline.pm line 49
> Compilation failed in require at /usr/local/GD-2.07/blib/lib/GD/Polyline.pm
> line 49.
> BEGIN failed--compilation aborted at
> /usr/local/GD-2.07/blib/lib/GD/Polyline.pm line 49.
> Compilation failed in require at t/Polyline.t line 10.
> BEGIN failed--compilation aborted at t/Polyline.t line 10.
> t/Polyline....dubious
>          Test returned status 255 (wstat 65280, 0xff00)
> DIED. FAILED test 1
>          Failed 1/1 tests, 0.00% okay
> Failed Test  Stat Wstat Total Fail  Failed  List of Failed
> ---------------------------------------------------------------------------
>---- t/GD.t        255 65280    10   10 100.00%  1-10
> t/Polyline.t  255 65280     1    1 100.00%  1
> Failed 2/2 test scripts, 0.00% okay. 11/11 subtests failed, 0.00% okay.
> make: *** [test_dynamic] Error 2
>
> thanks in advance for any suggestions you may have.
>
> Steve
>
> Stephen P. DiFazio, Ph.D
>
> Staff Scientist
> Oak Ridge National Laboratory
> Bldg. 1062-206
> P.O. Box 2008
> Oak Ridge, TN 37831-6422
> (865) 574-7844; fax:(865) 576-8143
> difazios@ornl.gov
> http://www.esd.ornl.gov/PGG/home.html
> Stephen P. DiFazio, Ph.D
> Staff Scientist
> Oak Ridge National Laboratory
> Bldg. 1062-206
> P.O. Box 2008
> Oak Ridge, TN 37831-6422
> (865) 574-7844; fax:(865) 576-8143
> difazios@ornl.gov
> http://www.esd.ornl.gov/PGG/home.html
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

-- 
Lincoln Stein
lstein@cshl.edu
Cold Spring Harbor Laboratory
1 Bungtown Road
Cold Spring Harbor, NY 11724
(516) 367-8380 (voice)
(516) 367-8389 (fax)
From Richard.Holland at agresearch.co.nz  Mon Oct  6 20:58:11 2003
From: Richard.Holland at agresearch.co.nz (Holland, Richard)
Date: Mon Oct  6 20:56:00 2003
Subject: [Bioperl-l] RE: [Pubmednew] Problem with esearch.fcgi
Message-ID: <AF026AF0FF4B054590228FD1F1DE516501337550@inbox.agresearch.co.nz>

Hi,

Thanks for your answer regarding non-accessible accessions via
esearch.fcgi, but it doesn't quite solve my problem.

I am accessing GenBank programmatically via a set of modules called
BioPerl (http://www.bioperl.org/), specifically Bio::DB::GenBank. These
access the database on my behalf via the esearch.fcgi script. They
depend on the script returning the plain GenBank file without any HTML
markup.

The URL you sent me in response to my original question marks up the
response in HTML, and doesn't return just the plain GenBank file on its
own. 

My question was why are the two scripts unable to agree on the existence
of a particular accession (XM_055766)? Surely they are accessing the
same database under the hood? Are there any plans to make esearch.fcgi
recognise these more recent accessions?

I am copying this email to the BioPerl mailing list in case somebody
there can help out too.

BioPerl people - what are the alternatives? I am having the same
problems with the EBI servers, dbfetch, and Bio::DB::EMBL, which also
does not believe that the accession XM_055766 exists (although I can see
it quite clearly using Entrez at the NCBI).

cheers,
Richard

---
Richard Holland
Bioinformatics Database Developer
ITS, Agresearch Invermay x3279


-----Original Message-----
From: Monica Romiti [mailto:romiti@ncbi.nlm.nih.gov] 
Sent: Tuesday, 7 October 2003 1:42 p.m.
To: Holland, Richard
Cc: romiti@ncbi.nlm.nih.gov
Subject: FW: [Pubmednew] Problem with esearch.fcgi


Dear Colleague,

Use:


http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=search&db=nucleotide&t
erm=XM_0
55766[accn]&doptcmdl=GenBank


Best regards,

Monica at NCBI
-----Original Message-----

From:  custhelp@mail.nlm.nih.gov
Sent:  10/4/2003 04:24:45 PM
To:  Richard.Holland@agresearch.co.nz
Subject:  [Pubmednew] Problem with esearch.fcgi

Richard Holland:

Your email has been forwarded to the National Center for Biotechnology 
Information (info@ncbi.nlm.nih.gov)

Ellen M. Layman
National Library of Medicine


-----Original Message-----

From:  Richard.Holland@agresearch.co.nz
Sent:  10/1/2003 05:03:16 PM
To:  <pubmednew@ncbi.nlm.nih.gov>
Subject:  [Pubmednew] Problem with esearch.fcgi

Hi,

I can successfully search Entrez using the web-based forms for the
following term:

XM_055766

However, the same search via the eutils tool esearch.fcgi
(?db=nucleotide&term=XM_055766) returns no results. What's going on?

cheers,
Richard


=======================================================================
Attention: The information contained in this message and/or attachments
from AgResearch Limited is intended only for the persons or entities to
which it is addressed and may contain confidential and/or privileged
material. Any review, retransmission, dissemination or other use of, or
taking of any action in reliance upon, this information by persons or
entities other than the intended recipients is prohibited by AgResearch
Limited. If you have received this message in error, please notify the
sender immediately.
=======================================================================

------------- End Forwarded Message -------------


------------- End Forwarded Message -------------


Best regards,

Monica L. Romiti
NCBI User Services

=======================================================================
Attention: The information contained in this message and/or attachments
from AgResearch Limited is intended only for the persons or entities
to which it is addressed and may contain confidential and/or privileged
material. Any review, retransmission, dissemination or other use of, or
taking of any action in reliance upon, this information by persons or
entities other than the intended recipients is prohibited by AgResearch
Limited. If you have received this message in error, please notify the
sender immediately.
=======================================================================

From btclimd at nus.edu.sg  Mon Oct  6 22:43:05 2003
From: btclimd at nus.edu.sg (Desmond Lim)
Date: Mon Oct  6 22:40:47 2003
Subject: [Bioperl-l] seqIO
Message-ID: <653C572D8746D341AAA25C494CDFB9E338001C@MBXSRV05.stf.nus.edu.sg>

Hi,

I'm trying to make an array into a SeqIO object

my $seq_in = Bio::SeqIO->new('-ph' => @arr, '-format' => 'fasta');

but it doesn't work. I have gone through the documentation for the SeqIO module but have no idea how to 'read' the sequences directly from an array.

I need to do this because there is a sequence manipulation that happens to the file that is opened in the beginning.

I have a work around of creating a file with the array and then opening it using -file but I think reading from an array should be possible.

Thanks.

From btclimd at nus.edu.sg  Mon Oct  6 22:43:48 2003
From: btclimd at nus.edu.sg (Desmond Lim)
Date: Mon Oct  6 22:41:29 2003
Subject: [Bioperl-l] seqIO
Message-ID: <653C572D8746D341AAA25C494CDFB9E338001D@MBXSRV05.stf.nus.edu.sg>

Hi,

I'm trying to make an array into a SeqIO object

my $seq_in = Bio::SeqIO->new('-ph' => @arr, '-format' => 'fasta');

but it doesn't work. I have gone through the documentation for the SeqIO module but have no idea how to 'read' the sequences directly from an array.

I need to do this because there is a sequence manipulation that happens to the file that is opened in the beginning.

I have a work around of creating a file with the array and then opening it using -file but I think reading from an array should be possible.

Thanks.

From hlapp at gnf.org  Tue Oct  7 00:31:06 2003
From: hlapp at gnf.org (Hilmar Lapp)
Date: Tue Oct  7 00:28:46 2003
Subject: [Bioperl-l] seqIO
In-Reply-To: <653C572D8746D341AAA25C494CDFB9E338001C@MBXSRV05.stf.nus.edu.sg>
Message-ID: <BBA78F9A.1DCB%hlapp@gnf.org>

On 10/6/03 7:43 PM, "Desmond Lim" <btclimd@nus.edu.sg> wrote:

> Hi,
> 
> I'm trying to make an array into a SeqIO object
> 
> my $seq_in = Bio::SeqIO->new('-ph' => @arr, '-format' => 'fasta');
> 

What made you think it would accept a -ph parameter? Besides, if there was
one you'd have to provide an array reference.

> but it doesn't work. I have gone through the documentation for the SeqIO
> module but have no idea how to 'read' the sequences directly from an array.
> 
> I need to do this because there is a sequence manipulation that happens to the
> file that is opened in the beginning.
> 

I'm not sure I understand your use case. Could you try to elaborate a bit.

If what you're attempting to do is apply post-processing to the sequences
coming off of a sequence file through a SeqIO module and have the result
appear as a SeqIO-compatible stream to pass on, Bio::Seq::BaseSeqProcessor
may be for you (that's what it was designed for).

    -hilmar


> I have a work around of creating a file with the array and then opening it
> using -file but I think reading from an array should be possible.
> 
> Thanks.
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 

-- 
-------------------------------------------------------------
Hilmar Lapp                            email: lapp at gnf.org
GNF, San Diego, Ca. 92121              phone: +1-858-812-1757
-------------------------------------------------------------


From btclimd at nus.edu.sg  Tue Oct  7 00:45:18 2003
From: btclimd at nus.edu.sg (Desmond Lim)
Date: Tue Oct  7 00:42:59 2003
Subject: [Bioperl-l] seqIO
Message-ID: <653C572D8746D341AAA25C494CDFB9E338001E@MBXSRV05.stf.nus.edu.sg>

> What made you think it would accept a -ph parameter? Besides, if
> there was
> one you'd have to provide an array reference.

I didn't think it would work but I wanted to try it out.

What I'm trying to do is get SeqIO to accept an array of sequences and not a file.

Desmond

From laurichj at bioinfo.ucr.edu  Tue Oct  7 01:25:01 2003
From: laurichj at bioinfo.ucr.edu (Josh Lauricha)
Date: Tue Oct  7 01:22:39 2003
Subject: [Bioperl-l] seqIO
In-Reply-To: <653C572D8746D341AAA25C494CDFB9E338001E@MBXSRV05.stf.nus.edu.sg>
References: <653C572D8746D341AAA25C494CDFB9E338001E@MBXSRV05.stf.nus.edu.sg>
Message-ID: <20031007052501.GA26215@bioinfo.ucr.edu>

On Tue 10/07/03 12:45, Desmond Lim wrote:
> I didn't think it would work but I wanted to try it out.
> 
> What I'm trying to do is get SeqIO to accept an array of sequences and
> not a file.

At the risk of sounding stupid, why? If you already have the sequences
in memory why do you want to read them in twice?

If the array is something like:
@a = <IN>;

then join them and use IO::String (or IO::Scalar).

-- 

----------------------------
| Josh Lauricha            |
| laurichj@bioinfo.ucr.edu |
| Bioinformatics, UCR      |
|--------------------------|
From heikki at ebi.ac.uk  Tue Oct  7 05:23:15 2003
From: heikki at ebi.ac.uk (Heikki Lehvaslaiho)
Date: Tue Oct  7 05:21:07 2003
Subject: [Bioperl-l] RE: [Pubmednew] Problem with esearch.fcgi
In-Reply-To: <AF026AF0FF4B054590228FD1F1DE516501337550@inbox.agresearch.co.nz>
References: <AF026AF0FF4B054590228FD1F1DE516501337550@inbox.agresearch.co.nz>
Message-ID: <1065518595.7939.14.camel@localhost>

Richard,

If you look carefully at the entry XM_055766 in the Entrez server, you
notice this:

COMMENT     MODEL REFSEQ:  This record is predicted by automated  
            computational analysis. This record is derived from an 
            annotated genomic sequence (NT_024812) using gene 
            prediction method: BLAST, supported by mRNA and EST 
            evidence.

It is not a GenBank entry although it looks like one and NCBI Entrez
claims it is one but in fact it is from the RefSeq database. 

The XP_* entries can not be retrieved using Bio::DB::RefSeq, either,
because it fetches entries from EBI copy of the datebase which doeas not
contain all the latest sequence subclasses (only NC_*, NT_*, NM_*,
NP_*). I'll try to get that updated.

	-Heikki


On Tue, 2003-10-07 at 14:58, Holland, Richard wrote:
> Hi,
> 
> Thanks for your answer regarding non-accessible accessions via
> esearch.fcgi, but it doesn't quite solve my problem.
> 
> I am accessing GenBank programmatically via a set of modules called
> BioPerl (http://www.bioperl.org/), specifically Bio::DB::GenBank. These
> access the database on my behalf via the esearch.fcgi script. They
> depend on the script returning the plain GenBank file without any HTML
> markup.
> 
> The URL you sent me in response to my original question marks up the
> response in HTML, and doesn't return just the plain GenBank file on its
> own. 
> 
> My question was why are the two scripts unable to agree on the existence
> of a particular accession (XM_055766)? Surely they are accessing the
> same database under the hood? Are there any plans to make esearch.fcgi
> recognise these more recent accessions?
> 
> I am copying this email to the BioPerl mailing list in case somebody
> there can help out too.
> 
> BioPerl people - what are the alternatives? I am having the same
> problems with the EBI servers, dbfetch, and Bio::DB::EMBL, which also
> does not believe that the accession XM_055766 exists (although I can see
> it quite clearly using Entrez at the NCBI).
> 
> cheers,
> Richard
> 
> ---
> Richard Holland
> Bioinformatics Database Developer
> ITS, Agresearch Invermay x3279
> 
> 
> 
> -----Original Message-----
> From: Monica Romiti [mailto:romiti@ncbi.nlm.nih.gov] 
> Sent: Tuesday, 7 October 2003 1:42 p.m.
> To: Holland, Richard
> Cc: romiti@ncbi.nlm.nih.gov
> Subject: FW: [Pubmednew] Problem with esearch.fcgi
> 
> 
> Dear Colleague,
> 
> Use:
> 
> 
> http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=search&db=nucleotide&t
> erm=XM_0
> 55766[accn]&doptcmdl=GenBank
> 
> 
> Best regards,
> 
> Monica at NCBI
> -----Original Message-----
> 
> From:  custhelp@mail.nlm.nih.gov
> Sent:  10/4/2003 04:24:45 PM
> To:  Richard.Holland@agresearch.co.nz
> Subject:  [Pubmednew] Problem with esearch.fcgi
> 
> Richard Holland:
> 
> Your email has been forwarded to the National Center for Biotechnology 
> Information (info@ncbi.nlm.nih.gov)
> 
> Ellen M. Layman
> National Library of Medicine
> 
> 
> -----Original Message-----
> 
> From:  Richard.Holland@agresearch.co.nz
> Sent:  10/1/2003 05:03:16 PM
> To:  <pubmednew@ncbi.nlm.nih.gov>
> Subject:  [Pubmednew] Problem with esearch.fcgi
> 
> Hi,
> 
> I can successfully search Entrez using the web-based forms for the
> following term:
> 
> XM_055766
> 
> However, the same search via the eutils tool esearch.fcgi
> (?db=nucleotide&term=XM_055766) returns no results. What's going on?
> 
> cheers,
> Richard
> 
> 
> 
> =======================================================================
> Attention: The information contained in this message and/or attachments
> from AgResearch Limited is intended only for the persons or entities to
> which it is addressed and may contain confidential and/or privileged
> material. Any review, retransmission, dissemination or other use of, or
> taking of any action in reliance upon, this information by persons or
> entities other than the intended recipients is prohibited by AgResearch
> Limited. If you have received this message in error, please notify the
> sender immediately.
> =======================================================================
> 
> ------------- End Forwarded Message -------------
> 
> 
> ------------- End Forwarded Message -------------
> 
> 
> Best regards,
> 
> Monica L. Romiti
> NCBI User Services
> 
> =======================================================================
> Attention: The information contained in this message and/or attachments
> from AgResearch Limited is intended only for the persons or entities
> to which it is addressed and may contain confidential and/or privileged
> material. Any review, retransmission, dissemination or other use of, or
> taking of any action in reliance upon, this information by persons or
> entities other than the intended recipients is prohibited by AgResearch
> Limited. If you have received this message in error, please notify the
> sender immediately.
> =======================================================================
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
-- 
______ _/      _/_____________________________________________________
      _/      _/                      http://www.ebi.ac.uk/mutations/
     _/  _/  _/  Heikki Lehvaslaiho    heikki_at_ebi ac uk
    _/_/_/_/_/  EMBL Outstation, European Bioinformatics Institute
   _/  _/  _/  Wellcome Trust Genome Campus, Hinxton
  _/  _/  _/  Cambs. CB10 1SD, United Kingdom
     _/      Phone: +44 (0)1223 494 644   FAX: +44 (0)1223 494 468
___ _/_/_/_/_/________________________________________________________

From Richard.Adams at ed.ac.uk  Tue Oct  7 07:19:33 2003
From: Richard.Adams at ed.ac.uk (Richard Adams)
Date: Tue Oct  7 07:17:11 2003
Subject: [Bioperl-l] Homologene parsing
Message-ID: <3F82A145.E9782924@ed.ac.uk>

Hello,
    I'm writing some code to access data from NCBI's homologene dataset
of reciprocal best blast matches
 (see http://www.ncbi.nlm.nih.gov/HomoloGene/details.shtml) and am
thinking of Bioperlifying it.
At the moment I just have code for trawling through the file,  parsing
individual entries and accessing their attributes
     e.g.,

            my @orthologueget_orthologous_accessions ()
             my @locus_ids     =get_orthologous_locusIds()
             my $cluster_size = orthologue_count()
           my $boolean =  has_orthologue('mouse') etc

    but to be useful for accessing clusters quickly I guess it needs
some sort of indexing - there are about 21 000 entries. I don't know
much about databases /indexing etc but would inheriting from
Bio::Index::Abstract be the way to go,, in a similar way to swissPfam?
Ideally one would get a db handle and use that to retrieve a particular
entry indexed by any of the identifiers in the cluster

    e.g., my $db= new Bio::DB::Homologene
                    $db->fetch_cluster_by_name('UBE2B'); access by gene
name
                    $db->fetch_cluster_by_id('NM_005197');

    of by taxon id
                  my @human_clusters =   $db->fetch_all_by_taxon(9606);

Any broad pointers in the right direction concerning indexing would be
greatly appreciated, I am quite happy to do all the coding
but want to make sure it's useful at the end of the day.

Cheers

Richard


--
Dr Richard Adams
Bioinformatician,
Psychiatric Genetics Group,
Medical Genetics,
Molecular Medicine Centre,
Western General Hospital,
Crewe Rd West,
Edinburgh UK
EH4 2XU

Tel: 44 131 651 1084
richard.adams@ed.ac.uk


From brian_osborne at cognia.com  Tue Oct  7 07:57:07 2003
From: brian_osborne at cognia.com (Brian Osborne)
Date: Tue Oct  7 07:59:35 2003
Subject: [Bioperl-l] bp 1.2.3 ppm install fails
In-Reply-To: <Pine.GSO.4.05.10310061559240.19978-100000@phage.cshl.edu>
Message-ID: <GAEDKMGOKFBLJPKCLKCCOENFDCAA.brian_osborne@cognia.com>

Scott,

> As a side note, it seems like a bad idea to force the installation of
every prereq module for bioperl

Definitely a bad idea, that's why Bioperl doesn't do this. In the CPAN
shell, for example, you can do "force install". There is also a "force
install" option in ActiveState, something like setting force-install to 1?
Please check the ActiveState documentation and tell me if I'm wrong.

A few months ago I took a look at the "required modules" list in Bioperl and
found that many of them were not supplied by ActiveState, just as you're
finding.

Brian O.

-----Original Message-----
From: bioperl-l-bounces@portal.open-bio.org
[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Scott Cain
Sent: Monday, October 06, 2003 4:07 PM
To: bioperl-l@bioperl.org
Cc: cain@cshl.org
Subject: [Bioperl-l] bp 1.2.3 ppm install fails

Hello all,

First, thanks to the kind soul who provided the 1.2.3 version of bioperl
as a ppm.  Unfortunately, I can't seem to get it installed due to a failed
dependency.  While using AS Perl 5.8, it seems that none of the
repositories know what 'libxml-perl' is, and since it is listed as a
prerequisite in the ppd file, installation cannot continue.  Is there
somewhere special to get that from?  At the moment, I am only searching
the ActiveState and bioperl repositories.

As a side note, it seems like a bad idea to force the installation of
every prereq module for bioperl.  In my case, I have no need of
libxml-perl, but can't install bioperl for a lack of it.

Thanks,
Scott

----------------------------------------------------------------------
Scott Cain, Ph. D.                                       cain@cshl.org
GMOD Coordinator, http://www.gmod.org/                   (216)392-3087
----------------------------------------------------------------------


_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org
http://portal.open-bio.org/mailman/listinfo/bioperl-l


From brian_osborne at cognia.com  Tue Oct  7 08:19:13 2003
From: brian_osborne at cognia.com (Brian Osborne)
Date: Tue Oct  7 08:21:11 2003
Subject: [Bioperl-l] Bio::Biblio
In-Reply-To: <20031006140504.GA2631@plessy.org>
Message-ID: <GAEDKMGOKFBLJPKCLKCCKENGDCAA.brian_osborne@cognia.com>

Charles,

Take a look at examples/biblio/biblio_examples.pl, there are some
one-liner's there that you might find useful.

Brian O.

-----Original Message-----
From: bioperl-l-bounces@portal.open-bio.org
[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Charles Plessy
Sent: Monday, October 06, 2003 10:05 AM
To: bioperl-l@bioperl.org
Subject: [Bioperl-l] Bio::Biblio

Hi,

        My goal is to write a quick-and-dirty medline2docbook fliter
for immediate use, using bioperl since it has already a library able
to query medline.

        However, I don't know how to quickly extract basic fields,
like author or journal, to use them in this kind of output system:

print << DOCBOOK
(...)
<biblioset>
   <authorgroup>
    <author>
     <surname>$object->Author_surname</surname>
     <firstname>$object->Author_firstname</firstname>
    </author>
(...)
DOCBOOK

Can anybody point me to the correct object and its construction method?

--
Charles
_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org
http://portal.open-bio.org/mailman/listinfo/bioperl-l


From fsanchez at cifn.unam.mx  Mon Oct  6 14:25:26 2003
From: fsanchez at cifn.unam.mx (=?ISO-8859-1?Q?Fabiola_S=E1nchez?=)
Date: Tue Oct  7 08:32:12 2003
Subject: [Bioperl-l] How can i get ...
Message-ID: <3F81B396.3010804@cifn.unam.mx>

Hello!
I'm reading  files in genbank format
i want to parser the lines below  or next to organism, but these don't 
have tag
how can i get this
i can get organism but  i can't get the lines next to organism
for example:

ORGANISM  Pseudospirillum japonicum
            Bacteria; Proteobacteria; Gammaproteobacteria; 
Oceanospirillales;
            Pseudospirillum.
I need to get: 
            Bacteria; Proteobacteria; Gammaproteobacteria; 
Oceanospirillales;
            Pseudospirillum.
but i can't

Thank you.

Fabi


LOCUS       AB006766                1459 bp    DNA     linear   BCT 
13-FEB-1999
DEFINITION  Oceanospirillum japonicum gene for 16S rRNA, partial sequence,
            strain:IFO19191.
ACCESSION   AB006766
VERSION     AB006766.1  GI:4049363
KEYWORDS    16S ribosomal RNA.
SOURCE      Pseudospirillum japonicum
  ORGANISM  Pseudospirillum japonicum
            Bacteria; Proteobacteria; Gammaproteobacteria; 
Oceanospirillales;
            Pseudospirillum.
REFERENCE   1  (sites)
  AUTHORS   Satomi,M., Kimura,B., Hayashi,M., Shouzen,Y., Okuzumi,M. and
            Fujii,T.
  TITLE     Marinospirillum gen. nov., with descriptions of Marinospirillum
            megaterium sp. nov., isolated from kusaya gravy, and transfer of
            Oceanospirillum minutulum to Marinospirillum minutulum comb. nov
  JOURNAL   Int. J. Syst. Bacteriol. 48 Pt 4, 1341-1348 (1998)
  MEDLINE   99045875
   PUBMED   9828435
REFERENCE   2  (bases 1 to 1459)
  AUTHORS   Satomi,M.
  TITLE     Direct Submission
  JOURNAL   Submitted (20-AUG-1997) Masataka Satomi, National Reserch 
Institute
            of Fisheries Science, Food Processing and Preservation Division;
            Fukuura 2-12-4, Kanazawa-ku,, Yokohama, Kanagawa 236, Japan
            (E-mail:msatomi@nrifs.affrc.go.jp, Tel:81-45-788-7670,
            Fax:81-45-788-7670)
FEATURES             Location/Qualifiers
     source          1..1459
                     /organism="Pseudospirillum japonicum"
                     /mol_type="genomic DNA"
                     /strain="ATCC19191"
                     /db_xref="taxon:64971"
     rRNA            <1..>1459
                     /product="16S ribosomal RNA"
BASE COUNT      368 a    326 c    462 g    302 t      1 others
ORIGIN     
        1 attgaacgct ggcggcaggc ctaacacatg caagtcgagc ggcagcgggg agtagcttgc
       61 tactttgccg gcgagcggcg gacgggtgag taacgcatag gaatctgccc agtagagggg
      121 gatagccagg ggaaactctg attaataccg catacgccct acgggggaaa aggggctttt
      181 agctcctgct attggatgag cctatgtcgg attagctagt tggtagggta aaggcctacc
      241 aaggcgacga tccgtagctg ttctgagagg atgatcagcc acactgggac tgagacacgg
      301 cccagactcc tacgggaggc agcagtgggg aatattgcac aatgggggga accctgatgc
      361 acccatcccg cgtgtgtgaa gaaggccttc gggttgtaaa gcactttcag caaggaggaa
      421 ggccgtatgc ttaataggca tgcggattga cgttacttgc agaagaagca ccggctaact
      481 ccgtgccagc agccgcggta atacggaggg tgcgagcgtt aatcggaatt actgggcgta
      541 aagcgcgcgt aggcggatag gtcagtcaga tgtgaaagcc ctgggctcaa cctaggacgt
      601 gcacctgata ctgcttatct agagtaaggt agagggtagt agaatttcct gtgtagcggt
      661 gaaatgcgta gatataggaa ggaataccgg tggcgaaggc ggctacctgg actattactg
      721 acgctgaggt gcgaaagcgt ggggatcaaa caggattaga taccctggta gtccacgctg
      781 taaacgatgt cgactagccg ttgccgacct tgagttggga gtggcgcagc taacgcgata
      841 agtcgaccgc ctggggagta cggccgcaag gttaaaactc aaatgaattg acgggggccc
      901 gcacaagcgg tggagcatgt ggtttaattc gatgcaacgc gaagaacctt acctactctt
      961 gacatccaga gaactttcca gagatggata ggtgccttcg ggaactctga gacaggtgct
     1021 gcatggctgt cgtcagctcg tgttgtgaaa tgttgggtta agtcccgtaa cgagcgcaac
     1081 ccttatcctt atttgccagc gggttatgcc gggaacttta aggaaactgc cggtgacaaa
     1141 ccggaggaag gtggggacga cgtcaagtca tcatggccct tacgagtagg gctacacacg
     1201 tgctacaatg gtaagtacag agggttgcaa gaccgcgagg tggagctaat ctcagaaaac
     1261 ttatcgtagt ccggattgga gtctgcaact cgactccatg aagtcggaat cgctagtaat
     1321 cgcgaatcag aatgtcgcgg tgaatacgtt cccgggcctt gtacacaccg cccgtcacac
     1381 catgggagtg gacttcacca gaagtagtta gtctaaccgn aagggggacg attaccacgg
     1441 tggggttcat gactggggt
//
From walsh at cenix-bioscience.com  Tue Oct  7 07:33:59 2003
From: walsh at cenix-bioscience.com (Andrew Walsh)
Date: Tue Oct  7 08:32:36 2003
Subject: [Bioperl-l] RE: [Pubmednew] Problem with esearch.fcgi
In-Reply-To: <1065518595.7939.14.camel@localhost>
References: <AF026AF0FF4B054590228FD1F1DE516501337550@inbox.agresearch.co.nz>
	<1065518595.7939.14.camel@localhost>
Message-ID: <3F82A4A7.8070205@cenix-bioscience.com>

I seem to be able to retrieve X[MP] sequences using Bioperl version 1.2.2.

Depending on the type of accession I'm dealing with, I use either 
Bio::DB::GenBank or Bio::DB::RefSeq.

The following code retrieves both XM_055766 and XP_055766 (the 
corresponding peptide) from the web:

As a side note (maybe someone with more knowledge of the SeqIO parser 
knows why). The 'KEYWORD' part of the Genbank file does not seem to be 
retrieved properly.
I get this line in the sequence file: 'KEYWORDS    ARRAY(0x8ba8f30).'

# the code...

# $fmt has been set to 'genbank' or 'fasta'
# get $accs, ref to array of accessions

for my $acc (@$accs) {
     my %ext_map = (genbank => 'gbff',
                    fasta   => 'fasta');
     my $file = "$acc.$ext_map{$fmt}";
     my $io   = Bio::SeqIO->new(-file => ">$file", -format => $fmt);
     my $seq;
     eval {
         $seq = get_seq_obj_from_web($acc);
     };
     if ($@) {
         warn "Could not get $acc: $@\n";
     }
     else {
         $io->write_seq($seq);
     }
}

sub get_seq_obj_from_web {
     my ($acc_num) = @_;
     warn "Getting $acc_num from web\n";
     my $ncbi_db = get_web_database_obj($acc_num);
     my $seq;
     eval {
	$seq = $ncbi_db->get_Seq_by_id($acc_num);
     };
     if ($@ || !$seq) {
	cluck "Could not retrieve $acc_num from NCBI: $@";
	return undef;
     }
     else {
         return $seq;
     }
}

sub get_web_database_obj {
     my $acc = shift;
     my $ncbi_db;
     # appears that XM #'s only retrievable from GenBank, not RefSeq.
     if ($acc =~ /^[N]M_/) {
	$ncbi_db = new Bio::DB::RefSeq()

     elsif ($acc =~ /^[NX]P_/) {
	$ncbi_db = new Bio::DB::GenPept()
     }
     else {
	$ncbi_db = new Bio::DB::GenBank()
     }
     my $proxy = 'http://my.proxy.com:3128/';
     $ncbi_db->proxy(['http','ftp'], $proxy);
     return $ncbi_db;
}

Heikki Lehvaslaiho wrote:
> Richard,
> 
> If you look carefully at the entry XM_055766 in the Entrez server, you
> notice this:
> 
> COMMENT     MODEL REFSEQ:  This record is predicted by automated  
>             computational analysis. This record is derived from an 
>             annotated genomic sequence (NT_024812) using gene 
>             prediction method: BLAST, supported by mRNA and EST 
>             evidence.
> 
> It is not a GenBank entry although it looks like one and NCBI Entrez
> claims it is one but in fact it is from the RefSeq database. 
> 
> The XP_* entries can not be retrieved using Bio::DB::RefSeq, either,
> because it fetches entries from EBI copy of the datebase which doeas not
> contain all the latest sequence subclasses (only NC_*, NT_*, NM_*,
> NP_*). I'll try to get that updated.
> 
> 	-Heikki
> 
> 
> 
> On Tue, 2003-10-07 at 14:58, Holland, Richard wrote:
> 
>>Hi,
>>
>>Thanks for your answer regarding non-accessible accessions via
>>esearch.fcgi, but it doesn't quite solve my problem.
>>
>>I am accessing GenBank programmatically via a set of modules called
>>BioPerl (http://www.bioperl.org/), specifically Bio::DB::GenBank. These
>>access the database on my behalf via the esearch.fcgi script. They
>>depend on the script returning the plain GenBank file without any HTML
>>markup.
>>
>>The URL you sent me in response to my original question marks up the
>>response in HTML, and doesn't return just the plain GenBank file on its
>>own. 
>>
>>My question was why are the two scripts unable to agree on the existence
>>of a particular accession (XM_055766)? Surely they are accessing the
>>same database under the hood? Are there any plans to make esearch.fcgi
>>recognise these more recent accessions?
>>
>>I am copying this email to the BioPerl mailing list in case somebody
>>there can help out too.
>>
>>BioPerl people - what are the alternatives? I am having the same
>>problems with the EBI servers, dbfetch, and Bio::DB::EMBL, which also
>>does not believe that the accession XM_055766 exists (although I can see
>>it quite clearly using Entrez at the NCBI).
>>
>>cheers,
>>Richard
>>
>>---
>>Richard Holland
>>Bioinformatics Database Developer
>>ITS, Agresearch Invermay x3279
>>
>>
>>
>>-----Original Message-----
>>From: Monica Romiti [mailto:romiti@ncbi.nlm.nih.gov] 
>>Sent: Tuesday, 7 October 2003 1:42 p.m.
>>To: Holland, Richard
>>Cc: romiti@ncbi.nlm.nih.gov
>>Subject: FW: [Pubmednew] Problem with esearch.fcgi
>>
>>
>>Dear Colleague,
>>
>>Use:
>>
>>
>>http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=search&db=nucleotide&t
>>erm=XM_0
>>55766[accn]&doptcmdl=GenBank
>>
>>
>>Best regards,
>>
>>Monica at NCBI
>>-----Original Message-----
>>
>>From:  custhelp@mail.nlm.nih.gov
>>Sent:  10/4/2003 04:24:45 PM
>>To:  Richard.Holland@agresearch.co.nz
>>Subject:  [Pubmednew] Problem with esearch.fcgi
>>
>>Richard Holland:
>>
>>Your email has been forwarded to the National Center for Biotechnology 
>>Information (info@ncbi.nlm.nih.gov)
>>
>>Ellen M. Layman
>>National Library of Medicine
>>
>>
>>-----Original Message-----
>>
>>From:  Richard.Holland@agresearch.co.nz
>>Sent:  10/1/2003 05:03:16 PM
>>To:  <pubmednew@ncbi.nlm.nih.gov>
>>Subject:  [Pubmednew] Problem with esearch.fcgi
>>
>>Hi,
>>
>>I can successfully search Entrez using the web-based forms for the
>>following term:
>>
>>XM_055766
>>
>>However, the same search via the eutils tool esearch.fcgi
>>(?db=nucleotide&term=XM_055766) returns no results. What's going on?
>>
>>cheers,
>>Richard
>>
>>
>>
>>=======================================================================
>>Attention: The information contained in this message and/or attachments
>>from AgResearch Limited is intended only for the persons or entities to
>>which it is addressed and may contain confidential and/or privileged
>>material. Any review, retransmission, dissemination or other use of, or
>>taking of any action in reliance upon, this information by persons or
>>entities other than the intended recipients is prohibited by AgResearch
>>Limited. If you have received this message in error, please notify the
>>sender immediately.
>>=======================================================================
>>
>>------------- End Forwarded Message -------------
>>
>>
>>------------- End Forwarded Message -------------
>>
>>
>>Best regards,
>>
>>Monica L. Romiti
>>NCBI User Services
>>
>>=======================================================================
>>Attention: The information contained in this message and/or attachments
>>from AgResearch Limited is intended only for the persons or entities
>>to which it is addressed and may contain confidential and/or privileged
>>material. Any review, retransmission, dissemination or other use of, or
>>taking of any action in reliance upon, this information by persons or
>>entities other than the intended recipients is prohibited by AgResearch
>>Limited. If you have received this message in error, please notify the
>>sender immediately.
>>=======================================================================
>>
>>_______________________________________________
>>Bioperl-l mailing list
>>Bioperl-l@portal.open-bio.org
>>http://portal.open-bio.org/mailman/listinfo/bioperl-l


-- 
------------------------------------------------------------------
Andrew Walsh, M.Sc.
Bioinformatics Software Engineer
IT Unit
Cenix BioScience GmbH
Pfotenhauerstr. 108
01307 Dresden, Germany
Tel. +49(351)210-2699
Fax  +49(351)210-1309

public key: http://www.cenix-bioscience.com/public_keys/walsh.gpg
------------------------------------------------------------------


From Marc.Logghe at devgen.com  Tue Oct  7 08:58:42 2003
From: Marc.Logghe at devgen.com (Marc Logghe)
Date: Tue Oct  7 08:56:38 2003
Subject: [Bioperl-l] How can i get ...
Message-ID: <BEE28BF86078B6429D6C780635718E212E73E1@morelia.be.devgen.com>

Hi,
it is like Heikki already mentioned, you need the classification() method of Bio::Species.
You get the output 
"Bacteria; Proteobacteria; Gammaproteobacteria; Oceanospirillales; Pseudospirillum"
when you run the code:
my $species = $seq->species;
my @classification = reverse $species->classification;
pop @classification;
print join( '; ', @classification), "\n";

ML
> -----Original Message-----
> From: Fabiola S?nchez [mailto:fsanchez@cifn.unam.mx]
> Sent: Monday, October 06, 2003 8:25 PM
> To: bioperl-l@bioperl.org
> Subject: [Bioperl-l] How can i get ...
> 
> 
> Hello!
> I'm reading  files in genbank format
> i want to parser the lines below  or next to organism, but 
> these don't 
> have tag
> how can i get this
> i can get organism but  i can't get the lines next to organism
> for example:
> 
> ORGANISM  Pseudospirillum japonicum
>             Bacteria; Proteobacteria; Gammaproteobacteria; 
> Oceanospirillales;
>             Pseudospirillum.
> I need to get: 
>             Bacteria; Proteobacteria; Gammaproteobacteria; 
> Oceanospirillales;
>             Pseudospirillum.
> but i can't
> 
> Thank you.
> 
> Fabi
> 
> 
> 
> LOCUS       AB006766                1459 bp    DNA     linear   BCT 
> 13-FEB-1999
> DEFINITION  Oceanospirillum japonicum gene for 16S rRNA, 
> partial sequence,
>             strain:IFO19191.
> ACCESSION   AB006766
> VERSION     AB006766.1  GI:4049363
> KEYWORDS    16S ribosomal RNA.
> SOURCE      Pseudospirillum japonicum
>   ORGANISM  Pseudospirillum japonicum
>             Bacteria; Proteobacteria; Gammaproteobacteria; 
> Oceanospirillales;
>             Pseudospirillum.
> REFERENCE   1  (sites)
>   AUTHORS   Satomi,M., Kimura,B., Hayashi,M., Shouzen,Y., 
> Okuzumi,M. and
>             Fujii,T.
>   TITLE     Marinospirillum gen. nov., with descriptions of 
> Marinospirillum
>             megaterium sp. nov., isolated from kusaya gravy, 
> and transfer of
>             Oceanospirillum minutulum to Marinospirillum 
> minutulum comb. nov
>   JOURNAL   Int. J. Syst. Bacteriol. 48 Pt 4, 1341-1348 (1998)
>   MEDLINE   99045875
>    PUBMED   9828435
> REFERENCE   2  (bases 1 to 1459)
>   AUTHORS   Satomi,M.
>   TITLE     Direct Submission
>   JOURNAL   Submitted (20-AUG-1997) Masataka Satomi, National Reserch 
> Institute
>             of Fisheries Science, Food Processing and 
> Preservation Division;
>             Fukuura 2-12-4, Kanazawa-ku,, Yokohama, Kanagawa 
> 236, Japan
>             (E-mail:msatomi@nrifs.affrc.go.jp, Tel:81-45-788-7670,
>             Fax:81-45-788-7670)
> FEATURES             Location/Qualifiers
>      source          1..1459
>                      /organism="Pseudospirillum japonicum"
>                      /mol_type="genomic DNA"
>                      /strain="ATCC19191"
>                      /db_xref="taxon:64971"
>      rRNA            <1..>1459
>                      /product="16S ribosomal RNA"
> BASE COUNT      368 a    326 c    462 g    302 t      1 others
> ORIGIN     
>         1 attgaacgct ggcggcaggc ctaacacatg caagtcgagc 
> ggcagcgggg agtagcttgc
>        61 tactttgccg gcgagcggcg gacgggtgag taacgcatag 
> gaatctgccc agtagagggg
>       121 gatagccagg ggaaactctg attaataccg catacgccct 
> acgggggaaa aggggctttt
>       181 agctcctgct attggatgag cctatgtcgg attagctagt 
> tggtagggta aaggcctacc
>       241 aaggcgacga tccgtagctg ttctgagagg atgatcagcc 
> acactgggac tgagacacgg
>       301 cccagactcc tacgggaggc agcagtgggg aatattgcac 
> aatgggggga accctgatgc
>       361 acccatcccg cgtgtgtgaa gaaggccttc gggttgtaaa 
> gcactttcag caaggaggaa
>       421 ggccgtatgc ttaataggca tgcggattga cgttacttgc 
> agaagaagca ccggctaact
>       481 ccgtgccagc agccgcggta atacggaggg tgcgagcgtt 
> aatcggaatt actgggcgta
>       541 aagcgcgcgt aggcggatag gtcagtcaga tgtgaaagcc 
> ctgggctcaa cctaggacgt
>       601 gcacctgata ctgcttatct agagtaaggt agagggtagt 
> agaatttcct gtgtagcggt
>       661 gaaatgcgta gatataggaa ggaataccgg tggcgaaggc 
> ggctacctgg actattactg
>       721 acgctgaggt gcgaaagcgt ggggatcaaa caggattaga 
> taccctggta gtccacgctg
>       781 taaacgatgt cgactagccg ttgccgacct tgagttggga 
> gtggcgcagc taacgcgata
>       841 agtcgaccgc ctggggagta cggccgcaag gttaaaactc 
> aaatgaattg acgggggccc
>       901 gcacaagcgg tggagcatgt ggtttaattc gatgcaacgc 
> gaagaacctt acctactctt
>       961 gacatccaga gaactttcca gagatggata ggtgccttcg 
> ggaactctga gacaggtgct
>      1021 gcatggctgt cgtcagctcg tgttgtgaaa tgttgggtta 
> agtcccgtaa cgagcgcaac
>      1081 ccttatcctt atttgccagc gggttatgcc gggaacttta 
> aggaaactgc cggtgacaaa
>      1141 ccggaggaag gtggggacga cgtcaagtca tcatggccct 
> tacgagtagg gctacacacg
>      1201 tgctacaatg gtaagtacag agggttgcaa gaccgcgagg 
> tggagctaat ctcagaaaac
>      1261 ttatcgtagt ccggattgga gtctgcaact cgactccatg 
> aagtcggaat cgctagtaat
>      1321 cgcgaatcag aatgtcgcgg tgaatacgtt cccgggcctt 
> gtacacaccg cccgtcacac
>      1381 catgggagtg gacttcacca gaagtagtta gtctaaccgn 
> aagggggacg attaccacgg
>      1441 tggggttcat gactggggt
> //
> 

From cain at cshl.org  Tue Oct  7 09:32:19 2003
From: cain at cshl.org (Scott Cain)
Date: Tue Oct  7 09:29:56 2003
Subject: [Bioperl-l] bp 1.2.3 ppm install fails
In-Reply-To: <GAEDKMGOKFBLJPKCLKCCOENFDCAA.brian_osborne@cognia.com>
References: <GAEDKMGOKFBLJPKCLKCCOENFDCAA.brian_osborne@cognia.com>
Message-ID: <1065533538.1757.5.camel@localhost.localdomain>

Brian,

As far as I can tell, the '-force' option in ppm (for perl 5.8) is only
useful for reinstalling modules that it thinks are already installed
(and even then, it doesn't always do that).  I'm not sure what a good
solution is, since I don't think the ActiveState framework allows
"optional" modules.   :-/

As a side note, I did find a libxml that I could install via ppm
elsewhere with google, but this mess means that I have to give
directions to users who want to install GBrowse via ppm how to get
libxml even though they don't need it for GBrowse.

Thanks,
Scott

On Tue, 2003-10-07 at 07:57, Brian Osborne wrote:
> Scott,
> 
> > As a side note, it seems like a bad idea to force the installation of
> every prereq module for bioperl
> 
> Definitely a bad idea, that's why Bioperl doesn't do this. In the CPAN
> shell, for example, you can do "force install". There is also a "force
> install" option in ActiveState, something like setting force-install to 1?
> Please check the ActiveState documentation and tell me if I'm wrong.
> 
> A few months ago I took a look at the "required modules" list in Bioperl and
> found that many of them were not supplied by ActiveState, just as you're
> finding.
> 
> Brian O.
> 
> -----Original Message-----
> From: bioperl-l-bounces@portal.open-bio.org
> [mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Scott Cain
> Sent: Monday, October 06, 2003 4:07 PM
> To: bioperl-l@bioperl.org
> Cc: cain@cshl.org
> Subject: [Bioperl-l] bp 1.2.3 ppm install fails
> 
> Hello all,
> 
> First, thanks to the kind soul who provided the 1.2.3 version of bioperl
> as a ppm.  Unfortunately, I can't seem to get it installed due to a failed
> dependency.  While using AS Perl 5.8, it seems that none of the
> repositories know what 'libxml-perl' is, and since it is listed as a
> prerequisite in the ppd file, installation cannot continue.  Is there
> somewhere special to get that from?  At the moment, I am only searching
> the ActiveState and bioperl repositories.
> 
> As a side note, it seems like a bad idea to force the installation of
> every prereq module for bioperl.  In my case, I have no need of
> libxml-perl, but can't install bioperl for a lack of it.
> 
> Thanks,
> Scott
> 
> ----------------------------------------------------------------------
> Scott Cain, Ph. D.                                       cain@cshl.org
> GMOD Coordinator, http://www.gmod.org/                   (216)392-3087
> ----------------------------------------------------------------------
> 
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 
-- 
------------------------------------------------------------------------
Scott Cain, Ph. D.                                         cain@cshl.org
GMOD Coordinator (http://www.gmod.org/)                     216-392-3087
Cold Spring Harbor Laboratory

From cjfields at uiuc.edu  Tue Oct  7 10:49:02 2003
From: cjfields at uiuc.edu (Chris Fields)
Date: Tue Oct  7 10:46:53 2003
Subject: [Bioperl-l] bp 1.2.3 ppm install fails
In-Reply-To: <1065533538.1757.5.camel@localhost.localdomain>
References: <GAEDKMGOKFBLJPKCLKCCOENFDCAA.brian_osborne@cognia.com>
	<1065533538.1757.5.camel@localhost.localdomain>
Message-ID: <6.0.0.22.2.20031007092544.01cd2c70@staff.uiuc.edu>

Scott,

Randy Kobes has a number of PPM's for Windows that ActiveState doesn't 
provide, for both Perl 5.6 and 5.8.  I believe ActiveState has an automated 
build engine that will not post PPM's to the PPM depositories unless they 
successfully build (no failed tests).  The only other way to get them to 
post is to specifically ask for it on ASPN, and that may take them a 
while.  I think I picked up most of the required external modules from 
there, including GD (v 2.07).

If using ppm3:

rep add Kobes http://theoryx5.uwinnipeg.ca/cgi-bin/ppmserver?urn:/PPMServer

I think ppm2 is similar, but I haven't used it in a while.

Chris Fields

At 08:32 AM 10/7/2003, you wrote:
>Brian,
>
>As far as I can tell, the '-force' option in ppm (for perl 5.8) is only
>useful for reinstalling modules that it thinks are already installed
>(and even then, it doesn't always do that).  I'm not sure what a good
>solution is, since I don't think the ActiveState framework allows
>"optional" modules.   :-/
>
>As a side note, I did find a libxml that I could install via ppm
>elsewhere with google, but this mess means that I have to give
>directions to users who want to install GBrowse via ppm how to get
>libxml even though they don't need it for GBrowse.
>
>Thanks,
>Scott
>
>On Tue, 2003-10-07 at 07:57, Brian Osborne wrote:
> > Scott,
> >
> > > As a side note, it seems like a bad idea to force the installation of
> > every prereq module for bioperl
> >
> > Definitely a bad idea, that's why Bioperl doesn't do this. In the CPAN
> > shell, for example, you can do "force install". There is also a "force
> > install" option in ActiveState, something like setting force-install to 1?
> > Please check the ActiveState documentation and tell me if I'm wrong.
> >
> > A few months ago I took a look at the "required modules" list in 
> Bioperl and
> > found that many of them were not supplied by ActiveState, just as you're
> > finding.
> >
> > Brian O.
> >
> > -----Original Message-----
> > From: bioperl-l-bounces@portal.open-bio.org
> > [mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Scott Cain
> > Sent: Monday, October 06, 2003 4:07 PM
> > To: bioperl-l@bioperl.org
> > Cc: cain@cshl.org
> > Subject: [Bioperl-l] bp 1.2.3 ppm install fails
> >
> > Hello all,
> >
> > First, thanks to the kind soul who provided the 1.2.3 version of bioperl
> > as a ppm.  Unfortunately, I can't seem to get it installed due to a failed
> > dependency.  While using AS Perl 5.8, it seems that none of the
> > repositories know what 'libxml-perl' is, and since it is listed as a
> > prerequisite in the ppd file, installation cannot continue.  Is there
> > somewhere special to get that from?  At the moment, I am only searching
> > the ActiveState and bioperl repositories.
> >
> > As a side note, it seems like a bad idea to force the installation of
> > every prereq module for bioperl.  In my case, I have no need of
> > libxml-perl, but can't install bioperl for a lack of it.
> >
> > Thanks,
> > Scott
> >
> > ----------------------------------------------------------------------
> > Scott Cain, Ph. D.                                       cain@cshl.org
> > GMOD Coordinator, http://www.gmod.org/                   (216)392-3087
> > ----------------------------------------------------------------------
> >
> >
> > _______________________________________________
> > Bioperl-l mailing list
> > Bioperl-l@portal.open-bio.org
> > http://portal.open-bio.org/mailman/listinfo/bioperl-l
> >
>--
>------------------------------------------------------------------------
>Scott Cain, Ph. D.                                         cain@cshl.org
>GMOD Coordinator (http://www.gmod.org/)                     216-392-3087
>Cold Spring Harbor Laboratory
>
>_______________________________________________
>Bioperl-l mailing list
>Bioperl-l@portal.open-bio.org
>http://portal.open-bio.org/mailman/listinfo/bioperl-l

__________________________________

Chris Fields - Postdoctoral Researcher
Lab of Dr. Robert Switzer

Address:

University of Illinois at Urbana-Champaign
Dept. of Biochemistry - 323 RAL
600 S. Mathews Ave.
Urbana, IL 61801

Phone : (217) 333-7098
Fax : (217) 244-5858 

From lstein at cshl.edu  Tue Oct  7 11:09:01 2003
From: lstein at cshl.edu (Lincoln Stein)
Date: Tue Oct  7 11:06:45 2003
Subject: [Bioperl-l] bp 1.2.3 ppm install fails
In-Reply-To: <GAEDKMGOKFBLJPKCLKCCOENFDCAA.brian_osborne@cognia.com>
References: <GAEDKMGOKFBLJPKCLKCCOENFDCAA.brian_osborne@cognia.com>
Message-ID: <200310071109.01780.lstein@cshl.edu>

It sounds like the modules that aren't supported by ActiveState should be 
removed from the PPM dependencies list.  The CPAN version of BioPerl doesn't 
have ANY dependencies right now.  I think this is an issue because one almost 
immediately wants IO::String to work with the seqio modules and it wouldn't 
hurt to add it as a dependency.

Lincoln

On Tuesday 07 October 2003 07:57 am, Brian Osborne wrote:
> Scott,
>
> > As a side note, it seems like a bad idea to force the installation of
>
> every prereq module for bioperl
>
> Definitely a bad idea, that's why Bioperl doesn't do this. In the CPAN
> shell, for example, you can do "force install". There is also a "force
> install" option in ActiveState, something like setting force-install to 1?
> Please check the ActiveState documentation and tell me if I'm wrong.
>
> A few months ago I took a look at the "required modules" list in Bioperl
> and found that many of them were not supplied by ActiveState, just as
> you're finding.
>
> Brian O.
>
> -----Original Message-----
> From: bioperl-l-bounces@portal.open-bio.org
> [mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Scott Cain
> Sent: Monday, October 06, 2003 4:07 PM
> To: bioperl-l@bioperl.org
> Cc: cain@cshl.org
> Subject: [Bioperl-l] bp 1.2.3 ppm install fails
>
> Hello all,
>
> First, thanks to the kind soul who provided the 1.2.3 version of bioperl
> as a ppm.  Unfortunately, I can't seem to get it installed due to a failed
> dependency.  While using AS Perl 5.8, it seems that none of the
> repositories know what 'libxml-perl' is, and since it is listed as a
> prerequisite in the ppd file, installation cannot continue.  Is there
> somewhere special to get that from?  At the moment, I am only searching
> the ActiveState and bioperl repositories.
>
> As a side note, it seems like a bad idea to force the installation of
> every prereq module for bioperl.  In my case, I have no need of
> libxml-perl, but can't install bioperl for a lack of it.
>
> Thanks,
> Scott
>
> ----------------------------------------------------------------------
> Scott Cain, Ph. D.                                       cain@cshl.org
> GMOD Coordinator, http://www.gmod.org/                   (216)392-3087
> ----------------------------------------------------------------------
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

-- 
========================================================================
Lincoln D. Stein                           Cold Spring Harbor Laboratory
lstein@cshl.org			                  Cold Spring Harbor, NY
========================================================================


From jason at cgt.duhs.duke.edu  Tue Oct  7 11:33:34 2003
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Tue Oct  7 11:31:11 2003
Subject: [Bioperl-l] RE: [Pubmednew] Problem with esearch.fcgi
In-Reply-To: <3F82A4A7.8070205@cenix-bioscience.com>
References: <AF026AF0FF4B054590228FD1F1DE516501337550@inbox.agresearch.co.nz>
	<1065518595.7939.14.camel@localhost>
	<3F82A4A7.8070205@cenix-bioscience.com>
Message-ID: <Pine.LNX.4.50.0310071131470.2882-100000@tenero.duhs.duke.edu>

On Tue, 7 Oct 2003, Andrew Walsh wrote:

> As a side note (maybe someone with more knowledge of the SeqIO parser
> knows why). The 'KEYWORD' part of the Genbank file does not seem to be
> retrieved properly.
> I get this line in the sequence file: 'KEYWORDS    ARRAY(0x8ba8f30).'

This was fixed in bioperl 1.2.3.

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From nigam at psu.edu  Tue Oct  7 12:05:06 2003
From: nigam at psu.edu (Nigam Shah)
Date: Tue Oct  7 12:02:44 2003
Subject: [Bioperl-l] PPM Install for libxml-perl
Message-ID: <000501c38cec$c65f7120$34167680@Vivek>

Hi! Scott,

add the following repository:
repository add "Winnipeg_server" http://theoryx5.uwinnipeg.ca/ppms

(the repository name _has_ to be in double quotes and no trailing '/' after
the URL other wise ppm will die with: Missing base argument at
C:/Perl8/site/lib/PPM/Repository.pm line 174)

You will find most of the dependencies there. Look at
http://www.personal.psu.edu/faculty/n/h/nhs109/Bioperl/ for more details
about installation on perl 5.8.

Regards,
Nigam.


Nigam Shah
Graduate Fellow, 
Huck Institute for Life Sciences,
Penn State University.
Ph:(814)863-5720
Web: www.personal.psu.edu/nhs109/

From raphael.bauer at informatik.hu-berlin.de  Tue Oct  7 12:24:44 2003
From: raphael.bauer at informatik.hu-berlin.de (Raphael A. Bauer)
Date: Tue Oct  7 12:22:24 2003
Subject: [Bioperl-l] bioperl-db: Updating existing Databases with
 load_seqdatabase.pl and load_ontology
Message-ID: <3F82E8CC.3050703@informatik.hu-berlin.de>

Hello bioperl-l members,

i wonder how to perform a "safe" (incremental) update on a
working biosql schema.

My setup is the following: BioSQL schema on Postgres equipped with 
Swissprot (via load_seqdatabase.pl and GeneOntology (via load_ontology.pl).

Now i simply want to feed the existing BioSQL schema with
new versions of both databases.
The question ist how to do it in a safe way - especially so that no 
links in tables are dead and all information in the relevant tables and
dbxrefs is upadated too.

There is of course the --lookup flag for both scripts (and the 
update-on-new-date.pl for SProt) - but does it take care of everything?
Or would it be better to build up the database from scratch in order
to remain in a consistent state...

Many thanks in advance,

Raphael


From lstein at cshl.edu  Tue Oct  7 12:52:26 2003
From: lstein at cshl.edu (Lincoln Stein)
Date: Tue Oct  7 12:50:14 2003
Subject: [Bioperl-l] bp 1.2.3 ppm install fails
In-Reply-To: <6.0.0.22.2.20031007092544.01cd2c70@staff.uiuc.edu>
References: <GAEDKMGOKFBLJPKCLKCCOENFDCAA.brian_osborne@cognia.com>
	<1065533538.1757.5.camel@localhost.localdomain>
	<6.0.0.22.2.20031007092544.01cd2c70@staff.uiuc.edu>
Message-ID: <200310071252.26364.lstein@cshl.edu>

More fundamentally, however, the BioPerl PPM should have no more and no less 
dependencies than the BioPerl CPAN.  We need to do it this way so that we 
have a common platform on which to evaluate bug reports, &c.

Lincoln

On Tuesday 07 October 2003 10:49 am, Chris Fields wrote:
> Scott,
>
> Randy Kobes has a number of PPM's for Windows that ActiveState doesn't
> provide, for both Perl 5.6 and 5.8.  I believe ActiveState has an automated
> build engine that will not post PPM's to the PPM depositories unless they
> successfully build (no failed tests).  The only other way to get them to
> post is to specifically ask for it on ASPN, and that may take them a
> while.  I think I picked up most of the required external modules from
> there, including GD (v 2.07).
>
> If using ppm3:
>
> rep add Kobes http://theoryx5.uwinnipeg.ca/cgi-bin/ppmserver?urn:/PPMServer
>
> I think ppm2 is similar, but I haven't used it in a while.
>
> Chris Fields
>
> At 08:32 AM 10/7/2003, you wrote:
> >Brian,
> >
> >As far as I can tell, the '-force' option in ppm (for perl 5.8) is only
> >useful for reinstalling modules that it thinks are already installed
> >(and even then, it doesn't always do that).  I'm not sure what a good
> >solution is, since I don't think the ActiveState framework allows
> >"optional" modules.   :-/
> >
> >As a side note, I did find a libxml that I could install via ppm
> >elsewhere with google, but this mess means that I have to give
> >directions to users who want to install GBrowse via ppm how to get
> >libxml even though they don't need it for GBrowse.
> >
> >Thanks,
> >Scott
> >
> >On Tue, 2003-10-07 at 07:57, Brian Osborne wrote:
> > > Scott,
> > >
> > > > As a side note, it seems like a bad idea to force the installation of
> > >
> > > every prereq module for bioperl
> > >
> > > Definitely a bad idea, that's why Bioperl doesn't do this. In the CPAN
> > > shell, for example, you can do "force install". There is also a "force
> > > install" option in ActiveState, something like setting force-install to
> > > 1? Please check the ActiveState documentation and tell me if I'm wrong.
> > >
> > > A few months ago I took a look at the "required modules" list in
> >
> > Bioperl and
> >
> > > found that many of them were not supplied by ActiveState, just as
> > > you're finding.
> > >
> > > Brian O.
> > >
> > > -----Original Message-----
> > > From: bioperl-l-bounces@portal.open-bio.org
> > > [mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Scott Cain
> > > Sent: Monday, October 06, 2003 4:07 PM
> > > To: bioperl-l@bioperl.org
> > > Cc: cain@cshl.org
> > > Subject: [Bioperl-l] bp 1.2.3 ppm install fails
> > >
> > > Hello all,
> > >
> > > First, thanks to the kind soul who provided the 1.2.3 version of
> > > bioperl as a ppm.  Unfortunately, I can't seem to get it installed due
> > > to a failed dependency.  While using AS Perl 5.8, it seems that none of
> > > the repositories know what 'libxml-perl' is, and since it is listed as
> > > a prerequisite in the ppd file, installation cannot continue.  Is there
> > > somewhere special to get that from?  At the moment, I am only searching
> > > the ActiveState and bioperl repositories.
> > >
> > > As a side note, it seems like a bad idea to force the installation of
> > > every prereq module for bioperl.  In my case, I have no need of
> > > libxml-perl, but can't install bioperl for a lack of it.
> > >
> > > Thanks,
> > > Scott
> > >
> > > ----------------------------------------------------------------------
> > > Scott Cain, Ph. D.                                       cain@cshl.org
> > > GMOD Coordinator, http://www.gmod.org/                   (216)392-3087
> > > ----------------------------------------------------------------------
> > >
> > >
> > > _______________________________________________
> > > Bioperl-l mailing list
> > > Bioperl-l@portal.open-bio.org
> > > http://portal.open-bio.org/mailman/listinfo/bioperl-l
> >
> >--
> >------------------------------------------------------------------------
> >Scott Cain, Ph. D.                                         cain@cshl.org
> >GMOD Coordinator (http://www.gmod.org/)                     216-392-3087
> >Cold Spring Harbor Laboratory
> >
> >_______________________________________________
> >Bioperl-l mailing list
> >Bioperl-l@portal.open-bio.org
> >http://portal.open-bio.org/mailman/listinfo/bioperl-l
>
> __________________________________
>
> Chris Fields - Postdoctoral Researcher
> Lab of Dr. Robert Switzer
>
> Address:
>
> University of Illinois at Urbana-Champaign
> Dept. of Biochemistry - 323 RAL
> 600 S. Mathews Ave.
> Urbana, IL 61801
>
> Phone : (217) 333-7098
> Fax : (217) 244-5858
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

-- 
========================================================================
Lincoln D. Stein                           Cold Spring Harbor Laboratory
lstein@cshl.org			                  Cold Spring Harbor, NY
========================================================================


From amackey at pcbi.upenn.edu  Tue Oct  7 14:26:58 2003
From: amackey at pcbi.upenn.edu (Aaron J. Mackey)
Date: Tue Oct  7 14:24:36 2003
Subject: [Bioperl-l] Fwd: having problems with abi.pm
Message-ID: <D5F5C725-F8F3-11D7-B8A7-000A958C5008@pcbi.upenn.edu>


I'm stumped on this one ...


> From: "Pinar Kondu" <pkondu@yahoo.com>
> Date: Tue Oct 7, 2003  12:03:02 PM US/Eastern
> To: "Aaron J. Mackey" <amackey@pcbi.upenn.edu>
> Subject: Re: having problems with abi.pm
>
> Hi again Aaron,
>
> It looks like it is working now, but I received an error message in 
> read.pm
> Since I assume I am not supposed to make any modifications in the 
> modules, I
> wanted to ask you what this message means:
>
> Here is how I ran my program:
> [root@iontek pinarperl]# perl test2.pl -a
> /home/ORTAKSKY/SEQUENCES/myfile.abi -p 
> /home/ORTAKSKY/SEQUENCES/myfileout
>
> Here is the message I got:
> Can't use an undefined value as filehandle reference at
> /usr/lib/perl5/site_perl/5.8.0/Bio/SeqIO/staden/read.pm line 115.
>
> The script is below.
>
> many thanks! -pinar
>
>
>>>>>>>
>>>>>>> #!/usr/bin/perl -w
>>>>>>>
>>>>>>> use strict;
>>>>>>> use Bio::SeqIO;
>>>>>>> use Getopt::Std;
>>>>>>>
>>>>>>> my %args;
>>>>>>> getopt("ap", \%args);
>>>>>>> if (!$args{a}) {
>>>>>>>   die "Please specify the input ABI file name!";
>>>>>>> }
>>>>>>> if (!$args{p}) {
>>>>>>>   die "Please specify the output PHD file name!";
>>>>>>> }
>>>>>>>
>>>>>>> my $ABIin = new Bio::SeqIO( -format => "ABI", -file => "$args{a}"
>>>>>>> );
>>>>>>>
>>>>>>> my $PHDout = new Bio::SeqIO( -format => "phd", -file =>
>>>>>>> ">$args{p}" );
>>>>>>>
>>>>>>> while (my $seq = $ABIin->next_seq) {
>>>>>>>     $PHDout->write_seq($seq);
>>>>>>> }
>>>>>>>
>>>>>>>
>>>>>>
>>>>>
>>>>
>>>
>

From hlapp at gnf.org  Tue Oct  7 14:55:49 2003
From: hlapp at gnf.org (Hilmar Lapp)
Date: Tue Oct  7 14:53:27 2003
Subject: [Bioperl-l] bioperl-db: Updating existing Databases with
	load_seqdatabase.pl and load_ontology
In-Reply-To: <3F82E8CC.3050703@informatik.hu-berlin.de>
Message-ID: <BBA85A45.1DFB%hlapp@gnf.org>

I don't know what you mean by 'everything'. Update is a known and known to
be completely non-trivial problem.

The load_XXX scripts themselves just issue an update to the persistent
objects (if they have been found by prior lookup). The update call as
implemented in the bioperl-db adaptors translate to SQL UPDATE statements
*only* (where undefined attributes are prevented from overriding database
column values). By 'only' I mean that there won't be any magically performed
DELETEs on entity associations as far as the adaptors are concerned.

'Updating' the associations is the main issue that makes a database update
non-trivial, because there is no SQL UPDATE equivalent: the association is
either present or absent. If this is confusing, take sequences and their
associated seqfeatures as a very simple example. If the number of features
changes for a sequence, you can't reflect that by issuing a SQL UPDATE
against some table, because seqfeature and bioentry are two tables connected
by a foreign key. You either a) figure out which features are modified,
added, and deleted, and correspondingly update, insert, and delete those, or
b) you delete all existing features and insert all from the updated sequence
entry from scratch.

There are arguments for and against both a) and b). The approach of the
load_XXX scripts is to leave this decision to the user. It is the script
supplied via --mergeobjs that does the magic if any. A couple of (working)
example scripts for this purpose come with the package in the scripts/biosql
directory, and if you check them out you'll see exactly what's going on (as
there is no magic anywhere else other than cascading plain UPDATEs):

    freshen-annot.pl    # replace all old annotations with the new
annotations regardless
    merge-unique-ann.pl # leave old annotation untouched, add on new that
are different from any of the old annotations
    update-on-new-version.pl # like freshen, but only for bioentries that
changed version
    update-on-new-date.pl    # like freshen, but only for bioentries that
have a more recent date

Note that databanks not only update and add entries, they also obsolete
entries. This is not at all handled by load_seqdatabase.pl.

So, returning to your question, it depends on your situation. If you are
never going to add your own annotations, purging and re-populating is
guaranteed to leave you in a state consistent with the datasource, but is
likely to be slower than a smart update (smart meaning update only if the
entry indeed changed). If you add your own annotations, purging the content
is not an option, and you need to decide how to merge an existing entry with
its updated counterpart using either one of the aforementioned scriptlets,
or one you write yourself. Similarly for ontology update: if you want to
update ontologies w/o updating everything else too, purging is not an
option, and you'll need to decide how you want to merge.

Merging entries currently leaves you with possibly obsoleted bioentries
hanging around in your biosql instance. I have this very problem myself and
can't present a good solution yet as I'll just start to address it within
the next few weeks.

Hth.

    -hilmar

On 10/7/03 9:24 AM, "Raphael A. Bauer"
<raphael.bauer@informatik.hu-berlin.de> wrote:

> Hello bioperl-l members,
> 
> i wonder how to perform a "safe" (incremental) update on a
> working biosql schema.
> 
> My setup is the following: BioSQL schema on Postgres equipped with
> Swissprot (via load_seqdatabase.pl and GeneOntology (via load_ontology.pl).
> 
> Now i simply want to feed the existing BioSQL schema with
> new versions of both databases.
> The question ist how to do it in a safe way - especially so that no
> links in tables are dead and all information in the relevant tables and
> dbxrefs is upadated too.
> 
> There is of course the --lookup flag for both scripts (and the
> update-on-new-date.pl for SProt) - but does it take care of everything?
> Or would it be better to build up the database from scratch in order
> to remain in a consistent state...
> 
> Many thanks in advance,
> 
> Raphael
> 
> 
> 
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 

-- 
-------------------------------------------------------------
Hilmar Lapp                            email: lapp at gnf.org
GNF, San Diego, Ca. 92121              phone: +1-858-812-1757
-------------------------------------------------------------


From lstein at cshl.edu  Tue Oct  7 19:18:16 2003
From: lstein at cshl.edu (Lincoln Stein)
Date: Tue Oct  7 19:18:47 2003
Subject: [Bioperl-l] seqIO
In-Reply-To: <653C572D8746D341AAA25C494CDFB9E338001E@MBXSRV05.stf.nus.edu.sg>
References: <653C572D8746D341AAA25C494CDFB9E338001E@MBXSRV05.stf.nus.edu.sg>
Message-ID: <200310071918.16412.lstein@cshl.edu>

If you have an array of sequences then you can just create them using 
Bio::Seq:

	@bioperl_sequences = map {Bio::Seq->new(-seq=>$_)} @raw_seq;

Lincoln


On Tuesday 07 October 2003 12:45 am, Desmond Lim wrote:
> > What made you think it would accept a -ph parameter? Besides, if
> > there was
> > one you'd have to provide an array reference.
>
> I didn't think it would work but I wanted to try it out.
>
> What I'm trying to do is get SeqIO to accept an array of sequences and not
> a file.
>
> Desmond
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

-- 
Lincoln Stein
lstein@cshl.edu
Cold Spring Harbor Laboratory
1 Bungtown Road
Cold Spring Harbor, NY 11724
(516) 367-8380 (voice)
(516) 367-8389 (fax)
From btclimd at nus.edu.sg  Tue Oct  7 21:16:22 2003
From: btclimd at nus.edu.sg (Desmond Lim)
Date: Tue Oct  7 21:14:05 2003
Subject: [Bioperl-l] seqIO
Message-ID: <653C572D8746D341AAA25C494CDFB9E3380020@MBXSRV05.stf.nus.edu.sg>

Hi Lincoln,

Thanks but what am I to do after this?

I have a file that contains EMBL sequences. And I'm reading it into an array, manipulating it and then print it a file in fasta format.

I know how to use the:

my $seq_in = Bio::SeqIO->new('-file' => filename1, '-format' => 'embl');
my $seq_out = Bio::SeqIO->new('-file' => filename2, '-format' => 'fasta'); 

while ( my $inseq = $seq_in->next_seq ) {
  $seq_out->write_seq($inseq);
}

But I don't have a file for filename1 but an array. Is there a way for reading the array (the manipulated embl one) and then write it as a fasta file?

Thanks.

-----Original Message-----
From: Lincoln Stein [mailto:lstein@cshl.edu]
Sent: Wednesday, October 08, 2003 7:18 AM
To: Desmond Lim; Hilmar Lapp; BioPerl
Subject: Re: [Bioperl-l] seqIO


If you have an array of sequences then you can just create them using 
Bio::Seq:

	@bioperl_sequences = map {Bio::Seq->new(-seq=>$_)} @raw_seq;

Lincoln


On Tuesday 07 October 2003 12:45 am, Desmond Lim wrote:
> > What made you think it would accept a -ph parameter? Besides, if
> > there was
> > one you'd have to provide an array reference.
>
> I didn't think it would work but I wanted to try it out.
>
> What I'm trying to do is get SeqIO to accept an array of sequences and not
> a file.
>
> Desmond
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

-- 
Lincoln Stein
lstein@cshl.edu
Cold Spring Harbor Laboratory
1 Bungtown Road
Cold Spring Harbor, NY 11724
(516) 367-8380 (voice)
(516) 367-8389 (fax)

From shawnh at stanford.edu  Tue Oct  7 21:46:41 2003
From: shawnh at stanford.edu (Shawn Hoon)
Date: Tue Oct  7 21:40:33 2003
Subject: [Bioperl-l] seqIO
In-Reply-To: <653C572D8746D341AAA25C494CDFB9E3380020@MBXSRV05.stf.nus.edu.sg>
References: <653C572D8746D341AAA25C494CDFB9E3380020@MBXSRV05.stf.nus.edu.sg>
Message-ID: <436625B0-F931-11D7-A809-000A95783436@stanford.edu>


> Hi Lincoln,
>
> Thanks but what am I to do after this?
>
> I have a file that contains EMBL sequences. And I'm reading it into an 
> array, manipulating it and then print it a file in fasta format.
>
> I know how to use the:
>
> my $seq_in = Bio::SeqIO->new('-file' => filename1, '-format' => 
> 'embl');
> my $seq_out = Bio::SeqIO->new('-file' => filename2, '-format' => 
> 'fasta');
>
> while ( my $inseq = $seq_in->next_seq ) {
>   $seq_out->write_seq($inseq);
> }
>
> But I don't have a file for filename1 but an array. Is there a way for 
> reading the array (the manipulated embl one) and then write it as a 
> fasta file?
>

Uhm I don't get it. Once you have read the sequences into memory (i.e. 
into the array) you have Seq objects which SeqIO objects
can take regardless of the source format, embl/genbank etc so you can 
pass it to SeqIO in a loop like so:

my $seq_in = Bio::SeqIO->new('-file' => filename1, '-format' => 'embl');
my @array;

while ( my $inseq = $seq_in->next_seq ) {
  push @array, $inseq;
}

#do some manipulation here...


my $sout = Bio::SeqIO->new(-file=>">output.fa",-format=>'fasta');
foreach my $s(@array){
	$sout->write_seq($s);
}


> Thanks.
>
> -----Original Message-----
> From: Lincoln Stein [mailto:lstein@cshl.edu]
> Sent: Wednesday, October 08, 2003 7:18 AM
> To: Desmond Lim; Hilmar Lapp; BioPerl
> Subject: Re: [Bioperl-l] seqIO
>
>
> If you have an array of sequences then you can just create them using
> Bio::Seq:
>
> 	@bioperl_sequences = map {Bio::Seq->new(-seq=>$_)} @raw_seq;
>
> Lincoln
>
>
> On Tuesday 07 October 2003 12:45 am, Desmond Lim wrote:
>>> What made you think it would accept a -ph parameter? Besides, if
>>> there was
>>> one you'd have to provide an array reference.
>>
>> I didn't think it would work but I wanted to try it out.
>>
>> What I'm trying to do is get SeqIO to accept an array of sequences 
>> and not
>> a file.
>>
>> Desmond
>>
>> _______________________________________________
>> Bioperl-l mailing list
>> Bioperl-l@portal.open-bio.org
>> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>
> -- 
> Lincoln Stein
> lstein@cshl.edu
> Cold Spring Harbor Laboratory
> 1 Bungtown Road
> Cold Spring Harbor, NY 11724
> (516) 367-8380 (voice)
> (516) 367-8389 (fax)
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>
-shawn

From jason at cgt.duhs.duke.edu  Tue Oct  7 21:43:55 2003
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Tue Oct  7 21:41:22 2003
Subject: [Bioperl-l] seqIO
In-Reply-To: <653C572D8746D341AAA25C494CDFB9E3380020@MBXSRV05.stf.nus.edu.sg>
References: <653C572D8746D341AAA25C494CDFB9E3380020@MBXSRV05.stf.nus.edu.sg>
Message-ID: <Pine.LNX.4.50.0310072141340.6423-100000@tenero.duhs.duke.edu>

As has already been suggest on the list - use IO::String.

You don't have Bioperl Bio::Seq sequences yet, but an array of
strings which are EMBL format sequence data.

See the FAQ -
  http://www.bioperl.org/Core/Latest/faq.html#Q2.4
(hint - turn the array back into a single string with join )

-jason
On Wed, 8 Oct 2003, Desmond Lim wrote:

> Hi Lincoln,
>
> Thanks but what am I to do after this?
>
> I have a file that contains EMBL sequences. And I'm reading it into an
> array, manipulating it and then print it a file in fasta format.
>
> I know how to use the:
>
> my $seq_in = Bio::SeqIO->new('-file' => filename1, '-format' => 'embl');
> my $seq_out = Bio::SeqIO->new('-file' => filename2, '-format' => 'fasta');
>
> while ( my $inseq = $seq_in->next_seq ) {
>   $seq_out->write_seq($inseq);
> }
>
> But I don't have a file for filename1 but an array. Is there a way for reading the array (the manipulated embl one) and then write it as a fasta file?
>
> Thanks.
>
> -----Original Message-----
> From: Lincoln Stein [mailto:lstein@cshl.edu]
> Sent: Wednesday, October 08, 2003 7:18 AM
> To: Desmond Lim; Hilmar Lapp; BioPerl
> Subject: Re: [Bioperl-l] seqIO
>
>
> If you have an array of sequences then you can just create them using
> Bio::Seq:
>
> 	@bioperl_sequences = map {Bio::Seq->new(-seq=>$_)} @raw_seq;
>
> Lincoln
>
>
> On Tuesday 07 October 2003 12:45 am, Desmond Lim wrote:
> > > What made you think it would accept a -ph parameter? Besides, if
> > > there was
> > > one you'd have to provide an array reference.
> >
> > I didn't think it would work but I wanted to try it out.
> >
> > What I'm trying to do is get SeqIO to accept an array of sequences and not
> > a file.
> >
> > Desmond
> >
> > _______________________________________________
> > Bioperl-l mailing list
> > Bioperl-l@portal.open-bio.org
> > http://portal.open-bio.org/mailman/listinfo/bioperl-l
>
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From lstein at cshl.edu  Wed Oct  8 10:18:38 2003
From: lstein at cshl.edu (Lincoln Stein)
Date: Wed Oct  8 10:16:31 2003
Subject: Asking for help on the bioperl mailing list.  WAS: [Bioperl-l] seqIO
In-Reply-To: <653C572D8746D341AAA25C494CDFB9E3380020@MBXSRV05.stf.nus.edu.sg>
References: <653C572D8746D341AAA25C494CDFB9E3380020@MBXSRV05.stf.nus.edu.sg>
Message-ID: <200310081018.38994.lstein@cshl.edu>

Hi Desmond and Others who Write to this Mailing List for Help,

The bioperl mailing list welcomes requests for help from programmers of all 
levels of experience.  However, as the recent thread indicates, a great deal 
of unhelpful and redundant traffic can be generated when the question is 
vague.

It would really help us all if next time you state the problem explicitly and 
concisely, as in "I have a file that contains EMBL sequences, I've read them 
into an array and want to print the results out as FASTA format."  Note that 
this is still underspecified because some people will understand this to mean 
that you've created an array in which each element is a line from the EMBL 
file, while others will think you've created one element per EMBL entry.  But 
it's much better than showing us a fragment of nonfunctional code taken out 
of context and then asking us "why doesn't bioperl work the way I think it 
should."

Also your problem raises another question.  Why did you read the EMBL file 
into a string array rather than using Bio::SeqIO from the beginning to do it?

Lincoln

On Tuesday 07 October 2003 09:16 pm, Desmond Lim wrote:
> Hi Lincoln,
>
> Thanks but what am I to do after this?
>
> I have a file that contains EMBL sequences. And I'm reading it into an
> array, manipulating it and then print it a file in fasta format.
>
> I know how to use the:
>
> my $seq_in = Bio::SeqIO->new('-file' => filename1, '-format' => 'embl');
> my $seq_out = Bio::SeqIO->new('-file' => filename2, '-format' => 'fasta');
>
> while ( my $inseq = $seq_in->next_seq ) {
>   $seq_out->write_seq($inseq);
> }
>
> But I don't have a file for filename1 but an array. Is there a way for
> reading the array (the manipulated embl one) and then write it as a fasta
> file?
>
> Thanks.
>
> -----Original Message-----
> From: Lincoln Stein [mailto:lstein@cshl.edu]
> Sent: Wednesday, October 08, 2003 7:18 AM
> To: Desmond Lim; Hilmar Lapp; BioPerl
> Subject: Re: [Bioperl-l] seqIO
>
>
> If you have an array of sequences then you can just create them using
> Bio::Seq:
>
> 	@bioperl_sequences = map {Bio::Seq->new(-seq=>$_)} @raw_seq;
>
> Lincoln
>
> On Tuesday 07 October 2003 12:45 am, Desmond Lim wrote:
> > > What made you think it would accept a -ph parameter? Besides, if
> > > there was
> > > one you'd have to provide an array reference.
> >
> > I didn't think it would work but I wanted to try it out.
> >
> > What I'm trying to do is get SeqIO to accept an array of sequences and
> > not a file.
> >
> > Desmond
> >
> > _______________________________________________
> > Bioperl-l mailing list
> > Bioperl-l@portal.open-bio.org
> > http://portal.open-bio.org/mailman/listinfo/bioperl-l

-- 
========================================================================
Lincoln D. Stein                           Cold Spring Harbor Laboratory
lstein@cshl.org			                  Cold Spring Harbor, NY
========================================================================


From hlapp at gmx.net  Wed Oct  8 12:40:15 2003
From: hlapp at gmx.net (Hilmar Lapp)
Date: Wed Oct  8 12:37:57 2003
Subject: Asking for help on the bioperl mailing list. WAS: [Bioperl-l]
	seqIO
In-Reply-To: <200310081018.38994.lstein@cshl.edu>
Message-ID: <1861501A-F9AE-11D7-A595-000A959EB4C4@gmx.net>

I agree wholeheartedly. The better the question is formulated the  
better your answer is going to be. It's not productive to waste  
people's time on figuring out what you wanted to ask in the first place.

There is such a thing as doing your homework first. Good and quick  
answers don't come for free even though it may seem so sometimes.

	-hilmar

On Wednesday, October 8, 2003, at 07:18  AM, Lincoln Stein wrote:

> Hi Desmond and Others who Write to this Mailing List for Help,
>
> The bioperl mailing list welcomes requests for help from programmers  
> of all
> levels of experience.  However, as the recent thread indicates, a  
> great deal
> of unhelpful and redundant traffic can be generated when the question  
> is
> vague.
>
> It would really help us all if next time you state the problem  
> explicitly and
> concisely, as in "I have a file that contains EMBL sequences, I've  
> read them
> into an array and want to print the results out as FASTA format."   
> Note that
> this is still underspecified because some people will understand this  
> to mean
> that you've created an array in which each element is a line from the  
> EMBL
> file, while others will think you've created one element per EMBL  
> entry.  But
> it's much better than showing us a fragment of nonfunctional code  
> taken out
> of context and then asking us "why doesn't bioperl work the way I  
> think it
> should."
>
> Also your problem raises another question.  Why did you read the EMBL  
> file
> into a string array rather than using Bio::SeqIO from the beginning to  
> do it?
>
> Lincoln
>
> On Tuesday 07 October 2003 09:16 pm, Desmond Lim wrote:
>> Hi Lincoln,
>>
>> Thanks but what am I to do after this?
>>
>> I have a file that contains EMBL sequences. And I'm reading it into an
>> array, manipulating it and then print it a file in fasta format.
>>
>> I know how to use the:
>>
>> my $seq_in = Bio::SeqIO->new('-file' => filename1, '-format' =>  
>> 'embl');
>> my $seq_out = Bio::SeqIO->new('-file' => filename2, '-format' =>  
>> 'fasta');
>>
>> while ( my $inseq = $seq_in->next_seq ) {
>>   $seq_out->write_seq($inseq);
>> }
>>
>> But I don't have a file for filename1 but an array. Is there a way for
>> reading the array (the manipulated embl one) and then write it as a  
>> fasta
>> file?
>>
>> Thanks.
>>
>> -----Original Message-----
>> From: Lincoln Stein [mailto:lstein@cshl.edu]
>> Sent: Wednesday, October 08, 2003 7:18 AM
>> To: Desmond Lim; Hilmar Lapp; BioPerl
>> Subject: Re: [Bioperl-l] seqIO
>>
>>
>> If you have an array of sequences then you can just create them using
>> Bio::Seq:
>>
>> 	@bioperl_sequences = map {Bio::Seq->new(-seq=>$_)} @raw_seq;
>>
>> Lincoln
>>
>> On Tuesday 07 October 2003 12:45 am, Desmond Lim wrote:
>>>> What made you think it would accept a -ph parameter? Besides, if
>>>> there was
>>>> one you'd have to provide an array reference.
>>>
>>> I didn't think it would work but I wanted to try it out.
>>>
>>> What I'm trying to do is get SeqIO to accept an array of sequences  
>>> and
>>> not a file.
>>>
>>> Desmond
>>>
>>> _______________________________________________
>>> Bioperl-l mailing list
>>> Bioperl-l@portal.open-bio.org
>>> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>
> --  
> ======================================================================= 
> =
> Lincoln D. Stein                           Cold Spring Harbor  
> Laboratory
> lstein@cshl.org			                  Cold Spring Harbor, NY
> ======================================================================= 
> =
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>
>
-- 
-------------------------------------------------------------
Hilmar Lapp                            email: lapp at gnf.org
GNF, San Diego, Ca. 92121              phone: +1-858-812-1757
-------------------------------------------------------------


From Daniel.Lang at biologie.uni-freiburg.de  Wed Oct  8 14:05:04 2003
From: Daniel.Lang at biologie.uni-freiburg.de (Daniel Lang)
Date: Wed Oct  8 14:02:40 2003
Subject: [Bioperl-l] HTMLResultWriter questions
Message-ID: <3F8451D0.70707@biologie.uni-freiburg.de>

Hi,

I have quite some problems with generating BLAST HTML reports with local
database links and own layout...

All the examples I found about HTMLResultWriter cover filtering of
results, but I haven?t found anything about the usage of the
"hit_link_desc" method etc.
Does anyone have some example code for me?
Thanx

-- 
Daniel Lang
University of Freiburg, Plant Biotechnology
Sonnenstr. 5, D-79104 Freiburg
phone: +49 761 203 6988
homepage:  http://www.plant-biotech.net/
e-mail: daniel.lang@biologie.uni-freiburg.de

#################################################
 >REALITY.SYS corrupted: Reboot universe? (Y/N/A)
#################################################

From hlapp at gnf.org  Wed Oct  8 14:54:04 2003
From: hlapp at gnf.org (Hilmar Lapp)
Date: Wed Oct  8 14:51:41 2003
Subject: [Bioperl-l] Re: [BioSQL-l] Error using load_seqdatabase.pl
In-Reply-To: <BAEPLGNHBBCGIFPHMAHLGELFCCAA.robert.roth@home.se>
Message-ID: <BBA9AB5C.1E48%hlapp@gnf.org>

This has nothing to do with the input file but everything with run-time
loading of modules in perl on Windows. The error you get is generated by the
following line of code:

    $self->debug("attempting to load driver for adaptor class $class\n");

The class Bio::DB::BioSQL::BasePersistenceAdaptor.pm has a 'use
Bio::Root::Root' statement, it inherits directly from Bio::Root::Root, and
Bio::Root::Root does have a debug() method as you can (and should) easily
convince yourself of.

Have you run the bioperl-db test suite? If so, what was the result? If not,
run it.

So, I'm copying the bioperl list here in case somebody has more clues as to
why you might be seeing this error on WinXP. We've had a case before on the
list where it turned out eventually that there had been a version mix-up on
the machine and everything worked fine after re-installing from scratch.

    -hilmar

On 10/8/03 4:24 AM, "Robert Roth" <robert.roth@home.se> wrote:

> 
> Hello,
> 
> I'm using Mysql 4.0.15, Bioperl 1.2.3, bioperl-db from the cvs and perl
> 5.6.1 on WinXP.
> After installation I pre-loaded the the biosql database with the NCBI taxon
> database using
> load_ncbi_taxonomy.pl. Now when I try to load sequence data with
> load_seqdatabase.pl I get
> the following error,
> 
> perl load_seqdatabase.pl --host localhost --dbname biosql --dbus
> er root --namespace bioperl --debug --safe --testonly --format
> genbank test.genbank
> Loading test.genbank ...
> attempting to load adaptor class for Bio::Seq::RichSeq
>       attempting to load module Bio::DB::BioSQL::RichSeqAdaptor
> attempting to load adaptor class for Bio::Seq
>       attempting to load module Bio::DB::BioSQL::SeqAdaptor
> instantiating adaptor class Bio::DB::BioSQL::SeqAdaptor
> attempting to load adaptor class for Bio::SeqFeature::Generic
>       attempting to load module Bio::DB::BioSQL::GenericAdaptor
> attempting to load adaptor class for Bio::Root::Root
>       attempting to load module Bio::DB::BioSQL::RootAdaptor
> Undefined subroutine &Bio::Root::Root::debug called at
> E:/Perl/site/lib/Bio/DB/B
> ioSQL/BasePersistenceAdaptor.pm line 1502, <GEN0> line 41.
> 
> I've tried different formats (swiss and genbank) and different files with
> the same
> result. The line number given in the error always correspond to the end of
> the first
> record (and it doesn't matter if the file contains one or more records).
> Any ideas?
> 
> Many thanks in advance,
> 
> Robert
> 
> 
> _______________________________________________
> BioSQL-l mailing list
> BioSQL-l@open-bio.org
> http://open-bio.org/mailman/listinfo/biosql-l
> 

-- 
-------------------------------------------------------------
Hilmar Lapp                            email: lapp at gnf.org
GNF, San Diego, Ca. 92121              phone: +1-858-812-1757
-------------------------------------------------------------


From ajm6q at virginia.edu  Wed Oct  8 15:59:30 2003
From: ajm6q at virginia.edu (Aaron J Mackey)
Date: Wed Oct  8 15:57:08 2003
Subject: [Bioperl-l] OS X bioperl, staden/read, install problems
In-Reply-To: <Pine.LNX.4.50.0310061050480.27277-100000@tenero.duhs.duke.edu>
Message-ID: <Pine.OSF.4.33.0310081556120.15105-100000@alpha10.bioch.virginia.edu>


On Mon, 6 Oct 2003, Jason Stajich wrote:

> When I get together with Aaron at the end of the month I will try and go
> through it with him so I understand what is supposed to happen and/or how
> we might reorg bioperl-ext to be less problematic.

The problem (over-and-over-again) has been that while people think they
have the staden io-lib successfully installed, they don't (because they
have to manually do a few things that the io-lib install doesn't do for
them).  It's a royal pain, of course.  And then they try to install
bioperl-ext, which doesn't successfully complete (you can, btw, install
only parts of bioperl-ext, so the whole staden read lib stuff shouldn't
prevent someone from installing the other alignment code).

Then, if they ignore that and try to run the normal bioperl test suite,
they get various odd cryptic errors about read not being successfully
installed.

-Aaron

From hlapp at gnf.org  Wed Oct  8 20:00:41 2003
From: hlapp at gnf.org (Hilmar Lapp)
Date: Wed Oct  8 19:58:19 2003
Subject: [Bioperl-l] Re: [BioSQL-l] Error using load_seqdatabase.pl
In-Reply-To: <BAEPLGNHBBCGIFPHMAHLGELICCAA.robert.roth@home.se>
Message-ID: <BBA9F339.1E76%hlapp@gnf.org>

This is definitely related to the perl setup I'd claim. Have you been able
to run the bioperl test suite without similar trouble?

Again, I'm cc'ing the bioperl list in case anybody has seen this also and
has clues as to what the cause is and how to remedy it.
 
    -hilmar
 
On 10/8/03 3:18 PM, "Robert Roth" <robert.roth@home.se> wrote:

> Hi,
> 
> I've tried to run the test suite and most tests failed with outputs like the
> one below. Only the
> test_dbadaptor was succesful and but also there a lot of comments about
> redefined subroutines like the ones below were generated.
> I will reinstall everything tomorrow to see if that helps.
> 
> /Robert
> 
> E:\temp\bioperl-db>e:\Perl\bin\perl.exe -Iblib\arch -Ilib\lib -IE:\perl\lib
> -IE:
> Perl\lib -e "use Test::Harness qw(&runtests $verbose); $verbose=0; runtests
> @ARG
> V;" t/comment.t
> t/comment...........ok 2/11Subroutine new redefined at
> e:/Perl/site/lib/Bio\Seq\
> RichSeq.pm line 112, <GEN0> line 72.
> Subroutine division redefined at e:/Perl/site/lib/Bio\Seq\RichSeq.pm line
> 172, <
> GEN0> line 72.
> Subroutine molecule redefined at e:/Perl/site/lib/Bio\Seq\RichSeq.pm line
> 193, <
> GEN0> line 72.
> Subroutine add_date redefined at e:/Perl/site/lib/Bio\Seq\RichSeq.pm line
> 215, <
> GEN0> line 72.
> Subroutine get_dates redefined at e:/Perl/site/lib/Bio\Seq\RichSeq.pm line
> 234,
> <GEN0> line 72.
> Subroutine pid redefined at e:/Perl/site/lib/Bio\Seq\RichSeq.pm line 253,
> <GEN0>
> line 72.
> Subroutine accession redefined at e:/Perl/site/lib/Bio\Seq\RichSeq.pm line
> 279,
> <GEN0> line 72.
> Subroutine add_secondary_accession redefined at
> e:/Perl/site/lib/Bio\Seq\RichSeq
> .pm line 296, <GEN0> line 72.
> Subroutine get_secondary_accessions redefined at
> e:/Perl/site/lib/Bio\Seq\RichSe
> q.pm line 315, <GEN0> line 72.
> Subroutine seq_version redefined at e:/Perl/site/lib/Bio\Seq\RichSeq.pm line
> 332
> , <GEN0> line 72.
> Subroutine keywords redefined at e:/Perl/site/lib/Bio\Seq\RichSeq.pm line
> 353, <
> GEN0> line 72.
> Subroutine each_date redefined at e:/Perl/site/lib/Bio\Seq\RichSeq.pm line
> 369,
> <GEN0> line 72.
> Subroutine each_secondary_accession redefined at
> e:/Perl/site/lib/Bio\Seq\RichSe
> q.pm line 376, <GEN0> line 72.
> Subroutine sv redefined at e:/Perl/site/lib/Bio\Seq\RichSeq.pm line 383,
> <GEN0>
> line 72.
> Subroutine new redefined at e:/Perl/site/lib/Bio\SeqFeature\Generic.pm line
> 146,
> <GEN0> line 72.
> Subroutine set_attributes redefined at
> e:/Perl/site/lib/Bio\SeqFeature\Generic.p
> m line 188, <GEN0> line 72.
> Subroutine direct_new redefined at
> e:/Perl/site/lib/Bio\SeqFeature\Generic.pm li
> ne 253, <GEN0> line 72.
> Subroutine location redefined at e:/Perl/site/lib/Bio\SeqFeature\Generic.pm
> line
> 274, <GEN0> line 72.
> Subroutine start redefined at e:/Perl/site/lib/Bio\SeqFeature\Generic.pm
> line 30
> 4, <GEN0> line 72.
> Subroutine end redefined at e:/Perl/site/lib/Bio\SeqFeature\Generic.pm line
> 321,
> <GEN0> line 72.
> Subroutine length redefined at e:/Perl/site/lib/Bio\SeqFeature\Generic.pm
> line 3
> 38, <GEN0> line 72.
> Subroutine strand redefined at e:/Perl/site/lib/Bio\SeqFeature\Generic.pm
> line 3
> 55, <GEN0> line 72.
> Subroutine score redefined at e:/Perl/site/lib/Bio\SeqFeature\Generic.pm
> line 37
> 2, <GEN0> line 72.
> Subroutine frame redefined at e:/Perl/site/lib/Bio\SeqFeature\Generic.pm
> line 39
> 7, <GEN0> line 72.
> Subroutine primary_tag redefined at
> e:/Perl/site/lib/Bio\SeqFeature\Generic.pm l
> ine 423, <GEN0> line 72.
> Subroutine source_tag redefined at
> e:/Perl/site/lib/Bio\SeqFeature\Generic.pm li
> ne 444, <GEN0> line 72.
> Subroutine has_tag redefined at e:/Perl/site/lib/Bio\SeqFeature\Generic.pm
> line
> 465, <GEN0> line 72.
> Subroutine add_tag_value redefined at
> e:/Perl/site/lib/Bio\SeqFeature\Generic.pm
> line 480, <GEN0> line 72.
> Subroutine get_tag_values redefined at
> e:/Perl/site/lib/Bio\SeqFeature\Generic.p
> m line 499, <GEN0> line 72.
> Subroutine get_all_tags redefined at
> e:/Perl/site/lib/Bio\SeqFeature\Generic.pm
> line 521, <GEN0> line 72.
> Subroutine remove_tag redefined at
> e:/Perl/site/lib/Bio\SeqFeature\Generic.pm li
> ne 537, <GEN0> line 72.
> Subroutine attach_seq redefined at
> e:/Perl/site/lib/Bio\SeqFeature\Generic.pm li
> ne 562, <GEN0> line 72.
> Subroutine seq redefined at e:/Perl/site/lib/Bio\SeqFeature\Generic.pm line
> 592,
> <GEN0> line 72.
> Subroutine entire_seq redefined at
> e:/Perl/site/lib/Bio\SeqFeature\Generic.pm li
> ne 633, <GEN0> line 72.
> Subroutine seq_id redefined at e:/Perl/site/lib/Bio\SeqFeature\Generic.pm
> line 6
> 58, <GEN0> line 72.
> Subroutine display_name redefined at
> e:/Perl/site/lib/Bio\SeqFeature\Generic.pm
> line 677, <GEN0> line 72.
> Subroutine annotation redefined at
> e:/Perl/site/lib/Bio\SeqFeature\Generic.pm li
> ne 702, <GEN0> line 72.
> Subroutine get_SeqFeatures redefined at
> e:/Perl/site/lib/Bio\SeqFeature\Generic.
> pm line 733, <GEN0> line 72.
> Subroutine add_SeqFeature redefined at
> e:/Perl/site/lib/Bio\SeqFeature\Generic.p
> m line 763, <GEN0> line 72.
> Subroutine remove_SeqFeatures redefined at
> e:/Perl/site/lib/Bio\SeqFeature\Gener
> ic.pm line 800, <GEN0> line 72.
> Subroutine gff_format redefined at
> e:/Perl/site/lib/Bio\SeqFeature\Generic.pm li
> ne 832, <GEN0> line 72.
> Subroutine gff_string redefined at
> e:/Perl/site/lib/Bio\SeqFeature\Generic.pm li
> ne 862, <GEN0> line 72.
> Subroutine slurp_gff_file redefined at
> e:/Perl/site/lib/Bio\SeqFeature\Generic.p
> m line 885, <GEN0> line 72.
> Subroutine _from_gff_string redefined at
> e:/Perl/site/lib/Bio\SeqFeature\Generic
> .pm line 921, <GEN0> line 72.
> Subroutine _expand_region redefined at
> e:/Perl/site/lib/Bio\SeqFeature\Generic.p
> m line 943, <GEN0> line 72.
> Subroutine _parse redefined at e:/Perl/site/lib/Bio\SeqFeature\Generic.pm
> line 9
> 73, <GEN0> line 72.
> Subroutine _tag_value redefined at
> e:/Perl/site/lib/Bio\SeqFeature\Generic.pm li
> ne 991, <GEN0> line 72.
> Subroutine seqname redefined at e:/Perl/site/lib/Bio\SeqFeature\Generic.pm
> line
> 1006, <GEN0> line 72.
> Subroutine display_id redefined at
> e:/Perl/site/lib/Bio\SeqFeature\Generic.pm li
> ne 1012, <GEN0> line 72.
> Subroutine each_tag_value redefined at
> e:/Perl/site/lib/Bio\SeqFeature\Generic.p
> m line 1019, <GEN0> line 72.
> Subroutine all_tags redefined at e:/Perl/site/lib/Bio\SeqFeature\Generic.pm
> line
> 1020, <GEN0> line 72.
> Subroutine sub_SeqFeature redefined at
> e:/Perl/site/lib/Bio\SeqFeature\Generic.p
> m line 1024, <GEN0> line 72.
> Subroutine add_sub_SeqFeature redefined at
> e:/Perl/site/lib/Bio\SeqFeature\Gener
> ic.pm line 1025, <GEN0> line 72.
> Subroutine flush_sub_SeqFeatures redefined at
> e:/Perl/site/lib/Bio\SeqFeature\Ge
> neric.pm line 1026, <GEN0> line 72.
> Subroutine flush_sub_SeqFeature redefined at
> e:/Perl/site/lib/Bio\SeqFeature\Gen
> eric.pm line 1028, <GEN0> line 72.
> Subroutine new redefined at e:/Perl/site/lib/Bio\Root\Root.pm line 195,
> <GEN0> l
> ine 72.
> Subroutine verbose redefined at e:/Perl/site/lib/Bio\Root\Root.pm line 228,
> <GEN
> 0> line 72.
> Subroutine _register_for_cleanup redefined at
> e:/Perl/site/lib/Bio\Root\Root.pm
> line 240, <GEN0> line 72.
> Subroutine _unregister_for_cleanup redefined at
> e:/Perl/site/lib/Bio\Root\Root.p
> m line 250, <GEN0> line 72.
> Subroutine _cleanup_methods redefined at e:/Perl/site/lib/Bio\Root\Root.pm
> line
> 257, <GEN0> line 72.
> Subroutine throw redefined at e:/Perl/site/lib/Bio\Root\Root.pm line 307,
> <GEN0>
> line 72.
> Subroutine debug redefined at e:/Perl/site/lib/Bio\Root\Root.pm line 370,
> <GEN0>
> line 72.
> Subroutine _load_module redefined at e:/Perl/site/lib/Bio\Root\Root.pm line
> 390,
> <GEN0> line 72.
> Subroutine DESTROY redefined at e:/Perl/site/lib/Bio\Root\Root.pm line 416,
> <GEN
> 0> line 72.
>       (in cleanup) Undefined subroutine &Bio::Root::Root::DESTROY called
> at e:
> /Perl/site/lib/Bio\DB\BioSQL\DBAdaptor.pm line 173, <GEN0> line 72.
> Undefined subroutine &Bio::Root::Root::debug called at
> e:/Perl/site/lib/Bio/DB/B
> ioSQL/BasePersistenceAdaptor.pm line 689, <GEN0> line 72.
>       (in cleanup) Undefined subroutine &Bio::Root::Root::DESTROY called.
>       (in cleanup) Undefined subroutine &Bio::Root::Root::DESTROY called
> at e:
> /Perl/site/lib/Bio/DB/BioSQL/BasePersistenceAdaptor.pm line 1756 during
> global d
> estruction.
>       (in cleanup) Undefined subroutine &Bio::Root::Root::DESTROY called
> durin
> g global destruction.
>       (in cleanup) Undefined subroutine &Bio::Root::Root::DESTROY called
> at e:
> /Perl/site/lib/Bio/DB/DBI/base.pm line 411 during global destruction.
>       (in cleanup) Undefined subroutine &Bio::Root::Root::DESTROY called
> durin
> g global destruction.
>       (in cleanup) Undefined subroutine &Bio::Root::Root::DESTROY called
> durin
> g global destruction.
> t/comment...........dubious
>       Test returned status 255 (wstat 65280, 0xff00)
> DIED. FAILED tests 3-11
>       Failed 9/11 tests, 18.18% okay
> Failed Test Status Wstat Total Fail  Failed  List of Failed
> ----------------------------------------------------------------------------
> ----
> 
> t/comment.t      255 65280    11    9  81.82%  3-11
> Failed 1/1 test scripts, 0.00% okay. 9/11 subtests failed, 18.18% okay.
> 
> -----Original Message-----
> From: Hilmar Lapp [mailto:hlapp@gnf.org]
> Sent: 08 October 2003 20:54
> To: Robert Roth; OBDA BioSQL
> Cc: BioPerl
> Subject: Re: [BioSQL-l] Error using load_seqdatabase.pl
> 
> 
> This has nothing to do with the input file but everything with run-time
> loading of modules in perl on Windows. The error you get is generated by the
> following line of code:
> 
>   $self->debug("attempting to load driver for adaptor class $class\n");
> 
> The class Bio::DB::BioSQL::BasePersistenceAdaptor.pm has a 'use
> Bio::Root::Root' statement, it inherits directly from Bio::Root::Root, and
> Bio::Root::Root does have a debug() method as you can (and should) easily
> convince yourself of.
> 
> Have you run the bioperl-db test suite? If so, what was the result? If not,
> run it.
> 
> So, I'm copying the bioperl list here in case somebody has more clues as to
> why you might be seeing this error on WinXP. We've had a case before on the
> list where it turned out eventually that there had been a version mix-up on
> the machine and everything worked fine after re-installing from scratch.
> 
>   -hilmar
> 
> On 10/8/03 4:24 AM, "Robert Roth" <robert.roth@home.se> wrote:
> 
>> 
>> Hello,
>> 
>> I'm using Mysql 4.0.15, Bioperl 1.2.3, bioperl-db from the cvs and perl
>> 5.6.1 on WinXP.
>> After installation I pre-loaded the the biosql database with the NCBI
> taxon
>> database using
>> load_ncbi_taxonomy.pl. Now when I try to load sequence data with
>> load_seqdatabase.pl I get
>> the following error,
>> 
>> perl load_seqdatabase.pl --host localhost --dbname biosql --dbus
>> er root --namespace bioperl --debug --safe --testonly --format
>> genbank test.genbank
>> Loading test.genbank ...
>> attempting to load adaptor class for Bio::Seq::RichSeq
>>       attempting to load module Bio::DB::BioSQL::RichSeqAdaptor
>> attempting to load adaptor class for Bio::Seq
>>       attempting to load module Bio::DB::BioSQL::SeqAdaptor
>> instantiating adaptor class Bio::DB::BioSQL::SeqAdaptor
>> attempting to load adaptor class for Bio::SeqFeature::Generic
>>       attempting to load module Bio::DB::BioSQL::GenericAdaptor
>> attempting to load adaptor class for Bio::Root::Root
>>       attempting to load module Bio::DB::BioSQL::RootAdaptor
>> Undefined subroutine &Bio::Root::Root::debug called at
>> E:/Perl/site/lib/Bio/DB/B
>> ioSQL/BasePersistenceAdaptor.pm line 1502, <GEN0> line 41.
>> 
>> I've tried different formats (swiss and genbank) and different files with
>> the same
>> result. The line number given in the error always correspond to the end of
>> the first
>> record (and it doesn't matter if the file contains one or more records).
>> Any ideas?
>> 
>> Many thanks in advance,
>> 
>> Robert
>> 
>> 
>> _______________________________________________
>> BioSQL-l mailing list
>> BioSQL-l@open-bio.org
>> http://open-bio.org/mailman/listinfo/biosql-l
>> 
> 
> --
> -------------------------------------------------------------
> Hilmar Lapp                            email: lapp at gnf.org
> GNF, San Diego, Ca. 92121              phone: +1-858-812-1757
> -------------------------------------------------------------
> 
> 
> 
> 
> 
> 
> 

-- 
-------------------------------------------------------------
Hilmar Lapp                            email: lapp at gnf.org
GNF, San Diego, Ca. 92121              phone: +1-858-812-1757
-------------------------------------------------------------


From wafzig at axaron.com  Thu Oct  9 04:21:10 2003
From: wafzig at axaron.com (Oliver Wafzig)
Date: Thu Oct  9 04:18:42 2003
Subject: [Bioperl-l] HTMLResultWriter questions
In-Reply-To: <3F8451D0.70707@biologie.uni-freiburg.de>
References: <3F8451D0.70707@biologie.uni-freiburg.de>
Message-ID: <200310091021.10195.wafzig@axaron.com>

On Wednesday 08 October 2003 20:05, Daniel Lang wrote:
Hi Daniel,

Generating local database links:
Method 1: call your HTMLResultWriter like this

		 my $htmlwriter =  
			new Bio::SearchIO::Writer::HTMLResultWriter('foo1%s','foo2%s' );
		where 'foo1' points to your protein-db source and 'foo2' points to your
		nucleotide db.

Methode 2: write your own 'hit_link' method and pass it as a coderef to 
"hit_link_desc" like  
 	 my $htmlwriter =
		new Bio::SearchIO::Writer::HTMLResultWriter('bla1%s','bla2%s' );
 	 $htmlwriter->hit_link_desc(\&my_hit_link);

You can use Bio::SearchIO::Writer::HTMLResultWriter::default_hit_link_desc as 
a starting piont for your own method.

Here is some code, linking all 'Hits' starting with '>gnl|database|Acc' to one 
URL, while all others ('>emb|....' or '>sw|....') will be fetched with our 
local BioRS system.   

Oliver
 
######################################################

#!/usr/bin/perl -w
use strict; 
use Bio::SearchIO; 
use Bio::SearchIO::Writer::HTMLResultWriter;

my %dbsource = ();
my $biors = 
"http://pink:4700/searchtool/searchtool.cgi?start_guest_session=1&request=query_adv&file=results&out_format=fasta&query=%s";
my $fastacmd = "http://pink/blast2ncbi/dispfastaseq?%s:%s:%s:%s";
$dbsource{"emb"} = "{embl_nest,embl_est}:[([AccNumber+EQ+text:%s;])]";
$dbsource{"sw"}  = "{swissprot,swissnew}:[([AccNumber+EQ+text:%s;])]";

die "no infile" unless defined ($ARGV[0]) && (-f $ARGV[0]);
my $infile = $ARGV[0];

my $in = new Bio::SearchIO(-format => 'blast',
			   -file => $infile); 

my $result = $in->next_result;
my $htmlwriter = new Bio::SearchIO::Writer::HTMLResultWriter();
$htmlwriter->hit_link_desc(\&my_hit_link);
$htmlwriter->hit_link_align(\&my_hit_link);
my $outhtml = new Bio::SearchIO(-writer => $htmlwriter,
				-file => ">searchio.html");
$outhtml->write_result($result);


#bereitet den html-link vor#
#dient als codereferenz in $writer->hit_link_desc (\&CODE)
#                   und in $writer->hit_link_align(\&CODE)
sub my_hit_link {
  my($self, $hit, $result) = @_;
  my $urlpattern = undef;
  my $source = undef; #biors oder fastacmd
  my $database = undef;
  my @spl = split( /[|]/, $hit->name);
  if ($spl[0] eq "gnl"){
    my ($beg, $end) = $hit->hsp('best')->range('sbjct');
    $urlpattern = sprintf ($fastacmd, $spl[1] ,"%s", $beg, $end);
  }else{
    $urlpattern = sprintf ($biors, $dbsource{$spl[0]});
  }
  #print STDERR "URLPATTERN:\t$urlpattern\n";
  
  my $type = ( $result->algorithm =~ /(P|X|Y)$/i ) ? 'PROTEIN' : 'NUCLEOTIDE';
  my ($gi,$acc) = &{$self->id_parser}($hit->name);
  
  
  my $url = length($self->remote_database_url($type,)) > 0 ? 
    sprintf('<a href="%s">%s</a>',
	    sprintf($self->remote_database_url($type, $urlpattern),$gi || $acc), 
	    $hit->name()) :  $hit->name();
  
  return $url;
} 


From robert.roth at home.se  Thu Oct  9 07:51:13 2003
From: robert.roth at home.se (Robert Roth)
Date: Thu Oct  9 07:48:52 2003
Subject: [Bioperl-l] Re: [BioSQL-l] Error using load_seqdatabase.pl
In-Reply-To: <BBA9F339.1E76%hlapp@gnf.org>
Message-ID: <BAEPLGNHBBCGIFPHMAHLEELKCCAA.robert.roth@home.se>


I started over again and also upgraded to perl 5.8 (from ActiveState). The
claim in INSTALL.WIN that not all external modules are available from the
ppm repositories is no longer true.
Since the the installation of Bioperl with ppm does not give the possiblity
to run the test suite I skipped it and installed Bioperl-1.2.3 with nmake
instead. The test suite for bioperl ran without any major problems. With all
test that just returned "ok" removed for brevity the output was,

************************************************************************
t\Assembly...................DB_File not installed. This means the Assembly
modules are not available.  Skipping tests.
t\Assembly...................ok
t\DB.........................ok
        1/78 skipped:
t\flat.......................DB_File not loaded. This means flat.t test
cannot
be executed. Skipping
t\flat.......................ok
t\Index......................ok 11/32Can't call method "accession" on an
undefined value at E:\temp\bioperl-1.2.3\blib\lib/Bio/DB/InMemoryCache.pm
line 220, <GN7> line 54.
t\Index......................dubious
        Test returned status 22 (wstat 5632, 0x1600)
        after all the subtests completed successfully
t\SeqFeatCollection..........DB_File not installed. This means the
SeqFeatCollection wont work
t\SeqFeatCollection..........ok
Failed Test Stat Wstat Total Fail  Failed  List of Failed
----------------------------------------------------------------------------
---
t\Index.t     22  5632    32    0   0.00%  ??
4 subtests skipped.
Failed 1/123 test scripts, 99.19% okay. 0/5977 subtests failed, 100.00%
okay.
************************************************************************


After this I moved on and installed bioperl-db. Running the test suite gives
the lengthy (sorry...) output below. They all (except dbadaptor.t and
query.t) fail with the same output. Attempting to run load_seqdatabase.pl
fails with the same output as before.
The complains about redefined subroutines that I showed in an earlier
message that I get when running load_seqdatabase.pl is something that is
known and attributed to a more "sensitive" perl on windows.
If the problem still seems to be related to the perl setup I guess I have to
build perl also and abandon activestate. Is there someone else who have a
working installation of Bioperl/bioperl-db/biosql under windows?
Your help is very much appreciated.

/Robert


*************************************************************************
 C:\Perl\bin\perl.exe "-MExtUtils::Command::MM" "-e" "test_harness(0, 'bl
ib\lib', 'blib\arch')" t\cluster.t t\comment.t t\dbadaptor.t t\dblink.t
t\ensemb
l.t t\fuzzy2.t t\genbank.t t\locuslink.t t\ontology.t t\query.t t\remove.t
t\seq
feature.t t\simpleseq.t t\species.t t\swiss.t
t\cluster.......ok 2/160Undefined subroutine &Bio::Root::Root::throw called
at E
:\temp\bioperl-db\blib\lib/Bio\DB\BioSQL\DBAdaptor.pm line 205, <GEN0> line
1.
t\cluster.......dubious
        Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 3-160
        Failed 158/160 tests, 1.25% okay
t\comment.......ok 2/11Undefined subroutine &Bio::Root::Root::throw called
at E:
\temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
<GEN0>
 line 72.
t\comment.......dubious
        Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 3-11
        Failed 9/11 tests, 18.18% okay
t\dbadaptor.....ok
t\dblink........ok 2/18Undefined subroutine &Bio::Root::Root::throw called
at E:
\temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
<GEN0>
 line 72.
t\dblink........dubious
        Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 3-18
        Failed 16/18 tests, 11.11% okay
t\ensembl.......ok 2/15Undefined subroutine &Bio::Root::Root::throw called
at E:
\temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
<GEN0>
 line 1420.
t\ensembl.......dubious
        Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 3-15
        Failed 13/15 tests, 13.33% okay
t\fuzzy2........ok 2/21Undefined subroutine &Bio::Root::Root::throw called
at E:
\temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
<GEN0>
 line 64.
t\fuzzy2........dubious
        Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 3-21
        Failed 19/21 tests, 9.52% okay
t\genbank.......ok 2/18Undefined subroutine &Bio::Root::Root::throw called
at E:
\temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
<GEN0>
 line 72.
t\genbank.......dubious
        Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 3-18
        Failed 16/18 tests, 11.11% okay
t\locuslink.....ok 3/110Undefined subroutine &Bio::Root::Root::throw called
at E
:\temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
<GEN0
> line 1.
t\locuslink.....dubious
        Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 4-110
        Failed 107/110 tests, 2.73% okay
t\ontology......ok 4/291Undefined subroutine &Bio::Root::Root::throw called
at E
:\temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
<GEN0
> line 98.
t\ontology......dubious
        Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 5-291
        Failed 287/291 tests, 1.37% okay
t\query.........ok
t\remove........ok 1/59Undefined subroutine &Bio::Root::Root::throw called
at E:
\temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
<GEN0>
 line 41.
        (in cleanup) Undefined subroutine &Bio::Root::Root::DESTROY called
at t\
remove.t line 32, <GEN0> line 41.
Undefined subroutine &Bio::Root::Root::throw called at
E:\temp\bioperl-db\blib\l
ib/Bio\DB\BioSQL\DBAdaptor.pm line 174, <GEN0> line 41.
t\remove........dubious
        Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 2-59
        Failed 58/59 tests, 1.69% okay
t\seqfeature....ok 2/48Undefined subroutine &Bio::Root::Root::throw called
at E:
\temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
<GEN0>
 line 72.
t\seqfeature....dubious
        Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 3-48
        Failed 46/48 tests, 4.17% okay
t\simpleseq.....ok 8/27Undefined subroutine &Bio::Root::Root::throw called
at E:
\temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
<GEN0>
 line 1.
        (in cleanup) Undefined subroutine &Bio::Root::Root::DESTROY called
at t\
simpleseq.t line 45, <GEN0> line 1.
        (in cleanup) Undefined subroutine &Bio::Root::Root::DESTROY called
at t\
simpleseq.t line 73, <GEN0> line 1.
Undefined subroutine &Bio::Root::Root::debug called at
E:\temp\bioperl-db\blib\l
ib/Bio/DB/BioSQL/BaseDriver.pm line 410, <GEN0> line 1.
t\simpleseq.....dubious
        Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 9-27
        Failed 19/27 tests, 29.63% okay
t\species.......ok 2/65Undefined subroutine &Bio::Root::Root::throw called
at E:
\temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198.
t\species.......dubious
        Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 3-65
        Failed 63/65 tests, 3.08% okay
t\swiss.........ok 2/52Undefined subroutine &Bio::Root::Root::throw called
at E:
\temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
<GEN0>
 line 78.
t\swiss.........dubious
        Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 3-52
        Failed 50/52 tests, 3.85% okay
Failed Test    Stat Wstat Total Fail  Failed  List of Failed
----------------------------------------------------------------------------
---
t\cluster.t     255 65280   160  158  98.75%  3-160
t\comment.t     255 65280    11    9  81.82%  3-11
t\dblink.t      255 65280    18   16  88.89%  3-18
t\ensembl.t     255 65280    15   13  86.67%  3-15
t\fuzzy2.t      255 65280    21   19  90.48%  3-21
t\genbank.t     255 65280    18   16  88.89%  3-18
t\locuslink.t   255 65280   110  107  97.27%  4-110
t\ontology.t    255 65280   291  287  98.63%  5-291
t\remove.t      255 65280    59   58  98.31%  2-59
t\seqfeature.t  255 65280    48   46  95.83%  3-48
t\simpleseq.t   255 65280    27   19  70.37%  9-27
t\species.t     255 65280    65   63  96.92%  3-65
t\swiss.t       255 65280    52   50  96.15%  3-52
Failed 13/15 test scripts, 13.33% okay. 861/919 subtests failed, 6.31% okay.
****************************************************************************
****


-----Original Message-----
From: bioperl-l-bounces@portal.open-bio.org
[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Hilmar Lapp
Sent: 09 October 2003 02:01
To: Robert Roth
Cc: BioPerl
Subject: [Bioperl-l] Re: [BioSQL-l] Error using load_seqdatabase.pl


This is definitely related to the perl setup I'd claim. Have you been able
to run the bioperl test suite without similar trouble?

Again, I'm cc'ing the bioperl list in case anybody has seen this also and
has clues as to what the cause is and how to remedy it.

    -hilmar


From brian_osborne at cognia.com  Thu Oct  9 08:44:26 2003
From: brian_osborne at cognia.com (Brian Osborne)
Date: Thu Oct  9 08:46:31 2003
Subject: [Bioperl-l] Re: [BioSQL-l] Error using load_seqdatabase.pl
In-Reply-To: <BAEPLGNHBBCGIFPHMAHLEELKCCAA.robert.roth@home.se>
Message-ID: <GAEDKMGOKFBLJPKCLKCCMEOJDCAA.brian_osborne@cognia.com>

Robert,

> Is there someone else who have a working installation of
Bioperl/bioperl-db/biosql under windows?

I've not done extensive use or testing of my Mysql database but I've set up
a small one using load_taxonomy.pl followed by load_seqdatabase.pl. This was
a month or so ago. I used Cygwin, the latest bioperl from CVS, latest
biosql-schema, and the latest bioperl-db, installed in that order. Mysql
4.0.13. The only tricky part is getting DBD::Mysql running under Cygwin. I
was getting a bit frustrated but then I stumbled on this:

http://search.cpan.org/src/JWIED/DBD-mysql-2.1025/INSTALL.html#windows/cygwi
n


Brian O.


-----Original Message-----
From: bioperl-l-bounces@portal.open-bio.org
[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Robert Roth
Sent: Thursday, October 09, 2003 7:51 AM
To: bioperl-l@bioperl.org
Subject: RE: [Bioperl-l] Re: [BioSQL-l] Error using load_seqdatabase.pl


I started over again and also upgraded to perl 5.8 (from ActiveState). The
claim in INSTALL.WIN that not all external modules are available from the
ppm repositories is no longer true.
Since the the installation of Bioperl with ppm does not give the possiblity
to run the test suite I skipped it and installed Bioperl-1.2.3 with nmake
instead. The test suite for bioperl ran without any major problems. With all
test that just returned "ok" removed for brevity the output was,

************************************************************************
t\Assembly...................DB_File not installed. This means the Assembly
modules are not available.  Skipping tests.
t\Assembly...................ok
t\DB.........................ok
        1/78 skipped:
t\flat.......................DB_File not loaded. This means flat.t test
cannot
be executed. Skipping
t\flat.......................ok
t\Index......................ok 11/32Can't call method "accession" on an
undefined value at E:\temp\bioperl-1.2.3\blib\lib/Bio/DB/InMemoryCache.pm
line 220, <GN7> line 54.
t\Index......................dubious
        Test returned status 22 (wstat 5632, 0x1600)
        after all the subtests completed successfully
t\SeqFeatCollection..........DB_File not installed. This means the
SeqFeatCollection wont work
t\SeqFeatCollection..........ok
Failed Test Stat Wstat Total Fail  Failed  List of Failed
----------------------------------------------------------------------------
---
t\Index.t     22  5632    32    0   0.00%  ??
4 subtests skipped.
Failed 1/123 test scripts, 99.19% okay. 0/5977 subtests failed, 100.00%
okay.
************************************************************************


After this I moved on and installed bioperl-db. Running the test suite gives
the lengthy (sorry...) output below. They all (except dbadaptor.t and
query.t) fail with the same output. Attempting to run load_seqdatabase.pl
fails with the same output as before.
The complains about redefined subroutines that I showed in an earlier
message that I get when running load_seqdatabase.pl is something that is
known and attributed to a more "sensitive" perl on windows.
If the problem still seems to be related to the perl setup I guess I have to
build perl also and abandon activestate. Is there someone else who have a
working installation of Bioperl/bioperl-db/biosql under windows?
Your help is very much appreciated.

/Robert


*************************************************************************
 C:\Perl\bin\perl.exe "-MExtUtils::Command::MM" "-e" "test_harness(0, 'bl
ib\lib', 'blib\arch')" t\cluster.t t\comment.t t\dbadaptor.t t\dblink.t
t\ensemb
l.t t\fuzzy2.t t\genbank.t t\locuslink.t t\ontology.t t\query.t t\remove.t
t\seq
feature.t t\simpleseq.t t\species.t t\swiss.t
t\cluster.......ok 2/160Undefined subroutine &Bio::Root::Root::throw called
at E
:\temp\bioperl-db\blib\lib/Bio\DB\BioSQL\DBAdaptor.pm line 205, <GEN0> line
1.
t\cluster.......dubious
        Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 3-160
        Failed 158/160 tests, 1.25% okay
t\comment.......ok 2/11Undefined subroutine &Bio::Root::Root::throw called
at E:
\temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
<GEN0>
 line 72.
t\comment.......dubious
        Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 3-11
        Failed 9/11 tests, 18.18% okay
t\dbadaptor.....ok
t\dblink........ok 2/18Undefined subroutine &Bio::Root::Root::throw called
at E:
\temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
<GEN0>
 line 72.
t\dblink........dubious
        Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 3-18
        Failed 16/18 tests, 11.11% okay
t\ensembl.......ok 2/15Undefined subroutine &Bio::Root::Root::throw called
at E:
\temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
<GEN0>
 line 1420.
t\ensembl.......dubious
        Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 3-15
        Failed 13/15 tests, 13.33% okay
t\fuzzy2........ok 2/21Undefined subroutine &Bio::Root::Root::throw called
at E:
\temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
<GEN0>
 line 64.
t\fuzzy2........dubious
        Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 3-21
        Failed 19/21 tests, 9.52% okay
t\genbank.......ok 2/18Undefined subroutine &Bio::Root::Root::throw called
at E:
\temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
<GEN0>
 line 72.
t\genbank.......dubious
        Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 3-18
        Failed 16/18 tests, 11.11% okay
t\locuslink.....ok 3/110Undefined subroutine &Bio::Root::Root::throw called
at E
:\temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
<GEN0
> line 1.
t\locuslink.....dubious
        Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 4-110
        Failed 107/110 tests, 2.73% okay
t\ontology......ok 4/291Undefined subroutine &Bio::Root::Root::throw called
at E
:\temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
<GEN0
> line 98.
t\ontology......dubious
        Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 5-291
        Failed 287/291 tests, 1.37% okay
t\query.........ok
t\remove........ok 1/59Undefined subroutine &Bio::Root::Root::throw called
at E:
\temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
<GEN0>
 line 41.
        (in cleanup) Undefined subroutine &Bio::Root::Root::DESTROY called
at t\
remove.t line 32, <GEN0> line 41.
Undefined subroutine &Bio::Root::Root::throw called at
E:\temp\bioperl-db\blib\l
ib/Bio\DB\BioSQL\DBAdaptor.pm line 174, <GEN0> line 41.
t\remove........dubious
        Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 2-59
        Failed 58/59 tests, 1.69% okay
t\seqfeature....ok 2/48Undefined subroutine &Bio::Root::Root::throw called
at E:
\temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
<GEN0>
 line 72.
t\seqfeature....dubious
        Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 3-48
        Failed 46/48 tests, 4.17% okay
t\simpleseq.....ok 8/27Undefined subroutine &Bio::Root::Root::throw called
at E:
\temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
<GEN0>
 line 1.
        (in cleanup) Undefined subroutine &Bio::Root::Root::DESTROY called
at t\
simpleseq.t line 45, <GEN0> line 1.
        (in cleanup) Undefined subroutine &Bio::Root::Root::DESTROY called
at t\
simpleseq.t line 73, <GEN0> line 1.
Undefined subroutine &Bio::Root::Root::debug called at
E:\temp\bioperl-db\blib\l
ib/Bio/DB/BioSQL/BaseDriver.pm line 410, <GEN0> line 1.
t\simpleseq.....dubious
        Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 9-27
        Failed 19/27 tests, 29.63% okay
t\species.......ok 2/65Undefined subroutine &Bio::Root::Root::throw called
at E:
\temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198.
t\species.......dubious
        Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 3-65
        Failed 63/65 tests, 3.08% okay
t\swiss.........ok 2/52Undefined subroutine &Bio::Root::Root::throw called
at E:
\temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
<GEN0>
 line 78.
t\swiss.........dubious
        Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 3-52
        Failed 50/52 tests, 3.85% okay
Failed Test    Stat Wstat Total Fail  Failed  List of Failed
----------------------------------------------------------------------------
---
t\cluster.t     255 65280   160  158  98.75%  3-160
t\comment.t     255 65280    11    9  81.82%  3-11
t\dblink.t      255 65280    18   16  88.89%  3-18
t\ensembl.t     255 65280    15   13  86.67%  3-15
t\fuzzy2.t      255 65280    21   19  90.48%  3-21
t\genbank.t     255 65280    18   16  88.89%  3-18
t\locuslink.t   255 65280   110  107  97.27%  4-110
t\ontology.t    255 65280   291  287  98.63%  5-291
t\remove.t      255 65280    59   58  98.31%  2-59
t\seqfeature.t  255 65280    48   46  95.83%  3-48
t\simpleseq.t   255 65280    27   19  70.37%  9-27
t\species.t     255 65280    65   63  96.92%  3-65
t\swiss.t       255 65280    52   50  96.15%  3-52
Failed 13/15 test scripts, 13.33% okay. 861/919 subtests failed, 6.31% okay.
****************************************************************************
****


-----Original Message-----
From: bioperl-l-bounces@portal.open-bio.org
[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Hilmar Lapp
Sent: 09 October 2003 02:01
To: Robert Roth
Cc: BioPerl
Subject: [Bioperl-l] Re: [BioSQL-l] Error using load_seqdatabase.pl


This is definitely related to the perl setup I'd claim. Have you been able
to run the bioperl test suite without similar trouble?

Again, I'm cc'ing the bioperl list in case anybody has seen this also and
has clues as to what the cause is and how to remedy it.

    -hilmar


_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org
http://portal.open-bio.org/mailman/listinfo/bioperl-l


From Steven.Roels at mpi.com  Thu Oct  9 14:26:23 2003
From: Steven.Roels at mpi.com (Roels, Steven)
Date: Thu Oct  9 14:25:14 2003
Subject: [Bioperl-l] cvs access problems?
Message-ID: <C50012B03D77BC4CBBE8FE39E0B3B31C56B46B@US-VS1.corp.mpi.com>


Is the open-bio cvs server up and running?

I'm trying to login as I've done before, but today I'm timing out.  As
near as we can tell its not a firewall issue and I can (for example)
access ensembl.

I'm doing the following:

% cvs -d :pserver:cvs@cvs.open-bio.org:/home/repository/bioperl login

when prompted, the password is 'cvs'


Thanks,

-Steve

*****************************************************************
Steve Roels, Ph.D.                       
Senior Scientist I - Computational Biology     Phone: (617) 761-6820
Millennium Pharmaceuticals, Inc.         FAX:   (617) 577-3555
640 Memorial Drive                       Email: roels@mpi.com
Cambridge, MA 02139-4853
*****************************************************************


This e-mail, including any attachments, is a confidential business communication, and may contain information that is confidential, proprietary and/or privileged.  This e-mail is intended only for the individual(s) to whom it is addressed, and may not be saved, copied, printed, disclosed or used by anyone else.  If you are not the(an) intended recipient, please immediately delete this e-mail from your computer system and notify the sender.  Thank you.


From Steven.Roels at mpi.com  Thu Oct  9 15:04:13 2003
From: Steven.Roels at mpi.com (Roels, Steven)
Date: Thu Oct  9 15:02:59 2003
Subject: [Bioperl-l] cvs access problems?
Message-ID: <C50012B03D77BC4CBBE8FE39E0B3B31C06C7DD64@US-VS1.corp.mpi.com>


Before anyone else troubles themselves...

It was just a firewall problem (the permissions were tied to IP address,
which had apparently changed since I last connected).

Thanks Jason for checking,

-Steve

*****************************************************************
Steve Roels, Ph.D.                       
Senior Scientist I - Computational Biology     Phone: (617) 761-6820
Millennium Pharmaceuticals, Inc.         FAX:   (617) 577-3555
640 Memorial Drive                       Email: roels@mpi.com
Cambridge, MA 02139-4853
*****************************************************************

>-----Original Message-----
>From: Jason Stajich [mailto:jason@cgt.duhs.duke.edu]
>Sent: Thursday, October 09, 2003 2:36 PM
>To: Roels, Steven
>Subject: Re: [Bioperl-l] cvs access problems?
>
>works for me....
>
>-j
>
>On Thu, 9 Oct 2003, Roels, Steven wrote:
>
>>
>> Is the open-bio cvs server up and running?
>>
>> I'm trying to login as I've done before, but today I'm timing out.
As
>> near as we can tell its not a firewall issue and I can (for example)
>> access ensembl.
>>
>> I'm doing the following:
>>
>> % cvs -d :pserver:cvs@cvs.open-bio.org:/home/repository/bioperl login
>>
>> when prompted, the password is 'cvs'
>>
>>
>> Thanks,
>>
>> -Steve
>>
>> *****************************************************************
>> Steve Roels, Ph.D.
>> Senior Scientist I - Computational Biology     Phone: (617) 761-6820
>> Millennium Pharmaceuticals, Inc.         FAX:   (617) 577-3555
>> 640 Memorial Drive                       Email: roels@mpi.com
>> Cambridge, MA 02139-4853
>> *****************************************************************
>>
>>
>>
>>
>> This e-mail, including any attachments, is a confidential business
>communication, and may contain information that is confidential,
>proprietary and/or privileged.  This e-mail is intended only for the
>individual(s) to whom it is addressed, and may not be saved, copied,
>printed, disclosed or used by anyone else.  If you are not the(an)
intended
>recipient, please immediately delete this e-mail from your computer
system
>and notify the sender.  Thank you.
>>
>>
>> _______________________________________________
>> Bioperl-l mailing list
>> Bioperl-l@portal.open-bio.org
>> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>>
>
>--
>Jason Stajich
>Duke University
>jason at cgt.mc.duke.edu


This e-mail, including any attachments, is a confidential business communication, and may contain information that is confidential, proprietary and/or privileged.  This e-mail is intended only for the individual(s) to whom it is addressed, and may not be saved, copied, printed, disclosed or used by anyone else.  If you are not the(an) intended recipient, please immediately delete this e-mail from your computer system and notify the sender.  Thank you.


From vamsi at warta.bio.psu.edu  Thu Oct  9 15:23:03 2003
From: vamsi at warta.bio.psu.edu (Vamsi)
Date: Thu Oct  9 15:20:36 2003
Subject: [Bioperl-l] load_gff.pl error
Message-ID: <200310091923.h99JN3k23009@warta.bio.psu.edu>

Hi,

I am trying to use the load_gff.pl located in the Bio-DB-GFF/scripts directory
of the bioperl-1.2.3 distribution. 

When I try
"perl load_gff.pl --dsn dbi:mysql:XXX --user XXX --pass XXX a.gff"

I get the error

Can't use string ("ARGV") as a symbol ref while "strict refs" in use at 
/usr/perl5/site_perl/5.005/Bio/DB/GFF.pm line 2130.

When I modify sub do_load_gff in GFF.pm by having as the first line
"no strict 'refs';"
the error disappears.

Is this something anyone else has faced?

Thanks,
Vamsi

From jnolan at tulane.edu  Thu Oct  9 16:20:10 2003
From: jnolan at tulane.edu (Jim Nolan)
Date: Thu Oct  9 16:20:28 2003
Subject: [Bioperl-l] OS X bioperl, staden/read, install problems
In-Reply-To: <Pine.OSF.4.33.0310081556120.15105-100000@alpha10.bioch.virginia.edu>
References: <Pine.OSF.4.33.0310081556120.15105-100000@alpha10.bioch.virginia.edu>
Message-ID: <a05210604bbab63714615@[129.81.38.94]>

I tried installing bioperl-1.2.3 again on a relatively naive machine 
(has bioperl 1.1.1, but no -ext). Installation went off without a 
hitch. I think I must have gotten the idea of installing bioper-ext 
from the www.bioperl.org/Core/external.shtml, after having looked 
there for some other dependency like SOAP::Lite. Modules that are 
this tricky to install (I did try to do the manual installation of 
the missing io_lib files) should have a big skull and crossbones on 
them to keep guys like me out. Or better yet, instructions in the 
readme to tell us how to do it right.

At any rate, I ask that the evaluation for modules outside the core 
release be minimized and that they not cause failure of make test.

Whining aside, thanks to everyone for making such a great resource 
free to all of us, and Thanks for getting me going again after the 
little detour.

Jim

At 3:59 PM -0400 10/8/03, Aaron J Mackey wrote:
>On Mon, 6 Oct 2003, Jason Stajich wrote:
>
>>  When I get together with Aaron at the end of the month I will try and go
>>  through it with him so I understand what is supposed to happen and/or how
>>  we might reorg bioperl-ext to be less problematic.
>
>The problem (over-and-over-again) has been that while people think they
>have the staden io-lib successfully installed, they don't (because they
>have to manually do a few things that the io-lib install doesn't do for
>them).  It's a royal pain, of course.  And then they try to install
>bioperl-ext, which doesn't successfully complete (you can, btw, install
>only parts of bioperl-ext, so the whole staden read lib stuff shouldn't
>prevent someone from installing the other alignment code).
>
>Then, if they ignore that and try to run the normal bioperl test suite,
>they get various odd cryptic errors about read not being successfully
>installed.
>
>-Aaron


-- 
---------------------------------------------------------
Jim Nolan			email:JNolan@tulane.edu
Department of Biochemistry SL43
Tulane University Health Sciences Center
1430 Tulane Ave.		Phone: (504)-584-2453
New Orleans, LA 70112-2699	FAX:   (504)-584-2739
http://www.tulane.edu/~biochem/faculty/nolan.htm
http://phage.bioc.tulane.edu
The requirements said: Windows 2000 or better.
So I got a Macintosh.
From lstein at cshl.edu  Thu Oct  9 16:26:59 2003
From: lstein at cshl.edu (Lincoln Stein)
Date: Thu Oct  9 16:24:41 2003
Subject: [Bioperl-l] load_gff.pl error
In-Reply-To: <200310091923.h99JN3k23009@warta.bio.psu.edu>
References: <200310091923.h99JN3k23009@warta.bio.psu.edu>
Message-ID: <200310091626.59317.lstein@cshl.edu>

The issue is fixed in bioperl live.

Lincoln

PS: Mark, remember when I told you that your soul would be consigned to 
purgatory if you broke anything when you changed the ARGV handling?  Well, it 
happened.

On Thursday 09 October 2003 03:23 pm, Vamsi wrote:
> Hi,
>
> I am trying to use the load_gff.pl located in the Bio-DB-GFF/scripts
> directory of the bioperl-1.2.3 distribution.
>
> When I try
> "perl load_gff.pl --dsn dbi:mysql:XXX --user XXX --pass XXX a.gff"
>
> I get the error
>
> Can't use string ("ARGV") as a symbol ref while "strict refs" in use at
> /usr/perl5/site_perl/5.005/Bio/DB/GFF.pm line 2130.
>
> When I modify sub do_load_gff in GFF.pm by having as the first line
> "no strict 'refs';"
> the error disappears.
>
> Is this something anyone else has faced?
>
> Thanks,
> Vamsi
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

-- 
========================================================================
Lincoln D. Stein                           Cold Spring Harbor Laboratory
lstein@cshl.org			                  Cold Spring Harbor, NY
========================================================================


From markw at illuminae.com  Thu Oct  9 16:57:03 2003
From: markw at illuminae.com (Mark Wilkinson)
Date: Thu Oct  9 16:57:04 2003
Subject: [BioPerl] Re: [Bioperl-l] load_gff.pl error
In-Reply-To: <200310091626.59317.lstein@cshl.edu>
References: <200310091923.h99JN3k23009@warta.bio.psu.edu>
	<200310091626.59317.lstein@cshl.edu>
Message-ID: <1065732852.2767.29.camel@localhost.localdomain>

Uh... no, you threatened me with purgatory if I broke the way data
loading from filehandles worked - you said nothing about messing with
ARGV ;-)

..besides, it doesn't look like this error is my fault??  I don't see
any code in there that came from my fingers.  do you know what bit of
code I wrote that is causing that error?

M

On Thu, 2003-10-09 at 14:26, Lincoln Stein wrote:
> The issue is fixed in bioperl live.
> 
> Lincoln
> 
> PS: Mark, remember when I told you that your soul would be consigned to 
> purgatory if you broke anything when you changed the ARGV handling?  Well, it 
> happened.
> 
> On Thursday 09 October 2003 03:23 pm, Vamsi wrote:
> > Hi,
> >
> > I am trying to use the load_gff.pl located in the Bio-DB-GFF/scripts
> > directory of the bioperl-1.2.3 distribution.
> >
> > When I try
> > "perl load_gff.pl --dsn dbi:mysql:XXX --user XXX --pass XXX a.gff"
> >
> > I get the error
> >
> > Can't use string ("ARGV") as a symbol ref while "strict refs" in use at
> > /usr/perl5/site_perl/5.005/Bio/DB/GFF.pm line 2130.
> >
> > When I modify sub do_load_gff in GFF.pm by having as the first line
> > "no strict 'refs';"
> > the error disappears.
> >
> > Is this something anyone else has faced?
> >
> > Thanks,
> > Vamsi
> >
> > _______________________________________________
> > Bioperl-l mailing list
> > Bioperl-l@portal.open-bio.org
> > http://portal.open-bio.org/mailman/listinfo/bioperl-l
-- 
Mark Wilkinson <markw@illuminae.com>
Illuminae

From lcarey at odd.bio.sunysb.edu  Thu Oct  9 18:48:13 2003
From: lcarey at odd.bio.sunysb.edu (Lucas Carey)
Date: Thu Oct  9 18:45:45 2003
Subject: [Bioperl-l] search for only C-terminal degenerate motifs
Message-ID: <20031009224813.GA11656@odd.bio.sunysb.edu>

Hi,
I'm not sure if this is the correct list, but I figured someone here would be able to help me.
I'd like to use blast to search for a protein motif within a certain number of residues of the C-terminus. What is the best way to go about doing this?
It seems like someone would have already had to do this, and written software to do it. That's what made me think of bioperl.
-Lucas
From palle at birc.dk  Fri Oct 10 05:09:59 2003
From: palle at birc.dk (Palle Villesen)
Date: Fri Oct 10 05:07:49 2003
Subject: [Bioperl-l] Problem retrieving peptides using GenPept
Message-ID: <3F867767.3090609@birc.dk>

Hi,

I have the following code (below), but I get the following error and I'm 
stuck. I have tried various queries and I always get <300 seqs back 
before the exception occurs. I'm not sure if it a speed problem, i.e. 
retrieving the datastream quickjly enough, but I have tried the 
-retrievaltype=>'tempfile' as well - which didn't help.

I couldn't find anything in the docs, hence this mail. Sorry to disturb.

Greetings,
P.
--

   Palle Villesen, Ph.D. 
   BiRC, Build. 090, University of Aarhus
   DK - 8000 Aarhus C, Denmark

   Http://www.daimi.au.dk/~biopv - +45 61708600
---------------------------------------------------------------------


OUTPUT:

Getting accession number for the following query:
BSE

Hits: 118
1 Seq length 176
2 Seq length 253
3 Seq length 176
4 Seq length 178
5 Seq length 96
6 Seq length 179
7 Seq length 179
8 Seq length 253
9 Seq length 360

-------------------- WARNING ---------------------
MSG: exception while parsing location line [bond(179,214)] in reading 
EMBL/GenBank/SwissProt, ignoring feature Bond (seqid=Q60506):
------------- EXCEPTION  -------------
MSG: operator "bond" unrecognized by parser
STACK Bio::Factory::FTLocationFactory::from_string 
/home/serine/palle/bioperl-live/Bio/Factory/FTLocationFactory.pm:160
STACK (eval) /home/serine/palle/bioperl-live/Bio/SeqIO/FTHelper.pm:124
STACK Bio::SeqIO::FTHelper::_generic_seqfeature 
/home/serine/palle/bioperl-live/Bio/SeqIO/FTHelper.pm:123
STACK Bio::SeqIO::genbank::next_seq 
/home/serine/palle/bioperl-live/Bio/SeqIO/genbank.pm:396
STACK main::get_query get_RVproteins_from_GB.pl:30
STACK (eval) get_RVproteins_from_GB.pl:8
STACK toplevel get_RVproteins_from_GB.pl:7

--------------------------------------

---------------------------------------------------
Caught exception
Done


Program:

#!/usr/local/bin/perl
my $query_string = "BSE";

print "Getting the following query:$query_string\n";

eval {
  get_query($query_string);
};

if ($@) {
  print "Caught exception\n";
}
print "Done\n\n";
exit();

sub get_query {
  use Bio::DB::GenPept;
  use Bio::DB::Query::GenBank;
  my ($query_string) = @_;
  my $query = Bio::DB::Query::GenBank->new(-db=>'protein',
					   -query=>$query_string
					  );
  my $gb = Bio::DB::GenPept->new ();
  print "Hits: ".$query->count."\n";
  my @ids = $query->ids;
  my $current =1;
  my $seqio = $gb->get_Stream_by_query($query);
  my $current =1;
  while (my $seq = $seqio->next_seq()) {
    print $current++." Seq length ".$seq->length()."\n";
  }
return 1;
}


From senger at ebi.ac.uk  Fri Oct 10 07:16:50 2003
From: senger at ebi.ac.uk (Martin Senger)
Date: Fri Oct 10 07:15:04 2003
Subject: [Bioperl-l] Re: Bio::Biblio
Message-ID: <Pine.LNX.4.44.0310101157430.20688-100000@bagheera.ebi.ac.uk>

Hi again,

>        My goal is to write a quick-and-dirty medline2docbook fliter for
> immediate use, using bioperl since it has already a library able to
> query medline.
> ...
> Can anybody point me to the correct object and its construction method?>
>

   As mentioned already, the good source for that is the script provided 
within bioperl. The script is: examples/biblio/biblio.PLS.
   Here I am ataching an example doing hopefully what you wish (and few 
explanations labelled by '(digit)' pattern:

#!/usr/bin/perl -w

use Bio::Biblio;       # to read data via SOAP
use Bio::Biblio::IO;   # to convert resulting XML to Biblio objects

# (1)
# --- taking citations from MEDLINE at EBI using SOAP
my $biblio = new Bio::Biblio;
my $citation = $biblio->get_by_id ('11112222');

# (2)
# --- make an instance of a converter
my $io = new Bio::Biblio::IO ('-data' => $citation);

# (3)
# --- and make the conversion (use the citation)
print << 'DOCBOOK_START';
<biblioset>
   <authorgroup>
DOCBOOK_START

while (my $bibref = $io->next_bibref) {
    foreach (@ {$$bibref{'_authors'}} ) {
	print << "DOCBOOK";
	 <author>
	    <surname>$$_{_lastname}</surname>
	    <firstname>$$_{_forename}</firstname>
	 </author>
DOCBOOK
    }
}
print << 'DOCBOOK_END';
   </authorgroup>
</biblioset>
DOCBOOK_END

__END__

Notes:

(1) First you need to get a citation (or a collection of citations. You
may get it from MEDLINEW at EBI (as shown in the script above), but you
can also get it from the PubMed web page, or you have it already in your
local file. The script shows how to get a citation by its ID, but you can 
slo get them by using a query, e.g. using:
   my $citation = $biblio->find ('brazma')->get_all;
   Anyway, the citation (or citations) is now in your possesion in an XML
format.

(2) You may specify various arguments in the constructor. By default it 
expects that you wish result as perl biblio objects. If you rather wish a 
raw hashtable use ('-result' => 'raw') in the constructor.
   Also you say where is your citation to be converted. If you have it in 
a file use ('-file' => $citation).
   If you got your citatioin from PubMed which is slightly different XML 
from Medline) use ('-format' => 'pubmedxml').

(3) The created biblio objects do not have any methods, you just use their 
attributes. It is not ideal I know but there is no object representing all 
attributes as get methods.
   The best way hiw to find what attributes are available is to print the 
whole citation:
   print $bibref->print_me;

   I hope this helps you. Please let me know if you need more.
   Regards, Martin

PS. BTW, this is what you should get when running the script above:
<biblioset>
   <authorgroup>
         <author>
            <surname>Oyer</surname>
            <firstname>C E</firstname>
         </author>
         <author>
            <surname>Ongcapin</surname>
            <firstname>E H</firstname>
         </author>
         <author>
            <surname>Ni</surname>
            <firstname>J</firstname>
         </author>
         <author>
            <surname>Bowles</surname>
            <firstname>N E</firstname>
         </author>
         <author>
            <surname>Towbin</surname>
            <firstname>J A</firstname>
         </author>
   </authorgroup>
</biblioset>

-- 
Martin Senger

EMBL Outstation - Hinxton                Senger@EBI.ac.uk     
European Bioinformatics Institute        Phone: (+44) 1223 494636      
Wellcome Trust Genome Campus             (Switchboard:     494444)
Hinxton                                  Fax  : (+44) 1223 494468
Cambridge CB10 1SD
United Kingdom                           http://industry.ebi.ac.uk/~senger

From brian_osborne at cognia.com  Fri Oct 10 08:54:36 2003
From: brian_osborne at cognia.com (Brian Osborne)
Date: Fri Oct 10 08:56:42 2003
Subject: [Bioperl-l] Problem retrieving peptides using GenPept
In-Reply-To: <3F867767.3090609@birc.dk>
Message-ID: <GAEDKMGOKFBLJPKCLKCCGEPEDCAA.brian_osborne@cognia.com>

Palle,

This is not a problem with your code or your connection, it's the "bond"
operator in the entry you're trying to retrieve. The Genbank and GenPept
parsers were written to respect the formal specifications for the
Genbank/EMBL/DDBJ feature tables (see
http://www.ncbi.nlm.nih.gov/projects/collab/FT/ for the details). The bond
operator is not in the standard, yet Genbank, for some reason, has allowed
it to appear so Bioperl is complaining. One possible work-around is to use
eval{} around the appropriate lines in your get_query().

Brian O.

-----Original Message-----
From: bioperl-l-bounces@portal.open-bio.org
[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Palle Villesen
Sent: Friday, October 10, 2003 5:10 AM
To: bioperl-l@bioperl.org
Subject: [Bioperl-l] Problem retrieving peptides using GenPept

Hi,

I have the following code (below), but I get the following error and I'm
stuck. I have tried various queries and I always get <300 seqs back
before the exception occurs. I'm not sure if it a speed problem, i.e.
retrieving the datastream quickjly enough, but I have tried the
-retrievaltype=>'tempfile' as well - which didn't help.

I couldn't find anything in the docs, hence this mail. Sorry to disturb.

Greetings,
P.
--

   Palle Villesen, Ph.D.
   BiRC, Build. 090, University of Aarhus
   DK - 8000 Aarhus C, Denmark

   Http://www.daimi.au.dk/~biopv - +45 61708600
---------------------------------------------------------------------


OUTPUT:

Getting accession number for the following query:
BSE

Hits: 118
1 Seq length 176
2 Seq length 253
3 Seq length 176
4 Seq length 178
5 Seq length 96
6 Seq length 179
7 Seq length 179
8 Seq length 253
9 Seq length 360

-------------------- WARNING ---------------------
MSG: exception while parsing location line [bond(179,214)] in reading
EMBL/GenBank/SwissProt, ignoring feature Bond (seqid=Q60506):
------------- EXCEPTION  -------------
MSG: operator "bond" unrecognized by parser
STACK Bio::Factory::FTLocationFactory::from_string
/home/serine/palle/bioperl-live/Bio/Factory/FTLocationFactory.pm:160
STACK (eval) /home/serine/palle/bioperl-live/Bio/SeqIO/FTHelper.pm:124
STACK Bio::SeqIO::FTHelper::_generic_seqfeature
/home/serine/palle/bioperl-live/Bio/SeqIO/FTHelper.pm:123
STACK Bio::SeqIO::genbank::next_seq
/home/serine/palle/bioperl-live/Bio/SeqIO/genbank.pm:396
STACK main::get_query get_RVproteins_from_GB.pl:30
STACK (eval) get_RVproteins_from_GB.pl:8
STACK toplevel get_RVproteins_from_GB.pl:7

--------------------------------------

---------------------------------------------------
Caught exception
Done


Program:

#!/usr/local/bin/perl
my $query_string = "BSE";

print "Getting the following query:$query_string\n";

eval {
  get_query($query_string);
};

if ($@) {
  print "Caught exception\n";
}
print "Done\n\n";
exit();

sub get_query {
  use Bio::DB::GenPept;
  use Bio::DB::Query::GenBank;
  my ($query_string) = @_;
  my $query = Bio::DB::Query::GenBank->new(-db=>'protein',
                                           -query=>$query_string
                                          );
  my $gb = Bio::DB::GenPept->new ();
  print "Hits: ".$query->count."\n";
  my @ids = $query->ids;
  my $current =1;
  my $seqio = $gb->get_Stream_by_query($query);
  my $current =1;
  while (my $seq = $seqio->next_seq()) {
    print $current++." Seq length ".$seq->length()."\n";
  }
return 1;
}


_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org
http://portal.open-bio.org/mailman/listinfo/bioperl-l


From robert.roth at home.se  Fri Oct 10 09:21:22 2003
From: robert.roth at home.se (Robert Roth)
Date: Fri Oct 10 09:18:54 2003
Subject: [Bioperl-l] Re: [BioSQL-l] Error using load_seqdatabase.pl
In-Reply-To: <GAEDKMGOKFBLJPKCLKCCMEOJDCAA.brian_osborne@cognia.com>
Message-ID: <BAEPLGNHBBCGIFPHMAHLEEMACCAA.robert.roth@home.se>


Brian,

Thank you for your advice. I got the database set up under cygwin now
and everything seems to work so far. The only thing that caused some trouble
was that I am not able to use 'localhost' as host when running the test for
bioperl-db. Using 127.0.0.1 as described at the link below for the
installation of DBD-mysql also solved these problems. Now
load_seqdatabase.pl seem to work so I guess I'll load the data under cygwin.

/Robert


>
>
> Robert,
>
> > Is there someone else who have a working installation of
> Bioperl/bioperl-db/biosql under windows?
>
> I've not done extensive use or testing of my Mysql database but
> I've set up
> a small one using load_taxonomy.pl followed by
> load_seqdatabase.pl. This was
> a month or so ago. I used Cygwin, the latest bioperl from CVS, latest
> biosql-schema, and the latest bioperl-db, installed in that order. Mysql
> 4.0.13. The only tricky part is getting DBD::Mysql running under Cygwin. I
> was getting a bit frustrated but then I stumbled on this:
>
> http://search.cpan.org/src/JWIED/DBD-mysql-2.1025/INSTALL.html#win
> dows/cygwi
> n
>
>
> Brian O.
>
>
>
>
> -----Original Message-----
> From: bioperl-l-bounces@portal.open-bio.org
> [mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Robert Roth
> Sent: Thursday, October 09, 2003 7:51 AM
> To: bioperl-l@bioperl.org
> Subject: RE: [Bioperl-l] Re: [BioSQL-l] Error using load_seqdatabase.pl
>
>
> I started over again and also upgraded to perl 5.8 (from ActiveState). The
> claim in INSTALL.WIN that not all external modules are available from the
> ppm repositories is no longer true.
> Since the the installation of Bioperl with ppm does not give the
> possiblity
> to run the test suite I skipped it and installed Bioperl-1.2.3 with nmake
> instead. The test suite for bioperl ran without any major
> problems. With all
> test that just returned "ok" removed for brevity the output was,
>
> ************************************************************************
> t\Assembly...................DB_File not installed. This means
> the Assembly
> modules are not available.  Skipping tests.
> t\Assembly...................ok
> t\DB.........................ok
>         1/78 skipped:
> t\flat.......................DB_File not loaded. This means flat.t test
> cannot
> be executed. Skipping
> t\flat.......................ok
> t\Index......................ok 11/32Can't call method "accession" on an
> undefined value at E:\temp\bioperl-1.2.3\blib\lib/Bio/DB/InMemoryCache.pm
> line 220, <GN7> line 54.
> t\Index......................dubious
>         Test returned status 22 (wstat 5632, 0x1600)
>         after all the subtests completed successfully
> t\SeqFeatCollection..........DB_File not installed. This means the
> SeqFeatCollection wont work
> t\SeqFeatCollection..........ok
> Failed Test Stat Wstat Total Fail  Failed  List of Failed
> ------------------------------------------------------------------
> ----------
> ---
> t\Index.t     22  5632    32    0   0.00%  ??
> 4 subtests skipped.
> Failed 1/123 test scripts, 99.19% okay. 0/5977 subtests failed, 100.00%
> okay.
> ************************************************************************
>
>
>
> After this I moved on and installed bioperl-db. Running the test
> suite gives
> the lengthy (sorry...) output below. They all (except dbadaptor.t and
> query.t) fail with the same output. Attempting to run load_seqdatabase.pl
> fails with the same output as before.
> The complains about redefined subroutines that I showed in an earlier
> message that I get when running load_seqdatabase.pl is something that is
> known and attributed to a more "sensitive" perl on windows.
> If the problem still seems to be related to the perl setup I
> guess I have to
> build perl also and abandon activestate. Is there someone else who have a
> working installation of Bioperl/bioperl-db/biosql under windows?
> Your help is very much appreciated.
>
> /Robert
>
>
> *************************************************************************
>  C:\Perl\bin\perl.exe "-MExtUtils::Command::MM" "-e" "test_harness(0, 'bl
> ib\lib', 'blib\arch')" t\cluster.t t\comment.t t\dbadaptor.t t\dblink.t
> t\ensemb
> l.t t\fuzzy2.t t\genbank.t t\locuslink.t t\ontology.t t\query.t t\remove.t
> t\seq
> feature.t t\simpleseq.t t\species.t t\swiss.t
> t\cluster.......ok 2/160Undefined subroutine
> &Bio::Root::Root::throw called
> at E
> :\temp\bioperl-db\blib\lib/Bio\DB\BioSQL\DBAdaptor.pm line 205,
> <GEN0> line
> 1.
> t\cluster.......dubious
>         Test returned status 255 (wstat 65280, 0xff00)
> DIED. FAILED tests 3-160
>         Failed 158/160 tests, 1.25% okay
> t\comment.......ok 2/11Undefined subroutine &Bio::Root::Root::throw called
> at E:
> \temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
> <GEN0>
>  line 72.
> t\comment.......dubious
>         Test returned status 255 (wstat 65280, 0xff00)
> DIED. FAILED tests 3-11
>         Failed 9/11 tests, 18.18% okay
> t\dbadaptor.....ok
> t\dblink........ok 2/18Undefined subroutine &Bio::Root::Root::throw called
> at E:
> \temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
> <GEN0>
>  line 72.
> t\dblink........dubious
>         Test returned status 255 (wstat 65280, 0xff00)
> DIED. FAILED tests 3-18
>         Failed 16/18 tests, 11.11% okay
> t\ensembl.......ok 2/15Undefined subroutine &Bio::Root::Root::throw called
> at E:
> \temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
> <GEN0>
>  line 1420.
> t\ensembl.......dubious
>         Test returned status 255 (wstat 65280, 0xff00)
> DIED. FAILED tests 3-15
>         Failed 13/15 tests, 13.33% okay
> t\fuzzy2........ok 2/21Undefined subroutine &Bio::Root::Root::throw called
> at E:
> \temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
> <GEN0>
>  line 64.
> t\fuzzy2........dubious
>         Test returned status 255 (wstat 65280, 0xff00)
> DIED. FAILED tests 3-21
>         Failed 19/21 tests, 9.52% okay
> t\genbank.......ok 2/18Undefined subroutine &Bio::Root::Root::throw called
> at E:
> \temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
> <GEN0>
>  line 72.
> t\genbank.......dubious
>         Test returned status 255 (wstat 65280, 0xff00)
> DIED. FAILED tests 3-18
>         Failed 16/18 tests, 11.11% okay
> t\locuslink.....ok 3/110Undefined subroutine
> &Bio::Root::Root::throw called
> at E
> :\temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
> <GEN0
> > line 1.
> t\locuslink.....dubious
>         Test returned status 255 (wstat 65280, 0xff00)
> DIED. FAILED tests 4-110
>         Failed 107/110 tests, 2.73% okay
> t\ontology......ok 4/291Undefined subroutine
> &Bio::Root::Root::throw called
> at E
> :\temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
> <GEN0
> > line 98.
> t\ontology......dubious
>         Test returned status 255 (wstat 65280, 0xff00)
> DIED. FAILED tests 5-291
>         Failed 287/291 tests, 1.37% okay
> t\query.........ok
> t\remove........ok 1/59Undefined subroutine &Bio::Root::Root::throw called
> at E:
> \temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
> <GEN0>
>  line 41.
>         (in cleanup) Undefined subroutine &Bio::Root::Root::DESTROY called
> at t\
> remove.t line 32, <GEN0> line 41.
> Undefined subroutine &Bio::Root::Root::throw called at
> E:\temp\bioperl-db\blib\l
> ib/Bio\DB\BioSQL\DBAdaptor.pm line 174, <GEN0> line 41.
> t\remove........dubious
>         Test returned status 255 (wstat 65280, 0xff00)
> DIED. FAILED tests 2-59
>         Failed 58/59 tests, 1.69% okay
> t\seqfeature....ok 2/48Undefined subroutine &Bio::Root::Root::throw called
> at E:
> \temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
> <GEN0>
>  line 72.
> t\seqfeature....dubious
>         Test returned status 255 (wstat 65280, 0xff00)
> DIED. FAILED tests 3-48
>         Failed 46/48 tests, 4.17% okay
> t\simpleseq.....ok 8/27Undefined subroutine &Bio::Root::Root::throw called
> at E:
> \temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
> <GEN0>
>  line 1.
>         (in cleanup) Undefined subroutine &Bio::Root::Root::DESTROY called
> at t\
> simpleseq.t line 45, <GEN0> line 1.
>         (in cleanup) Undefined subroutine &Bio::Root::Root::DESTROY called
> at t\
> simpleseq.t line 73, <GEN0> line 1.
> Undefined subroutine &Bio::Root::Root::debug called at
> E:\temp\bioperl-db\blib\l
> ib/Bio/DB/BioSQL/BaseDriver.pm line 410, <GEN0> line 1.
> t\simpleseq.....dubious
>         Test returned status 255 (wstat 65280, 0xff00)
> DIED. FAILED tests 9-27
>         Failed 19/27 tests, 29.63% okay
> t\species.......ok 2/65Undefined subroutine &Bio::Root::Root::throw called
> at E:
> \temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198.
> t\species.......dubious
>         Test returned status 255 (wstat 65280, 0xff00)
> DIED. FAILED tests 3-65
>         Failed 63/65 tests, 3.08% okay
> t\swiss.........ok 2/52Undefined subroutine &Bio::Root::Root::throw called
> at E:
> \temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
> <GEN0>
>  line 78.
> t\swiss.........dubious
>         Test returned status 255 (wstat 65280, 0xff00)
> DIED. FAILED tests 3-52
>         Failed 50/52 tests, 3.85% okay
> Failed Test    Stat Wstat Total Fail  Failed  List of Failed
> ------------------------------------------------------------------
> ----------
> ---
> t\cluster.t     255 65280   160  158  98.75%  3-160
> t\comment.t     255 65280    11    9  81.82%  3-11
> t\dblink.t      255 65280    18   16  88.89%  3-18
> t\ensembl.t     255 65280    15   13  86.67%  3-15
> t\fuzzy2.t      255 65280    21   19  90.48%  3-21
> t\genbank.t     255 65280    18   16  88.89%  3-18
> t\locuslink.t   255 65280   110  107  97.27%  4-110
> t\ontology.t    255 65280   291  287  98.63%  5-291
> t\remove.t      255 65280    59   58  98.31%  2-59
> t\seqfeature.t  255 65280    48   46  95.83%  3-48
> t\simpleseq.t   255 65280    27   19  70.37%  9-27
> t\species.t     255 65280    65   63  96.92%  3-65
> t\swiss.t       255 65280    52   50  96.15%  3-52
> Failed 13/15 test scripts, 13.33% okay. 861/919 subtests failed,
> 6.31% okay.
> ******************************************************************
> **********
> ****
>
>
>
>
> -----Original Message-----
> From: bioperl-l-bounces@portal.open-bio.org
> [mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Hilmar Lapp
> Sent: 09 October 2003 02:01
> To: Robert Roth
> Cc: BioPerl
> Subject: [Bioperl-l] Re: [BioSQL-l] Error using load_seqdatabase.pl
>
>
> This is definitely related to the perl setup I'd claim. Have you been able
> to run the bioperl test suite without similar trouble?
>
> Again, I'm cc'ing the bioperl list in case anybody has seen this also and
> has clues as to what the cause is and how to remedy it.
>
>     -hilmar
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>
>
>
>


From brian_osborne at cognia.com  Fri Oct 10 09:27:36 2003
From: brian_osborne at cognia.com (Brian Osborne)
Date: Fri Oct 10 09:29:35 2003
Subject: [Bioperl-l] Re: [BioSQL-l] Error using load_seqdatabase.pl
In-Reply-To: <BAEPLGNHBBCGIFPHMAHLEEMACCAA.robert.roth@home.se>
Message-ID: <GAEDKMGOKFBLJPKCLKCCIEPGDCAA.brian_osborne@cognia.com>

Cool.

-----Original Message-----
From: bioperl-l-bounces@portal.open-bio.org
[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Robert Roth
Sent: Friday, October 10, 2003 9:21 AM
To: Brian Osborne; bioperl-l@bioperl.org
Subject: RE: [Bioperl-l] Re: [BioSQL-l] Error using load_seqdatabase.pl


Brian,

Thank you for your advice. I got the database set up under cygwin now
and everything seems to work so far. The only thing that caused some trouble
was that I am not able to use 'localhost' as host when running the test for
bioperl-db. Using 127.0.0.1 as described at the link below for the
installation of DBD-mysql also solved these problems. Now
load_seqdatabase.pl seem to work so I guess I'll load the data under cygwin.

/Robert


>
>
> Robert,
>
> > Is there someone else who have a working installation of
> Bioperl/bioperl-db/biosql under windows?
>
> I've not done extensive use or testing of my Mysql database but
> I've set up
> a small one using load_taxonomy.pl followed by
> load_seqdatabase.pl. This was
> a month or so ago. I used Cygwin, the latest bioperl from CVS, latest
> biosql-schema, and the latest bioperl-db, installed in that order. Mysql
> 4.0.13. The only tricky part is getting DBD::Mysql running under Cygwin. I
> was getting a bit frustrated but then I stumbled on this:
>
> http://search.cpan.org/src/JWIED/DBD-mysql-2.1025/INSTALL.html#win
> dows/cygwi
> n
>
>
> Brian O.
>
>
>
>
> -----Original Message-----
> From: bioperl-l-bounces@portal.open-bio.org
> [mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Robert Roth
> Sent: Thursday, October 09, 2003 7:51 AM
> To: bioperl-l@bioperl.org
> Subject: RE: [Bioperl-l] Re: [BioSQL-l] Error using load_seqdatabase.pl
>
>
> I started over again and also upgraded to perl 5.8 (from ActiveState). The
> claim in INSTALL.WIN that not all external modules are available from the
> ppm repositories is no longer true.
> Since the the installation of Bioperl with ppm does not give the
> possiblity
> to run the test suite I skipped it and installed Bioperl-1.2.3 with nmake
> instead. The test suite for bioperl ran without any major
> problems. With all
> test that just returned "ok" removed for brevity the output was,
>
> ************************************************************************
> t\Assembly...................DB_File not installed. This means
> the Assembly
> modules are not available.  Skipping tests.
> t\Assembly...................ok
> t\DB.........................ok
>         1/78 skipped:
> t\flat.......................DB_File not loaded. This means flat.t test
> cannot
> be executed. Skipping
> t\flat.......................ok
> t\Index......................ok 11/32Can't call method "accession" on an
> undefined value at E:\temp\bioperl-1.2.3\blib\lib/Bio/DB/InMemoryCache.pm
> line 220, <GN7> line 54.
> t\Index......................dubious
>         Test returned status 22 (wstat 5632, 0x1600)
>         after all the subtests completed successfully
> t\SeqFeatCollection..........DB_File not installed. This means the
> SeqFeatCollection wont work
> t\SeqFeatCollection..........ok
> Failed Test Stat Wstat Total Fail  Failed  List of Failed
> ------------------------------------------------------------------
> ----------
> ---
> t\Index.t     22  5632    32    0   0.00%  ??
> 4 subtests skipped.
> Failed 1/123 test scripts, 99.19% okay. 0/5977 subtests failed, 100.00%
> okay.
> ************************************************************************
>
>
>
> After this I moved on and installed bioperl-db. Running the test
> suite gives
> the lengthy (sorry...) output below. They all (except dbadaptor.t and
> query.t) fail with the same output. Attempting to run load_seqdatabase.pl
> fails with the same output as before.
> The complains about redefined subroutines that I showed in an earlier
> message that I get when running load_seqdatabase.pl is something that is
> known and attributed to a more "sensitive" perl on windows.
> If the problem still seems to be related to the perl setup I
> guess I have to
> build perl also and abandon activestate. Is there someone else who have a
> working installation of Bioperl/bioperl-db/biosql under windows?
> Your help is very much appreciated.
>
> /Robert
>
>
> *************************************************************************
>  C:\Perl\bin\perl.exe "-MExtUtils::Command::MM" "-e" "test_harness(0, 'bl
> ib\lib', 'blib\arch')" t\cluster.t t\comment.t t\dbadaptor.t t\dblink.t
> t\ensemb
> l.t t\fuzzy2.t t\genbank.t t\locuslink.t t\ontology.t t\query.t t\remove.t
> t\seq
> feature.t t\simpleseq.t t\species.t t\swiss.t
> t\cluster.......ok 2/160Undefined subroutine
> &Bio::Root::Root::throw called
> at E
> :\temp\bioperl-db\blib\lib/Bio\DB\BioSQL\DBAdaptor.pm line 205,
> <GEN0> line
> 1.
> t\cluster.......dubious
>         Test returned status 255 (wstat 65280, 0xff00)
> DIED. FAILED tests 3-160
>         Failed 158/160 tests, 1.25% okay
> t\comment.......ok 2/11Undefined subroutine &Bio::Root::Root::throw called
> at E:
> \temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
> <GEN0>
>  line 72.
> t\comment.......dubious
>         Test returned status 255 (wstat 65280, 0xff00)
> DIED. FAILED tests 3-11
>         Failed 9/11 tests, 18.18% okay
> t\dbadaptor.....ok
> t\dblink........ok 2/18Undefined subroutine &Bio::Root::Root::throw called
> at E:
> \temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
> <GEN0>
>  line 72.
> t\dblink........dubious
>         Test returned status 255 (wstat 65280, 0xff00)
> DIED. FAILED tests 3-18
>         Failed 16/18 tests, 11.11% okay
> t\ensembl.......ok 2/15Undefined subroutine &Bio::Root::Root::throw called
> at E:
> \temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
> <GEN0>
>  line 1420.
> t\ensembl.......dubious
>         Test returned status 255 (wstat 65280, 0xff00)
> DIED. FAILED tests 3-15
>         Failed 13/15 tests, 13.33% okay
> t\fuzzy2........ok 2/21Undefined subroutine &Bio::Root::Root::throw called
> at E:
> \temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
> <GEN0>
>  line 64.
> t\fuzzy2........dubious
>         Test returned status 255 (wstat 65280, 0xff00)
> DIED. FAILED tests 3-21
>         Failed 19/21 tests, 9.52% okay
> t\genbank.......ok 2/18Undefined subroutine &Bio::Root::Root::throw called
> at E:
> \temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
> <GEN0>
>  line 72.
> t\genbank.......dubious
>         Test returned status 255 (wstat 65280, 0xff00)
> DIED. FAILED tests 3-18
>         Failed 16/18 tests, 11.11% okay
> t\locuslink.....ok 3/110Undefined subroutine
> &Bio::Root::Root::throw called
> at E
> :\temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
> <GEN0
> > line 1.
> t\locuslink.....dubious
>         Test returned status 255 (wstat 65280, 0xff00)
> DIED. FAILED tests 4-110
>         Failed 107/110 tests, 2.73% okay
> t\ontology......ok 4/291Undefined subroutine
> &Bio::Root::Root::throw called
> at E
> :\temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
> <GEN0
> > line 98.
> t\ontology......dubious
>         Test returned status 255 (wstat 65280, 0xff00)
> DIED. FAILED tests 5-291
>         Failed 287/291 tests, 1.37% okay
> t\query.........ok
> t\remove........ok 1/59Undefined subroutine &Bio::Root::Root::throw called
> at E:
> \temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
> <GEN0>
>  line 41.
>         (in cleanup) Undefined subroutine &Bio::Root::Root::DESTROY called
> at t\
> remove.t line 32, <GEN0> line 41.
> Undefined subroutine &Bio::Root::Root::throw called at
> E:\temp\bioperl-db\blib\l
> ib/Bio\DB\BioSQL\DBAdaptor.pm line 174, <GEN0> line 41.
> t\remove........dubious
>         Test returned status 255 (wstat 65280, 0xff00)
> DIED. FAILED tests 2-59
>         Failed 58/59 tests, 1.69% okay
> t\seqfeature....ok 2/48Undefined subroutine &Bio::Root::Root::throw called
> at E:
> \temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
> <GEN0>
>  line 72.
> t\seqfeature....dubious
>         Test returned status 255 (wstat 65280, 0xff00)
> DIED. FAILED tests 3-48
>         Failed 46/48 tests, 4.17% okay
> t\simpleseq.....ok 8/27Undefined subroutine &Bio::Root::Root::throw called
> at E:
> \temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
> <GEN0>
>  line 1.
>         (in cleanup) Undefined subroutine &Bio::Root::Root::DESTROY called
> at t\
> simpleseq.t line 45, <GEN0> line 1.
>         (in cleanup) Undefined subroutine &Bio::Root::Root::DESTROY called
> at t\
> simpleseq.t line 73, <GEN0> line 1.
> Undefined subroutine &Bio::Root::Root::debug called at
> E:\temp\bioperl-db\blib\l
> ib/Bio/DB/BioSQL/BaseDriver.pm line 410, <GEN0> line 1.
> t\simpleseq.....dubious
>         Test returned status 255 (wstat 65280, 0xff00)
> DIED. FAILED tests 9-27
>         Failed 19/27 tests, 29.63% okay
> t\species.......ok 2/65Undefined subroutine &Bio::Root::Root::throw called
> at E:
> \temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198.
> t\species.......dubious
>         Test returned status 255 (wstat 65280, 0xff00)
> DIED. FAILED tests 3-65
>         Failed 63/65 tests, 3.08% okay
> t\swiss.........ok 2/52Undefined subroutine &Bio::Root::Root::throw called
> at E:
> \temp\bioperl-db\blib\lib/Bio/DB/Persistent/PersistentObject.pm line 198,
> <GEN0>
>  line 78.
> t\swiss.........dubious
>         Test returned status 255 (wstat 65280, 0xff00)
> DIED. FAILED tests 3-52
>         Failed 50/52 tests, 3.85% okay
> Failed Test    Stat Wstat Total Fail  Failed  List of Failed
> ------------------------------------------------------------------
> ----------
> ---
> t\cluster.t     255 65280   160  158  98.75%  3-160
> t\comment.t     255 65280    11    9  81.82%  3-11
> t\dblink.t      255 65280    18   16  88.89%  3-18
> t\ensembl.t     255 65280    15   13  86.67%  3-15
> t\fuzzy2.t      255 65280    21   19  90.48%  3-21
> t\genbank.t     255 65280    18   16  88.89%  3-18
> t\locuslink.t   255 65280   110  107  97.27%  4-110
> t\ontology.t    255 65280   291  287  98.63%  5-291
> t\remove.t      255 65280    59   58  98.31%  2-59
> t\seqfeature.t  255 65280    48   46  95.83%  3-48
> t\simpleseq.t   255 65280    27   19  70.37%  9-27
> t\species.t     255 65280    65   63  96.92%  3-65
> t\swiss.t       255 65280    52   50  96.15%  3-52
> Failed 13/15 test scripts, 13.33% okay. 861/919 subtests failed,
> 6.31% okay.
> ******************************************************************
> **********
> ****
>
>
>
>
> -----Original Message-----
> From: bioperl-l-bounces@portal.open-bio.org
> [mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Hilmar Lapp
> Sent: 09 October 2003 02:01
> To: Robert Roth
> Cc: BioPerl
> Subject: [Bioperl-l] Re: [BioSQL-l] Error using load_seqdatabase.pl
>
>
> This is definitely related to the perl setup I'd claim. Have you been able
> to run the bioperl test suite without similar trouble?
>
> Again, I'm cc'ing the bioperl list in case anybody has seen this also and
> has clues as to what the cause is and how to remedy it.
>
>     -hilmar
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>
>
>
>


_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org
http://portal.open-bio.org/mailman/listinfo/bioperl-l


From lstein at cshl.edu  Fri Oct 10 12:26:37 2003
From: lstein at cshl.edu (Lincoln Stein)
Date: Fri Oct 10 12:24:21 2003
Subject: [Bioperl-l] Problem retrieving peptides using GenPept
In-Reply-To: <3F867767.3090609@birc.dk>
References: <3F867767.3090609@birc.dk>
Message-ID: <200310101226.37841.lstein@cshl.edu>

Hi,

This looks like a problem in the parser when it hits an entry that is not 
prepared for.  My prediction would be that you will always get the exception 
at the same accession number, but it sounds like you get sometimes more and 
sometimes less?  Please see whether you can isolate the issue to a particular 
entry.  This will help debug the problem.

Lincoln

On Friday 10 October 2003 05:09 am, Palle Villesen wrote:
> Hi,
>
> I have the following code (below), but I get the following error and I'm
> stuck. I have tried various queries and I always get <300 seqs back
> before the exception occurs. I'm not sure if it a speed problem, i.e.
> retrieving the datastream quickjly enough, but I have tried the
> -retrievaltype=>'tempfile' as well - which didn't help.
>
> I couldn't find anything in the docs, hence this mail. Sorry to disturb.
>
> Greetings,
> P.

-- 
========================================================================
Lincoln D. Stein                           Cold Spring Harbor Laboratory
lstein@cshl.org			                  Cold Spring Harbor, NY
========================================================================


From brian_osborne at cognia.com  Fri Oct 10 15:31:44 2003
From: brian_osborne at cognia.com (Brian Osborne)
Date: Fri Oct 10 15:34:14 2003
Subject: [Bioperl-l] search for only C-terminal degenerate motifs
In-Reply-To: <20031009224813.GA11656@odd.bio.sunysb.edu>
Message-ID: <GAEDKMGOKFBLJPKCLKCCEEPLDCAA.brian_osborne@cognia.com>

Lucas,

I don't think a script exactly like this exists in Bioperl but certainly the
means to write this script does. A good start might be the SearchIO HOWTO
(bioperl.org/HOWTOs). The combination of SearchIO and StandAloneBlast
(section III.4.5 of the bptutorial) enables you to run BLAST and parse the
results. The details of the C-terminal match should be straightforward, I
think, but it depends on your familiarity with Perl.

Brian O.

-----Original Message-----
From: bioperl-l-bounces@portal.open-bio.org
[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Lucas Carey
Sent: Thursday, October 09, 2003 6:48 PM
To: bioperl-l@bioperl.org
Subject: [Bioperl-l] search for only C-terminal degenerate motifs

Hi,
I'm not sure if this is the correct list, but I figured someone here would
be able to help me.
I'd like to use blast to search for a protein motif within a certain number
of residues of the C-terminus. What is the best way to go about doing this?
It seems like someone would have already had to do this, and written
software to do it. That's what made me think of bioperl.
-Lucas
_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org
http://portal.open-bio.org/mailman/listinfo/bioperl-l


From amackey at pcbi.upenn.edu  Fri Oct 10 16:08:26 2003
From: amackey at pcbi.upenn.edu (Aaron J. Mackey)
Date: Fri Oct 10 16:05:54 2003
Subject: [Bioperl-l] search for only C-terminal degenerate motifs
In-Reply-To: <GAEDKMGOKFBLJPKCLKCCEEPLDCAA.brian_osborne@cognia.com>
Message-ID: <81E173BD-FB5D-11D7-9104-000A958C5008@pcbi.upenn.edu>

Yeah, the problem is you'd really like to *not* search all of the 
database (for both speed and statistical reasons), only the first n 
C-terminal residues of each sequence in the database.  Both BLAST and 
FASTA have mechanisms to allow you to focus the search on a given part 
of the query, but not on a given part of the library.  Unfortunately, I 
think the best/quickest answer is to make your own database of 
C-terminal sequences [perhaps using Bio::SeqIO and $seq->trunc()].

But you said "motif" - are you trying to find:

   a) exact matches to a given short sequence
   b) exact matches to a consensus regular expression (e.g.: CX[S|T]C)
   c) near-exact, homologous matches using a query sequence and a 
stringent, shallow scoring matrix
   d) high scoring matches to a position-specific profile/weight matrix

Depending on the answer, you should be able to use current tools, 
filter your output based on where hits occur, and then alter any 
statistical results (P, E()) to accomodate your effectively reduced 
search space.

-Aaron

On Friday, October 10, 2003, at 03:31 PM, Brian Osborne wrote:

> Lucas,
>
> I don't think a script exactly like this exists in Bioperl but 
> certainly the
> means to write this script does. A good start might be the SearchIO 
> HOWTO
> (bioperl.org/HOWTOs). The combination of SearchIO and StandAloneBlast
> (section III.4.5 of the bptutorial) enables you to run BLAST and parse 
> the
> results. The details of the C-terminal match should be 
> straightforward, I
> think, but it depends on your familiarity with Perl.
>
> Brian O.
>
> -----Original Message-----
> From: bioperl-l-bounces@portal.open-bio.org
> [mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Lucas Carey
> Sent: Thursday, October 09, 2003 6:48 PM
> To: bioperl-l@bioperl.org
> Subject: [Bioperl-l] search for only C-terminal degenerate motifs
>
> Hi,
> I'm not sure if this is the correct list, but I figured someone here 
> would
> be able to help me.
> I'd like to use blast to search for a protein motif within a certain 
> number
> of residues of the C-terminus. What is the best way to go about doing 
> this?
> It seems like someone would have already had to do this, and written
> software to do it. That's what made me think of bioperl.
> -Lucas
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

From grossman at molgen.mpg.de  Fri Oct 10 16:45:39 2003
From: grossman at molgen.mpg.de (Steffen Grossmann)
Date: Fri Oct 10 16:42:05 2003
Subject: [Bioperl-l] Method dnaseq in EncodedSeq not working properly?
Message-ID: <3F871A73.7080908@molgen.mpg.de>

Dear all,

am I wrong or does the method "dnaseq" of Bio::Seq::EncodedSeq not do 
what is promised in the documentation? I understand the documentation 
such that e.g. with the command

@introns = $obj->$dnaseq(-encoding => 'I')

I would get a list of all the parts in the sequence which are coded with 
an 'I'. (From a look at the method's code I'd say it doesn't do anything 
at all...)

Actually, what I really need is a way to identify pieces of repeated X's 
or N's from DNA sequences in order to exclude them from a sequence-wide 
screening for good hits of a position-weight-matrix. Any hint concerning 
that problem is highly appreciated! Since I am a bioperl-newbie maybe I 
only suffer from a lack of overview...

Steffen
-- 
%---------------------------------------------%
%            Steffen Grossmann                %
%                                             %
% Max Planck Institute for Molecular Genetics %
%      Computational Molecular Biology        %
%---------------------------------------------%
%              Ihnestrasse 73                 %
%               14195 Berlin                  %
%                 Germany                     %
%---------------------------------------------%
%         Tel: (++49 +30) 8413-1167           %
%         Fax: (++49 +30) 8413-1152           %
%---------------------------------------------%

From lstein at cshl.edu  Fri Oct 10 16:59:47 2003
From: lstein at cshl.edu (Lincoln Stein)
Date: Fri Oct 10 16:57:23 2003
Subject: [BioPerl] Re: [Bioperl-l] load_gff.pl error
In-Reply-To: <1065732852.2767.29.camel@localhost.localdomain>
References: <200310091923.h99JN3k23009@warta.bio.psu.edu>
	<200310091626.59317.lstein@cshl.edu>
	<1065732852.2767.29.camel@localhost.localdomain>
Message-ID: <200310101659.47813.lstein@cshl.edu>

Hi Mark,

Oh dear.  I maligned you.  I personally broke the code in order to satisfy 
your request that load_gff() handle tied filehandles.  Now we both have to 
spend time in purgatory.

Lincoln

On Thursday 09 October 2003 04:54 pm, Mark Wilkinson wrote:
> Uh... no, you threatened me with purgatory if I broke the way data
> loading from filehandles worked - you said nothing about messing with
> ARGV ;-)
>
> .besides, it doesn't look like this error is my fault??  I don't see
> any code in there that came from my fingers.  do you know what bit of
> code I wrote that is causing that error?
>
> M
>
> On Thu, 2003-10-09 at 14:26, Lincoln Stein wrote:
> > The issue is fixed in bioperl live.
> >
> > Lincoln
> >
> > PS: Mark, remember when I told you that your soul would be consigned to
> > purgatory if you broke anything when you changed the ARGV handling? 
> > Well, it happened.
> >
> > On Thursday 09 October 2003 03:23 pm, Vamsi wrote:
> > > Hi,
> > >
> > > I am trying to use the load_gff.pl located in the Bio-DB-GFF/scripts
> > > directory of the bioperl-1.2.3 distribution.
> > >
> > > When I try
> > > "perl load_gff.pl --dsn dbi:mysql:XXX --user XXX --pass XXX a.gff"
> > >
> > > I get the error
> > >
> > > Can't use string ("ARGV") as a symbol ref while "strict refs" in use at
> > > /usr/perl5/site_perl/5.005/Bio/DB/GFF.pm line 2130.
> > >
> > > When I modify sub do_load_gff in GFF.pm by having as the first line
> > > "no strict 'refs';"
> > > the error disappears.
> > >
> > > Is this something anyone else has faced?
> > >
> > > Thanks,
> > > Vamsi
> > >
> > > _______________________________________________
> > > Bioperl-l mailing list
> > > Bioperl-l@portal.open-bio.org
> > > http://portal.open-bio.org/mailman/listinfo/bioperl-l

-- 
========================================================================
Lincoln D. Stein                           Cold Spring Harbor Laboratory
lstein@cshl.org			                  Cold Spring Harbor, NY
========================================================================


From amackey at pcbi.upenn.edu  Fri Oct 10 17:09:12 2003
From: amackey at pcbi.upenn.edu (Aaron J. Mackey)
Date: Fri Oct 10 17:06:40 2003
Subject: [Bioperl-l] Method dnaseq in EncodedSeq not working properly?
In-Reply-To: <3F871A73.7080908@molgen.mpg.de>
Message-ID: <FF27B4A0-FB65-11D7-9104-000A958C5008@pcbi.upenn.edu>

You're completely correct, it's entirely documented but entirely 
unimplemented.  Since you're the first to notice, you must also be the 
first to need it ;)  If you continue to need it, I'll see what I can do.

> Actually, what I really need is a way to identify pieces of repeated 
> X's or N's from DNA sequences in order to exclude them from a 
> sequence-wide screening for good hits of a position-weight-matrix.

If I understand you correctly, your weight matrix search results in 
hits that "align" against X/N's, and you want to filter out those hits. 
  I see two choices:

    1) alter your profile to give very bad scores to X/N (if you have 
this capability)
    2) use some Bio::Range objects representing hits and X/N stretches, 
and call
       $hit->overlap($xn) to see if they overlap.

-Aaron

From markw at illuminae.com  Fri Oct 10 18:36:50 2003
From: markw at illuminae.com (Mark Wilkinson)
Date: Fri Oct 10 18:36:51 2003
Subject: [BioPerl] Re: [Bioperl-l] load_gff.pl error
In-Reply-To: <200310101659.47813.lstein@cshl.edu>
References: <200310091923.h99JN3k23009@warta.bio.psu.edu>
	<200310091626.59317.lstein@cshl.edu>
	<1065732852.2767.29.camel@localhost.localdomain>
	<200310101659.47813.lstein@cshl.edu>
Message-ID: <1065825558.1698.90.camel@localhost.localdomain>

hey... wait a minute!  

Why do *I* have to go to purgatory?!  I didn't do anything wrong - I
only *suggested* that the code be changed... and it wasn't even for ME! 
It was for Sheldon - damn HIM!

This just isn't fair at all... I hate divine thought-police   }-/

M


On Fri, 2003-10-10 at 14:59, Lincoln Stein wrote:
> Hi Mark,
> 
> Oh dear.  I maligned you.  I personally broke the code in order to satisfy 
> your request that load_gff() handle tied filehandles.  Now we both have to 
> spend time in purgatory.
> 
> Lincoln
> 
> On Thursday 09 October 2003 04:54 pm, Mark Wilkinson wrote:
> > Uh... no, you threatened me with purgatory if I broke the way data
> > loading from filehandles worked - you said nothing about messing with
> > ARGV ;-)
> >
> > .besides, it doesn't look like this error is my fault??  I don't see
> > any code in there that came from my fingers.  do you know what bit of
> > code I wrote that is causing that error?
> >
> > M
> >
> > On Thu, 2003-10-09 at 14:26, Lincoln Stein wrote:
> > > The issue is fixed in bioperl live.
> > >
> > > Lincoln
> > >
> > > PS: Mark, remember when I told you that your soul would be consigned to
> > > purgatory if you broke anything when you changed the ARGV handling? 
> > > Well, it happened.
> > >
> > > On Thursday 09 October 2003 03:23 pm, Vamsi wrote:
> > > > Hi,
> > > >
> > > > I am trying to use the load_gff.pl located in the Bio-DB-GFF/scripts
> > > > directory of the bioperl-1.2.3 distribution.
> > > >
> > > > When I try
> > > > "perl load_gff.pl --dsn dbi:mysql:XXX --user XXX --pass XXX a.gff"
> > > >
> > > > I get the error
> > > >
> > > > Can't use string ("ARGV") as a symbol ref while "strict refs" in use at
> > > > /usr/perl5/site_perl/5.005/Bio/DB/GFF.pm line 2130.
> > > >
> > > > When I modify sub do_load_gff in GFF.pm by having as the first line
> > > > "no strict 'refs';"
> > > > the error disappears.
> > > >
> > > > Is this something anyone else has faced?
> > > >
> > > > Thanks,
> > > > Vamsi
> > > >
> > > > _______________________________________________
> > > > Bioperl-l mailing list
> > > > Bioperl-l@portal.open-bio.org
> > > > http://portal.open-bio.org/mailman/listinfo/bioperl-l
-- 
Mark Wilkinson <markw@illuminae.com>
Illuminae

From smckay at bcgsc.bc.ca  Fri Oct 10 19:11:08 2003
From: smckay at bcgsc.bc.ca (Sheldon McKay)
Date: Fri Oct 10 19:08:35 2003
Subject: [BioPerl] Re: [Bioperl-l] load_gff.pl error
References: <200310091923.h99JN3k23009@warta.bio.psu.edu>
	<200310091626.59317.lstein@cshl.edu>
	<1065732852.2767.29.camel@localhost.localdomain>
	<200310101659.47813.lstein@cshl.edu>
	<1065825558.1698.90.camel@localhost.localdomain>
Message-ID: <001101c38f83$da88ec00$c201010a@celegans01>

Mark is indeed correct, it was I who wanted tied filehandles.  I don't think
third party damning is allowed, sorry Mark.
When you get released from purgatory, please don't remove the tied
filehandle functionality.

Sheldon

----- Original Message -----
From: "Mark Wilkinson" <markw@illuminae.com>
To: "Lincoln Stein" <lstein@cshl.org>; "Doreen Ware" <ware@cshl.org>;
"Sheldon McKay" <smckay@bcgsc.bc.ca>; "me" <markw@illuminae.com>
Cc: <bioperl-l@bioperl.org>
Sent: Friday, October 10, 2003 3:39 PM
Subject: Re: [BioPerl] Re: [Bioperl-l] load_gff.pl error


> hey... wait a minute!
>
> Why do *I* have to go to purgatory?!  I didn't do anything wrong - I
> only *suggested* that the code be changed... and it wasn't even for ME!
> It was for Sheldon - damn HIM!
>
> This just isn't fair at all... I hate divine thought-police   }-/
>
> M
>
>


From bmadsen at mathworks.com  Fri Oct 10 11:29:35 2003
From: bmadsen at mathworks.com (Brian Madsen)
Date: Fri Oct 10 21:16:58 2003
Subject: [Bioperl-l] Suggestions for MATLAB-Bioperl Interface
Message-ID: <DB873318D1A41648BEAC4B5AAB63B3E1D8DB1A@MESSAGE-AH.ad.mathworks.com>

I am studying Bioinformatics through the Master's program at
Northeastern and am working as a co-op at MathWorks on their
Bioinformatics Toolbox.

In August, I met a few researchers around Boston to get ideas on the
Toolbox. One of the best suggestions was from a current user, Dr. Vamsi
Mootha at the Whitehead Institute. He was interested in calling Bioperl
apps from MATLAB and calling MATLAB functions from Bioperl. While I
don't yet have a streamlined approach for calling MATLAB from Perl, I
have gone through the installation of Bioperl and tested calling Perl
apps from the toolbox (attached is an html of a demo).

2 questions:
1 - Are there any suggestions on the interface for MATLAB and
Bioperl--any procedures or issues I should research?
2 - What intellectual property issues might be involved? With the base
of Bioperl modules available, I thought it might be a good idea to
include an installation of Bioperl on the Bioinformatics Toolbox CD.

Thanks for any ideas.

-Brian Madsen
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://portal.open-bio.org/pipermail/bioperl-l/attachments/20031010/246f3edb/bioperldemo-0001.html
From bio at masrao.homelinux.com  Sat Oct 11 06:07:24 2003
From: bio at masrao.homelinux.com (Koichi Minamijima)
Date: Sat Oct 11 05:49:26 2003
Subject: [Bioperl-l] Bio::Graphics HOWTO :: about of the color of graph
Message-ID: <20031011190724.D09D3770.bio@masrao.com>

Hi.
If it knows, please let me know.

I looked at the web page of "BioGraphics HOWTO."
It was used having edited "Example 4."
	http://bioperl.org/HOWTOs/html/Graphics-HOWTO.html#parsing-
blast

[next unless $hit->significance < 1E-20;] 
of the 38th line was only changed into 
[next unless $hit->significance < 1;].

rscu.fas (ftp://ftp.ncbi.nlm.nih.gov/refseq/cumulative/rscu.fas.Z) 
and "One arrangement file" were used, and blastall was done.
The picture was generated when "Exsample 4." was performed using the 
result.

?The picture of a result?
	http://masrao.com/img/render4.png (total)
	 or
	 http://masrao.com/img/render4.jpg (A part)

?A part of result of BLAST?
//-------------------------------------------start--------------------
-----------------------//

	>gi|32563818|ref|NP_871912.1| (NM_182112) ROLler: helically 
twisted,animals roll when moving ROL-8, COLlagenstructural gene (rol-8)
 [Caenorhabdtis elegans]
           Length = 265

	  Score = 70.1 bits (170), Expect = 2e-11
	  Identities = 37/95 (38%), Positives = 56/95 (58%), Gaps = 8/
95 (8%)

	Query: 11  
EVGRFRRNAFSAVALSTSCILAIVVLTPIAYQAIQRIHSNLLNDAHFCQARNRDLWTETM 
	70
	            E  + RR AF AV +ST+ ++A VV  P+ Y  +Q   S+L+ +  +C+
AR+RD+W E
	Sbjct: 8   
EQRQMRRIAFVAVVVSTAAVIASVVTLPMLYNYVQSFQSHLMVETDYCKARSRDMWLEMT 
	67
	
	Query: 71  KMARSRGRDEELVARTKRAVKGTWLFGQYIPDRSS 105
	             +   +G       R KRA    WLFGQ+IP+ S+
	Sbjct: 68  ALQAGKGMGH----RMKRA----WLFGQWIPETSA 94


	  Score = 59.7 bits (143), Expect = 3e-08
	  Identities = 24/37 (64%), Positives = 25/37 (67%)

	Query: 291 PGPYGKPGQPGAPGPDGSAGEKGGCDHCPPPRTPPGY 327
	            PG  G+ G  G  GP G  GEKG C HCPPPRTPPGY
	Sbjct: 229 PGHEGRAGPNGPAGPPGDNGEKGDCGHCPPPRTPPGY 265

	 >gi|17533809|ref|NP_496362.1| (NM_063961) COLlagen 
structural gene(29.2 kD) (col-82) [Caenorhabditis elegans]Length = 306

	  Score = 57.8 bits (138), Expect = 1e-07
	  Identities = 23/36 (63%), Positives = 24/36 (66%)

	Query: 292 GPYGKPGQPGAPGPDGSAGEKGGCDHCPPPRTPPGY 327
	            G  G+PG  G  GP G  G  GGCDHCPPPRT PGY
	Sbjct: 271 GAAGQPGAAGEQGPAGEPGAGGGCDHCPPPRTAPGY 306


	  Score = 42.7 bits (99), Expect = 0.004
	  Identities = 20/76 (26%), Positives = 38/76 (50%)

	Query: 10 
LEVGRFRRNAFSAVALSTSCILAIVVLTPIAYQAIQRIHSNLLNDAHFCQARNRDLWTET 
	69
           +E    R+ AF  + +ST   L  +V  P+ Y  +Q + S L  +  FC+ R   L+ +
	Sbjct: 12 
VEAESLRKVAFFGICISTVATLTAIVAIPLLYNYMQHVQSTLQTEVDFCKHRTNGLFDQY 
	71

	Query: 70 MKMARSRGRDEELVAR 85
	            +M + +G   +L+ +
	Sbjct: 72 ERMQQLKGVRGKLIVK 87

//---------------------------------------------------end--------------
-----------------------//

Please compare the portions of 292-327 of NM_182112 and NM_063961.
Graph is white, although NM_182112 are higher as for a score and e-
value is low.

It pats, although it is written to "http://bioperl.org/HOWTOs/html/
Graphics-HOWTO.html#addingscale" as "We specify the base background 
color (-bgcolor => 'blue') and and the minimum and maximum scores to 
scale to. (-min_score and-max_score)", and it is ?
Is it different in HSP?

It is a long text and is pardon. 
Teach, if there are those who understand some.
It is it and English which is hard to read, and is pardon.

From letondal at pasteur.fr  Sat Oct 11 08:26:57 2003
From: letondal at pasteur.fr (Catherine Letondal)
Date: Sat Oct 11 08:24:27 2003
Subject: [Bioperl-l] Suggestions for MATLAB-Bioperl Interface
In-Reply-To: <DB873318D1A41648BEAC4B5AAB63B3E1D8DB1A@MESSAGE-AH.ad.mathworks.com>;
	from bmadsen@mathworks.com on Fri, Oct 10, 2003 at 11:29:35AM -0400
References: <DB873318D1A41648BEAC4B5AAB63B3E1D8DB1A@MESSAGE-AH.ad.mathworks.com>
Message-ID: <20031011142657.A13680@electre.pasteur.fr>

On Fri, Oct 10, 2003 at 11:29:35AM -0400, Brian Madsen wrote:
> I am studying Bioinformatics through the Master's program at
> Northeastern and am working as a co-op at MathWorks on their
> Bioinformatics Toolbox.
> 
> In August, I met a few researchers around Boston to get ideas on the
> Toolbox. One of the best suggestions was from a current user, Dr. Vamsi
> Mootha at the Whitehead Institute. He was interested in calling Bioperl
> apps from MATLAB and calling MATLAB functions from Bioperl. While I
> don't yet have a streamlined approach for calling MATLAB from Perl, I
> have gone through the installation of Bioperl and tested calling Perl
> apps from the toolbox (attached is an html of a demo).
> 
> 2 questions:
> 1 - Are there any suggestions on the interface for MATLAB and
> Bioperl--any procedures or issues I should research?
> 2 - What intellectual property issues might be involved? With the base
> of Bioperl modules available, I thought it might be a good idea to
> include an installation of Bioperl on the Bioinformatics Toolbox CD.
> 
> Thanks for any ideas.
> 

This is probably at a research stage only, but I came accross
"Lightweight Languages" which are (as far as I have understood) an
attempt to address complex programming situations such as this
one. Communication with differing subsystems is addresed also, but the
idea goes much further.

http://www.mail-archive.com/perl6-internals@perl.org/msg17089.html

A more detailed description of the field is provided here:
http://www.usenix.org/publications/library/proceedings/dsl97/full_papers/spinellis/spinellis_html/spinellis.html


-- 
Catherine Letondal -- Pasteur Institute Computing Center
From lcarey at odd.bio.sunysb.edu  Sun Oct 12 00:56:10 2003
From: lcarey at odd.bio.sunysb.edu (Lucas Carey)
Date: Sun Oct 12 00:53:38 2003
Subject: [Bioperl-l] search for only C-terminal degenerate motifs
In-Reply-To: <81E173BD-FB5D-11D7-9104-000A958C5008@pcbi.upenn.edu>
References: <GAEDKMGOKFBLJPKCLKCCEEPLDCAA.brian_osborne@cognia.com>
	<81E173BD-FB5D-11D7-9104-000A958C5008@pcbi.upenn.edu>
Message-ID: <20031012045610.GA18891@odd.bio.sunysb.edu>

On Fri, Oct 10, 2003 at 04:08:26PM -0400, Aaron J. Mackey wrote:
> Yeah, the problem is you'd really like to *not* search all of the 
> database (for both speed and statistical reasons), only the first n 
> C-terminal residues of each sequence in the database.  Both BLAST and 
Search time isn't an issue, I'll be using mpiBLAST on a 128cpu cluster. I don't think the statistics will matter, because I'm just looking for genes to test biologically, not to draw any conclusions from the db search alone.

> But you said "motif" - are you trying to find:
> 
>   a) exact matches to a given short sequence
exact matches to one of ~7 sequences located at : (C-3) - (C-2) - (C-1) - C
The query file would just be a FASTA file with 7 4aa query sequences.

>   b) exact matches to a consensus regular expression (e.g.: CX[S|T]C)
is this possible? can i search for [R|K][D|E] if I wanted to search for a negativly charged aa that follows a postitivly charged one?

My thought, because I'm fairly competent with perl but have never used bioperl before, was to do this:

>gi|17647257|ref|NP_523617.1|   Chitinase 1 [Drosophila melanogaster]
 gi|37078008|sp|Q9W5U3|CHI1_DROME   Probable chitinase 1
            Length = 508

				Score = 14.6 bits (27), Expect =  9048
				Identities = 4/4 (100%), Positives = 4/4 (100%)

				Query: 1   AGDK 4
				AGDK
				Sbjct: 226 AGDK 229

If my signal sequence is AGDK. I want to look for matches where XXX in
'Length = XXX' is equal to XXX in 'Sbjct: nnn AGDK XXX'.

I would not mind using this as an excuse to learn bioperl, assuming that there is a reasonable straight forward way to go about doing this in bioperl. In the BioSearch::IO HOWTO I see end('hit') and length('query') but no length('db_sequence') or something like that. Does this method exist?
I could do 
if( (end('hit') == length('db_sequence')) && (matches('query')[0] == 4)) { print 	query_description ;}
-Lucas
thank you for your assistance

From bounce at gaintrafficfast.com  Sun Oct 12 01:56:30 2003
From: bounce at gaintrafficfast.com (Sharon Brown)
Date: Sun Oct 12 02:03:40 2003
Subject: [Bioperl-l] www.bioperl.org
Message-ID: <3195CR1000034300@s2h6a0w3.trafficglue.com>

Hi,

I visited www.bioperl.org, and noticed that you're not listed
on some search engines! I would like to introduce to you an affordable
service where we can help enhance your business' global online presence 
and increase the number of visitors to your website.

Our unique technology at http://www.gaintrafficfast.com/index.html
submits your website to over 300,000 search engines and directories
every month. 

You'll be surprised how simple it is to now reach out to an international
market and increase the visibility of your products and services, at a low
cost to you.

Why not visit us at http://www.gaintrafficfast.com/index.html
and let us help you direct traffic your way?

I look forward to hearing from you!

Best Regards,
Sharon Brown
Sales and Marketing
E-mail: sharon_brown@gaintrafficfast.com
http://www.gaintrafficfast.com

------------------------------------------------------------------------------------------
Not interested in our service?
To be taken off our mailing list, please follow the instructions below
http://s2h6a0w3.trafficglue.com/mnkr/www/r?1000034300.3195.1.GXMH+omosu5agt
From W.Kenworthy at murdoch.edu.au  Sun Oct 12 08:16:43 2003
From: W.Kenworthy at murdoch.edu.au (William Kenworthy)
Date: Sun Oct 12 08:14:16 2003
Subject: [Bioperl-l] bioperl-pipeline
Message-ID: <1065961003.8277.15.camel@rattus.localdomain>

the link to biopipe.org on the openbioinformatics website broke some
months back.  Has this project become "bioperl-pipeline"?

Also, is there a tutorial for bioperl-pipeline, or some document that
can give me a start on it? (what to install, module listing etc)

BillK


From ajm6q at virginia.edu  Sun Oct 12 09:58:51 2003
From: ajm6q at virginia.edu (Aaron J Mackey)
Date: Sun Oct 12 09:56:23 2003
Subject: [Bioperl-l] search for only C-terminal degenerate motifs
In-Reply-To: <20031012045610.GA18891@odd.bio.sunysb.edu>
Message-ID: <Pine.OSF.4.33.0310120956490.29637-100000@alpha10.bioch.virginia.edu>


Since you're looking for exact matches (in a defined location, no less),
why do you need BLAST, or any bioinfomatics tool?  Doesn't simple string
comparison, or regexp matching get you what you need with minimal fuss?

-Aaron

On Sun, 12 Oct 2003, Lucas Carey wrote:

> On Fri, Oct 10, 2003 at 04:08:26PM -0400, Aaron J. Mackey wrote:
> > Yeah, the problem is you'd really like to *not* search all of the
> > database (for both speed and statistical reasons), only the first n
> > C-terminal residues of each sequence in the database.  Both BLAST and
> Search time isn't an issue, I'll be using mpiBLAST on a 128cpu cluster. I don't think the statistics will matter, because I'm just looking for genes to test biologically, not to draw any conclusions from the db search alone.
>
> > But you said "motif" - are you trying to find:
> >
> >   a) exact matches to a given short sequence
> exact matches to one of ~7 sequences located at : (C-3) - (C-2) - (C-1) - C
> The query file would just be a FASTA file with 7 4aa query sequences.
>
> >   b) exact matches to a consensus regular expression (e.g.: CX[S|T]C)
> is this possible? can i search for [R|K][D|E] if I wanted to search for a negativly charged aa that follows a postitivly charged one?
>
> My thought, because I'm fairly competent with perl but have never used bioperl before, was to do this:
>
> >gi|17647257|ref|NP_523617.1|   Chitinase 1 [Drosophila melanogaster]
>  gi|37078008|sp|Q9W5U3|CHI1_DROME   Probable chitinase 1
>             Length = 508
>
> 				Score = 14.6 bits (27), Expect =  9048
> 				Identities = 4/4 (100%), Positives = 4/4 (100%)
>
> 				Query: 1   AGDK 4
> 				AGDK
> 				Sbjct: 226 AGDK 229
>
> If my signal sequence is AGDK. I want to look for matches where XXX in
> 'Length = XXX' is equal to XXX in 'Sbjct: nnn AGDK XXX'.
>
> I would not mind using this as an excuse to learn bioperl, assuming that there is a reasonable straight forward way to go about doing this in bioperl. In the BioSearch::IO HOWTO I see end('hit') and length('query') but no length('db_sequence') or something like that. Does this method exist?
> I could do
> if( (end('hit') == length('db_sequence')) && (matches('query')[0] == 4)) { print 	query_description ;}
> -Lucas
> thank you for your assistance
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

-- 
 Aaron J Mackey
 Pearson Laboratory
 University of Virginia
 (434) 924-2821
 amackey@virginia.edu


From charles at gizmotronics.dyndns.org  Sun Oct 12 12:05:13 2003
From: charles at gizmotronics.dyndns.org (Charles Plessy)
Date: Sun Oct 12 12:02:42 2003
Subject: A script to automagically generate DocBook formatted references. (was
	: Re: [Bioperl-l] Re: Bio::Biblio)
In-Reply-To: <Pine.LNX.4.44.0310101157430.20688-100000@bagheera.ebi.ac.uk>
References: <Pine.LNX.4.44.0310101157430.20688-100000@bagheera.ebi.ac.uk>
Message-ID: <20031012160513.GC1237@gizmotronics.dyndns.org>

>    Here I am ataching an example doing hopefully what you wish (and few 
> explanations labelled by '(digit)' pattern:

Thank you very much! Your script is very useful to me!

I now search for the articles'pmid on pubmed, run the script with the
pmid as an argument, and cut-paste the output in my docbook
bibliography file.

Here are my modifications. I plan some cosmetic changes, and to make
this script able to process a list of pmids.

-- 
Charles Plessy


#!/usr/bin/perl -w

#$Revision: 1.1 $

# This script was originally written by Martin Senger.
# I adapted the output fo facilitate the writing of my biology thesis.
# Charles Plessy

use Bio::Biblio;       # to read data via SOAP
use Bio::Biblio::IO;   # to convert resulting XML to Biblio objects

# (1)
# --- taking citations from MEDLINE at EBI using SOAP

my $pmID = shift;
my $biblio = new Bio::Biblio;
my $citation = $biblio->get_by_id ("$pmID");

# (2)
# --- make an instance of a converter
my $io = new Bio::Biblio::IO ('-data' => $citation);

# (3)
# --- and make the conversion (use the citation)

while (my $bibref = $io->next_bibref) {

$bibref->{_date} =~ /(\d*)/;
my $year=$1 ;

print << "DOCBOOK_START";
<biblioentry id="$bibref->{_authors}->[0]->{_lastname}-$year">
  <abbrev>$bibref->{_authors}->[0]->{_lastname} <emphasis>et al</emphasis>, $year</abbrev>
  <biblioset relation="journal">
    <titleabbrev>$bibref->{_journal}->{_medline_ta}</titleabbrev>
  </biblioset>
  <biblioset relation="article" id="pmid-$bibref->{_pmid}">
    <authorgroup>
DOCBOOK_START

# To fix : no 'et al' if 1, 2 or 3 authors
#          if multiple pmids, check unicity of biblioentry ids
#          manage languages

    foreach (@ {$$bibref{'_authors'}} ) {
        print << "DOCBOOK_AUTHORS";
      <author>
        <surname>$$_{_lastname}</surname>
        <firstname>$$_{_forename}</firstname>
      </author>
DOCBOOK_AUTHORS
    }

print << "DOCBOOK_END";
    </authorgroup>
    <title>$bibref->{_title}</title>
    <date>$bibref->{_date}</date>
    <volumenum>$bibref->{_volume}</volumenum>
    <issuenum>$bibref->{_issue}</issuenum>
    <pagenums>$bibref->{_medline_page}</pagenums>
  </biblioset>
</biblioentry>


DOCBOOK_END

# To fix    : put a nice &ndash: in <pagenums>

}


__END__

From shawnh at stanford.edu  Sun Oct 12 13:18:00 2003
From: shawnh at stanford.edu (Shawn Hoon)
Date: Sun Oct 12 13:11:44 2003
Subject: [Bioperl-l] bioperl-pipeline
In-Reply-To: <1065961003.8277.15.camel@rattus.localdomain>
References: <1065961003.8277.15.camel@rattus.localdomain>
Message-ID: <07CD0901-FCD8-11D7-95C5-000A95783436@stanford.edu>

We are still trying to resolve some dns issues. See if  
biopipe-tmp.open-bio.org
or add

65.246.187.176 www.biopipe.org

to /etc/hosts
I recommend reading the INSTALL document and some of the HOWTOs
and use the bioperl-pipeline@bioperl.org mailing list for any queries.


cheers,

shawn

On Sunday, October 12, 2003, at 5:16AM, William Kenworthy wrote:

> the link to biopipe.org on the openbioinformatics website broke some
> months back.  Has this project become "bioperl-pipeline"?
>
> Also, is there a tutorial for bioperl-pipeline, or some document that
> can give me a start on it? (what to install, module listing etc)
>
> BillK
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

From zayed at sanbi.ac.za  Mon Oct 13 09:17:38 2003
From: zayed at sanbi.ac.za (Zayed Albertyn)
Date: Mon Oct 13 09:15:11 2003
Subject: [Bioperl-l] [Bioperl] LocusLink
Message-ID: <20031013150113.C89754@fling.sanbi.ac.za>

Hi

I have been trying to use Bio::SeqIO to parse Locuslink format for
obtaining all RefSeq mRNAs associated with a Locuslink ID.

I receive the following error message when I use the hash_tree method from
Bio:Annotation:
Can't locate object method "each_synonym" via package "Bio::Annotation::OntologyTerm" at OntologyTerm.pm line 187,
<GEN0> line 1.
The strange thing is that it does support the as_text() method and I can
pull values this way yet I would like to store and retrieve the values as
I like in a hash (as described in the Bio:Annotation documentation).

Is there some reason why I cannot access this "each_synonym" method and
how would I obtain the RefSeq entries if I could initialize the hash_tree()
method?

Thanks
Zayed
-----------------------------------------------
From: Zayed Albertyn
Electric Genetics PTY Ltd
Tel: +27 21 959 3645; Mobile: +2782 480 6097
www.egenetics.com
From lstein at cshl.edu  Mon Oct 13 11:14:06 2003
From: lstein at cshl.edu (Lincoln Stein)
Date: Mon Oct 13 11:11:44 2003
Subject: [BioPerl] Re: [Bioperl-l] load_gff.pl error
In-Reply-To: <001101c38f83$da88ec00$c201010a@celegans01>
References: <200310091923.h99JN3k23009@warta.bio.psu.edu>
	<1065825558.1698.90.camel@localhost.localdomain>
	<001101c38f83$da88ec00$c201010a@celegans01>
Message-ID: <200310131114.06086.lstein@cshl.edu>

I think we're all going to purgatory together....

Lincoln

On Friday 10 October 2003 07:11 pm, Sheldon McKay wrote:
> Mark is indeed correct, it was I who wanted tied filehandles.  I don't
> think third party damning is allowed, sorry Mark.
> When you get released from purgatory, please don't remove the tied
> filehandle functionality.
>
> Sheldon
>
> ----- Original Message -----
> From: "Mark Wilkinson" <markw@illuminae.com>
> To: "Lincoln Stein" <lstein@cshl.org>; "Doreen Ware" <ware@cshl.org>;
> "Sheldon McKay" <smckay@bcgsc.bc.ca>; "me" <markw@illuminae.com>
> Cc: <bioperl-l@bioperl.org>
> Sent: Friday, October 10, 2003 3:39 PM
> Subject: Re: [BioPerl] Re: [Bioperl-l] load_gff.pl error
>
> > hey... wait a minute!
> >
> > Why do *I* have to go to purgatory?!  I didn't do anything wrong - I
> > only *suggested* that the code be changed... and it wasn't even for ME!
> > It was for Sheldon - damn HIM!
> >
> > This just isn't fair at all... I hate divine thought-police   }-/
> >
> > M
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

-- 
========================================================================
Lincoln D. Stein                           Cold Spring Harbor Laboratory
lstein@cshl.org			                  Cold Spring Harbor, NY
========================================================================


From markw at illuminae.com  Mon Oct 13 13:40:23 2003
From: markw at illuminae.com (Mark Wilkinson)
Date: Mon Oct 13 13:37:43 2003
Subject: [unclassified] Re: [BioPerl] Re: [Bioperl-l] load_gff.pl error
References: <200310091923.h99JN3k23009@warta.bio.psu.edu>	<1065825558.1698.90.camel@localhost.localdomain>	<001101c38f83$da88ec00$c201010a@celegans01>
	<200310131114.06086.lstein@cshl.edu>
Message-ID: <3F8AE387.5080605@illuminae.com>

I think the correct ideom is "going to hell in a handbasket"...

Happy Thanksgiving everyone!

M

Lincoln Stein wrote:

>I think we're all going to purgatory together....
>
>Lincoln
>  
>


From hlapp at gnf.org  Mon Oct 13 16:02:39 2003
From: hlapp at gnf.org (Hilmar Lapp)
Date: Mon Oct 13 15:59:59 2003
Subject: [Bioperl-l] [Bioperl] LocusLink
In-Reply-To: <20031013150113.C89754@fling.sanbi.ac.za>
Message-ID: <BBB052EF.2075%hlapp@gnf.org>

This is due to a bug that was fixed on the main trunk in May already but not
migrated to the stable branch. I migrated (and corrected) the fix. Either
wait a few hours and update from anonymous cvs, or pull up
Bio/Annotation/OntologyTerm.pm in your favorite editor change the call to
each_synonym to get_synonyms.

The refseq Ids are not ontology terms though, they are dbxrefs. In order to
get at them you need to extract the correct dbxrefs.

    -hilmar

On 10/13/03 6:17 AM, "Zayed  Albertyn" <zayed@sanbi.ac.za> wrote:

> Hi
> 
> I have been trying to use Bio::SeqIO to parse Locuslink format for
> obtaining all RefSeq mRNAs associated with a Locuslink ID.
> 
> I receive the following error message when I use the hash_tree method from
> Bio:Annotation:
> Can't locate object method "each_synonym" via package
> "Bio::Annotation::OntologyTerm" at OntologyTerm.pm line 187,
> <GEN0> line 1.
> The strange thing is that it does support the as_text() method and I can
> pull values this way yet I would like to store and retrieve the values as
> I like in a hash (as described in the Bio:Annotation documentation).
> 
> Is there some reason why I cannot access this "each_synonym" method and
> how would I obtain the RefSeq entries if I could initialize the hash_tree()
> method?
> 
> Thanks
> Zayed
> -----------------------------------------------
> From: Zayed Albertyn
> Electric Genetics PTY Ltd
> Tel: +27 21 959 3645; Mobile: +2782 480 6097
> www.egenetics.com
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 

-- 
-------------------------------------------------------------
Hilmar Lapp                            email: lapp at gnf.org
GNF, San Diego, Ca. 92121              phone: +1-858-812-1757
-------------------------------------------------------------


From shawnh at stanford.edu  Mon Oct 13 21:18:13 2003
From: shawnh at stanford.edu (Shawn Hoon)
Date: Mon Oct 13 21:11:41 2003
Subject: [Bioperl-l] Bio::Index::PDB
Message-ID: <4837EE60-FDE4-11D7-95C5-000A95783436@stanford.edu>

Hi,
I wrote an indexer along the lines of the sequence index modules  for 
PDB structure files.
Basically a lot of copying from Bio::Index::Fasta

Basically 2 modules added:

Bio::Index::AbstractStructure (which inherits from Bio::Index::Abstract)
Bio::Index::PDB

Usage:

     use Bio::Index::PDB;
     use strict;

     my $Index_File_Name = shift;
     my $inx = Bio::Index::PDB->new('-filename' => $Index_File_Name);
     my $out = Bio::Structure::IO->new('-format' => 'pdb','-fh' => 
\*STDOUT);

     foreach my $id (@ARGV) {
         my $struct= $inx->fetch($id); # Returns Bio::Structure object
         $out->write_structure($struct);
     }

Is this along the lines of how indexers are implemented? If so, I will 
go ahead and commit them.


shawn

From brian_osborne at cognia.com  Tue Oct 14 08:18:56 2003
From: brian_osborne at cognia.com (Brian Osborne)
Date: Tue Oct 14 08:20:50 2003
Subject: [Bioperl-l] Bio::Index::PDB
In-Reply-To: <4837EE60-FDE4-11D7-95C5-000A95783436@stanford.edu>
Message-ID: <GAEDKMGOKFBLJPKCLKCCCEANDDAA.brian_osborne@cognia.com>

Shawn,

Both Bio::Index::Fasta and Bio::DB::Fasta allow you to stipulate the key you
want to use, as in:

my $inx = Bio::Index::Fasta->new( -filename   => $file_name . ".idx",
                                    -write_flag => 1 );
$inx->id_parser(\&get_id);
# make the index
$inx->make_index($file_name);

# where get_id() could look like:

# here is where the retrieval key is specified
  sub get_id {
     my $header = shift;
     $header =~ /^>.*\bsp\|([A-Z]\d{5}\b)/;
     $1;
  }

I've found this capability to be very useful myself.

Brian O.


-----Original Message-----
From: bioperl-l-bounces@portal.open-bio.org
[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Shawn Hoon
Sent: Monday, October 13, 2003 9:18 PM
To: BioPerl List
Subject: [Bioperl-l] Bio::Index::PDB

Hi,
I wrote an indexer along the lines of the sequence index modules  for
PDB structure files.
Basically a lot of copying from Bio::Index::Fasta

Basically 2 modules added:

Bio::Index::AbstractStructure (which inherits from Bio::Index::Abstract)
Bio::Index::PDB

Usage:

     use Bio::Index::PDB;
     use strict;

     my $Index_File_Name = shift;
     my $inx = Bio::Index::PDB->new('-filename' => $Index_File_Name);
     my $out = Bio::Structure::IO->new('-format' => 'pdb','-fh' =>
\*STDOUT);

     foreach my $id (@ARGV) {
         my $struct= $inx->fetch($id); # Returns Bio::Structure object
         $out->write_structure($struct);
     }

Is this along the lines of how indexers are implemented? If so, I will
go ahead and commit them.


shawn

_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org
http://portal.open-bio.org/mailman/listinfo/bioperl-l


From jerome.eteve at it-omics.com  Tue Oct 14 11:42:07 2003
From: jerome.eteve at it-omics.com (=?ISO-8859-1?Q?J=E9r=F4me_Et=E9v=E9?=)
Date: Tue Oct 14 11:42:09 2003
Subject: [Bioperl-l] [OBDA] proxy settings.
Message-ID: <1066146197.22653.142.camel@neuron.itomics.local>

Hi everyone.
 I wonder if it is possible to set OBDA in order to make it use an http
proxy for the biofetch database flavour. This would be a great help
since many organisations do use a proxy to access the web.

On my linuxbox, I've set the 
HTTP_PROXY and the http_proxy environment variables to
'http://myproxy:8080/' and it don't seem to work.

Thanx for your help !
Regards.

-- 
--
Of course there's no reason for it, it's just our policy.
----------------------------
jerome.eteve_at_it-omics.com

From michael.watson at bbsrc.ac.uk  Tue Oct 14 12:03:01 2003
From: michael.watson at bbsrc.ac.uk (michael watson (IAH-C))
Date: Tue Oct 14 12:02:34 2003
Subject: [Bioperl-l] Codon Table
Message-ID: <20B7EB075F2D4542AFFAF813E98ACD9301C00BEB@cl-exsrv1.irad.bbsrc.ac.uk>

Hi

I want to use a different codon table than any that are present in CodonTable.pm.  Does Bioperl have a way to implement this?  Or will I have to hack CodonTable.pm?

Thanks
Mick
From cf2 at ualberta.ca  Tue Oct 14 15:58:13 2003
From: cf2 at ualberta.ca (Carlos Fernandez-Patron)
Date: Tue Oct 14 15:53:12 2003
Subject: [Bioperl-l] carlos
Message-ID: <5.1.0.14.0.20031014135619.028e6e50@pop.srv.ualberta.ca>

Lobvi,
Deje tu tarjeta en la casa.
No se si este es tu email.
Respondeme ASAP y pasame por fax una copia de los permisos de trabajos tuyo 
y de tu esposa para human resources de la Universidad.

Gracias,

Carlos

Dr. Carlos Fernandez-Patron
Heart and Stroke Foundation New Investigator
Assistant Professor
Department of Biochemistry
3-08 Medical Sciences Building
The University of Alberta
Edmonton, Alberta T6G 2H7,
Canada
--------------------------------
Lab phone: (780) 492 9540
Fax:           (780) 492 0095
e-mail:        cf2@ualberta.ca
http://www.biochem.ualberta.ca/Biochem/Faculty/CFPatron/cfp.html


From shawnh at stanford.edu  Tue Oct 14 16:11:58 2003
From: shawnh at stanford.edu (Shawn Hoon)
Date: Tue Oct 14 16:05:32 2003
Subject: [Bioperl-l] Bio::Index::PDB
In-Reply-To: <GAEDKMGOKFBLJPKCLKCCCEANDDAA.brian_osborne@cognia.com>
References: <GAEDKMGOKFBLJPKCLKCCCEANDDAA.brian_osborne@cognia.com>
Message-ID: <AA730EA8-FE82-11D7-95C5-000A95783436@stanford.edu>

Yeah, but I needed to have it return a Bio::Structure type object so I 
needed to inherit
from a Bio::Index::AbstractStructure as opposed to a 
Bio::Index::AbstractSeq

But yes, definitely this looks like a good feature to have especially 
as I'm now finding,
some of the astral pdb type files have different headers from the 
normal pdb files..

shawn


On Tuesday, October 14, 2003, at 5:18AM, Brian Osborne wrote:

> Shawn,
>
> Both Bio::Index::Fasta and Bio::DB::Fasta allow you to stipulate the 
> key you
> want to use, as in:
>
> my $inx = Bio::Index::Fasta->new( -filename   => $file_name . ".idx",
>                                     -write_flag => 1 );
> $inx->id_parser(\&get_id);
> # make the index
> $inx->make_index($file_name);
>
> # where get_id() could look like:
>
> # here is where the retrieval key is specified
>   sub get_id {
>      my $header = shift;
>      $header =~ /^>.*\bsp\|([A-Z]\d{5}\b)/;
>      $1;
>   }
>
> I've found this capability to be very useful myself.
>
> Brian O.
>
>
> -----Original Message-----
> From: bioperl-l-bounces@portal.open-bio.org
> [mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Shawn Hoon
> Sent: Monday, October 13, 2003 9:18 PM
> To: BioPerl List
> Subject: [Bioperl-l] Bio::Index::PDB
>
> Hi,
> I wrote an indexer along the lines of the sequence index modules  for
> PDB structure files.
> Basically a lot of copying from Bio::Index::Fasta
>
> Basically 2 modules added:
>
> Bio::Index::AbstractStructure (which inherits from 
> Bio::Index::Abstract)
> Bio::Index::PDB
>
> Usage:
>
>      use Bio::Index::PDB;
>      use strict;
>
>      my $Index_File_Name = shift;
>      my $inx = Bio::Index::PDB->new('-filename' => $Index_File_Name);
>      my $out = Bio::Structure::IO->new('-format' => 'pdb','-fh' =>
> \*STDOUT);
>
>      foreach my $id (@ARGV) {
>          my $struct= $inx->fetch($id); # Returns Bio::Structure object
>          $out->write_structure($struct);
>      }
>
> Is this along the lines of how indexers are implemented? If so, I will
> go ahead and commit them.
>
>
> shawn
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

From babenko at ncbi.nlm.nih.gov  Tue Oct 14 09:32:18 2003
From: babenko at ncbi.nlm.nih.gov (babenko)
Date: Tue Oct 14 22:15:15 2003
Subject: [Bioperl-l] Abandoned GBentries
Message-ID: <002001c39257$96a6f500$78140e82@ncbi.nlm.nih.gov>

     Hi, 

I'm using the somewhat old version of genbank. So when I assess the
results on the ncbi list, I could often see the red title:

This record was replaced or removed

E.g.:


1: AAF56887
<http://www.ncbi.nlm.nih.gov:80/entrez/viewer.fcgi?cmd=Retrieve&db=prote
in&list_uids=7301776&dopt=GenPept&term=7301776&qty=1> [gi:7301776] This
record was replaced or removed. See revision
<http://www.ncbi.nlm.nih.gov/entrez/sutils/girevhist.cgi?val=AAF56887>
history for details.

 
       Is

Is there any means to automatically trace that?

    Thanks,

        Vladimir

 
From zayed at sanbi.ac.za  Wed Oct 15 04:27:23 2003
From: zayed at sanbi.ac.za (Zayed Albertyn)
Date: Wed Oct 15 04:24:54 2003
Subject: [Bioperl-l] Bioperl parser for Gene Expression Omnibus
Message-ID: <20031015102227.I43095@fling.sanbi.ac.za>

Hi

I know that there exists a biopython module for handling GEO's SOFT format
but is there a similar module in development for bioperl. It would be
really great to have objects that handle experiment results.

Thanks
Zayed

-----------------------------------------------
From: Zayed Albertyn
Electric Genetics PTY Ltd
Tel: +27 21 959 3645; Mobile: +2782 480 6097
www.egenetics.com
From tania at egenetics.com  Wed Oct 15 06:06:10 2003
From: tania at egenetics.com (Tania Broveak Hide)
Date: Wed Oct 15 06:03:35 2003
Subject: [Bioperl-l] BioPerl programmer opening at Electric Genetics
Message-ID: <023b01c39303$f5c3f130$6402a8c0@boing>


Want to...
  live in one of the most beautiful cities in the world, 
    develop open source software
      and get PAID for it???
 
If you have:
  - 3+ years Perl coding experience
  - 1+ year experience with another programming language widely used 
    in bioinformatics open source projects (e.g., Java, Python, etc)
  - 3+ years UNIX experience (Linux or *BSD a plus)
  - participated actively in open source projects, preferably as a 
    core developer
  - great productivity and enthusiasm
  - a (community) service attitude
  - good written and verbal communication skills
 
Then you qualify for this unique position!
 
Your application is strengthen with any of the following:
  - experience with the BioPerl project
  - familiarity or existing relationship with key open source developers

    in bioinformatics
  - training or exposure to bioinformatics, genetics, molecular biology 
    or other life science discipline
  - experience using UML modeling techniques and tools.
 
Job description follows:
 
Open Source Programmer
----------------------
Build and maintain relationship with Bioinformatics open source
community through active participation (i.e., programming) in open
source projects relevant to the company's strategic aims.  Participate
actively in Bioinformatics open source community and key open source
events such as BOSC and Biohackathon.  It is expected that the Open
Source Programmer will become a 'major contributor' on the BioPerl
project and, as time permits, participate in other application or
platform projects. The open source programmer will implement the
company's BioPerl Support offering as well as provide support internally
to programming and documentation staff interacting heavily with BioPerl
and other open source projects.

 
Who are we?
-----------
Electric Genetics is an international leader in the development of
Bioinformatics tools and services, specializing in tools and techniques
for making the most of transcriptomic and gene expression data.
Electric Genetics has a strong philosophy of using and supporting open
source programs and open standards and are currently preparing technical
and user documentation for the Ensembl system.  With offices in the USA
and South Africa, Electric Genetics' software is used by over 300 key
research sites around the world.

The Open Source Programmer will be located at Electric Genetics' Cape
Town R&D centre.  

How to apply
------------
To apply, please send a CV and one example of your Perl work to
recruit@egenetics.com.  Be sure to tell us why you think you are the
best candidate for this "ideal job".  Applications are open until 10
November 2003.

From janenerz at web.de  Wed Oct 15 06:44:56 2003
From: janenerz at web.de (Christiane Nerz)
Date: Wed Oct 15 06:42:03 2003
Subject: [Bioperl-l] Does bioperl is the answer?
Message-ID: <3F8D2528.1050105@web.de>

Hi everybody!
I never worked with bioperl and even worse: I'm an absolute perl-newbie.
Does bioperl provides the key to the following problem:
I have a list with positions on bacterial-DNA. (Start-Stop-Positions of 
patterns, about 600 hits) and would like to know if the patterns are 
located within orfs or outside. So I need a programm to search hit by 
hit in an Genome-Database.
Any idea?
Jane


From jason at cgt.duhs.duke.edu  Wed Oct 15 06:55:05 2003
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Wed Oct 15 06:52:23 2003
Subject: [Bioperl-l] Codon Table
In-Reply-To: <20B7EB075F2D4542AFFAF813E98ACD9301C00BEB@cl-exsrv1.irad.bbsrc.ac.uk>
References: <20B7EB075F2D4542AFFAF813E98ACD9301C00BEB@cl-exsrv1.irad.bbsrc.ac.uk>
Message-ID: <Pine.LNX.4.50.0310150650010.8531-100000@tenero.duhs.duke.edu>

If the table is not there then you obviously have to hack CodonTable - you
need to add values to the @NAMES, @TABLES, @STARTS

You specify which table you want to use as
$table->id($n)
where $n is 1->23 - so you would want to add them to the bottom I suppose
but we

Is this table avaialble at
     http://www.ncbi.nlm.nih.gov/htbin-post/Taxonomy/wprintgc?mode=c

perhaps we need to update the module for new tables?


If this is a LOCAL only think you can add values to the available tables
by directly updating the value of these variables in your script

push @Bio::Tools::CodonTable::NAMES, 'NewName';
... etc (update the other values)
-jason
On Tue, 14 Oct 2003, michael watson (IAH-C) wrote:

> Hi
>
> I want to use a different codon table than any that are present in
> CodonTable.pm.  Does Bioperl have a way to implement this?  Or will I
> have to hack CodonTable.pm?
>
> Thanks
> Mick
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From daniel.lang at biologie.uni-freiburg.de  Wed Oct 15 09:15:54 2003
From: daniel.lang at biologie.uni-freiburg.de (Daniel Lang)
Date: Wed Oct 15 09:13:00 2003
Subject: [Bioperl-l] Bio::DB::Registry
Message-ID: <3F8D488A.2080401@biologie.uni-freiburg.de>

Hi,
I encountered problems using the obda registry system...
Sequence retrieval with the example script fails:
	use Bio::DB::Registry();
	my $reg = new Bio::DB::Registry();
	my $db = $reg->get_database("ppp");
	my $seq = $db->get_Seq_by_acc("PPP_3085_C1");
	print $seq->seq,"\n";
With following Exception:
	-------------------- WARNING ---------------------
	MSG: Couldn't call new_from_registry on [Bio::DB::Flat]
	
	------------- EXCEPTION  -------------
	MSG: you must specify an indexing scheme
	STACK Bio::DB::Flat::new 		
/usr/lib/perl5/site_perl/5.6.1/Bio/DB/Flat.pm:163
	STACK Bio::DB::Flat::new_from_registry
/usr/lib/perl5/site_perl/5.6.1/Bio/DB/Flat.pm:250
	STACK (eval) /usr/lib/perl5/site_perl/5.6.1/Bio/DB/Registry.pm:194
	STACK Bio::DB::Registry::_load_registry
/usr/lib/perl5/site_perl/5.6.1/Bio/DB/Registry.pm:193
	STACK Bio::DB::Registry::new
/usr/lib/perl5/site_perl/5.6.1/Bio/DB/Registry.pm:96
	STACK toplevel fetch_test.pl:8

	--------------------------------------

	---------------------------------------------------
	Can't call method "seq" on an undefined value at fetch_test.pl line 11,
<F> line 14.

I?m using latest tarball with Linux/Perl 5.61.
I indexed a flat fasta file using the bioflat_index.pl from the recent
tarball  with BDB. No Errors and warning.
I also created a /etc/bioinformatics/seqdatabase.ini as follows:
	VERSION=1.00

	[ppp]
	protocol=flat
	location=/export/flatfiles/ppp/
	dbname=ppp
The permissions for the corresponding directories should be ok also..
Any hints for me?

Thanks in advance,
	Daniel


-- 
Daniel Lang
University of Freiburg, Plant Biotechnology
Sonnenstr. 5, D-79104 Freiburg
phone: +49 761 203 6988
homepage:  http://www.plant-biotech.net/
e-mail: daniel.lang@biologie.uni-freiburg.de

#################################################
 >REALITY.SYS corrupted: Reboot universe? (Y/N/A)
#################################################

From brian_osborne at cognia.com  Wed Oct 15 10:31:44 2003
From: brian_osborne at cognia.com (Brian Osborne)
Date: Wed Oct 15 10:33:44 2003
Subject: [Bioperl-l] Bio::DB::Registry
In-Reply-To: <3F8D488A.2080401@biologie.uni-freiburg.de>
Message-ID: <GAEDKMGOKFBLJPKCLKCCOECDDDAA.brian_osborne@cognia.com>

Daniel,

* MSG: you must specify an indexing scheme

My guess is that you need to set the variable OBDA_INDEX. A value of "flat"
means use the built-in scheme, a value of "bdb" means use BerkeleyDB, which
you'll have to install if it's not installed. Check out the file
doc/howto/FLAT-DATABASES-HOWTO.txt.

We probably should convert this to Docbook, make it more accessible. Did you
take a look at the OBDA_Access HOWTO
(http://bioperl.org/HOWTOs/html/OBDA_Access.html)?

Brian O.


-----Original Message-----
From: bioperl-l-bounces@portal.open-bio.org
[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Daniel Lang
Sent: Wednesday, October 15, 2003 9:16 AM
To: Bioperl-List
Subject: [Bioperl-l] Bio::DB::Registry

Hi,
I encountered problems using the obda registry system...
Sequence retrieval with the example script fails:
        use Bio::DB::Registry();
        my $reg = new Bio::DB::Registry();
        my $db = $reg->get_database("ppp");
        my $seq = $db->get_Seq_by_acc("PPP_3085_C1");
        print $seq->seq,"\n";
With following Exception:
        -------------------- WARNING ---------------------
        MSG: Couldn't call new_from_registry on [Bio::DB::Flat]

        ------------- EXCEPTION  -------------
        MSG: you must specify an indexing scheme
        STACK Bio::DB::Flat::new
/usr/lib/perl5/site_perl/5.6.1/Bio/DB/Flat.pm:163
        STACK Bio::DB::Flat::new_from_registry
/usr/lib/perl5/site_perl/5.6.1/Bio/DB/Flat.pm:250
        STACK (eval) /usr/lib/perl5/site_perl/5.6.1/Bio/DB/Registry.pm:194
        STACK Bio::DB::Registry::_load_registry
/usr/lib/perl5/site_perl/5.6.1/Bio/DB/Registry.pm:193
        STACK Bio::DB::Registry::new
/usr/lib/perl5/site_perl/5.6.1/Bio/DB/Registry.pm:96
        STACK toplevel fetch_test.pl:8

        --------------------------------------

        ---------------------------------------------------
        Can't call method "seq" on an undefined value at fetch_test.pl line
11,
<F> line 14.

I?m using latest tarball with Linux/Perl 5.61.
I indexed a flat fasta file using the bioflat_index.pl from the recent
tarball  with BDB. No Errors and warning.
I also created a /etc/bioinformatics/seqdatabase.ini as follows:
        VERSION=1.00

        [ppp]
        protocol=flat
        location=/export/flatfiles/ppp/
        dbname=ppp
The permissions for the corresponding directories should be ok also..
Any hints for me?

Thanks in advance,
        Daniel


--
Daniel Lang
University of Freiburg, Plant Biotechnology
Sonnenstr. 5, D-79104 Freiburg
phone: +49 761 203 6988
homepage:  http://www.plant-biotech.net/
e-mail: daniel.lang@biologie.uni-freiburg.de

#################################################
 >REALITY.SYS corrupted: Reboot universe? (Y/N/A)
#################################################

_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org
http://portal.open-bio.org/mailman/listinfo/bioperl-l


From daniel.lang at biologie.uni-freiburg.de  Wed Oct 15 10:52:39 2003
From: daniel.lang at biologie.uni-freiburg.de (Daniel Lang)
Date: Wed Oct 15 10:49:40 2003
Subject: [Bioperl-l] Bio::DB::Registry
In-Reply-To: <GAEDKMGOKFBLJPKCLKCCOECDDDAA.brian_osborne@cognia.com>
References: <GAEDKMGOKFBLJPKCLKCCOECDDDAA.brian_osborne@cognia.com>
Message-ID: <200310151652.43405.daniel.lang@biologie.uni-freiburg.de>

Thanks Brian, but still not working:(

On Wednesday 15 October 2003 16:31, Brian Osborne wrote:
> My guess is that you need to set the variable OBDA_INDEX. A value of "flat"
> means use the built-in scheme, a value of "bdb" means use BerkeleyDB, which
I tried both flat and bdb with the corresponding  OBDA_INDEX enviroment 
variable  setting.
 
> (http://bioperl.org/HOWTOs/html/OBDA_Access.html)?
Yes I configured everything correspondingly.

I was also confused, that in the current cvs the bioflat_index.pl script can 
only be found in scripts/DB/attic and not in examples/db?
So the version I?m using is the one from attic...

Maybe somehing is wrong with my config.dat:
index   flat/1
fileid_0        /export/flatfiles/ppp.seq       13155893
primary_namespace       ACC
secondary_namespaces    ID
format  URN:LSID:open-bio.org:fasta

Thanks in advance,
Daniel

  
-- 
Daniel Lang
University of Freiburg, Plant Biotechnology
Sonnenstr. 5, D-79104 Freiburg
phone: +49 761 203 6988
homepage:  http://www.plant-biotech.net/
e-mail: daniel.lang@biologie.uni-freiburg.de

#################################################
>REALITY.SYS corrupted: Reboot universe? (Y/N/A)
#################################################


From brian_osborne at cognia.com  Wed Oct 15 11:13:09 2003
From: brian_osborne at cognia.com (Brian Osborne)
Date: Wed Oct 15 11:15:08 2003
Subject: [Bioperl-l] Bio::DB::Registry
In-Reply-To: <200310151652.43405.daniel.lang@biologie.uni-freiburg.de>
Message-ID: <GAEDKMGOKFBLJPKCLKCCMECFDDAA.brian_osborne@cognia.com>

Daniel,

Hmm. Here's a config.dat that worked for me:

index   BerkeleyDB/1
format  URN:LSID:open-bio.org:fasta/protein
fileid_0        /home/bosborne/programming/perl/Bioperl/out.fa  7405
primary_namespace       ACC
secondary_namespaces

Note that I'm saying "fasta/protein", not simply "fasta". Don't know if it's
important.

I'll look into bioflat_index.pl.

Brian O.


-----Original Message-----
From: bioperl-l-bounces@portal.open-bio.org
[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Daniel Lang
Sent: Wednesday, October 15, 2003 10:53 AM
To: Brian Osborne; Bioperl-List
Subject: Re: [Bioperl-l] Bio::DB::Registry

Thanks Brian, but still not working:(

On Wednesday 15 October 2003 16:31, Brian Osborne wrote:
> My guess is that you need to set the variable OBDA_INDEX. A value of
"flat"
> means use the built-in scheme, a value of "bdb" means use BerkeleyDB,
which
I tried both flat and bdb with the corresponding  OBDA_INDEX enviroment
variable  setting.

> (http://bioperl.org/HOWTOs/html/OBDA_Access.html)?
Yes I configured everything correspondingly.

I was also confused, that in the current cvs the bioflat_index.pl script can
only be found in scripts/DB/attic and not in examples/db?
So the version I?m using is the one from attic...

Maybe somehing is wrong with my config.dat:
index   flat/1
fileid_0        /export/flatfiles/ppp.seq       13155893
primary_namespace       ACC
secondary_namespaces    ID
format  URN:LSID:open-bio.org:fasta

Thanks in advance,
Daniel


--
Daniel Lang
University of Freiburg, Plant Biotechnology
Sonnenstr. 5, D-79104 Freiburg
phone: +49 761 203 6988
homepage:  http://www.plant-biotech.net/
e-mail: daniel.lang@biologie.uni-freiburg.de

#################################################
>REALITY.SYS corrupted: Reboot universe? (Y/N/A)
#################################################


_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org
http://portal.open-bio.org/mailman/listinfo/bioperl-l


From rjdrouse at patentinformatics.com  Wed Oct 15 17:36:10 2003
From: rjdrouse at patentinformatics.com (Richard Rouse)
Date: Wed Oct 15 17:33:30 2003
Subject: [Bioperl-l] installation problems
Message-ID: <ADEFIHIJAHBGLBNJCIADAEMGCCAA.rjdrouse@patentinformatics.com>

Dear bioperl-db,

I have mysql 3.23, slackware 8.1 and bioperl-1.2.2 I recently installed
bioperl-db and it was working nicely until I updated perl from 5.8.0 to
5.8.1. Now I get errors when running make test and have problems with the
load_seqdatabase.pl script:

Using this command I get the following problem:

perl ./load_seqdatabase.pl --host localhost --dbname biosql2 --dbuser
richard --dbpass xxxx --lookup --safe --format genbank patent
/home/richard/bioperl-1.2.2/bioperl-db/scripts/biosql/db/gbpat1.seq


-------------------- WARNING ---------------------
MSG: failed to load dbadaptor:
------------- EXCEPTION  -------------
MSG: Failed to load module Bio::DB::BioSQL::DBAdaptor. Can't locate
Bio/DB/DBAdaptorI.pm in @INC (@INC contains:
/usr/local/lib/perl5/5.8.1/i686-linux /usr/local/lib/perl5/5.8.1
/usr/local/lib/perl5/site_perl/5.8.1/i686-linux
/usr/local/lib/perl5/site_perl/5.8.1 /usr/local/lib/perl5/site_perl .) at
/usr/local/lib/perl5/site_perl/5.8.1/Bio/DB/BioSQL/DBAdaptor.pm line 58.
BEGIN failed--compilation aborted at
/usr/local/lib/perl5/site_perl/5.8.1/Bio/DB/BioSQL/DBAdaptor.pm line 58.
Compilation failed in require at
/usr/local/lib/perl5/site_perl/5.8.1/Bio/Root/Root.pm line 407.

STACK Bio::Root::Root::_load_module
/usr/local/lib/perl5/site_perl/5.8.1/Bio/Root/Root.pm:409
STACK (eval) /usr/local/lib/perl5/site_perl/5.8.1/Bio/DB/BioDB.pm:155
STACK Bio::DB::BioDB::_load_dbadaptor
/usr/local/lib/perl5/site_perl/5.8.1/Bio/DB/BioDB.pm:154
STACK Bio::DB::BioDB::new
/usr/local/lib/perl5/site_perl/5.8.1/Bio/DB/BioDB.pm:121
STACK toplevel ./load_seqdatabase.pl:353

--------------------------------------


------------- EXCEPTION  -------------
MSG: Failed to load module Bio::DB::BioSQL::dbadaptor. Can't locate
Bio/DB/BioSQL/dbadaptor.pm in @INC (@INC contains:
/usr/local/lib/perl5/5.8.1/i686-linux /usr/local/lib/perl5/5.8.1
/usr/local/lib/perl5/site_perl/5.8.1/i686-linux
/usr/local/lib/perl5/site_perl/5.8.1 /usr/local/lib/perl5/site_perl .) at
/usr/local/lib/perl5/site_perl/5.8.1/Bio/Root/Root.pm line 407.

STACK Bio::Root::Root::_load_module
/usr/local/lib/perl5/site_perl/5.8.1/Bio/Root/Root.pm:409
STACK (eval) /usr/local/lib/perl5/site_perl/5.8.1/Bio/DB/BioDB.pm:155
STACK Bio::DB::BioDB::_load_dbadaptor
/usr/local/lib/perl5/site_perl/5.8.1/Bio/DB/BioDB.pm:154
STACK Bio::DB::BioDB::new
/usr/local/lib/perl5/site_perl/5.8.1/Bio/DB/BioDB.pm:121
STACK toplevel ./load_seqdatabase.pl:353

--------------------------------------

---------------------------------------------------

------------- EXCEPTION  -------------
MSG: fatal: unable to load DBAdaptor for database:
Bio::DB::BioSQL::{DBAdaptor,dbadaptor} all failed to load
STACK Bio::DB::BioDB::new
/usr/local/lib/perl5/site_perl/5.8.1/Bio/DB/BioDB.pm:122
STACK toplevel ./load_seqdatabase.pl:353

--------------------------------------

Richard Rouse JD
Rouse Patent Services
USPTO Registered Patent Agent
P.O. Box 948586
La Jolla, CA 92037
web: www.patentinformatics.com
email: rjdrouse@patentinformatics.com
phone: 619-846-8291


From jason at cgt.duhs.duke.edu  Wed Oct 15 17:41:53 2003
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Wed Oct 15 17:39:02 2003
Subject: [Bioperl-l] installation problems
In-Reply-To: <ADEFIHIJAHBGLBNJCIADAEMGCCAA.rjdrouse@patentinformatics.com>
References: <ADEFIHIJAHBGLBNJCIADAEMGCCAA.rjdrouse@patentinformatics.com>
Message-ID: <Pine.LNX.4.50.0310151741230.11591-100000@tenero.duhs.duke.edu>

You upgraded your perl - did you reinstall Bioperl?

You might as well get bioperl 1.2.3 as well as it has several bugfixes.

-jason
On Wed, 15 Oct 2003, Richard Rouse wrote:

> Dear bioperl-db,
>
> I have mysql 3.23, slackware 8.1 and bioperl-1.2.2 I recently installed
> bioperl-db and it was working nicely until I updated perl from 5.8.0 to
> 5.8.1. Now I get errors when running make test and have problems with the
> load_seqdatabase.pl script:
>
> Using this command I get the following problem:
>
> perl ./load_seqdatabase.pl --host localhost --dbname biosql2 --dbuser
> richard --dbpass xxxx --lookup --safe --format genbank patent
> /home/richard/bioperl-1.2.2/bioperl-db/scripts/biosql/db/gbpat1.seq
>
>
>
> -------------------- WARNING ---------------------
> MSG: failed to load dbadaptor:
> ------------- EXCEPTION  -------------
> MSG: Failed to load module Bio::DB::BioSQL::DBAdaptor. Can't locate
> Bio/DB/DBAdaptorI.pm in @INC (@INC contains:
> /usr/local/lib/perl5/5.8.1/i686-linux /usr/local/lib/perl5/5.8.1
> /usr/local/lib/perl5/site_perl/5.8.1/i686-linux
> /usr/local/lib/perl5/site_perl/5.8.1 /usr/local/lib/perl5/site_perl .) at
> /usr/local/lib/perl5/site_perl/5.8.1/Bio/DB/BioSQL/DBAdaptor.pm line 58.
> BEGIN failed--compilation aborted at
> /usr/local/lib/perl5/site_perl/5.8.1/Bio/DB/BioSQL/DBAdaptor.pm line 58.
> Compilation failed in require at
> /usr/local/lib/perl5/site_perl/5.8.1/Bio/Root/Root.pm line 407.
>
> STACK Bio::Root::Root::_load_module
> /usr/local/lib/perl5/site_perl/5.8.1/Bio/Root/Root.pm:409
> STACK (eval) /usr/local/lib/perl5/site_perl/5.8.1/Bio/DB/BioDB.pm:155
> STACK Bio::DB::BioDB::_load_dbadaptor
> /usr/local/lib/perl5/site_perl/5.8.1/Bio/DB/BioDB.pm:154
> STACK Bio::DB::BioDB::new
> /usr/local/lib/perl5/site_perl/5.8.1/Bio/DB/BioDB.pm:121
> STACK toplevel ./load_seqdatabase.pl:353
>
> --------------------------------------
>
>
> ------------- EXCEPTION  -------------
> MSG: Failed to load module Bio::DB::BioSQL::dbadaptor. Can't locate
> Bio/DB/BioSQL/dbadaptor.pm in @INC (@INC contains:
> /usr/local/lib/perl5/5.8.1/i686-linux /usr/local/lib/perl5/5.8.1
> /usr/local/lib/perl5/site_perl/5.8.1/i686-linux
> /usr/local/lib/perl5/site_perl/5.8.1 /usr/local/lib/perl5/site_perl .) at
> /usr/local/lib/perl5/site_perl/5.8.1/Bio/Root/Root.pm line 407.
>
> STACK Bio::Root::Root::_load_module
> /usr/local/lib/perl5/site_perl/5.8.1/Bio/Root/Root.pm:409
> STACK (eval) /usr/local/lib/perl5/site_perl/5.8.1/Bio/DB/BioDB.pm:155
> STACK Bio::DB::BioDB::_load_dbadaptor
> /usr/local/lib/perl5/site_perl/5.8.1/Bio/DB/BioDB.pm:154
> STACK Bio::DB::BioDB::new
> /usr/local/lib/perl5/site_perl/5.8.1/Bio/DB/BioDB.pm:121
> STACK toplevel ./load_seqdatabase.pl:353
>
> --------------------------------------
>
> ---------------------------------------------------
>
> ------------- EXCEPTION  -------------
> MSG: fatal: unable to load DBAdaptor for database:
> Bio::DB::BioSQL::{DBAdaptor,dbadaptor} all failed to load
> STACK Bio::DB::BioDB::new
> /usr/local/lib/perl5/site_perl/5.8.1/Bio/DB/BioDB.pm:122
> STACK toplevel ./load_seqdatabase.pl:353
>
> --------------------------------------
>
> Richard Rouse JD
> Rouse Patent Services
> USPTO Registered Patent Agent
> P.O. Box 948586
> La Jolla, CA 92037
> web: www.patentinformatics.com
> email: rjdrouse@patentinformatics.com
> phone: 619-846-8291
>
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From hlapp at gmx.net  Thu Oct 16 03:12:27 2003
From: hlapp at gmx.net (Hilmar Lapp)
Date: Thu Oct 16 03:09:44 2003
Subject: [Bioperl-l] installation problems
In-Reply-To: <Pine.LNX.4.50.0310151741230.11591-100000@tenero.duhs.duke.edu>
Message-ID: <195AB679-FFA8-11D7-AD4B-000A959EB4C4@gmx.net>


On Wednesday, October 15, 2003, at 02:41  PM, Jason Stajich wrote:

> You upgraded your perl - did you reinstall Bioperl?
>

You'll also need to reinstall bioperl-db obviously.

	-hilmar

-- 
-------------------------------------------------------------
Hilmar Lapp                            email: lapp at gnf.org
GNF, San Diego, Ca. 92121              phone: +1-858-812-1757
-------------------------------------------------------------


From William.Cannon at pnl.gov  Thu Oct 16 18:28:50 2003
From: William.Cannon at pnl.gov (Cannon, William R)
Date: Thu Oct 16 18:26:17 2003
Subject: [Bioperl-l] RemoteBlast: can't get a TaxBlast report
Message-ID: <62AE0CF1D4875C4BBDEC29DB9924ACE88B1D44@pnlmse25.pnl.gov>

Hi,
I tried running RemoteBlast using the synopsis script found on at http://doc.bioperl.org/releases/bioperl-1.2/Bio/Tools/Run/RemoteBlast.html. Everything worked fine, as is. However, when I try to get a Taxonomy report by adding the line:
$Bio::Tools::Run::RemoteBlast::HEADER{'FORMAT_OBJECT'} = 'TaxBlast';

I still get the alignment report back. Anybody know what else I should be doing?

Thanks.
Bill

(Using :
perl 5.6
bioperl 1.2.3 on redhat linux 7.2.

Here is the full script:
#! /msrc/apps/perl-5.6/bin/perl

  #Remote-blast "factory object" creation and blast-parameter initialization

  use Bio::Tools::Run::RemoteBlast;
  use strict;
  my $prog = 'blastp';
  my $db   = 'nr';
  my $e_val= '1.0';

  my @params = ( '-prog' => $prog,
         '-data' => $db,
         '-expect' => $e_val,
         '-readmethod' => 'SearchIO' );

  my $factory = Bio::Tools::Run::RemoteBlast->new(@params);

  #change a paramter
  $Bio::Tools::Run::RemoteBlast::HEADER{'ENTREZ_QUERY'} = 'Homo sapiens [ORGN]';
  $Bio::Tools::Run::RemoteBlast::HEADER{'FORMAT_OBJECT'} = 'TaxBlast';

  #remove a parameter
  delete $Bio::Tools::Run::RemoteBlast::RETRIEVALHEADER{'ALIGNMENT_VIEW'};
  delete $Bio::Tools::Run::RemoteBlast::RETRIEVALHEADER{'DESCRIPTIONS'};
  delete $Bio::Tools::Run::RemoteBlast::RETRIEVALHEADER{'ALIGNMENTS'};

  my $v = 1;
  #$v is just to turn on and off the messages
  
#  my $str = Bio::SeqIO->new(-file=>'amino.fa' , '-format' => 'fasta' );
  my $str = Bio::SeqIO->new(-file=>'roa1.fasta' , '-format' => 'fasta' );

  while (my $input = $str->next_seq()){
    #Blast a sequence against a database:

    #Alternatively, you could  pass in a file with many
    #sequences rather than loop through sequence one at a time
    #Remove the loop starting 'while (my $input = $str->next_seq())'
    #and swap the two lines below for an example of that.
    print STDERR "%s\n", $Bio::Tools::Run::RemoteBlast::HEADER{'FORMAT_OBJECT'};
    my $r = $factory->submit_blast($input);
    #my $r = $factory->submit_blast('amino.fa');

    print STDERR "waiting..." if( $v > 0 );
    while ( my @rids = $factory->each_rid ) {
      foreach my $rid ( @rids ) {
        my $rc = $factory->retrieve_blast($rid);
        if( !ref($rc) ) {
          if( $rc < 0 ) {
            $factory->remove_rid($rid);
          }
          print STDERR "." if ( $v > 0 );
          sleep 5;
        } else {
          my $result = $rc->next_result();
          #save the output
          my $filename = $result->query_name()."\.out";
          $factory->save_output($filename);
          $factory->remove_rid($rid);
          print "\nQuery Name: ", $result->query_name(), "\n";
          while ( my $hit = $result->next_hit ) {
            next unless ( $v > 0);
            print "\thit name is ", $hit->name, "\n";
            while( my $hsp = $hit->next_hsp ) {
              print "\t\tscore is ", $hsp->score, "\n";
            }
          }
        }
      }
    }
  }


From jason at cgt.duhs.duke.edu  Thu Oct 16 20:19:13 2003
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Thu Oct 16 20:16:19 2003
Subject: [Bioperl-l] need a devloper for RemoteBlast (was RemoteBlast: can't
 get a TaxBlast report)
In-Reply-To: <62AE0CF1D4875C4BBDEC29DB9924ACE88B1D44@pnlmse25.pnl.gov>
References: <62AE0CF1D4875C4BBDEC29DB9924ACE88B1D44@pnlmse25.pnl.gov>
Message-ID: <Pine.LNX.4.50.0310162013001.20891-100000@tenero.duhs.duke.edu>

I really have no idea how to parse this so I suspect SearchIO doesn't
either.  This is not something we have ever tried to support because it
isn't in part of the standard output from stand alone blast reports.

Frankly this module started as a hack to provide and illustrate minimal
access just to running BLAST on a remote server such as NCBI when the cgi
page itsself was pretty minimal.

If people want to do more complicated stuff the module needs a dedicated
soul actively developing it who will adapt it to the new features NCBI is
providing.  In case it isn't clear, I am not going to be that person any
time soon.

Currently there are also new features which allow sending multiple
sequences in and using discontinuous megablast which will allow, for
example, in parallel searching of multiple sequences reducing the total
search time to a fraction of the time versus compared to each query
in-serial.  This would be a big time save against things like dbest.

(I bang this drum every few months so it isn't directed at you Bill.)

-jason
On Thu, 16 Oct 2003, Cannon, William R wrote:

> Hi, I tried running RemoteBlast using the synopsis script found on at
> http://doc.bioperl.org/releases/bioperl-1.2/Bio/Tools/Run/RemoteBlast.html.
> Everything worked fine, as is. However, when I try to get a Taxonomy
> report by adding the line:
> $Bio::Tools::Run::RemoteBlast::HEADER{'FORMAT_OBJECT'} = 'TaxBlast';
>
> I still get the alignment report back. Anybody know what else I should
> be doing?
>
> Thanks.
> Bill
>
> (Using :
> perl 5.6
> bioperl 1.2.3 on redhat linux 7.2.
>
> Here is the full script:
> #! /msrc/apps/perl-5.6/bin/perl
>
>   #Remote-blast "factory object" creation and blast-parameter initialization
>
>   use Bio::Tools::Run::RemoteBlast;
>   use strict;
>   my $prog = 'blastp';
>   my $db   = 'nr';
>   my $e_val= '1.0';
>
>   my @params = ( '-prog' => $prog,
>          '-data' => $db,
>          '-expect' => $e_val,
>          '-readmethod' => 'SearchIO' );
>
>   my $factory = Bio::Tools::Run::RemoteBlast->new(@params);
>
>   #change a paramter
>   $Bio::Tools::Run::RemoteBlast::HEADER{'ENTREZ_QUERY'} = 'Homo sapiens [ORGN]';
>   $Bio::Tools::Run::RemoteBlast::HEADER{'FORMAT_OBJECT'} = 'TaxBlast';
>
>   #remove a parameter
>   delete $Bio::Tools::Run::RemoteBlast::RETRIEVALHEADER{'ALIGNMENT_VIEW'};
>   delete $Bio::Tools::Run::RemoteBlast::RETRIEVALHEADER{'DESCRIPTIONS'};
>   delete $Bio::Tools::Run::RemoteBlast::RETRIEVALHEADER{'ALIGNMENTS'};
>
>   my $v = 1;
>   #$v is just to turn on and off the messages
>
> #  my $str = Bio::SeqIO->new(-file=>'amino.fa' , '-format' => 'fasta' );
>   my $str = Bio::SeqIO->new(-file=>'roa1.fasta' , '-format' => 'fasta' );
>
>   while (my $input = $str->next_seq()){
>     #Blast a sequence against a database:
>
>     #Alternatively, you could  pass in a file with many
>     #sequences rather than loop through sequence one at a time
>     #Remove the loop starting 'while (my $input = $str->next_seq())'
>     #and swap the two lines below for an example of that.
>     print STDERR "%s\n", $Bio::Tools::Run::RemoteBlast::HEADER{'FORMAT_OBJECT'};
>     my $r = $factory->submit_blast($input);
>     #my $r = $factory->submit_blast('amino.fa');
>
>     print STDERR "waiting..." if( $v > 0 );
>     while ( my @rids = $factory->each_rid ) {
>       foreach my $rid ( @rids ) {
>         my $rc = $factory->retrieve_blast($rid);
>         if( !ref($rc) ) {
>           if( $rc < 0 ) {
>             $factory->remove_rid($rid);
>           }
>           print STDERR "." if ( $v > 0 );
>           sleep 5;
>         } else {
>           my $result = $rc->next_result();
>           #save the output
>           my $filename = $result->query_name()."\.out";
>           $factory->save_output($filename);
>           $factory->remove_rid($rid);
>           print "\nQuery Name: ", $result->query_name(), "\n";
>           while ( my $hit = $result->next_hit ) {
>             next unless ( $v > 0);
>             print "\thit name is ", $hit->name, "\n";
>             while( my $hsp = $hit->next_hsp ) {
>               print "\t\tscore is ", $hsp->score, "\n";
>             }
>           }
>         }
>       }
>     }
>   }
>
>
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From wes.barris at csiro.au  Fri Oct 17 01:48:17 2003
From: wes.barris at csiro.au (Wes Barris)
Date: Fri Oct 17 01:45:57 2003
Subject: [Bioperl-l] How do you create a genbank file?
Message-ID: <3F8F82A1.5060504@csiro.au>

Hi,

I need to convert a set of fasta sequences into a set of genbank entries.
This is easy to do except that there are a few subtle parts of the genbank
format that I have not figured out how to set yet using bioperl.

Here is an example of what I have been able to produce:

LOCUS       AB050006                 588 bp    dna     linear   UNK
DEFINITION  Bos taurus bASC mRNA for apoptosis-associated speck-like protein
             containing a CARD, complete cds.
ACCESSION   AB050006
FEATURES             Location/Qualifiers
      source          1..588
                      /organism="Bos taurus"
.
.
.

Here is an example of what I would like it too look like:

LOCUS       AB050006                 588 bp    mRNA     linear   MAM
DEFINITION  Bos taurus bASC mRNA for apoptosis-associated speck-like protein
             containing a CARD, complete cds.
ACCESSION   AB050006
VERSION     AB050006.1  GI:26453358
SOURCE      Bos taurus (cow)
   ORGANISM  Bos taurus
FEATURES             Location/Qualifiers
      source          1..588
                      /organism="Bos taurus"
.
.
.

My question is how do I set the following?

mRNA (instead of dna)
MAM (instead of UNK)
VERSION     AB050006.1  GI:26453358		<- I can't get this line to appear
SOURCE      Bos taurus (cow)
   ORGANISM  Bos taurus


Here is a portion of my code:

my $seq_in  = Bio::SeqIO->new( -format => 'fasta', -file => $infile);
my $seq_out = Bio::SeqIO->new( -format => 'genbank',   -file => ">$outfile");
while (my $seq = $seq_in->next_seq()) {
    my ($gi, $accession, $clone, $clonelib, $length, $file, $direction, $description, 
$tissuetype, $organism) = &parseStackDefline($seq->description());
    $seq->desc($description);
    $seq->accession_number($accession);
    $seq->primary_id($gi);
    my $feat = Bio::SeqFeature::Generic->new(-primary=>'source', -start=>1, -end=>$length);
    $feat->add_tag_value('clone',$clone)           if ($clone ne 'unknown');
    $feat->add_tag_value('clonelib',$clonelib)     if ($clonelib ne 'unknown');
    $feat->add_tag_value('tissuetype',$tissuetype) if ($tissuetype ne 'unknown');
    $feat->add_tag_value('organism',$organism)     if ($organism ne 'unknown');
    $seq->add_SeqFeature($feat);

    $seq_out->write_seq($seq);
    }

-- 
Wes Barris
E-Mail: Wes.Barris@csiro.au


From Marc.Logghe at devgen.com  Fri Oct 17 05:49:03 2003
From: Marc.Logghe at devgen.com (Marc Logghe)
Date: Fri Oct 17 05:46:23 2003
Subject: [Bioperl-l] How do you create a genbank file?
Message-ID: <BEE28BF86078B6429D6C780635718E2103955F@morelia.be.devgen.com>

> My question is how do I set the following?
> 
> mRNA (instead of dna)
> MAM (instead of UNK)
> VERSION     AB050006.1  GI:26453358		<- I can't get 
> this line to appear
> SOURCE      Bos taurus (cow)
>    ORGANISM  Bos taurus
> 
> 
To set the version you should use:
$seq->seq_version($version);  # $version is e.g. 1
Problem is, it is not possible to set the GI number. As far as I know, when you pass a genbank file, Bio::SeqIO does not even  parse it, at least it does not show up when you Data::Dump the resulting Bio::Seq::RichSeq object.
There is no slot for that information, because it does not exist in e.g. an EMBL sequence record.
Concerning the organism, first create the Bio::Species object. In case you only have the string 'Bos taurus' in your fasta, of course you can not generate the full classification. At least not using only your fasta data.
my $species = Bio::Species->new([reverse split /\s/, $organism]);
$seq->species($species);

gives you:
SOURCE      Bos taurus
  ORGANISM  Bos taurus
            Bos.

HTH,
Marc

From heikki at ebi.ac.uk  Fri Oct 17 06:20:34 2003
From: heikki at ebi.ac.uk (Heikki Lehvaslaiho)
Date: Fri Oct 17 06:18:04 2003
Subject: [Bioperl-l] Bioperl Developer snapshot 1.3.02
Message-ID: <1066386033.2446.128.camel@localhost>


   Bioperl developer snap shot  1.3.02
   ---------------------------------


This is the second developer snap shot from the BioPerl CVS head
that will eventually lead to release 1.4.

http://bioperl.org/DIST/current_core_unstable.tar.gz
http://bioperl.org/DIST/bioperl-1.3.02.tar.gz


Last fortnight has been dominated by bug fixing (but more is needed; see
bugzilla). The only new stuff added has been on comparison matrix
modules by Stafan Kirov.

The most important fixes have been tweaks in ontology handling, GFF
histograms, some Bio::Graphics glyphs, feature table output and
Gene/Genomewise parsing. The full list is below.

Enjoy,

	-Heikki


NEW DIRECTORIES and FILES
=========================

Bio::Matrix::PSM
  IO InstanceSite InstanceSiteI Psm PsmHeader 
  PsmHeaderI PsmI SiteMatrix SiteMatrixI 

  Added DNA PSM (Position Scoring Matrix) modules This includes meme,
  mast and transfac parsers

examples/rnai_finder.cgi 
  cgi script for RNAi reagent design


CHANGES
=======

Bio::DB::GFF::Adaptor::dbi 
  fixed bug 1384; histograms not working

Bio::DB::GFF::Feature
bulk_load_gff.PLS fast_load_gff.PLS load_gff.PLS
  fixed overly-promiscuous regexp for detecting FASTA files

Bio::DB::Query::WebQuery 
  quell undef warnings

Bio::DB::WebDBSeqI 
  reset MODVERSION to use Packagewide Version system - change agent
  string to be 'bioperl-Bio_DB_GenBank/1.3' for example

Bio::Graphics::FeatureFile 
  All functionality moved to Bio::DB::GFF->split_group, however still
  see major perf problems, not sure if this is localized to
  FeatureFile or not

Bio::Graphics::Glyph::Factory 
  fixed implementation of user-defined glyph sort routines

Bio::Graphics::Glyph::arrow
Bio::Graphics::Glyph
  partial fix for incorrectly-rendered half-open intervals

Bio::Index::Abstract 
  don't warn if we set verbose to -1

Bio::Location::Simple::to_FTstring()
  was never returning a location of the form "complement(12345)

Bio::Ontology::SimpleGOEngine 
Bio::OntologyIO::dagflat 
Bio::Expression::FeatureGroup 
  allowing an underscore in ontology IDs.  this is necessary to be
  able to parse cjm's OBO_REL relationship ontology, which otherwise
  observes DAG-Edit format

Bio::Ontology::SimpleGOEngine
Bio::OntologyIO::dagflat.pm 
  MPATH ontology doesn't have at least 3 digits in all identifiers

Bio::SearchIO::(chado&chadosxpr)
  removed as redundant

Bio::SeqFeature::Tools::TypeMapper 
  updated type sequence_variant to reflect SO

Bio::SeqIO::chadoxml 
  fix bug of null srcfeature_id for featureloc when srcfeature is not
  given on write_seq() invocation; fix bug of not calling unflattener
  when -seq_so_type is explicitly supplied.

  fixed bug 1532 with regard to filehandle; modified code to eliminate
  warnings when run with 'perl -w'.
  debugging code clean up

Bio::SearchIO::wise 
  was swapping query/target improperly

Bio::Restriction::IO::base 
  fix suggested in bug report #1538, correctly parse negative cut site
  locations

Bio::Tools::Phylo::PAML::Result
  doc fixes

+ several minor fixes to quiet warnings when Test::Harness runs tests
with 'use warnings'


-- 
______ _/      _/_____________________________________________________
      _/      _/                      http://www.ebi.ac.uk/mutations/
     _/  _/  _/  Heikki Lehvaslaiho    heikki_at_ebi ac uk
    _/_/_/_/_/  EMBL Outstation, European Bioinformatics Institute
   _/  _/  _/  Wellcome Trust Genome Campus, Hinxton
  _/  _/  _/  Cambs. CB10 1SD, United Kingdom
     _/      Phone: +44 (0)1223 494 644   FAX: +44 (0)1223 494 468
___ _/_/_/_/_/________________________________________________________

From heikki at ebi.ac.uk  Fri Oct 17 06:47:35 2003
From: heikki at ebi.ac.uk (Heikki Lehvaslaiho)
Date: Fri Oct 17 06:45:02 2003
Subject: [Bioperl-l] bioperl mirror at iubio.bio.indiana.edu now daily
	updated
In-Reply-To: <01993920-E940-11D7-AB8B-000393B8D01C@indiana.edu>
References: <01993920-E940-11D7-AB8B-000393B8D01C@indiana.edu>
Message-ID: <1066387655.7927.4.camel@localhost>

Thanks Don,

I changed the text on the page. Sorry for not reacting earlier.
Your mirror is much appreciated.

	-Heikki

On Wed, 2003-09-17 at 19:51, Don Gilbert wrote:
> Re: http://www.bioperl.org/Core/Latest/index.shtml
> MIRRORS:
> iubio.bio.indiana.edu - Updates on a 20 day cycle so may not always be 
> current.
> 
> I updated IUBio mirroring of this to daily, in case you all start 
> cranking
> out new revisions.
> 
> -- Don
> 
> -- d.gilbert--bioinformatics--indiana-u--bloomington-in-47405
> -- gilbertd@indiana.edu -- http://marmot.bio.indiana.edu/
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
-- 
______ _/      _/_____________________________________________________
      _/      _/                      http://www.ebi.ac.uk/mutations/
     _/  _/  _/  Heikki Lehvaslaiho    heikki_at_ebi ac uk
    _/_/_/_/_/  EMBL Outstation, European Bioinformatics Institute
   _/  _/  _/  Wellcome Trust Genome Campus, Hinxton
  _/  _/  _/  Cambs. CB10 1SD, United Kingdom
     _/      Phone: +44 (0)1223 494 644   FAX: +44 (0)1223 494 468
___ _/_/_/_/_/________________________________________________________

From jason at cgt.duhs.duke.edu  Fri Oct 17 08:27:27 2003
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Fri Oct 17 08:25:00 2003
Subject: [Bioperl-l] How do you create a genbank file?
In-Reply-To: <BEE28BF86078B6429D6C780635718E2103955F@morelia.be.devgen.com>
References: <BEE28BF86078B6429D6C780635718E2103955F@morelia.be.devgen.com>
Message-ID: <Pine.LNX.4.50.0310170821030.24935-100000@tenero.duhs.duke.edu>

If you are parsing Fasta files with properly set NCBI headers I thought we
had added a way to make this all get set properly, perhaps not...

You can set GI number with
$seq->primary_id($ginumber);

If you want to be creating RichSeq instead of Bio::Seq objects (in the
event you want to set some fields which are only available in RichSeq
objects, initialize your  Bio::SeqIO fasta parser like this:

use Bio::SeqIO;
use Bio::Seq::SeqFactory;
my $seqio = new Bio::SeqIO(-format => 'fasta',
			   -file   => $file,
			   -seqfactory => new Bio::Seq::SeqFactory
			( -type => 'Bio::Seq::RichSeq'));

(Or alternatively you can set the seqfactory after you have initialized
the SeqIO object with
$seqio->sequence_factory(new Bio::Seq::SeqFactory(-type =>
'Bio::Seq::RichSeq'));

-jason

On Fri, 17 Oct 2003, Marc Logghe wrote:

> > My question is how do I set the following?
> >
> > mRNA (instead of dna)
> > MAM (instead of UNK)
> > VERSION     AB050006.1  GI:26453358		<- I can't get
> > this line to appear
> > SOURCE      Bos taurus (cow)
> >    ORGANISM  Bos taurus
> >
> >
> To set the version you should use:
> $seq->seq_version($version);  # $version is e.g. 1
> Problem is, it is not possible to set the GI number. As far as I know, when you pass a genbank file, Bio::SeqIO does not even  parse it, at least it does not show up when you Data::Dump the resulting Bio::Seq::RichSeq object.
> There is no slot for that information, because it does not exist in e.g. an EMBL sequence record.
> Concerning the organism, first create the Bio::Species object. In case you only have the string 'Bos taurus' in your fasta, of course you can not generate the full classification. At least not using only your fasta data.
> my $species = Bio::Species->new([reverse split /\s/, $organism]);
> $seq->species($species);
>
> gives you:
> SOURCE      Bos taurus
>   ORGANISM  Bos taurus
>             Bos.
>
> HTH,
> Marc
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From Marc.Logghe at devgen.com  Fri Oct 17 08:38:06 2003
From: Marc.Logghe at devgen.com (Marc Logghe)
Date: Fri Oct 17 08:35:23 2003
Subject: [Bioperl-l] How do you create a genbank file?
Message-ID: <BEE28BF86078B6429D6C780635718E21039560@morelia.be.devgen.com>


> You can set GI number with
> $seq->primary_id($ginumber);

True. You can set it up in the object but it is not shown in the genbank dump. And, also the way around, when a genbank record is parsed containing the line 'VERSION     AB050006.1  GI:26453358' it does not get into the Bio::Seq::RichSeq object. As far as I can see.

> 
> If you want to be creating RichSeq instead of Bio::Seq objects (in the
> event you want to set some fields which are only available in RichSeq
> objects, initialize your  Bio::SeqIO fasta parser like this:
> 
> use Bio::SeqIO;
> use Bio::Seq::SeqFactory;
> my $seqio = new Bio::SeqIO(-format => 'fasta',
> 			   -file   => $file,
> 			   -seqfactory => new Bio::Seq::SeqFactory
> 			( -type => 'Bio::Seq::RichSeq'));
That is a really cool one. Never used it like this. Thanks
ML

From jason at cgt.duhs.duke.edu  Fri Oct 17 08:58:08 2003
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Fri Oct 17 08:55:25 2003
Subject: [Bioperl-l] How do you create a genbank file?
In-Reply-To: <BEE28BF86078B6429D6C780635718E21039560@morelia.be.devgen.com>
References: <BEE28BF86078B6429D6C780635718E21039560@morelia.be.devgen.com>
Message-ID: <Pine.LNX.4.50.0310170851310.25151-100000@tenero.duhs.duke.edu>

The should all be captured when you parse a genbank file (and be
available when you write a genbank file) that is how we can round trip
the format and have it be 99% identical.

gi number goes to primary_id()
accession number goes to accession_number()
version goes to seq_version() and version() (not sure why we have 2 methods here???)

  # From Bio::SeqIO::genbank
  if( /^ACCESSION\s+(\S.*\S)/ ) {
	      push(@acc, split(/\s+/,$1));
	      while( defined($_ = $self->_readline) ) {
		  /^\s+(.*)/ && do { push (@acc, split(/\s+/,$1)); next };
		  last;
	      }
	      $buffer = $_;
	      next;
	  }
  ....
  elsif( /^VERSION\s+(.+)$/ ) {
	      my ($acc,$gi) = split(' ',$1);
	      if($acc =~ /^\w+\.(\d+)/) {
		  $params{'-version'} = $1;
		  $params{'-seq_version'} = $1;
	      }
	      if($gi && (index($gi,"GI:") == 0)) {
		  $params{'-primary_id'} = substr($gi,3);
	      }
	  }

   the first item in @acc is passed to the appropriate accession_number
   slot and the rest are passed to secondard accessions slot.

   The %params hash gets passed to the appropriate Seq object
through the factory.  The arguments are passed through the hierarchy

   SEQ_VERSION is consumed by Seq::RichSeq
   VERSION     is consumed by Seq::PrimarySeq
   PRIMARY_ID  is consumed by Seq::PrimarySeq

-jason

On Fri, 17 Oct 2003, Marc Logghe wrote:

>
> > You can set GI number with
> > $seq->primary_id($ginumber);
>
> True. You can set it up in the object but it is not shown in the genbank
> dump. And, also the way around, when a genbank record is parsed
> containing the line 'VERSION AB050006.1 GI:26453358' it does not get
> into the Bio::Seq::RichSeq object. As far as I can see.
>
> >
> > If you want to be creating RichSeq instead of Bio::Seq objects (in the
> > event you want to set some fields which are only available in RichSeq
> > objects, initialize your  Bio::SeqIO fasta parser like this:
> >
> > use Bio::SeqIO;
> > use Bio::Seq::SeqFactory;
> > my $seqio = new Bio::SeqIO(-format => 'fasta',
> > 			   -file   => $file,
> > 			   -seqfactory => new Bio::Seq::SeqFactory
> > 			( -type => 'Bio::Seq::RichSeq'));
> That is a really cool one. Never used it like this. Thanks
> ML
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From Marc.Logghe at devgen.com  Fri Oct 17 09:23:42 2003
From: Marc.Logghe at devgen.com (Marc Logghe)
Date: Fri Oct 17 09:21:00 2003
Subject: [Bioperl-l] How do you create a genbank file?
Message-ID: <BEE28BF86078B6429D6C780635718E212E740D@morelia.be.devgen.com>

I must have had huge flies in my eyes: the GI is there indeed, in the object *and* in the genbank dump.
Thanks for your patience, Jason !!
Cheers,
Marc

> -----Original Message-----
> From: Jason Stajich [mailto:jason@cgt.duhs.duke.edu]
> Sent: Friday, October 17, 2003 2:58 PM
> To: Marc Logghe
> Cc: Wes Barris; Bioperl Mailing List
> Subject: RE: [Bioperl-l] How do you create a genbank file?
> 
> 
> The should all be captured when you parse a genbank file (and be
> available when you write a genbank file) that is how we can round trip
> the format and have it be 99% identical.
> 
> gi number goes to primary_id()
> accession number goes to accession_number()
> version goes to seq_version() and version() (not sure why we 
> have 2 methods here???)
> 
>   # From Bio::SeqIO::genbank
>   if( /^ACCESSION\s+(\S.*\S)/ ) {
> 	      push(@acc, split(/\s+/,$1));
> 	      while( defined($_ = $self->_readline) ) {
> 		  /^\s+(.*)/ && do { push (@acc, 
> split(/\s+/,$1)); next };
> 		  last;
> 	      }
> 	      $buffer = $_;
> 	      next;
> 	  }
>   ....
>   elsif( /^VERSION\s+(.+)$/ ) {
> 	      my ($acc,$gi) = split(' ',$1);
> 	      if($acc =~ /^\w+\.(\d+)/) {
> 		  $params{'-version'} = $1;
> 		  $params{'-seq_version'} = $1;
> 	      }
> 	      if($gi && (index($gi,"GI:") == 0)) {
> 		  $params{'-primary_id'} = substr($gi,3);
> 	      }
> 	  }
> 
>    the first item in @acc is passed to the appropriate 
> accession_number
>    slot and the rest are passed to secondard accessions slot.
> 
>    The %params hash gets passed to the appropriate Seq object
> through the factory.  The arguments are passed through the hierarchy
> 
>    SEQ_VERSION is consumed by Seq::RichSeq
>    VERSION     is consumed by Seq::PrimarySeq
>    PRIMARY_ID  is consumed by Seq::PrimarySeq
> 
> -jason
> 
> On Fri, 17 Oct 2003, Marc Logghe wrote:
> 
> >
> > > You can set GI number with
> > > $seq->primary_id($ginumber);
> >
> > True. You can set it up in the object but it is not shown 
> in the genbank
> > dump. And, also the way around, when a genbank record is parsed
> > containing the line 'VERSION AB050006.1 GI:26453358' it does not get
> > into the Bio::Seq::RichSeq object. As far as I can see.
> >
> > >
> > > If you want to be creating RichSeq instead of Bio::Seq 
> objects (in the
> > > event you want to set some fields which are only 
> available in RichSeq
> > > objects, initialize your  Bio::SeqIO fasta parser like this:
> > >
> > > use Bio::SeqIO;
> > > use Bio::Seq::SeqFactory;
> > > my $seqio = new Bio::SeqIO(-format => 'fasta',
> > > 			   -file   => $file,
> > > 			   -seqfactory => new Bio::Seq::SeqFactory
> > > 			( -type => 'Bio::Seq::RichSeq'));
> > That is a really cool one. Never used it like this. Thanks
> > ML
> >
> 
> --
> Jason Stajich
> Duke University
> jason at cgt.mc.duke.edu
> 

From heikki at ebi.ac.uk  Fri Oct 17 09:51:21 2003
From: heikki at ebi.ac.uk (Heikki Lehvaslaiho)
Date: Fri Oct 17 09:48:44 2003
Subject: [Bioperl-l] Codon Table
In-Reply-To: <20B7EB075F2D4542AFFAF813E98ACD9301C00BEB@cl-exsrv1.irad.bbsrc.ac.uk>
References: <20B7EB075F2D4542AFFAF813E98ACD9301C00BEB@cl-exsrv1.irad.bbsrc.ac.uk>
Message-ID: <1066398681.7861.47.camel@localhost>

Mick,

I really have not thought about this possibility. CodonTable can easily
be modified, but there is no way of passing the modified codontable
object to sequence::translate. The translate function has really complex
argument list already, so there is no way we can add anything in there. 

Depending what you want to do, it might be enough to modify the
codontable object and use its translate function. Compare it to
Bio::PrimarySeqI::translate to see what I mean. If you need full
seq::translate functionality, Bio::SeqUtils is the place to put the
code, e.g. $polypept = Bio::SeqUtils::translate($seq, $codontable);

I wrote the following method to be added into Bio::Tools::CodonTable. If
you think it is useful, I can commit it.

Yours,
	-Heikki


=head2 add_table

 Title   : add_table
 Usage   : $newid = $ct->add_table($name, $table, $starts)
 Function: Add a custom Codon Table into the object.
           Know what you are doing, only the length of
           the argument strings is checked!
 Returns : the id of the new codon table
 Args    : name, a string, optional (can be empty)
           table, a string of 64 characters
           startcodons, a string of 64 characters, defaults to standard

=cut

sub add_table {
    my ($self, $name, $table, $starts) = @_;

    $name ||= 'Custom'. scalar @NAMES + 1;
    $starts ||= $STARTS[0]; 
    $self->throw('Suspect input!')
        unless length($table) == 64 and length($starts) == 64;

    push @NAMES, $name;
    push @TABLES, $table;
    push @STARTS, $starts;

    return scalar @NAMES;

}

usage:

$ct_id = $ct->add_table(@custom_table);
$ct->id($ct_id);
$string = $ct->translate($nt_string);


On Tue, 2003-10-14 at 17:03, michael watson (IAH-C) wrote:
> Hi
> 
> I want to use a different codon table than any that are present in CodonTable.pm.  Does Bioperl have a way to implement this?  Or will I have to hack CodonTable.pm?
> 
> Thanks
> Mick
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
-- 
______ _/      _/_____________________________________________________
      _/      _/                      http://www.ebi.ac.uk/mutations/
     _/  _/  _/  Heikki Lehvaslaiho    heikki_at_ebi ac uk
    _/_/_/_/_/  EMBL Outstation, European Bioinformatics Institute
   _/  _/  _/  Wellcome Trust Genome Campus, Hinxton
  _/  _/  _/  Cambs. CB10 1SD, United Kingdom
     _/      Phone: +44 (0)1223 494 644   FAX: +44 (0)1223 494 468
___ _/_/_/_/_/________________________________________________________

From hyupaik at indiana.edu  Fri Oct 17 14:15:52 2003
From: hyupaik at indiana.edu (Henry Hyun-il Paik)
Date: Fri Oct 17 14:13:10 2003
Subject: [Bioperl-l] (no subject)
In-Reply-To: <Pine.LNX.4.50.0310170851310.25151-100000@tenero.duhs.duke.edu>
Message-ID: <Pine.GSO.3.96.1031017130455.2728D-100000@ariel.ucs.indiana.edu>


 Hello list,

 It is impossible to have a space in Feature key, Right?

 I downlaoded some data from embl cpgisle. They look like below.

---------------------------------------------------------------------------
ID   GAPDHG
AC   J04038;
LE   5378
DE   Human glyceraldehyde-3-phosphate dehydrogenase (GAPDH) gene, complete
cds.
DE   7/95
EX   Gene expression widespread
FT   CpG island      871..1673
FT                   /size=803
FT                   /%(C+G)=69.12
FT                   /Obs/Exp CpG=0.82
FT   CpG island      1683..2063
FT                   /size=381
FT                   /%(C+G)=67.19
FT                   /Obs/Exp CpG=0.77
XX
FT                   /CAAT-box.1="884"
FT                   /CAAT-box.2_complement="2156"
FT                   /GC-box="1064"
FT                   /E2F_CS.1="1785"
FT                   /SpI="158,1198,1244,1290,1310,1314"
FT                   /SpI_complement="174,584,1519,1668,1736,2271"
FT                   /SpI_complement="2625"
FT                   /AccII="717,727,1093,1268,1334,1423"
FT                   /AccII="1489,1531,1788,2006,3650,4278"
//
------------------------------------------------------------------------
 
 I tried to parse this by using SeqIO. It didn't work.

 I got an error message like below.

 -----------------------------------------------------------------------

Argument "island" isn't numeric in numeric gt (>) at
/home/hy1001/bin/Bio/Location/Atomic.pm line 91, <GEN0> line 15.
Argument "island" isn't numeric in numeric gt (>) at
/home/hy1001/bin/Bio/Location/Atomic.pm line 91, <GEN0> line 15.

------------- EXCEPTION  -------------
MSG: Got a sequence with no letters in - cannot guess alphabet []
STACK Bio::PrimarySeq::_guess_alphabet
/home/hy1001/bin/Bio/PrimarySeq.pm:817
STACK Bio::PrimarySeq::seq /home/hy1001/bin/Bio/PrimarySeq.pm:276
STACK Bio::PrimarySeq::new /home/hy1001/bin/Bio/PrimarySeq.pm:214
STACK Bio::Seq::new /home/hy1001/bin/Bio/Seq.pm:498
STACK Bio::Seq::RichSeq::new /home/hy1001/bin/Bio/Seq/RichSeq.pm:115
STACK Bio::Seq::SeqFactory::create
/home/hy1001/bin/Bio/Seq/SeqFactory.pm:126
STACK Bio::SeqIO::embl::next_seq /home/hy1001/bin/Bio/SeqIO/embl.pm:344
STACK toplevel extracted.pl:13

 -----------------------------------------------------------------------

 So I changed 'CpG island' to 'CpG_island'. Then it worked fine.

 I am using perl 5.8.0 and bioperl 1.2.3 on linux.

 Thank you.

 - Henry.

 
From hyupaik at indiana.edu  Fri Oct 17 14:18:37 2003
From: hyupaik at indiana.edu (hyupaik@indiana.edu)
Date: Fri Oct 17 14:15:54 2003
Subject: [Bioperl-l] a space in Feature key in the Feature Table.
Message-ID: <1066414717.3f90327d3f8eb@webmail.iu.edu>


 Hello list,

 It is impossible to have a space in Feature key, Right?

 I downlaoded some data from embl cpgisle. They look like below.

---------------------------------------------------------------------------
ID   GAPDHG
AC   J04038;
LE   5378
DE   Human glyceraldehyde-3-phosphate dehydrogenase (GAPDH) gene, complete
cds.
DE   7/95
EX   Gene expression widespread
FT   CpG island      871..1673
FT                   /size=803
FT                   /%(C+G)=69.12
FT                   /Obs/Exp CpG=0.82
FT   CpG island      1683..2063
FT                   /size=381
FT                   /%(C+G)=67.19
FT                   /Obs/Exp CpG=0.77
XX
FT                   /CAAT-box.1="884"
FT                   /CAAT-box.2_complement="2156"
FT                   /GC-box="1064"
FT                   /E2F_CS.1="1785"
FT                   /SpI="158,1198,1244,1290,1310,1314"
FT                   /SpI_complement="174,584,1519,1668,1736,2271"
FT                   /SpI_complement="2625"
FT                   /AccII="717,727,1093,1268,1334,1423"
FT                   /AccII="1489,1531,1788,2006,3650,4278"
//
------------------------------------------------------------------------
 
 I tried to parse this by using SeqIO. It didn't work.

 I got an error message like below.

 -----------------------------------------------------------------------

Argument "island" isn't numeric in numeric gt (>) at
/home/hy1001/bin/Bio/Location/Atomic.pm line 91, <GEN0> line 15.
Argument "island" isn't numeric in numeric gt (>) at
/home/hy1001/bin/Bio/Location/Atomic.pm line 91, <GEN0> line 15.

------------- EXCEPTION  -------------
MSG: Got a sequence with no letters in - cannot guess alphabet []
STACK Bio::PrimarySeq::_guess_alphabet
/home/hy1001/bin/Bio/PrimarySeq.pm:817
STACK Bio::PrimarySeq::seq /home/hy1001/bin/Bio/PrimarySeq.pm:276
STACK Bio::PrimarySeq::new /home/hy1001/bin/Bio/PrimarySeq.pm:214
STACK Bio::Seq::new /home/hy1001/bin/Bio/Seq.pm:498
STACK Bio::Seq::RichSeq::new /home/hy1001/bin/Bio/Seq/RichSeq.pm:115
STACK Bio::Seq::SeqFactory::create
/home/hy1001/bin/Bio/Seq/SeqFactory.pm:126
STACK Bio::SeqIO::embl::next_seq /home/hy1001/bin/Bio/SeqIO/embl.pm:344
STACK toplevel extracted.pl:13

 -----------------------------------------------------------------------

 So I changed 'CpG island' to 'CpG_island'. Then it worked fine.

 I am using perl 5.8.0 and bioperl 1.2.3 on linux.

 Thank you.

 - Henry.

From birney at ebi.ac.uk  Fri Oct 17 14:31:11 2003
From: birney at ebi.ac.uk (Ewan Birney)
Date: Fri Oct 17 14:28:23 2003
Subject: [Bioperl-l] (no subject)
In-Reply-To: <Pine.GSO.3.96.1031017130455.2728D-100000@ariel.ucs.indiana.edu>
Message-ID: <15052214-00D0-11D8-919D-000393CBD5AE@ebi.ac.uk>


On Friday, October 17, 2003, at 07:15  pm, Henry Hyun-il Paik wrote:

>
>  Hello list,
>
>  It is impossible to have a space in Feature key, Right?
>
>  I downlaoded some data from embl cpgisle. They look like below.
>

I don't think you are allowed spaces. Where did you get this from?


> ----------------------------------------------------------------------- 
> ----
> ID   GAPDHG
> AC   J04038;
> LE   5378
> DE   Human glyceraldehyde-3-phosphate dehydrogenase (GAPDH) gene,  
> complete
> cds.
> DE   7/95
> EX   Gene expression widespread
> FT   CpG island      871..1673
> FT                   /size=803
> FT                   /%(C+G)=69.12
> FT                   /Obs/Exp CpG=0.82
> FT   CpG island      1683..2063
> FT                   /size=381
> FT                   /%(C+G)=67.19
> FT                   /Obs/Exp CpG=0.77
> XX
> FT                   /CAAT-box.1="884"
> FT                   /CAAT-box.2_complement="2156"
> FT                   /GC-box="1064"
> FT                   /E2F_CS.1="1785"
> FT                   /SpI="158,1198,1244,1290,1310,1314"
> FT                   /SpI_complement="174,584,1519,1668,1736,2271"
> FT                   /SpI_complement="2625"
> FT                   /AccII="717,727,1093,1268,1334,1423"
> FT                   /AccII="1489,1531,1788,2006,3650,4278"
> //
> ----------------------------------------------------------------------- 
> -
>
>  I tried to parse this by using SeqIO. It didn't work.
>
>  I got an error message like below.
>
>   
> -----------------------------------------------------------------------
>
> Argument "island" isn't numeric in numeric gt (>) at
> /home/hy1001/bin/Bio/Location/Atomic.pm line 91, <GEN0> line 15.
> Argument "island" isn't numeric in numeric gt (>) at
> /home/hy1001/bin/Bio/Location/Atomic.pm line 91, <GEN0> line 15.
>
> ------------- EXCEPTION  -------------
> MSG: Got a sequence with no letters in - cannot guess alphabet []
> STACK Bio::PrimarySeq::_guess_alphabet
> /home/hy1001/bin/Bio/PrimarySeq.pm:817
> STACK Bio::PrimarySeq::seq /home/hy1001/bin/Bio/PrimarySeq.pm:276
> STACK Bio::PrimarySeq::new /home/hy1001/bin/Bio/PrimarySeq.pm:214
> STACK Bio::Seq::new /home/hy1001/bin/Bio/Seq.pm:498
> STACK Bio::Seq::RichSeq::new /home/hy1001/bin/Bio/Seq/RichSeq.pm:115
> STACK Bio::Seq::SeqFactory::create
> /home/hy1001/bin/Bio/Seq/SeqFactory.pm:126
> STACK Bio::SeqIO::embl::next_seq /home/hy1001/bin/Bio/SeqIO/embl.pm:344
> STACK toplevel extracted.pl:13
>
>   
> -----------------------------------------------------------------------
>
>  So I changed 'CpG island' to 'CpG_island'. Then it worked fine.
>
>  I am using perl 5.8.0 and bioperl 1.2.3 on linux.
>
>  Thank you.
>
>  - Henry.
>
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

From hyupaik at indiana.edu  Fri Oct 17 16:03:15 2003
From: hyupaik at indiana.edu (Henry Hyun-il Paik)
Date: Fri Oct 17 16:00:41 2003
Subject: [Bioperl-l] a space in Feature key
In-Reply-To: <15052214-00D0-11D8-919D-000393CBD5AE@ebi.ac.uk>
Message-ID: <Pine.GSO.3.96.1031017145905.17907B-100000@ariel.ucs.indiana.edu>


 Hello Ewan,

 I downloaded data from 

	ftp://ftp.ebi.ac.uk/pub/databases/cpgisle/

 the file name is cpgisle.dat

 - Henry

On Fri, 17 Oct 2003, Ewan Birney wrote:

> 
> On Friday, October 17, 2003, at 07:15  pm, Henry Hyun-il Paik wrote:
> 
> >
> >  Hello list,
> >
> >  It is impossible to have a space in Feature key, Right?
> >
> >  I downlaoded some data from embl cpgisle. They look like below.
> >
> 
> I don't think you are allowed spaces. Where did you get this from?
> 
> 
> 
> > ----------------------------------------------------------------------- 
> > ----
> > ID   GAPDHG
> > AC   J04038;
> > LE   5378
> > DE   Human glyceraldehyde-3-phosphate dehydrogenase (GAPDH) gene,  
> > complete
> > cds.
> > DE   7/95
> > EX   Gene expression widespread
> > FT   CpG island      871..1673
> > FT                   /size=803
> > FT                   /%(C+G)=69.12
> > FT                   /Obs/Exp CpG=0.82
> > FT   CpG island      1683..2063
> > FT                   /size=381
> > FT                   /%(C+G)=67.19
> > FT                   /Obs/Exp CpG=0.77
> > XX
> > FT                   /CAAT-box.1="884"
> > FT                   /CAAT-box.2_complement="2156"
> > FT                   /GC-box="1064"
> > FT                   /E2F_CS.1="1785"
> > FT                   /SpI="158,1198,1244,1290,1310,1314"
> > FT                   /SpI_complement="174,584,1519,1668,1736,2271"
> > FT                   /SpI_complement="2625"
> > FT                   /AccII="717,727,1093,1268,1334,1423"
> > FT                   /AccII="1489,1531,1788,2006,3650,4278"
> > //
> > ----------------------------------------------------------------------- 
> > -
> >
> >  I tried to parse this by using SeqIO. It didn't work.
> >
> >  I got an error message like below.
> >
> >   
> > -----------------------------------------------------------------------
> >
> > Argument "island" isn't numeric in numeric gt (>) at
> > /home/hy1001/bin/Bio/Location/Atomic.pm line 91, <GEN0> line 15.
> > Argument "island" isn't numeric in numeric gt (>) at
> > /home/hy1001/bin/Bio/Location/Atomic.pm line 91, <GEN0> line 15.
> >
> > ------------- EXCEPTION  -------------
> > MSG: Got a sequence with no letters in - cannot guess alphabet []
> > STACK Bio::PrimarySeq::_guess_alphabet
> > /home/hy1001/bin/Bio/PrimarySeq.pm:817
> > STACK Bio::PrimarySeq::seq /home/hy1001/bin/Bio/PrimarySeq.pm:276
> > STACK Bio::PrimarySeq::new /home/hy1001/bin/Bio/PrimarySeq.pm:214
> > STACK Bio::Seq::new /home/hy1001/bin/Bio/Seq.pm:498
> > STACK Bio::Seq::RichSeq::new /home/hy1001/bin/Bio/Seq/RichSeq.pm:115
> > STACK Bio::Seq::SeqFactory::create
> > /home/hy1001/bin/Bio/Seq/SeqFactory.pm:126
> > STACK Bio::SeqIO::embl::next_seq /home/hy1001/bin/Bio/SeqIO/embl.pm:344
> > STACK toplevel extracted.pl:13
> >
> >   
> > -----------------------------------------------------------------------
> >
> >  So I changed 'CpG island' to 'CpG_island'. Then it worked fine.
> >
> >  I am using perl 5.8.0 and bioperl 1.2.3 on linux.
> >
> >  Thank you.
> >
> >  - Henry.
> >
> >
> >
> > _______________________________________________
> > Bioperl-l mailing list
> > Bioperl-l@portal.open-bio.org
> > http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 
> 
> 

From muratet at hiwaay.net  Fri Oct 17 20:03:47 2003
From: muratet at hiwaay.net (Michael Muratet)
Date: Fri Oct 17 18:32:28 2003
Subject: [Bioperl-l] 'Suspicious' headers
Message-ID: <20031017190347.139d2cb9.muratet@hiwaay.net>

Greetings

I sent an email to the list regarding modules in bioperl that would
format data for phylip (other than simple file format changes) from a
Windows machine and it bounced to the moderator for a 'suspicious
header'. Can anyone tell me what was suspicious, so I can fix it?

Regarding the original question: I'm trying to format data from a
clustalw alignment for use in a phylogeny program. There must be a
simple way to clip the ends of the sequences so they're all the same
length, pick a region without gaps, etc., but I can't find it in the
documentation. Does anybody have any suggestions?

Thanks

Mike
From cvelasc at gwdg.de  Wed Oct 15 08:52:32 2003
From: cvelasc at gwdg.de (velasco)
Date: Fri Oct 17 18:36:14 2003
Subject: [Bioperl-l] (no subject)
Message-ID: <000001c3931b$33eebd70$277b4c86@helsinki>

To whom it concern
 
I'm strongly interested in bioperl. I have not any bioinformatics
experience and I would like to start with bioperl.
I have already installed Perl (ActiveState Perl 5.6) on Windows. I have
already downloaded the bioperl modules by Perl Package Maneger and from
http://bioperl.org/Core/external.shtml . I have read many tutorials
available on internet, but I cannot access to the bioperl programs.
Maybe I miss some important details. I hope to get help.
 
Best Regards
 
 
Velasco Cimica
velasco.cimica@virgilio.it
cvelasc@gwdg.de
+49/551/396365 (Lab)
 
Clinic of Goettingen
Departement of Gastroenterology 
2.D4 675 
Roberth Koch Str 40
37075 Goettingen
Germany
 
From mike.muratet at torchtechnologies.com  Fri Oct 17 09:38:31 2003
From: mike.muratet at torchtechnologies.com (Mike Muratet)
Date: Fri Oct 17 18:36:37 2003
Subject: [Bioperl-l] Phylip format for phylogeny
Message-ID: <016801c394b3$f55ebf70$5301a8c0@TORCHQR1Q6JDEG>

Greetings

In the "there must be a way to do this" department, I am working on a phylogeny project, and I would like to take clustalw output, find the common aligned region (i.e., the region without gaps) write it out in phylip format and use it in one or more of the available phylogeny tools. Converting clustalw to phylip format is trivial, but is there a way to find and extract a region from all of the sequences in an alignment? (I can't find it in the documentation.)

Thanks

Mike
From dag at sonsorol.org  Fri Oct 17 19:09:49 2003
From: dag at sonsorol.org (chris dagdigian)
Date: Fri Oct 17 19:08:17 2003
Subject: [Bioperl-l] 'Suspicious' headers
In-Reply-To: <20031017190347.139d2cb9.muratet@hiwaay.net>
References: <20031017190347.139d2cb9.muratet@hiwaay.net>
Message-ID: <3F9076BD.5060203@sonsorol.org>


For spam and virus reasons just about any email sent to our mailing 
lists that _is not_ written in plain ASCII text will be caught and held 
for moderation (one of our admin volunteers looks at the message and 
makes an approve/discard decision).

These are the most common scenarios for people getting their messages 
held up:

  o Email written in HTML (ie Outlook users)
  o Email with any sort of attachment
  o Any sort of MIME-encoded email, inline or attachment
  o Odd character encoding

We do quite a bit of behind the scenes mail filtering and we still can't 
stop all of the spam from leaking through. Right now any incoming email 
that has a spamassasin score of >= 8.0 is deleted pretty much instantly 
at the server level.

Messages that pass that first step get to our Mailman mailing list 
software which will also 'hold for moderation' any messages that have 
odd encoding or attachment issues.

-Chris


Michael Muratet wrote:

> Greetings
> 
> I sent an email to the list regarding modules in bioperl that would
> format data for phylip (other than simple file format changes) from a
> Windows machine and it bounced to the moderator for a 'suspicious
> header'. Can anyone tell me what was suspicious, so I can fix it?
> 
> Regarding the original question: I'm trying to format data from a
> clustalw alignment for use in a phylogeny program. There must be a
> simple way to clip the ends of the sequences so they're all the same
> length, pick a region without gaps, etc., but I can't find it in the
> documentation. Does anybody have any suggestions?
> 
> Thanks
> 
> Mike
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

-- 
Chris Dagdigian, <dag@sonsorol.org>
Independent life science IT & informatics consulting
Office: 617-666-6454, Mobile: 617-877-5498, Fax: 425-699-0193
PGP KeyID: 83D4310E Yahoo IM: craffi Web: http://bioteam.net

From brian_osborne at cognia.com  Fri Oct 17 09:11:19 2003
From: brian_osborne at cognia.com (Brian Osborne)
Date: Fri Oct 17 21:09:37 2003
Subject: [Bioperl-l] (no subject)
In-Reply-To: <000001c3931b$33eebd70$277b4c86@helsinki>
Message-ID: <GAEDKMGOKFBLJPKCLKCCKEEIDDAA.brian_osborne@cognia.com>

Velasco,

Please send us an example of the code you've written and the error messages
that you're seeing.

Brian O.

-----Original Message-----
From: bioperl-l-bounces@portal.open-bio.org
[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of velasco
Sent: Wednesday, October 15, 2003 8:53 AM
To: bioperl-l@bioperl.org
Subject: [Bioperl-l] (no subject)

To whom it concern

I'm strongly interested in bioperl. I have not any bioinformatics
experience and I would like to start with bioperl.
I have already installed Perl (ActiveState Perl 5.6) on Windows. I have
already downloaded the bioperl modules by Perl Package Maneger and from
http://bioperl.org/Core/external.shtml . I have read many tutorials
available on internet, but I cannot access to the bioperl programs.
Maybe I miss some important details. I hope to get help.

Best Regards


Velasco Cimica
velasco.cimica@virgilio.it
cvelasc@gwdg.de
+49/551/396365 (Lab)

Clinic of Goettingen
Departement of Gastroenterology
2.D4 675
Roberth Koch Str 40
37075 Goettingen
Germany


From kvddrift at earthlink.net  Sat Oct 18 15:05:57 2003
From: kvddrift at earthlink.net (Koen van der Drift)
Date: Sat Oct 18 15:03:25 2003
Subject: [Bioperl-l] some tests fail on Mac OS X and Bioperl 1.302
Message-ID: <1AB133AF-019E-11D8-BA0B-003065A5FDCC@earthlink.net>

Hi,

The Bioperl install of 1.3 (release October 17) gave some errors during
the test phase. I am using perl 5.8.0 on Mac OS X 10.2.8.

Most errors came from the fact that the test was done while my Mac
was off-line (I have dialup). All these tests passed with version 1.2.3
Is it now necessary to be online for the tests? I didn't see anything
about this in the docs.

These tests gave errors while online:

in DBCUTG.t :

# Test 9 got: '10.06' (t/DBCUTG.t at line 67)
#   Expected: '10.07'

# Test 14 got: '10.06' (t/DBCUTG.t at line 75)
#    Expected: '10.07'


in psm.t:

# Failed test 7 in t/psm.t at line 39

# Test 25 got: '1' (t/psm.t at line 89)
#    Expected: '0'
a is 13

# Test 29 got: 'NNNNNNNNNNNNN' (t/psm.t at line 102)
#    Expected: 'VVDCAGGTGBYD'

# Failed test 34 in t/psm.t at line 117


thanks,

- Koen.

From redwards at utmem.edu  Sat Oct 18 21:34:43 2003
From: redwards at utmem.edu (Rob Edwards)
Date: Sat Oct 18 21:31:58 2003
Subject: [Bioperl-l] 1.3.02 failed tests
In-Reply-To: <1065025847.2553.112.camel@localhost>
Message-ID: <6A819219-01D4-11D8-9995-000A959E1622@utmem.edu>

Building bioperl-1.302 on Mac OSX with perl 5.8.0

Failed Test        Stat Wstat Total Fail  Failed  List of Failed
------------------------------------------------------------------------ 
-------
t/Coalescent.t                   11    1   9.09%  9
t/DBCUTG.t                       23    2   8.70%  9 14
t/psm.t                          41    4   9.76%  7 25 29 34
t/simpleGOparser.t               88    1   1.14%  11
27 subtests skipped.
Failed 4/174 test scripts, 97.70% okay. 8/7953 subtests failed, 99.90%  
okay.

Building bioperl-1.302 on Debian linux with perl 5.8.0

Failed Test        Stat Wstat Total Fail  Failed  List of Failed
------------------------------------------------------------------------ 
-------
t/DBCUTG.t                       23    2   8.70%  9 14
t/psm.t                          41    4   9.76%  7 25 29 34
t/simpleGOparser.t               88    1   1.14%  11
27 subtests skipped.
Failed 3/174 test scripts, 98.28% okay. 7/7953 subtests failed, 99.91%  
okay.
make: *** [test_dynamic] Error 29

Note that t/Coalescent.t  only fails on OS X.

Rob

From redwards at utmem.edu  Sat Oct 18 22:52:47 2003
From: redwards at utmem.edu (Rob Edwards)
Date: Sat Oct 18 22:50:02 2003
Subject: [Bioperl-l] New modules
Message-ID: <5204FDB8-01DF-11D8-9995-000A959E1622@utmem.edu>

I have written a couple of modules that are bioperl-ish. They are by no 
means finished, polished objects, and there is not t/ scripts at the 
moment. (But there are some docs!).

Bio::Tools::RepeatFinder for finding direct and indirect, perfect, and 
imperfect repeats in DNA sequences.

It is not necessarily speed-optimized, but it works well for sequences 
upto several hundred kb. The imperfect repeats part of it (a) just 
joins other repeats that it finds, so there is a limitation on the 
minimum length of imperfect repeats (2n+1 where n is the user-defined 
length of one half of a repeat); (b) in the current implementation 
requires Tie::RefHash; (c) the distance between repeats can be 
specified; and (d) this really slows things down because all the 
repeats are compared to all other repeats. You don't have to calculate 
these though, if you are only looking for perfect repeats.

Bio::SearchIO::Blink is a parser for BLINK from NCBI

Blink is NCBI's Blast Link - precalculated BLAST searches for every 
protein in the NR database (I believe). Unfortunately these reports 
don't contain useful things like starts and stops of matches so at the 
moment this doesn't return a very bioperl-ish result, but I am hoping 
that as NCBI develop BLINK this module can evolve into something. Its 
useful as is, though, depending on what you need.

The modules are available from http://www.salmonella.org/bioperl/

Take a look and let me know what you think or if you find bugs.

Rob

From brian_osborne at cognia.com  Sun Oct 19 02:01:56 2003
From: brian_osborne at cognia.com (Brian Osborne)
Date: Sun Oct 19 14:00:35 2003
Subject: [Bioperl-l] New modules
In-Reply-To: <5204FDB8-01DF-11D8-9995-000A959E1622@utmem.edu>
Message-ID: <GAEDKMGOKFBLJPKCLKCCOEEKDDAA.brian_osborne@cognia.com>

Rob,

Can the repeats overlap? There was a recent feature request for something
like this...

Brian O.

-----Original Message-----
From: bioperl-l-bounces@portal.open-bio.org
[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Rob Edwards
Sent: Saturday, October 18, 2003 10:53 PM
To: Bioperl
Subject: [Bioperl-l] New modules

I have written a couple of modules that are bioperl-ish. They are by no
means finished, polished objects, and there is not t/ scripts at the
moment. (But there are some docs!).

Bio::Tools::RepeatFinder for finding direct and indirect, perfect, and
imperfect repeats in DNA sequences.

It is not necessarily speed-optimized, but it works well for sequences
upto several hundred kb. The imperfect repeats part of it (a) just
joins other repeats that it finds, so there is a limitation on the
minimum length of imperfect repeats (2n+1 where n is the user-defined
length of one half of a repeat); (b) in the current implementation
requires Tie::RefHash; (c) the distance between repeats can be
specified; and (d) this really slows things down because all the
repeats are compared to all other repeats. You don't have to calculate
these though, if you are only looking for perfect repeats.

Bio::SearchIO::Blink is a parser for BLINK from NCBI

Blink is NCBI's Blast Link - precalculated BLAST searches for every
protein in the NR database (I believe). Unfortunately these reports
don't contain useful things like starts and stops of matches so at the
moment this doesn't return a very bioperl-ish result, but I am hoping
that as NCBI develop BLINK this module can evolve into something. Its
useful as is, though, depending on what you need.

The modules are available from http://www.salmonella.org/bioperl/

Take a look and let me know what you think or if you find bugs.

Rob

_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org
http://portal.open-bio.org/mailman/listinfo/bioperl-l


From redwards at utmem.edu  Sun Oct 19 14:25:05 2003
From: redwards at utmem.edu (Rob Edwards)
Date: Sun Oct 19 14:22:20 2003
Subject: [Bioperl-l] New modules
In-Reply-To: <GAEDKMGOKFBLJPKCLKCCOEEKDDAA.brian_osborne@cognia.com>
Message-ID: <8FC39619-0261-11D8-9995-000A959E1622@utmem.edu>


On Sunday, October 19, 2003, at 01:01  AM, Brian Osborne wrote:

> Rob,
>
> Can the repeats overlap? There was a recent feature request for 
> something
> like this...
>
> Brian O.
>
>


Yep, the repeats can and do overlap. I suppose I could add something to 
not allow this, but I haven't yet.

there is a known "issue" that if you have repeats like this

GGAAAAAAAGG  and CCAAAAAAATTAAAAAAACC

and ask it to merge the nearby repeats the left half will have length 7 
and the right half will have length 16. I just keep forgetting to add 
something to make sure the left and right halves of the repeats are the 
same length. (Note: this is only with merged repeats; if you don't 
merge things together you'll get two separate repeats).

The most recent queries I remember about repeats had to do with 
repeatmasking sequences. I haven't given this much thought but I don't 
see why you couldn't use this module for that, but I'd set the minimum 
lengths of the repeats to be quite long to start with otherwise it will 
be horribly slow. The speed is really determined by the length of the 
sequence and the length of the repeats (obviously-a short sequence will 
have fewer repeats, and longer repeats will occur less frequently).

What was the other request ? I couldn't find anything in a search 
through the mailing archive.

Rob

From letondal at pasteur.fr  Sun Oct 19 17:11:16 2003
From: letondal at pasteur.fr (Catherine Letondal)
Date: Sun Oct 19 17:08:31 2003
Subject: [Bioperl-l] Pise: control on submissions
Message-ID: <20031019231116.A219070@electre.pasteur.fr>

Hi,

This is just an information for those of you who use the bioperl Pise
API. It's about a new type of error that can happen
when running programs under Pise: recently, we had difficulties to
cope with the growing number of requests (mainly from interactive
users) and I had to add a limit on the number of submissions - even
queued ones. So, it's really a good idea to check for the job
status when using the bioperl Pise API, for the server may be too busy
and reject the request.

my $job = $fastdnaml->run(-alignment => $ARGV[0]);
if ($job->error) {
    print $job->error_message, "\n";
    # exit or whatever
}

In this case, a possibility is to use an alternative Pise servers (such as
kun.homelinux.com - which is faster, see examples).

-- 
Catherine Letondal -- Pasteur Institute Computing Center
From jason at cgt.duhs.duke.edu  Sun Oct 19 19:56:54 2003
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Sun Oct 19 19:53:54 2003
Subject: [Bioperl-l] Phylip format for phylogeny
In-Reply-To: <016801c394b3$f55ebf70$5301a8c0@TORCHQR1Q6JDEG>
References: <016801c394b3$f55ebf70$5301a8c0@TORCHQR1Q6JDEG>
Message-ID: <Pine.LNX.4.50.0310191858240.19166-100000@tenero.duhs.duke.edu>

You'd need to write something that would determine where the cutoffs of
your sub-alignment need to be if you are talking about a contiguous slice
of the alignment you want to preserve.

I have something I wrote a long time ago which basically walked out from
the start and end of the alignment and identify the part I want to
include.  This might be something like setting a cutoff % of sequences in
a column which are gapped and then finding either a single or multiple
regions which were of a minimum length and met the gap restriction cutoff.
This seemed to work for my needs - it reduced some of the noise for later
analyses but it definitely an alg that needs tweaking for a specific
question.

Anyways, you can get a slice of contiguous columns with
$aln->slice($start,$end)

There are also cmds to get non-contiguous columns but I'm not sure if that
is what you want.

-jason

On Fri, 17 Oct 2003, Mike Muratet wrote:

> Greetings
>
> In the "there must be a way to do this" department, I am working on a
> phylogeny project, and I would like to take clustalw output, find the
> common aligned region (i.e., the region without gaps) write it out in
> phylip format and use it in one or more of the available phylogeny
> tools. Converting clustalw to phylip format is trivial, but is there a
> way to find and extract a region from all of the sequences in an
> alignment? (I can't find it in the documentation.)
>
> Thanks
>
> Mike

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From copley at well.ox.ac.uk  Mon Oct 20 04:30:49 2003
From: copley at well.ox.ac.uk (Richard Copley)
Date: Mon Oct 20 04:28:02 2003
Subject: [Bioperl-l] Phylip format for phylogeny
In-Reply-To: <016801c394b3$f55ebf70$5301a8c0@TORCHQR1Q6JDEG>
References: <016801c394b3$f55ebf70$5301a8c0@TORCHQR1Q6JDEG>
Message-ID: <3F939D39.1010601@well.ox.ac.uk>


> In the "there must be a way to do this" department, I am working on a phylogeny project, and I would like to take clustalw output, find the common aligned region (i.e., the region without gaps) write it out in phylip format and use it in one or more of the available phylogeny tools. Converting clustalw to phylip format is trivial, but is there a way to find and extract a region from all of the sequences in an alignment? (I can't find it in the documentation.)

Jose Castresana's Gblocks program is a useful way to extract well 
aligned blocks for phylogenetic analysis.

http://monstre1.imim.es/~castresa/Gblocks/Gblocks.html


-- 
Richard Copley      | Wellcome Trust Centre for Human Genetics
tel +44 1865 287597 | Roosevelt Drive
fax +44 1865 287664 | Oxford OX3 7BN, England

From brian_osborne at cognia.com  Sun Oct 19 20:01:13 2003
From: brian_osborne at cognia.com (Brian Osborne)
Date: Mon Oct 20 07:59:30 2003
Subject: [Bioperl-l] Bio::DB::Registry
In-Reply-To: <web-14573023@uni-freiburg.de>
Message-ID: <GAEDKMGOKFBLJPKCLKCCKEENDDAA.brian_osborne@cognia.com>

Daniel,

I've made a change that fixes the OBDA/Registry system for me, but only when
I create "flat" indices with bioflat_index.pl, not "bdb". Now I'll take a
look at why "bdb" doesn't work, it has to do with the code's inability to
direct the search properly to BinarySearch.pm or BDB.pm, I think.
Anotherwords, using Bio::DB::Flat directly using "flat" (or "binarysearch")
or "bdb" works, but when you use Registry with "bdb" the search is
incorrectly directed to BinarySearch.

If you'd like to try this then get the latest Bioperl from CVS. I don't
think there's anything wrong with your config.dat, and no, I didn't make any
changes to bioflat_index.pl.


Brian O.

-----Original Message-----
From: Daniel Lang [mailto:daniel.lang@biologie.uni-freiburg.de]
Sent: Friday, October 17, 2003 3:29 PM
To: Brian Osborne
Subject: Re: [Bioperl-l] Bio::DB::Registry

Hi Brian,
No, I?ve tried it with the enviroment variable using both
methods(flat and bsd), but the error remains:(
I?v posted my config.dat to the list(Wed Oct 15 10:52:39
EDT 2003 ), maybe there is something wrong?
As I?ve mentioned, I?ve downladed the bioflat_index.pl
seperately from the attic...
I see you moved it back to examples, did you change
anything?

Thanks for your help!
Enjoy your weekend:)
Daniel


From brian_osborne at cognia.com  Sun Oct 19 20:58:39 2003
From: brian_osborne at cognia.com (Brian Osborne)
Date: Mon Oct 20 08:56:56 2003
Subject: [Bioperl-l] New modules
In-Reply-To: <8FC39619-0261-11D8-9995-000A959E1622@utmem.edu>
Message-ID: <GAEDKMGOKFBLJPKCLKCCIEEODDAA.brian_osborne@cognia.com>

Rob,

>What was the other request ? I couldn't find anything in a search
>through the mailing archive.

Perhaps not exactly what you've done, it was a request to make SeqWords
collect overlapping matches.

Brian O.

-----Original Message-----
From: Rob Edwards [mailto:redwards@utmem.edu]
Sent: Sunday, October 19, 2003 2:25 PM
To: Brian Osborne
Cc: Bioperl
Subject: Re: [Bioperl-l] New modules


On Sunday, October 19, 2003, at 01:01  AM, Brian Osborne wrote:

> Rob,
>
> Can the repeats overlap? There was a recent feature request for
> something
> like this...
>
> Brian O.
>
>


Yep, the repeats can and do overlap. I suppose I could add something to
not allow this, but I haven't yet.

there is a known "issue" that if you have repeats like this

GGAAAAAAAGG  and CCAAAAAAATTAAAAAAACC

and ask it to merge the nearby repeats the left half will have length 7
and the right half will have length 16. I just keep forgetting to add
something to make sure the left and right halves of the repeats are the
same length. (Note: this is only with merged repeats; if you don't
merge things together you'll get two separate repeats).

The most recent queries I remember about repeats had to do with
repeatmasking sequences. I haven't given this much thought but I don't
see why you couldn't use this module for that, but I'd set the minimum
lengths of the repeats to be quite long to start with otherwise it will
be horribly slow. The speed is really determined by the length of the
sequence and the length of the repeats (obviously-a short sequence will
have fewer repeats, and longer repeats will occur less frequently).

What was the other request ? I couldn't find anything in a search
through the mailing archive.

Rob


From mike.muratet at torchtechnologies.com  Mon Oct 20 13:34:17 2003
From: mike.muratet at torchtechnologies.com (Mike Muratet)
Date: Mon Oct 20 13:30:49 2003
Subject: [Bioperl-l] Sequence from PDB
Message-ID: <01dc01c39730$6461a0d0$5301a8c0@TORCHQR1Q6JDEG>

Greetings

Is there a method associated with the Bio::Structure family that will
extract the sequence (primary structure) from PDB files? I don't see one in
the documentation.

Thanks

Mike

From brian_osborne at cognia.com  Mon Oct 20 01:47:29 2003
From: brian_osborne at cognia.com (Brian Osborne)
Date: Mon Oct 20 13:46:11 2003
Subject: [Bioperl-l] Sequence from PDB
In-Reply-To: <01dc01c39730$6461a0d0$5301a8c0@TORCHQR1Q6JDEG>
Message-ID: <GAEDKMGOKFBLJPKCLKCCKEFEDDAA.brian_osborne@cognia.com>

Mike,

>From bptutorial:

  $structio = Bio::Structure::IO->new( -file => "1XYZ.pdb");
  $struc = $structio->next_structure; # returns an Entry object
  $pseq = $struc->seqres;             # returns a PrimarySeq object, thus
  $pseq->subseq(1,20);                # returns a sequence string

Brian O.

-----Original Message-----
From: bioperl-l-bounces@portal.open-bio.org
[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Mike Muratet
Sent: Monday, October 20, 2003 1:34 PM
To: bioperl-l@bioperl.org
Subject: [Bioperl-l] Sequence from PDB

Greetings

Is there a method associated with the Bio::Structure family that will
extract the sequence (primary structure) from PDB files? I don't see one in
the documentation.

Thanks

Mike

_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org
http://portal.open-bio.org/mailman/listinfo/bioperl-l


From allenday at ucla.edu  Mon Oct 20 21:46:36 2003
From: allenday at ucla.edu (Allen Day)
Date: Mon Oct 20 21:43:47 2003
Subject: [Bioperl-l] Bio::SeqIO::kegg
Message-ID: <Pine.LNX.4.44.0310201839140.3261-100000@sumo.genetics.ucla.edu>

Hi all,

I just added a parser for KEGG sequence records, available from:

ftp://ftp.genome.ad.jp/pub/kegg/genomes/genes/

I've parsed all tags except CODON_USAGE and POSITION into a 
Bio::Seq::RichSeq.  I didn't implement CODON_USAGE because I wasn't sure 
what class would hold it, and also because the codon frequencies can be 
recalculated from the primary sequence.

For the POSITION tag, I didn't implement it because I wasn't sure how to 
deal with the reference sequence and exon positions for a spliced mRNA.  
If someone wants to explain to me how to do this, I'd be happy to 
implement it.  Or if someone wants to DIY, please feel free.

I'm in the process of writing a parser for KGML:

ftp://ftp.genome.ad.jp/pub/kegg/pathways/KGML_v0.3/

to read in KEGG's pathway data, are people interested in representing
metabolic networks in bioperl, or am I better off doing this as a separate
project?  Or better yet, has someone already done this?

-Allen

From birney at ebi.ac.uk  Tue Oct 21 03:54:46 2003
From: birney at ebi.ac.uk (Ewan Birney)
Date: Tue Oct 21 04:43:26 2003
Subject: [Bioperl-l] Bio::SeqIO::kegg
In-Reply-To: <Pine.LNX.4.44.0310201839140.3261-100000@sumo.genetics.ucla.edu>
Message-ID: <D7098785-039B-11D8-9D1E-000393CBD5AE@ebi.ac.uk>


On Tuesday, October 21, 2003, at 02:46  am, Allen Day wrote:

> Hi all,
>
> I just added a parser for KEGG sequence records, available from:
>
> ftp://ftp.genome.ad.jp/pub/kegg/genomes/genes/
>
> I've parsed all tags except CODON_USAGE and POSITION into a
> Bio::Seq::RichSeq.  I didn't implement CODON_USAGE because I wasn't 
> sure
> what class would hold it, and also because the codon frequencies can be
> recalculated from the primary sequence.
>
> For the POSITION tag, I didn't implement it because I wasn't sure how 
> to
> deal with the reference sequence and exon positions for a spliced mRNA.
> If someone wants to explain to me how to do this, I'd be happy to
> implement it.  Or if someone wants to DIY, please feel free.
>
> I'm in the process of writing a parser for KGML:
>
> ftp://ftp.genome.ad.jp/pub/kegg/pathways/KGML_v0.3/
>
> to read in KEGG's pathway data, are people interested in representing
> metabolic networks in bioperl, or am I better off doing this as a 
> separate
> project?  Or better yet, has someone already done this?
>

Allen - whatever happens in Kegg parsing in bioperl, over at GK we'd 
like to
make sure that we can provide similar information and try to round trip
information. Imre Vastrik is the technical lead on GK and possibly
has some ideas about a generic "pathway" object model (GK has a rich
pathway object model which is defined in Protege and then Perl and
Java code run-time bind to the protege-generated and manipulated
datastructures).


> -Allen
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

From chauser at duke.edu  Tue Oct 21 13:34:56 2003
From: chauser at duke.edu (Charles Hauser)
Date: Tue Oct 21 16:49:36 2003
Subject: [Bioperl-l] Bio::SeqFeature::Generic
Message-ID: <1066757868.17682.40.camel@pandorina.biology.duke.edu>

Don't know if this is a bug or a problem on my end.

Using Bio::SeqFeature::Generic to generate gff:

	my $feat = new Bio::SeqFeature::Generic();
	$feat->seq_id($seq->id);
	$feat->source_tag('assembly');
	$feat->primary_tag('contig');
	$feat->start('1');
	$feat->end($seq->length);
	$feat->add_tag_value('Contig',$seq->id);
	$out->write_feature($feat);

I end up with 2 extra spaces following the group tag ('Contig
scaffold_1') which results in gbrowse returning: ...landmark not
recognized...

deleting these spaces makes all right with the world.

Is the problem in my code or in  Bio::SeqFeature::Generic?

Charles

-------------- next part --------------
scaffold_1	assembly	contig	1	2159858	.	.	.	Contig scaffold_1  
scaffold_1	ace	match	50886	52846	.	+	.	Target "ACE:20021010.8753.1"  9 687  
scaffold_1	ace	HSP	50886	51245	.	+	.	Target "ACE:20021010.8753.1"  9 368  
scaffold_1	ace	HSP	51578	51703	.	+	.	Target "ACE:20021010.8753.1"  369 494  
scaffold_1	ace	HSP	52280	52345	.	+	.	Target "ACE:20021010.8753.1"  495 560  
scaffold_1	ace	HSP	52719	52834	.	+	.	Target "ACE:20021010.8753.1"  561 676  
scaffold_1	ace	HSP	52836	52846	.	+	.	Target "ACE:20021010.8753.1"  677 687  
scaffold_1	ace	match	2852	3829	.	+	.	Target "ACE:20021010.432.1"  948 1577  
scaffold_1	ace	HSP	2852	2998	.	+	.	Target "ACE:20021010.432.1"  948 1094  
scaffold_1	ace	HSP	3175	3218	.	+	.	Target "ACE:20021010.432.1"  1095 1138  
scaffold_1	ace	HSP	3219	3305	.	+	.	Target "ACE:20021010.432.1"  1140 1226  
scaffold_1	ace	HSP	3479	3829	.	+	.	Target "ACE:20021010.432.1"  1227 1577  
scaffold_1	ace	match	47657	48119	.	+	.	Target "ACE:20021010.1025.1"  5 484  
scaffold_1	ace	HSP	47657	47746	.	+	.	Target "ACE:20021010.1025.1"  5 94  
scaffold_1	ace	HSP	47747	48119	.	+	.	Target "ACE:20021010.1025.1"  112 484  
scaffold_1	ace	match	44560	45221	.	+	.	Target "ACE:20021010.3151.1"  8 669  
scaffold_1	ace	HSP	44560	45221	.	+	.	Target "ACE:20021010.3151.1"  8 669  
scaffold_1	ace	match	47550	50747	.	+	.	Target "ACE:20021010.8359.1"  9 1212  
scaffold_1	ace	HSP	47550	48210	.	+	.	Target "ACE:20021010.8359.1"  9 669  
scaffold_1	ace	HSP	48741	48862	.	+	.	Target "ACE:20021010.8359.1"  670 791  
scaffold_1	ace	HSP	49384	49502	.	+	.	Target "ACE:20021010.8359.1"  792 910  
scaffold_1	ace	HSP	50261	50383	.	+	.	Target "ACE:20021010.8359.1"  911 1033  
scaffold_1	ace	HSP	50569	50747	.	+	.	Target "ACE:20021010.8359.1"  1034 1212  
From hlapp at gnf.org  Tue Oct 21 21:37:40 2003
From: hlapp at gnf.org (Hilmar Lapp)
Date: Tue Oct 21 21:34:34 2003
Subject: [Bioperl-l] 1.3.02 failed tests
In-Reply-To: <6A819219-01D4-11D8-9995-000A959E1622@utmem.edu>
Message-ID: <BBBB2D74.22C0%hlapp@gnf.org>

On 10/18/03 6:34 PM, "Rob Edwards" <redwards@utmem.edu> wrote:

> t/simpleGOparser.t               88    1   1.14%  11

Do these just fail under Mac OSX perl 5.8.x? I can only check for perl
5.6.0.

    -hilmar
-- 
-------------------------------------------------------------
Hilmar Lapp                            email: lapp at gnf.org
GNF, San Diego, Ca. 92121              phone: +1-858-812-1757
-------------------------------------------------------------


From jason at cgt.duhs.duke.edu  Tue Oct 21 21:57:54 2003
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Tue Oct 21 21:55:03 2003
Subject: [Bioperl-l] 1.3.02 failed tests
In-Reply-To: <BBBB2D74.22C0%hlapp@gnf.org>
References: <BBBB2D74.22C0%hlapp@gnf.org>
Message-ID: <Pine.LNX.4.50.0310212157220.2815-100000@tenero.duhs.duke.edu>

I also get this on perl 5.8.0

not ok 11
# Test 11 got: <UNDEF> (t/simpleGOparser.t at line 82)
#    Expected: 'EC:5.3.99.5'


and perl 5.6.0
not ok 11
# Test 11 got: <UNDEF> (t/simpleGOparser.t at line 82)
#    Expected: 'EC:5.3.99.5'


On Tue, 21 Oct 2003, Hilmar Lapp wrote:

> On 10/18/03 6:34 PM, "Rob Edwards" <redwards@utmem.edu> wrote:
>
> > t/simpleGOparser.t               88    1   1.14%  11
>
> Do these just fail under Mac OSX perl 5.8.x? I can only check for perl
> 5.6.0.
>
>     -hilmar
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From hlapp at gnf.org  Tue Oct 21 22:01:27 2003
From: hlapp at gnf.org (Hilmar Lapp)
Date: Tue Oct 21 21:58:22 2003
Subject: [Bioperl-l] 1.3.02 failed tests
In-Reply-To: <Pine.LNX.4.50.0310212157220.2815-100000@tenero.duhs.duke.edu>
Message-ID: <BBBB3307.22C5%hlapp@gnf.org>

Will check. This used to work; somebody changed the parser lately though.

    -hilmar

On 10/21/03 6:57 PM, "Jason Stajich" <jason@cgt.duhs.duke.edu> wrote:

> I also get this on perl 5.8.0
> 
> not ok 11
> # Test 11 got: <UNDEF> (t/simpleGOparser.t at line 82)
> #    Expected: 'EC:5.3.99.5'
> 
> 
> and perl 5.6.0
> not ok 11
> # Test 11 got: <UNDEF> (t/simpleGOparser.t at line 82)
> #    Expected: 'EC:5.3.99.5'
> 
> 
> On Tue, 21 Oct 2003, Hilmar Lapp wrote:
> 
>> On 10/18/03 6:34 PM, "Rob Edwards" <redwards@utmem.edu> wrote:
>> 
>>> t/simpleGOparser.t               88    1   1.14%  11
>> 
>> Do these just fail under Mac OSX perl 5.8.x? I can only check for perl
>> 5.6.0.
>> 
>>     -hilmar
>> 
> 
> --
> Jason Stajich
> Duke University
> jason at cgt.mc.duke.edu
> 

-- 
-------------------------------------------------------------
Hilmar Lapp                            email: lapp at gnf.org
GNF, San Diego, Ca. 92121              phone: +1-858-812-1757
-------------------------------------------------------------


From redwards at utmem.edu  Tue Oct 21 22:28:55 2003
From: redwards at utmem.edu (Rob Edwards)
Date: Tue Oct 21 22:26:08 2003
Subject: [Bioperl-l] 1.3.02 failed tests
In-Reply-To: <Pine.LNX.4.50.0310212157220.2815-100000@tenero.duhs.duke.edu>
Message-ID: <7BB5C6EE-0437-11D8-9995-000A959E1622@utmem.edu>

I'm probably just going to confuse things here:

When I run make test
It fails on OSX/perl 5.8.0 and Debian/perl 5.8.0
It passes on Redhat/perl 5.6.1
($dblinks[0] is undef as Jason got)


However, it passes on all systems if you run 'perl -w 
t/simpleGOparser.t' and $dblinks[0] has the right value.

See, I warned you I'd confuse you.

Rob


On Tuesday, October 21, 2003, at 08:57  PM, Jason Stajich wrote:

> I also get this on perl 5.8.0
>
> not ok 11
> # Test 11 got: <UNDEF> (t/simpleGOparser.t at line 82)
> #    Expected: 'EC:5.3.99.5'
>
>
> and perl 5.6.0
> not ok 11
> # Test 11 got: <UNDEF> (t/simpleGOparser.t at line 82)
> #    Expected: 'EC:5.3.99.5'
>
>
> On Tue, 21 Oct 2003, Hilmar Lapp wrote:
>
>> On 10/18/03 6:34 PM, "Rob Edwards" <redwards@utmem.edu> wrote:
>>
>>> t/simpleGOparser.t               88    1   1.14%  11
>>
>> Do these just fail under Mac OSX perl 5.8.x? I can only check for perl
>> 5.6.0.
>>
>>     -hilmar
>>
>
> --
> Jason Stajich
> Duke University
> jason at cgt.mc.duke.edu

From billym at orionbiosolutions.com  Tue Oct 21 23:57:50 2003
From: billym at orionbiosolutions.com (billym@orionbiosolutions.com)
Date: Wed Oct 22 00:04:27 2003
Subject: [Bioperl-l] Orion BioSolutions Kits, $25 to you with Purchase
Message-ID: <20031022035750.8874.qmail@crm.orionbiosolutions.com>


Introducing Orion BioSolutions Research Kits for Cell and
Molecular Biology. Try our kits and we will give you a gift
of $25, or you can donate your $25 to charity.


Cloning Kits: CloneFinders - High Efficiency Chemically
Competent Ecoli

Protein Expression Kits: ProExpressors - High level inducible
expression in BL21 cells.

FACS Kits: IntraCyte - Intracellular FACS Kits - Superior New
Technology, Not just for lymphocytes, works on many different
cell and tissue types.


Visit our website at: http://www.orionbiosolutions.com to place
your order today.


_________________________________________________________________

You have been recognized as a customer of Life Science Research
Kits. If we have reached you in error, and you are not interested
in our new technologies and promotional offers, please opt out
at: http://www.orionbiosolutions.com/bin/optout
We do not want to invade your privacy. Please read our privacy
policy at: http://www.orionbiosolutions.com

From daniel.lang at biologie.uni-freiburg.de  Wed Oct 22 03:27:40 2003
From: daniel.lang at biologie.uni-freiburg.de (Daniel Lang)
Date: Wed Oct 22 03:24:23 2003
Subject: [Bioperl-l] Bio::DB::Registry
In-Reply-To: <GAEDKMGOKFBLJPKCLKCCKEENDDAA.brian_osborne@cognia.com>
References: <GAEDKMGOKFBLJPKCLKCCKEENDDAA.brian_osborne@cognia.com>
Message-ID: <3F96316C.3080107@biologie.uni-freiburg.de>

Hi Brian,
I hope I?m not getting on your nerves...
Now, there occurs a new problem when using flat indices:(
When I call services on the Registry object, it returns the list of the
available services, but when I want to "get_databases" on one of them I
get a Bio::DB::Failover Object...
I?ve registered a nt (FASTA) and an aa (SWISSPROT) flat file...
Of course, I?m still interested in using the bdb indices:)

Thanks again,
Daniel

-- 
Daniel Lang
University of Freiburg, Plant Biotechnology
Sonnenstr. 5, D-79104 Freiburg
phone: +49 761 203 6988
homepage:  http://www.plant-biotech.net/
e-mail: daniel.lang@biologie.uni-freiburg.de

#################################################
 >REALITY.SYS corrupted: Reboot universe? (Y/N/A)
#################################################

From hlapp at gnf.org  Wed Oct 22 03:56:14 2003
From: hlapp at gnf.org (Hilmar Lapp)
Date: Wed Oct 22 03:53:09 2003
Subject: [Bioperl-l] Bio::SeqIO::kegg
In-Reply-To: <Pine.LNX.4.44.0310201839140.3261-100000@sumo.genetics.ucla.edu>
Message-ID: <35B1A9DE-0465-11D8-B5E7-000A959EB4C4@gnf.org>

t/SeqIO......................ok 48/230"my" variable $annotation masks 
earlier declaration in same scope at blib/lib/Bio/SeqIO/kegg.pm line 
202, <GEN13> line 235

	-hilmar

On Monday, October 20, 2003, at 06:46  PM, Allen Day wrote:

> Hi all,
>
> I just added a parser for KEGG sequence records, available from:
>
> ftp://ftp.genome.ad.jp/pub/kegg/genomes/genes/
>
> I've parsed all tags except CODON_USAGE and POSITION into a
> Bio::Seq::RichSeq.  I didn't implement CODON_USAGE because I wasn't 
> sure
> what class would hold it, and also because the codon frequencies can be
> recalculated from the primary sequence.
>
> For the POSITION tag, I didn't implement it because I wasn't sure how 
> to
> deal with the reference sequence and exon positions for a spliced mRNA.
> If someone wants to explain to me how to do this, I'd be happy to
> implement it.  Or if someone wants to DIY, please feel free.
>
> I'm in the process of writing a parser for KGML:
>
> ftp://ftp.genome.ad.jp/pub/kegg/pathways/KGML_v0.3/
>
> to read in KEGG's pathway data, are people interested in representing
> metabolic networks in bioperl, or am I better off doing this as a 
> separate
> project?  Or better yet, has someone already done this?
>
> -Allen
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>
-- 
-------------------------------------------------------------
Hilmar Lapp                            email: lapp at gnf.org
GNF, San Diego, Ca. 92121              phone: +1-858-812-1757
-------------------------------------------------------------


From hlapp at gnf.org  Wed Oct 22 04:25:20 2003
From: hlapp at gnf.org (Hilmar Lapp)
Date: Wed Oct 22 04:22:14 2003
Subject: [Bioperl-l] 1.3.02 failed tests
In-Reply-To: <Pine.LNX.4.50.0310212157220.2815-100000@tenero.duhs.duke.edu>
Message-ID: <4609B7E9-0469-11D8-ABB3-000A959EB4C4@gnf.org>

Allen, your recent change to require only 1 digit instead of 3 after 
the colon broke the parser. The change makes the regular expressions 
pick up EC numbers as secondary GO accessions. Did the test quoted 
below not fail on your machine?

I'm going to reverse that change. The MPATH ontology needs to be 
accommodated in a way that doesn't break the parser, possibly by having 
its own.

Also, I should urge you to add test cases for all the ontologies for 
which you found yourself having to adapt the parser. Otherwise the 
ability to parse those too may be abolished silently at any time by 
anyone changing the code.

	-hilmar

On Tuesday, October 21, 2003, at 06:57  PM, Jason Stajich wrote:

> I also get this on perl 5.8.0
>
> not ok 11
> # Test 11 got: <UNDEF> (t/simpleGOparser.t at line 82)
> #    Expected: 'EC:5.3.99.5'
>
>
> and perl 5.6.0
> not ok 11
> # Test 11 got: <UNDEF> (t/simpleGOparser.t at line 82)
> #    Expected: 'EC:5.3.99.5'
>
>
> On Tue, 21 Oct 2003, Hilmar Lapp wrote:
>
>> On 10/18/03 6:34 PM, "Rob Edwards" <redwards@utmem.edu> wrote:
>>
>>> t/simpleGOparser.t               88    1   1.14%  11
>>
>> Do these just fail under Mac OSX perl 5.8.x? I can only check for perl
>> 5.6.0.
>>
>>     -hilmar
>>
>
> --
> Jason Stajich
> Duke University
> jason at cgt.mc.duke.edu
>
-- 
-------------------------------------------------------------
Hilmar Lapp                            email: lapp at gnf.org
GNF, San Diego, Ca. 92121              phone: +1-858-812-1757
-------------------------------------------------------------


From brian_osborne at cognia.com  Wed Oct 22 08:28:19 2003
From: brian_osborne at cognia.com (Brian Osborne)
Date: Wed Oct 22 08:27:13 2003
Subject: [Bioperl-l] Bio::DB::Registry
In-Reply-To: <3F96316C.3080107@biologie.uni-freiburg.de>
Message-ID: <GAEDKMGOKFBLJPKCLKCCAEGKDDAA.brian_osborne@cognia.com>

Daniel,

No problem, you're not bugging me. I have both BinarySearch and BDB working
with Registry and DB/Flat on my machine now. I'll take a look at your
get_databases() problem. Then I have to add some BDB/DB_File tests to
Registry.t. I'll tell you when this is all done, probably after the weekend.

What you should do is submit this get_databases() error into the tracking
system at bugzilla.bioperl.org with details, that would help.

Brian O.

-----Original Message-----
From: bioperl-l-bounces@portal.open-bio.org
[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Daniel Lang
Sent: Wednesday, October 22, 2003 3:28 AM
To: Brian Osborne
Cc: bioperl-l@bioperl.org
Subject: Re: [Bioperl-l] Bio::DB::Registry

Hi Brian,
I hope I?m not getting on your nerves...
Now, there occurs a new problem when using flat indices:(
When I call services on the Registry object, it returns the list of the
available services, but when I want to "get_databases" on one of them I
get a Bio::DB::Failover Object...
I?ve registered a nt (FASTA) and an aa (SWISSPROT) flat file...
Of course, I?m still interested in using the bdb indices:)

Thanks again,
Daniel

--
Daniel Lang
University of Freiburg, Plant Biotechnology
Sonnenstr. 5, D-79104 Freiburg
phone: +49 761 203 6988
homepage:  http://www.plant-biotech.net/
e-mail: daniel.lang@biologie.uni-freiburg.de

#################################################
 >REALITY.SYS corrupted: Reboot universe? (Y/N/A)
#################################################

_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org
http://portal.open-bio.org/mailman/listinfo/bioperl-l


From heikki at ebi.ac.uk  Wed Oct 22 09:11:45 2003
From: heikki at ebi.ac.uk (Heikki Lehvaslaiho)
Date: Wed Oct 22 09:08:58 2003
Subject: Bio::Tools::GFF [Re: [Bioperl-l] Bio::SeqFeature::Generic]
In-Reply-To: <1066757868.17682.40.camel@pandorina.biology.duke.edu>
References: <1066757868.17682.40.camel@pandorina.biology.duke.edu>
Message-ID: <1066828304.2631.18.camel@localhost>

Charles,

Your code should be fine.

I can see the extra spaces in the output by running the
following script as 'gfftest.pl |od'.

The method that needs changing is Bio::Tools::GFF::_gff2_string(), I'd
rather have feedback from GFF people first. Lincoln or someone from his
group should be able to say what is the correct way to solve this
problem.

Yours,
	-Heikki

---------------------gfftest.pl--------------------------------
use Bio::Tools::GFF;
use Bio::SeqFeature::Generic;
use Data::Dumper;

my $gffout = Bio::Tools::GFF->new(-gff_version => 2);

my $feat = new Bio::SeqFeature::Generic();
$feat->seq_id('111');
$feat->source_tag('assembly');
$feat->primary_tag('contig');
$feat->start('1');
$feat->end(123);
$feat->add_tag_value('Contig','111');
$gffout->write_feature($feat);
#print Dumper $feat;
--------------------------------------------------------

On Tue, 2003-10-21 at 18:37, Charles Hauser wrote:
> Don't know if this is a bug or a problem on my end.
> 
> Using Bio::SeqFeature::Generic to generate gff:
> 
> 	my $feat = new Bio::SeqFeature::Generic();
> 	$feat->seq_id($seq->id);
> 	$feat->source_tag('assembly');
> 	$feat->primary_tag('contig');
> 	$feat->start('1');
> 	$feat->end($seq->length);
> 	$feat->add_tag_value('Contig',$seq->id);
> 	$out->write_feature($feat);
> 
> I end up with 2 extra spaces following the group tag ('Contig
> scaffold_1') which results in gbrowse returning: ...landmark not
> recognized...
> 
> deleting these spaces makes all right with the world.
> 
> Is the problem in my code or in  Bio::SeqFeature::Generic?
> 
> Charles
> 
> 
> ______________________________________________________________________
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
-- 
______ _/      _/_____________________________________________________
      _/      _/                      http://www.ebi.ac.uk/mutations/
     _/  _/  _/  Heikki Lehvaslaiho    heikki_at_ebi ac uk
    _/_/_/_/_/  EMBL Outstation, European Bioinformatics Institute
   _/  _/  _/  Wellcome Trust Genome Campus, Hinxton
  _/  _/  _/  Cambs. CB10 1SD, United Kingdom
     _/      Phone: +44 (0)1223 494 644   FAX: +44 (0)1223 494 468
___ _/_/_/_/_/________________________________________________________

From jason at cgt.duhs.duke.edu  Wed Oct 22 09:39:09 2003
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Wed Oct 22 09:36:36 2003
Subject: Bio::Tools::GFF [Re: [Bioperl-l] Bio::SeqFeature::Generic]
In-Reply-To: <1066828304.2631.18.camel@localhost>
References: <1066757868.17682.40.camel@pandorina.biology.duke.edu>
	<1066828304.2631.18.camel@localhost>
Message-ID: <Pine.LNX.4.50.0310220935430.7276-100000@tenero.duhs.duke.edu>

I've got a fix - this code really needed to be written better.  I'm using
arrays and joins instead of chop-ing strings post-hoc.

Index: Bio/Tools/GFF.pm
===================================================================
RCS file: /home/repository/bioperl/bioperl-live/Bio/Tools/GFF.pm,v
retrieving revision 1.29
diff -r1.29 GFF.pm
570c570
<    my $valuestr;
---
>
571a572
>    my @group;
573,575d573
<        $str .= "\t"; # my interpretation of the GFF2
<                      # specification suggests the need
<                      # for this additional TAB character...??
577,579c575
< 	   my $valuestr; # a string which will hold one or more values
< 	                 # for this tag, with quoted free text and
< 	                 # space-separated individual values.
---
> 	   my @v;
585c581,582
< 		   $value = '"' . $value . '" '} # if the value contains
---
> 		   $value = '"' . $value . '" ';
> 	       }                                 # if the value contains
589c586
< 	       $value = "\"\"" unless defined $value;
---
> 	       $value = '\""' unless defined $value;
593,595c590,591
< 	       $valuestr .=  $value . " "; # with a trailing space in case
< 	                                   # there are multiple values
< 	       # for this tag (allowed in GFF2 and .ace format)
---
> 	       push @v, $value;
> 	       # for this tag (allowed in GFF2 and .ace format)
597c593
< 	   $str .= "$tag $valuestr ; ";	# semicolon delimited with no '='
sign
---
> 	   push @group, "$tag ".join(" ", @v);
599d594
<        chop $str; chop $str;  # remove the trailing semicolon and space
600a596
>    $str .= "\t" . join(' ; ', @group);

-jason

-jason
On Wed, 22 Oct 2003, Heikki Lehvaslaiho wrote:

> Charles,
>
> Your code should be fine.
>
> I can see the extra spaces in the output by running the
> following script as 'gfftest.pl |od'.
>
> The method that needs changing is Bio::Tools::GFF::_gff2_string(), I'd
> rather have feedback from GFF people first. Lincoln or someone from his
> group should be able to say what is the correct way to solve this
> problem.
>
> Yours,
> 	-Heikki
>
> ---------------------gfftest.pl--------------------------------
> use Bio::Tools::GFF;
> use Bio::SeqFeature::Generic;
> use Data::Dumper;
>
> my $gffout = Bio::Tools::GFF->new(-gff_version => 2);
>
> my $feat = new Bio::SeqFeature::Generic();
> $feat->seq_id('111');
> $feat->source_tag('assembly');
> $feat->primary_tag('contig');
> $feat->start('1');
> $feat->end(123);
> $feat->add_tag_value('Contig','111');
> $gffout->write_feature($feat);
> #print Dumper $feat;
> --------------------------------------------------------
>
> On Tue, 2003-10-21 at 18:37, Charles Hauser wrote:
> > Don't know if this is a bug or a problem on my end.
> >
> > Using Bio::SeqFeature::Generic to generate gff:
> >
> > 	my $feat = new Bio::SeqFeature::Generic();
> > 	$feat->seq_id($seq->id);
> > 	$feat->source_tag('assembly');
> > 	$feat->primary_tag('contig');
> > 	$feat->start('1');
> > 	$feat->end($seq->length);
> > 	$feat->add_tag_value('Contig',$seq->id);
> > 	$out->write_feature($feat);
> >
> > I end up with 2 extra spaces following the group tag ('Contig
> > scaffold_1') which results in gbrowse returning: ...landmark not
> > recognized...
> >
> > deleting these spaces makes all right with the world.
> >
> > Is the problem in my code or in  Bio::SeqFeature::Generic?
> >
> > Charles
> >
> >
> > ______________________________________________________________________
> > _______________________________________________
> > Bioperl-l mailing list
> > Bioperl-l@portal.open-bio.org
> > http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From allenday at ucla.edu  Wed Oct 22 14:40:42 2003
From: allenday at ucla.edu (Allen Day)
Date: Wed Oct 22 14:37:50 2003
Subject: [Bioperl-l] 1.3.02 failed tests
In-Reply-To: <4609B7E9-0469-11D8-ABB3-000A959EB4C4@gnf.org>
Message-ID: <Pine.LNX.4.44.0310221033010.15197-100000@sumo.genetics.ucla.edu>

> Allen, your recent change to require only 1 digit instead of 3 after 
> the colon broke the parser. The change makes the regular expressions 
> pick up EC numbers as secondary GO accessions. Did the test quoted 
> below not fail on your machine?

i tested my changes on Ontology.t and OntologyEngine.t, i didn't see
simpleGOparser.t, and so didn't run it before i checked in.  it does fail 
on my machine.

> I'm going to reverse that change. The MPATH ontology needs to be 
> accommodated in a way that doesn't break the parser, possibly by having 
> its own.

i keep loosening the regular expression to accommodate the different 
DAG-Edit IDs i enounter.  how about i write a regular expression that is 
as strict as possible but still matches everything that DAG-Edit allows?

> Also, I should urge you to add test cases for all the ontologies for 
> which you found yourself having to adapt the parser. Otherwise the 
> ability to parse those too may be abolished silently at any time by 
> anyone changing the code.
> 

sure.

-allen

> 	-hilmar
> 
> On Tuesday, October 21, 2003, at 06:57  PM, Jason Stajich wrote:
> 
> > I also get this on perl 5.8.0
> >
> > not ok 11
> > # Test 11 got: <UNDEF> (t/simpleGOparser.t at line 82)
> > #    Expected: 'EC:5.3.99.5'
> >
> >
> > and perl 5.6.0
> > not ok 11
> > # Test 11 got: <UNDEF> (t/simpleGOparser.t at line 82)
> > #    Expected: 'EC:5.3.99.5'
> >
> >
> > On Tue, 21 Oct 2003, Hilmar Lapp wrote:
> >
> >> On 10/18/03 6:34 PM, "Rob Edwards" <redwards@utmem.edu> wrote:
> >>
> >>> t/simpleGOparser.t               88    1   1.14%  11
> >>
> >> Do these just fail under Mac OSX perl 5.8.x? I can only check for perl
> >> 5.6.0.
> >>
> >>     -hilmar
> >>
> >
> > --
> > Jason Stajich
> > Duke University
> > jason at cgt.mc.duke.edu
> >
> 


From hlapp at gnf.org  Wed Oct 22 15:25:40 2003
From: hlapp at gnf.org (Hilmar Lapp)
Date: Wed Oct 22 15:22:33 2003
Subject: [Bioperl-l] 1.3.02 failed tests
In-Reply-To: <Pine.LNX.4.44.0310221033010.15197-100000@sumo.genetics.ucla.edu>
Message-ID: <BBBC27C4.2301%hlapp@gnf.org>

On 10/22/03 11:40 AM, "Allen Day" <allenday@ucla.edu> wrote:

>> Allen, your recent change to require only 1 digit instead of 3 after
>> the colon broke the parser. The change makes the regular expressions
>> pick up EC numbers as secondary GO accessions. Did the test quoted
>> below not fail on your machine?
> 
> i tested my changes on Ontology.t and OntologyEngine.t, i didn't see
> simpleGOparser.t, and so didn't run it before i checked in.  it does fail
> on my machine.

It's generally not a bad idea to run *all* tests before checking in, as
bioperl is so complex meanwhile that it is difficult to predict with
certainty what knock-on effects a change may or may not have. Yes I know
running all tests takes a while and is painful, but I don't think there is a
good reason not to.

> 
>> I'm going to reverse that change. The MPATH ontology needs to be
>> accommodated in a way that doesn't break the parser, possibly by having
>> its own.
> 
> i keep loosening the regular expression to accommodate the different
> DAG-Edit IDs i enounter.  how about i write a regular expression that is
> as strict as possible but still matches everything that DAG-Edit allows?

Sure, you'd be welcome. Note though that non-conforming or
not-so-well-conforming formats may clearly justify their own parser in order
to avoid making a brittle one-catch-all monolithic parser. Also, complex
regular expressions have a tendency to become arcane and impossible to
maintain, so I'm actually not a big fan of coming up with ever more
sophisticated patterns with all kinds of subtle recognition features.

Rather, I'd try to find a pattern that identifies the token as an ID,
whether seconday ID or dbxref ID, and then leave it up to a method that may
be overridden to actually parse the ID and assign it to one of the term's
properties. (You'd then have 2 parsers, which would share most of the code
though.)

Event-based parsing may also be a way out, but that'd be big change.

    -hilmar

> 
>> Also, I should urge you to add test cases for all the ontologies for
>> which you found yourself having to adapt the parser. Otherwise the
>> ability to parse those too may be abolished silently at any time by
>> anyone changing the code.
>> 
> 
> sure.
> 
> -allen
> 
>> -hilmar
>> 
>> On Tuesday, October 21, 2003, at 06:57  PM, Jason Stajich wrote:
>> 
>>> I also get this on perl 5.8.0
>>> 
>>> not ok 11
>>> # Test 11 got: <UNDEF> (t/simpleGOparser.t at line 82)
>>> #    Expected: 'EC:5.3.99.5'
>>> 
>>> 
>>> and perl 5.6.0
>>> not ok 11
>>> # Test 11 got: <UNDEF> (t/simpleGOparser.t at line 82)
>>> #    Expected: 'EC:5.3.99.5'
>>> 
>>> 
>>> On Tue, 21 Oct 2003, Hilmar Lapp wrote:
>>> 
>>>> On 10/18/03 6:34 PM, "Rob Edwards" <redwards@utmem.edu> wrote:
>>>> 
>>>>> t/simpleGOparser.t               88    1   1.14%  11
>>>> 
>>>> Do these just fail under Mac OSX perl 5.8.x? I can only check for perl
>>>> 5.6.0.
>>>> 
>>>>     -hilmar
>>>> 
>>> 
>>> --
>>> Jason Stajich
>>> Duke University
>>> jason at cgt.mc.duke.edu
>>> 
>> 
> 
> 
> 
> 

-- 
-------------------------------------------------------------
Hilmar Lapp                            email: lapp at gnf.org
GNF, San Diego, Ca. 92121              phone: +1-858-812-1757
-------------------------------------------------------------


From ryan.doherty at mrc.ac.za  Thu Oct 23 04:30:15 2003
From: ryan.doherty at mrc.ac.za (Ryan Doherty)
Date: Thu Oct 23 04:24:45 2003
Subject: [Bioperl-l] Retrieve mail
Message-ID: <OAEIIDNNDAKNLOGNGNLEMEOGDFAA.rdoherty@mrc.ac.za>

Hello,

Can anyone recommend a module (either BioPerl module or other Perl module) that
will assist me in retrieving e-mail messages from a POP3 mail-server?

Much appreciated,


Ryan Doherty

From heikki at ebi.ac.uk  Thu Oct 23 04:41:31 2003
From: heikki at ebi.ac.uk (Heikki Lehvaslaiho)
Date: Thu Oct 23 04:38:40 2003
Subject: [Bioperl-l] a space in Feature key
In-Reply-To: <Pine.GSO.3.96.1031017145905.17907B-100000@ariel.ucs.indiana.edu>
References: <Pine.GSO.3.96.1031017145905.17907B-100000@ariel.ucs.indiana.edu>
Message-ID: <1066898490.2226.51.camel@localhost>

Checking with EMBL databank guys, I find out that since feature table 
format is defined in characters, space characters are not specifically
banned in feature keys, but they are actively avoided.

Also, I talked to Rodrigo Lopez who promised to release an updated
CpGIsle database soon (next week?) and remove the offending space.

Yours,

	-Heikki


On Fri, 2003-10-17 at 21:03, Henry Hyun-il Paik wrote:
>  Hello Ewan,
> 
>  I downloaded data from 
> 
> 	ftp://ftp.ebi.ac.uk/pub/databases/cpgisle/
> 
>  the file name is cpgisle.dat
> 
>  - Henry
> 
> On Fri, 17 Oct 2003, Ewan Birney wrote:
> 
> > 
> > On Friday, October 17, 2003, at 07:15  pm, Henry Hyun-il Paik wrote:
> > 
> > >
> > >  Hello list,
> > >
> > >  It is impossible to have a space in Feature key, Right?
> > >
> > >  I downlaoded some data from embl cpgisle. They look like below.
> > >
> > 
> > I don't think you are allowed spaces. Where did you get this from?
> > 
> > 
> > 
> > > ----------------------------------------------------------------------- 
> > > ----
> > > ID   GAPDHG
> > > AC   J04038;
> > > LE   5378
> > > DE   Human glyceraldehyde-3-phosphate dehydrogenase (GAPDH) gene,  
> > > complete
> > > cds.
> > > DE   7/95
> > > EX   Gene expression widespread
> > > FT   CpG island      871..1673
> > > FT                   /size=803
> > > FT                   /%(C+G)=69.12
> > > FT                   /Obs/Exp CpG=0.82
> > > FT   CpG island      1683..2063
> > > FT                   /size=381
> > > FT                   /%(C+G)=67.19
> > > FT                   /Obs/Exp CpG=0.77
> > > XX
> > > FT                   /CAAT-box.1="884"
> > > FT                   /CAAT-box.2_complement="2156"
> > > FT                   /GC-box="1064"
> > > FT                   /E2F_CS.1="1785"
> > > FT                   /SpI="158,1198,1244,1290,1310,1314"
> > > FT                   /SpI_complement="174,584,1519,1668,1736,2271"
> > > FT                   /SpI_complement="2625"
> > > FT                   /AccII="717,727,1093,1268,1334,1423"
> > > FT                   /AccII="1489,1531,1788,2006,3650,4278"
> > > //
> > > ----------------------------------------------------------------------- 
> > > -
> > >
> > >  I tried to parse this by using SeqIO. It didn't work.
> > >
> > >  I got an error message like below.
> > >
> > >   
> > > -----------------------------------------------------------------------
> > >
> > > Argument "island" isn't numeric in numeric gt (>) at
> > > /home/hy1001/bin/Bio/Location/Atomic.pm line 91, <GEN0> line 15.
> > > Argument "island" isn't numeric in numeric gt (>) at
> > > /home/hy1001/bin/Bio/Location/Atomic.pm line 91, <GEN0> line 15.
> > >
> > > ------------- EXCEPTION  -------------
> > > MSG: Got a sequence with no letters in - cannot guess alphabet []
> > > STACK Bio::PrimarySeq::_guess_alphabet
> > > /home/hy1001/bin/Bio/PrimarySeq.pm:817
> > > STACK Bio::PrimarySeq::seq /home/hy1001/bin/Bio/PrimarySeq.pm:276
> > > STACK Bio::PrimarySeq::new /home/hy1001/bin/Bio/PrimarySeq.pm:214
> > > STACK Bio::Seq::new /home/hy1001/bin/Bio/Seq.pm:498
> > > STACK Bio::Seq::RichSeq::new /home/hy1001/bin/Bio/Seq/RichSeq.pm:115
> > > STACK Bio::Seq::SeqFactory::create
> > > /home/hy1001/bin/Bio/Seq/SeqFactory.pm:126
> > > STACK Bio::SeqIO::embl::next_seq /home/hy1001/bin/Bio/SeqIO/embl.pm:344
> > > STACK toplevel extracted.pl:13
> > >
> > >   
> > > -----------------------------------------------------------------------
> > >
> > >  So I changed 'CpG island' to 'CpG_island'. Then it worked fine.
> > >
> > >  I am using perl 5.8.0 and bioperl 1.2.3 on linux.
> > >
> > >  Thank you.
> > >
> > >  - Henry.
> > >
> > >
> > >
> > > _______________________________________________
> > > Bioperl-l mailing list
> > > Bioperl-l@portal.open-bio.org
> > > http://portal.open-bio.org/mailman/listinfo/bioperl-l
> > 
> > _______________________________________________
> > Bioperl-l mailing list
> > Bioperl-l@portal.open-bio.org
> > http://portal.open-bio.org/mailman/listinfo/bioperl-l
> > 
> > 
> > 
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
-- 
______ _/      _/_____________________________________________________
      _/      _/                      http://www.ebi.ac.uk/mutations/
     _/  _/  _/  Heikki Lehvaslaiho    heikki_at_ebi ac uk
    _/_/_/_/_/  EMBL Outstation, European Bioinformatics Institute
   _/  _/  _/  Wellcome Trust Genome Campus, Hinxton
  _/  _/  _/  Cambs. CB10 1SD, United Kingdom
     _/      Phone: +44 (0)1223 494 644   FAX: +44 (0)1223 494 468
___ _/_/_/_/_/________________________________________________________

From ak at ebi.ac.uk  Thu Oct 23 04:51:12 2003
From: ak at ebi.ac.uk (Andreas Kahari)
Date: Thu Oct 23 04:48:18 2003
Subject: [offtopic] Re: [Bioperl-l] Retrieve mail
In-Reply-To: <OAEIIDNNDAKNLOGNGNLEMEOGDFAA.rdoherty@mrc.ac.za>
References: <OAEIIDNNDAKNLOGNGNLEMEOGDFAA.rdoherty@mrc.ac.za>
Message-ID: <20031023085112.GA32526@ebi.ac.uk>

On Thu, Oct 23, 2003 at 10:30:15AM +0200, Ryan Doherty wrote:
> Hello,
>
> Can anyone recommend a module (either BioPerl module or other
> Perl module) that will assist me in retrieving e-mail messages
> from a POP3 mail-server?

Search for POP3 at search.cpan.org

-- 
a n d r e (  Andreas K?h?ri                           ) 0 1 0 0 0
a s . k a  ) EMBL, European Bioinformatics Institute (  1 0 0 0 1
h a r i @ (  Wellcome Trust Genome Campus, Hinxton    ) 0 0 1 1 1
e b i . a  ) Cambridge, CB10 1SD                     (  0 0 1 0 0
c . u k   (  United Kingdom                           ) 0 0 0 1
From andreas.boehm at virchow.uni-wuerzburg.de  Thu Oct 23 05:10:48 2003
From: andreas.boehm at virchow.uni-wuerzburg.de (Andreas Boehm)
Date: Thu Oct 23 05:12:44 2003
Subject: [Bioperl-l] modified Fasta-Headers?
In-Reply-To: <OAEIIDNNDAKNLOGNGNLEMEOGDFAA.rdoherty@mrc.ac.za>
References: <OAEIIDNNDAKNLOGNGNLEMEOGDFAA.rdoherty@mrc.ac.za>
Message-ID: <3F979B18.2000302@virchow.uni-wuerzburg.de>

Hello,

analyzing fasta-files distributed with software from thermo, we found 
some undocumented characters in the header. There are some 0x01 (chr(1)) 
as field separator.
Is the meaning of theses chars known?

regards,
Andreas Boehm

From andreas.boehm at virchow.uni-wuerzburg.de  Thu Oct 23 05:16:25 2003
From: andreas.boehm at virchow.uni-wuerzburg.de (Andreas Boehm)
Date: Thu Oct 23 05:18:22 2003
Subject: [Bioperl-l] modified Fasta-Headers?
Message-ID: <3F979C69.3090105@virchow.uni-wuerzburg.de>

Hello,

analyzing fasta-files distributed with software from thermo, we found
some undocumented characters in the header. There are some 0x01 (chr(1))
as field separator.
Is the meaning of theses chars known?

regards,
Andreas Boehm

From Martin.Schenker at ogs.co.uk  Thu Oct 23 05:41:51 2003
From: Martin.Schenker at ogs.co.uk (Martin Schenker)
Date: Thu Oct 23 05:39:01 2003
Subject: [Bioperl-l] modified Fasta-Headers?
Message-ID: <934C1A6D9596D511B59E0002B34BC52A02F1394B@selene.ogs.co.uk>

Hi Andreas,

Are your sure that's not the "normal" nr header with ^A (^A => oct001 =>
\001) seperated chunks from different sources?

Best, Martin

> -----Original Message-----
> From: Andreas Boehm [mailto:andreas.boehm@virchow.uni-wuerzburg.de] 
> Sent: 23 October 2003 10:16
> To: bioperl-l@bioperl.org
> Subject: [Bioperl-l] modified Fasta-Headers?
> 
> 
> Hello,
> 
> analyzing fasta-files distributed with software from thermo, we found
> some undocumented characters in the header. There are some 
> 0x01 (chr(1))
> as field separator.
> Is the meaning of theses chars known?
> 
> regards,
> Andreas Boehm
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 


********************************************************************
The information transmitted by this email is private and
confidential and is intended for the use of the intended 
recipients specified therein.
If you are neither an intended recipient nor an employee 
or agent responsible for delivery to an intended recipient, 
you should be aware that any dissemination, distribution 
or copying of this mail is strictly prohibited.
If this communication was received in error, please 
notify us immediately.
********************************************************************

From andreas.boehm at virchow.uni-wuerzburg.de  Thu Oct 23 06:28:21 2003
From: andreas.boehm at virchow.uni-wuerzburg.de (Andreas Boehm)
Date: Thu Oct 23 06:30:18 2003
Subject: [Bioperl-l] modified Fasta-Headers?
In-Reply-To: <934C1A6D9596D511B59E0002B34BC52A02F1394B@selene.ogs.co.uk>
References: <934C1A6D9596D511B59E0002B34BC52A02F1394B@selene.ogs.co.uk>
Message-ID: <3F97AD45.5070703@virchow.uni-wuerzburg.de>

Hi,

> Are your sure that's not the "normal" nr header with ^A (^A => oct001 =>
> \001) seperated chunks from different sources?

No. Take a look:
 >gi|84670|pir||B27257 coagulogen II precursor - horseshoe crab =
(Tachypleus tridentatus)=01gi|10809 (X04192) coagulogen type II =
[Tachypleus tridentatus]=01gi|217395|gnl|PID|d1000491 (D00077) =
coagulogen type 2 [Tachypleus tridentatus]=01gi|356167|prf||1208319A =
coagulogen [Tachypleus sp.] [MASS=3D21826]
MEKKLFGIALLLTTVASVLAADTNAPICLCDEPGVLGRTQIVTTEIKDKIEKAVEAVAQESGVSGRGFSIFSH=
HPVFREC
GKYECRTVRPEHSRCYNFPPFIHFKSECPVSTRDCEPVFGYTVAGEFRVIVQAPRAGFRQCVWQHKCRFGSNS=
CGYNGRC
TQQRSVVRLVTYNLEKDGFLCESFRTCCGCPCRSF

from horse.fasta

I do not know, what the meaning of these chr(1) should be.
Any help?


regards,
Andreas Boehm

From deejoe at iastate.edu  Thu Oct 23 10:18:16 2003
From: deejoe at iastate.edu (D. Joe Anderson)
Date: Thu Oct 23 10:15:23 2003
Subject: [Bioperl-l] bioperl-pipeline
In-Reply-To: <07CD0901-FCD8-11D7-95C5-000A95783436@stanford.edu>
References: <1065961003.8277.15.camel@rattus.localdomain>
	<07CD0901-FCD8-11D7-95C5-000A95783436@stanford.edu>
Message-ID: <20031023141816.GM23429@iastate.edu>

On Sun, Oct 12, 2003 at 10:18:00AM -0700, Shawn Hoon wrote:
> We are still trying to resolve some dns issues. See if  
> biopipe-tmp.open-bio.org
> or add
> 
> 65.246.187.176 www.biopipe.org
> 
> to /etc/hosts
 
Just for the record, it looks like the DNS resolution problems have been
cleared up.

http://www.biopipe.org now just works (at least for me).


--Joe


From jason at cgt.duhs.duke.edu  Thu Oct 23 11:26:26 2003
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Thu Oct 23 11:23:30 2003
Subject: [Bioperl-l] [Bioperl-guts-l] [Bug 1544] New: Trouble dissecting
 problem Bio::SeqIO (fwd)
Message-ID: <Pine.LNX.4.50.0310231124070.15950-100000@tenero.duhs.duke.edu>

It is just a warning because the tag line doesn't wrap the way the parser
expects it to - it isn't a big deal.  If you don't want to see the
warnings you can set -verbose => -1 when you instantiate a
Bio::DB::GenBank.

my $db = Bio::DB::GenBank(-verbose => -1);


-jason
--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu

---------- Forwarded message ----------
Date: Thu, 23 Oct 2003 09:24:59 -0400
From: bugzilla-daemon@portal.open-bio.org
To: bioperl-guts-l@bioperl.org
Subject: [Bioperl-guts-l] [Bug 1544] New: Trouble dissecting problem
    Bio::SeqIO

http://bugzilla.bioperl.org/show_bug.cgi?id=1544

           Summary: Trouble dissecting problem Bio::SeqIO
           Product: Bioperl
           Version: 1.2 branch
          Platform: PC
               URL: http://www.smd.qmul.ac.uk/gc/homepages/dbuchan/
        OS/Version: Linux
            Status: NEW
          Severity: normal
          Priority: P2
         Component: Bio::SeqIO
        AssignedTo: bioperl-guts-l@bioperl.org
        ReportedBy: d.w.a.buchan@qmul.ac.uk


I assume this is a problem with Bio::SeqIO but may be a problem with
Bio::DB::GenBank (but I doubt it). Currently I'm trying to download a file
~67,000 est from genbank, however upon trying to download and output some of the
records bioperl throws the following error (depending on the est record)

-------------------- WARNING ---------------------
MSG: trouble dissecting
"IPISGKPLPKVTLSRDGVPLKATMRFNTEITAENLTINLKESVTADAGRYEITAANSSGTTKAFINI VVLDRPGPPT
GPVVISDITEESVTLKWEPPKYDGGSQVTNYILLKRETSTAVWTEVSATVARTMMKVMKL ... ;, mRNA
sequence." into chunks of 66 chars or less - this tag won't print right
---------------------------------------------------

or

-------------------- WARNING ---------------------
MSG: trouble dissecting
"TGTTACCAATCTGAAGTGGGAGCGGCCGCATTGCTTTTTTTTTTTTTTTTTTTTTTTT T 3'];
double-stranded cDNA was ligated to Eco RI adaptors (Pharmacia),  digested with
Not I and cloned into the Not I and Eco RI sites of  the modified pT7T3 vector.
Library is normalized, and was constructed  by Bento Soares and M.Fatima
Bonaldo."" into chunks of 57 chars or less - this tag won't print right
---------------------------------------------------

In exchange it prints outputs nothing.  Unfortunately I can't give specific
accession numbers as I'm using the get_stream_by_id() method.  Although I will
gladly foward the file of accessions that I'm using and the script I'm using to
do this.

The perl version I'm using is 5.8.0, the Bio version I'm using is 1.2.3 both
running in RedHat 8.  We are experiencing the same problem when trying to
accomplish the same task on a Sun Solaris 8 machine


------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.
_______________________________________________
Bioperl-guts-l mailing list
Bioperl-guts-l@portal.open-bio.org
http://portal.open-bio.org/mailman/listinfo/bioperl-guts-l
From chauser at duke.edu  Thu Oct 23 12:09:34 2003
From: chauser at duke.edu (Charles Hauser)
Date: Thu Oct 23 12:09:35 2003
Subject: [Bioperl-l] GFF3: preliminary or implemented?
Message-ID: <1066925550.20083.4.camel@pandorina.biology.duke.edu>

My last notes re:GFF3 were that it was in the 'preliminary' stage, but
it appears as though Jason update Tools::GFF to handle '='.

Question: is GFF3 a supported version now & is  GBrowse happy w/ GFF3
formatting?


Charles


From lstein at cshl.edu  Thu Oct 23 12:33:07 2003
From: lstein at cshl.edu (Lincoln Stein)
Date: Thu Oct 23 12:30:16 2003
Subject: [Bioperl-l] GFF3: preliminary or implemented?
In-Reply-To: <1066925550.20083.4.camel@pandorina.biology.duke.edu>
References: <1066925550.20083.4.camel@pandorina.biology.duke.edu>
Message-ID: <200310231233.07501.lstein@cshl.edu>

Essentially the answer is yes.  A few details need to be worked out, most 
importantly handling the CIGAR gap lines.

Lincoln

On Thursday 23 October 2003 12:12 pm, Charles Hauser wrote:
> My last notes re:GFF3 were that it was in the 'preliminary' stage, but
> it appears as though Jason update Tools::GFF to handle '='.
>
> Question: is GFF3 a supported version now & is  GBrowse happy w/ GFF3
> formatting?
>
>
> Charles
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

-- 
========================================================================
Lincoln D. Stein                           Cold Spring Harbor Laboratory
lstein@cshl.org			                  Cold Spring Harbor, NY
========================================================================


From cjm at fruitfly.org  Thu Oct 23 12:48:27 2003
From: cjm at fruitfly.org (Chris Mungall)
Date: Thu Oct 23 12:46:02 2003
Subject: [Bioperl-l] GFF3: preliminary or implemented?
In-Reply-To: <1066925550.20083.4.camel@pandorina.biology.duke.edu>
Message-ID: <Pine.LNX.4.33.0310230945200.30090-100000@sos.lbl.gov>


It seems to me that we need to extend Bio::Tools::GFF to read/write the
seqfeature nesting hierarchy using ID= and Parent= tags

This seems a little problemmatic because it requires the notion of a
unique id for features, which is lacking in bioperl

How should we progress here? Should we add add a unique_id method to
SeqFeature? Should this be autogenerated?

On 23 Oct 2003, Charles Hauser wrote:

> My last notes re:GFF3 were that it was in the 'preliminary' stage, but
> it appears as though Jason update Tools::GFF to handle '='.
>
> Question: is GFF3 a supported version now & is  GBrowse happy w/ GFF3
> formatting?
>
>
> Charles
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

From michael at galton.ucl.ac.uk  Thu Oct 23 16:01:59 2003
From: michael at galton.ucl.ac.uk (michael)
Date: Thu Oct 23 15:59:05 2003
Subject: [Bioperl-l] What's that number?
In-Reply-To: <200309241917.h8OJGoMj031893@portal.open-bio.org>
Message-ID: <Pine.GSO.4.33.0310231600110.3297-100000@ash.gene.ucl.ac.uk>


	Is there any code out there that can make an educated guess to the
origin of any given (biological) ID/accession number?

	X93993 AJ010957 are both valid EMBL accession numbers, P47202
looks similar but is from swissprot (a bit of digging reveals the
swissprot patten as /[OPQ]\d[A-Z\d][A-Z\d][A-Z\d]\d/ ).  Failing code is
there a biological name space document out there?

	I'm writing a public gene symbol submission form and want it to do
the right thing regardless of where the data is pasted in.

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Michael John Lush PhD		 	 Tel:44-20-7679-5027
Nomenclature Bioinformatics Support 	 Fax:44-20-7387-3496
HUGO Gene Nomenclature Committee Email:  nome@galton.ucl.ac.uk
The Galton Laboratory
University College London, UK
URL: http://www.gene.ucl.ac.uk/nomenclature/
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

From o02u at unb.ca  Thu Oct 23 19:42:59 2003
From: o02u at unb.ca (Cooper, Marc Emmerson)
Date: Thu Oct 23 19:40:06 2003
Subject: [Bioperl-l] Question re: Phred Base Quality Data & Bio::SeqIO::scf 
Message-ID: <1066952579.3f986783186c9@webmail1.unb.ca>

Hi everyone, 

I'm looking for some information on base quality data via the Bio::SeqIO::scf package. 
Specifically, I recognize that there are methods to extract the base qualities for both
called and uncalled bases within the SCF file using _dump_base_qualities() or
_dump_base_qualities_incoming().  

I have generated an SCF file using phred and have written a small portion of perl code
to extract the quality data from the SCF file.  However upon examination of the output
I noticed that only the quality data for called bases is available and zero values are
assigned to the quality data for the three other potential bases at each base position.
   

Could anyone tell me whether it is possible to export quality data for the uncalled
bases using phred?  Or do SCF files generated by phred only contain quality data for
called bases?

Cheers,

Marc Cooper
Master's Candidate
Bioinformatics Research Group
Faculty of Computer Science
University of New Brunswick
Fredericton, NB  Canada  
E3B 5A3
email: marc.cooper@unb.ca

From pwilkinson at videotron.ca  Thu Oct 23 21:41:56 2003
From: pwilkinson at videotron.ca (Peter Wilkinson)
Date: Thu Oct 23 21:39:03 2003
Subject: [Bioperl-l] What's that number?
In-Reply-To: <Pine.GSO.4.33.0310231600110.3297-100000@ash.gene.ucl.ac.uk>
References: <200309241917.h8OJGoMj031893@portal.open-bio.org>
Message-ID: <5.2.0.9.0.20031023213506.00b242b8@pop.videotron.ca>

Well it would be nice if there existed some type of global namespace 
management for that sort of thing, however there is none.

I am afraid that we will have to put up with the mountainous clutter of 
sequences and annotations.

Now if someone wants to start some type of public registry for defining 
theses namespaces .... which would not be such a bad idea if one could 
prove how it would help the community. GO is an attempt for function, and 
HUGO for gene names ... no reason not to do it for namespaces 
(databasenamespaces ;) ).

Peter

p.s. perhaps a big pair of billy boots is good enough to wade through it all.


At 09:01 PM 23/10/2003 +0100, michael wrote:


>         Is there any code out there that can make an educated guess to the
>origin of any given (biological) ID/accession number?
>
>         X93993 AJ010957 are both valid EMBL accession numbers, P47202
>looks similar but is from swissprot (a bit of digging reveals the
>swissprot patten as /[OPQ]\d[A-Z\d][A-Z\d][A-Z\d]\d/ ).  Failing code is
>there a biological name space document out there?
>
>         I'm writing a public gene symbol submission form and want it to do
>the right thing regardless of where the data is pasted in.
>
>~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
>Michael John Lush PhD                   Tel:44-20-7679-5027
>Nomenclature Bioinformatics Support     Fax:44-20-7387-3496
>HUGO Gene Nomenclature Committee Email:  nome@galton.ucl.ac.uk
>The Galton Laboratory
>University College London, UK
>URL: http://www.gene.ucl.ac.uk/nomenclature/
>~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
>
>_______________________________________________
>Bioperl-l mailing list
>Bioperl-l@portal.open-bio.org
>http://portal.open-bio.org/mailman/listinfo/bioperl-l


-------------------------------------
Peter Wilkinson
Bioinformatics Consultant

-------------------------------------  


From lstein at cshl.edu  Thu Oct 23 22:23:30 2003
From: lstein at cshl.edu (Lincoln Stein)
Date: Thu Oct 23 22:20:51 2003
Subject: [Gmod-gbrowse] Re: [Bioperl-l] GFF3: preliminary or implemented?
In-Reply-To: <Pine.LNX.4.33.0310230945200.30090-100000@sos.lbl.gov>
References: <Pine.LNX.4.33.0310230945200.30090-100000@sos.lbl.gov>
Message-ID: <200310232223.30051.lstein@cshl.edu>

The idea of the ID is that it is just unique within the GFF3 file.  It does 
not necessarily translate into an ID for the feature itself.  I did some work 
on Bio::Tools::GFF so that it would deal with this correctly, but it hasn't 
been torture tested.  GBrowse handles the IDs correctly, but doesn't rely on 
Bio::Tools::GFF.

Lincoln


On Thursday 23 October 2003 12:48 pm, Chris Mungall wrote:
> It seems to me that we need to extend Bio::Tools::GFF to read/write the
> seqfeature nesting hierarchy using ID= and Parent= tags
>
> This seems a little problemmatic because it requires the notion of a
> unique id for features, which is lacking in bioperl
>
> How should we progress here? Should we add add a unique_id method to
> SeqFeature? Should this be autogenerated?
>
> On 23 Oct 2003, Charles Hauser wrote:
> > My last notes re:GFF3 were that it was in the 'preliminary' stage, but
> > it appears as though Jason update Tools::GFF to handle '='.
> >
> > Question: is GFF3 a supported version now & is  GBrowse happy w/ GFF3
> > formatting?
> >
> >
> > Charles
> >
> >
> > _______________________________________________
> > Bioperl-l mailing list
> > Bioperl-l@portal.open-bio.org
> > http://portal.open-bio.org/mailman/listinfo/bioperl-l
>
> -------------------------------------------------------
> This SF.net email is sponsored by: The SF.net Donation Program.
> Do you like what SourceForge.net is doing for the Open
> Source Community?  Make a contribution, and help us add new
> features and functionality. Click here: http://sourceforge.net/donate/
> _______________________________________________
> Gmod-gbrowse mailing list
> Gmod-gbrowse@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/gmod-gbrowse

-- 
Lincoln Stein
lstein@cshl.edu
Cold Spring Harbor Laboratory
1 Bungtown Road
Cold Spring Harbor, NY 11724
(516) 367-8380 (voice)
(516) 367-8389 (fax)
From zbigk at yahoo.com  Thu Oct 23 23:09:43 2003
From: zbigk at yahoo.com (ADV; . Energy Systems & Electrical Design)
Date: Fri Oct 24 02:09:31 2003
Subject: [Bioperl-l] ADV:  .  Resume for JOB or Service
Message-ID: <200310240609.h9O69Bdb004596@portal.open-bio.org>


Rich' S 	Resume	

							Santa Clara, CA 95050


  (408) 482-2840          rrsiek@yahoo.com
 

Electrical Designer &  Drafting


Electro-Mechanical Designer & Drafting


Drafting & Design Shopping Centers; grocery stores, hardware stores, restaurants & residential - housing areas, computer business & fast food units installation & testing; 

Energy systems; Solar Panels, Wind Energy, portable & emergency generators; Factory production lines, food & mechanical process machinery Installations & trouble shooting.  

Equipment & production line installations, MCC, Sensors, Wiring, Alarm, Network, Security, Electrical Design & Installations;  Network Sketches, one line diagrams, and "as is" drawings update. 

Customizing Electronic and Electrical Components & Parts, Layouts electronic and electrical schematic, connectors and mechanical detailing. Use CAD, Windows and applications; Programming & Detailing, Production Equipment, Machinery, Conveyors Spiral Elevator, Fast Cannery Transportation, Electronic and Electrical Components, Parts, Schematics & Layouts, Master Control Center, Can Sheet Metal Oven Rebuilding Project, Combustion Remodeling, PD 5,  PLC setup. 

Designing Electric Cars using AutoCAD, Spreadsheet, Basic, dB; electronic and electrical schematic, Layouts, mechanical detailing & redesigning components for manufacturing, Assembly drawings, Customizing Electronic, Mechanical and Electrical Equipment.

Quotes, supply, bids and job estimating. Customers contact, inspection, project mgmt & supervision of electricians & material handling; Commercial & Residential Project; Mgr. for satellite office & shop. 

Commercial, Industrial, Residential, Fire Alarms, Smoke Detectors, Lights, Plugs, Panels, Power Supply, Electro-Solar Installations, Emergency Generators, Transformers, Power Lines, Fire sprinklers design / control, CAD automation. Foreman, Estimator, Designer & CAD Operator (New & "as is" drawings update)

Hands on electrical installation performing, fitting wires & power lines;  Panels, Light, receptacles & Fuse boxes, emergency power supply, parking lighting & post installation; Installing lamps, switches, alarms, plugs, receptacles, fire alarms, smoke detectors, fire & safety installations; Computer & data network wiring; underground installations & conduit layout, bending and mounting; Job Estimating & bid preparation.

Environmental Energy Systems and Green Building Coordinating Programs;  P E C,  C E C,  PG&E training, Solar Living Institute, High Performance Schools, Health Analyzing, using Solar and Wind Energy Friendly Systems, Savings by Design recommendations & new Title 24 Standards - LEED, CEC, AIA & COTE Ratings. 

Electrical & Electrical Drafting, Electrical Design & management, Site Inspection and Quality Control, coordination with General Contractor, Estimating and Supplying, Document Control & Upgrade, daily performance checking & schedule update. 


Electrical Installations & Service

Solar Energy Installations & Service

Electro-Mechanical Assembly & Service

ELECTRICAL PROJECT MANAGER -  COORDINATOR

ELECTRICAL & MAINTENANCE SERVICE

HANDS ON WIRING & INSTALLATION

ELECTRICAL AND MECHANICAL PROJECTS 


Electrical Installations & Service

Solar Energy Installations & Service

Electro-Mechanical Assembly & Service

ELECTRICAL PROJECT MANAGER -  COORDINATOR

ELECTRICAL & MAINTENANCE SERVICE

HANDS ON WIRING & INSTALLATION

ELECTRICAL AND MECHANICAL PROJECTS 

Sun Tracking  double  SOLAR ENERGY


SOLAR ENERGY SYSTEMS

WIND ENERGY SYSTEMS

SUN TRACKING SYSTEMS

SOLAR PALETTE SYSTEMS

FLYWHEEL STORAGE SYSTEMS


Solar Engineer


From zbigk at yahoo.com  Thu Oct 23 23:09:43 2003
From: zbigk at yahoo.com (ADV; . Energy Systems & Electrical Design)
Date: Fri Oct 24 02:09:37 2003
Subject: [Bioperl-l] ADV:  .  Resume for JOB or Service
Message-ID: <200310240609.h9O69Bdb004597@portal.open-bio.org>


Rich' S 	Resume	

							Santa Clara, CA 95050


  (408) 482-2840          rrsiek@yahoo.com
 

Electrical Designer &  Drafting


Electro-Mechanical Designer & Drafting


Drafting & Design Shopping Centers; grocery stores, hardware stores, restaurants & residential - housing areas, computer business & fast food units installation & testing; 

Energy systems; Solar Panels, Wind Energy, portable & emergency generators; Factory production lines, food & mechanical process machinery Installations & trouble shooting.  

Equipment & production line installations, MCC, Sensors, Wiring, Alarm, Network, Security, Electrical Design & Installations;  Network Sketches, one line diagrams, and "as is" drawings update. 

Customizing Electronic and Electrical Components & Parts, Layouts electronic and electrical schematic, connectors and mechanical detailing. Use CAD, Windows and applications; Programming & Detailing, Production Equipment, Machinery, Conveyors Spiral Elevator, Fast Cannery Transportation, Electronic and Electrical Components, Parts, Schematics & Layouts, Master Control Center, Can Sheet Metal Oven Rebuilding Project, Combustion Remodeling, PD 5,  PLC setup. 

Designing Electric Cars using AutoCAD, Spreadsheet, Basic, dB; electronic and electrical schematic, Layouts, mechanical detailing & redesigning components for manufacturing, Assembly drawings, Customizing Electronic, Mechanical and Electrical Equipment.

Quotes, supply, bids and job estimating. Customers contact, inspection, project mgmt & supervision of electricians & material handling; Commercial & Residential Project; Mgr. for satellite office & shop. 

Commercial, Industrial, Residential, Fire Alarms, Smoke Detectors, Lights, Plugs, Panels, Power Supply, Electro-Solar Installations, Emergency Generators, Transformers, Power Lines, Fire sprinklers design / control, CAD automation. Foreman, Estimator, Designer & CAD Operator (New & "as is" drawings update)

Hands on electrical installation performing, fitting wires & power lines;  Panels, Light, receptacles & Fuse boxes, emergency power supply, parking lighting & post installation; Installing lamps, switches, alarms, plugs, receptacles, fire alarms, smoke detectors, fire & safety installations; Computer & data network wiring; underground installations & conduit layout, bending and mounting; Job Estimating & bid preparation.

Environmental Energy Systems and Green Building Coordinating Programs;  P E C,  C E C,  PG&E training, Solar Living Institute, High Performance Schools, Health Analyzing, using Solar and Wind Energy Friendly Systems, Savings by Design recommendations & new Title 24 Standards - LEED, CEC, AIA & COTE Ratings. 

Electrical & Electrical Drafting, Electrical Design & management, Site Inspection and Quality Control, coordination with General Contractor, Estimating and Supplying, Document Control & Upgrade, daily performance checking & schedule update. 


Electrical Installations & Service

Solar Energy Installations & Service

Electro-Mechanical Assembly & Service

ELECTRICAL PROJECT MANAGER -  COORDINATOR

ELECTRICAL & MAINTENANCE SERVICE

HANDS ON WIRING & INSTALLATION

ELECTRICAL AND MECHANICAL PROJECTS 


Electrical Installations & Service

Solar Energy Installations & Service

Electro-Mechanical Assembly & Service

ELECTRICAL PROJECT MANAGER -  COORDINATOR

ELECTRICAL & MAINTENANCE SERVICE

HANDS ON WIRING & INSTALLATION

ELECTRICAL AND MECHANICAL PROJECTS 

Sun Tracking  double  SOLAR ENERGY


SOLAR ENERGY SYSTEMS

WIND ENERGY SYSTEMS

SUN TRACKING SYSTEMS

SOLAR PALETTE SYSTEMS

FLYWHEEL STORAGE SYSTEMS


Solar Engineer


From david.konopnicki at hp.com  Fri Oct 24 05:47:06 2003
From: david.konopnicki at hp.com (Konopnicki, David)
Date: Fri Oct 24 05:44:13 2003
Subject: [Bioperl-l] Netwok problem?
Message-ID: <711B92DC11556E428989346ECC5D9E0C028350B7@raaexc01.emea.cpqcorp.net>

Hi,
I just installed BioPerl and am trying to run:

get_sequence('swissprot',"ROA1_HUMAN");

I receive an error: can't connect to www.ebi.ac.uk:80 (connection refused)
My proxy settings look fine and I get run lwp-download to get www.ebi.ac.uk:80
Any ideas? 
Thanks
-David

From walsh at cenix-bioscience.com  Fri Oct 24 07:25:07 2003
From: walsh at cenix-bioscience.com (Andrew Walsh)
Date: Fri Oct 24 08:39:54 2003
Subject: [Bioperl-l] What's that number?
In-Reply-To: <5.2.0.9.0.20031023213506.00b242b8@pop.videotron.ca>
References: <200309241917.h8OJGoMj031893@portal.open-bio.org>
	<5.2.0.9.0.20031023213506.00b242b8@pop.videotron.ca>
Message-ID: <3F990C13.3040806@cenix-bioscience.com>

There had been work done on something like that for seq retrievals- 
Bio::DB::Universal.  It probably still wouldn't help with the ambiguous 
identifiers though.

Andrew


Peter Wilkinson wrote:
> Well it would be nice if there existed some type of global namespace 
> management for that sort of thing, however there is none.
> 
> I am afraid that we will have to put up with the mountainous clutter of 
> sequences and annotations.
> 
> Now if someone wants to start some type of public registry for defining 
> theses namespaces .... which would not be such a bad idea if one could 
> prove how it would help the community. GO is an attempt for function, 
> and HUGO for gene names ... no reason not to do it for namespaces 
> (databasenamespaces ;) ).
> 
> Peter
> 
> p.s. perhaps a big pair of billy boots is good enough to wade through it 
> all.
> 
> 
> 
> 
> 
> At 09:01 PM 23/10/2003 +0100, michael wrote:
> 
> 
>>         Is there any code out there that can make an educated guess to 
>> the
>> origin of any given (biological) ID/accession number?
>>
>>         X93993 AJ010957 are both valid EMBL accession numbers, P47202
>> looks similar but is from swissprot (a bit of digging reveals the
>> swissprot patten as /[OPQ]\d[A-Z\d][A-Z\d][A-Z\d]\d/ ).  Failing code is
>> there a biological name space document out there?
>>
>>         I'm writing a public gene symbol submission form and want it 
>> to do
>> the right thing regardless of where the data is pasted in.
>>
>> ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
>> Michael John Lush PhD                   Tel:44-20-7679-5027
>> Nomenclature Bioinformatics Support     Fax:44-20-7387-3496
>> HUGO Gene Nomenclature Committee Email:  nome@galton.ucl.ac.uk
>> The Galton Laboratory
>> University College London, UK
>> URL: http://www.gene.ucl.ac.uk/nomenclature/
>> ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
>>
>> _______________________________________________
>> Bioperl-l mailing list
>> Bioperl-l@portal.open-bio.org
>> http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 
> 
> 
> -------------------------------------
> Peter Wilkinson
> Bioinformatics Consultant
> 
> ------------------------------------- 
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 


-- 
------------------------------------------------------------------
Andrew Walsh, M.Sc.
Bioinformatics Software Engineer
IT Unit
Cenix BioScience GmbH
Pfotenhauerstr. 108
01307 Dresden, Germany
Tel. +49(351)210-2699
Fax  +49(351)210-1309

public key: http://www.cenix-bioscience.com/public_keys/walsh.gpg
------------------------------------------------------------------


From markw at illuminae.com  Fri Oct 24 10:26:26 2003
From: markw at illuminae.com (Mark Wilkinson)
Date: Fri Oct 24 10:23:41 2003
Subject: [BioPerl] Re: [Bioperl-l] What's that number?
In-Reply-To: <5.2.0.9.0.20031023213506.00b242b8@pop.videotron.ca>
References: <200309241917.h8OJGoMj031893@portal.open-bio.org>
	<5.2.0.9.0.20031023213506.00b242b8@pop.videotron.ca>
Message-ID: <1067005586.2036.24.camel@localhost.localdomain>

There are at least two "initiatives" looking at exactly this problem,
but they wont solve the precise problem that you raise (i.e. determining
the namespace of an in-hand ID number post facto):

1)  The LSID proposal from the I3C makes ID numbers "resolvable" so that
you can always determine the namespace of an ID by querying a resolver
service with your in-hand data.  In this system the ID number must take
the form of a LSID-type URI, and someone (the authority, or someone
else) must have set up a resolver service for URI's of that type.

2)  The BioMOBY project, in which ID numbers are always passed together
with a second identifier indicating the namespace under which that ID is
to be interpreted.  Although, unlike LSIDs, we in the BioMOBY project
use the existing ID numbers _verbatim_, this still wont help you in this
case since the person who gave you that ID is obligated by the BioMOBY
API to give you the namespace identifier at the same time.  We do have a
public registry of namespaces, however, largely based on the GO Xref
Abbreviations List.

in any case, once you get an ID number without namespace information you
are throwing your life on the mercy of regular expression searches of
loosely defined conventions...  bad luck for you!

All the more reason we should start using these two technologies as
quickly as possible!! (though call me biased ;-) )

Mark


On Thu, 2003-10-23 at 19:41, Peter Wilkinson wrote:
> Well it would be nice if there existed some type of global namespace 
> management for that sort of thing, however there is none.
> 
> I am afraid that we will have to put up with the mountainous clutter of 
> sequences and annotations.
> 
> Now if someone wants to start some type of public registry for defining 
> theses namespaces .... which would not be such a bad idea if one could 
> prove how it would help the community. GO is an attempt for function, and 
> HUGO for gene names ... no reason not to do it for namespaces 
> (databasenamespaces ;) ).
> 
> Peter
> 
> p.s. perhaps a big pair of billy boots is good enough to wade through it all.
> 
> 
> 
> 
> 
> At 09:01 PM 23/10/2003 +0100, michael wrote:
> 
> 
> >         Is there any code out there that can make an educated guess to the
> >origin of any given (biological) ID/accession number?
> >
> >         X93993 AJ010957 are both valid EMBL accession numbers, P47202
> >looks similar but is from swissprot (a bit of digging reveals the
> >swissprot patten as /[OPQ]\d[A-Z\d][A-Z\d][A-Z\d]\d/ ).  Failing code is
> >there a biological name space document out there?
> >
> >         I'm writing a public gene symbol submission form and want it to do
> >the right thing regardless of where the data is pasted in.
> >
> >~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
> >Michael John Lush PhD                   Tel:44-20-7679-5027
> >Nomenclature Bioinformatics Support     Fax:44-20-7387-3496
> >HUGO Gene Nomenclature Committee Email:  nome@galton.ucl.ac.uk
> >The Galton Laboratory
> >University College London, UK
> >URL: http://www.gene.ucl.ac.uk/nomenclature/
> >~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
> >
> >_______________________________________________
> >Bioperl-l mailing list
> >Bioperl-l@portal.open-bio.org
> >http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 
> 
> -------------------------------------
> Peter Wilkinson
> Bioinformatics Consultant
> 
> -------------------------------------  
> 
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
-- 
Mark Wilkinson <markw@illuminae.com>
Illuminae

From chenn at cshl.edu  Fri Oct 24 15:15:36 2003
From: chenn at cshl.edu (Jack Chen)
Date: Fri Oct 24 15:13:07 2003
Subject: [Bioperl-l] Bio::Graphics::Pictogram
Message-ID: <Pine.GSO.4.05.10310241513390.11090-100000@phage.cshl.edu>

A nice and useful module. Is there a plan to make it work with proteins?
Also, it currently does not recognize lower case sequences.

Jack

++++++++++++++++++++++++++++++++++++++++++++
    o-o     Jack Chen, Stein Laboratory      
    o---o   Cold Spring Harbor Laboratory    
  o----o    #5 Williams, 1 Bungtown Road           
 O----O     Cold Spring Harbor, NY, 11724   
 0--o       Tel: 1 516 367 8394              
   O        e-mail: chenn@cshl.org  	    
  o-o       Website: http://www.wormbase.org
+++++++++++++++++++++++++++++++++++++++++++++
        

From shawnh at stanford.edu  Fri Oct 24 15:59:09 2003
From: shawnh at stanford.edu (Shawn Hoon)
Date: Fri Oct 24 15:52:02 2003
Subject: [Bioperl-l] Bio::Graphics::Pictogram
In-Reply-To: <Pine.GSO.4.05.10310241513390.11090-100000@phage.cshl.edu>
References: <Pine.GSO.4.05.10310241513390.11090-100000@phage.cshl.edu>
Message-ID: <87CD5708-065C-11D8-A8A6-000A95783436@stanford.edu>


On Friday, October 24, 2003, at 12:15PM, Jack Chen wrote:

> A nice and useful module. Is there a plan to make it work with 
> proteins?
> Also, it currently does not recognize lower case sequences.
>
Thanks, just committed a fix for the lowercase.

I was supposed to work on a project with Allen Day moving the module to 
a SVG
package he was writing for cpan. This would make Pictogram a generic 
module that
should handle aa as well since the current module will need some work 
to get it to extend.
  However 1st year of grad school is keeping me busy right now
and its not going very far. If you would like to look at contributing, 
you might want to talk to Allen more on
this

cheers,

shawn


> Jack
>
> ++++++++++++++++++++++++++++++++++++++++++++
>     o-o     Jack Chen, Stein Laboratory
>     o---o   Cold Spring Harbor Laboratory
>   o----o    #5 Williams, 1 Bungtown Road
>  O----O     Cold Spring Harbor, NY, 11724
>  0--o       Tel: 1 516 367 8394
>    O        e-mail: chenn@cshl.org  	
>   o-o       Website: http://www.wormbase.org
> +++++++++++++++++++++++++++++++++++++++++++++
>
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

From heikki at nildram.co.uk  Sat Oct 25 10:44:14 2003
From: heikki at nildram.co.uk (Heikki Lehvaslaiho)
Date: Sat Oct 25 10:41:37 2003
Subject: [Bioperl-l] Codon Table
In-Reply-To: <1066398681.7861.47.camel@localhost>
References: <20B7EB075F2D4542AFFAF813E98ACD9301C00BEB@cl-exsrv1.irad.bbsrc.ac.uk>
	<1066398681.7861.47.camel@localhost>
Message-ID: <1067093053.8427.59.camel@localhost>

Mick,

The PrimarySeq::translate interface is a mess, so adding one more
attribute the the end of the long list is not going to make much a
difference. I added the code into CVS.

	-Heikki


On Fri, 2003-10-17 at 14:51, Heikki Lehvaslaiho wrote:
> Mick,
> 
> I really have not thought about this possibility. CodonTable can easily
> be modified, but there is no way of passing the modified codontable
> object to sequence::translate. The translate function has really complex
> argument list already, so there is no way we can add anything in there. 
> 
> Depending what you want to do, it might be enough to modify the
> codontable object and use its translate function. Compare it to
> Bio::PrimarySeqI::translate to see what I mean. If you need full
> seq::translate functionality, Bio::SeqUtils is the place to put the
> code, e.g. $polypept = Bio::SeqUtils::translate($seq, $codontable);
> 
> I wrote the following method to be added into Bio::Tools::CodonTable. If
> you think it is useful, I can commit it.
> 
> Yours,
> 	-Heikki
> 
> 
> =head2 add_table
> 
>  Title   : add_table
>  Usage   : $newid = $ct->add_table($name, $table, $starts)
>  Function: Add a custom Codon Table into the object.
>            Know what you are doing, only the length of
>            the argument strings is checked!
>  Returns : the id of the new codon table
>  Args    : name, a string, optional (can be empty)
>            table, a string of 64 characters
>            startcodons, a string of 64 characters, defaults to standard
> 
> =cut
> 
> sub add_table {
>     my ($self, $name, $table, $starts) = @_;
> 
>     $name ||= 'Custom'. scalar @NAMES + 1;
>     $starts ||= $STARTS[0]; 
>     $self->throw('Suspect input!')
>         unless length($table) == 64 and length($starts) == 64;
> 
>     push @NAMES, $name;
>     push @TABLES, $table;
>     push @STARTS, $starts;
> 
>     return scalar @NAMES;
> 
> }
> 
> usage:
> 
> $ct_id = $ct->add_table(@custom_table);
> $ct->id($ct_id);
> $string = $ct->translate($nt_string);
> 
> 
>  
> 
> 
> 
> On Tue, 2003-10-14 at 17:03, michael watson (IAH-C) wrote:
> > Hi
> > 
> > I want to use a different codon table than any that are present in CodonTable.pm.  Does Bioperl have a way to implement this?  Or will I have to hack CodonTable.pm?
> > 
> > Thanks
> > Mick
> > _______________________________________________
> > Bioperl-l mailing list
> > Bioperl-l@portal.open-bio.org
> > http://portal.open-bio.org/mailman/listinfo/bioperl-l
-- 
______ _/      _/_____________________________________________________
      _/      _/                      http://www.ebi.ac.uk/mutations/
     _/  _/  _/  Heikki Lehvaslaiho    heikki_at_ebi ac uk
    _/_/_/_/_/  EMBL Outstation, European Bioinformatics Institute
   _/  _/  _/  Wellcome Trust Genome Campus, Hinxton
  _/  _/  _/  Cambs. CB10 1SD, United Kingdom
     _/      Phone: +44 (0)1223 494 644   FAX: +44 (0)1223 494 468
___ _/_/_/_/_/________________________________________________________

From heikki at nildram.co.uk  Sat Oct 25 11:06:27 2003
From: heikki at nildram.co.uk (Heikki Lehvaslaiho)
Date: Sat Oct 25 11:03:39 2003
Subject: [Bioperl-l] tests
Message-ID: <1067094387.8433.68.camel@localhost>


I went to a really boring meeting and there was no network connection,
so I ended up fixing bioperl tests to better skip individual tests when
network is down. 

The only remaining test files with failures (under linux and perl 5.8.1)
are BioGraphics.t and Registry.t. Brian is working on the latter and
Lincoln has previously hinted that there might be a better way of
testing png file generation. Would be nice to see it in place.

	-Heikki


From hcuong at ibt.ac.vn  Fri Oct 24 23:19:35 2003
From: hcuong at ibt.ac.vn (hcuong)
Date: Sat Oct 25 12:52:52 2003
Subject: [Bioperl-l] Help me to deal with "MSG: blastall call crashed: -1
	blastall ..."
Message-ID: <002f01c39aa6$d0e71680$12010b0a@nhc>

Hi, Please help me. I try to make BLAST search for the first time by the test script:
> #----------------
>
> #!/usr/bin/perl
>
>  use strict;
>
> use Bio::Tools::Run::StandAloneBlast;
>
>  my @params = (program => 'blastn',
>
>                             database => 'ecoli1.nt');
>
> my $factory = Bio::Tools::Run::StandAloneBlast->new(@params);
>
> my $input = Bio::Seq->new(-id => "test query",
>
>                                              -seq => "ACTAAGTGGGGG");
>
> my $blast_report = $factory->blastall ($input);
>
>  print "$blast_report\n";
>
> #----------------

But I always get the following error only:

>  ------------- EXCEPTION  -------------
>
> MSG: blastall call crashed: -1 blastall -p  blastn  -d  /ecoli1.nt  -i  /tmp/bSh3ZAfK7n  -o  /tmp/gv9wpcMj8g
>
>  STACK Bio::Tools::Run::StandAloneBlast::_runblast /usr/local/ActivePerl-5.6/lib/site_perl/5.6.1/Bio/Tools/Run/StandAloneBlast.pm:633
>
> STACK Bio::Tools::Run::StandAloneBlast::_generic_local_blast /usr/local/ActivePerl-5.6/lib/site_perl/5.6.1/Bio/Tools/Run/StandAloneBlast.pm:602
>
> STACK Bio::Tools::Run::StandAloneBlast::blastall /usr/local/ActivePerl-5.6/lib/site_perl/5.6.1/Bio/Tools/Run/StandAloneBlast.pm:489
>
> TACK toplevel blast_local.pl:15


---
Nguyen Huu Cuong
Plant Cell Biotechnology Department
Institute of Biotechnology, NCST
18 Hoang Quoc Viet, Cau Giay, Ha Noi
Tel: 84 - 4 - 7562368
Email: hcuong@ibt.ac.vn
From rpraca5 at yahoo.com  Sat Oct 25 10:45:36 2003
From: rpraca5 at yahoo.com (Richard's    Resume)
Date: Sat Oct 25 13:45:21 2003
Subject: [Bioperl-l] Mechanical   &   Design     E n g i n e e r
Message-ID: <200310251745.h9PHjBdb008499@portal.open-bio.org>


Richard's


Resume	 for Job, 	Consulting or Service

OBJECTIVE: 	

			
	Sr.  MECHANICAL  &  DESIGN    ENGINEER

Project Mgr, Electro-Mechanical Design, Mfg, Product Development, R&D, CADD

Tel: ( 408)  309-7006 	rpraca3@yahoo.com


EXPERIENCE:
08/93-present          Sr. MECHANICAL & DESIGN ENGINEER, CADD Mgr "Mech-Tronic" 
                   
Engineering, Design & Product Development Service; Project Management, Mfg, Tooling, Product Improvement; Developing Working Product, Design and Prototype Production;  Controlling & Managing testing,  redesign, manufacturing documentation preparation;  
and full realization of Task using technical approach, schedules,  and budget to 
achieve Final Product.  

Selecting vendors and specialist to supply performance  satisfying customers expectation and to create a new, modern product with advanced properties, quality and market desirable working abilities and sellable values.


Preparing propositions & presentations, Review Manufacturing Process and Quality 
Control, Inspecting standards, Engineering Computations and Technical Improvements,  
Supplying and Production Automation. Project Management & Development. Preparing technical documentation, calculations, engineering, design, layouts, drawings, 
3D and Solid Modeling, development & propositions. Hard Drive Design, testing, 
balancing, recalculating and redesign. 

Manufacturing and Assembly Equipment design and build.  Systems Integration.
Tooling and Operations Development, Implementation & Automation for mass production.  Energy, Electric Vehicles and Computerized Transportation System Design & Implement. Solar Panels. Solid Works, Pro-E, CAD Management and Operations, Analyzing, Micro Station, ACAD 10-2002 & LT, Win 3x & 2000', Net, Internet, Softdesk, Structural Design. Network, Security, Electrical Design & Installations; Commercial, Industrial, Residential, Fire Alarms, Smoke Detectors, Lights, Plugs, Panels, Power Supply,  Electro - Solar Installations, CADD automation. Dynamics, Kinematics, thermodynamics / heat transfer & FEA analyzing.

Manufacturing hydraulic & pneumatic equipment, machinery and control systems, mechanisms, robotics device, precision machine elements.  Inspecting and control manufacturing standards, analyzing stresses and tolerances, selecting materials, engineering computations and technical improvements, documentation, projects development, supplying.  Automation, Conveyors, Spiral Elevator, Fast Cannery Transportation, Electronic and Electrical Components, Master Control Center, Sheet Metal Oven Rebuilding Project. Power Systems, UPS.


Automation equipment and machine control using for machines and robotics precision mechanisms motion programming, fluid mechanics, pneumatics systems, mounting and positioning devices, electro-mechanical and vacuum mechanisms, design and analysis of structures, castings, welded frames, mechanical detailing. 


Computers: 

DOS, SUN UNIX, MAC, WP, dB, Lotus, Network, Windows & Applications; MS Project, Excel, Access, Word, PFS, Graphics, CAD / CAM, Excel, Basic, C, Fortran, Analyzes. 
CADD systems, Algor, VAX, Net.  MCAD, Acad's 2002 & Softdesk, Script, Nastran, Infusoft, LAN, Tektronix, Cadkey,  Lisp, Personal Designer, ACAD / Computer Instructor, Machine Design, Robotics & Automation, WIND, SOLAR, METRIC.


EDUCATION:

Institute for Business & Technology, California
CADD Engineer,  Programming,  Design,  Management

Electro - Mechanical  College
Mechanical Engineering - BSME,   ASEE,  MBA


Open for Travel  *  Salary open *  Permanent  preferred  or Consulting


.
From rpraca5 at yahoo.com  Sat Oct 25 10:45:36 2003
From: rpraca5 at yahoo.com (Richard's    Resume)
Date: Sat Oct 25 13:45:30 2003
Subject: [Bioperl-l] Mechanical   &   Design     E n g i n e e r
Message-ID: <200310251745.h9PHjEdb008506@portal.open-bio.org>


Richard's


Resume	 for Job, 	Consulting or Service

OBJECTIVE: 	

			
	Sr.  MECHANICAL  &  DESIGN    ENGINEER

Project Mgr, Electro-Mechanical Design, Mfg, Product Development, R&D, CADD

Tel: ( 408)  309-7006 	rpraca3@yahoo.com


EXPERIENCE:
08/93-present          Sr. MECHANICAL & DESIGN ENGINEER, CADD Mgr "Mech-Tronic" 
                   
Engineering, Design & Product Development Service; Project Management, Mfg, Tooling, Product Improvement; Developing Working Product, Design and Prototype Production;  Controlling & Managing testing,  redesign, manufacturing documentation preparation;  
and full realization of Task using technical approach, schedules,  and budget to 
achieve Final Product.  

Selecting vendors and specialist to supply performance  satisfying customers expectation and to create a new, modern product with advanced properties, quality and market desirable working abilities and sellable values.


Preparing propositions & presentations, Review Manufacturing Process and Quality 
Control, Inspecting standards, Engineering Computations and Technical Improvements,  
Supplying and Production Automation. Project Management & Development. Preparing technical documentation, calculations, engineering, design, layouts, drawings, 
3D and Solid Modeling, development & propositions. Hard Drive Design, testing, 
balancing, recalculating and redesign. 

Manufacturing and Assembly Equipment design and build.  Systems Integration.
Tooling and Operations Development, Implementation & Automation for mass production.  Energy, Electric Vehicles and Computerized Transportation System Design & Implement. Solar Panels. Solid Works, Pro-E, CAD Management and Operations, Analyzing, Micro Station, ACAD 10-2002 & LT, Win 3x & 2000', Net, Internet, Softdesk, Structural Design. Network, Security, Electrical Design & Installations; Commercial, Industrial, Residential, Fire Alarms, Smoke Detectors, Lights, Plugs, Panels, Power Supply,  Electro - Solar Installations, CADD automation. Dynamics, Kinematics, thermodynamics / heat transfer & FEA analyzing.

Manufacturing hydraulic & pneumatic equipment, machinery and control systems, mechanisms, robotics device, precision machine elements.  Inspecting and control manufacturing standards, analyzing stresses and tolerances, selecting materials, engineering computations and technical improvements, documentation, projects development, supplying.  Automation, Conveyors, Spiral Elevator, Fast Cannery Transportation, Electronic and Electrical Components, Master Control Center, Sheet Metal Oven Rebuilding Project. Power Systems, UPS.


Automation equipment and machine control using for machines and robotics precision mechanisms motion programming, fluid mechanics, pneumatics systems, mounting and positioning devices, electro-mechanical and vacuum mechanisms, design and analysis of structures, castings, welded frames, mechanical detailing. 


Computers: 

DOS, SUN UNIX, MAC, WP, dB, Lotus, Network, Windows & Applications; MS Project, Excel, Access, Word, PFS, Graphics, CAD / CAM, Excel, Basic, C, Fortran, Analyzes. 
CADD systems, Algor, VAX, Net.  MCAD, Acad's 2002 & Softdesk, Script, Nastran, Infusoft, LAN, Tektronix, Cadkey,  Lisp, Personal Designer, ACAD / Computer Instructor, Machine Design, Robotics & Automation, WIND, SOLAR, METRIC.


EDUCATION:

Institute for Business & Technology, California
CADD Engineer,  Programming,  Design,  Management

Electro - Mechanical  College
Mechanical Engineering - BSME,   ASEE,  MBA


Open for Travel  *  Salary open *  Permanent  preferred  or Consulting


.
From lstein at cshl.edu  Sat Oct 25 16:25:50 2003
From: lstein at cshl.edu (Lincoln Stein)
Date: Sat Oct 25 16:23:44 2003
Subject: [Bioperl-l] tests
In-Reply-To: <1067094387.8433.68.camel@localhost>
References: <1067094387.8433.68.camel@localhost>
Message-ID: <200310251625.50445.lstein@cshl.edu>

Hi Heikki,

Option 1:

Run BioGraphics.t with the --write switch:

	perl t/BioGraphics.t --write

This will create a set of pngs in t/data/biographics named 
t1.png, t2.png and t3.png.  Look at them visually and 
confirm that they are similar to the variants located in 
t/data/biographics/version*.  If so, move the new versions 
into the appropriate version directory.

Option 2:

I will disable the PNG tests completely.  Every different 
combination of libpng and libgd generates files that are 
visually identical but incompatible at the binary level.  
These tests aren't scaling.

The SVG version of Bio::Graphics is almost finished, and 
when done I will replace the PNG tests with SVG tests.  
Since SVG is a text-only XML format, it won't suffer these 
problems.

Lincoln

On Saturday 25 October 2003 11:06 am, Heikki Lehvaslaiho 
wrote:
> I went to a really boring meeting and there was no
> network connection, so I ended up fixing bioperl tests to
> better skip individual tests when network is down.
>
> The only remaining test files with failures (under linux
> and perl 5.8.1) are BioGraphics.t and Registry.t. Brian
> is working on the latter and Lincoln has previously
> hinted that there might be a better way of testing png
> file generation. Would be nice to see it in place.
>
> 	-Heikki
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
From heikki at nildram.co.uk  Sat Oct 25 18:07:32 2003
From: heikki at nildram.co.uk (Heikki Lehvaslaiho)
Date: Sat Oct 25 18:04:35 2003
Subject: [Bioperl-l] tests
In-Reply-To: <200310251625.50445.lstein@cshl.edu>
References: <1067094387.8433.68.camel@localhost>
	<200310251625.50445.lstein@cshl.edu>
Message-ID: <1067119651.8428.73.camel@localhost>

I'd prefer option 2: disabling the png tests and replacing them later
with SVG tests.

Thanks,
	-Heikki

On Sat, 2003-10-25 at 21:25, Lincoln Stein wrote:
> Hi Heikki,
> 
> Option 1:
> 
> Run BioGraphics.t with the --write switch:
> 
> 	perl t/BioGraphics.t --write
> 
> This will create a set of pngs in t/data/biographics named 
> t1.png, t2.png and t3.png.  Look at them visually and 
> confirm that they are similar to the variants located in 
> t/data/biographics/version*.  If so, move the new versions 
> into the appropriate version directory.
> 
> Option 2:
> 
> I will disable the PNG tests completely.  Every different 
> combination of libpng and libgd generates files that are 
> visually identical but incompatible at the binary level.  
> These tests aren't scaling.
> 
> The SVG version of Bio::Graphics is almost finished, and 
> when done I will replace the PNG tests with SVG tests.  
> Since SVG is a text-only XML format, it won't suffer these 
> problems.
> 
> Lincoln
> 
> On Saturday 25 October 2003 11:06 am, Heikki Lehvaslaiho 
> wrote:
> > I went to a really boring meeting and there was no
> > network connection, so I ended up fixing bioperl tests to
> > better skip individual tests when network is down.
> >
> > The only remaining test files with failures (under linux
> > and perl 5.8.1) are BioGraphics.t and Registry.t. Brian
> > is working on the latter and Lincoln has previously
> > hinted that there might be a better way of testing png
> > file generation. Would be nice to see it in place.
> >
> > 	-Heikki
> >
> >
> > _______________________________________________
> > Bioperl-l mailing list
> > Bioperl-l@portal.open-bio.org
> > http://portal.open-bio.org/mailman/listinfo/bioperl-l
-- 
______ _/      _/_____________________________________________________
      _/      _/                      http://www.ebi.ac.uk/mutations/
     _/  _/  _/  Heikki Lehvaslaiho    heikki_at_ebi ac uk
    _/_/_/_/_/  EMBL Outstation, European Bioinformatics Institute
   _/  _/  _/  Wellcome Trust Genome Campus, Hinxton
  _/  _/  _/  Cambs. CB10 1SD, United Kingdom
     _/      Phone: +44 (0)1223 494 644   FAX: +44 (0)1223 494 468
___ _/_/_/_/_/________________________________________________________

From lstein at cshl.edu  Sat Oct 25 21:33:47 2003
From: lstein at cshl.edu (Lincoln Stein)
Date: Sat Oct 25 21:31:00 2003
Subject: [Bioperl-l] tests
In-Reply-To: <1067119651.8428.73.camel@localhost>
References: <1067094387.8433.68.camel@localhost>
	<200310251625.50445.lstein@cshl.edu>
	<1067119651.8428.73.camel@localhost>
Message-ID: <200310252133.47115.lstein@cshl.edu>

Done.

Lincoln

On Saturday 25 October 2003 06:07 pm, Heikki Lehvaslaiho wrote:
> I'd prefer option 2: disabling the png tests and replacing them later
> with SVG tests.
>
> Thanks,
> 	-Heikki
>
> On Sat, 2003-10-25 at 21:25, Lincoln Stein wrote:
> > Hi Heikki,
> >
> > Option 1:
> >
> > Run BioGraphics.t with the --write switch:
> >
> > 	perl t/BioGraphics.t --write
> >
> > This will create a set of pngs in t/data/biographics named
> > t1.png, t2.png and t3.png.  Look at them visually and
> > confirm that they are similar to the variants located in
> > t/data/biographics/version*.  If so, move the new versions
> > into the appropriate version directory.
> >
> > Option 2:
> >
> > I will disable the PNG tests completely.  Every different
> > combination of libpng and libgd generates files that are
> > visually identical but incompatible at the binary level.
> > These tests aren't scaling.
> >
> > The SVG version of Bio::Graphics is almost finished, and
> > when done I will replace the PNG tests with SVG tests.
> > Since SVG is a text-only XML format, it won't suffer these
> > problems.
> >
> > Lincoln
> >
> > On Saturday 25 October 2003 11:06 am, Heikki Lehvaslaiho
> >
> > wrote:
> > > I went to a really boring meeting and there was no
> > > network connection, so I ended up fixing bioperl tests to
> > > better skip individual tests when network is down.
> > >
> > > The only remaining test files with failures (under linux
> > > and perl 5.8.1) are BioGraphics.t and Registry.t. Brian
> > > is working on the latter and Lincoln has previously
> > > hinted that there might be a better way of testing png
> > > file generation. Would be nice to see it in place.
> > >
> > > 	-Heikki
> > >
> > >
> > > _______________________________________________
> > > Bioperl-l mailing list
> > > Bioperl-l@portal.open-bio.org
> > > http://portal.open-bio.org/mailman/listinfo/bioperl-l

-- 
Lincoln Stein
lstein@cshl.edu
Cold Spring Harbor Laboratory
1 Bungtown Road
Cold Spring Harbor, NY 11724
(516) 367-8380 (voice)
(516) 367-8389 (fax)
From jbedell at oriongenomics.com  Sun Oct 26 01:51:51 2003
From: jbedell at oriongenomics.com (Joseph Bedell)
Date: Sun Oct 26 01:51:03 2003
Subject: [Bioperl-l] Help me to deal with "MSG: blastall call crashed:
	-1blastall ..."
Message-ID: <434AF352F9D03C4C896782B8CC78BC761A3CF4@vader.oriongenomics.com>


Hi-

This is only a guess, but by the looks of the error statement (the -d
/ecoli1.nt), I think you need to set an environment variable to point to
the location of your ecoli1.nt database. I believe that you can set
either $BLASTDATADIR or $BLASTDB to the correct directory and it should
work.

Joey


>-----Original Message-----
>From: bioperl-l-bounces@portal.open-bio.org [mailto:bioperl-l-
>bounces@portal.open-bio.org] On Behalf Of hcuong
>Sent: Friday, October 24, 2003 10:20 PM
>To: bioperl-l@bioperl.org
>Subject: [Bioperl-l] Help me to deal with "MSG: blastall call crashed:
-
>1blastall ..."
>
>Hi, Please help me. I try to make BLAST search for the first time by
the
>test script:
>> #----------------
>>
>> #!/usr/bin/perl
>>
>>  use strict;
>>
>> use Bio::Tools::Run::StandAloneBlast;
>>
>>  my @params = (program => 'blastn',
>>
>>                             database => 'ecoli1.nt');
>>
>> my $factory = Bio::Tools::Run::StandAloneBlast->new(@params);
>>
>> my $input = Bio::Seq->new(-id => "test query",
>>
>>                                              -seq => "ACTAAGTGGGGG");
>>
>> my $blast_report = $factory->blastall ($input);
>>
>>  print "$blast_report\n";
>>
>> #----------------
>
>But I always get the following error only:
>
>>  ------------- EXCEPTION  -------------
>>
>> MSG: blastall call crashed: -1 blastall -p  blastn  -d  /ecoli1.nt
-i
>/tmp/bSh3ZAfK7n  -o  /tmp/gv9wpcMj8g
>>
>>  STACK Bio::Tools::Run::StandAloneBlast::_runblast
/usr/local/ActivePerl-
>5.6/lib/site_perl/5.6.1/Bio/Tools/Run/StandAloneBlast.pm:633
>>
>> STACK Bio::Tools::Run::StandAloneBlast::_generic_local_blast
>/usr/local/ActivePerl-
>5.6/lib/site_perl/5.6.1/Bio/Tools/Run/StandAloneBlast.pm:602
>>
>> STACK Bio::Tools::Run::StandAloneBlast::blastall
/usr/local/ActivePerl-
>5.6/lib/site_perl/5.6.1/Bio/Tools/Run/StandAloneBlast.pm:489
>>
>> TACK toplevel blast_local.pl:15
>
>
>---
>Nguyen Huu Cuong
>Plant Cell Biotechnology Department
>Institute of Biotechnology, NCST
>18 Hoang Quoc Viet, Cau Giay, Ha Noi
>Tel: 84 - 4 - 7562368
>Email: hcuong@ibt.ac.vn

From michael.watson at bbsrc.ac.uk  Mon Oct 27 04:14:06 2003
From: michael.watson at bbsrc.ac.uk (michael watson (IAH-C))
Date: Mon Oct 27 04:12:30 2003
Subject: [Bioperl-l] Codon Table
Message-ID: <20B7EB075F2D4542AFFAF813E98ACD930282236E@cl-exsrv1.irad.bbsrc.ac.uk>

Thanks Heidi, your efforts are appreciated :-)

-----Original Message-----
From: Heikki Lehvaslaiho [mailto:heikki@nildram.co.uk]
Sent: 25 October 2003 15:44
To: michael watson (IAH-C)
Cc: 'bioperl-l@bioperl.org'
Subject: Re: [Bioperl-l] Codon Table


Mick,

The PrimarySeq::translate interface is a mess, so adding one more
attribute the the end of the long list is not going to make much a
difference. I added the code into CVS.

	-Heikki


On Fri, 2003-10-17 at 14:51, Heikki Lehvaslaiho wrote:
> Mick,
> 
> I really have not thought about this possibility. CodonTable can easily
> be modified, but there is no way of passing the modified codontable
> object to sequence::translate. The translate function has really complex
> argument list already, so there is no way we can add anything in there. 
> 
> Depending what you want to do, it might be enough to modify the
> codontable object and use its translate function. Compare it to
> Bio::PrimarySeqI::translate to see what I mean. If you need full
> seq::translate functionality, Bio::SeqUtils is the place to put the
> code, e.g. $polypept = Bio::SeqUtils::translate($seq, $codontable);
> 
> I wrote the following method to be added into Bio::Tools::CodonTable. If
> you think it is useful, I can commit it.
> 
> Yours,
> 	-Heikki
> 
> 
> =head2 add_table
> 
>  Title   : add_table
>  Usage   : $newid = $ct->add_table($name, $table, $starts)
>  Function: Add a custom Codon Table into the object.
>            Know what you are doing, only the length of
>            the argument strings is checked!
>  Returns : the id of the new codon table
>  Args    : name, a string, optional (can be empty)
>            table, a string of 64 characters
>            startcodons, a string of 64 characters, defaults to standard
> 
> =cut
> 
> sub add_table {
>     my ($self, $name, $table, $starts) = @_;
> 
>     $name ||= 'Custom'. scalar @NAMES + 1;
>     $starts ||= $STARTS[0]; 
>     $self->throw('Suspect input!')
>         unless length($table) == 64 and length($starts) == 64;
> 
>     push @NAMES, $name;
>     push @TABLES, $table;
>     push @STARTS, $starts;
> 
>     return scalar @NAMES;
> 
> }
> 
> usage:
> 
> $ct_id = $ct->add_table(@custom_table);
> $ct->id($ct_id);
> $string = $ct->translate($nt_string);
> 
> 
>  
> 
> 
> 
> On Tue, 2003-10-14 at 17:03, michael watson (IAH-C) wrote:
> > Hi
> > 
> > I want to use a different codon table than any that are present in CodonTable.pm.  Does Bioperl have a way to implement this?  Or will I have to hack CodonTable.pm?
> > 
> > Thanks
> > Mick
> > _______________________________________________
> > Bioperl-l mailing list
> > Bioperl-l@portal.open-bio.org
> > http://portal.open-bio.org/mailman/listinfo/bioperl-l
-- 
______ _/      _/_____________________________________________________
      _/      _/                      http://www.ebi.ac.uk/mutations/
     _/  _/  _/  Heikki Lehvaslaiho    heikki_at_ebi ac uk
    _/_/_/_/_/  EMBL Outstation, European Bioinformatics Institute
   _/  _/  _/  Wellcome Trust Genome Campus, Hinxton
  _/  _/  _/  Cambs. CB10 1SD, United Kingdom
     _/      Phone: +44 (0)1223 494 644   FAX: +44 (0)1223 494 468
___ _/_/_/_/_/________________________________________________________
From birney at ebi.ac.uk  Mon Oct 27 05:15:41 2003
From: birney at ebi.ac.uk (Ewan Birney)
Date: Mon Oct 27 05:13:14 2003
Subject: [Bioperl-l] Codon Table
In-Reply-To: <20B7EB075F2D4542AFFAF813E98ACD930282236E@cl-exsrv1.irad.bbsrc.ac.uk>
Message-ID: <Pine.LNX.4.44.0310271015140.21332-100000@pigeon.ebi.ac.uk>

On Mon, 27 Oct 2003, michael watson (IAH-C) wrote:

> Thanks Heidi, your efforts are appreciated :-)
> 


Heidi? Who's Heidi? (Heikki - time to get your hair extensions and 
pig-tails out!)


From Marc.Logghe at devgen.com  Mon Oct 27 05:20:53 2003
From: Marc.Logghe at devgen.com (Marc Logghe)
Date: Mon Oct 27 05:23:31 2003
Subject: [Bioperl-l] Codon Table
Message-ID: <BEE28BF86078B6429D6C780635718E212E742B@morelia.be.devgen.com>


> -----Original Message-----
> From: Ewan Birney [mailto:birney@ebi.ac.uk]
> Sent: Monday, October 27, 2003 11:16 AM
> To: michael watson (IAH-C)
> Cc: 'bioperl-l@bioperl.org'; 'heikki@ebi.ac.uk'
> Subject: RE: [Bioperl-l] Codon Table
> 
> 
> On Mon, 27 Oct 2003, michael watson (IAH-C) wrote:
> 
> > Thanks Heidi, your efforts are appreciated :-)
> > 
> 
> 
> Heidi? Who's Heidi? (Heikki - time to get your hair extensions and 
> pig-tails out!)
He already did, apparently. Don't know what was better though ;-)
http://www.ebi.ac.uk/Information/Staff/person_maint.php?person_id=305

From michael.watson at bbsrc.ac.uk  Mon Oct 27 05:36:07 2003
From: michael.watson at bbsrc.ac.uk (michael watson (IAH-C))
Date: Mon Oct 27 05:37:19 2003
Subject: [Bioperl-l] Codon Table
Message-ID: <20B7EB075F2D4542AFFAF813E98ACD9302822373@cl-exsrv1.irad.bbsrc.ac.uk>

It'S MONDAY MORNING!! ;-)

-----Original Message-----
From: Marc Logghe [mailto:Marc.Logghe@devgen.com]
Sent: 27 October 2003 10:21
To: Ewan Birney; michael watson (IAH-C)
Cc: bioperl-l@bioperl.org; heikki@ebi.ac.uk
Subject: RE: [Bioperl-l] Codon Table


> -----Original Message-----
> From: Ewan Birney [mailto:birney@ebi.ac.uk]
> Sent: Monday, October 27, 2003 11:16 AM
> To: michael watson (IAH-C)
> Cc: 'bioperl-l@bioperl.org'; 'heikki@ebi.ac.uk'
> Subject: RE: [Bioperl-l] Codon Table
> 
> 
> On Mon, 27 Oct 2003, michael watson (IAH-C) wrote:
> 
> > Thanks Heidi, your efforts are appreciated :-)
> > 
> 
> 
> Heidi? Who's Heidi? (Heikki - time to get your hair extensions and 
> pig-tails out!)
He already did, apparently. Don't know what was better though ;-)
http://www.ebi.ac.uk/Information/Staff/person_maint.php?person_id=305
From ssacrich at yahoo.com  Mon Oct 27 04:58:26 2003
From: ssacrich at yahoo.com (Civil & Structural Designer )
Date: Mon Oct 27 07:58:00 2003
Subject: [Bioperl-l] Resume from  CAD  Specialist / Operator / Drafter
Message-ID: <200310271257.h9RCvrdb011310@portal.open-bio.org>


RIC  SIE

Tel   (408)    482-2840    rzbig@yahoo.com


OBJECTIVE:		STRUCTURAL &  MECHANICAL  DESIGNER

CIVIL, ARCHITECTURAL, TRANSPORTATION,  CAD Operator 

EXPERIENCE:

89 - present
DESIGNER, ENGINEER,  CAD MANAGER; 

Engineering & Design Service, Project Management & Development.
Preparing technical documentation, calculations, layouts drawings &
propositions. CAD Management and Operations, drafting & redesigning.
Intergraph, MicroStation,  Autodesk, ACAD, Win, Net, Softdesk Mgmt
Civil, Bridges and Structural Design, Plans, Mapping, Detail Freeway
& Roadway, data translation & inserting. Script & CAD automation.
Geological Structures, Viaducts, Freeways, Highways, Shopping Center. 
Architectural and Environmental Projects and cooperation; military
facilities and plans, Cities, Airports, remediation drawings upgrade,
correcting and redesign. Traffic design & problem analyzes-reorganize.
Freeway Design & Drafting Support, Site analyzing for Caltrans,
Architectural, Archeotype & Electrical drawings, "as is" and initial design;          
Develop remediation procedure and equipment for lead painted buildings.
Construction management, Job site inspection, civil & structural support
Mechanical Evaluations - Design - Service and Maintenance; R&D.

EDUCATION:


 College -  BS  Degree - CAD, Engineering

DOS, UNIX, MAC, SUN computers;  WP, dBase, Lotus, Network, Lisp,
Windows & Applications:  Excel, Words, Access, Power Point & more
METRIC,  SOLAR,  AutoCAD/Computer Instructor. Transportation Spec.
Personal Designer, MS Project, C, Script, File Management, File transfer. 
Learn quickly, work independly, shift, overtime.


From ssacrich at yahoo.com  Mon Oct 27 04:58:26 2003
From: ssacrich at yahoo.com (Civil & Structural Designer )
Date: Mon Oct 27 07:58:21 2003
Subject: [Bioperl-l] Resume from  CAD  Specialist / Operator / Drafter
Message-ID: <200310271257.h9RCvrdb011311@portal.open-bio.org>


RIC  SIE

Tel   (408)    482-2840    rzbig@yahoo.com


OBJECTIVE:		STRUCTURAL &  MECHANICAL  DESIGNER

CIVIL, ARCHITECTURAL, TRANSPORTATION,  CAD Operator 

EXPERIENCE:

89 - present
DESIGNER, ENGINEER,  CAD MANAGER; 

Engineering & Design Service, Project Management & Development.
Preparing technical documentation, calculations, layouts drawings &
propositions. CAD Management and Operations, drafting & redesigning.
Intergraph, MicroStation,  Autodesk, ACAD, Win, Net, Softdesk Mgmt
Civil, Bridges and Structural Design, Plans, Mapping, Detail Freeway
& Roadway, data translation & inserting. Script & CAD automation.
Geological Structures, Viaducts, Freeways, Highways, Shopping Center. 
Architectural and Environmental Projects and cooperation; military
facilities and plans, Cities, Airports, remediation drawings upgrade,
correcting and redesign. Traffic design & problem analyzes-reorganize.
Freeway Design & Drafting Support, Site analyzing for Caltrans,
Architectural, Archeotype & Electrical drawings, "as is" and initial design;          
Develop remediation procedure and equipment for lead painted buildings.
Construction management, Job site inspection, civil & structural support
Mechanical Evaluations - Design - Service and Maintenance; R&D.

EDUCATION:


 College -  BS  Degree - CAD, Engineering

DOS, UNIX, MAC, SUN computers;  WP, dBase, Lotus, Network, Lisp,
Windows & Applications:  Excel, Words, Access, Power Point & more
METRIC,  SOLAR,  AutoCAD/Computer Instructor. Transportation Spec.
Personal Designer, MS Project, C, Script, File Management, File transfer. 
Learn quickly, work independly, shift, overtime.


From brian_osborne at cognia.com  Mon Oct 27 09:56:13 2003
From: brian_osborne at cognia.com (Brian Osborne)
Date: Mon Oct 27 09:55:13 2003
Subject: [Bioperl-l] RE: :DB::Registry
In-Reply-To: <1067260843.2296.37.camel@localhost>
Message-ID: <GAEDKMGOKFBLJPKCLKCCKEINDDAA.brian_osborne@cognia.com>

Heikki,

Yes, with your modified Registry.t I see a problem. With my version,
everything was OK. The problem is that I can only work on this after-hours.
If you're trying to stay on schedule you have some choices: we can remove
Registry.t, and the code stays broken. We can solicit help from bioperl-l,
whoever's interested, or you can wait while I take a closer look.

Here's what I've done. No index method was being passed to Flat, that won't
work since it's required. Also, config.dat wasn't being read. That wouldn't
work since these databases are created before Registry is called, and the
only way Flat knows whether something is bdb or binarysearch (flat) is by
reading config.dat, then it knows which child class to call. Also, Registry
was reading only the first file in OBDA_SEARCH_PATH, that was also not
desirable. I think I see where the problem is now. If you take a look at the
$db passed to Failover using your test you'll see that there's a file path
missing:

db: Bio::DB::Flat::BDB::fasta=HASH(0x105b7d28)
key: flat_directory t/tmp
key: flat_flat_file_no HASH(0x105b7e6c)
key: flat_format fasta
key: flat_primary_namespace ACC
key: flat_indexing BerkeleyDB/1
key: flat_flat_file_path HASH(0x105b7e48)
Use of uninitialized value in print at
/usr/lib/perl5/site_perl/5.8.0/Bio/DB/Failover.pm line 113.
key: flat_verbose
key: flat_dbname testbdb
key: flat_secondary_namespaces ARRAY(0x105b7d7c)
key: flat_alphabet protein
key: bdb_maxopen 32

So the proper child class is involved, Flat::BDB, the correct flat_indexing
is there, the name is right. So that's what I'll be looking at next. I could
be wrong, of course, this is just my suspicion. You decide whether you want
someone who can work faster than me!

Brian O.


-----Original Message-----
From: Heikki Lehvaslaiho [mailto:heikki@ebi.ac.uk]
Sent: Monday, October 27, 2003 8:21 AM
To: Brian Osborne
Subject: RE: :DB::Registry

Brian,

I hope my mail yesterday made sense. I had to send it off in a hurry.

I think there is a fundamental problem in processing multiple registry
files. I know that you have take on this area, but I if you get stuck, I
recommend contacting Lincoln directly. I would. Now.

        -Heikki

On Sun, 2003-10-26 at 03:26, Brian Osborne wrote:
> Heikki,
>
> I have a guesss. Since you're running 11 tests that means you have DB_File
> installed. Here are the 9th and 10th tests:
>
> $db = $registry->get_database('testbdb');
>  ok defined($db);
>  $seq = $db->get_Seq_by_id("ALEU_HORVU");
>  ok defined($seq);
>
> So, apparently the flat database testdbd is getting made. One way to
confirm
> this is to comment out the END block in Registry.t and run the test. You
> should see t/tmp/testbdb, t/tmp/testbdb/config.dat, and the index file
> t/tmp/testbdb/key_ACC. Now, if you see those files the database has been
> made, so the question is why can't you retrieve the sequence?
>
> I've noticed this note in DB/Flat.pm:
>
> -index is one of "bdb" or "binarysearch" and indicates the type of
> index to generate.  "bdb" corresponds to Berkeley DB.  You *must* be
> using BerkeleyDB version 2 or higher, and have the Perl BerkeleyDB
> extension installed (DB_File will *not* work).
>
> Do you have this recent version of BerkeleyDB? I'll add this check to
> Registry.t.
>
>
> Brian O.
>
> -----Original Message-----
> From: Heikki Lehvaslaiho [mailto:heikki@nildram.co.uk]
> Sent: Saturday, October 25, 2003 10:41 AM
> To: Brian Osborne
> Subject: Bio::DB::Registry
>
> Brian,
>
> Your recent changes broke Registry.t for me. Can you give a hint what
> might be going on?
>
>         -Heikki
>
> localhost ~/src/bioperl/core> perl -w t/Registry.t
> 1..11
> ok 1
> ok 2
>
> -------------------- WARNING ---------------------
> MSG: Couldn't call new_from_registry on [Bio::DB::Flat]
>
> ------------- EXCEPTION  -------------
> MSG: First line not compatible with flat file index.  Should be
> something like
>
> index   flat/1
> STACK Bio::DB::Flat::BinarySearch::read_config_file
> /home/heikki/src/bioperl/core/Bio/DB/Flat/BinarySearch.pm:1192
> STACK Bio::DB::Flat::BinarySearch::new
> /home/heikki/src/bioperl/core/Bio/DB/Flat/BinarySearch.pm:270
> STACK Bio::DB::Flat::new
> /home/heikki/src/bioperl/core/Bio/DB/Flat.pm:178
> STACK Bio::DB::Flat::new_from_registry
> /home/heikki/src/bioperl/core/Bio/DB/Flat.pm:254
> STACK (eval) /home/heikki/src/bioperl/core/Bio/DB/Registry.pm:182
> STACK Bio::DB::Registry::_load_registry
> /home/heikki/src/bioperl/core/Bio/DB/Registry.pm:181
> STACK Bio::DB::Registry::new
> /home/heikki/src/bioperl/core/Bio/DB/Registry.pm:97
> STACK toplevel t/Registry.t:56
>
> --------------------------------------
>
> ---------------------------------------------------
> ok 3
> ok 4
> ok 5
> ok 6
> ok 7
> ok 8
> ok 9
> not ok 10
> # Failed test 10 in t/Registry.t at line 75
> Use of uninitialized value in scalar assignment at t/Registry.t line 85.
--
______ _/      _/_____________________________________________________
      _/      _/                      http://www.ebi.ac.uk/mutations/
     _/  _/  _/  Heikki Lehvaslaiho    heikki_at_ebi ac uk
    _/_/_/_/_/  EMBL Outstation, European Bioinformatics Institute
   _/  _/  _/  Wellcome Trust Genome Campus, Hinxton
  _/  _/  _/  Cambs. CB10 1SD, United Kingdom
     _/      Phone: +44 (0)1223 494 644   FAX: +44 (0)1223 494 468
___ _/_/_/_/_/________________________________________________________


From heikki at ebi.ac.uk  Mon Oct 27 11:24:52 2003
From: heikki at ebi.ac.uk (Heikki Lehvaslaiho)
Date: Mon Oct 27 11:29:33 2003
Subject: [Bioperl-l] Re: Question on the scf file format
In-Reply-To: <Pine.OSF.4.50.0310271536430.1688764-100000@cbi1c.internal.sanger.ac.uk>
References: <001001c39c9a$fbb2c6c0$0201a8c0@distance>
	<Pine.OSF.4.50.0310271536430.1688764-100000@cbi1c.internal.sanger.ac.uk>
Message-ID: <1067271892.2301.52.camel@localhost>

Guillaume,

In other words: Tony's fixes are only in the CVS head and in developer
releases in our website. They never made it out into the 1.2 release
series. It was an oversight  which I apologise profusely.

Please install the 1.3.02 from 
http://bioperl.org/DIST/current_core_unstable.tar.gz
and try again.

Yours,
	-Heikki


On Mon, 2003-10-27 at 15:38, Tony Cox wrote:
> On Mon, 27 Oct 2003, Guillaume Giraudon wrote:
> 
> Hi Guillaume,
> 
> This looks like this may be the "cast" you have to apply if you are using
> 8bit/16bit data values. Check the code I added to the Bioperl SCF.pm module to
> make this work properly for all SCF files.
> 
> Tony
> 
> 
> 
> +>Hi Jason, Hi Tony, Hi Heikki
> +>
> +>I'm sorry to bother you with this simple question but I saw one of your source files (scf.pm) on the web and though you might be able to help me out on this matter : I am trying to write a web based scf file viewer (in php). I came across a lot of documents that seem to all be based on the RFC I found at
> +>http://www.mrc-lmb.cam.ac.uk/pubseq/scf-rfc.html
> +>
> +>I have attached a zip file of the files I'm working with so that you might take a look at them. I'm comparing the results I get from my program with what Chromas (v1.45) gives me. So far, I believe I'm parsing the header correctly. What I get makes sense :
> +>
> +>scf_header Object
> +>(
> +>    [magic_number] => 779314022
> +>    [samples] => 10934
> +>    [samples_offset] => 128
> +>    [bases] => 899
> +>    [bases_left_clip] => 0
> +>    [bases_right_clip] => 0
> +>    [bases_offset] => 87600
> +>    [comments_size] => 364
> +>    [comments_offset] => 98388
> +>    [version] => 3.00
> +>    [sample_size] => 2
> +>    [code_set] => 0
> +>    [private_size] => 0
> +>    [private_offset] => 0
> +>    [spare] =>
> +>)
> +>
> +>Now when I start parsing the Samples section, I get confused. From what I can gather, Its composed of delta differences between each sample (and not the values themselves as I originally thought).
> +>
> +>Strangely, I believe I'm calculating my offsets fine because the very first vales of all A,C,G and T match what I have in the raw_data.txt file (exported with chromas). But I cant seem to read the rest of the samples correctly.
> +>
> +>Here is a little HEX extract from the scf file :
> +>
> +>00000000h: 2E 73 63 66 00 00 2A B6 00 00 00 80 00 00 03 83 ; .scf..*?...?...?
> +>00000010h: 00 00 00 00 00 00 00 00 00 01 56 30 00 00 01 6C ; ..........V0...l
> +>00000020h: 00 01 80 54 33 2E 30 30 00 00 00 02 00 00 00 00 ; ..?T3.00........
> +>00000030h: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ; ................
> +>00000040h: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ; ................
> +>00000050h: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ; ................
> +>00000060h: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ; ................
> +>00000070h: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ; ................
> +>00000080h: 00 A0 FF 60 00 00 00 10 00 03 FF FE FF FD FF FB ; . ?`......??????
> +>00000090h: FF FE FF FA FF FB FF FD FF FB FF FF FF FF 00 00 ; ??????????????..
> +>000000a0h: FF FF 00 01 00 00 FF FD 00 01 FF FD 00 03 00 00 ; ??....??..??....
> +>000000b0h: 00 04 00 01 00 03 00 01 00 03 00 00 00 02 FF FF ; ..............??
> +>000000c0h: 00 02 00 01 00 00 00 00 00 00 00 00 00 00 00 00 ; ................
> +>
> +>The first 128 bytes are the header and I seem to read the part fine. My samples_offset is 128 (80h) and my sample_size is 2. At that location (80h), the first value is '00 A0' witch is 160 in decimal. That?s exactly my first value so that look about right. But then I have FF 60. Is this supposed to be a delta ?
> +>I though it could be but then, if its an unsigned value, it?s a bit huge.
> +>I tried to consider it as a signed value but then again, I cant seem to get the same thing as Chromas.
> +>
> +>Here is a short extract of what I get :
> +>
> +>A	C	G	T
> +>160	0	72	6
> +>-160	0	-84	-8
> +>0	0	-5	-1
> +>16	0	2	2
> +>3	0	3	1
> +>-2	0	2	0
> +>-3	0	5	0
> +>-5	0	4	0
> +>-2	0	1	0
> +>-6	0	0	0
> +>-5	0	0	0
> +>-3	0	0	0
> +>-5	0	0	0
> +>-1	0	0	0
> +>
> +>Any idea of what I might be doing wrong ?
> +>
> +>Thank you in advance,
> +>
> +>G.Giraudon
> +>
> +>
> +>
> 
> ******************************************************
> Tony Cox			Email:avc@sanger.ac.uk
> Sanger Institute		WWW:www.sanger.ac.uk
> Wellcome Trust Genome Campus	Head,Software Services
> Hinxton				Tel: +44 1223 834244
> Cambs. CB10 1SA			Fax: +44 1223 494919
> ******************************************************
-- 
______ _/      _/_____________________________________________________
      _/      _/                      http://www.ebi.ac.uk/mutations/
     _/  _/  _/  Heikki Lehvaslaiho    heikki_at_ebi ac uk
    _/_/_/_/_/  EMBL Outstation, European Bioinformatics Institute
   _/  _/  _/  Wellcome Trust Genome Campus, Hinxton
  _/  _/  _/  Cambs. CB10 1SD, United Kingdom
     _/      Phone: +44 (0)1223 494 644   FAX: +44 (0)1223 494 468
___ _/_/_/_/_/________________________________________________________

From Tony.LeGall at bichat.inserm.fr  Mon Oct 27 15:48:07 2003
From: Tony.LeGall at bichat.inserm.fr (LE GALL Tony)
Date: Mon Oct 27 18:47:12 2003
Subject: [Bioperl-l] running T_coffee with bioperl on a windows system
Message-ID: <000801c39ccb$a01e99c0$129f39c2@INSERMY7LJC34Q>

Hello,
Can I run T_coffee with bioperl on a windows system. If it is possible, how can I do ? 
Thanks,
LE GALL
From Luc.Gauthier at aventis.com  Tue Oct 28 05:13:54 2003
From: Luc.Gauthier at aventis.com (Luc.Gauthier@aventis.com)
Date: Tue Oct 28 05:10:55 2003
Subject: [Bioperl-l] RangeI.pm warnings
Message-ID: <6FA8B454A1DF1E4A97F0A48DCC324EAB819CB3@crbsmxsusr05.pharma.aventis.com>

Hi everyone,

I use the SearchIO mechanism to parse BLASTN 2.1.3 reports.

I yesterday downloaded an upgrade to the bioperl-live library I currently use
for my developments.
The problem is that since then, I get the following warning messages from
RangeI.pm :

-----
Argument "start" isn't numeric in i_negate at
/home/local/lgauthie/dev/perl5lib/Bio/RangeI.pm line 505.
Argument "end" isn't numeric in i_negate at
/home/local/lgauthie/dev/perl5lib/Bio/RangeI.pm line 506.
Argument "strand" isn't numeric in i_negate at
/home/local/lgauthie/dev/perl5lib/Bio/RangeI.pm line 507.
-----

Here is how it happens...

First, I get a raw blastall report from a wrapper tool I wrote :
     my $result = $self->_run_ReadWrite($toolpath, $toolparams,
$params{query});
     my $raw_report = join "\n", @{$results->{OUTPUT}};

Then I instantiate an IO::String (v1.03) object from the obtained string :
     my $pseudoFH = IO::String->new( $raw_report );

Finally, I want to obtain a Bio::SearchIO report from that :
     my $searchIO = Bio::SearchIO->new( -format => 'blast' , -fh => $pseudoFH
);


And this is that command that brings the warnings messages from RangeI.pm.

One interesting thing is that when I parse blast reports in a loop, only the
first Bio::SearchIO
report creation will bring the warning messages...

Has anyone else seen that problem ?

Thank you for reading, and have a good day !
Luc Gauthier


From Richard.Adams at ed.ac.uk  Tue Oct 28 06:57:06 2003
From: Richard.Adams at ed.ac.uk (Richard Adams)
Date: Tue Oct 28 06:54:05 2003
Subject: [Bioperl-l] Bio::SimpleAlign methods
Message-ID: <3F9E5992.1539CAD5@ed.ac.uk>

Hello,
    I'm writing a wrapper for the SIFT program which predicts functional
effects of SNPs using protein sequence
alignments and have written 3 methods for formatting alignments
necessary for input into this program and maybe useful for others as
well.

    1. $aln->get_ungapped_slices($seq_id);
        returns an array of alignment slices corresponding to ungapped
sequence blocks of the specified  sequence.

    2. $aln->merge_slices();
        concatenates alignment slices into 1 big alignment.

    3. $aln->set_pos($seq_id, pos);
        changes the position of a sequence in the alignment

    If these are of any potential use should I CVS them into
Bio::SimpleAlign (or given they're rather esoteric) into
Bio::Align::Utilities?

Richard

--
Dr Richard Adams
Psychiatric Genetics Group,
Medical Genetics,
Molecular Medicine Centre,
Western General Hospital,
Crewe Rd West,
Edinburgh UK
EH4 2XU

Tel: 44 131 651 1084
richard.adams@ed.ac.uk


From brian_osborne at cognia.com  Tue Oct 28 10:19:55 2003
From: brian_osborne at cognia.com (Brian Osborne)
Date: Tue Oct 28 10:19:00 2003
Subject: [Bioperl-l] running T_coffee with bioperl on a windows system
In-Reply-To: <000801c39ccb$a01e99c0$129f39c2@INSERMY7LJC34Q>
Message-ID: <GAEDKMGOKFBLJPKCLKCCCEJMDDAA.brian_osborne@cognia.com>

Tony,

Certainly. There are a few different ways of installing Bioperl on Windows.
Take a look at the INSTALL.WIN file.

Brian O.


-----Original Message-----
From: bioperl-l-bounces@portal.open-bio.org
[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of LE GALL Tony
Sent: Monday, October 27, 2003 3:48 PM
To: bioperl-l@bioperl.org
Subject: [Bioperl-l] running T_coffee with bioperl on a windows system

Hello,
Can I run T_coffee with bioperl on a windows system. If it is possible, how
can I do ?
Thanks,
LE GALL


From jason at cgt.duhs.duke.edu  Tue Oct 28 15:43:19 2003
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Tue Oct 28 15:41:09 2003
Subject: [Bioperl-l] bug #1546 and BPlite/BPpsilite
In-Reply-To: <3F9EBF99.3010306@chem.ucsd.edu>
References: <3F9EBF99.3010306@chem.ucsd.edu>
Message-ID: <Pine.LNX.4.50.0310281534470.603-100000@tenero.duhs.duke.edu>

WRT buggy code - Do you mean the parser or the running of BLAST from
withing Bioperl?

If in regard to StandAloneBlast, I totally agree.  I frequently ask for
someone to propose a new rewrite of this module which is more robust,
scalable to WU-BLAST, but so far have had no takers.  I don't personally
run BLAST that way so I've not had an itching desire to rewrite it.

As for buggy parsing, you're using BPpsilite which has been superceeded by
new code in Bio::SearchIO::blast on the main trunk (but not necessarily in
1.2.3).  I do not have any plans to personally maintain any of the
BPlite/BPpsilite/BPbl2seq modules as all of my own development effort is
focused on Bio::SearchIO.  Others are welcome to maintain these - we'll
get you an account to do this if someone volunteers.


As for the problems you are specifically reporting, hopefully someone will
have a chance to look at it soon.

-jason

On Tue, 28 Oct 2003, Michael Prentiss wrote:

> Jason,
> I did not think about the attachment.  I included the inputs, stderr,
> the code, and an example of the problem.
> I found this part of BioPerl really buggy, and the errors were transient
> !?!?.   Is this some sort of memory problem
> because perl is not careful enough with memory allocation?
> Thanks for you help, I am rather confused.
> Michael Prentiss
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From hawkins at scharp.org  Tue Oct 28 21:34:56 2003
From: hawkins at scharp.org (Natalie Hawkins)
Date: Tue Oct 28 21:31:54 2003
Subject: [Bioperl-l] bioperl-db schema
Message-ID: <3F9F2750.4010502@scharp.org>


Hello,

I was looking for a diagram of the bioperl-db schema.

I found a file in the /bioperl-db/model CVS dir called:

bioperl-maps.dia

I am not sure what to do w/that.  How do I use it?

thanks,
Natalie

From fangl at genomics.org.cn  Wed Oct 29 00:53:50 2003
From: fangl at genomics.org.cn (Magic Fang)
Date: Wed Oct 29 00:51:03 2003
Subject: [Bioperl-l] how to align the description to glyph
Message-ID: <3F9F55EE.6010805@genomics.org.cn>

Hi,
sometime, the description of a glyph will out of range of the panel, so 
if i can set the alignment to right, i will avoid this problem, how can i?

Magic

From Marc.Logghe at devgen.com  Wed Oct 29 03:12:36 2003
From: Marc.Logghe at devgen.com (Marc Logghe)
Date: Wed Oct 29 03:10:03 2003
Subject: [Bioperl-l] bioperl-db schema
Message-ID: <BEE28BF86078B6429D6C780635718E212E7439@morelia.be.devgen.com>


> -----Original Message-----
> From: Natalie Hawkins [mailto:hawkins@scharp.org]
> Sent: Wednesday, October 29, 2003 3:35 AM
> To: bioperl-l@bioperl.org
> Subject: [Bioperl-l] bioperl-db schema
> 
> 
> 
> 
> Hello,
> 
> I was looking for a diagram of the bioperl-db schema.
> 
> I found a file in the /bioperl-db/model CVS dir called:
> 
> bioperl-maps.dia
> 
> I am not sure what to do w/that.  How do I use it?
You have to install dia (http://www.lysator.liu.se/~alla/dia/) in order to load the file.
There is also a pdf version available at the biodas cvs site
http://cvs.biodas.org/cgi-bin/viewcvs/viewcvs.cgi/*checkout*/biosql-schema/doc/biosql-ERD.pdf?rev=1.5&cvsroot=biosql
HTH,
Marc 

From amackey at pcbi.upenn.edu  Wed Oct 29 07:15:48 2003
From: amackey at pcbi.upenn.edu (Aaron J Mackey)
Date: Wed Oct 29 07:12:42 2003
Subject: [Bioperl-l] Re: [Bioperl-guts-l] bioperl commit
In-Reply-To: <200310291030.h9TAUiJu017418@pub.open-bio.org>
Message-ID: <Pine.LNX.4.44.0310290712180.31785-100000@snowball.pcbi.upenn.edu>


Note that DSN's can also have "db=foo", "dbname=foo", "port=1234", etc.  
etc.  Additionally, one can pass DBI parameters (RaiseError, AutoCommit,
etc) via the DSN.  To make your day even worse, these parameters can
legitimately come in any order.

Why do you have to mangle the DSN?

On Wed, 29 Oct 2003, Chuah Aaron wrote:

> 
> baaron
> Wed Oct 29 05:30:44 EST 2003
> Update of /home/repository/bioperl/bioperl-live/scripts/Bio-DB-GFF
> In directory pub.open-bio.org:/tmp/cvs-serv17394
> 
> Modified Files:
> 	bulk_load_gff.PLS 
> Log Message:
> added support for dsn strings in the form of "dbi:mysql:database=xxx;host=xxx"
> 
> bioperl-live/scripts/Bio-DB-GFF bulk_load_gff.PLS,1.13,1.14
> ===================================================================
> RCS file: /home/repository/bioperl/bioperl-live/scripts/Bio-DB-GFF/bulk_load_gff.PLS,v
> retrieving revision 1.13
> retrieving revision 1.14
> diff -u -r1.13 -r1.14
> --- /home/repository/bioperl/bioperl-live/scripts/Bio-DB-GFF/bulk_load_gff.PLS	2003/10/28 06:32:40	1.13
> +++ /home/repository/bioperl/bioperl-live/scripts/Bio-DB-GFF/bulk_load_gff.PLS	2003/10/29 10:30:44	1.14
> @@ -155,14 +155,17 @@
>    $AUTH .= " -p$PASSWORD";
>  }
>  
> +$DSN=~s/database=//i;
> +$DSN=~s/;host=/:/i; #cater for dsn in the form of "dbi:mysql:database=$dbname;host=$host"
> +
>  my($DBI,$DBD,$DBNAME,$HOST)=split /:/,$DSN;
>  if (defined $HOST) {
>    $AUTH .= " -h$HOST";  
>  }
> +$DBNAME=$DSN unless $DSN=~/:/;
>  if (defined $DBNAME) {
>    $AUTH .= " -D$DBNAME";
>  }
> -
>  if (defined $LOCAL) {
>    $LOCAL='local';
>    $AUTH.=' --local-infile=1';
> @@ -170,6 +173,7 @@
>    $LOCAL='';
>  }
>  
> +
>  my $db = Bio::DB::GFF->new(-adaptor=>'faux',-dsn => $DSN,@auth)
>    or die "Can't open database: ",Bio::DB::GFF->error,"\n";
>  
> 
> _______________________________________________
> Bioperl-guts-l mailing list
> Bioperl-guts-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-guts-l
> 


From Alexandre.Irrthum at icr.ac.uk  Wed Oct 29 08:02:11 2003
From: Alexandre.Irrthum at icr.ac.uk (Alexandre Irrthum)
Date: Wed Oct 29 07:59:35 2003
Subject: [Bioperl-l] rpsblast from bioperl
Message-ID: <sf9fba6b.053@icr.ac.uk>

Hi everybody,

I have to call the NCBI rpsblast program from within a Perl script. I
wonder if there is a "bioperl" way to do that, similar to what the
StandAloneBlast object can do for blastall or blastpgp. I also wonder if
the Blast or BPlite parser objects support parsing of rpsblast reports.

Thanks a lot for your help,

alex


P.S. please cc your answer to my email address if possible 
From jason at cgt.duhs.duke.edu  Wed Oct 29 08:08:34 2003
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Wed Oct 29 08:06:07 2003
Subject: [Bioperl-l] bioperl-db schema
In-Reply-To: <3F9F2750.4010502@scharp.org>
References: <3F9F2750.4010502@scharp.org>
Message-ID: <Pine.LNX.4.50.0310290804190.6328-100000@tenero.duhs.duke.edu>

I think the dia diagram is for an OLD marker/map db schema I was writing.

If you want an ER diagram of the biosql schema check-out the docs
directory in biosql/schema

cvs -d:pserver:cvs@cvs.open-bio.org:/home/repository/biosql co schema

look in docs/biosql-ERD.pdf is the diagram.

Or just click here:
http://cvs.open-bio.org/cgi-bin/viewcvs/viewcvs.cgi/*checkout*/biosql-schema/doc/biosql-ERD.pdf?rev=1.5&cvsroot=biosql&content-type=application/pdf

-jason
On Tue, 28 Oct 2003, Natalie Hawkins wrote:

>
>
> Hello,
>
> I was looking for a diagram of the bioperl-db schema.
>
> I found a file in the /bioperl-db/model CVS dir called:
>
> bioperl-maps.dia
>
> I am not sure what to do w/that.  How do I use it?
>
> thanks,
> Natalie
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From jason at cgt.duhs.duke.edu  Wed Oct 29 08:09:43 2003
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Wed Oct 29 08:06:34 2003
Subject: [Bioperl-l] rpsblast from bioperl
In-Reply-To: <sf9fba6b.053@icr.ac.uk>
References: <sf9fba6b.053@icr.ac.uk>
Message-ID: <Pine.LNX.4.50.0310290808450.6328-100000@tenero.duhs.duke.edu>

Yes we support parsing it (use Bio::SearchIO format 'blast').

No we don't have a StandAloneBlast way of doing it, but you're welcome to
add that one method and we'll add it back into the toolkit.

-jason
On Wed, 29 Oct 2003, Alexandre Irrthum wrote:

> Hi everybody,
>
> I have to call the NCBI rpsblast program from within a Perl script. I
> wonder if there is a "bioperl" way to do that, similar to what the
> StandAloneBlast object can do for blastall or blastpgp. I also wonder if
> the Blast or BPlite parser objects support parsing of rpsblast reports.
>
> Thanks a lot for your help,
>
> alex
>
>
> P.S. please cc your answer to my email address if possible
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From skirov at utk.edu  Wed Oct 29 11:54:55 2003
From: skirov at utk.edu (Stefan Kirov)
Date: Wed Oct 29 11:51:58 2003
Subject: [Bioperl-l] LargeSeq performance
Message-ID: <3F9FF0DF.6020302@utk.edu>

I have a problem with the performance of  LargeSeq. I am working with 
whole chromosomes (mouse, human) and next_seq takes forever.
I do not know if it is worth, since any portion can be read with random 
access, but I am still curious to know id pepople think it might be a 
good idea to create an object, that hadles extremely large sequences- 
whole chromosomes for example without impact on the performance?
If you think it's worth I can try to do it. What I have in mind is use 
grep to map the record separators ">" (in case you are mad enogh to put 
more than one chromosome in a single file). Thus next_seq will know 
where to look for the next sequence and, parse the id line and calc the 
length. And I doubt anyone will use this under Windows (anyway, OS can 
be checked to avoid problems). Also the object will use random 
accessinstead of  Bio::Root::IO to get sequence data.
Let me know what you think...
Stefan Kirov

From jason at cgt.duhs.duke.edu  Wed Oct 29 12:32:30 2003
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Wed Oct 29 12:29:29 2003
Subject: [Bioperl-l] LargeSeq performance
In-Reply-To: <3F9FF0DF.6020302@utk.edu>
References: <3F9FF0DF.6020302@utk.edu>
Message-ID: <Pine.LNX.4.50.0310291228350.6786-100000@tenero.duhs.duke.edu>

Yeah I think it is pretty much not a good solution in the end.
My strategy would be separate annotation from the actual sequence.
Put the sequence in fasta files and use Lincoln's Bio::DB::Fasta which
allows random access and subsequence retrieval quite nicely for large
files or lots of sequences.  Perhaps we should just think about abandoning
LargeSeq for the Indexed approach that Lincoln uses.

In addition to parse large chromosome files with annotations would need to
fix-up/check that SeqIO::genbank/embl can still parse a file which is a
CONTIG file or a genbank record which doesn't have any sequence (all you
want is the annotations/features anyways).  This might need a little more
tweaking to make sure it works.  I feel like the SeqIO parsing right now
is pretty fragile to certain types of changes so I can't say that it would
right now out of the box.

That's the way I'd go, personally but may be a more proper engineering
solution.

-jason

On Wed, 29 Oct 2003, Stefan Kirov wrote:

> I have a problem with the performance of  LargeSeq. I am working with
> whole chromosomes (mouse, human) and next_seq takes forever.
> I do not know if it is worth, since any portion can be read with random
> access, but I am still curious to know id pepople think it might be a
> good idea to create an object, that hadles extremely large sequences-
> whole chromosomes for example without impact on the performance?
> If you think it's worth I can try to do it. What I have in mind is use
> grep to map the record separators ">" (in case you are mad enogh to put
> more than one chromosome in a single file). Thus next_seq will know
> where to look for the next sequence and, parse the id line and calc the
> length. And I doubt anyone will use this under Windows (anyway, OS can
> be checked to avoid problems). Also the object will use random
> accessinstead of  Bio::Root::IO to get sequence data.
> Let me know what you think...
> Stefan Kirov
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From John.Gray at stjude.org  Wed Oct 29 11:39:09 2003
From: John.Gray at stjude.org (Gray, John)
Date: Wed Oct 29 18:39:58 2003
Subject: [Bioperl-l] Restriction Enzyme cuts on Circular plasmids
Message-ID: <1E0CC447E59C974CA5C7160D2A2854ECC51A71@SJMEMXMB04.stjude.sjcrh.local>

I am trying to find documentation for the Bio::Restriction module, to see if it can properly predict restriction fragment sizes for circular plasmids.

 
I'm sorry to have to ask.  The tutorial at http://www.bioperl.org/Core/Latest/bptutorial.html says that Bio::Restriction replaces the older Bio::RestrictionEnzyme, which I have used.  I can't find any documentation for this method at http://doc.bioperl.org/releases/bioperl-1.2.3/.  

 
The Seq documentation does not provide any description of the 'is_circular' method, and I was hoping that I could set the Seq object to be circular, and then just run something like $enyme_object->cut_seq($seq_object) to get my list of strings containing the resulting fragments.

 
Thank you for your help.

 
PS.  When I try and search the Bioperl-l archive, it doesn't return any matches, even when I try to match the word 'bioperl'.  Should I tell someone about this?

 
-------------------------------------------------------------------------------------
John T. Gray, Ph.D.
Director, Vector Development & Production
Experimental Hematology Division
Hematology-Oncology
St. Jude Children's Research Hospital
D2038A, Mail Stop 341
332 N. Lauderdale Street
Memphis, TN 38105 

(901) 495-4729 phone
(901) 495-2176 fax

John.Gray@stjude.org <mailto:John.Gray@stjude.org> 

 
From murmanmadi at yahoo.com  Wed Oct 29 12:03:53 2003
From: murmanmadi at yahoo.com (M V)
Date: Wed Oct 29 18:40:22 2003
Subject: [Bioperl-l] perl script to retrieve accession numbers
Message-ID: <20031029170353.14118.qmail@web14612.mail.yahoo.com>

Hi there,
 
Is there a perl script that retrieves the genbank accession numbers if I have a list of gene names? Thanks in advance.
 
Murmanmadi


---------------------------------
Do you Yahoo!?
Exclusive Video Premiere - Britney Spears
From redwards at utmem.edu  Wed Oct 29 19:31:04 2003
From: redwards at utmem.edu (Rob Edwards)
Date: Wed Oct 29 19:27:56 2003
Subject: [Bioperl-l] Restriction Enzyme cuts on Circular plasmids
In-Reply-To: <1E0CC447E59C974CA5C7160D2A2854ECC51A71@SJMEMXMB04.stjude.sjcrh.local>
Message-ID: <5850FFBC-0A70-11D8-A107-000A959E1622@utmem.edu>

The new Bio::Restriction modules should tell you the correct sizes of  
circular plasmids. This will do exactly what you want. The old  
Bio::Tools::RestrictionEnzyme does not have this ability, though you  
can just merge the first and last fragments of a sequence.

The newer Bio::Restriction are (I believe) only available in BioPerl  
1.3 and later. This is the upcoming release and the first RC seems  
quite stable.

You should probably tell someone about the searches not working, but I  
guess you just have :)

Rob


On Wednesday, October 29, 2003, at 10:39  AM, Gray, John wrote:

> I am trying to find documentation for the Bio::Restriction module, to  
> see if it can properly predict restriction fragment sizes for circular  
> plasmids.
>
>
>
> I'm sorry to have to ask.  The tutorial at  
> http://www.bioperl.org/Core/Latest/bptutorial.html says that  
> Bio::Restriction replaces the older Bio::RestrictionEnzyme, which I  
> have used.  I can't find any documentation for this method at  
> http://doc.bioperl.org/releases/bioperl-1.2.3/.
>
>
>
> The Seq documentation does not provide any description of the  
> 'is_circular' method, and I was hoping that I could set the Seq object  
> to be circular, and then just run something like  
> $enyme_object->cut_seq($seq_object) to get my list of strings  
> containing the resulting fragments.
>
>
>
> Thank you for your help.
>
>
>
> PS.  When I try and search the Bioperl-l archive, it doesn't return  
> any matches, even when I try to match the word 'bioperl'.  Should I  
> tell someone about this?
>
>
>
> ----------------------------------------------------------------------- 
> --------------
> John T. Gray, Ph.D.
> Director, Vector Development & Production
> Experimental Hematology Division
> Hematology-Oncology
> St. Jude Children's Research Hospital
> D2038A, Mail Stop 341
> 332 N. Lauderdale Street
> Memphis, TN 38105
>
> (901) 495-4729 phone
> (901) 495-2176 fax
>
> John.Gray@stjude.org <mailto:John.Gray@stjude.org>
>
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

From jason at cgt.duhs.duke.edu  Wed Oct 29 21:21:06 2003
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Wed Oct 29 21:17:44 2003
Subject: [Bioperl-l] Restriction Enzyme cuts on Circular plasmids
In-Reply-To: <1E0CC447E59C974CA5C7160D2A2854ECC51A71@SJMEMXMB04.stjude.sjcrh.local>
References: <1E0CC447E59C974CA5C7160D2A2854ECC51A71@SJMEMXMB04.stjude.sjcrh.local>
Message-ID: <Pine.LNX.4.50.0310292114060.17851-100000@tenero.duhs.duke.edu>

On Wed, 29 Oct 2003, Gray, John wrote:

> I am trying to find documentation for the Bio::Restriction module, to
> see if it can properly predict restriction fragment sizes for circular
> plasmids.
>
See http://doc.bioperl.org/bioperl-live for docs for code on the latest
CVS not the stable 1.2.x release series.

>
>
> I'm sorry to have to ask.  The tutorial at
> http://www.bioperl.org/Core/Latest/bptutorial.html says that
> Bio::Restriction replaces the older Bio::RestrictionEnzyme, which I have
> used.  I can't find any documentation for this method at
> http://doc.bioperl.org/releases/bioperl-1.2.3/.
>

As Rob has said - it's new - not sure why the tutorial has been updated to
preceede the not-yet released code though.

>
>
> The Seq documentation does not provide any description of the
> 'is_circular' method, and I was hoping that I could set the Seq object
> to be circular, and then just run something like
> $enyme_object->cut_seq($seq_object) to get my list of strings containing
> the resulting fragments.
>

Bio::Seq is a Bio::PrimarySeq
http://doc.bioperl.org/releases/bioperl-1.2.3/Bio/PrimarySeqI.html#POD14

>
>
> Thank you for your help.
>
>
>
> PS.  When I try and search the Bioperl-l archive, it doesn't return any
> matches, even when I try to match the word 'bioperl'.  Should I tell
> someone about this?

http://search.open-bio.org/   ?  Seems to work just fine for me.  Putting
in 'Restriction' finds lots of hits and restricting that with 'Enzyme'
finds more hits too.

-jason
>
>
>
> -------------------------------------------------------------------------------------
> John T. Gray, Ph.D.
> Director, Vector Development & Production
> Experimental Hematology Division
> Hematology-Oncology
> St. Jude Children's Research Hospital
> D2038A, Mail Stop 341
> 332 N. Lauderdale Street
> Memphis, TN 38105
>
> (901) 495-4729 phone
> (901) 495-2176 fax
>
> John.Gray@stjude.org <mailto:John.Gray@stjude.org>
>
>
>
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From redwards at utmem.edu  Wed Oct 29 22:21:38 2003
From: redwards at utmem.edu (Rob Edwards)
Date: Wed Oct 29 22:18:29 2003
Subject: [Bioperl-l] Restriction Enzyme cuts on Circular plasmids
In-Reply-To: <Pine.LNX.4.50.0310292114060.17851-100000@tenero.duhs.duke.edu>
Message-ID: <2C8DDD99-0A88-11D8-A107-000A959E1622@utmem.edu>


>
> As Rob has said - it's new - not sure why the tutorial has been 
> updated to
> preceede the not-yet released code though.

Because it's so cool?


> http://search.open-bio.org/   ?  Seems to work just fine for me.  
> Putting
> in 'Restriction' finds lots of hits and restricting that with 'Enzyme'
>

Searching the web archive at http://bioperl.org/pipermail/bioperl-l/ 
doesn't work. The main difference seems to be these two lines:

http://search.open-bio.org/ : (works)
<input type="hidden" name="config" value="htdig">
<input type="hidden" name="restrict" value="bioperl.org/">

and

http://bioperl.org/pipermail/bioperl-l : (doesn't work)
<input type="hidden" name="config" value="htdig-mailman/bioperl-l">
<input type="hidden" name="restrict" value="">


(procrastination means always having something to do tomorrow).

Rob

From Richard.Adams at ed.ac.uk  Thu Oct 30 04:16:39 2003
From: Richard.Adams at ed.ac.uk (Richard Adams)
Date: Thu Oct 30 04:13:32 2003
Subject: [Bioperl-l] RemoteBlast
Message-ID: <3FA0D6F7.906983D4@ed.ac.uk>

I'd be willing to volunteer for maintaining RemoteBlast if there are no
other takers, I've used it quite a bit so I guess should try and put
something back .

Just from first thoughts I'd plan to

- incorporate Megablast / Blast2seqs /rpsblast functionality
-separate out the form submission bits of code from the Blast specific
code in the module, perhaps by inheriting
    from Bio::WebAgent.

Other than making sure that the output is parseable by Bio::SearchIO,
are there any strong feelings for  how new features should be
implemented?

If anyone has requests/ideas fornew functionality please say so!

Richard


--
Dr Richard Adams

Psychiatric Genetics Group,
Medical Genetics,
Molecular Medicine Centre,
Western General Hospital,
Crewe Rd West,
Edinburgh UK
EH4 2XU

Tel: 44 131 651 1084
richard.adams@ed.ac.uk


From brian_osborne at cognia.com  Thu Oct 30 07:30:23 2003
From: brian_osborne at cognia.com (Brian Osborne)
Date: Thu Oct 30 07:29:07 2003
Subject: [Bioperl-l] Restriction Enzyme cuts on Circular plasmids
In-Reply-To: <2C8DDD99-0A88-11D8-A107-000A959E1622@utmem.edu>
Message-ID: <GAEDKMGOKFBLJPKCLKCCCELDDDAA.brian_osborne@cognia.com>

John and Rob,

News of the release of these new classes appeared at the Bioperl Web site on
July 15, in the News section. I updated bptutorial shortly thereafter.

Brian O.

-----Original Message-----
From: bioperl-l-bounces@portal.open-bio.org
[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Rob Edwards
Sent: Wednesday, October 29, 2003 10:22 PM
To: Gray, John
Cc: bioperl-l@portal.open-bio.org
Subject: Re: [Bioperl-l] Restriction Enzyme cuts on Circular plasmids


>
> As Rob has said - it's new - not sure why the tutorial has been
> updated to
> preceede the not-yet released code though.

Because it's so cool?


> http://search.open-bio.org/   ?  Seems to work just fine for me.
> Putting
> in 'Restriction' finds lots of hits and restricting that with 'Enzyme'
>

Searching the web archive at http://bioperl.org/pipermail/bioperl-l/
doesn't work. The main difference seems to be these two lines:

http://search.open-bio.org/ : (works)
<input type="hidden" name="config" value="htdig">
<input type="hidden" name="restrict" value="bioperl.org/">

and

http://bioperl.org/pipermail/bioperl-l : (doesn't work)
<input type="hidden" name="config" value="htdig-mailman/bioperl-l">
<input type="hidden" name="restrict" value="">


(procrastination means always having something to do tomorrow).

Rob

_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org
http://portal.open-bio.org/mailman/listinfo/bioperl-l


From walsh at cenix-bioscience.com  Thu Oct 30 02:08:17 2003
From: walsh at cenix-bioscience.com (Andrew Walsh)
Date: Thu Oct 30 08:32:21 2003
Subject: [Bioperl-l] perl script to retrieve accession numbers
In-Reply-To: <20031029170353.14118.qmail@web14612.mail.yahoo.com>
References: <20031029170353.14118.qmail@web14612.mail.yahoo.com>
Message-ID: <3FA0B8E1.4090808@cenix-bioscience.com>

You should read the Bioperl tutorial section on this:

http://www.bioperl.org/Core/Latest/bptutorial.html

http://www.bioperl.org/Core/Latest/bptutorial.html#iii.1.1_accessing_remote_databases_(bio::db::genbank,_etc)

Also read the PODs for Bio::DB::GenBank (i.e. use perldoc).

If you are retrieving lots of sequences, try using Batch Entrez on the 
NCBI website to get local copies of the sequences, then use 
Bio::Index::GenBank and Bio::SeqIO.  Check out those POD and you should 
go through the tutorial.

Andrew


M V wrote:
> Hi there,
>  
> Is there a perl script that retrieves the genbank accession numbers if I have a list of gene names? Thanks in advance.
>  
> Murmanmadi
> 
> 
> ---------------------------------
> Do you Yahoo!?
> Exclusive Video Premiere - Britney Spears
> 
> 
> ------------------------------------------------------------------------
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l


-- 
------------------------------------------------------------------
Andrew Walsh, M.Sc.
Bioinformatics Software Engineer
IT Unit
Cenix BioScience GmbH
Pfotenhauerstr. 108
01307 Dresden, Germany
Tel. +49(351)210-2699
Fax  +49(351)210-1309

public key: http://www.cenix-bioscience.com/public_keys/walsh.gpg
------------------------------------------------------------------


From John.Gray at stjude.org  Thu Oct 30 09:37:34 2003
From: John.Gray at stjude.org (Gray, John)
Date: Thu Oct 30 09:34:27 2003
Subject: [Bioperl-l] Restriction Enzyme cuts on Circular plasmids
Message-ID: <1E0CC447E59C974CA5C7160D2A2854ECC51A72@SJMEMXMB04.stjude.sjcrh.local>

Thank you all for the help.

The broken archive search is at http://bioperl.org/pipermail/bioperl-l/.  I will use http://search.open-bio.org in the future.

Cutting circular DNAs is not as simple as concatenating the first and last fragments, as frequently the recognition sequence will span the origin of the plasmid, and in that case the site gets missed.  I have made a work-around within Bio::RestrictionEnzyme by appending the sequence with a short duplication of the beginning:

## Make circularity function properly AGAIN!!
## $concat is set to the length of the recognition sequence -1
$concat = 5;
$sequence = uc ($& . (substr $sequence, 0, $concat-1));

The additional sequence must then be later removed from the appropriate fragment.  (Here I am only after the length of the fragments):

## Cut the sequence with the restriction enzyme object.
@fragments = eval{ $re->cut_seq($seq); };
$@ && die "\n*** TROUBLE CUTTING:\n$@\n"; 

## pull lengths from array of sequences
my @fraglen;
my $i = 0;
foreach (@fragments) {
        $fraglen[$i] = length $_;
        $i++;
        }
## Sum first and last fragment lengths due to circularity, remove last
$fraglen[0] = $fraglen[0] + $fraglen[@fraglen-1] - $concat;
pop @fraglen;


This works well so far, but it is cumbersome because I must modify the sequence in a specific way for each enzyme recognition site length.

Reading the documentation for the new Bio::Restriction module I see that it has many improvements which would have made my code much easier, but it is not clear that the methods address the problem that I raised above.  Anyone have any idea?

Thanks again-

John Gray


-----Original Message-----
From: Rob Edwards [mailto:redwards@utmem.edu] 
Sent: Wednesday, October 29, 2003 6:31 PM
To: Gray, John
Cc: bioperl-l@portal.open-bio.org
Subject: Re: [Bioperl-l] Restriction Enzyme cuts on Circular plasmids

The new Bio::Restriction modules should tell you the correct sizes of  
circular plasmids. This will do exactly what you want. The old  
Bio::Tools::RestrictionEnzyme does not have this ability, though you  
can just merge the first and last fragments of a sequence.

The newer Bio::Restriction are (I believe) only available in BioPerl  
1.3 and later. This is the upcoming release and the first RC seems  
quite stable.

You should probably tell someone about the searches not working, but I  
guess you just have :)

Rob


On Wednesday, October 29, 2003, at 10:39  AM, Gray, John wrote:

> I am trying to find documentation for the Bio::Restriction module, to  
> see if it can properly predict restriction fragment sizes for circular  
> plasmids.
>
>
>
> I'm sorry to have to ask.  The tutorial at  
> http://www.bioperl.org/Core/Latest/bptutorial.html says that  
> Bio::Restriction replaces the older Bio::RestrictionEnzyme, which I  
> have used.  I can't find any documentation for this method at  
> http://doc.bioperl.org/releases/bioperl-1.2.3/.
>
>
>
> The Seq documentation does not provide any description of the  
> 'is_circular' method, and I was hoping that I could set the Seq object  
> to be circular, and then just run something like  
> $enyme_object->cut_seq($seq_object) to get my list of strings  
> containing the resulting fragments.
>
>
>
> Thank you for your help.
>
>
>
> PS.  When I try and search the Bioperl-l archive, it doesn't return  
> any matches, even when I try to match the word 'bioperl'.  Should I  
> tell someone about this?
>
>
>
> ----------------------------------------------------------------------- 
> --------------
> John T. Gray, Ph.D.
> Director, Vector Development & Production
> Experimental Hematology Division
> Hematology-Oncology
> St. Jude Children's Research Hospital
> D2038A, Mail Stop 341
> 332 N. Lauderdale Street
> Memphis, TN 38105
>
> (901) 495-4729 phone
> (901) 495-2176 fax
>
> John.Gray@stjude.org <mailto:John.Gray@stjude.org>
>
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l


From John.Gray at stjude.org  Thu Oct 30 09:48:59 2003
From: John.Gray at stjude.org (Gray, John)
Date: Thu Oct 30 09:45:53 2003
Subject: [Bioperl-l] Restriction Enzyme cuts on Circular plasmids
Message-ID: <1E0CC447E59C974CA5C7160D2A2854ECC520A3@SJMEMXMB04.stjude.sjcrh.local>

Sorry, the code I pasted in my last email was missing some hard returns.  It should have looked like this:

## Make circularity function properly AGAIN!!
## $concat is set to the length of the recognition sequence -1 
$concat = 5; 
$sequence = uc ($& . (substr $sequence, 0, $concat-1));

The additional sequence must then be later removed from the appropriate fragment.  (Here I am only after the length of the fragments):

## Cut the sequence with the restriction enzyme object. 
@fragments = eval{ $re->cut_seq($seq); }; 
$@ && die "\n*** TROUBLE CUTTING:\n$@\n"; 

## pull lengths from array of sequences
my @fraglen;
my $i = 0;
foreach (@fragments) {
        $fraglen[$i] = length $_;
        $i++;
        }
## Sum first and last fragment lengths due to circularity, remove last 
$fraglen[0] = $fraglen[0] + $fraglen[@fraglen-1] - $concat; 
pop @fraglen;


-------------------------------------------------------------------------------------
John T. Gray, Ph.D.
Director, Vector Development & Production
Experimental Hematology Division
Hematology-Oncology
St. Jude Children's Research Hospital
D2038A, Mail Stop 341
332 N. Lauderdale Street
Memphis, TN 38105?
(901) 495-4729 phone
(901) 495-2176 fax
John.Gray@stjude.org


From redwards at utmem.edu  Thu Oct 30 10:37:44 2003
From: redwards at utmem.edu (Rob Edwards)
Date: Thu Oct 30 10:34:35 2003
Subject: [Bioperl-l] Restriction Enzyme cuts on Circular plasmids
In-Reply-To: <1E0CC447E59C974CA5C7160D2A2854ECC51A72@SJMEMXMB04.stjude.sjcrh.local>
Message-ID: <0141582E-0AEF-11D8-A107-000A959E1622@utmem.edu>

The best way to do it is to probably cut the sequences, join the first  
and last sequence as is done now, and then see if the enzyme cuts that  
fragment and if so sort out where the fragment should be. I think it is  
reasonably trivial and I can take a look at it soon.

Rob


On Thursday, October 30, 2003, at 08:37  AM, Gray, John wrote:

> Thank you all for the help.
>
> The broken archive search is at  
> http://bioperl.org/pipermail/bioperl-l/.  I will use  
> http://search.open-bio.org in the future.
>
> Cutting circular DNAs is not as simple as concatenating the first and  
> last fragments, as frequently the recognition sequence will span the  
> origin of the plasmid, and in that case the site gets missed.  I have  
> made a work-around within Bio::RestrictionEnzyme by appending the  
> sequence with a short duplication of the beginning:
>
> ## Make circularity function properly AGAIN!!
> ## $concat is set to the length of the recognition sequence -1
> $concat = 5;
> $sequence = uc ($& . (substr $sequence, 0, $concat-1));
>
> The additional sequence must then be later removed from the  
> appropriate fragment.  (Here I am only after the length of the  
> fragments):
>
> ## Cut the sequence with the restriction enzyme object.
> @fragments = eval{ $re->cut_seq($seq); };
> $@ && die "\n*** TROUBLE CUTTING:\n$@\n";
>
> ## pull lengths from array of sequences
> my @fraglen;
> my $i = 0;
> foreach (@fragments) {
>         $fraglen[$i] = length $_;
>         $i++;
>         }
> ## Sum first and last fragment lengths due to circularity, remove last
> $fraglen[0] = $fraglen[0] + $fraglen[@fraglen-1] - $concat;
> pop @fraglen;
>
>
> This works well so far, but it is cumbersome because I must modify the  
> sequence in a specific way for each enzyme recognition site length.
>
> Reading the documentation for the new Bio::Restriction module I see  
> that it has many improvements which would have made my code much  
> easier, but it is not clear that the methods address the problem that  
> I raised above.  Anyone have any idea?
>
> Thanks again-
>
> John Gray
>
>
>
>
> -----Original Message-----
> From: Rob Edwards [mailto:redwards@utmem.edu]
> Sent: Wednesday, October 29, 2003 6:31 PM
> To: Gray, John
> Cc: bioperl-l@portal.open-bio.org
> Subject: Re: [Bioperl-l] Restriction Enzyme cuts on Circular plasmids
>
> The new Bio::Restriction modules should tell you the correct sizes of
> circular plasmids. This will do exactly what you want. The old
> Bio::Tools::RestrictionEnzyme does not have this ability, though you
> can just merge the first and last fragments of a sequence.
>
> The newer Bio::Restriction are (I believe) only available in BioPerl
> 1.3 and later. This is the upcoming release and the first RC seems
> quite stable.
>
> You should probably tell someone about the searches not working, but I
> guess you just have :)
>
> Rob
>
>
>
> On Wednesday, October 29, 2003, at 10:39  AM, Gray, John wrote:
>
>> I am trying to find documentation for the Bio::Restriction module, to
>> see if it can properly predict restriction fragment sizes for circular
>> plasmids.
>>
>>
>>
>> I'm sorry to have to ask.  The tutorial at
>> http://www.bioperl.org/Core/Latest/bptutorial.html says that
>> Bio::Restriction replaces the older Bio::RestrictionEnzyme, which I
>> have used.  I can't find any documentation for this method at
>> http://doc.bioperl.org/releases/bioperl-1.2.3/.
>>
>>
>>
>> The Seq documentation does not provide any description of the
>> 'is_circular' method, and I was hoping that I could set the Seq object
>> to be circular, and then just run something like
>> $enyme_object->cut_seq($seq_object) to get my list of strings
>> containing the resulting fragments.
>>
>>
>>
>> Thank you for your help.
>>
>>
>>
>> PS.  When I try and search the Bioperl-l archive, it doesn't return
>> any matches, even when I try to match the word 'bioperl'.  Should I
>> tell someone about this?
>>
>>
>>
>> ---------------------------------------------------------------------- 
>> -
>> --------------
>> John T. Gray, Ph.D.
>> Director, Vector Development & Production
>> Experimental Hematology Division
>> Hematology-Oncology
>> St. Jude Children's Research Hospital
>> D2038A, Mail Stop 341
>> 332 N. Lauderdale Street
>> Memphis, TN 38105
>>
>> (901) 495-4729 phone
>> (901) 495-2176 fax
>>
>> John.Gray@stjude.org <mailto:John.Gray@stjude.org>
>>
>>
>>
>> _______________________________________________
>> Bioperl-l mailing list
>> Bioperl-l@portal.open-bio.org
>> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>
>
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

From redwards at utmem.edu  Thu Oct 30 20:52:41 2003
From: redwards at utmem.edu (Rob Edwards)
Date: Thu Oct 30 20:49:31 2003
Subject: [Bioperl-l] Restriction Enzyme cuts on Circular plasmids
In-Reply-To: <1E0CC447E59C974CA5C7160D2A2854ECC520A3@SJMEMXMB04.stjude.sjcrh.local>
Message-ID: <E9FBA406-0B44-11D8-A107-000A959E1622@utmem.edu>

The following is a quick patch for Bio/Restriction/Analysis.pm so that 
it handles circular sequences correctly if there is another cut site in 
the region that has been linearized. At the moment it won't handle a 
single cut site at that point (e.g. pBR322 has a single EcoRI site at 
the point it is circularized). I am not sure how to deal with this and 
need to think about it (the fragments are right but the cut sites are 
not).

Can someone submit it for me?

I have submitted a Bugzilla report as #1548

120c120,121
< for further analysis. However, this will change the start of the
---
 > for further analysis. This fragment will also be checked for cuts
 > by the enzyme(s). However, this will change the start of the
737c738,749
<                 unshift (@re_frags, $last.$first);
---
 >               my $newfrag=$last.$first;
 >               my @cuts = split /($beforeseq)($afterseq)/i, $newfrag;
 >               my @newfrags;
 >               if ($#cuts) {
 >                # there is another cut
 >                for (my $i=0; $i<=$#cuts; $i+=2) {push (@newfrags, 
$cuts[$i].$cuts[$i+1])}
 >               }
 >               else {
 >                # there isn't another cut
 >                push (@newfrags, $newfrag);
 >               }
 >               push @re_frags, @newfrags;


Thanks

Rob

From wes.barris at csiro.au  Fri Oct 31 00:58:43 2003
From: wes.barris at csiro.au (Wes Barris)
Date: Fri Oct 31 00:55:51 2003
Subject: [Bioperl-l] How do you create a Bio::Align::AlignI object?
Message-ID: <3FA1FA13.8070806@csiro.au>

Hi,

I need to convert the contents of an ACE file into a series of MSF files
(one for each contig).  Since BioPerl does not offer a straight-forward
conversion process for this task, I have to manually read each contig
section of the ACE file and manually create the structures for an
AlignI object.  The only way I have found to create an AlignI object
was by opening an input stream to an existing file.  As a result, I
had to write the consensus to a file just so that I could open it again
and gain access to an AlignI object.

Here is my code.  Is there a slicker way to do this?

acetomsf.pl:

#!/usr/local/bin/perl -w
#
#
use strict;
use Bio::Assembly::IO;
use Bio::AlignIO;
use Bio::SeqIO;
#
my $usage = "Usage: $0 <infile.ace>\n";
my $infile = shift or die $usage;
my $prefix = 'btcn';
my $io = new Bio::Assembly::IO(-file=>$infile, -format=>'ace');
my $assembly = $io->next_assembly;

foreach my $contig ($assembly->all_contigs()) {
    my $contigName = $prefix.$contig->id;
#  $contig->match();    # not yet implemented
#
# Write the consensus to a file.
#
    my $consensusSeq = new Bio::Seq(
                         -seq=>$contig->get_consensus_sequence->seq,
                         -id=>$prefix.$contig->id);
    my $seqout = new Bio::SeqIO(-file=>">$contigName.fa", -format=>'fasta');
    $seqout->write_seq($consensusSeq);
#
# Snarf the consensus back in so that we can create a Bio::Align::AlignI object.
#
    my $instream  = new Bio::AlignIO(-format=>'fasta', -file=>"$contigName.fa");
    my $outstream = new Bio::AlignIO(-format=>'msf', -file=>">$contigName.msf");
    my $aln = $instream->next_aln();
#
# Loop through each sequence in the contig adding it to the new alignment.
#
    foreach my $seq ($contig->each_seq) {
#
# Some of the ACE starting locations are negative.
#
       if ($contig->get_seq_coord($seq)->start <= 0) {
          my $offset = -$contig->get_seq_coord($seq)->start + 2;
          $seq->seq($seq->subseq($offset,$seq->length));
          print($seq->primary_id," was clipped at the beginning by $offset\n");
          }
#
# Pad each sequence so it aligns with the consensus.
#
       my $before = $contig->get_seq_coord($seq)->start - 1;
       my $after = $contig->get_consensus_length - $contig->get_seq_coord($seq)->end;
       my $alignedSequence = '-' x $before . $seq->seq . '-' x $after;
#
# Some of the ACE ending locations go beyond the consensus.
#
       if (length($alignedSequence) > $contig->get_consensus_length) {
          $alignedSequence = substr($alignedSequence, 1 ,$contig->get_consensus_length);
          print($seq->primary_id," was clipped at the end\n");
          }
       $seq->seq($alignedSequence);
       $aln->add_seq($seq);
       }
    $aln->set_displayname_flat;
    $outstream->write_aln($aln);
    undef $outstream;
    }


-- 
Wes Barris
E-Mail: Wes.Barris@csiro.au


From Daniel.Lang at biologie.uni-freiburg.de  Fri Oct 31 05:30:48 2003
From: Daniel.Lang at biologie.uni-freiburg.de (Daniel Lang)
Date: Fri Oct 31 05:27:46 2003
Subject: [Bioperl-l] SearchIO::Writer::HTMLResultWriter question
Message-ID: <3FA239D8.4040609@biologie.uni-freiburg.de>

Hi,
Is it possible, that currently the search function on the Archives page 
doesn?t work...I don?t get any search results at all, no matter what 
search term I use:(
So mabye this has already been asked...

Is there a "No Hits found" function in HTMLResultWriter?
Or can I make the results-filter do something like this?

Thanks in advance,
Daniel


From d.gatherer at vir.gla.ac.uk  Fri Oct 31 07:13:21 2003
From: d.gatherer at vir.gla.ac.uk (Derek Gatherer)
Date: Fri Oct 31 07:09:45 2003
Subject: [Bioperl-l] gcg.pm
In-Reply-To: <200310311034.h9VAWjde004507@portal.open-bio.org>
Message-ID: <5.2.1.1.1.20031031120319.00af7328@udcf.gla.ac.uk>

Hi

Please excuse if this is an already known bug.

Seq.pm says you can do this:


# make from strings in script
     $seqobj = Bio::Seq->new( -display_id => 'my_id',
                              -seq => $sequence_as_string);

and then you can do:

     $seq_out->write_seq($seqobj);

which is fine, except if you previously defined a GCG-format IO, eg:

my $seq_out = Bio::SeqIO->new('-file' => ">$outfile", '-format' => "GCG");

in which case you have to do

$seqobj = Bio::Seq->new( -display_id => 'my_id',
                              -seq => $sequence_as_string,
                              -desc => "a description");

Otherwise, you get:

Use of uninitialized value in concatenation (.) or string at 
/usr/local/lib/site_perl/5.8.0//Bio/SeqIO/gcg.pm line 197,

The offending line in gcg.pm is:

197 push(@out,"$comment\n");

which causes offence since:

178     my $comment     = $seq->desc;

If the IO format is FASTA, none of this happens and the output is clean.

I suggest that a default $comment should be used in gcg.pm, which is 
"!!AA_SEQUENCE" or "!!NA_SEQUENCE" depending on sequence type, as that is 
the normal first line in most CGC files.

Cheers
Derek

From Richard.Adams at ed.ac.uk  Fri Oct 31 07:24:28 2003
From: Richard.Adams at ed.ac.uk (Richard Adams)
Date: Fri Oct 31 07:21:24 2003
Subject: [Bioperl-l] Re: default RemoteBlast user agent
References: <Pine.LNX.4.50.0310302126130.26634-200000@tenero.duhs.duke.edu>
Message-ID: <3FA2547C.9030202@ed.ac.uk>


Hi Jason,
    Do you think it's worthwhile including error checking for sensible 
blast parameters or  is it acceptable to just let people figure out the 
blast failed submsiion html for themselves?

Richard
Jason Stajich wrote:

> Richard -
>
> Grateful that you are willing to take on RemoteBlast.  Not my cup of tea
> any more.  I think SearchIO should be able to handle bl2seq, megablast,
> and rpsblast outputs just fine now.  May have to cleanup any <pre> 
> tags if
> NCBI throws them in there.
>
> Here is some correspondance I've had with someone at NCBI - presumably 
> the
> simple script he sent is probably what you already knew - but I'm sending
> it along at any rate.
>
> Let me know if you have any questions, etc.
>
> Best,
> -jason
>
> -- 
> Jason Stajich
> Duke University
> jason at cgt.mc.duke.edu
>
> ---------- Forwarded message ----------
> Date: Wed, 22 Oct 2003 11:57:51 -0400
> From: George Coulouris <coulouri@ncbi.nlm.nih.gov>
> Cc: jason@bioperl.org
> Subject: Re: default RemoteBlast user agent?
>
> George Coulouris wrote:
>
>> Jason-
>>
>> On an unrelated note, are there plans to support multiple queries /
>> megablast nucleotide searches?
>>
>> -George
>
>
> Jason,
>
> I've attached some perl code that demonstrates how to do batch
> submission of multiple nucleotide sequences with megablast. Usage is:
>
> ./remote_megablastp.pl file1 file2..
>
> Input files should be in fasta format.
>
> It will emit the CGI output from which the RID can be obtained. Polling
> for results / obtaining results / formatting results can proceed as usual.
>
>------------------------------------------------------------------------
>
>#!/opt/local/bin/perl
>
>use URI::Escape;
>use LWP::UserAgent;
>use HTTP::Request::Common qw(POST);
>
>$ua = LWP::UserAgent->new;
>
># read and encode the queries
>foreach $query (@ARGV)
>{
>open(QUERY,$query);
>while(<QUERY>)
>{
>$encoded_query = $encoded_query . uri_escape($_);
>}
>}
>
>$args = 'CMD=Put&PROGRAM=blastn&DATABASE=nt&MEGABLAST=on&QUERY=' . $encoded_query;
>
>$req = new HTTP::Request POST => 'http://www.ncbi.nlm.nih.gov/blast/Blast.cgi';
>$req->content_type('application/x-www-form-urlencoded');
>$req->content($args);
>
>#print $req->as_string();
>
>$response = $ua->request($req);
>
>if ($response->is_success)
>{
>print $response->content;
>}
>else
>{
>print $response->error_as_HTML;
>}
>  
>

-- 
Dr Richard Adams
Bioinformatician,
Psychiatric Genetics Group,
Medical Genetics,
Molecular Medicine Centre,
Western General Hospital,
Crewe Rd West,
Edinburgh UK
EH4 2XU

Tel: 44 131 651 1084
richard.adams@ed.ac.uk


From jason at cgt.duhs.duke.edu  Fri Oct 31 08:15:03 2003
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Fri Oct 31 08:12:24 2003
Subject: [Bioperl-l] SearchIO::Writer::HTMLResultWriter question
In-Reply-To: <3FA239D8.4040609@biologie.uni-freiburg.de>
References: <3FA239D8.4040609@biologie.uni-freiburg.de>
Message-ID: <Pine.LNX.4.50.0310310812370.30433-100000@tenero.duhs.duke.edu>

On Fri, 31 Oct 2003, Daniel Lang wrote:

> Hi,
> Is it possible, that currently the search function on the Archives page
> doesn�t work...I don�t get any search results at all, no matter what
> search term I use:(
> So mabye this has already been asked...

Use http://search.open-bio.org/ which works just fine for me  -- will get
Chris to look at the other page to see what is going on.
>

> Is there a "No Hits found" function in HTMLResultWriter?
> Or can I make the results-filter do something like this?
>
You mean, print out 'No Hits found' in the report?  Not currently - should
be an easy extension though.  You'd need edit HTMLResultWriter though as
there is currently no hook for this particular part - if there are no Hits
the hit_filter won't be called of course.

-jason

> Thanks in advance,
> Daniel
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu

From jason at cgt.duhs.duke.edu  Fri Oct 31 08:24:48 2003
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Fri Oct 31 08:21:49 2003
Subject: [Bioperl-l] gcg.pm
In-Reply-To: <5.2.1.1.1.20031031120319.00af7328@udcf.gla.ac.uk>
References: <5.2.1.1.1.20031031120319.00af7328@udcf.gla.ac.uk>
Message-ID: <Pine.LNX.4.50.0310310817320.30433-100000@tenero.duhs.duke.edu>

Go ahead and submit it as a bug at http://bugzilla.open-bio.org/ so that
we have a record of this and someone who wants to jump in and fix simple
bugs can do so.

You can also just change the line
 my $comment     = $seq->desc;
to
 $comment = '' unless defined $comment;
To get rid of the warning.

Note that the GCG output is still clean and usuable, you're just getting
warnings to STDERR.

The more sophisticated soln you propose would be a pretty simple
if / elsif based on $seq->alphabet

I would love to see people submitting code patches too when it
something pretty straight forward like this...

-jason
On Fri, 31 Oct 2003, Derek Gatherer wrote:

> Hi
>
> Please excuse if this is an already known bug.
>
> Seq.pm says you can do this:
>
>
> # make from strings in script
>      $seqobj = Bio::Seq->new( -display_id => 'my_id',
>                               -seq => $sequence_as_string);
>
> and then you can do:
>
>      $seq_out->write_seq($seqobj);
>
> which is fine, except if you previously defined a GCG-format IO, eg:
>
> my $seq_out = Bio::SeqIO->new('-file' => ">$outfile", '-format' => "GCG");
>
> in which case you have to do
>
> $seqobj = Bio::Seq->new( -display_id => 'my_id',
>                               -seq => $sequence_as_string,
>                               -desc => "a description");
>
> Otherwise, you get:
>
> Use of uninitialized value in concatenation (.) or string at
> /usr/local/lib/site_perl/5.8.0//Bio/SeqIO/gcg.pm line 197,
>
> The offending line in gcg.pm is:
>
> 197 push(@out,"$comment\n");
>
> which causes offence since:
>
> 178     my $comment     = $seq->desc;
>
> If the IO format is FASTA, none of this happens and the output is clean.
>
> I suggest that a default $comment should be used in gcg.pm, which is
> "!!AA_SEQUENCE" or "!!NA_SEQUENCE" depending on sequence type, as that is
> the normal first line in most CGC files.
>
> Cheers
> Derek
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From Daniel.Lang at biologie.uni-freiburg.de  Fri Oct 31 08:54:24 2003
From: Daniel.Lang at biologie.uni-freiburg.de (Daniel Lang)
Date: Fri Oct 31 08:51:16 2003
Subject: [Bioperl-l] allowgaps parameter in Bio::SearchIO
Message-ID: <3FA26990.5030909@biologie.uni-freiburg.de>

Hi,
I?m getting wrong values for the "allowgaps" parameter while parsing
NCBI-like BLAST results:
The parser does not detect if a the report was generated with -g F
switch(ungapped) and the allowgaps values is "yes", when it should be
"no"...
I guess the parameter is based on the occurence of "Gapped" after the
"Lambda"-part, right?

Here is an example BLAST output snipplet(-g F):

Searching..................................................done

  ***** No hits found ******

   Database: ppp_orfpep
     Posted date:  Sep 18, 2003  6:31 AM
   Number of letters in database: 2,367,445
   Number of sequences in database:  14,583

Lambda     K      H
    0.318    0.135    0.401


Matrix: BLOSUM62
Number of Hits to DB: 378,425

Or did I missinterpret the parameter?
Thanks ind advance,
Daniel:)


From John.Gray at stjude.org  Fri Oct 31 09:36:04 2003
From: John.Gray at stjude.org (Gray, John)
Date: Fri Oct 31 09:32:56 2003
Subject: [Bioperl-l] Restriction Enzyme cuts on Circular plasmids
Message-ID: <1E0CC447E59C974CA5C7160D2A2854ECC51A76@SJMEMXMB04.stjude.sjcrh.local>

After reading some of your comments about how the site recognition is functioning, I am concerned that there may be another problem.  It commonly occurs that restriction enzyme recognition sites will overlap, and I think this may cause your method to miss some sites.  I am wondering whether it may be necessary to separate the process of site mapping and cleavage. 

For example, BssH II cuts at G^CGCGC, and the sequence of GCGCGCGC theoretically has two cut sites within it.  Of course, your algorithm is similar to reality in that once the enzyme cuts the sequence once, it probably won't be able to recognize the other site.  However, in the test tube what you will actually get is a random distribution of cutting at the two sites.  Traditionally (at least in the software I have used), the site mapping algorithms have returned all possible cut sites.

I am thinking the only way around this would be to first map the sites into an array, and then use that array to either calculate fragment sizes or sequences.  With the possibility of overlapping sites in mind, I still can't think of any way to circumvent the problem of the origin on circular sequences without concatenating the sequence to simulate circularity.

John

-----Original Message-----
From: Rob Edwards [mailto:redwards@utmem.edu] 
Sent: Thursday, October 30, 2003 7:53 PM
To: bioperl-l@portal.open-bio.org
Subject: Re: [Bioperl-l] Restriction Enzyme cuts on Circular plasmids

The following is a quick patch for Bio/Restriction/Analysis.pm so that 
it handles circular sequences correctly if there is another cut site in 
the region that has been linearized. At the moment it won't handle a 
single cut site at that point (e.g. pBR322 has a single EcoRI site at 
the point it is circularized). I am not sure how to deal with this and 
need to think about it (the fragments are right but the cut sites are 
not).

Can someone submit it for me?

I have submitted a Bugzilla report as #1548

120c120,121
< for further analysis. However, this will change the start of the
---
 > for further analysis. This fragment will also be checked for cuts
 > by the enzyme(s). However, this will change the start of the
737c738,749
<                 unshift (@re_frags, $last.$first);
---
 >               my $newfrag=$last.$first;
 >               my @cuts = split /($beforeseq)($afterseq)/i, $newfrag;
 >               my @newfrags;
 >               if ($#cuts) {
 >                # there is another cut
 >                for (my $i=0; $i<=$#cuts; $i+=2) {push (@newfrags, 
$cuts[$i].$cuts[$i+1])}
 >               }
 >               else {
 >                # there isn't another cut
 >                push (@newfrags, $newfrag);
 >               }
 >               push @re_frags, @newfrags;


Thanks

Rob

_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org
http://portal.open-bio.org/mailman/listinfo/bioperl-l


From redwards at utmem.edu  Fri Oct 31 09:48:10 2003
From: redwards at utmem.edu (Rob Edwards)
Date: Fri Oct 31 09:45:00 2003
Subject: [Bioperl-l] Restriction Enzyme cuts on Circular plasmids
In-Reply-To: <1E0CC447E59C974CA5C7160D2A2854ECC51A76@SJMEMXMB04.stjude.sjcrh.local>
Message-ID: <3F2E5F98-0BB1-11D8-94D7-000A959E1622@utmem.edu>

Yes, you are quite right. Probably in the test tube one of the sites 
will be cut randomly - behavior we could mimic but that would be very 
confusing. I agree it may be better to map the sites rather than cut 
the sequences, and this may be the best way to do it (and actually 
could be faster).

Perhaps a better way to do it is to add a new function that returns 
mapped cut sites. I need to look more at the docs, but isn't there a 
bioperl feature already to look for occurrences of a single sequence in 
a sequence? This could actually be a lot faster for non-ambiguous 
sequences by using index rather than split, and use less memory.

At the moment the module actually calculates cut sites by fragmenting 
the sequence and then calculating the length of each of the sequences 
returned by the split. This is probably not optimal.

Maybe this should be a bugzilla bug report....

Rob


On Friday, October 31, 2003, at 08:36  AM, Gray, John wrote:

> After reading some of your comments about how the site recognition is 
> functioning, I am concerned that there may be another problem.  It 
> commonly occurs that restriction enzyme recognition sites will 
> overlap, and I think this may cause your method to miss some sites.  I 
> am wondering whether it may be necessary to separate the process of 
> site mapping and cleavage.
>
> For example, BssH II cuts at G^CGCGC, and the sequence of GCGCGCGC 
> theoretically has two cut sites within it.  Of course, your algorithm 
> is similar to reality in that once the enzyme cuts the sequence once, 
> it probably won't be able to recognize the other site.  However, in 
> the test tube what you will actually get is a random distribution of 
> cutting at the two sites.  Traditionally (at least in the software I 
> have used), the site mapping algorithms have returned all possible cut 
> sites.
>
> I am thinking the only way around this would be to first map the sites 
> into an array, and then use that array to either calculate fragment 
> sizes or sequences.  With the possibility of overlapping sites in 
> mind, I still can't think of any way to circumvent the problem of the 
> origin on circular sequences without concatenating the sequence to 
> simulate circularity.
>
> John
>
> -----Original Message-----
> From: Rob Edwards [mailto:redwards@utmem.edu]
> Sent: Thursday, October 30, 2003 7:53 PM
> To: bioperl-l@portal.open-bio.org
> Subject: Re: [Bioperl-l] Restriction Enzyme cuts on Circular plasmids
>
> The following is a quick patch for Bio/Restriction/Analysis.pm so that
> it handles circular sequences correctly if there is another cut site in
> the region that has been linearized. At the moment it won't handle a
> single cut site at that point (e.g. pBR322 has a single EcoRI site at
> the point it is circularized). I am not sure how to deal with this and
> need to think about it (the fragments are right but the cut sites are
> not).
>
> Can someone submit it for me?
>
> I have submitted a Bugzilla report as #1548
>
> 120c120,121
> < for further analysis. However, this will change the start of the
> ---
>> for further analysis. This fragment will also be checked for cuts
>> by the enzyme(s). However, this will change the start of the
> 737c738,749
> <                 unshift (@re_frags, $last.$first);
> ---
>>               my $newfrag=$last.$first;
>>               my @cuts = split /($beforeseq)($afterseq)/i, $newfrag;
>>               my @newfrags;
>>               if ($#cuts) {
>>                # there is another cut
>>                for (my $i=0; $i<=$#cuts; $i+=2) {push (@newfrags,
> $cuts[$i].$cuts[$i+1])}
>>               }
>>               else {
>>                # there isn't another cut
>>                push (@newfrags, $newfrag);
>>               }
>>               push @re_frags, @newfrags;
>
>
> Thanks
>
> Rob
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>
>

From jason at cgt.duhs.duke.edu  Fri Oct 31 10:11:13 2003
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Fri Oct 31 10:08:27 2003
Subject: [Bioperl-l] allowgaps parameter in Bio::SearchIO
In-Reply-To: <3FA26990.5030909@biologie.uni-freiburg.de>
References: <3FA26990.5030909@biologie.uni-freiburg.de>
Message-ID: <Pine.LNX.4.50.0310311001270.31011-100000@tenero.duhs.duke.edu>

It's not parsed separately probably.  What you're seeing is a default
parameter which is reset to no when parsing WU-BLAST output (which
conviently has a line which says 'nogaps' to make this determination
easily).

The logic of the parsing will have to be reconsidered to try and add this
sort of information in I guess.  Unfortunately NCBI-BLAST doesn't do a
particularly nice job of reporting what were all the input parameters to
the BLAST run so it a bit of work to try and infer all of them from the
output based on what is present/absent.

A bit more work and attention needs to be paid to the stats and param
parsing, as we are not really capturing all of them.  Something I'd love
for someone to try and help with, or I will try and get to it towards the
end of November when my schedule frees up a little bit.

-jason

On Fri, 31 Oct 2003, Daniel Lang wrote:

> Hi,
> I?m getting wrong values for the "allowgaps" parameter while parsing
> NCBI-like BLAST results:
> The parser does not detect if a the report was generated with -g F
> switch(ungapped) and the allowgaps values is "yes", when it should be
> "no"...
> I guess the parameter is based on the occurence of "Gapped" after the
> "Lambda"-part, right?
>
> Here is an example BLAST output snipplet(-g F):
>
> Searching..................................................done
>
>   ***** No hits found ******
>
>    Database: ppp_orfpep
>      Posted date:  Sep 18, 2003  6:31 AM
>    Number of letters in database: 2,367,445
>    Number of sequences in database:  14,583
>
> Lambda     K      H
>     0.318    0.135    0.401
>
>
> Matrix: BLOSUM62
> Number of Hits to DB: 378,425
>
> Or did I missinterpret the parameter?
> Thanks ind advance,
> Daniel:)
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu

From d.gatherer at vir.gla.ac.uk  Fri Oct 31 10:40:25 2003
From: d.gatherer at vir.gla.ac.uk (Derek Gatherer)
Date: Fri Oct 31 10:36:16 2003
Subject: [Bioperl-l] gcg.pm, another one
In-Reply-To: <200310311034.h9VAWjde004507@portal.open-bio.org>
Message-ID: <5.2.1.1.1.20031031153144.00b1f160@udcf.gla.ac.uk>

Hello again

Last bug submitted to bugzilla.  The following may be a bug, but I wonder 
if there is a problem with my GCG format.

Try this script:

#!/usr/bin/perl -w

use lib "/usr/local/lib/site_perl/5.8.0/";
use strict;
use Bio::Seq;
use Bio::SeqIO;

my $dnain = Bio::SeqIO->new( '-format' => 'GCG' , -file => "cds.gcg");

while((my $seqobj = $dnain->next_seq()))
{
     $seqobj->display_id;
}

on the file cds.gcg below.

to get:

------------- EXCEPTION  -------------
MSG: Looks like start of another sequence. See documentation.
STACK Bio::SeqIO::gcg::next_seq 
/usr/local/lib/site_perl/5.8.0//Bio/SeqIO/gcg.pm:124
STACK toplevel gcgtest.pl:10

The problem, I think, is that the SeqIO stream doesn't seem to recognise 
the change over from one sequence to another.  Or do I need some record 
separator between my sequences???  If this really is also a bug, I'll 
submit it too.

Offending line in gcg.pm is:

  124      if( /\.\.$/ ) {
         $self->throw("Looks like start of another sequence. See 
documentation. ");
        }

and here's the file cds.gcg, containing two sequences in GCG format

!!NA_SEQUENCE 1.0
  ASSEMBLE    October 27, 2003 15:32

Symbols:     1 to: 1269  from: merlin.seq /rev   ck: 8363, 55186 to: 56454

LOCUS       MERLIN                235645 bp    DNA     linear   VRL 14-AUG-2003
DEFINITION  Human herpesvirus 5 strain Merlin, complete genome.
ACCESSION   MERLIN
VERSION
KEYWORDS    .
SOURCE      Human herpesvirus 5 . . .

merlin_ul43.cds  Length: 1269  October 27, 2003 15:32  Type: N  Check: 9039  ..

        1  ATGGAGAAAA CGCCGGCGGA GACGACGGCG GTTTCAGCTG GCAACGTGCC

       51  ACGTGACTCA ATTCCGTGTA TAACTAACGT GTCCGCGGAC ACCCGCGGCC

      101  GTACCCGCCC CAGCAGACCA GCCACCGTCC CTCAGCGACG TCCCGCGCGG

      151  ATCGGACACT TTAGGCGGCG CAGCGCCAGC CTTAGCTTTC TTGACTGGCC

      201  GGACGACAGC GTCACAGAGG GCGTTCGGAC GACCTCCGCG TCGGTCGCCG

      251  CCTCCGCGGC CCGTTTCGAC GAAATCCGGC GGCGCCGCCA GAGCATCAAC

      301  GACGAGATGA AGGAACGCAC GCTGGAGGAC GCGCTGGCTG TCGAGCTGGT

      351  CAACGAGACC TTCCGCTGCT CTGTCACCTC CGACGCCCGC AAGGACTTGC

      401  AGAAGCTGGT TCGTCGCGTC AGCGGCACGG TGCTGCGTCT CAGCTGGCCA

      451  AACGGTTGGT TCTTCACCTA CTGCGACCTG TTACGCGTCG GCTACTTTGG

      501  ACATCTCAAT ATTAAAGGTT TGGAGAAGAC CTTCCTGTGC TGCGACAAGT

      551  TCTTGCTGCC GGTGGGCACT GTGAGTCGTT GCGAAGCCAT CGGCCGCCCA

      601  CCGCTACCCG TACTCATCGG CGAGGGCGGT CGCGTCTACG TCTACTCGCC

      651  TGTGGTGGAA TCGCTGTACC TGGTGTCGCG GTCCGGTTTC CGCGGCTTCG

      701  TGCAGGAGGG CCTGCGCAAC TACGCGCCGC TGCGCGAAGA ACTGGGCTAT

      751  GTCCGCTTCG AGACCGGCGG CGACGTGGGT CGCGAGTTCA TGTTGGCGCG

      801  CGACCTGCTG GCCCTGTGGC GCCTGTGCAT GAAGCGCGAG GGTTCTATCT

      851  TCAGCTGGCG AGACGGTAAC GAGGCGCTGA CGACGGTCGT CTTGAACGGG

      901  AGCCAGACTT ACGAGGATCC GGCCCACGGC AACTGGTTAA AAGAGACGTG

      951  CTCGCTGAAC GTGCTGCAGG TATTTGTGGT GCGGGCCGTG CCGGTGGAGT

     1001  CGCAGCAGCG CCTGGACATC TCCATACTGG TGAACGAGAG CGGCGCCGTC

     1051  TTCGGCGTGC ATCCCGATAC GCGGCAGGCG CACTTTCTGG CGCGCGGACT

     1101  CCTGGGCTTC TTTCGCGTCG GGTTCTTGCG GTTCTGCAAC AACTACTGCT

     1151  TCGCCCGCGA CTGTTTTACC CACCCTGAAA GCGTGGCACC CGCTTACCGC

     1201  GCCACCGGCT GTCCCAGAGA ACTGTTTTGT CGTCGTTTGC GCAAAAAGAA

     1251  GGGGCTCTTT GCTCGAAGG

!!NA_SEQUENCE 1.0
  ASSEMBLE    October 27, 2003 15:32

Symbols:     1 to: 2718  from: merlin.seq /rev   ck: 8363, 57946 to: 60663

LOCUS       MERLIN                235645 bp    DNA     linear   VRL 14-AUG-2003
DEFINITION  Human herpesvirus 5 strain Merlin, complete genome.
ACCESSION   MERLIN
VERSION
KEYWORDS    .
SOURCE      Human herpesvirus 5 . . .

merlin_ul45.cds  Length: 2718  October 27, 2003 15:32  Type: N  Check: 4998  ..

        1  ATGAATCCGG CTGACGCGGA CGAGGAACAG CGGGTGTCCT CGGTGCCCGC

       51  ACATCGGTGC CGGCCAGGTA GGATTCCAAG CCGCAGCGCG GAAACCGAGA

      101  CGGAGGAATC GTCGGCAGAG GTCGCCGCTG ATACTATCGG GGGAGATGAC

      151  AGCGAGCTCG AGGAGGGGCC GCTGCCCGGG GGTGACAAGG AAGCGTCCGC

      201  TGGAAATACC AACGTATCGA GCGGTGTAGC ATGTGTAGCG GGTTTTACGA

      251  GTGGTGGCGG CGTCGTCAGT TGGCGTCCCG AGTCGCCGTC TCCCGACGGC

      301  ACGCCGTCTG TGCTGTCGTT GACGCGTGAC AGCGGTCCCG CCGTGCCCAG

      351  TCGCGGTGGA CGCGTGAGTA GCGGTCTGAG CACCTTTAAT CCGGCCGGCG

      401  CGACCAGGAT GGAGCTGGAC AGTGTCGAGG AGGAGGACGA TTTCGGGGCT

      451  TCGCTCTGCA AAGTATCGCC GCCGATACAA GCTATGCGCA TGTTGATGGG

      501  CAAAAAGTGT CATTGTCACG GCTACTGGGG CAAGTTTCGC TTTTGCGGCG

      551  TACAGGAGCC GGCGCGGGAG CTGCCGTCCG ACAGGAACGC GCTGTGGCGC

      601  GAGATGGACA CCGTGTCGCG GCACAGTGCC GGTTTGGGCA GTTTCAGGCT

      651  ATTTCAGCTC ATTATGCGCC ACGGTCCCTG TCTGATTCGT CACTCGCCGC

      701  GTTGCGACCT GCTGTTGGGT CGCTTTTATT TCAAAGCCAA CTGGGCGCGT

      751  GAAAGCCGCA CGCCACTGTG TTACGCTTCG GAGCTGTGCG ATGAGTCGGT

      801  GCGCCGTTTT GTGCTGCGTC ACATGGAGGA TCTACCCAAG CTGGCCGAGG

      851  AGACGGCGCG TTTTGTGGAA TTGGCCGGTT GCTGGGGCTT GTACGCGGCC

      901  ATTTTGTGTT TGGATAAGGT GTGTCGCCAA CTGCACGGAC AGGACGAGAG

      951  CCCGGGCGGC GTGTTTTTGC GCATCGCCGT GGCGTTGACG GCCGCTATCG

     1001  AGAACAGTAG GCACTCGCGC ATCTATCGTT TCCATCTGGA TGCGCGTTTC

     1051  GAGGGCGAGG TGTTGGAATC GGTGTTGAAG CGCTGTCGCG ATGGGCAGCT

     1101  GTCGCTGTCC ACCTTCACCA TGTCTACCGT GGGTTTCGAT CGCGTGCCGC

     1151  AGTACGACTT TCTGATCTCG GCCGACCCTT TCTCGCGTGA CGCCAGTTGG

     1201  GCGGCCATGT GCAAGTGGAT GAGTACCTTG AGTTGCGGCG TTTCTGTGTC

     1251  GGTGAACGTA ACGCGACTTA ACGCCGATGT GAACAGCGTG ATTCGTTGCC

     1301  TGGGGGGATA CTGCGATTTG ATACGCGAGA AGGAGGTGCA TCGACCCGTG

     1351  GTACGTGTGT TTGTGGACAT GTGGGACGTG GCCGCTATCC GCGTGATTAA

     1401  CTTTATTCTC AAAGAAAGCA CGTCGGAGTT GACGGGGGTT TGCTACGCTT

     1451  TCAACGTGCC TAGCGTGTTA ATGAAGCGCT ACCGTGCGCG TGAGCAGCGC

     1501  TACTCGCTGT TTGGGCGGCC TGTCTCCCGG CGGCTCTCGG ACCTGGGTCA

     1551  GGAGTCGGCT TTCGAGAAAG AGTATTCGCG CTGCGAGCAA TCGTGCCCCA

     1601  AGGTGGTCGT GAACACGGAC GATTTTCTGA AAAAGATGTT GCTGTGCGCG

     1651  CTCAAGGGCC GTGCCTCGGT GGTCTTTGTC CATCACGTAG TCAAGTACTC

     1701  GATTATGGCC GACAGCGTGT GCCTGCCGCC GTGCTTGAGT CCCGATATGG

     1751  CGTCGTGCCA CTTTGGCGAG TGTGACATGC CGGTGCAGCG GCTGACGGTG

     1801  AACGTGGCTC GCTGCGTGTT TGCGCGTAGC GACGAGCAGA AGCTGCATCT

     1851  ACCCGACGTG GTTTTGGGGA ACACGCGACG TTACTTTGAT TTGAGCGTGC

     1901  TGCGCGAGTT GGTGACCGAG GCGGTGGTTT GGGGCAACGC GCGCTTGGAC

     1951  GCGCTAATGT CGGCGTCCGA ATGGTGGGTA GAGAGCGCGC TGGAAAAACT

     2001  GCGTCCGCTG CACATCGGCG TGGCTGGCTT GCACACGGCG CTCATGCGGT

     2051  TAGGGTTCAC GTACTTTGCC TCTTGGGACT TGATCGAGCG CATCTTTGAG

     2101  CACATGTACT TTGCCGCGGT GCGCGCTAGC GTCGATTTGT GCAAGTCGGG

     2151  TTTGCCGCGC TGCGAGTGGT TCGAACGCAC CATCTATCAA GAGGGCAAAT

     2201  TCATTTTCGA ATTGTATCGG TTGCCGCGGC TCTCCATCGC CAGCGCGCGC

     2251  TGGGAAGCGC TGCGCGCCGA CATGCTCGAG TTCGGATTGC GCAACTGTCA

     2301  GTTTCTGGCG GTGGGTCCCG ACGACGAGGT GGCGCATCTG TGGGGCGTGA

     2351  CGCCGTCAGT GTGGGCTTCG CGCGGCACCG TGTTCGAGGA GGAGACGGTG

     2401  TGGTCATTGT GCCCGCCCAA CCGTGAGTGT TACTTCCCCA CCGTGGTGCG

     2451  GAGGCCGCTG CGCGTGCCCG TGGTGAATTA CGCGTGGTTG GAGCAGCACC

     2501  AGGAGGAGGG CAAGGCGACG CAGTGTCTGT TCCAGGCGGC ACCGGCGATC

     2551  CAAAACGACG TGGAAATGGC GGCCGTGAAC CTGAGCGTGT TTGTGGACCA

     2601  GTGCGTGGCC CTGGTTTTCT ACTATGACTC GGGGATGACG CCCGACGTGC

     2651  TTCTGGCCAG GATGCTCAAG TGGTACCACT GGCGCTTTAA GGTCGGAGTA

     2701  TATAAGTACT GTGCCTCT

From michael.watson at bbsrc.ac.uk  Fri Oct 31 11:54:09 2003
From: michael.watson at bbsrc.ac.uk (michael watson (IAH-C))
Date: Fri Oct 31 11:53:08 2003
Subject: [Bioperl-l] Glimmer
Message-ID: <20B7EB075F2D4542AFFAF813E98ACD9302822399@cl-exsrv1.irad.bbsrc.ac.uk>

Hi

Does anyone have, or does there exist, any perl modules to parse and handle the output from glimmer?

Thanks

Michael Watson
From laurichj at bioinfo.ucr.edu  Fri Oct 31 12:15:19 2003
From: laurichj at bioinfo.ucr.edu (Josh Lauricha)
Date: Fri Oct 31 12:12:10 2003
Subject: [Bioperl-l] How do you create a Bio::Align::AlignI object?
In-Reply-To: <3FA1FA13.8070806@csiro.au>
References: <3FA1FA13.8070806@csiro.au>
Message-ID: <20031031171519.GA18009@bioinfo.ucr.edu>

On Fri 10/31/03 15:58, Wes Barris wrote:
> Hi,
> 
> I need to convert the contents of an ACE file into a series of MSF files
> (one for each contig).  Since BioPerl does not offer a straight-forward
> conversion process for this task, I have to manually read each contig
> section of the ACE file and manually create the structures for an
> AlignI object.  The only way I have found to create an AlignI object
> was by opening an input stream to an existing file.  As a result, I
> had to write the consensus to a file just so that I could open it again
> and gain access to an AlignI object.
> 
> Here is my code.  Is there a slicker way to do this?

Without reading through your code, the way to create and AlignI
implementing object is useing Bio::SimpleAlign.

> 
> acetomsf.pl:
> 
> #!/usr/local/bin/perl -w
> #
> #
> use strict;
> use Bio::Assembly::IO;
> use Bio::AlignIO;
> use Bio::SeqIO;
> #
> my $usage = "Usage: $0 <infile.ace>\n";
> my $infile = shift or die $usage;
> my $prefix = 'btcn';
> my $io = new Bio::Assembly::IO(-file=>$infile, -format=>'ace');
> my $assembly = $io->next_assembly;
> 
> foreach my $contig ($assembly->all_contigs()) {
>    my $contigName = $prefix.$contig->id;
> #  $contig->match();    # not yet implemented
> #
> # Write the consensus to a file.
> #
>    my $consensusSeq = new Bio::Seq(
>                         -seq=>$contig->get_consensus_sequence->seq,
>                         -id=>$prefix.$contig->id);
>    my $seqout = new Bio::SeqIO(-file=>">$contigName.fa", -format=>'fasta');
>    $seqout->write_seq($consensusSeq);
> #
> # Snarf the consensus back in so that we can create a Bio::Align::AlignI 
> object.
> #
>    my $instream  = new Bio::AlignIO(-format=>'fasta', 
>    -file=>"$contigName.fa");
>    my $outstream = new Bio::AlignIO(-format=>'msf', 
>    -file=>">$contigName.msf");
>    my $aln = $instream->next_aln();
> #
> # Loop through each sequence in the contig adding it to the new alignment.
> #
>    foreach my $seq ($contig->each_seq) {
> #
> # Some of the ACE starting locations are negative.
> #
>       if ($contig->get_seq_coord($seq)->start <= 0) {
>          my $offset = -$contig->get_seq_coord($seq)->start + 2;
>          $seq->seq($seq->subseq($offset,$seq->length));
>          print($seq->primary_id," was clipped at the beginning by 
>          $offset\n");
>          }
> #
> # Pad each sequence so it aligns with the consensus.
> #
>       my $before = $contig->get_seq_coord($seq)->start - 1;
>       my $after = $contig->get_consensus_length - 
>       $contig->get_seq_coord($seq)->end;
>       my $alignedSequence = '-' x $before . $seq->seq . '-' x $after;
> #
> # Some of the ACE ending locations go beyond the consensus.
> #
>       if (length($alignedSequence) > $contig->get_consensus_length) {
>          $alignedSequence = substr($alignedSequence, 1 
>          ,$contig->get_consensus_length);
>          print($seq->primary_id," was clipped at the end\n");
>          }
>       $seq->seq($alignedSequence);
>       $aln->add_seq($seq);
>       }
>    $aln->set_displayname_flat;
>    $outstream->write_aln($aln);
>    undef $outstream;
>    }
> 
> 
> -- 
> Wes Barris
> E-Mail: Wes.Barris@csiro.au
> 
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 

-- 

----------------------------
| Josh Lauricha            |
| laurichj@bioinfo.ucr.edu |
| Bioinformatics, UCR      |
|--------------------------|
From laurichj at bioinfo.ucr.edu  Fri Oct 31 12:54:11 2003
From: laurichj at bioinfo.ucr.edu (Josh Lauricha)
Date: Fri Oct 31 12:51:01 2003
Subject: [Bioperl-l] Bio::SeqIO::tigr
Message-ID: <20031031175411.GB18009@bioinfo.ucr.edu>

I've written a SeqIO parser for the tigr xml data format, and would like
to contribute it to BioPerl. However, there are a couple things I don't
really like about it but don't have the time to fix right now. Could I
get some feedback from the list regaurding each?

First, some background. Since each XML file is roughly 60MB, using the
XML parsers provided by TIGR (using XML::Simple and XML::Sax, IIRC)
takes around 7-10 minutes to parse (no including BioPerl object
creation) and occationally used more than ~2.5GB of memory, which an x86
can't handle.

To get around this, I took advantage of the fact that these are machine
generated and parsed the entire file using regexp, only storing what is
"relavent" to retrieve a sequence. This means, the ~75 lines of code
TIGR used is around 1280. However, it uses around 250MB of memory and
(converting from TIGR to GenBank) runs in around two to three and a half
minutes, 30-60% slower than GenBank -> GenBank convertion.

1) The code is pretty ugly. It was one of my first "large" perl projects
   and reflects that. The uglyness is partially due to my inexperiance
   at the time, and partially do to the ugliness of the problem.

2) Its not very well commented, ok its not commented. This isn't too big
   a problem, as everything acts basically the same way, and once
   someone understands that the rest is easy. (Its really just the same
   thing over and over). Its just fairly bad form.

3) The memory usage (and runtime) could be improved by one or more of:
   a) Storing everything directly into objects rather than a tree 
   b) Using arrays to store everything rather than hashes
   c) Ignoring any tags that aren't actually used.

4) The coding style is nothing like the rest of BioPerl's. Mainly
   because, I prefer this style (PERSONAL preference, no flames,
   everyone gets their own oppinion). This is bad for a project,
   but in all honesty if I need to drastically change my coding
   style I will probably never get around to fixing up this code.

5) There is quite a long delay before anything is actually accessible
   because the nucleotide data is given at the end of the files
   (actually, at the end of an ASSEMBLY tag) so everything before it
   needs to be parsed. This leads to the first ->next_seq() call taking
   a significant time.

Since I can't show you what the object looks like, I'll show you what
the GenBank file looks like. An example of the genbank file is at:

http://bioinfo.ucr.edu/cgi-bin/seqfetch.pl?database=all&accession=At1g03870

Thanks for your time,

-- 

----------------------------
| Josh Lauricha            |
| laurichj@bioinfo.ucr.edu |
| Bioinformatics, UCR      |
|--------------------------|
From jason at cgt.duhs.duke.edu  Fri Oct 31 13:41:08 2003
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Fri Oct 31 13:37:48 2003
Subject: [Bioperl-l] Glimmer
In-Reply-To: <20B7EB075F2D4542AFFAF813E98ACD9302822399@cl-exsrv1.irad.bbsrc.ac.uk>
References: <20B7EB075F2D4542AFFAF813E98ACD9302822399@cl-exsrv1.irad.bbsrc.ac.uk>
Message-ID: <Pine.LNX.4.50.0310311340200.31767-100000@tenero.duhs.duke.edu>

Bio::Tools::Glimmer

Only in CVS or or 1.3.x releases NOT in 1.2.x series.

-jason

On Fri, 31 Oct 2003, michael watson (IAH-C) wrote:

> Hi
>
> Does anyone have, or does there exist, any perl modules to parse and handle the output from glimmer?
>
> Thanks
>
> Michael Watson
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From hlapp at gnf.org  Fri Oct 31 14:10:08 2003
From: hlapp at gnf.org (Hilmar Lapp)
Date: Fri Oct 31 14:06:58 2003
Subject: [Bioperl-l] gcg.pm, another one
In-Reply-To: <5.2.1.1.1.20031031153144.00b1f160@udcf.gla.ac.uk>
Message-ID: <BBC7F390.24DD%hlapp@gnf.org>

The SeqIO gcg parser was written for single-sequence gcg files.

In fact, the parser I think doesn't currently have a maintainer, so anyone
who is willing to become that is welcome. Since GCG has been commercialized
since many years, you'll not easily find people among the core who actively
use GCG, just to explain why there is no maintainer.

    -hilmar

On 10/31/03 7:40 AM, "Derek Gatherer" <d.gatherer@vir.gla.ac.uk> wrote:

> Hello again
> 
> Last bug submitted to bugzilla.  The following may be a bug, but I wonder
> if there is a problem with my GCG format.
> 
> Try this script:
> 
> #!/usr/bin/perl -w
> 
> use lib "/usr/local/lib/site_perl/5.8.0/";
> use strict;
> use Bio::Seq;
> use Bio::SeqIO;
> 
> my $dnain = Bio::SeqIO->new( '-format' => 'GCG' , -file => "cds.gcg");
> 
> while((my $seqobj = $dnain->next_seq()))
> {
>    $seqobj->display_id;
> }
> 
> on the file cds.gcg below.
> 
> to get:
> 
> ------------- EXCEPTION  -------------
> MSG: Looks like start of another sequence. See documentation.
> STACK Bio::SeqIO::gcg::next_seq
> /usr/local/lib/site_perl/5.8.0//Bio/SeqIO/gcg.pm:124
> STACK toplevel gcgtest.pl:10
> 
> The problem, I think, is that the SeqIO stream doesn't seem to recognise
> the change over from one sequence to another.  Or do I need some record
> separator between my sequences???  If this really is also a bug, I'll
> submit it too.
> 
> Offending line in gcg.pm is:
> 
> 124      if( /\.\.$/ ) {
>        $self->throw("Looks like start of another sequence. See
> documentation. ");
>       }
> 
> and here's the file cds.gcg, containing two sequences in GCG format
> 
> !!NA_SEQUENCE 1.0
> ASSEMBLE    October 27, 2003 15:32
> 
> Symbols:     1 to: 1269  from: merlin.seq /rev   ck: 8363, 55186 to: 56454
> 
> LOCUS       MERLIN                235645 bp    DNA     linear   VRL
> 14-AUG-2003
> DEFINITION  Human herpesvirus 5 strain Merlin, complete genome.
> ACCESSION   MERLIN
> VERSION
> KEYWORDS    .
> SOURCE      Human herpesvirus 5 . . .
> 
> merlin_ul43.cds  Length: 1269  October 27, 2003 15:32  Type: N  Check: 9039
> ..
> 
>       1  ATGGAGAAAA CGCCGGCGGA GACGACGGCG GTTTCAGCTG GCAACGTGCC
> 
>      51  ACGTGACTCA ATTCCGTGTA TAACTAACGT GTCCGCGGAC ACCCGCGGCC
> 
>     101  GTACCCGCCC CAGCAGACCA GCCACCGTCC CTCAGCGACG TCCCGCGCGG
> 
>     151  ATCGGACACT TTAGGCGGCG CAGCGCCAGC CTTAGCTTTC TTGACTGGCC
> 
>     201  GGACGACAGC GTCACAGAGG GCGTTCGGAC GACCTCCGCG TCGGTCGCCG
> 
>     251  CCTCCGCGGC CCGTTTCGAC GAAATCCGGC GGCGCCGCCA GAGCATCAAC
> 
>     301  GACGAGATGA AGGAACGCAC GCTGGAGGAC GCGCTGGCTG TCGAGCTGGT
> 
>     351  CAACGAGACC TTCCGCTGCT CTGTCACCTC CGACGCCCGC AAGGACTTGC
> 
>     401  AGAAGCTGGT TCGTCGCGTC AGCGGCACGG TGCTGCGTCT CAGCTGGCCA
> 
>     451  AACGGTTGGT TCTTCACCTA CTGCGACCTG TTACGCGTCG GCTACTTTGG
> 
>     501  ACATCTCAAT ATTAAAGGTT TGGAGAAGAC CTTCCTGTGC TGCGACAAGT
> 
>     551  TCTTGCTGCC GGTGGGCACT GTGAGTCGTT GCGAAGCCAT CGGCCGCCCA
> 
>     601  CCGCTACCCG TACTCATCGG CGAGGGCGGT CGCGTCTACG TCTACTCGCC
> 
>     651  TGTGGTGGAA TCGCTGTACC TGGTGTCGCG GTCCGGTTTC CGCGGCTTCG
> 
>     701  TGCAGGAGGG CCTGCGCAAC TACGCGCCGC TGCGCGAAGA ACTGGGCTAT
> 
>     751  GTCCGCTTCG AGACCGGCGG CGACGTGGGT CGCGAGTTCA TGTTGGCGCG
> 
>     801  CGACCTGCTG GCCCTGTGGC GCCTGTGCAT GAAGCGCGAG GGTTCTATCT
> 
>     851  TCAGCTGGCG AGACGGTAAC GAGGCGCTGA CGACGGTCGT CTTGAACGGG
> 
>     901  AGCCAGACTT ACGAGGATCC GGCCCACGGC AACTGGTTAA AAGAGACGTG
> 
>     951  CTCGCTGAAC GTGCTGCAGG TATTTGTGGT GCGGGCCGTG CCGGTGGAGT
> 
>    1001  CGCAGCAGCG CCTGGACATC TCCATACTGG TGAACGAGAG CGGCGCCGTC
> 
>    1051  TTCGGCGTGC ATCCCGATAC GCGGCAGGCG CACTTTCTGG CGCGCGGACT
> 
>    1101  CCTGGGCTTC TTTCGCGTCG GGTTCTTGCG GTTCTGCAAC AACTACTGCT
> 
>    1151  TCGCCCGCGA CTGTTTTACC CACCCTGAAA GCGTGGCACC CGCTTACCGC
> 
>    1201  GCCACCGGCT GTCCCAGAGA ACTGTTTTGT CGTCGTTTGC GCAAAAAGAA
> 
>    1251  GGGGCTCTTT GCTCGAAGG
> 
> !!NA_SEQUENCE 1.0
> ASSEMBLE    October 27, 2003 15:32
> 
> Symbols:     1 to: 2718  from: merlin.seq /rev   ck: 8363, 57946 to: 60663
> 
> LOCUS       MERLIN                235645 bp    DNA     linear   VRL
> 14-AUG-2003
> DEFINITION  Human herpesvirus 5 strain Merlin, complete genome.
> ACCESSION   MERLIN
> VERSION
> KEYWORDS    .
> SOURCE      Human herpesvirus 5 . . .
> 
> merlin_ul45.cds  Length: 2718  October 27, 2003 15:32  Type: N  Check: 4998
> ..
> 
>       1  ATGAATCCGG CTGACGCGGA CGAGGAACAG CGGGTGTCCT CGGTGCCCGC
> 
>      51  ACATCGGTGC CGGCCAGGTA GGATTCCAAG CCGCAGCGCG GAAACCGAGA
> 
>     101  CGGAGGAATC GTCGGCAGAG GTCGCCGCTG ATACTATCGG GGGAGATGAC
> 
>     151  AGCGAGCTCG AGGAGGGGCC GCTGCCCGGG GGTGACAAGG AAGCGTCCGC
> 
>     201  TGGAAATACC AACGTATCGA GCGGTGTAGC ATGTGTAGCG GGTTTTACGA
> 
>     251  GTGGTGGCGG CGTCGTCAGT TGGCGTCCCG AGTCGCCGTC TCCCGACGGC
> 
>     301  ACGCCGTCTG TGCTGTCGTT GACGCGTGAC AGCGGTCCCG CCGTGCCCAG
> 
>     351  TCGCGGTGGA CGCGTGAGTA GCGGTCTGAG CACCTTTAAT CCGGCCGGCG
> 
>     401  CGACCAGGAT GGAGCTGGAC AGTGTCGAGG AGGAGGACGA TTTCGGGGCT
> 
>     451  TCGCTCTGCA AAGTATCGCC GCCGATACAA GCTATGCGCA TGTTGATGGG
> 
>     501  CAAAAAGTGT CATTGTCACG GCTACTGGGG CAAGTTTCGC TTTTGCGGCG
> 
>     551  TACAGGAGCC GGCGCGGGAG CTGCCGTCCG ACAGGAACGC GCTGTGGCGC
> 
>     601  GAGATGGACA CCGTGTCGCG GCACAGTGCC GGTTTGGGCA GTTTCAGGCT
> 
>     651  ATTTCAGCTC ATTATGCGCC ACGGTCCCTG TCTGATTCGT CACTCGCCGC
> 
>     701  GTTGCGACCT GCTGTTGGGT CGCTTTTATT TCAAAGCCAA CTGGGCGCGT
> 
>     751  GAAAGCCGCA CGCCACTGTG TTACGCTTCG GAGCTGTGCG ATGAGTCGGT
> 
>     801  GCGCCGTTTT GTGCTGCGTC ACATGGAGGA TCTACCCAAG CTGGCCGAGG
> 
>     851  AGACGGCGCG TTTTGTGGAA TTGGCCGGTT GCTGGGGCTT GTACGCGGCC
> 
>     901  ATTTTGTGTT TGGATAAGGT GTGTCGCCAA CTGCACGGAC AGGACGAGAG
> 
>     951  CCCGGGCGGC GTGTTTTTGC GCATCGCCGT GGCGTTGACG GCCGCTATCG
> 
>    1001  AGAACAGTAG GCACTCGCGC ATCTATCGTT TCCATCTGGA TGCGCGTTTC
> 
>    1051  GAGGGCGAGG TGTTGGAATC GGTGTTGAAG CGCTGTCGCG ATGGGCAGCT
> 
>    1101  GTCGCTGTCC ACCTTCACCA TGTCTACCGT GGGTTTCGAT CGCGTGCCGC
> 
>    1151  AGTACGACTT TCTGATCTCG GCCGACCCTT TCTCGCGTGA CGCCAGTTGG
> 
>    1201  GCGGCCATGT GCAAGTGGAT GAGTACCTTG AGTTGCGGCG TTTCTGTGTC
> 
>    1251  GGTGAACGTA ACGCGACTTA ACGCCGATGT GAACAGCGTG ATTCGTTGCC
> 
>    1301  TGGGGGGATA CTGCGATTTG ATACGCGAGA AGGAGGTGCA TCGACCCGTG
> 
>    1351  GTACGTGTGT TTGTGGACAT GTGGGACGTG GCCGCTATCC GCGTGATTAA
> 
>    1401  CTTTATTCTC AAAGAAAGCA CGTCGGAGTT GACGGGGGTT TGCTACGCTT
> 
>    1451  TCAACGTGCC TAGCGTGTTA ATGAAGCGCT ACCGTGCGCG TGAGCAGCGC
> 
>    1501  TACTCGCTGT TTGGGCGGCC TGTCTCCCGG CGGCTCTCGG ACCTGGGTCA
> 
>    1551  GGAGTCGGCT TTCGAGAAAG AGTATTCGCG CTGCGAGCAA TCGTGCCCCA
> 
>    1601  AGGTGGTCGT GAACACGGAC GATTTTCTGA AAAAGATGTT GCTGTGCGCG
> 
>    1651  CTCAAGGGCC GTGCCTCGGT GGTCTTTGTC CATCACGTAG TCAAGTACTC
> 
>    1701  GATTATGGCC GACAGCGTGT GCCTGCCGCC GTGCTTGAGT CCCGATATGG
> 
>    1751  CGTCGTGCCA CTTTGGCGAG TGTGACATGC CGGTGCAGCG GCTGACGGTG
> 
>    1801  AACGTGGCTC GCTGCGTGTT TGCGCGTAGC GACGAGCAGA AGCTGCATCT
> 
>    1851  ACCCGACGTG GTTTTGGGGA ACACGCGACG TTACTTTGAT TTGAGCGTGC
> 
>    1901  TGCGCGAGTT GGTGACCGAG GCGGTGGTTT GGGGCAACGC GCGCTTGGAC
> 
>    1951  GCGCTAATGT CGGCGTCCGA ATGGTGGGTA GAGAGCGCGC TGGAAAAACT
> 
>    2001  GCGTCCGCTG CACATCGGCG TGGCTGGCTT GCACACGGCG CTCATGCGGT
> 
>    2051  TAGGGTTCAC GTACTTTGCC TCTTGGGACT TGATCGAGCG CATCTTTGAG
> 
>    2101  CACATGTACT TTGCCGCGGT GCGCGCTAGC GTCGATTTGT GCAAGTCGGG
> 
>    2151  TTTGCCGCGC TGCGAGTGGT TCGAACGCAC CATCTATCAA GAGGGCAAAT
> 
>    2201  TCATTTTCGA ATTGTATCGG TTGCCGCGGC TCTCCATCGC CAGCGCGCGC
> 
>    2251  TGGGAAGCGC TGCGCGCCGA CATGCTCGAG TTCGGATTGC GCAACTGTCA
> 
>    2301  GTTTCTGGCG GTGGGTCCCG ACGACGAGGT GGCGCATCTG TGGGGCGTGA
> 
>    2351  CGCCGTCAGT GTGGGCTTCG CGCGGCACCG TGTTCGAGGA GGAGACGGTG
> 
>    2401  TGGTCATTGT GCCCGCCCAA CCGTGAGTGT TACTTCCCCA CCGTGGTGCG
> 
>    2451  GAGGCCGCTG CGCGTGCCCG TGGTGAATTA CGCGTGGTTG GAGCAGCACC
> 
>    2501  AGGAGGAGGG CAAGGCGACG CAGTGTCTGT TCCAGGCGGC ACCGGCGATC
> 
>    2551  CAAAACGACG TGGAAATGGC GGCCGTGAAC CTGAGCGTGT TTGTGGACCA
> 
>    2601  GTGCGTGGCC CTGGTTTTCT ACTATGACTC GGGGATGACG CCCGACGTGC
> 
>    2651  TTCTGGCCAG GATGCTCAAG TGGTACCACT GGCGCTTTAA GGTCGGAGTA
> 
>    2701  TATAAGTACT GTGCCTCT
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 

-- 
-------------------------------------------------------------
Hilmar Lapp                            email: lapp at gnf.org
GNF, San Diego, Ca. 92121              phone: +1-858-812-1757
-------------------------------------------------------------


From hlapp at gnf.org  Fri Oct 31 14:11:38 2003
From: hlapp at gnf.org (Hilmar Lapp)
Date: Fri Oct 31 14:08:26 2003
Subject: [Bioperl-l] gcg.pm
In-Reply-To: <5.2.1.1.1.20031031120319.00af7328@udcf.gla.ac.uk>
Message-ID: <BBC7F3EA.24DE%hlapp@gnf.org>

On 10/31/03 4:13 AM, "Derek Gatherer" <d.gatherer@vir.gla.ac.uk> wrote:

> Otherwise, you get:
> 
> Use of uninitialized value in concatenation (.) or string at
> /usr/local/lib/site_perl/5.8.0//Bio/SeqIO/gcg.pm line 197,
> 
> The offending line in gcg.pm is:

This is just a warning, and the line has no detrimental effects. So as for
your code, you can safely ignore it.

I still agree with Jason to file a bug report. Or better yet, submit a patch
along with it if you have one ...

    -hilmar
-- 
-------------------------------------------------------------
Hilmar Lapp                            email: lapp at gnf.org
GNF, San Diego, Ca. 92121              phone: +1-858-812-1757
-------------------------------------------------------------


From paul.boutros at utoronto.ca  Fri Oct 31 15:59:00 2003
From: paul.boutros at utoronto.ca (paul.boutros@utoronto.ca)
Date: Fri Oct 31 15:57:07 2003
Subject: [Bioperl-l] Re: RemoteBlast
Message-ID: <1067633940.3fa2cd147b189@webmail.utoronto.ca>

>I'd be willing to volunteer for maintaining RemoteBlast if there are no
>other takers, I've used it quite a bit so I guess should try and put
>something back .
>
>Just from first thoughts I'd plan to
>
>- incorporate Megablast / Blast2seqs /rpsblast functionality
>-separate out the form submission bits of code from the Blast specific
>code in the module, perhaps by inheriting
>    from Bio::WebAgent.
>
>Other than making sure that the output is parseable by Bio::SearchIO,
>are there any strong feelings for  how new features should be
>implemented?
>
>If anyone has requests/ideas fornew functionality please say so!
>

Hi,

I started doing some modifications myself, and I'd be happy to help you if I 
can.  I have a version with Megablast enabled, as well as with additional 
set/get functions for parameters.

One thing that would be really nice: to have a standardized interface for both 
StandAloneBlast and RemoteBlast.  Perhaps that could be done with a new class, 
BlastInterfaceI?  Not sure.  I'm attaching my version below.  I've meant to 
post this here, but I've been procrastinating on adding to the test suite....

Paul

###Begin Code
# $Id: RemoteBlast.pm,v 1.14.2.1 2003/03/25 12:32:17 heikki Exp $
#
# BioPerl module for Bio::Tools::Run::RemoteBlast
#
# Cared for by Jason Stajich, Mat Wiepert
#
# Copyright Jason Stajich
#
# You may distribute this module under the same terms as perl itself

# POD documentation - main docs before the code

=head1 NAME

Bio::Tools::Run::RemoteBlast - Object for remote execution of the NCBI Blast
via HTTP

=head1 SYNOPSIS

  #Remote-blast "factory object" creation and blast-parameter initialization

  use Bio::Tools::Run::RemoteBlast;
  use strict;
  my $prog   = 'blastp';
  my $db     = 'swissprot';
  my $e_val  = '1e-10';
  my $matrix = 'BLOSUM62';

  my @params = (
         '-prog'       => $prog,
         '-data'       => $db,
         '-expect'     => $e_val,
         '-readmethod' => 'SearchIO',
         '-matrix'     => $matrix
         );

  my $factory = Bio::Tools::Run::RemoteBlast->new(@params);

  # Methods exist to change many parameters.  A full listing of 
  # available parameters is at:
  # http://www.ncbi.nlm.nih.gov/BLAST/Doc/urlapi.html

  # For example, one interesting feature of QBlast is that allows filtering
  # of results by expectation value.  For instance:
  $factory->expect_low(1e-30);
  $factory->expect_high(1e-10);
  # retrieves only results where the expected number of hits by chance
  # in the database would be between 1 and 10.  Note that this filtering
  # is done after the BLAST is run, but before results are returned.

  # One can also limit the number of alignments returned:
  $factory->alignments(20);

  # Some CGI parameters can only be changed directly, for instance
  $Bio::Tools::Run::RemoteBlast::HEADER{'PERC_IDENT'} = 97;

  # Parameters can also be removed in this way:
  delete $Bio::Tools::Run::RemoteBlast::HEADER{'PERC_IDENT'};

  my $v = 1;
  #$v is just to turn on and off the messages

  my $str = Bio::SeqIO->new(-file=>'amino.fa' , '-format' => 'fasta' );

  while (my $input = $str->next_seq()){
    #Blast a sequence against a database:

    #Alternatively, you could  pass in a file with many
    #sequences rather than loop through sequence one at a time
    #Remove the loop starting 'while (my $input = $str->next_seq())'
    #and swap the two lines below for an example of that.
    my $r = $factory->submit_blast($input);
    #my $r = $factory->submit_blast('amino.fa');

    print STDERR "waiting..." if( $v > 0 );
    while ( my @rids = $factory->each_rid ) {
      foreach my $rid ( @rids ) {
        my $rc = $factory->retrieve_blast($rid);
        if( !ref($rc) ) {
          if( $rc < 0 ) {
            $factory->remove_rid($rid);
          }
          print STDERR "." if ( $v > 0 );
          sleep 5;
        } else {
          my $result = $rc->next_result();
          #save the output
          my $filename = $result->query_name()."\.out";
          $factory->save_output($filename);
          $factory->remove_rid($rid);
          print "\nQuery Name: ", $result->query_name(), "\n";
          while ( my $hit = $result->next_hit ) {
            next unless ( $v > 0);
            print "\thit name is ", $hit->name, "\n";
            while( my $hsp = $hit->next_hsp ) {
              print "\t\tscore is ", $hsp->score, "\n";
            }
          }
        }
      }
    }
  }


=head1 DESCRIPTION

Class for remote execution of the NCBI Blast via HTTP.

For a description of the many CGI parameters see:
http://www.ncbi.nlm.nih.gov/BLAST/Doc/urlapi.html

Various additional options and input formats are available.

=head1 FEEDBACK

=head2 Mailing Lists

User feedback is an integral part of the evolution of this and other
Bioperl modules. Send your comments and suggestions preferably to one
of the Bioperl mailing lists.  Your participation is much appreciated.

  bioperl-l@bioperl.org               - General discussion
  http://bio.perl.org/MailList.html   - About the mailing lists

=head2 Reporting Bugs

Report bugs to the Bioperl bug tracking system to help us keep track
the bugs and their resolution.  Bug reports can be submitted via email
or the web:

  bioperl-bugs@bio.perl.org
  http://bio.perl.org/bioperl-bugs/

=head1 AUTHOR -  Jason Stajich

Email jason@bioperl.org

=head1 APPENDIX

The rest of the documentation details each of the object
methods. Internal methods are usually preceded with a _

=cut

package Bio::Tools::Run::RemoteBlast;

use vars qw($AUTOLOAD @ISA $URLBASE %HEADER %RETRIEVALHEADER $RIDLINE);
use strict;

use Bio::Root::Root;
use Bio::Root::IO;
use Bio::SeqIO;
use IO::String;
use Bio::Tools::BPlite;
use Bio::SearchIO;
use LWP;
use HTTP::Request::Common;
BEGIN {
    $URLBASE = 'http://www.ncbi.nlm.nih.gov/blast/Blast.cgi';
    %HEADER = (
               'CMD'                          => 'Put',
	       'PROGRAM'                      => 'blastp',
	       'DATABASE'                     => 'nr',
	       'FILTER'                       => 'L',
	       'EXPECT'                       => '1e-3',
	       'QUERY'                        =>  '',
	       'CDD_SEARCH'                   => 'off',
	       'COMPOSITION_BASED_STATISTICS' => 'no',
	       'SERVICE'                      => 'plain',
	       );

    %RETRIEVALHEADER = (
                 'CMD'            => 'Get',
		 'RID'            => '',
		 'ALIGNMENT_VIEW' => 'Pairwise',
		 'DESCRIPTIONS'   => 500,
		 'ALIGNMENTS'     => 500,
		 'FORMAT_TYPE'    => 'Text',
                 'FORMAT_OBJECT'  => 'Alignment',
                 'SERVICE'        => 'plain',
                  );

    $RIDLINE = 'RID\s+=\s+(\S+)';

}

@ISA = qw(Bio::Root::Root Bio::Root::IO);

sub new {
    my ($caller, @args) = @_;
    # chained new
    my $self = $caller->SUPER::new(@args);
    # so that tempfiles are cleaned up
    $self->_initialize_io();
    my ($prog, $program, $data, $database, $expect, $alignments, $service, 
$format_object, 
	$hitlist_size, $word_size, $auto_format, $matrix_name, $megablast, 
$filter,
        $descriptions, $expect_low, $expect_high, $entrez_query, $gapcosts, 
$nucl_penalty,
        $nucl_reward, $readmethod) =
            $self->_rearrange([
                               qw(PROG
                                  PROGRAM
                                  DATA
                                  DATABASE
				  EXPECT
                                  ALIGNMENTS
                                  SERVICE
                                  FORMAT_OBJECT
                                  HITLIST_SIZE
                                  WORD_SIZE
                                  AUTO_FORMAT
                                  MATRIX_NAME
                                  MEGABLAST
                                  FILTER
                                  DESCRIPTIONS
                                  EXPECT_LOW
                                  EXPECT_HIGH
                                  ENTREZ_QUERY
                                  GAPCOSTS
                                  NUCL_PENALTY
                                  NUCL_REWARD
				  READMETHOD)
                              ], @args);
	
    $readmethod = 'SearchIO' unless defined $readmethod;
    $self->readmethod($readmethod);
    $self->program($prog);
    $self->program($program);	# allow -program as alias for -prog
    $self->database($data);
    $self->database($database);	# allow -database as alias for -data
    $self->expect($expect);
    $self->alignments($alignments);
    $self->service($service);
    $self->format_object($format_object);
    $self->hitlist_size($hitlist_size);
    $self->word_size($word_size);
    $self->auto_format($auto_format);
    $self->matrix_name($matrix_name);
    $self->megablast($megablast);
    $self->filter($filter);
    $self->descriptions($descriptions);
    $self->expect_low($expect_low);
    $self->expect_high($expect_high);
    $self->entrez_query($entrez_query);
    $self->gapcosts($gapcosts);
    $self->nucl_penalty($nucl_penalty);
    $self->nucl_reward($nucl_reward);

    return $self;
}


=head2 header

 Title   : header
 Usage   : my $header = $self->header
 Function: Get/Set HTTP header for blast query
 Returns : string
 Args    : none

=cut

sub header {
    my ($self) = @_;
    my %h = %HEADER;
    $h{'PROGRAM'} = $self->program;
    $h{'DATABASE'} = $self->database;
    $h{'EXPECT'}  = $self->expect;
    return %h;
}


=head2 readmethod

 Title   : readmethod
 Usage   : my $readmethod = $self->readmethod
 Function: Get/Set the method to read the blast report
 Returns : string
 Args    : string [ Blast, BPlite ]

=cut

sub readmethod {
    my ($self, $val) = @_;
    if( defined $val ) {
	$self->{'_readmethod'} = $val;
    }
    return $self->{'_readmethod'};
}


=head2 program

 Title   : program
 Usage   : my $prog = $self->program
 Function: Get/Set the program to run
 Returns : string
 Args    : string [ blastp, blastn, blastx, tblastn, tblastx ]

=cut

sub program {
    my ($self, $val) = @_;
    if( defined $val ) {
	$val = lc $val;
	if( $val !~ /t?blast[pnx]/ ) {
	    $self->warn("trying to set program to an invalid program name 
($val) -- defaulting to blastp");
	    $val = 'blastp';
	}
#	$self->{'_program'} = $val;
	$HEADER{'PROGRAM'} = $val;
    }
    return $HEADER{'PROGRAM'};
}


=head2 database

 Title   : database
 Usage   : my $db = $self->database
 Function: Get/Set the database to search
 Returns : string
 Args    : string [ swissprot, nr, nt, etc... ]

=cut

sub database {
    my ($self, $val) = @_;
    if( defined $val ) {
#	$self->{'_database'} = $val;
 	$HEADER{'DATABASE'} = $val;
    }
    return $HEADER{'DATABASE'};
}


=head2 expect

 Title   : expect
 Usage   : my $expect = $self->expect
 Function: Get/Set the E value cutoff
 Returns : string
 Args    : string [ '1e-4' ]

=cut

sub expect {
    my ($self, $val) = @_;
    if( defined $val ) {
#	$self->{'_expect'} = $val;
 	$HEADER{'EXPECT'} = $val;
    }
    return $HEADER{'EXPECT'};
}


=head2 format_object

 Title   : format_object
 Usage   : my $format_object = $self->format_object
           $self->format_object($value)
 Function: Get/Set the object-type to be retrieved
 Returns : string
 Args    : string [Alignment, Neighbors, PSSM, etc.]

=cut

sub format_object {
	my ($self, $val) = @_;
	if (defined($val)) {
		$RETRIEVALHEADER{'FORMAT_OBJECT'} = $val;
		}
	return $RETRIEVALHEADER{'FORMAT_OBJECT'};
	}


=head2 service

 Title   : service
 Usage   : my $service = $self->service
           $self->service($value)
 Function: Get/Set the NCBI Blast service to be used
 Returns : string
 Args    : string [plain, psi, phi, rpsblast, megablast]

=cut

sub service {
	my ($self, $val) = @_;
	if (defined($val)) {
		$HEADER{'SERVICE'} = $val;
                $RETRIEVALHEADER{'SERVICE'} = $val;
		}
	return $HEADER{'SERVICE'};
	}


=head2 alignments

 Title   : alignments
 Usage   : my $alignments = $self->alignments
           $self->alignments($value)
 Function: Get/Set the number of alignments to be returned
 Returns : integer
 Args    : integer (default: 500)

=cut

sub alignments {
	my ($self, $val) = @_;
	if (defined($val) && $val =~ /^(\d+)$/) {
		$RETRIEVALHEADER{'ALIGNMENTS'} = $1;
		}
	return $RETRIEVALHEADER{'ALIGNMENTS'};
	}


=head2 hitlist_size

 Title   : hitlist_size
 Usage   : my $hitlist_size = $self->hitlist_size
 Function: Get/Set the number of hits to be returned
 Returns : integer
 Args    : integer (default: 500)

=cut

sub hitlist_size {
	my ($self, $val) = @_;
	if (defined($val) && $val =~ /^(\d+)$/) {
		$HEADER{'HITLIST_SIZE'} = $1;
		}
	return $HEADER{'HITLIST_SIZE'};
	}


=head2 word_size

 Title   : word_size
 Usage   : my $word_size = $self->word_size
           $self->word_size($value)
 Function: Get/Set the BLAST word-size for HSP generation
 Returns : integer
 Args    : integer (default: protein-3; nucleotide-11; megablast-28)

=cut

sub word_size {
	my ($self, $val) = @_;
	if (defined($val) && $val =~ /^(\d+)$/) {
		$HEADER{'WORD_SIZE'} = $1;
		}
	return $HEADER{'WORD_SIZE'};
	}


=head2 auto_format

 Title   : auto_format
 Usage   : my $auto_format = $self->auto_format
           $self->auto_format($value)
 Function: Get/Set the flag for automatic result formatting
 Returns : string
 Args    : string (Values: Off (default), Semiauto, Fullauto)

=cut

sub auto_format {
	my ($self, $val) = @_;
	if (defined($val)) {
		$HEADER{'AUTO_FORMAT'} = $val;
		}
	return $HEADER{'AUTO_FORMAT'};
	}


=head2 matrix_name

 Title   : matrix_name
 Usage   : my $matrix_name = $self->matrix_name
           $self->matrix_name($value)
 Function: Get/Set the name of the substitution matrix used (protein only)
 Returns : string
 Args    : string (Default: BLOSUM62)

=cut

sub matrix_name {
	my ($self, $val) = @_;
	if (defined($val)) {
		$HEADER{'MATRIX_NAME'} = $val;
		}
	return $HEADER{'MATRIX_NAME'};
	}


=head2 megablast

 Title   : megablast
 Usage   : my $megablast = $self->megablast
           $self->megablast($value)
 Function: Flag to indicate if the MegaBLAST algorithm should be used
 Returns : string
 Args    : string (Values: 'yes', 'no' (default))

=cut

sub megablast {
	my ($self, $val) = @_;
	if (defined($val)) {
		$HEADER{'MEGABLAST'} = $val;
		}
	return $HEADER{'MEGABLAST'};
	}


=head2 filter

 Title   : filter
 Usage   : my $filter = $self->filter
           $self->filter($value)
 Function: Sequence filter identifier
 Returns : string
 Args    : string
 Values  : 'L'  for low-complexity (default)
           'R'  for human repeats
           'm'  for Mask for Lookup
           'no' to turn off

=cut

sub filter {
	my ($self, $val) = @_;
	if (defined($val) && ($val eq 'L' || $val eq 'R' || $val eq 'm' || 
$val eq 'no')) {
                if ($val eq 'no') {
		      $HEADER{'FILTER'} = '';
                      }
                else {
                      $HEADER{'FILTER'} = $val;
                      }
		}
	return $HEADER{'FILTER'};
	}


=head2 descriptions

 Title   : descriptions
 Usage   : my $descriptions = $self->descriptions
           $self->descriptions($value)
 Function: Specify number of descriptions to be returned
 Returns : integer
 Args    : integer (Default: 500)

=cut

sub descriptions {
	my ($self, $val) = @_;
	if (defined($val) && $val =~ /^(\d+)$/) {
	      $RETRIEVALHEADER{'DESCRIPTIONS'} = $1;
		}
	return $RETRIEVALHEADER{'DESCRIPTIONS'};
	}


=head2 expect_low

 Title   : expect_low
 Usage   : my $expect_low = $self->expect_low
           $self->expect_low($value)
 Function: Minimum expectation value returned
 Returns : double-type value
 Args    : double-type value (Default: 0)

=cut

sub expect_low {
	my ($self, $val) = @_;
	if (defined($val)) {
	      $RETRIEVALHEADER{'EXPECT_LOW'} = $val;
		}
	return $RETRIEVALHEADER{'EXPECT_LOW'};
	}


=head2 expect_high

 Title   : expect_high
 Usage   : my $expect_high = $self->expect_high
           $self->expect_high($value)
 Function: Maximum expectation value returned
 Returns : double-type value
 Args    : double-type value (Default: 0)

=cut

sub expect_high {
	my ($self, $val) = @_;
	if (defined($val)) {
	      $RETRIEVALHEADER{'EXPECT_HIGH'} = $val;
		}
	return $RETRIEVALHEADER{'EXPECT_HIGH'};
	}


=head2 entrez_query

 Title   : entrez_query
 Usage   : my $entrez_query = $self->entrez_query
           $self->entrez_query($value)
 Function: An Entrez query to use in limiting searches
 Returns : string
 Args    : string (Default: empty)

=cut

sub entrez_query {
	my ($self, $val) = @_;
	if (defined($val)) {
	      $HEADER{'ENTREZ_QUERY'} = $val;
		}
	return $HEADER{'ENTREZ_QUERY'};
	}


=head2 gapcosts

 Title   : gapcosts
 Usage   : my $gapcosts = $self->gapcosts
           $self->gapcosts($value)
 Function: Scoring cost for opening and extending gaps in alignments
 Returns : Two space-separated floats (open, extend)
 Args    : Two space-separated floats (open, extend)
 Default : nucleotide   5 + 2x
           protein     11 +  x
           megablast   non-affine

=cut

sub gapcosts {
	my ($self, $val) = @_;

	if (defined($val)) {

		my $flag = 1;

		my @costs = split(/ /, $val);

		if (scalar(@costs) != 2) { # require only two values
			$flag = 0;
			}

		foreach my $cost (@costs) {
			$cost =~ s/^\s+//;
			$cost =~ s/\s+$//;

			# require +ve integers or floats
			if (($cost !~ /^\d+$/) && ($cost !~ /^\d+\.\d+$/)) {
				$flag = 0;
				}

			}

		if ($flag == 1) {
			$HEADER{'GAPCOSTS'} = join(' ', @costs);
			}

		}

	return $HEADER{'GAPCOSTS'};

	}


=head2 nucl_penalty

 Title   : nucl_penalty
 Usage   : my $nucl_penalty = $self->nucl_penalty
           $self->nucl_penalty($value)
 Function: Negative score assigned to nucleotide mismatches (BLASTN)
 Returns : -ve integer
 Args    : -ve integer

=cut

sub nucl_penalty {
	my ($self, $val) = @_;
	if (defined($val) && $val =~ /^-\d+$/) {
	      $HEADER{'NUCL_PENALTY'} = $val;
		}
	return $HEADER{'NUCL_PENALTY'};
	}


=head2 nucl_reward

 Title   : nucl_reward
 Usage   : my $nucl_reward = $self->nucl_reward
           $self->nucl_reward($value)
 Function: Positive score assigned to a nucleotide match (BLASTN)
 Returns : +ve integer
 Args    : +ve integer

=cut

sub nucl_reward {
	my ($self, $val) = @_;
	if (defined($val) && $val =~ /^(\d+)$/) {
              if ($val > 0) { $HEADER{'NUCL_REWARD'} = $val; }
              }
	return $HEADER{'NUCL_REWARD'};
	}


=head2 ua

 Title   : ua
 Usage   : my $ua = $self->ua or
           $self->ua($ua)
 Function: Get/Set a LWP::UserAgent for use
 Returns : reference to LWP::UserAgent Object
 Args    : none
 Comments: Will create a UserAgent if none has been requested before.

=cut

sub ua {
    my ($self, $value) = @_;
    if( ! defined $self->{'_ua'} ) {
	$self->{'_ua'} = new LWP::UserAgent;
    }
    return $self->{'_ua'};
}


=head2 proxy

 Title   : proxy
 Usage   : $httpproxy = $db->proxy('http')  or
           $db->proxy(['http','ftp'], 'http://myproxy' )
 Function: Get/Set a proxy for use of proxy
 Returns : a string indicating the proxy
 Args    : $protocol : an array ref of the protocol(s) to set/get
           $proxyurl : url of the proxy to use for the specified protocol

=cut

sub proxy {
    my ($self,$protocol,$proxy) = @_;
    return undef if ( !defined $self->ua || !defined $protocol
		      || !defined $proxy );
    return $self->ua->proxy($protocol,$proxy);
}


sub add_rid {
    my ($self, @vals) = @_;
    foreach ( @vals ) {
	$self->{'_rids'}->{$_} = 1;
    }
    return scalar keys %{$self->{'_rids'}};
}


sub remove_rid {
    my ($self, @vals) = @_;
    foreach ( @vals ) {
	delete $self->{'_rids'}->{$_};
    }
    return scalar keys %{$self->{'_rids'}};
}


sub each_rid {
    my ($self) = @_;
    return keys %{$self->{'_rids'}};
}


=head2 submit_blast

 Title   : submit_blast
 Usage   : $self->submit_blast([$seq1,$seq2]);
 Function: Submit blast jobs to ncbi blast queue on sequence(s)
 Returns : Blast report object as defined by $self->readmethod
 Args    : input can be:
           * sequence object
           * array ref of sequence objects
           * filename of file containing fasta formatted sequences

=cut

sub submit_blast {
    my ($self, $input) = @_;
    my @seqs = $self->_load_input($input);
    return 0 unless ( @seqs );
    my $tcount = 0;
    my %header = $self->header;
    foreach my $seq ( @seqs ) {
	#If query has a fasta header, the output has the query line.
	$header{'QUERY'} = ">".(defined $seq->display_id() ? $seq->display_id
() : "").
		" ".(defined $seq->desc() ? $seq->desc() : "")."\n".$seq->seq
();
	my $request = POST $URLBASE, [%header];
	$self->warn($request->as_string) if ( $self->verbose > 0);
	my $response = $self->ua->request( $request);

	if( $response->is_success ) {
	    if( $self->verbose > 0 ) {
		my ($tempfh) = $self->tempfile();
		# Hmm, what exactly are we trying to do here?
		print $tempfh $response->content;
		close($tempfh);
		undef $tempfh;
	    }
	    my @subdata = split(/\n/, $response->content );
	    my $count = 0;
	    foreach ( @subdata ) {
		if( /$RIDLINE/ ) {
		    $count++;
		    print STDERR $_ if( $self->verbose > 0);
		    $self->add_rid($1);		
		    last;
		}	
	    }
	    if( $count == 0 ) {
		$self->warn("req was ". $request->as_string() . "\n");
		$self->warn(join('', @subdata));
	    }    	
	    $tcount += $count;
	} else {
	    # should try and be a little more verbose here
	    $self->warn("req was ". $request->as_string() . "\n" .
			$response->error_as_HTML);
	    $tcount = -1;
	}
    }
    return $tcount;
}


=head2 retrieve_blast

 Title   : retrieve_blast
 Usage   : my $blastreport = $blastfactory->retrieve_blast($rid);
 Function: Attempts to retrieve a blast report from remote blast queue
 Returns : -1 on error,
           0 on 'job not finished',
           Bio::Tools::BPlite or Bio::Tools::Blast object
           (depending on how object was initialized) on success
 Args    : Remote Blast ID (RID)

=cut

sub retrieve_blast {
    my($self, $rid) = @_;
    my (undef,$tempfile) = $self->tempfile();
    my %hdr = %RETRIEVALHEADER;
    $hdr{'RID'} = $rid;
    my $req = POST $URLBASE, [%hdr];
    if( $self->verbose > 0 ) {
	$self->warn("retrieve request is " . $req->as_string());
    }
    my $response = $self->ua->request($req, $tempfile);
    if( $self->verbose > 0 ) {
	open(TMP, $tempfile) or $self->throw("cannot open $tempfile");
	while(<TMP>) { print $_; }
	close TMP;
    }
    if( $response->is_success ) {	
	my $size = -s $tempfile;
	if( $size > 1000 ) {
	    my $blastobj;
	    if( $self->readmethod =~ /BPlite/ ) {
		$blastobj = new Bio::Tools::BPlite(-file => $tempfile);
	    } else {
		$blastobj = new Bio::SearchIO(-file => $tempfile,
					      -format => 'blast');
	    }
	    #save tempfile
	    $self->file($tempfile);
	    return $blastobj;
	} elsif( $size < 500 ) { # search had a problem
	    open(ERR, "<$tempfile") or $self->throw("cannot open file 
$tempfile");
	    $self->warn(join("", <ERR>));
	    close ERR;
	    return -1;
	} else { # still working
	    return 0;
	}
    } else {
	$self->warn($response->error_as_HTML);
	return -1;
    }
}


=head2 save_output

 Title   : saveoutput
 Usage   : my $saveoutput = $self->save_output($filename)
 Function: Method to save the blast report
 Returns : 1 (throws error otherwise)
 Args    : string [rid, filename]

=cut

sub save_output {
    my ($self, $filename) = @_;
    if( ! defined $filename ) {
        	$self->throw("Can't save blast output.  You must specify a filename to 
save to.");
    }
    #should be set when retrieving blast
   	my $blastfile = $self->file;
   	#open temp file and output file, have to filter out some HTML
	open(TMP, $blastfile) or $self->throw("cannot open $blastfile");
	open(SAVEOUT, ">$filename") or $self->throw("cannot open $filename");
	my $seentop=0;
	while(<TMP>) {
		next if (/<pre>/);	
		if( /^(?:[T]?BLAST[NPX])\s*.+$/i ||
	   		/^RPS-BLAST\s*.+$/i ) {
	   		$seentop=1;
	   	}
	   	next if !$seentop;
	    if( $seentop ) {
			print SAVEOUT;
		}
	}
	close SAVEOUT;
	close TMP;
	return 1;	
}


sub _load_input {
    my ($self, $input) = @_;

    if( ! defined $input ) {
	$self->throw("Calling remote blast with no input");	
    }
    my @seqs;
    if( ! ref $input ) {
	if( -e $input ) {
	    my $seqio = new Bio::SeqIO(-format => 'fasta', -file => $input);
	    while( my $seq = $seqio->next_seq ) {
		push @seqs, $seq;
	    }
	} else {
	    $self->throw("Input $input was not a valid filename");
	}	
    } elsif( ref($input) =~ /ARRAY/i ) {
	foreach ( @$input ) {
	    if( ref($_) && $_->isa('Bio::PrimarySeqI') ) {
		push @seqs, $_;
	    } else {
		$self->warn("Trying to add a " . ref($_) .
			    " but expected a Bio::PrimarySeqI");
	    }
	}
	if( ! @seqs) {
	    $self->throw("Did not pass in valid input -- no sequence objects 
found");
	}
    } elsif( $input->isa('Bio::PrimarySeqI') ) {
	push @seqs, $input;
    }
    return @seqs;
}
1;
__END__

###End Code


From allenday at ucla.edu  Fri Oct 31 20:29:43 2003
From: allenday at ucla.edu (Allen Day)
Date: Fri Oct 31 20:26:33 2003
Subject: [Bioperl-l] OMIM tests failing
Message-ID: <Pine.LNX.4.44.0310311728360.28330-100000@sumo.genetics.ucla.edu>

any idea what's going on here?  i see some recent commits (yesterday, 
first in 7 months) by juguang.  it worked two days ago...

-allen

[5:27pm]allenday@sumo:/raid5a/allenday/cvsroot/bioperl-live> make 
test_OMIMentry
PERL_DL_NONLAZY=1 /usr/bin/perl -Iblib/arch -Iblib/lib 
-I/usr/lib/perl5/5.8.0/i386-linux-thread-multi -I/usr/lib/perl5/5.8.0 -e 
'use Test::Harness qw(&runtests $verbose); $verbose=0; runtests @ARGV;' 
t/OMIMentry.t
t/OMIMentry....ok 5/145                                                      
------------- EXCEPTION  -------------
MSG: a hash referenced needed
STACK Bio::Phenotype::OMIM::OMIMentry::clinical_symptoms 
blib/lib/Bio/Phenotype/OMIM/OMIMentry.pm:537
STACK toplevel t/OMIMentry.t:60

--------------------------------------
t/OMIMentry....dubious                                                       
        Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 17-145
        Failed 129/145 tests, 11.03% okay
Failed Test   Stat Wstat Total Fail  Failed  List of Failed
-------------------------------------------------------------------------------
t/OMIMentry.t  255 65280   145  129  88.97%  17-145
Failed 1/1 test scripts, 0.00% okay. 129/145 subtests failed, 11.03% okay.
make: *** [test_OMIMentry] Error 2
[5:27pm]allenday@sumo:/raid5a/allenday/cvsroot/bioperl-live> make 
test_OMIMentryAllelicVariant
PERL_DL_NONLAZY=1 /usr/bin/perl -Iblib/arch -Iblib/lib 
-I/usr/lib/perl5/5.8.0/i386-linux-thread-multi -I/usr/lib/perl5/5.8.0 -e 
'use Test::Harness qw(&runtests $verbose); $verbose=0; runtests @ARGV;' 
t/OMIMentryAllelicVariant.t
t/OMIMentryAllelicVariant....ok                                              
All tests successful.
Files=1, Tests=26,  0 wallclock secs ( 0.14 cusr +  0.01 csys =  0.15 CPU)
[5:28pm]allenday@sumo:/raid5a/allenday/cvsroot/bioperl-live> make 
test_OMIMparser
PERL_DL_NONLAZY=1 /usr/bin/perl -Iblib/arch -Iblib/lib 
-I/usr/lib/perl5/5.8.0/i386-linux-thread-multi -I/usr/lib/perl5/5.8.0 -e 
'use Test::Harness qw(&runtests $verbose); $verbose=0; runtests @ARGV;' 
t/OMIMparser.t
t/OMIMparser....ok 1/173                                                     
------------- EXCEPTION  -------------
MSG: a part/organism must be assigned
STACK Bio::Phenotype::OMIM::OMIMentry::add_clinical_symptoms 
blib/lib/Bio/Phenotype/OMIM/OMIMentry.pm:567
STACK Bio::Phenotype::OMIM::OMIMparser::_finer_parse_symptoms 
blib/lib/Bio/Phenotype/OMIM/OMIMparser.pm:550
STACK Bio::Phenotype::OMIM::OMIMparser::_createOMIMentry 
blib/lib/Bio/Phenotype/OMIM/OMIMparser.pm:531
STACK Bio::Phenotype::OMIM::OMIMparser::next_phenotype 
blib/lib/Bio/Phenotype/OMIM/OMIMparser.pm:272
STACK toplevel t/OMIMparser.t:31

--------------------------------------
t/OMIMparser....dubious                                                      
        Test returned status 25 (wstat 6400, 0x1900)
DIED. FAILED tests 2-173
        Failed 172/173 tests, 0.58% okay
Failed Test    Stat Wstat Total Fail  Failed  List of Failed
-------------------------------------------------------------------------------
t/OMIMparser.t   25  6400   173  172  99.42%  2-173
Failed 1/1 test scripts, 0.00% okay. 172/173 subtests failed, 0.58% okay.
make: *** [test_OMIMparser] Error 2
[5:28pm]allenday@sumo:/raid5a/allenday/cvsroot/bioperl-live>