From Postmaster at conviron.com  Mon Mar  1 06:12:12 2004
From: Postmaster at conviron.com (Postmaster@conviron.com)
Date: Mon Mar  1 06:21:14 2004
Subject: [Bioperl-l] MDaemon Notification -- Attachment Removed
Message-ID: <MDAEMON0735200403010512.AA1212187@conviron.com>

The following message contained restricted attachment(s) which have been removed:

>From      : bioperl-l@bioperl.org
To        : backup@conviron.com
Subject   : Re: Hello
Message-ID: 

Attachment(s) removed:
-----------------------------------------
your_picture.pif


From Postmaster at conviron.com  Mon Mar  1 06:12:12 2004
From: Postmaster at conviron.com (Postmaster@conviron.com)
Date: Mon Mar  1 06:21:15 2004
Subject: [Bioperl-l] MDaemon Notification -- Attachment Removed
Message-ID: <MDAEMON0729200403010512.AA1212137@conviron.com>

The following message contained restricted attachment(s) which have been removed:

>From      : bioperl-l@bioperl.org
To        : ed@conviron.com
Subject   : Re: Hello
Message-ID: 

Attachment(s) removed:
-----------------------------------------
your_picture.pif


From avilella at ebi.ac.uk  Mon Mar  1 07:53:55 2004
From: avilella at ebi.ac.uk (avilella)
Date: Mon Mar  1 08:00:16 2004
Subject: [Bioperl-l] Bio::PopGen::Statistics discrete calculations
Message-ID: <1078145634.5352.29.camel@localhost.localdomain>

Hi,

I'm interested in calculating some of the statistics in
Bio::PopGen::Statistics in a discrete, position-wise, way.

For example,

I've done a quick implementation of the method "pi" that returns the
value of the statistic for each polymorphic positions instead of the
mean of all of them. It's "pi_discrete" method:

http://www.ebi.ac.uk/cvs/cvspublic/wallace/bioperl-live/Bio/PopGen/Statistics.pm?rev=1.4&content-type=text/x-cvsweb-markup

I would also like to have methods to do Sliding Window calculations for
these statistics. Something like:

"Calculate tajima_D for windows of 100 polymorphisms that slide 10
positions"

Any comments about the logic, naming, interest of this methods?

Thanks,

    Albert.

-- 
Albert Vilella Bertran    avilella_at_ebi_ac_uk
EMBL Outstation, European Bioinformatics Institute
Wellcome Trust Genome Campus, Hinxton
Cambs. CB10 1SD, United Kingdom
Phone: +44 (0)1223 494 448   FAX: +44 (0)1223 494 468

From bazin at univ-montp2.fr  Mon Mar  1 07:57:06 2004
From: bazin at univ-montp2.fr (Eric Bazin)
Date: Mon Mar  1 08:02:01 2004
Subject: [Bioperl-l] load_seqdatabase.pl problem
References: <FE9E1DF1-662B-11D8-A55D-000A959EB4C4@gnf.org>
	<403B9762.1060007@univ-montp2.fr>
Message-ID: <40433322.2080403@univ-montp2.fr>

Hi.

I try to import embl sequence datas in a biosql schema using the perl
script "load_seqdatabase.pl". Previously I loaded without any problem
the taxon schema from the NCBI with load_ncbi_taxonomy.pl. Can anybody give
me a help for this?

This is what I received:

bash#./load_seqdatabase.pl --dbname polymorphix --format embl
/home/eric/polybase.dat

Could not store AB022832:
------------- EXCEPTION  -------------
MSG: Unique key query in Bio::DB::BioSQL::SpeciesAdaptor returned 3 rows
instead of 1. Query was [name_class="scie
ntific name",binomial="Aspergillus oryzae"]
STACK Bio::DB::BioSQL::BasePersistenceAdaptor::_find_by_unique_key
/usr/lib/perl5/site_perl/5.8.0/Bio/DB/BioSQL/Ba
sePersistenceAdaptor.pm:960
STACK Bio::DB::BioSQL::BasePersistenceAdaptor::find_by_unique_key
/usr/lib/perl5/site_perl/5.8.0/Bio/DB/BioSQL/Bas
ePersistenceAdaptor.pm:856
STACK Bio::DB::BioSQL::BasePersistenceAdaptor::create
/usr/lib/perl5/site_perl/5.8.0/Bio/DB/BioSQL/BasePersistence
Adaptor.pm:183
STACK Bio::DB::Persistent::PersistentObject::create
/usr/lib/perl5/site_perl/5.8.0/Bio/DB/Persistent/PersistentObj
ect.pm:243
STACK Bio::DB::BioSQL::BasePersistenceAdaptor::create
/usr/lib/perl5/site_perl/5.8.0/Bio/DB/BioSQL/BasePersistence
Adaptor.pm:170
[root@thaler biosql]# :BasePersistenceAdaptor::store
/usr/lib/perl5/site_perl/5.8.0/Bio/DB/BioSQL/BasePersistenceA
daptor.pm:253
STACK Bio::DB::Persistent::PersistentObject::store
/usr/lib/perl5/site_perl/5.8.0/Bio/DB/Persistent/PersistentObje
ct.pm:270
STACK (eval) ./load_seqdatabase.pl:517
STACK toplevel ./load_seqdatabase.pl:500

--------------------------------------

Thanks a lot

Eric bazin


From avilella at ebi.ac.uk  Mon Mar  1 08:33:33 2004
From: avilella at ebi.ac.uk (avilella)
Date: Mon Mar  1 08:39:58 2004
Subject: [Bioperl-l] new Bio::Installer modules in bioperl-run
Message-ID: <1078148013.5352.42.camel@localhost.localdomain>

Hi,

I've been playing with a set of modules to make the installation of
bioperl external programes easier.

These are modules with methods to basically download, decompress,
compile and install the program that is meant to be called by bioperl.

For example, for TCoffee, there would be a script in bioperl-run/scripts
like:

install_tcoffee.PLS
-------------------
use strict;
use Bio::Installer::TCoffee;

# Install in the directory where the script is called
use Cwd;
my $destination_install_dir = getcwd;
my $installer = new Bio::Installer::TCoffee(
                                            -destination_install_dir =>
$destination_install_dir);
$installer->download();
$installer->install();

1;
--------------------

that would download, decompress and compile TCoffee to the preferred
directory.

Feedback is welcomed,

Thanks,

    Albert.

From todd.harris at cshl.edu  Mon Mar  1 08:56:00 2004
From: todd.harris at cshl.edu (Todd Harris)
Date: Mon Mar  1 09:01:55 2004
Subject: [Bioperl-l] new Bio::Installer modules in bioperl-run
In-Reply-To: <1078148013.5352.42.camel@localhost.localdomain>
Message-ID: <BC689D10.C440%todd.harris@cshl.edu>

Hi Albert - 

This would be a nice convenience.

Do these handle the (potentially) many system-dependent issues as well?  Or
can you just generically pass configure/make/test/install paramaters to the
installer object? Also, a superclass would be nice that creates an installer
factory for those wishing to build lots of things from a single script.

todd

> On 3/1/04 7:33 AM, avilella wrote:

> Hi,
> 
> I've been playing with a set of modules to make the installation of
> bioperl external programes easier.
> 
> These are modules with methods to basically download, decompress,
> compile and install the program that is meant to be called by bioperl.
> 
> For example, for TCoffee, there would be a script in bioperl-run/scripts
> like:
> 
> install_tcoffee.PLS
> -------------------
> use strict;
> use Bio::Installer::TCoffee;
> 
> # Install in the directory where the script is called
> use Cwd;
> my $destination_install_dir = getcwd;
> my $installer = new Bio::Installer::TCoffee(
>                                           -destination_install_dir =>
> $destination_install_dir);
> $installer->download();
> $installer->install();
> 
> 1;
> --------------------
> 
> that would download, decompress and compile TCoffee to the preferred
> directory.
> 
> Feedback is welcomed,
> 
> Thanks,
> 
>   Albert.
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 

From avilella at ebi.ac.uk  Mon Mar  1 09:14:53 2004
From: avilella at ebi.ac.uk (avilella)
Date: Mon Mar  1 09:21:22 2004
Subject: [Bioperl-l] new Bio::Installer modules in bioperl-run
In-Reply-To: <BC689D10.C440%todd.harris@cshl.edu>
References: <BC689D10.C440%todd.harris@cshl.edu>
Message-ID: <1078150493.5352.88.camel@localhost.localdomain>

On Mon, 2004-03-01 at 13:56, Todd Harris wrote:
> Hi Albert - 
> 
> This would be a nice convenience.
> 
> Do these handle the (potentially) many system-dependent issues as well?

Nope. It only checks for:

($^O =~ /dec_osf|linux|unix|bsd|solaris|darwin/i)

or strictly:

/linux/

at the points where it does system-dependent calls... for the moment.

Has bioperl an estimate of XX% of OSes where is mostly installed/used?

>   Or
> can yo

> u just generically pass configure/make/test/install paramaters to the
> installer object? Also, a superclass would be nice that creates an installer
> factory for those wishing to build lots of things from a single script.

There is a Bio::Installer::Generic for the common calls, like ("tar xzf
package.tar.gz")... But very "simplistic" right now...

http://www.ebi.ac.uk/cvs/cvspublic/wallace/bioperl-run/Bio/Installer/

Still tweaking and adding some bits,

    Albert.

> 
> todd
> 
> > On 3/1/04 7:33 AM, avilella wrote:
> 
> > Hi,
> > 
> > I've been playing with a set of modules to make the installation of
> > bioperl external programes easier.
> > 
> > These are modules with methods to basically download, decompress,
> > compile and install the program that is meant to be called by bioperl.
> > 
> > For example, for TCoffee, there would be a script in bioperl-run/scripts
> > like:
> > 
> > install_tcoffee.PLS
> > -------------------
> > use strict;
> > use Bio::Installer::TCoffee;
> > 
> > # Install in the directory where the script is called
> > use Cwd;
> > my $destination_install_dir = getcwd;
> > my $installer = new Bio::Installer::TCoffee(
> >                                           -destination_install_dir =>
> > $destination_install_dir);
> > $installer->download();
> > $installer->install();
> > 
> > 1;
> > --------------------
> > 
> > that would download, decompress and compile TCoffee to the preferred
> > directory.
> > 
> > Feedback is welcomed,
> > 
> > Thanks,
> > 
> >   Albert.
> > 
> > _______________________________________________
> > Bioperl-l mailing list
> > Bioperl-l@portal.open-bio.org
> > http://portal.open-bio.org/mailman/listinfo/bioperl-l
> > 
> 

From crabtree at tigr.org  Mon Mar  1 10:02:10 2004
From: crabtree at tigr.org (Jonathan Crabtree)
Date: Mon Mar  1 10:08:53 2004
Subject: [Bioperl-l] bioperl graphics
In-Reply-To: <200402271638.37134.lstein@cshl.edu>
References: <1077720594.403cb61235be7@webmail.njit.edu>
	<403CD165.4090309@tigr.org> <200402271638.37134.lstein@cshl.edu>
Message-ID: <40435072.8050206@tigr.org>


Hi Lincoln-

Thanks for the quick response.  I thought there must have been a good 
reason for using colorAllocate(), but now I'm not sure that this is 
true.  I've been able to generate valid SVG documents using GD::SVG and 
a copy of Bio::Graphics::Panel modified in the way that I described to 
Haibo, so I don't think that my proposed change breaks SVG output 
outright.  I've been using GD-SVG-0.25, which already supports 
colorResolve().  Here are lines 292-297 of SVG.pm:

sub GD::SVG::Image::colorResolve {
  my ($self,$r,$g,$b) = @_;
  ###GD###my $index = $self->{gd}->colorResolve($r,$g,$b);
  my $index = $self->colorAllocate($r,$g,$b);
  return $index;
}

I haven't looked at any past versions of GD::SVG, however, so perhaps 
this is a recent change.

I do like your suggestion about using multiple GD::Image objects and 
then copying the various Panel GD::Images into a single target 
GD::Image.  However, this approach definitely won't work with GD::SVG, 
which does not yet support any of the GD image copying methods (AFAIK, 
again based on the GD-SVG-0.25 code that I grabbed from CPAN.)  I'm not 
crazy about my current method, which involves manipulating pad_left, 
pad_top, etc., to arrange the various Bio::Graphics::Panels in their 
appropriate locations.  On the other hand, the API for 
Bio::Graphics::Panel does explicitly support the use of an 
externally-supplied GD::Image object when calling gd(), so I don't feel 
as though I'm abusing the module too much.  Using a single GD::Image (or 
SVG::GD::Image) also makes for a (marginally!) faster and more 
space-efficient solution.  Regards,

Jonathan


Lincoln Stein wrote:

>-----BEGIN PGP SIGNED MESSAGE-----
>Hash: SHA1
>
>Sorry, but if you change colorAllocate() to colorResolve(), you will 
>break the ability to generate publication-quality images with 
>GD::SVG.  Perhaps Todd Harris will add colorResolve() to a future 
>version of GD::SVG, in which case I will make the suggested change to 
>Bio::Graphics.
>
>I would recommend instead making two Bio::Graphics::Panel objects, and 
>generating a pair of GD objects (using the Panel->gd() method).  Then 
>you can combine them onto a third GD object in whatever geometry you 
>want by using GD->copy()
>
>Lincoln
>
>On Wednesday 25 February 2004 06:46 pm, Jonathan Crabtree wrote:
>  
>
>>Haibo-
>>
>>hz5@njit.edu wrote:
>>    
>>
>>>Is there any way to render 2 Bio::Graphics::Panel into one png
>>>image? because I want 2 different arrows with different labeled
>>>coordinates on the same image and align to the left, but one
>>>Panel can only have one coordinates system.
>>>      
>>>
>>The answer is yes, with a couple of caveats.  The first is that you
>>will have to take care of the layout of the individual
>>Panel-generated images.  If you're left-justifying everything then
>>this should be easy enough.  The second is that I would recommend
>>making a one-line change to Bio/Graphics/Panel.pm, to prevent the
>>package from trying to allocate the same set of colors twice (when
>>you reuse the same GD object to draw the two different parts of the
>>image.)  Search for the following piece of code in Panel.pm (at
>>line 411 in bioperl-1.4, I think):
>>
>>  for my $name ('white','black',keys %COLORS) {
>>    my $idx = $gd->colorAllocate(@{$COLORS{$name}});
>>    $translation_table{$name} = $idx;
>>  }
>>
>>Change "colorAllocate" to "colorResolve"; this should have no
>>effect on any existing Bio::Graphics code (AFAIK) and will allow
>>you to do your two (or three or four)-Panel trick.  (As an aside,
>>I'd like to lobby for this one-line change to be made in a future
>>version of
>>Bio::Graphics::Panel, for precisely this reason.)  In any case,
>>once you've made that change and reinstalled your copy of Bioperl,
>>here is a rough outline of what you need to do:
>>
>>1. Set up your individual Bio::Graphics::Panel objects (e.g. $p1,
>>$p2, $p3, etc.) as desired to draw your images, but do *not* call
>>the gd method on any of them yet.
>>
>>2. Create a GD::Image object big enough to hold the images that
>>will be drawn by $p1, $p2, $p3, etc.:
>>    my $gdImg = GD::Image->new($fullWidth, $fullHeight);
>>(Note: use $p1->width(), $p1->height(), etc., to determine what
>>$fullWidth and $fullHeight should be, based on your desired Panel
>>layout algorithm.)
>>
>>4. Use a "dummy" Bio::Graphics::Panel object to allocate all your
>>colors (this is an optional step; I do this because my code does
>>some drawing that isn't handled by Bio::Graphics::Panel, and want
>>to make sure that the palette has been allocated before I start):
>>
>>    my $dummyPanel = Bio::Graphics::Panel->new(-length => 100,
>>-offset => 0, -width => $fullWidth);
>>    $dummyPanel->gd($gdImg); # forces color allocation
>>
>>5. Draw the individual panels and generate your png image:
>>
>>    $p1->gd($gdImg);
>>    $p2->gd($gdImg);
>>    my $pngData = $gdImg->png();
>>
>>I've glossed over some of the details here, for example the fact
>>that you may need to know the value of $p1->height() before you can
>>initialize $p2, but that's the basic idea.  I've been using this
>>method to generate some comparative sequence displays and while
>>it's definitely a bit of a hack, it works well in practice.  You
>>can also do the same thing with a GD::SVG::Image if you'd like to
>>generate SVG output.  Good luck,
>>
>>Jonathan
>>
>>
>>
>>_______________________________________________
>>Bioperl-l mailing list
>>Bioperl-l@portal.open-bio.org
>>http://portal.open-bio.org/mailman/listinfo/bioperl-l
>>    
>>
>
>- -- 
>Lincoln D. Stein
>Cold Spring Harbor Laboratory
>1 Bungtown Road
>Cold Spring Harbor, NY 11724
>-----BEGIN PGP SIGNATURE-----
>Version: GnuPG v1.2.1 (GNU/Linux)
>
>iD8DBQFAP1Zt0CIvUP7P+AkRAreyAJ0XIcjMDeT/Bw69OBOEhD8tsznP+QCfVLWo
>+RnQaijXxPlVWTbmjTkbHYw=
>=lN1U
>-----END PGP SIGNATURE-----
>  
>


From jegreenwood25 at hotmail.com  Mon Mar  1 10:28:13 2004
From: jegreenwood25 at hotmail.com (Jonathan Greenwood)
Date: Mon Mar  1 10:34:07 2004
Subject: [Bioperl-l] Question
Message-ID: <Law11-F33t7yiGh05YF000042b1@hotmail.com>

Hi there, I made a png file with clickable glyphs, now instead of having the 
description of the feature underneath the glyph, I would like to make it 
appear as those pop-up boxes that appear when you place the cursor over the 
glyph. So far the code works but it only takes the first assigned value and 
ignores the rest(which is a problem if there is more than one of the same 
feature). Please help!!

Sincerely,

Jonathan Greenwood
email: jegreenwood25@hotmail.com

Code:

#! /usr/local/bin/perl -w

use strict;
use Bio::Graphics;
use Bio::SeqIO;
use Bio::SeqFeature::Generic;
use CGI qw / :standard /;
use CGI::Pretty;

my $io = Bio::SeqIO->new(-file   => 'x65306.gb',
			 -format => 'Genbank');
my $seq = $io->next_seq;
my $wholeseq = Bio::SeqFeature::Generic->new(-start=>1,
					     -end=>$seq->length);
my @features = $seq->all_SeqFeatures;

# sort features by their primary tags
my %sorted_features;
for my $f (@features) {
  my $tag = $f->primary_tag;
  push @{$sorted_features{$tag}},$f;
}

my $panel = Bio::Graphics::Panel->new(-length       => $seq->length,
				      -width        => 1000,
				      -pad_left     => 10,
				      -pad_right    => 10,
				      -key_style    => 'none',
				      -spacing      => -0.25,
				      -box_subparts => 'true'
				      );

$panel->add_track($wholeseq,
		  -glyph  => 'arrow',
		  -bump   => +1,
		  -double => 1,
		  -tick   => 2
	          );

# special feature
if ($sorted_features{CDS}) {
  $panel->add_track($sorted_features{CDS},
		    -glyph          => 'transcript2',
		    -bgcolor        => 'orange',
		    -bump           =>  0,
		    -height         => 12,
		    -title          => cds_parsing(),
		    -link           => 
'http://mgcheo.med.uottawa.ca/cgi-bin/feature.pl#CDS/',
		    );
  delete $sorted_features{'CDS'};
}

# general case

my @colors = qw(wheat blue yellow green cyan purple chartreuse magenta 
gray);
my $idx    = 0;
for my $tag (sort keys %sorted_features) {
my $features = $sorted_features{$tag};
delete $sorted_features{source};
$panel->add_track($features,
		  -glyph        =>  'generic',
		  -bgcolor      =>  $colors[$idx++ % @colors],
		  -fgcolor      =>  'black',
		  -bump         => 0,
		  -height       => 12,
                  -title        => other_parsing(),
		  -link         => '/cgi-bin/feature.pl',
		 );
}

my ($url,$map,$mapname) = $panel->image_and_map(-url => 'tmpfiles');

print "Content-type: text/html\n\n";

print <<END_OF_SCRIPT;
<HTML>
<HEAD>
<H1 ALIGN="CENTER"> These links don't function in the way they are supposed 
to...</H1>
</HEAD>
<BODY>
<IMG SRC = "/$url" USEMAP = "#$mapname" ISMAP>
$map
</BODY>
</HTML>
END_OF_SCRIPT

exit 0;

sub cds_parsing {
my $seqio_object = Bio::SeqIO->new(-file => 'x65306.gb');
my $seq_object = $seqio_object->next_seq;

foreach my $feat_object ($seq_object->get_SeqFeatures) {
	  if ($feat_object->primary_tag eq "CDS") {
	    foreach my $val ($feat_object->get_tag_values('product')){
	      if ($feat_object->has_tag('product')) {
		return $val;
	      }
	    }
	  }
	}
}

sub other_parsing {
return "Still needs some work";
}

_________________________________________________________________
Protect your PC - get McAfee.com VirusScan Online  
http://clinic.mcafee.com/clinic/ibuy/campaign.asp?cid=3963

From todd.harris at cshl.edu  Mon Mar  1 10:43:37 2004
From: todd.harris at cshl.edu (Todd Harris)
Date: Mon Mar  1 10:49:44 2004
Subject: [Bioperl-l] bioperl graphics
In-Reply-To: <40435072.8050206@tigr.org>
Message-ID: <BC68B649.C458%todd.harris@cshl.edu>

> On 3/1/04 9:02 AM, Jonathan Crabtree wrote:

Hi Jason - 

Well, that's great news!  I couldn't remember if I had added colorResolve to
GD::SVG.  Whoops!  As you can see, it's something of a kludge, but I'm glad
to hear that it works for you.

Regarding Lincoln's suggestion of copying images, correct me if I'm wrong,
but I interpreted that to mean copying the images OUTSIDE of
Bio::Graphics::Panel.  That is, create your Panel, fetch it, then paste it
where appropriate into a new GD::Image with the appropriate dimensions. To
avoid the method-overriding aspects of GD::SVG, this might need to be done
within a separate package or eval block.

I'm planning a significant upgrade to GD::SVG over the next few weeks that
should 1) enable parallel creation of a GD object; 2) make it compatible
with more current GD methods, and 3) make it more robust for future methods.

t


> 
> Hi Lincoln-
> 
> Thanks for the quick response.  I thought there must have been a good
> reason for using colorAllocate(), but now I'm not sure that this is
> true.  I've been able to generate valid SVG documents using GD::SVG and
> a copy of Bio::Graphics::Panel modified in the way that I described to
> Haibo, so I don't think that my proposed change breaks SVG output
> outright.  I've been using GD-SVG-0.25, which already supports
> colorResolve().  Here are lines 292-297 of SVG.pm:
> 
> sub GD::SVG::Image::colorResolve {
> my ($self,$r,$g,$b) = @_;
> ###GD###my $index = $self->{gd}->colorResolve($r,$g,$b);
> my $index = $self->colorAllocate($r,$g,$b);
> return $index;
> }
> 
> I haven't looked at any past versions of GD::SVG, however, so perhaps
> this is a recent change.
> 
> I do like your suggestion about using multiple GD::Image objects and
> then copying the various Panel GD::Images into a single target
> GD::Image.  However, this approach definitely won't work with GD::SVG,
> which does not yet support any of the GD image copying methods (AFAIK,
> again based on the GD-SVG-0.25 code that I grabbed from CPAN.)  I'm not
> crazy about my current method, which involves manipulating pad_left,
> pad_top, etc., to arrange the various Bio::Graphics::Panels in their
> appropriate locations.  On the other hand, the API for
> Bio::Graphics::Panel does explicitly support the use of an
> externally-supplied GD::Image object when calling gd(), so I don't feel
> as though I'm abusing the module too much.  Using a single GD::Image (or
> SVG::GD::Image) also makes for a (marginally!) faster and more
> space-efficient solution.  Regards,
> 
> Jonathan
> 
> 
> Lincoln Stein wrote:
> 
>> -----BEGIN PGP SIGNED MESSAGE-----
>> Hash: SHA1
>> 
>> Sorry, but if you change colorAllocate() to colorResolve(), you will
>> break the ability to generate publication-quality images with
>> GD::SVG.  Perhaps Todd Harris will add colorResolve() to a future
>> version of GD::SVG, in which case I will make the suggested change to
>> Bio::Graphics.
>> 
>> I would recommend instead making two Bio::Graphics::Panel objects, and
>> generating a pair of GD objects (using the Panel->gd() method).  Then
>> you can combine them onto a third GD object in whatever geometry you
>> want by using GD->copy()
>> 
>> Lincoln
>> 
>> On Wednesday 25 February 2004 06:46 pm, Jonathan Crabtree wrote:
>>  
>> 
>>> Haibo-
>>> 
>>> hz5@njit.edu wrote:
>>>    
>>> 
>>>> Is there any way to render 2 Bio::Graphics::Panel into one png
>>>> image? because I want 2 different arrows with different labeled
>>>> coordinates on the same image and align to the left, but one
>>>> Panel can only have one coordinates system.
>>>>      
>>>> 
>>> The answer is yes, with a couple of caveats.  The first is that you
>>> will have to take care of the layout of the individual
>>> Panel-generated images.  If you're left-justifying everything then
>>> this should be easy enough.  The second is that I would recommend
>>> making a one-line change to Bio/Graphics/Panel.pm, to prevent the
>>> package from trying to allocate the same set of colors twice (when
>>> you reuse the same GD object to draw the two different parts of the
>>> image.)  Search for the following piece of code in Panel.pm (at
>>> line 411 in bioperl-1.4, I think):
>>> 
>>>  for my $name ('white','black',keys %COLORS) {
>>>    my $idx = $gd->colorAllocate(@{$COLORS{$name}});
>>>    $translation_table{$name} = $idx;
>>>  }
>>> 
>>> Change "colorAllocate" to "colorResolve"; this should have no
>>> effect on any existing Bio::Graphics code (AFAIK) and will allow
>>> you to do your two (or three or four)-Panel trick.  (As an aside,
>>> I'd like to lobby for this one-line change to be made in a future
>>> version of
>>> Bio::Graphics::Panel, for precisely this reason.)  In any case,
>>> once you've made that change and reinstalled your copy of Bioperl,
>>> here is a rough outline of what you need to do:
>>> 
>>> 1. Set up your individual Bio::Graphics::Panel objects (e.g. $p1,
>>> $p2, $p3, etc.) as desired to draw your images, but do *not* call
>>> the gd method on any of them yet.
>>> 
>>> 2. Create a GD::Image object big enough to hold the images that
>>> will be drawn by $p1, $p2, $p3, etc.:
>>>    my $gdImg = GD::Image->new($fullWidth, $fullHeight);
>>> (Note: use $p1->width(), $p1->height(), etc., to determine what
>>> $fullWidth and $fullHeight should be, based on your desired Panel
>>> layout algorithm.)
>>> 
>>> 4. Use a "dummy" Bio::Graphics::Panel object to allocate all your
>>> colors (this is an optional step; I do this because my code does
>>> some drawing that isn't handled by Bio::Graphics::Panel, and want
>>> to make sure that the palette has been allocated before I start):
>>> 
>>>    my $dummyPanel = Bio::Graphics::Panel->new(-length => 100,
>>> -offset => 0, -width => $fullWidth);
>>>    $dummyPanel->gd($gdImg); # forces color allocation
>>> 
>>> 5. Draw the individual panels and generate your png image:
>>> 
>>>    $p1->gd($gdImg);
>>>    $p2->gd($gdImg);
>>>    my $pngData = $gdImg->png();
>>> 
>>> I've glossed over some of the details here, for example the fact
>>> that you may need to know the value of $p1->height() before you can
>>> initialize $p2, but that's the basic idea.  I've been using this
>>> method to generate some comparative sequence displays and while
>>> it's definitely a bit of a hack, it works well in practice.  You
>>> can also do the same thing with a GD::SVG::Image if you'd like to
>>> generate SVG output.  Good luck,
>>> 
>>> Jonathan
>>> 
>>> 
>>> 
>>> _______________________________________________
>>> Bioperl-l mailing list
>>> Bioperl-l@portal.open-bio.org
>>> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>>>    
>>> 
>> 
>> - -- 
>> Lincoln D. Stein
>> Cold Spring Harbor Laboratory
>> 1 Bungtown Road
>> Cold Spring Harbor, NY 11724
>> -----BEGIN PGP SIGNATURE-----
>> Version: GnuPG v1.2.1 (GNU/Linux)
>> 
>> iD8DBQFAP1Zt0CIvUP7P+AkRAreyAJ0XIcjMDeT/Bw69OBOEhD8tsznP+QCfVLWo
>> +RnQaijXxPlVWTbmjTkbHYw=
>> =lN1U
>> -----END PGP SIGNATURE-----
>>  
>> 
> 
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 

From crabtree at tigr.org  Mon Mar  1 11:55:02 2004
From: crabtree at tigr.org (Jonathan Crabtree)
Date: Mon Mar  1 12:01:49 2004
Subject: [Bioperl-l] bioperl graphics
In-Reply-To: <BC68B649.C458%todd.harris@cshl.edu>
References: <BC68B649.C458%todd.harris@cshl.edu>
Message-ID: <40436AE6.7000303@tigr.org>


Hi Todd-

Todd Harris wrote:

>>On 3/1/04 9:02 AM, Jonathan Crabtree wrote:
>>    
>>
>
>Hi Jason - 
>
>Well, that's great news!  I couldn't remember if I had added colorResolve to
>GD::SVG.  Whoops!  As you can see, it's something of a kludge, but I'm glad
>to hear that it works for you.
>  
>

Not at all; it works just fine, and I think GD::SVG is a great way to 
generate SVG documents from GD-aware code.  The only problem I see with 
the current implementation of colorResolve() is that it doesn't try to 
determine whether the requested color has already been "allocated". 
 This breaks the documented API, although in a rather minor way; it's 
unlikely that anyone would write code that relies on subsequent 
identical calls to colorResolve() returning the same color index.  It 
also means that the 'colors_added' array will be larger than it needs 
to, and that colorsTotal() will return a misleading result, though again 
these are both very minor issues.  I just noticed that the man page does 
state that colorResolve is unimplemented, so perhaps this is what caused 
the confusion about the method's disposition:

     $index = $image->colorResolve(red,green,blue)
           NOT IMPLEMENTED
 

>Regarding Lincoln's suggestion of copying images, correct me if I'm wrong,
>but I interpreted that to mean copying the images OUTSIDE of
>Bio::Graphics::Panel.  That is, create your Panel, fetch it, then paste it
>where appropriate into a new GD::Image with the appropriate dimensions. To
>avoid the method-overriding aspects of GD::SVG, this might need to be done
>within a separate package or eval block.
>

Yes, I think my understanding of Lincoln's suggestion is the same as 
yours.  I'm not too familiar with SVG, but it seems as though it might 
be possible to do a pure SVG implementation of copy() using nothing more 
than a clipping path (if only a portion of the source image is to be 
copied), a transform, and an appropriate grouping of the source 
elements.  However, even if GD::SVG::Image supported copy(), I still 
think I'd lean towards keeping my current multi-Panel implementation, 
using only a single GD::Image or GD::SVG::Image, and relying on passing 
the same image object to the gd() method of multiple panels.  The 
copying seems like an unnecessary step when the current 
Bio::Graphics::Panel API supports reusing a single image object (and the 
corresponding code can be made to support it with a 1-line change.)

>I'm planning a significant upgrade to GD::SVG over the next few weeks that
>should 1) enable parallel creation of a GD object; 2) make it compatible
>with more current GD methods, and 3) make it more robust for future methods.
>  
>

That sounds great.  Thanks for the feedback,

Jonathan


From Anthony.Underwood at hpa.org.uk  Mon Mar  1 13:27:06 2004
From: Anthony.Underwood at hpa.org.uk (SRMD, Col - Underwood, Anthony)
Date: Mon Mar  1 13:20:18 2004
Subject: [Bioperl-l] Traces as X, Y coordinates
Message-ID: <427DA33A6101484BA7A72934AF90DF73051AA816@exchange1.phls.org.uk>

Hi there,

Please does anyone have experience with using the Bio::Seq::SequenceTrace
and Bio::SeqIO::scf modules to extract peak values for the different traces
(G,A,T and C) which can then be plotted using a graphics module along with
the bases themselves at the appropriate positions. I'm a little unclear as
to how the trace, peak indices, peak_index_at, base_at etc methods work.

If someone has exprience in how to go from an scf file to arrays of values
which represent x, y coords and bases, quality at each x coord then I'd
really appreciate your advice and insight.

Many thanks,

Anthony

Dr Anthony Underwood
Bioinformatics Group
Genomics, Proteomic and Bioinformatics Unit
Central Public Health Laboratory
Health Protection Agency
61 Colindale Avenue
London
NW9 5HT
t:    0208 2004400 ext. 3618
f:    0208 3583138
e: anthony.underwood@hpa.org.uk


From rnilsson at clarku.edu  Mon Mar  1 14:27:35 2004
From: rnilsson at clarku.edu (henrik nilsson)
Date: Mon Mar  1 14:34:04 2004
Subject: [Bioperl-l] Fetching > 500 sequences
Message-ID: <200403011427.35507.rnilsson@clarku.edu>

Hi,

It seems that I have problems with fetching more than 500 sequences from 
Genbank using Bioperl. It looks like the script (attached below) fetches all 
the 7000+ sequences, but only 500 make it to the output file. Is there any 
way to get all these 7000+ sequences written to the file - that is, is it 
possible to sidestep the 500 seq. limit?

Thanks for your time,

Rolf


Please find the script below. When I run it, I get

Writing accession number AJ406491
... etc ...
Writing accession number AJ406489
Writing accession number AJ406471
Writing accession number AJ406465
Writing accession number AJ406461
Total number of records found = 7053

but when I type 

[rolf@localhost dir]$ cat data.gb | grep 'BASE COUNT' | wc -l
    500
[rolf@localhost dir]$

It is clear that only 500 seq. were written to the file.

#!/usr/bin/perl -w
use strict;
use Bio::DB::GenBank;
use Bio::DB::Query::GenBank;
use IO::String;
use Bio::SeqIO;
use Bio::Seq::RichSeq;


my $query_string = 'Boletales';

my $query = Bio::DB::Query::GenBank->new(-db=>'nucleotide',
                                         -query=>$query_string);
my $out = Bio::SeqIO->new(-file=>">data.gb", -format=>'genbank');

my $count = $query->count;

my $gb = new Bio::DB::GenBank();

my $stream = $gb->get_Stream_by_query($query);

while (my $seq = $stream->next_seq) {
        print "Writing accession number ", $seq->accession_number,"\n";
        $out->write_seq($seq);
        }

print "Total number of records found = $count\n";

exit;


From hlapp at gnf.org  Mon Mar  1 14:57:51 2004
From: hlapp at gnf.org (Hilmar Lapp)
Date: Mon Mar  1 15:03:45 2004
Subject: [Bioperl-l] Re: undef strand and biosql
In-Reply-To: <BAY12-F26K8rBKXGA7z000033b5@hotmail.com>
Message-ID: <B8BB535E-6BBA-11D8-863F-000A959EB4C4@gnf.org>

Hm. Strand being undef is a soft violation, in the sense that there is  
a value defined for 'unknown strand' (namely 0).

Bottom line is that an undef strand should not lead to failure. I'll  
either fix bioperl-db to automatically substitute 0 for undef, or we  
could fix bioperl to return 0 if the strand hasn't been set yet.

The latter is risky because then it is difficult to impossible to ask  
whether the strand has already been set but is unknown, or whether it's  
never been set and therefore you want to set a default.

Any votes from the bioperl'ers out there on this question? I'm afraid  
my vote would go for not returning 0 by default in the location object.

Actually, we could also put a DEFAULT 0 in the column definition in the  
schema DDL. That doesn't necessarily fix it in all RDBMSs though if you  
directly assign NULL to the column (e.g., in Oracle). Any votes on that  
from the biosql community?

	-hilmar

On Monday, March 1, 2004, at 09:21  AM, matthieu CONTE wrote:

> I solve the problem of 'source_term_id'  by create -source_tag =>  
> 'tigr'...and it seems to be ok......
>
> "$seq->add_SeqFeature(new Bio::SeqFeature::Generic(
> 		-primary => 'RIGHT_UTR',
> 		-source_tag => 'tigr',
> 		-start   ..........
> "
> But it still have a  problem of undef column 'strand'
> Maybe in :
> $loc->add_sub_Location(new Bio::Location::Simple(
> 		-start => $e->{'start'},
> 		-end   => $e->{'end'},
>                                -strand => ????
> ));
> but I?m also confuse with the tag: ?strand? of Bio::SeqFeature::Generic
>
> I have this warning:
> -------------------- WARNING ---------------------
> MSG: insert in Bio::DB::BioSQL::LocationAdaptor (driver) failed,  
> values were ("1","318","","1") FKs (16957,<NULL>)
> Column 'strand' cannot be null
> ---------------------------------------------------
> Could not store 8356.t04040:
> ------------- EXCEPTION  -------------
> MSG: create: object (Bio::Location::Simple) failed to insert or to be  
> found by unique key
> STACK Bio::DB::BioSQL::BasePersistenceAdaptor::create  
> /usr/local/ActivePerl-5.8/lib/site_perl/5.8.0/Bio/DB/BioSQL/ 
> BasePersistenceAdaptor.pm:207
> STACK Bio::DB::BioSQL::BasePersistenceAdaptor::store  
> /usr/local/ActivePerl-5.8/lib/site_perl/5.8.0/Bio/DB/BioSQL/ 
> BasePersistenceAdaptor.pm:253
> STACK Bio::DB::Persistent::PersistentObject::store  
> /usr/local/ActivePerl-5.8/lib/site_perl/5.8.0/Bio/DB/Persistent/ 
> PersistentObject.pm:270
> STACK Bio::DB::BioSQL::SeqFeatureAdaptor::store_children  
> /usr/local/ActivePerl-5.8/lib/site_perl/5.8.0/Bio/DB/BioSQL/ 
> SeqFeatureAdaptor.pm:278
> STACK Bio::DB::BioSQL::BasePersistenceAdaptor::create  
> /usr/local/ActivePerl-5.8/lib/site_perl/5.8.0/Bio/DB/BioSQL/ 
> BasePersistenceAdaptor.pm:215
> STACK Bio::DB::BioSQL::BasePersistenceAdaptor::store  
> /usr/local/ActivePerl-5.8/lib/site_perl/5.8.0/Bio/DB/BioSQL/ 
> BasePersistenceAdaptor.pm:253
> STACK Bio::DB::Persistent::PersistentObject::store  
> /usr/local/ActivePerl-5.8/lib/site_perl/5.8.0/Bio/DB/Persistent/ 
> PersistentObject.pm:270
> STACK Bio::DB::BioSQL::SeqAdaptor::store_children  
> /usr/local/ActivePerl-5.8/lib/site_perl/5.8.0/Bio/DB/BioSQL/ 
> SeqAdaptor.pm:246
> STACK Bio::DB::BioSQL::BasePersistenceAdaptor::create  
> /usr/local/ActivePerl-5.8/lib/site_perl/5.8.0/Bio/DB/BioSQL/ 
> BasePersistenceAdaptor.pm:215
> STACK Bio::DB::BioSQL::BasePersistenceAdaptor::store  
> /usr/local/ActivePerl-5.8/lib/site_perl/5.8.0/Bio/DB/BioSQL/ 
> BasePersistenceAdaptor.pm:253
> STACK Bio::DB::Persistent::PersistentObject::store  
> /usr/local/ActivePerl-5.8/lib/site_perl/5.8.0/Bio/DB/Persistent/ 
> PersistentObject.pm:270
> STACK (eval) load_seqdatabase.pl:517
> STACK toplevel load_seqdatabase.pl:500
>
>
>
>
> -----------------------------------------------------------
> Matthieu CONTE
> M. Sc. in Bioinformatics from SIB
>
> CIRAD-Biotrop TA40/03
> Avenue Agropolis
> 34398 Montpellier Cedex 5
> FRANCE
>
> m_conte@hotmail.com
> tel: (33)04 67 61 60 21
> fax :(33) 4 67 61 56 05
>
> -----------------------------------------------------------
>
>
>
>
>
>> From: Hilmar Lapp <hlapp@gnf.org>
>> To: Josh Lauricha <laurichj@cs.ucr.edu>
>> CC: "matthieu CONTE" <m_conte@hotmail.com>, bioperl-l@bioperl.org
>> Subject: Re: [Bioperl-l] Re: Bio ::seqIO ::tigr
>> Date: Fri, 27 Feb 2004 16:45:13 -0800
>>
>>
>> On Thursday, February 26, 2004, at 09:45  AM, Josh Lauricha wrote:
>>
>>> Does the source_term_id refer to the source_tag()?
>>
>> Yes.
>>
>>>
>>> On Feb 26, 2004, at 9:08 AM, matthieu CONTE wrote:
>>>
>>>> [conte@bearn biosql]$ perl load_seqdatabase.pl --dbuser biosql  
>>>> --dbpass biosql --namespace orysa_tigr --format tigr  
>>>> /home/conte/pipeline_orthologues/data/orysa_tigr/chr07.xml
>>>> Loading /home/conte/pipeline_orthologues/data/orysa_tigr/chr07.xml  
>>>> ...
>>>>
>>>> -------------------- WARNING ---------------------
>>>> MSG: insert in Bio::DB::BioSQL::SeqFeatureAdaptor (driver) failed,  
>>>> values were ("","1") FKs (26216,37,<NULL>)
>>>> Column 'source_term_id' cannot be null
>>>> ---------------------------------------------------
>>>>
>>
>> What this means is that there was no $feat->source_tag set, and  
>> looking up undef resulted in undef for the foreign key :-)
>>
>> While bioperl doesn't enforce it, for biosql the source_tag() as well  
>> as the primary_tag() of a feature are mandatory and cannot be undef.  
>> Most SeqIO parsers set source_tag() to a static default if there is  
>> no value, e.g. 'EMBL/GenBank/SwissProt' if you're using FTHelper.pm.
>>
>> 	-hilmar
>> --
>> -------------------------------------------------------------
>> Hilmar Lapp                            email: lapp at gnf.org
>> GNF, San Diego, Ca. 92121              phone: +1-858-812-1757
>> -------------------------------------------------------------
>>
>>
>
> _________________________________________________________________
> MSN Messenger : discutez en direct avec vos amis !  
> http://www.msn.fr/msger/default.asp
>
>
-- 
-------------------------------------------------------------
Hilmar Lapp                            email: lapp at gnf.org
GNF, San Diego, Ca. 92121              phone: +1-858-812-1757
-------------------------------------------------------------


From thies at itg.fzk.de  Mon Mar  1 11:40:29 2004
From: thies at itg.fzk.de (Wolf-Gerolf Thies)
Date: Mon Mar  1 15:18:13 2004
Subject: [Bioperl-l] Problems with BioPerl-1.4
Message-ID: <3.0.5.32.20040301174029.00926430@pop3.fzk.de>

Dear BioPerl experts,

I have tried to install BioPerl-1.4 on on True64UNIX (OSF-1) Version 5.1.
The standard make fails with the error:

 Make: line too long.  Stop.

Then I have tested the gnu make, but it also fails:

# /usr/local/bin/make
mkdir blib
mkdir blib/lib
mkdir blib/arch
mkdir blib/arch/auto
mkdir blib/arch/auto/Bio
mkdir blib/lib/auto
mkdir blib/lib/auto/Bio
mkdir blib/man1
mkdir blib/man3
make: execvp: /bin/sh: Arg list too long
make: *** [pm_to_blib] Error 127

Do you have any idea what might be my problem ? I join the makefile
to this mail.

With kind regards

Wolf-Gerolf Thies
-------------- next part --------------
# This Makefile is for the Bio extension to perl.
#
# It was generated automatically by MakeMaker version
# 5.45 (Revision: 1.222) from the contents of
# Makefile.PL. Don't edit this file, edit Makefile.PL instead.
#
#	ANY CHANGES MADE HERE WILL BE LOST!
#
#   MakeMaker ARGV: ()
#
#   MakeMaker Parameters:

#	ABSTRACT => q[Bioinformatics Toolkit]
#	AUTHOR => q[Bioperl Team (bioperl-l@bioperl.org)]
#	DIR => []
#	DISTNAME => q[bioperl]
#	EXE_FILES => [q[./scripts_temp/bp_load_gff.pl], q[./scripts_temp/bp_genbank2gff.pl], q[./scripts_temp/bp_bulk_load_gff.pl], q[./scripts_temp/bp_fast_load_gff.pl], q[./scripts_temp/bp_generate_histogram.pl], q[./scripts_temp/bp_pg_bulk_load_gff.pl], q[./scripts_temp/bp_process_gadfly.pl], q[./scripts_temp/bp_process_ncbi_human.pl], q[./scripts_temp/bp_process_sgd.pl], q[./scripts_temp/bp_process_wormbase.pl], q[./scripts_temp/bp_biofetch_genbank_proxy.pl], q[./scripts_temp/bp_bioflat_index.pl], q[./scripts_temp/bp_biogetseq.pl], q[./scripts_temp/bp_flanks.pl], q[./scripts_temp/bp_biblio.pl], q[./scripts_temp/bp_frend.pl], q[./scripts_temp/bp_feature_draw.pl], q[./scripts_temp/bp_search_overview.pl], q[./scripts_temp/bp_fetch.pl], q[./scripts_temp/bp_index.pl], q[./scripts_temp/bp_composite_LD.pl], q[./scripts_temp/bp_heterogeneity_test.pl], q[./scripts_temp/bp_filter_search.pl], q[./scripts_temp/bp_extract_feature_seq.pl], q[./scripts_temp/bp_seqconvert.pl], q[./scripts_temp/bp_split_seq.pl], q[./scripts_temp/bp_translate_seq.pl], q[./scripts_temp/bp_aacomp.pl], q[./scripts_temp/bp_chaos_plot.pl], q[./scripts_temp/bp_gccalc.pl], q[./scripts_temp/bp_oligo_count.pl], q[./scripts_temp/bp_local_taxonomydb_query.pl], q[./scripts_temp/bp_taxid4species.pl], q[./scripts_temp/bp_blast2tree.pl], q[./scripts_temp/bp_nrdb.pl], q[./scripts_temp/bp_mrtrans.pl], q[./scripts_temp/bp_sreformat.pl], q[./scripts_temp/bp_dbsplit.pl], q[./scripts_temp/bp_mask_by_search.pl], q[./scripts_temp/bp_mutate.pl], q[./scripts_temp/bp_pairwise_kaks.pl], q[./scripts_temp/bp_remote_blast.pl], q[./scripts_temp/bp_search2BSML.pl], q[./scripts_temp/bp_search2alnblocks.pl], q[./scripts_temp/bp_search2gff.pl], q[./scripts_temp/bp_search2tribe.pl], q[./scripts_temp/bp_seq_length.pl]]
#	NAME => q[Bio]
#	PL_FILES => { doc/makedoc.PL=>[q[bioscripts.pod], q[biodatabases.pod], q[biodesign.pod], q[bioperl.pod]] }
#	PREREQ_PM => { HTML::Entities=>q[0], File::Spec=>q[0], File::Temp=>q[0], DB_File=>q[0], IO::Scalar=>q[0], IO::String=>q[0] }
#	VERSION_FROM => q[Bio/Root/Version.pm]
#	dist => { DIST_DEFAULT=>q[all tardist], COMPRESS=>q[gzip -9f], SUFFIX=>q[.gz] }
#	realclean => { FILES=>q[./Seq.pm.bak] }

# --- MakeMaker post_initialize section:


# --- MakeMaker const_config section:

# These definitions are from config.sh (via /usr/lib/perl-5.6.0/lib/5.6.0/alpha-dec_osf/Config.pm)

# They may have been overridden via Makefile.PL or on the command line
AR = ar
CC = cc
CCCDLFLAGS =  
CCDLFLAGS =   -Wl,-rpath,/usr/lib/perl-5.6.0/lib/5.6.0/alpha-dec_osf/CORE
DLEXT = so
DLSRC = dl_dlopen.xs
LD = ld
LDDLFLAGS = -shared -expect_unresolved "*" -O4 -msym -std -s
LDFLAGS = 
LIBC = /usr/shlib/libc.so
LIB_EXT = .a
OBJ_EXT = .o
OSNAME = dec_osf
OSVERS = 5.1
RANLIB = :
SO = so
EXE_EXT = 
FULL_AR = /usr/bin/ar


# --- MakeMaker constants section:
AR_STATIC_ARGS = cr
NAME = Bio
DISTNAME = bioperl
NAME_SYM = Bio
VERSION = 1.4
VERSION_SYM = 1_4
XS_VERSION = 1.4
INST_BIN = blib/bin
INST_EXE = blib/script
INST_LIB = blib/lib
INST_ARCHLIB = blib/arch
INST_SCRIPT = blib/script
PREFIX = /usr/lib/perl-5.6.0
INSTALLDIRS = site
INSTALLPRIVLIB = $(PREFIX)/lib/5.6.0
INSTALLARCHLIB = $(PREFIX)/lib/5.6.0/alpha-dec_osf
INSTALLSITELIB = $(PREFIX)/lib/site_perl/5.6.0
INSTALLSITEARCH = $(PREFIX)/lib/site_perl/5.6.0/alpha-dec_osf
INSTALLBIN = $(PREFIX)/bin
INSTALLSCRIPT = $(PREFIX)/bin
PERL_LIB = /usr/lib/perl-5.6.0/lib/5.6.0
PERL_ARCHLIB = /usr/lib/perl-5.6.0/lib/5.6.0/alpha-dec_osf
SITELIBEXP = /usr/lib/perl-5.6.0/lib/site_perl/5.6.0
SITEARCHEXP = /usr/lib/perl-5.6.0/lib/site_perl/5.6.0/alpha-dec_osf
LIBPERL_A = libperl.a
FIRST_MAKEFILE = Makefile
MAKE_APERL_FILE = Makefile.aperl
PERLMAINCC = $(CC)
PERL_INC = /usr/lib/perl-5.6.0/lib/5.6.0/alpha-dec_osf/CORE
PERL = /usr/bin/perl
FULLPERL = /usr/bin/perl
FULL_AR = /usr/bin/ar

VERSION_MACRO = VERSION
DEFINE_VERSION = -D$(VERSION_MACRO)=\"$(VERSION)\"
XS_VERSION_MACRO = XS_VERSION
XS_DEFINE_VERSION = -D$(XS_VERSION_MACRO)=\"$(XS_VERSION)\"
PERL_MALLOC_DEF = -DPERL_EXTMALLOC_DEF -Dmalloc=Perl_malloc -Dfree=Perl_mfree -Drealloc=Perl_realloc -Dcalloc=Perl_calloc

MAKEMAKER = /usr/lib/perl-5.6.0/lib/5.6.0/ExtUtils/MakeMaker.pm
MM_VERSION = 5.45

# FULLEXT = Pathname for extension directory (eg Foo/Bar/Oracle).
# BASEEXT = Basename part of FULLEXT. May be just equal FULLEXT. (eg Oracle)
# ROOTEXT = Directory part of FULLEXT with leading slash (eg /DBD)  !!! Deprecated from MM 5.32  !!!
# PARENT_NAME = NAME without BASEEXT and no trailing :: (eg Foo::Bar)
# DLBASE  = Basename part of dynamic library. May be just equal BASEEXT.
FULLEXT = Bio
BASEEXT = Bio
DLBASE = $(BASEEXT)
VERSION_FROM = Bio/Root/Version.pm
OBJECT = 
LDFROM = $(OBJECT)
LINKTYPE = dynamic

# Handy lists of source code files:
XS_FILES= 
C_FILES = 
O_FILES = 
H_FILES = 
HTMLLIBPODS    = 
HTMLSCRIPTPODS = 
MAN1PODS = ./scripts_temp/bp_aacomp.pl \
	./scripts_temp/bp_biblio.pl \
	./scripts_temp/bp_biofetch_genbank_proxy.pl \
	./scripts_temp/bp_biogetseq.pl \
	./scripts_temp/bp_blast2tree.pl \
	./scripts_temp/bp_bulk_load_gff.pl \
	./scripts_temp/bp_chaos_plot.pl \
	./scripts_temp/bp_composite_LD.pl \
	./scripts_temp/bp_dbsplit.pl \
	./scripts_temp/bp_extract_feature_seq.pl \
	./scripts_temp/bp_fast_load_gff.pl \
	./scripts_temp/bp_feature_draw.pl \
	./scripts_temp/bp_fetch.pl \
	./scripts_temp/bp_filter_search.pl \
	./scripts_temp/bp_flanks.pl \
	./scripts_temp/bp_frend.pl \
	./scripts_temp/bp_gccalc.pl \
	./scripts_temp/bp_genbank2gff.pl \
	./scripts_temp/bp_generate_histogram.pl \
	./scripts_temp/bp_heterogeneity_test.pl \
	./scripts_temp/bp_index.pl \
	./scripts_temp/bp_load_gff.pl \
	./scripts_temp/bp_local_taxonomydb_query.pl \
	./scripts_temp/bp_mask_by_search.pl \
	./scripts_temp/bp_mrtrans.pl \
	./scripts_temp/bp_mutate.pl \
	./scripts_temp/bp_nrdb.pl \
	./scripts_temp/bp_oligo_count.pl \
	./scripts_temp/bp_pairwise_kaks.pl \
	./scripts_temp/bp_pg_bulk_load_gff.pl \
	./scripts_temp/bp_process_gadfly.pl \
	./scripts_temp/bp_process_ncbi_human.pl \
	./scripts_temp/bp_process_sgd.pl \
	./scripts_temp/bp_process_wormbase.pl \
	./scripts_temp/bp_remote_blast.pl \
	./scripts_temp/bp_search2BSML.pl \
	./scripts_temp/bp_search2alnblocks.pl \
	./scripts_temp/bp_search2gff.pl \
	./scripts_temp/bp_search2tribe.pl \
	./scripts_temp/bp_search_overview.pl \
	./scripts_temp/bp_seq_length.pl \
	./scripts_temp/bp_seqconvert.pl \
	./scripts_temp/bp_split_seq.pl \
	./scripts_temp/bp_sreformat.pl \
	./scripts_temp/bp_taxid4species.pl \
	./scripts_temp/bp_translate_seq.pl
MAN3PODS = Bio/Align/AlignI.pm \
	Bio/Align/DNAStatistics.pm \
	Bio/Align/PairwiseStatistics.pm \
	Bio/Align/StatisticsI.pm \
	Bio/Align/Utilities.pm \
	Bio/AlignIO.pm \
	Bio/AlignIO/bl2seq.pm \
	Bio/AlignIO/clustalw.pm \
	Bio/AlignIO/emboss.pm \
	Bio/AlignIO/fasta.pm \
	Bio/AlignIO/maf.pm \
	Bio/AlignIO/mase.pm \
	Bio/AlignIO/mega.pm \
	Bio/AlignIO/meme.pm \
	Bio/AlignIO/metafasta.pm \
	Bio/AlignIO/msf.pm \
	Bio/AlignIO/nexus.pm \
	Bio/AlignIO/pfam.pm \
	Bio/AlignIO/phylip.pm \
	Bio/AlignIO/prodom.pm \
	Bio/AlignIO/psi.pm \
	Bio/AlignIO/selex.pm \
	Bio/AlignIO/stockholm.pm \
	Bio/AnalysisI.pm \
	Bio/AnalysisParserI.pm \
	Bio/AnalysisResultI.pm \
	Bio/AnnotatableI.pm \
	Bio/Annotation/AnnotationFactory.pm \
	Bio/Annotation/Collection.pm \
	Bio/Annotation/Comment.pm \
	Bio/Annotation/DBLink.pm \
	Bio/Annotation/OntologyTerm.pm \
	Bio/Annotation/Reference.pm \
	Bio/Annotation/SimpleValue.pm \
	Bio/Annotation/StructuredValue.pm \
	Bio/Annotation/TypeManager.pm \
	Bio/AnnotationCollectionI.pm \
	Bio/AnnotationI.pm \
	Bio/Assembly/Contig.pm \
	Bio/Assembly/ContigAnalysis.pm \
	Bio/Assembly/IO.pm \
	Bio/Assembly/IO/ace.pm \
	Bio/Assembly/IO/phrap.pm \
	Bio/Assembly/Scaffold.pm \
	Bio/Assembly/ScaffoldI.pm \
	Bio/Biblio.pm \
	Bio/Biblio/Article.pm \
	Bio/Biblio/BiblioBase.pm \
	Bio/Biblio/Book.pm \
	Bio/Biblio/BookArticle.pm \
	Bio/Biblio/IO.pm \
	Bio/Biblio/IO/medline2ref.pm \
	Bio/Biblio/IO/medlinexml.pm \
	Bio/Biblio/IO/pubmed2ref.pm \
	Bio/Biblio/IO/pubmedxml.pm \
	Bio/Biblio/Journal.pm \
	Bio/Biblio/JournalArticle.pm \
	Bio/Biblio/MedlineArticle.pm \
	Bio/Biblio/MedlineBook.pm \
	Bio/Biblio/MedlineBookArticle.pm \
	Bio/Biblio/MedlineJournal.pm \
	Bio/Biblio/MedlineJournalArticle.pm \
	Bio/Biblio/Organisation.pm \
	Bio/Biblio/Patent.pm \
	Bio/Biblio/Person.pm \
	Bio/Biblio/Proceeding.pm \
	Bio/Biblio/Provider.pm \
	Bio/Biblio/PubmedArticle.pm \
	Bio/Biblio/PubmedBookArticle.pm \
	Bio/Biblio/PubmedJournalArticle.pm \
	Bio/Biblio/Ref.pm \
	Bio/Biblio/Service.pm \
	Bio/Biblio/TechReport.pm \
	Bio/Biblio/Thesis.pm \
	Bio/Biblio/WebResource.pm \
	Bio/Cluster/ClusterFactory.pm \
	Bio/Cluster/FamilyI.pm \
	Bio/Cluster/SequenceFamily.pm \
	Bio/Cluster/UniGene.pm \
	Bio/Cluster/UniGeneI.pm \
	Bio/ClusterI.pm \
	Bio/ClusterIO.pm \
	Bio/ClusterIO/dbsnp.pm \
	Bio/ClusterIO/unigene.pm \
	Bio/CodonUsage/IO.pm \
	Bio/CodonUsage/Table.pm \
	Bio/Coordinate/Chain.pm \
	Bio/Coordinate/Collection.pm \
	Bio/Coordinate/ExtrapolatingPair.pm \
	Bio/Coordinate/GeneMapper.pm \
	Bio/Coordinate/Graph.pm \
	Bio/Coordinate/MapperI.pm \
	Bio/Coordinate/Pair.pm \
	Bio/Coordinate/Result.pm \
	Bio/Coordinate/Result/Gap.pm \
	Bio/Coordinate/Result/Match.pm \
	Bio/Coordinate/ResultI.pm \
	Bio/Coordinate/Utils.pm \
	Bio/DB/Ace.pm \
	Bio/DB/Biblio/biofetch.pm \
	Bio/DB/Biblio/soap.pm \
	Bio/DB/BiblioI.pm \
	Bio/DB/BioFetch.pm \
	Bio/DB/CUTG.pm \
	Bio/DB/DBFetch.pm \
	Bio/DB/EMBL.pm \
	Bio/DB/Failover.pm \
	Bio/DB/Fasta.pm \
	Bio/DB/FileCache.pm \
	Bio/DB/Flat.pm \
	Bio/DB/Flat/BDB.pm \
	Bio/DB/Flat/BDB/embl.pm \
	Bio/DB/Flat/BDB/fasta.pm \
	Bio/DB/Flat/BDB/genbank.pm \
	Bio/DB/Flat/BDB/swiss.pm \
	Bio/DB/Flat/BDB/swissprot.pm \
	Bio/DB/Flat/BinarySearch.pm \
	Bio/DB/GDB.pm \
	Bio/DB/GFF.pm \
	Bio/DB/GFF/Adaptor/ace.pm \
	Bio/DB/GFF/Adaptor/biofetch.pm \
	Bio/DB/GFF/Adaptor/biofetch_oracle.pm \
	Bio/DB/GFF/Adaptor/dbi.pm \
	Bio/DB/GFF/Adaptor/dbi/caching_handle.pm \
	Bio/DB/GFF/Adaptor/dbi/iterator.pm \
	Bio/DB/GFF/Adaptor/dbi/mysql.pm \
	Bio/DB/GFF/Adaptor/dbi/mysqlace.pm \
	Bio/DB/GFF/Adaptor/dbi/mysqlopt.pm \
	Bio/DB/GFF/Adaptor/dbi/oracle.pm \
	Bio/DB/GFF/Adaptor/dbi/oracleace.pm \
	Bio/DB/GFF/Adaptor/dbi/pg.pm \
	Bio/DB/GFF/Adaptor/memory.pm \
	Bio/DB/GFF/Adaptor/memory_iterator.pm \
	Bio/DB/GFF/Aggregator.pm \
	Bio/DB/GFF/Aggregator/alignment.pm \
	Bio/DB/GFF/Aggregator/clone.pm \
	Bio/DB/GFF/Aggregator/coding.pm \
	Bio/DB/GFF/Aggregator/match.pm \
	Bio/DB/GFF/Aggregator/none.pm \
	Bio/DB/GFF/Aggregator/processed_transcript.pm \
	Bio/DB/GFF/Aggregator/transcript.pm \
	Bio/DB/GFF/Aggregator/ucsc_acembly.pm \
	Bio/DB/GFF/Aggregator/ucsc_ensgene.pm \
	Bio/DB/GFF/Aggregator/ucsc_genscan.pm \
	Bio/DB/GFF/Aggregator/ucsc_refgene.pm \
	Bio/DB/GFF/Aggregator/ucsc_sanger22.pm \
	Bio/DB/GFF/Aggregator/ucsc_sanger22pseudo.pm \
	Bio/DB/GFF/Aggregator/ucsc_softberry.pm \
	Bio/DB/GFF/Aggregator/ucsc_twinscan.pm \
	Bio/DB/GFF/Aggregator/ucsc_unigene.pm \
	Bio/DB/GFF/Featname.pm \
	Bio/DB/GFF/Feature.pm \
	Bio/DB/GFF/Homol.pm \
	Bio/DB/GFF/RelSegment.pm \
	Bio/DB/GFF/Segment.pm \
	Bio/DB/GFF/Typename.pm \
	Bio/DB/GFF/Util/Binning.pm \
	Bio/DB/GFF/Util/Rearrange.pm \
	Bio/DB/GenBank.pm \
	Bio/DB/GenPept.pm \
	Bio/DB/InMemoryCache.pm \
	Bio/DB/MeSH.pm \
	Bio/DB/NCBIHelper.pm \
	Bio/DB/Query/GenBank.pm \
	Bio/DB/Query/WebQuery.pm \
	Bio/DB/QueryI.pm \
	Bio/DB/RandomAccessI.pm \
	Bio/DB/RefSeq.pm \
	Bio/DB/Registry.pm \
	Bio/DB/SeqI.pm \
	Bio/DB/SwissProt.pm \
	Bio/DB/Taxonomy.pm \
	Bio/DB/Taxonomy/entrez.pm \
	Bio/DB/Taxonomy/flatfile.pm \
	Bio/DB/Universal.pm \
	Bio/DB/UpdateableSeqI.pm \
	Bio/DB/WebDBSeqI.pm \
	Bio/DB/XEMBL.pm \
	Bio/DB/XEMBLService.pm \
	Bio/DBLinkContainerI.pm \
	Bio/Das/FeatureTypeI.pm \
	Bio/Das/SegmentI.pm \
	Bio/DasI.pm \
	Bio/DescribableI.pm \
	Bio/Event/EventGeneratorI.pm \
	Bio/Event/EventHandlerI.pm \
	Bio/Expression/FeatureGroup.pm \
	Bio/Expression/FeatureI.pm \
	Bio/Factory/AnalysisI.pm \
	Bio/Factory/ApplicationFactoryI.pm \
	Bio/Factory/DriverFactory.pm \
	Bio/Factory/FTLocationFactory.pm \
	Bio/Factory/HitFactoryI.pm \
	Bio/Factory/LocationFactoryI.pm \
	Bio/Factory/MapFactoryI.pm \
	Bio/Factory/ObjectBuilderI.pm \
	Bio/Factory/ObjectFactory.pm \
	Bio/Factory/ObjectFactoryI.pm \
	Bio/Factory/ResultFactoryI.pm \
	Bio/Factory/SeqAnalysisParserFactory.pm \
	Bio/Factory/SeqAnalysisParserFactoryI.pm \
	Bio/Factory/SequenceFactoryI.pm \
	Bio/Factory/SequenceProcessorI.pm \
	Bio/Factory/SequenceStreamI.pm \
	Bio/Factory/TreeFactoryI.pm \
	Bio/FeatureHolderI.pm \
	Bio/Graphics.pm \
	Bio/Graphics/Feature.pm \
	Bio/Graphics/FeatureFile.pm \
	Bio/Graphics/FeatureFile/Iterator.pm \
	Bio/Graphics/Glyph.pm \
	Bio/Graphics/Glyph/Factory.pm \
	Bio/Graphics/Glyph/alignment.pm \
	Bio/Graphics/Glyph/anchored_arrow.pm \
	Bio/Graphics/Glyph/arrow.pm \
	Bio/Graphics/Glyph/box.pm \
	Bio/Graphics/Glyph/cds.pm \
	Bio/Graphics/Glyph/crossbox.pm \
	Bio/Graphics/Glyph/diamond.pm \
	Bio/Graphics/Glyph/dna.pm \
	Bio/Graphics/Glyph/dot.pm \
	Bio/Graphics/Glyph/ellipse.pm \
	Bio/Graphics/Glyph/ex.pm \
	Bio/Graphics/Glyph/extending_arrow.pm \
	Bio/Graphics/Glyph/generic.pm \
	Bio/Graphics/Glyph/graded_segments.pm \
	Bio/Graphics/Glyph/group.pm \
	Bio/Graphics/Glyph/heterogeneous_segments.pm \
	Bio/Graphics/Glyph/line.pm \
	Bio/Graphics/Glyph/minmax.pm \
	Bio/Graphics/Glyph/oval.pm \
	Bio/Graphics/Glyph/pinsertion.pm \
	Bio/Graphics/Glyph/primers.pm \
	Bio/Graphics/Glyph/processed_transcript.pm \
	Bio/Graphics/Glyph/redgreen_box.pm \
	Bio/Graphics/Glyph/redgreen_segment.pm \
	Bio/Graphics/Glyph/rndrect.pm \
	Bio/Graphics/Glyph/ruler_arrow.pm \
	Bio/Graphics/Glyph/segmented_keyglyph.pm \
	Bio/Graphics/Glyph/segments.pm \
	Bio/Graphics/Glyph/span.pm \
	Bio/Graphics/Glyph/splice_site.pm \
	Bio/Graphics/Glyph/toomany.pm \
	Bio/Graphics/Glyph/track.pm \
	Bio/Graphics/Glyph/transcript.pm \
	Bio/Graphics/Glyph/transcript2.pm \
	Bio/Graphics/Glyph/translation.pm \
	Bio/Graphics/Glyph/triangle.pm \
	Bio/Graphics/Glyph/xyplot.pm \
	Bio/Graphics/Panel.pm \
	Bio/Graphics/Pictogram.pm \
	Bio/Graphics/RendererI.pm \
	Bio/IdCollectionI.pm \
	Bio/IdentifiableI.pm \
	Bio/Index/Abstract.pm \
	Bio/Index/AbstractSeq.pm \
	Bio/Index/Blast.pm \
	Bio/Index/EMBL.pm \
	Bio/Index/Fasta.pm \
	Bio/Index/Fastq.pm \
	Bio/Index/GenBank.pm \
	Bio/Index/SwissPfam.pm \
	Bio/Index/Swissprot.pm \
	Bio/LiveSeq/AARange.pm \
	Bio/LiveSeq/Chain.pm \
	Bio/LiveSeq/ChainI.pm \
	Bio/LiveSeq/DNA.pm \
	Bio/LiveSeq/Exon.pm \
	Bio/LiveSeq/Gene.pm \
	Bio/LiveSeq/IO/BioPerl.pm \
	Bio/LiveSeq/IO/Loader.pm \
	Bio/LiveSeq/IO/SRS.pm \
	Bio/LiveSeq/Intron.pm \
	Bio/LiveSeq/Mutation.pm \
	Bio/LiveSeq/Mutator.pm \
	Bio/LiveSeq/Prim_Transcript.pm \
	Bio/LiveSeq/Range.pm \
	Bio/LiveSeq/Repeat_Region.pm \
	Bio/LiveSeq/Repeat_Unit.pm \
	Bio/LiveSeq/SeqI.pm \
	Bio/LiveSeq/Transcript.pm \
	Bio/LiveSeq/Translation.pm \
	Bio/LocatableSeq.pm \
	Bio/Location/Atomic.pm \
	Bio/Location/AvWithinCoordPolicy.pm \
	Bio/Location/CoordinatePolicyI.pm \
	Bio/Location/Fuzzy.pm \
	Bio/Location/FuzzyLocationI.pm \
	Bio/Location/NarrowestCoordPolicy.pm \
	Bio/Location/Simple.pm \
	Bio/Location/Split.pm \
	Bio/Location/SplitLocationI.pm \
	Bio/Location/WidestCoordPolicy.pm \
	Bio/LocationI.pm \
	Bio/Map/CytoMap.pm \
	Bio/Map/CytoMarker.pm \
	Bio/Map/CytoPosition.pm \
	Bio/Map/LinkageMap.pm \
	Bio/Map/LinkagePosition.pm \
	Bio/Map/MapI.pm \
	Bio/Map/MappableI.pm \
	Bio/Map/Marker.pm \
	Bio/Map/MarkerI.pm \
	Bio/Map/Microsatellite.pm \
	Bio/Map/OrderedPosition.pm \
	Bio/Map/OrderedPositionWithDistance.pm \
	Bio/Map/Position.pm \
	Bio/Map/PositionI.pm \
	Bio/Map/SimpleMap.pm \
	Bio/MapIO.pm \
	Bio/MapIO/mapmaker.pm \
	Bio/Matrix/Generic.pm \
	Bio/Matrix/IO.pm \
	Bio/Matrix/IO/phylip.pm \
	Bio/Matrix/IO/scoring.pm \
	Bio/Matrix/MatrixI.pm \
	Bio/Matrix/PSM/IO.pm \
	Bio/Matrix/PSM/IO/mast.pm \
	Bio/Matrix/PSM/IO/meme.pm \
	Bio/Matrix/PSM/IO/transfac.pm \
	Bio/Matrix/PSM/InstanceSite.pm \
	Bio/Matrix/PSM/InstanceSiteI.pm \
	Bio/Matrix/PSM/Psm.pm \
	Bio/Matrix/PSM/PsmHeader.pm \
	Bio/Matrix/PSM/PsmHeaderI.pm \
	Bio/Matrix/PSM/PsmI.pm \
	Bio/Matrix/PSM/SiteMatrix.pm \
	Bio/Matrix/PSM/SiteMatrixI.pm \
	Bio/Matrix/PhylipDist.pm \
	Bio/Matrix/Scoring.pm \
	Bio/Ontology/GOterm.pm \
	Bio/Ontology/InterProTerm.pm \
	Bio/Ontology/Ontology.pm \
	Bio/Ontology/OntologyEngineI.pm \
	Bio/Ontology/OntologyI.pm \
	Bio/Ontology/OntologyStore.pm \
	Bio/Ontology/Path.pm \
	Bio/Ontology/PathI.pm \
	Bio/Ontology/Relationship.pm \
	Bio/Ontology/RelationshipFactory.pm \
	Bio/Ontology/RelationshipI.pm \
	Bio/Ontology/RelationshipType.pm \
	Bio/Ontology/SimpleGOEngine.pm \
	Bio/Ontology/SimpleOntologyEngine.pm \
	Bio/Ontology/Term.pm \
	Bio/Ontology/TermFactory.pm \
	Bio/Ontology/TermI.pm \
	Bio/OntologyIO.pm \
	Bio/OntologyIO/Handlers/BaseSAXHandler.pm \
	Bio/OntologyIO/Handlers/InterProHandler.pm \
	Bio/OntologyIO/Handlers/InterPro_BioSQL_Handler.pm \
	Bio/OntologyIO/InterProParser.pm \
	Bio/OntologyIO/dagflat.pm \
	Bio/OntologyIO/goflat.pm \
	Bio/OntologyIO/simplehierarchy.pm \
	Bio/OntologyIO/soflat.pm \
	Bio/Perl.pm \
	Bio/Phenotype/Correlate.pm \
	Bio/Phenotype/MeSH/Term.pm \
	Bio/Phenotype/MeSH/Twig.pm \
	Bio/Phenotype/Measure.pm \
	Bio/Phenotype/OMIM/MiniMIMentry.pm \
	Bio/Phenotype/OMIM/OMIMentry.pm \
	Bio/Phenotype/OMIM/OMIMentryAllelicVariant.pm \
	Bio/Phenotype/OMIM/OMIMparser.pm \
	Bio/Phenotype/Phenotype.pm \
	Bio/Phenotype/PhenotypeI.pm \
	Bio/PopGen/Genotype.pm \
	Bio/PopGen/GenotypeI.pm \
	Bio/PopGen/IO.pm \
	Bio/PopGen/IO/csv.pm \
	Bio/PopGen/IO/prettybase.pm \
	Bio/PopGen/Individual.pm \
	Bio/PopGen/IndividualI.pm \
	Bio/PopGen/Marker.pm \
	Bio/PopGen/MarkerI.pm \
	Bio/PopGen/PopStats.pm \
	Bio/PopGen/Population.pm \
	Bio/PopGen/PopulationI.pm \
	Bio/PopGen/Simulation/Coalescent.pm \
	Bio/PopGen/Simulation/GeneticDrift.pm \
	Bio/PopGen/Statistics.pm \
	Bio/PrimarySeq.pm \
	Bio/PrimarySeqI.pm \
	Bio/Range.pm \
	Bio/RangeI.pm \
	Bio/Restriction/Analysis.pm \
	Bio/Restriction/Enzyme.pm \
	Bio/Restriction/Enzyme/MultiCut.pm \
	Bio/Restriction/Enzyme/MultiSite.pm \
	Bio/Restriction/EnzymeCollection.pm \
	Bio/Restriction/EnzymeI.pm \
	Bio/Restriction/IO.pm \
	Bio/Restriction/IO/bairoch.pm \
	Bio/Restriction/IO/base.pm \
	Bio/Restriction/IO/itype2.pm \
	Bio/Restriction/IO/withrefm.pm \
	Bio/Root/Err.pm \
	Bio/Root/Exception.pm \
	Bio/Root/Global.pm \
	Bio/Root/HTTPget.pm \
	Bio/Root/IO.pm \
	Bio/Root/IOManager.pm \
	Bio/Root/Object.pm \
	Bio/Root/Root.pm \
	Bio/Root/RootI.pm \
	Bio/Root/Storable.pm \
	Bio/Root/Utilities.pm \
	Bio/Root/Vector.pm \
	Bio/Root/Version.pm \
	Bio/Root/Xref.pm \
	Bio/Search/BlastUtils.pm \
	Bio/Search/DatabaseI.pm \
	Bio/Search/GenericDatabase.pm \
	Bio/Search/HSP/BlastHSP.pm \
	Bio/Search/HSP/FastaHSP.pm \
	Bio/Search/HSP/GenericHSP.pm \
	Bio/Search/HSP/HMMERHSP.pm \
	Bio/Search/HSP/HSPFactory.pm \
	Bio/Search/HSP/HSPI.pm \
	Bio/Search/HSP/PSLHSP.pm \
	Bio/Search/HSP/PsiBlastHSP.pm \
	Bio/Search/HSP/WABAHSP.pm \
	Bio/Search/Hit/BlastHit.pm \
	Bio/Search/Hit/Fasta.pm \
	Bio/Search/Hit/GenericHit.pm \
	Bio/Search/Hit/HMMERHit.pm \
	Bio/Search/Hit/HitFactory.pm \
	Bio/Search/Hit/HitI.pm \
	Bio/Search/Hit/PsiBlastHit.pm \
	Bio/Search/Iteration/GenericIteration.pm \
	Bio/Search/Iteration/IterationI.pm \
	Bio/Search/Processor.pm \
	Bio/Search/Result/BlastResult.pm \
	Bio/Search/Result/GenericResult.pm \
	Bio/Search/Result/HMMERResult.pm \
	Bio/Search/Result/ResultFactory.pm \
	Bio/Search/Result/ResultI.pm \
	Bio/Search/Result/WABAResult.pm \
	Bio/Search/SearchUtils.pm \
	Bio/SearchDist.pm \
	Bio/SearchIO.pm \
	Bio/SearchIO/EventHandlerI.pm \
	Bio/SearchIO/FastHitEventBuilder.pm \
	Bio/SearchIO/IteratedSearchResultEventBuilder.pm \
	Bio/SearchIO/SearchResultEventBuilder.pm \
	Bio/SearchIO/SearchWriterI.pm \
	Bio/SearchIO/Writer/BSMLResultWriter.pm \
	Bio/SearchIO/Writer/GbrowseGFF.pm \
	Bio/SearchIO/Writer/HSPTableWriter.pm \
	Bio/SearchIO/Writer/HTMLResultWriter.pm \
	Bio/SearchIO/Writer/HitTableWriter.pm \
	Bio/SearchIO/Writer/ResultTableWriter.pm \
	Bio/SearchIO/Writer/TextResultWriter.pm \
	Bio/SearchIO/axt.pm \
	Bio/SearchIO/blast.pm \
	Bio/SearchIO/blasttable.pm \
	Bio/SearchIO/blastxml.pm \
	Bio/SearchIO/exonerate.pm \
	Bio/SearchIO/fasta.pm \
	Bio/SearchIO/hmmer.pm \
	Bio/SearchIO/megablast.pm \
	Bio/SearchIO/psl.pm \
	Bio/SearchIO/sim4.pm \
	Bio/SearchIO/waba.pm \
	Bio/SearchIO/wise.pm \
	Bio/Seq.pm \
	Bio/Seq/BaseSeqProcessor.pm \
	Bio/Seq/EncodedSeq.pm \
	Bio/Seq/LargePrimarySeq.pm \
	Bio/Seq/LargeSeq.pm \
	Bio/Seq/Meta.pm \
	Bio/Seq/Meta/Array.pm \
	Bio/Seq/MetaI.pm \
	Bio/Seq/PrimaryQual.pm \
	Bio/Seq/PrimedSeq.pm \
	Bio/Seq/QualI.pm \
	Bio/Seq/RichSeq.pm \
	Bio/Seq/RichSeqI.pm \
	Bio/Seq/SeqBuilder.pm \
	Bio/Seq/SeqFactory.pm \
	Bio/Seq/SeqFastaSpeedFactory.pm \
	Bio/Seq/SeqWithQuality.pm \
	Bio/Seq/SequenceTrace.pm \
	Bio/Seq/TraceI.pm \
	Bio/SeqAnalysisParserI.pm \
	Bio/SeqFeature/AnnotationAdaptor.pm \
	Bio/SeqFeature/Collection.pm \
	Bio/SeqFeature/CollectionI.pm \
	Bio/SeqFeature/Computation.pm \
	Bio/SeqFeature/FeaturePair.pm \
	Bio/SeqFeature/Gene/Exon.pm \
	Bio/SeqFeature/Gene/ExonI.pm \
	Bio/SeqFeature/Gene/GeneStructure.pm \
	Bio/SeqFeature/Gene/GeneStructureI.pm \
	Bio/SeqFeature/Gene/Intron.pm \
	Bio/SeqFeature/Gene/NC_Feature.pm \
	Bio/SeqFeature/Gene/Poly_A_site.pm \
	Bio/SeqFeature/Gene/Promoter.pm \
	Bio/SeqFeature/Gene/Transcript.pm \
	Bio/SeqFeature/Gene/TranscriptI.pm \
	Bio/SeqFeature/Gene/UTR.pm \
	Bio/SeqFeature/Generic.pm \
	Bio/SeqFeature/PositionProxy.pm \
	Bio/SeqFeature/Primer.pm \
	Bio/SeqFeature/SiRNA/Oligo.pm \
	Bio/SeqFeature/SiRNA/Pair.pm \
	Bio/SeqFeature/Similarity.pm \
	Bio/SeqFeature/SimilarityPair.pm \
	Bio/SeqFeature/Tools/TypeMapper.pm \
	Bio/SeqFeature/Tools/Unflattener.pm \
	Bio/SeqFeatureI.pm \
	Bio/SeqI.pm \
	Bio/SeqIO.pm \
	Bio/SeqIO/FTHelper.pm \
	Bio/SeqIO/MultiFile.pm \
	Bio/SeqIO/abi.pm \
	Bio/SeqIO/ace.pm \
	Bio/SeqIO/alf.pm \
	Bio/SeqIO/asciitree.pm \
	Bio/SeqIO/bsml.pm \
	Bio/SeqIO/chadoxml.pm \
	Bio/SeqIO/ctf.pm \
	Bio/SeqIO/embl.pm \
	Bio/SeqIO/exp.pm \
	Bio/SeqIO/fasta.pm \
	Bio/SeqIO/fastq.pm \
	Bio/SeqIO/game.pm \
	Bio/SeqIO/game/featHandler.pm \
	Bio/SeqIO/game/gameHandler.pm \
	Bio/SeqIO/game/gameSubs.pm \
	Bio/SeqIO/game/gameWriter.pm \
	Bio/SeqIO/game/seqHandler.pm \
	Bio/SeqIO/gcg.pm \
	Bio/SeqIO/genbank.pm \
	Bio/SeqIO/kegg.pm \
	Bio/SeqIO/largefasta.pm \
	Bio/SeqIO/locuslink.pm \
	Bio/SeqIO/metafasta.pm \
	Bio/SeqIO/phd.pm \
	Bio/SeqIO/pir.pm \
	Bio/SeqIO/pln.pm \
	Bio/SeqIO/qual.pm \
	Bio/SeqIO/raw.pm \
	Bio/SeqIO/scf.pm \
	Bio/SeqIO/swiss.pm \
	Bio/SeqIO/tab.pm \
	Bio/SeqIO/tigr.pm \
	Bio/SeqIO/ztr.pm \
	Bio/SeqUtils.pm \
	Bio/SimpleAlign.pm \
	Bio/SimpleAnalysisI.pm \
	Bio/Species.pm \
	Bio/Structure/Atom.pm \
	Bio/Structure/Chain.pm \
	Bio/Structure/Entry.pm \
	Bio/Structure/IO.pm \
	Bio/Structure/IO/pdb.pm \
	Bio/Structure/Model.pm \
	Bio/Structure/Residue.pm \
	Bio/Structure/SecStr/DSSP/Res.pm \
	Bio/Structure/SecStr/STRIDE/Res.pm \
	Bio/Structure/StructureI.pm \
	Bio/Symbol/Alphabet.pm \
	Bio/Symbol/AlphabetI.pm \
	Bio/Symbol/DNAAlphabet.pm \
	Bio/Symbol/ProteinAlphabet.pm \
	Bio/Symbol/Symbol.pm \
	Bio/Symbol/SymbolI.pm \
	Bio/Taxonomy.pm \
	Bio/Taxonomy/FactoryI.pm \
	Bio/Taxonomy/Node.pm \
	Bio/Taxonomy/Taxon.pm \
	Bio/Taxonomy/Tree.pm \
	Bio/Tools/AlignFactory.pm \
	Bio/Tools/Alignment/Consed.pm \
	Bio/Tools/Alignment/Trim.pm \
	Bio/Tools/Analysis/DNA/ESEfinder.pm \
	Bio/Tools/Analysis/Protein/Domcut.pm \
	Bio/Tools/Analysis/Protein/ELM.pm \
	Bio/Tools/Analysis/Protein/GOR4.pm \
	Bio/Tools/Analysis/Protein/HNN.pm \
	Bio/Tools/Analysis/Protein/Mitoprot.pm \
	Bio/Tools/Analysis/Protein/NetPhos.pm \
	Bio/Tools/Analysis/Protein/Scansite.pm \
	Bio/Tools/Analysis/Protein/Sopma.pm \
	Bio/Tools/Analysis/SimpleAnalysisBase.pm \
	Bio/Tools/AnalysisResult.pm \
	Bio/Tools/BPbl2seq.pm \
	Bio/Tools/BPlite.pm \
	Bio/Tools/BPlite/HSP.pm \
	Bio/Tools/BPlite/Iteration.pm \
	Bio/Tools/BPlite/Sbjct.pm \
	Bio/Tools/BPpsilite.pm \
	Bio/Tools/Blast.pm \
	Bio/Tools/Blast/HSP.pm \
	Bio/Tools/Blast/HTML.pm \
	Bio/Tools/Blast/Sbjct.pm \
	Bio/Tools/Blat.pm \
	Bio/Tools/CodonTable.pm \
	Bio/Tools/Coil.pm \
	Bio/Tools/ECnumber.pm \
	Bio/Tools/EMBOSS/Palindrome.pm \
	Bio/Tools/EPCR.pm \
	Bio/Tools/ESTScan.pm \
	Bio/Tools/Eponine.pm \
	Bio/Tools/Est2Genome.pm \
	Bio/Tools/FootPrinter.pm \
	Bio/Tools/GFF.pm \
	Bio/Tools/Gel.pm \
	Bio/Tools/Geneid.pm \
	Bio/Tools/Genemark.pm \
	Bio/Tools/Genewise.pm \
	Bio/Tools/Genomewise.pm \
	Bio/Tools/Genscan.pm \
	Bio/Tools/Glimmer.pm \
	Bio/Tools/Grail.pm \
	Bio/Tools/GuessSeqFormat.pm \
	Bio/Tools/HMMER/Domain.pm \
	Bio/Tools/HMMER/Results.pm \
	Bio/Tools/HMMER/Set.pm \
	Bio/Tools/Hmmpfam.pm \
	Bio/Tools/IUPAC.pm \
	Bio/Tools/Lucy.pm \
	Bio/Tools/MZEF.pm \
	Bio/Tools/OddCodes.pm \
	Bio/Tools/Phylo/Molphy.pm \
	Bio/Tools/Phylo/Molphy/Result.pm \
	Bio/Tools/Phylo/PAML.pm \
	Bio/Tools/Phylo/PAML/ModelResult.pm \
	Bio/Tools/Phylo/PAML/Result.pm \
	Bio/Tools/Phylo/Phylip/ProtDist.pm \
	Bio/Tools/Prediction/Exon.pm \
	Bio/Tools/Prediction/Gene.pm \
	Bio/Tools/Primer/Assessor/Base.pm \
	Bio/Tools/Primer/AssessorI.pm \
	Bio/Tools/Primer/Feature.pm \
	Bio/Tools/Primer/Pair.pm \
	Bio/Tools/Primer3.pm \
	Bio/Tools/Prints.pm \
	Bio/Tools/Profile.pm \
	Bio/Tools/Promoterwise.pm \
	Bio/Tools/PrositeScan.pm \
	Bio/Tools/Pseudowise.pm \
	Bio/Tools/QRNA.pm \
	Bio/Tools/RandomDistFunctions.pm \
	Bio/Tools/RepeatMasker.pm \
	Bio/Tools/RestrictionEnzyme.pm \
	Bio/Tools/Run/RemoteBlast.pm \
	Bio/Tools/Run/StandAloneBlast.pm \
	Bio/Tools/Run/WrapperBase.pm \
	Bio/Tools/Seg.pm \
	Bio/Tools/SeqAnal.pm \
	Bio/Tools/SeqPattern.pm \
	Bio/Tools/SeqStats.pm \
	Bio/Tools/SeqWords.pm \
	Bio/Tools/SiRNA.pm \
	Bio/Tools/Sigcleave.pm \
	Bio/Tools/Signalp.pm \
	Bio/Tools/Sim4/Exon.pm \
	Bio/Tools/Sim4/Results.pm \
	Bio/Tools/Tmhmm.pm \
	Bio/Tools/WWW.pm \
	Bio/Tools/dpAlign.pm \
	Bio/Tools/pICalculator.pm \
	Bio/Tools/pSW.pm \
	Bio/Tree/AlleleNode.pm \
	Bio/Tree/Node.pm \
	Bio/Tree/NodeI.pm \
	Bio/Tree/NodeNHX.pm \
	Bio/Tree/RandomFactory.pm \
	Bio/Tree/Statistics.pm \
	Bio/Tree/Tree.pm \
	Bio/Tree/TreeFunctionsI.pm \
	Bio/Tree/TreeI.pm \
	Bio/TreeIO.pm \
	Bio/TreeIO/TreeEventBuilder.pm \
	Bio/TreeIO/lintree.pm \
	Bio/TreeIO/newick.pm \
	Bio/TreeIO/nexus.pm \
	Bio/TreeIO/nhx.pm \
	Bio/TreeIO/svggraph.pm \
	Bio/TreeIO/tabtree.pm \
	Bio/UpdateableSeqI.pm \
	Bio/Variation/AAChange.pm \
	Bio/Variation/AAReverseMutate.pm \
	Bio/Variation/Allele.pm \
	Bio/Variation/DNAMutation.pm \
	Bio/Variation/IO.pm \
	Bio/Variation/IO/flat.pm \
	Bio/Variation/IO/xml.pm \
	Bio/Variation/RNAChange.pm \
	Bio/Variation/SNP.pm \
	Bio/Variation/SeqDiff.pm \
	Bio/Variation/VariantI.pm \
	Bio/WebAgent.pm \
	biodatabases.pod \
	biodesign.pod \
	bioperl.pod \
	bioscripts.pod \
	bptutorial.pl
HTMLEXT = html
INST_MAN1DIR = blib/man1
INSTALLMAN1DIR = /usr/share/doclib/annex/man/man1
MAN1EXT = 1
INST_MAN3DIR = blib/man3
INSTALLMAN3DIR = /usr/share/doclib/annex/man/man3
MAN3EXT = 3
PERM_RW = 644
PERM_RWX = 755

# work around a famous dec-osf make(1) feature(?):
makemakerdflt: all

.SUFFIXES: .xs .c .C .cpp .cxx .cc $(OBJ_EXT)

# Nick wanted to get rid of .PRECIOUS. I don't remember why. I seem to recall, that
# some make implementations will delete the Makefile when we rebuild it. Because
# we call false(1) when we rebuild it. So make(1) is not completely wrong when it
# does so. Our milage may vary.
# .PRECIOUS: Makefile    # seems to be not necessary anymore

.PHONY: all config static dynamic test linkext manifest

# Where is the Config information that we are using/depend on
CONFIGDEP = $(PERL_ARCHLIB)/Config.pm $(PERL_INC)/config.h

# Where to put things:
INST_LIBDIR      = $(INST_LIB)
INST_ARCHLIBDIR  = $(INST_ARCHLIB)

INST_AUTODIR     = $(INST_LIB)/auto/$(FULLEXT)
INST_ARCHAUTODIR = $(INST_ARCHLIB)/auto/$(FULLEXT)

INST_STATIC  =
INST_DYNAMIC =
INST_BOOT    =

EXPORT_LIST = 

PERL_ARCHIVE = 

TO_INST_PM = Bio/Align/AlignI.pm \
	Bio/Align/DNAStatistics.pm \
	Bio/Align/PairwiseStatistics.pm \
	Bio/Align/StatisticsI.pm \
	Bio/Align/Utilities.pm \
	Bio/AlignIO.pm \
	Bio/AlignIO/bl2seq.pm \
	Bio/AlignIO/clustalw.pm \
	Bio/AlignIO/emboss.pm \
	Bio/AlignIO/fasta.pm \
	Bio/AlignIO/maf.pm \
	Bio/AlignIO/mase.pm \
	Bio/AlignIO/mega.pm \
	Bio/AlignIO/meme.pm \
	Bio/AlignIO/metafasta.pm \
	Bio/AlignIO/msf.pm \
	Bio/AlignIO/nexus.pm \
	Bio/AlignIO/pfam.pm \
	Bio/AlignIO/phylip.pm \
	Bio/AlignIO/prodom.pm \
	Bio/AlignIO/psi.pm \
	Bio/AlignIO/selex.pm \
	Bio/AlignIO/stockholm.pm \
	Bio/AnalysisI.pm \
	Bio/AnalysisParserI.pm \
	Bio/AnalysisResultI.pm \
	Bio/AnnotatableI.pm \
	Bio/Annotation/AnnotationFactory.pm \
	Bio/Annotation/Collection.pm \
	Bio/Annotation/Comment.pm \
	Bio/Annotation/DBLink.pm \
	Bio/Annotation/OntologyTerm.pm \
	Bio/Annotation/Reference.pm \
	Bio/Annotation/SimpleValue.pm \
	Bio/Annotation/StructuredValue.pm \
	Bio/Annotation/TypeManager.pm \
	Bio/AnnotationCollectionI.pm \
	Bio/AnnotationI.pm \
	Bio/Assembly/Contig.pm \
	Bio/Assembly/ContigAnalysis.pm \
	Bio/Assembly/IO.pm \
	Bio/Assembly/IO/ace.pm \
	Bio/Assembly/IO/phrap.pm \
	Bio/Assembly/Scaffold.pm \
	Bio/Assembly/ScaffoldI.pm \
	Bio/Biblio.pm \
	Bio/Biblio/Article.pm \
	Bio/Biblio/BiblioBase.pm \
	Bio/Biblio/Book.pm \
	Bio/Biblio/BookArticle.pm \
	Bio/Biblio/IO.pm \
	Bio/Biblio/IO/medline2ref.pm \
	Bio/Biblio/IO/medlinexml.pm \
	Bio/Biblio/IO/pubmed2ref.pm \
	Bio/Biblio/IO/pubmedxml.pm \
	Bio/Biblio/Journal.pm \
	Bio/Biblio/JournalArticle.pm \
	Bio/Biblio/MedlineArticle.pm \
	Bio/Biblio/MedlineBook.pm \
	Bio/Biblio/MedlineBookArticle.pm \
	Bio/Biblio/MedlineJournal.pm \
	Bio/Biblio/MedlineJournalArticle.pm \
	Bio/Biblio/Organisation.pm \
	Bio/Biblio/Patent.pm \
	Bio/Biblio/Person.pm \
	Bio/Biblio/Proceeding.pm \
	Bio/Biblio/Provider.pm \
	Bio/Biblio/PubmedArticle.pm \
	Bio/Biblio/PubmedBookArticle.pm \
	Bio/Biblio/PubmedJournalArticle.pm \
	Bio/Biblio/Ref.pm \
	Bio/Biblio/Service.pm \
	Bio/Biblio/TechReport.pm \
	Bio/Biblio/Thesis.pm \
	Bio/Biblio/WebResource.pm \
	Bio/Cluster/ClusterFactory.pm \
	Bio/Cluster/FamilyI.pm \
	Bio/Cluster/SequenceFamily.pm \
	Bio/Cluster/UniGene.pm \
	Bio/Cluster/UniGeneI.pm \
	Bio/ClusterI.pm \
	Bio/ClusterIO.pm \
	Bio/ClusterIO/dbsnp.pm \
	Bio/ClusterIO/unigene.pm \
	Bio/CodonUsage/IO.pm \
	Bio/CodonUsage/Table.pm \
	Bio/Coordinate/Chain.pm \
	Bio/Coordinate/Collection.pm \
	Bio/Coordinate/ExtrapolatingPair.pm \
	Bio/Coordinate/GeneMapper.pm \
	Bio/Coordinate/Graph.pm \
	Bio/Coordinate/MapperI.pm \
	Bio/Coordinate/Pair.pm \
	Bio/Coordinate/Result.pm \
	Bio/Coordinate/Result/Gap.pm \
	Bio/Coordinate/Result/Match.pm \
	Bio/Coordinate/ResultI.pm \
	Bio/Coordinate/Utils.pm \
	Bio/DB/Ace.pm \
	Bio/DB/Biblio/biofetch.pm \
	Bio/DB/Biblio/soap.pm \
	Bio/DB/BiblioI.pm \
	Bio/DB/BioFetch.pm \
	Bio/DB/CUTG.pm \
	Bio/DB/DBFetch.pm \
	Bio/DB/EMBL.pm \
	Bio/DB/Failover.pm \
	Bio/DB/Fasta.pm \
	Bio/DB/FileCache.pm \
	Bio/DB/Flat.pm \
	Bio/DB/Flat/BDB.pm \
	Bio/DB/Flat/BDB/embl.pm \
	Bio/DB/Flat/BDB/fasta.pm \
	Bio/DB/Flat/BDB/genbank.pm \
	Bio/DB/Flat/BDB/swiss.pm \
	Bio/DB/Flat/BDB/swissprot.pm \
	Bio/DB/Flat/BinarySearch.pm \
	Bio/DB/GDB.pm \
	Bio/DB/GFF.pm \
	Bio/DB/GFF/Adaptor/ace.pm \
	Bio/DB/GFF/Adaptor/biofetch.pm \
	Bio/DB/GFF/Adaptor/biofetch_oracle.pm \
	Bio/DB/GFF/Adaptor/dbi.pm \
	Bio/DB/GFF/Adaptor/dbi/caching_handle.pm \
	Bio/DB/GFF/Adaptor/dbi/iterator.pm \
	Bio/DB/GFF/Adaptor/dbi/mysql.pm \
	Bio/DB/GFF/Adaptor/dbi/mysqlace.pm \
	Bio/DB/GFF/Adaptor/dbi/mysqlopt.pm \
	Bio/DB/GFF/Adaptor/dbi/oracle.pm \
	Bio/DB/GFF/Adaptor/dbi/oracleace.pm \
	Bio/DB/GFF/Adaptor/dbi/pg.pm \
	Bio/DB/GFF/Adaptor/memory.pm \
	Bio/DB/GFF/Adaptor/memory_iterator.pm \
	Bio/DB/GFF/Aggregator.pm \
	Bio/DB/GFF/Aggregator/alignment.pm \
	Bio/DB/GFF/Aggregator/clone.pm \
	Bio/DB/GFF/Aggregator/coding.pm \
	Bio/DB/GFF/Aggregator/match.pm \
	Bio/DB/GFF/Aggregator/none.pm \
	Bio/DB/GFF/Aggregator/processed_transcript.pm \
	Bio/DB/GFF/Aggregator/transcript.pm \
	Bio/DB/GFF/Aggregator/ucsc_acembly.pm \
	Bio/DB/GFF/Aggregator/ucsc_ensgene.pm \
	Bio/DB/GFF/Aggregator/ucsc_genscan.pm \
	Bio/DB/GFF/Aggregator/ucsc_refgene.pm \
	Bio/DB/GFF/Aggregator/ucsc_sanger22.pm \
	Bio/DB/GFF/Aggregator/ucsc_sanger22pseudo.pm \
	Bio/DB/GFF/Aggregator/ucsc_softberry.pm \
	Bio/DB/GFF/Aggregator/ucsc_twinscan.pm \
	Bio/DB/GFF/Aggregator/ucsc_unigene.pm \
	Bio/DB/GFF/Featname.pm \
	Bio/DB/GFF/Feature.pm \
	Bio/DB/GFF/Homol.pm \
	Bio/DB/GFF/RelSegment.pm \
	Bio/DB/GFF/Segment.pm \
	Bio/DB/GFF/Typename.pm \
	Bio/DB/GFF/Util/Binning.pm \
	Bio/DB/GFF/Util/Rearrange.pm \
	Bio/DB/GenBank.pm \
	Bio/DB/GenPept.pm \
	Bio/DB/InMemoryCache.pm \
	Bio/DB/MANIFEST \
	Bio/DB/Makefile.PL \
	Bio/DB/MeSH.pm \
	Bio/DB/NCBIHelper.pm \
	Bio/DB/Query/GenBank.pm \
	Bio/DB/Query/WebQuery.pm \
	Bio/DB/QueryI.pm \
	Bio/DB/RandomAccessI.pm \
	Bio/DB/RefSeq.pm \
	Bio/DB/Registry.pm \
	Bio/DB/SeqI.pm \
	Bio/DB/SwissProt.pm \
	Bio/DB/Taxonomy.pm \
	Bio/DB/Taxonomy/entrez.pm \
	Bio/DB/Taxonomy/flatfile.pm \
	Bio/DB/Universal.pm \
	Bio/DB/UpdateableSeqI.pm \
	Bio/DB/WebDBSeqI.pm \
	Bio/DB/XEMBL.pm \
	Bio/DB/XEMBLService.pm \
	Bio/DBLinkContainerI.pm \
	Bio/Das/FeatureTypeI.pm \
	Bio/Das/SegmentI.pm \
	Bio/DasI.pm \
	Bio/DescribableI.pm \
	Bio/Event/EventGeneratorI.pm \
	Bio/Event/EventHandlerI.pm \
	Bio/Expression/FeatureGroup.pm \
	Bio/Expression/FeatureGroup/FeatureGroupMas50.pm \
	Bio/Expression/FeatureI.pm \
	Bio/Expression/FeatureSet/FeatureSetMas50.pm \
	Bio/Factory/AnalysisI.pm \
	Bio/Factory/ApplicationFactoryI.pm \
	Bio/Factory/DriverFactory.pm \
	Bio/Factory/FTLocationFactory.pm \
	Bio/Factory/HitFactoryI.pm \
	Bio/Factory/LocationFactoryI.pm \
	Bio/Factory/MapFactoryI.pm \
	Bio/Factory/ObjectBuilderI.pm \
	Bio/Factory/ObjectFactory.pm \
	Bio/Factory/ObjectFactoryI.pm \
	Bio/Factory/ResultFactoryI.pm \
	Bio/Factory/SeqAnalysisParserFactory.pm \
	Bio/Factory/SeqAnalysisParserFactoryI.pm \
	Bio/Factory/SequenceFactoryI.pm \
	Bio/Factory/SequenceProcessorI.pm \
	Bio/Factory/SequenceStreamI.pm \
	Bio/Factory/TreeFactoryI.pm \
	Bio/FeatureHolderI.pm \
	Bio/Graphics.pm \
	Bio/Graphics/ConfiguratorI.pm \
	Bio/Graphics/Feature.pm \
	Bio/Graphics/FeatureFile.pm \
	Bio/Graphics/FeatureFile/Iterator.pm \
	Bio/Graphics/Glyph.pm \
	Bio/Graphics/Glyph/Factory.pm \
	Bio/Graphics/Glyph/alignment.pm \
	Bio/Graphics/Glyph/anchored_arrow.pm \
	Bio/Graphics/Glyph/arrow.pm \
	Bio/Graphics/Glyph/box.pm \
	Bio/Graphics/Glyph/cds.pm \
	Bio/Graphics/Glyph/crossbox.pm \
	Bio/Graphics/Glyph/diamond.pm \
	Bio/Graphics/Glyph/dna.pm \
	Bio/Graphics/Glyph/dot.pm \
	Bio/Graphics/Glyph/ellipse.pm \
	Bio/Graphics/Glyph/ex.pm \
	Bio/Graphics/Glyph/extending_arrow.pm \
	Bio/Graphics/Glyph/generic.pm \
	Bio/Graphics/Glyph/graded_segments.pm \
	Bio/Graphics/Glyph/group.pm \
	Bio/Graphics/Glyph/heterogeneous_segments.pm \
	Bio/Graphics/Glyph/line.pm \
	Bio/Graphics/Glyph/minmax.pm \
	Bio/Graphics/Glyph/oval.pm \
	Bio/Graphics/Glyph/pinsertion.pm \
	Bio/Graphics/Glyph/primers.pm \
	Bio/Graphics/Glyph/processed_transcript.pm \
	Bio/Graphics/Glyph/redgreen_box.pm \
	Bio/Graphics/Glyph/redgreen_segment.pm \
	Bio/Graphics/Glyph/rndrect.pm \
	Bio/Graphics/Glyph/ruler_arrow.pm \
	Bio/Graphics/Glyph/segmented_keyglyph.pm \
	Bio/Graphics/Glyph/segments.pm \
	Bio/Graphics/Glyph/span.pm \
	Bio/Graphics/Glyph/splice_site.pm \
	Bio/Graphics/Glyph/toomany.pm \
	Bio/Graphics/Glyph/track.pm \
	Bio/Graphics/Glyph/transcript.pm \
	Bio/Graphics/Glyph/transcript2.pm \
	Bio/Graphics/Glyph/translation.pm \
	Bio/Graphics/Glyph/triangle.pm \
	Bio/Graphics/Glyph/xyplot.pm \
	Bio/Graphics/Panel.pm \
	Bio/Graphics/Pictogram.pm \
	Bio/Graphics/RendererI.pm \
	Bio/Graphics/Util.pm \
	Bio/IdCollectionI.pm \
	Bio/IdentifiableI.pm \
	Bio/Index/Abstract.pm \
	Bio/Index/AbstractSeq.pm \
	Bio/Index/Blast.pm \
	Bio/Index/EMBL.pm \
	Bio/Index/Fasta.pm \
	Bio/Index/Fastq.pm \
	Bio/Index/GenBank.pm \
	Bio/Index/SwissPfam.pm \
	Bio/Index/Swissprot.pm \
	Bio/LiveSeq/AARange.pm \
	Bio/LiveSeq/Chain.pm \
	Bio/LiveSeq/ChainI.pm \
	Bio/LiveSeq/DNA.pm \
	Bio/LiveSeq/Exon.pm \
	Bio/LiveSeq/Gene.pm \
	Bio/LiveSeq/IO/BioPerl.pm \
	Bio/LiveSeq/IO/Loader.pm \
	Bio/LiveSeq/IO/README \
	Bio/LiveSeq/IO/SRS.pm \
	Bio/LiveSeq/Intron.pm \
	Bio/LiveSeq/Mutation.pm \
	Bio/LiveSeq/Mutator.pm \
	Bio/LiveSeq/Prim_Transcript.pm \
	Bio/LiveSeq/Range.pm \
	Bio/LiveSeq/Repeat_Region.pm \
	Bio/LiveSeq/Repeat_Unit.pm \
	Bio/LiveSeq/SeqI.pm \
	Bio/LiveSeq/Transcript.pm \
	Bio/LiveSeq/Translation.pm \
	Bio/LocatableSeq.pm \
	Bio/Location/Atomic.pm \
	Bio/Location/AvWithinCoordPolicy.pm \
	Bio/Location/CoordinatePolicyI.pm \
	Bio/Location/Fuzzy.pm \
	Bio/Location/FuzzyLocationI.pm \
	Bio/Location/NarrowestCoordPolicy.pm \
	Bio/Location/Simple.pm \
	Bio/Location/Split.pm \
	Bio/Location/SplitLocationI.pm \
	Bio/Location/WidestCoordPolicy.pm \
	Bio/LocationI.pm \
	Bio/Map/CytoMap.pm \
	Bio/Map/CytoMarker.pm \
	Bio/Map/CytoPosition.pm \
	Bio/Map/LinkageMap.pm \
	Bio/Map/LinkagePosition.pm \
	Bio/Map/MapI.pm \
	Bio/Map/MappableI.pm \
	Bio/Map/Marker.pm \
	Bio/Map/MarkerI.pm \
	Bio/Map/Microsatellite.pm \
	Bio/Map/OrderedPosition.pm \
	Bio/Map/OrderedPositionWithDistance.pm \
	Bio/Map/Position.pm \
	Bio/Map/PositionI.pm \
	Bio/Map/SimpleMap.pm \
	Bio/MapIO.pm \
	Bio/MapIO/mapmaker.pm \
	Bio/Matrix/Generic.pm \
	Bio/Matrix/IO.pm \
	Bio/Matrix/IO/phylip.pm \
	Bio/Matrix/IO/scoring.pm \
	Bio/Matrix/MatrixI.pm \
	Bio/Matrix/PSM/IO.pm \
	Bio/Matrix/PSM/IO/mast.pm \
	Bio/Matrix/PSM/IO/meme.pm \
	Bio/Matrix/PSM/IO/transfac.pm \
	Bio/Matrix/PSM/InstanceSite.pm \
	Bio/Matrix/PSM/InstanceSiteI.pm \
	Bio/Matrix/PSM/Psm.pm \
	Bio/Matrix/PSM/PsmHeader.pm \
	Bio/Matrix/PSM/PsmHeaderI.pm \
	Bio/Matrix/PSM/PsmI.pm \
	Bio/Matrix/PSM/SiteMatrix.pm \
	Bio/Matrix/PSM/SiteMatrixI.pm \
	Bio/Matrix/PhylipDist.pm \
	Bio/Matrix/Scoring.pm \
	Bio/Ontology/GOterm.pm \
	Bio/Ontology/InterProTerm.pm \
	Bio/Ontology/Ontology.pm \
	Bio/Ontology/OntologyEngineI.pm \
	Bio/Ontology/OntologyI.pm \
	Bio/Ontology/OntologyStore.pm \
	Bio/Ontology/Path.pm \
	Bio/Ontology/PathI.pm \
	Bio/Ontology/Relationship.pm \
	Bio/Ontology/RelationshipFactory.pm \
	Bio/Ontology/RelationshipI.pm \
	Bio/Ontology/RelationshipType.pm \
	Bio/Ontology/SimpleGOEngine.pm \
	Bio/Ontology/SimpleOntologyEngine.pm \
	Bio/Ontology/Term.pm \
	Bio/Ontology/TermFactory.pm \
	Bio/Ontology/TermI.pm \
	Bio/OntologyIO.pm \
	Bio/OntologyIO/Handlers/BaseSAXHandler.pm \
	Bio/OntologyIO/Handlers/InterProHandler.pm \
	Bio/OntologyIO/Handlers/InterPro_BioSQL_Handler.pm \
	Bio/OntologyIO/InterProParser.pm \
	Bio/OntologyIO/dagflat.pm \
	Bio/OntologyIO/goflat.pm \
	Bio/OntologyIO/simplehierarchy.pm \
	Bio/OntologyIO/soflat.pm \
	Bio/Perl.pm \
	Bio/Phenotype/Correlate.pm \
	Bio/Phenotype/MeSH/Term.pm \
	Bio/Phenotype/MeSH/Twig.pm \
	Bio/Phenotype/Measure.pm \
	Bio/Phenotype/OMIM/MiniMIMentry.pm \
	Bio/Phenotype/OMIM/OMIMentry.pm \
	Bio/Phenotype/OMIM/OMIMentryAllelicVariant.pm \
	Bio/Phenotype/OMIM/OMIMparser.pm \
	Bio/Phenotype/Phenotype.pm \
	Bio/Phenotype/PhenotypeI.pm \
	Bio/PopGen/Genotype.pm \
	Bio/PopGen/GenotypeI.pm \
	Bio/PopGen/IO.pm \
	Bio/PopGen/IO/csv.pm \
	Bio/PopGen/IO/prettybase.pm \
	Bio/PopGen/Individual.pm \
	Bio/PopGen/IndividualI.pm \
	Bio/PopGen/Marker.pm \
	Bio/PopGen/MarkerI.pm \
	Bio/PopGen/PopStats.pm \
	Bio/PopGen/Population.pm \
	Bio/PopGen/PopulationI.pm \
	Bio/PopGen/Simulation/Coalescent.pm \
	Bio/PopGen/Simulation/GeneticDrift.pm \
	Bio/PopGen/Statistics.pm \
	Bio/PrimarySeq.pm \
	Bio/PrimarySeqI.pm \
	Bio/Range.pm \
	Bio/RangeI.pm \
	Bio/Restriction/Analysis.pm \
	Bio/Restriction/Enzyme.pm \
	Bio/Restriction/Enzyme/MultiCut.pm \
	Bio/Restriction/Enzyme/MultiSite.pm \
	Bio/Restriction/EnzymeCollection.pm \
	Bio/Restriction/EnzymeI.pm \
	Bio/Restriction/IO.pm \
	Bio/Restriction/IO/bairoch.pm \
	Bio/Restriction/IO/base.pm \
	Bio/Restriction/IO/itype2.pm \
	Bio/Restriction/IO/withrefm.pm \
	Bio/Root/Err.pm \
	Bio/Root/Exception.pm \
	Bio/Root/Global.pm \
	Bio/Root/HTTPget.pm \
	Bio/Root/IO.pm \
	Bio/Root/IOManager.pm \
	Bio/Root/Object.pm \
	Bio/Root/Root.pm \
	Bio/Root/RootI.pm \
	Bio/Root/Storable.pm \
	Bio/Root/Utilities.pm \
	Bio/Root/Vector.pm \
	Bio/Root/Version.pm \
	Bio/Root/Xref.pm \
	Bio/Search/BlastUtils.pm \
	Bio/Search/DatabaseI.pm \
	Bio/Search/GenericDatabase.pm \
	Bio/Search/HSP/BlastHSP.pm \
	Bio/Search/HSP/FastaHSP.pm \
	Bio/Search/HSP/GenericHSP.pm \
	Bio/Search/HSP/HMMERHSP.pm \
	Bio/Search/HSP/HSPFactory.pm \
	Bio/Search/HSP/HSPI.pm \
	Bio/Search/HSP/PSLHSP.pm \
	Bio/Search/HSP/PsiBlastHSP.pm \
	Bio/Search/HSP/WABAHSP.pm \
	Bio/Search/Hit/BlastHit.pm \
	Bio/Search/Hit/Fasta.pm \
	Bio/Search/Hit/GenericHit.pm \
	Bio/Search/Hit/HMMERHit.pm \
	Bio/Search/Hit/HitFactory.pm \
	Bio/Search/Hit/HitI.pm \
	Bio/Search/Hit/PsiBlastHit.pm \
	Bio/Search/Iteration/GenericIteration.pm \
	Bio/Search/Iteration/IterationI.pm \
	Bio/Search/Processor.pm \
	Bio/Search/Result/BlastResult.pm \
	Bio/Search/Result/GenericResult.pm \
	Bio/Search/Result/HMMERResult.pm \
	Bio/Search/Result/ResultFactory.pm \
	Bio/Search/Result/ResultI.pm \
	Bio/Search/Result/WABAResult.pm \
	Bio/Search/SearchUtils.pm \
	Bio/SearchDist.pm \
	Bio/SearchIO.pm \
	Bio/SearchIO/EventHandlerI.pm \
	Bio/SearchIO/FastHitEventBuilder.pm \
	Bio/SearchIO/IteratedSearchResultEventBuilder.pm \
	Bio/SearchIO/SearchResultEventBuilder.pm \
	Bio/SearchIO/SearchWriterI.pm \
	Bio/SearchIO/Writer/BSMLResultWriter.pm \
	Bio/SearchIO/Writer/GbrowseGFF.pm \
	Bio/SearchIO/Writer/HSPTableWriter.pm \
	Bio/SearchIO/Writer/HTMLResultWriter.pm \
	Bio/SearchIO/Writer/HitTableWriter.pm \
	Bio/SearchIO/Writer/ResultTableWriter.pm \
	Bio/SearchIO/Writer/TextResultWriter.pm \
	Bio/SearchIO/axt.pm \
	Bio/SearchIO/blast.pm \
	Bio/SearchIO/blasttable.pm \
	Bio/SearchIO/blastxml.pm \
	Bio/SearchIO/exonerate.pm \
	Bio/SearchIO/fasta.pm \
	Bio/SearchIO/hmmer.pm \
	Bio/SearchIO/megablast.pm \
	Bio/SearchIO/psl.pm \
	Bio/SearchIO/sim4.pm \
	Bio/SearchIO/waba.pm \
	Bio/SearchIO/wise.pm \
	Bio/Seq.pm \
	Bio/Seq/BaseSeqProcessor.pm \
	Bio/Seq/EncodedSeq.pm \
	Bio/Seq/LargePrimarySeq.pm \
	Bio/Seq/LargeSeq.pm \
	Bio/Seq/Meta.pm \
	Bio/Seq/Meta/Array.pm \
	Bio/Seq/MetaI.pm \
	Bio/Seq/PrimaryQual.pm \
	Bio/Seq/PrimedSeq.pm \
	Bio/Seq/QualI.pm \
	Bio/Seq/RichSeq.pm \
	Bio/Seq/RichSeqI.pm \
	Bio/Seq/SeqBuilder.pm \
	Bio/Seq/SeqFactory.pm \
	Bio/Seq/SeqFastaSpeedFactory.pm \
	Bio/Seq/SeqWithQuality.pm \
	Bio/Seq/SequenceTrace.pm \
	Bio/Seq/TraceI.pm \
	Bio/SeqAnalysisParserI.pm \
	Bio/SeqFeature/AnnotationAdaptor.pm \
	Bio/SeqFeature/Collection.pm \
	Bio/SeqFeature/CollectionI.pm \
	Bio/SeqFeature/Computation.pm \
	Bio/SeqFeature/FeaturePair.pm \
	Bio/SeqFeature/Gene/Exon.pm \
	Bio/SeqFeature/Gene/ExonI.pm \
	Bio/SeqFeature/Gene/GeneStructure.pm \
	Bio/SeqFeature/Gene/GeneStructureI.pm \
	Bio/SeqFeature/Gene/Intron.pm \
	Bio/SeqFeature/Gene/NC_Feature.pm \
	Bio/SeqFeature/Gene/Poly_A_site.pm \
	Bio/SeqFeature/Gene/Promoter.pm \
	Bio/SeqFeature/Gene/Transcript.pm \
	Bio/SeqFeature/Gene/TranscriptI.pm \
	Bio/SeqFeature/Gene/UTR.pm \
	Bio/SeqFeature/Generic.pm \
	Bio/SeqFeature/PositionProxy.pm \
	Bio/SeqFeature/Primer.pm \
	Bio/SeqFeature/SiRNA/Oligo.pm \
	Bio/SeqFeature/SiRNA/Pair.pm \
	Bio/SeqFeature/Similarity.pm \
	Bio/SeqFeature/SimilarityPair.pm \
	Bio/SeqFeature/Tools/TypeMapper.pm \
	Bio/SeqFeature/Tools/Unflattener.pm \
	Bio/SeqFeatureI.pm \
	Bio/SeqI.pm \
	Bio/SeqIO.pm \
	Bio/SeqIO/FTHelper.pm \
	Bio/SeqIO/MultiFile.pm \
	Bio/SeqIO/abi.pm \
	Bio/SeqIO/ace.pm \
	Bio/SeqIO/alf.pm \
	Bio/SeqIO/asciitree.pm \
	Bio/SeqIO/bsml.pm \
	Bio/SeqIO/chadoxml.pm \
	Bio/SeqIO/ctf.pm \
	Bio/SeqIO/embl.pm \
	Bio/SeqIO/exp.pm \
	Bio/SeqIO/fasta.pm \
	Bio/SeqIO/fastq.pm \
	Bio/SeqIO/game.pm \
	Bio/SeqIO/game/featHandler.pm \
	Bio/SeqIO/game/gameHandler.pm \
	Bio/SeqIO/game/gameSubs.pm \
	Bio/SeqIO/game/gameWriter.pm \
	Bio/SeqIO/game/seqHandler.pm \
	Bio/SeqIO/gcg.pm \
	Bio/SeqIO/genbank.pm \
	Bio/SeqIO/kegg.pm \
	Bio/SeqIO/largefasta.pm \
	Bio/SeqIO/locuslink.pm \
	Bio/SeqIO/metafasta.pm \
	Bio/SeqIO/phd.pm \
	Bio/SeqIO/pir.pm \
	Bio/SeqIO/pln.pm \
	Bio/SeqIO/qual.pm \
	Bio/SeqIO/raw.pm \
	Bio/SeqIO/scf.pm \
	Bio/SeqIO/swiss.pm \
	Bio/SeqIO/tab.pm \
	Bio/SeqIO/tigr.pm \
	Bio/SeqIO/ztr.pm \
	Bio/SeqUtils.pm \
	Bio/SimpleAlign.pm \
	Bio/SimpleAnalysisI.pm \
	Bio/Species.pm \
	Bio/Structure/Atom.pm \
	Bio/Structure/Chain.pm \
	Bio/Structure/Entry.pm \
	Bio/Structure/IO.pm \
	Bio/Structure/IO/pdb.pm \
	Bio/Structure/Model.pm \
	Bio/Structure/Residue.pm \
	Bio/Structure/SecStr/DSSP/Res.pm \
	Bio/Structure/SecStr/STRIDE/Res.pm \
	Bio/Structure/StructureI.pm \
	Bio/Symbol/Alphabet.pm \
	Bio/Symbol/AlphabetI.pm \
	Bio/Symbol/DNAAlphabet.pm \
	Bio/Symbol/ProteinAlphabet.pm \
	Bio/Symbol/README.Symbol \
	Bio/Symbol/Symbol.pm \
	Bio/Symbol/SymbolI.pm \
	Bio/Taxonomy.pm \
	Bio/Taxonomy/FactoryI.pm \
	Bio/Taxonomy/Node.pm \
	Bio/Taxonomy/Taxon.pm \
	Bio/Taxonomy/Tree.pm \
	Bio/Tools/AlignFactory.pm \
	Bio/Tools/Alignment/Consed.pm \
	Bio/Tools/Alignment/Trim.pm \
	Bio/Tools/Analysis/DNA/ESEfinder.pm \
	Bio/Tools/Analysis/Protein/Domcut.pm \
	Bio/Tools/Analysis/Protein/ELM.pm \
	Bio/Tools/Analysis/Protein/GOR4.pm \
	Bio/Tools/Analysis/Protein/HNN.pm \
	Bio/Tools/Analysis/Protein/Mitoprot.pm \
	Bio/Tools/Analysis/Protein/NetPhos.pm \
	Bio/Tools/Analysis/Protein/Scansite.pm \
	Bio/Tools/Analysis/Protein/Sopma.pm \
	Bio/Tools/Analysis/SimpleAnalysisBase.pm \
	Bio/Tools/AnalysisResult.pm \
	Bio/Tools/BPbl2seq.pm \
	Bio/Tools/BPlite.pm \
	Bio/Tools/BPlite/HSP.pm \
	Bio/Tools/BPlite/Iteration.pm \
	Bio/Tools/BPlite/Sbjct.pm \
	Bio/Tools/BPpsilite.pm \
	Bio/Tools/Blast.pm \
	Bio/Tools/Blast/CHANGES \
	Bio/Tools/Blast/HSP.pm \
	Bio/Tools/Blast/HTML.pm \
	Bio/Tools/Blast/README \
	Bio/Tools/Blast/Sbjct.pm \
	Bio/Tools/Blat.pm \
	Bio/Tools/CodonTable.pm \
	Bio/Tools/Coil.pm \
	Bio/Tools/ECnumber.pm \
	Bio/Tools/EMBOSS/Palindrome.pm \
	Bio/Tools/EPCR.pm \
	Bio/Tools/ESTScan.pm \
	Bio/Tools/Eponine.pm \
	Bio/Tools/Est2Genome.pm \
	Bio/Tools/FootPrinter.pm \
	Bio/Tools/GFF.pm \
	Bio/Tools/Gel.pm \
	Bio/Tools/Geneid.pm \
	Bio/Tools/Genemark.pm \
	Bio/Tools/Genewise.pm \
	Bio/Tools/Genomewise.pm \
	Bio/Tools/Genscan.pm \
	Bio/Tools/Glimmer.pm \
	Bio/Tools/Grail.pm \
	Bio/Tools/GuessSeqFormat.pm \
	Bio/Tools/HMMER/Domain.pm \
	Bio/Tools/HMMER/Results.pm \
	Bio/Tools/HMMER/Set.pm \
	Bio/Tools/Hmmpfam.pm \
	Bio/Tools/IUPAC.pm \
	Bio/Tools/Lucy.pm \
	Bio/Tools/MZEF.pm \
	Bio/Tools/OddCodes.pm \
	Bio/Tools/Phylo/Molphy.pm \
	Bio/Tools/Phylo/Molphy/Result.pm \
	Bio/Tools/Phylo/PAML.pm \
	Bio/Tools/Phylo/PAML/ModelResult.pm \
	Bio/Tools/Phylo/PAML/Result.pm \
	Bio/Tools/Phylo/Phylip/ProtDist.pm \
	Bio/Tools/Prediction/Exon.pm \
	Bio/Tools/Prediction/Gene.pm \
	Bio/Tools/Primer/Assessor/Base.pm \
	Bio/Tools/Primer/AssessorI.pm \
	Bio/Tools/Primer/Feature.pm \
	Bio/Tools/Primer/Pair.pm \
	Bio/Tools/Primer3.pm \
	Bio/Tools/Prints.pm \
	Bio/Tools/Profile.pm \
	Bio/Tools/Promoterwise.pm \
	Bio/Tools/PrositeScan.pm \
	Bio/Tools/Pseudowise.pm \
	Bio/Tools/QRNA.pm \
	Bio/Tools/RandomDistFunctions.pm \
	Bio/Tools/RepeatMasker.pm \
	Bio/Tools/RestrictionEnzyme.pm \
	Bio/Tools/Run/README \
	Bio/Tools/Run/RemoteBlast.pm \
	Bio/Tools/Run/StandAloneBlast.pm \
	Bio/Tools/Run/WrapperBase.pm \
	Bio/Tools/Seg.pm \
	Bio/Tools/SeqAnal.pm \
	Bio/Tools/SeqPattern.pm \
	Bio/Tools/SeqStats.pm \
	Bio/Tools/SeqWords.pm \
	Bio/Tools/SiRNA.pm \
	Bio/Tools/Sigcleave.pm \
	Bio/Tools/Signalp.pm \
	Bio/Tools/Sim4/Exon.pm \
	Bio/Tools/Sim4/Results.pm \
	Bio/Tools/Tmhmm.pm \
	Bio/Tools/WWW.pm \
	Bio/Tools/dpAlign.pm \
	Bio/Tools/pICalculator.pm \
	Bio/Tools/pSW.pm \
	Bio/Tree/AlleleNode.pm \
	Bio/Tree/Node.pm \
	Bio/Tree/NodeI.pm \
	Bio/Tree/NodeNHX.pm \
	Bio/Tree/RandomFactory.pm \
	Bio/Tree/Statistics.pm \
	Bio/Tree/Tree.pm \
	Bio/Tree/TreeFunctionsI.pm \
	Bio/Tree/TreeI.pm \
	Bio/TreeIO.pm \
	Bio/TreeIO/TreeEventBuilder.pm \
	Bio/TreeIO/lintree.pm \
	Bio/TreeIO/newick.pm \
	Bio/TreeIO/nexus.pm \
	Bio/TreeIO/nhx.pm \
	Bio/TreeIO/svggraph.pm \
	Bio/TreeIO/tabtree.pm \
	Bio/UpdateableSeqI.pm \
	Bio/Variation/AAChange.pm \
	Bio/Variation/AAReverseMutate.pm \
	Bio/Variation/Allele.pm \
	Bio/Variation/DNAMutation.pm \
	Bio/Variation/IO.pm \
	Bio/Variation/IO/flat.pm \
	Bio/Variation/IO/xml.pm \
	Bio/Variation/README \
	Bio/Variation/RNAChange.pm \
	Bio/Variation/SNP.pm \
	Bio/Variation/SeqDiff.pm \
	Bio/Variation/VariantI.pm \
	Bio/WebAgent.pm \
	biodatabases.pod \
	biodesign.pod \
	bioperl.pod \
	bioscripts.pod \
	bptutorial.pl

PM_TO_BLIB = Bio/SeqIO/bsml.pm \
	$(INST_LIBDIR)/Bio/SeqIO/bsml.pm \
	Bio/Location/NarrowestCoordPolicy.pm \
	$(INST_LIBDIR)/Bio/Location/NarrowestCoordPolicy.pm \
	Bio/MapIO.pm \
	$(INST_LIBDIR)/Bio/MapIO.pm \
	Bio/SeqIO/abi.pm \
	$(INST_LIBDIR)/Bio/SeqIO/abi.pm \
	Bio/OntologyIO/goflat.pm \
	$(INST_LIBDIR)/Bio/OntologyIO/goflat.pm \
	Bio/Biblio/Organisation.pm \
	$(INST_LIBDIR)/Bio/Biblio/Organisation.pm \
	Bio/DB/GFF/Aggregator/ucsc_sanger22.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Aggregator/ucsc_sanger22.pm \
	Bio/Taxonomy/Taxon.pm \
	$(INST_LIBDIR)/Bio/Taxonomy/Taxon.pm \
	Bio/DB/GFF/Adaptor/memory.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Adaptor/memory.pm \
	Bio/Graphics/Glyph/redgreen_box.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/redgreen_box.pm \
	Bio/Tools/PrositeScan.pm \
	$(INST_LIBDIR)/Bio/Tools/PrositeScan.pm \
	Bio/Variation/IO.pm \
	$(INST_LIBDIR)/Bio/Variation/IO.pm \
	Bio/Factory/FTLocationFactory.pm \
	$(INST_LIBDIR)/Bio/Factory/FTLocationFactory.pm \
	Bio/Tools/ECnumber.pm \
	$(INST_LIBDIR)/Bio/Tools/ECnumber.pm \
	Bio/SearchIO/Writer/HTMLResultWriter.pm \
	$(INST_LIBDIR)/Bio/SearchIO/Writer/HTMLResultWriter.pm \
	Bio/Variation/AAChange.pm \
	$(INST_LIBDIR)/Bio/Variation/AAChange.pm \
	Bio/Biblio/Book.pm \
	$(INST_LIBDIR)/Bio/Biblio/Book.pm \
	Bio/DB/GFF/Aggregator/match.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Aggregator/match.pm \
	Bio/Event/EventHandlerI.pm \
	$(INST_LIBDIR)/Bio/Event/EventHandlerI.pm \
	Bio/DB/GFF/Util/Rearrange.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Util/Rearrange.pm \
	Bio/Tools/OddCodes.pm \
	$(INST_LIBDIR)/Bio/Tools/OddCodes.pm \
	Bio/Factory/ObjectFactoryI.pm \
	$(INST_LIBDIR)/Bio/Factory/ObjectFactoryI.pm \
	Bio/Search/Hit/HitFactory.pm \
	$(INST_LIBDIR)/Bio/Search/Hit/HitFactory.pm \
	Bio/DB/GFF/Aggregator/clone.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Aggregator/clone.pm \
	Bio/Root/Global.pm \
	$(INST_LIBDIR)/Bio/Root/Global.pm \
	Bio/Seq.pm \
	$(INST_LIBDIR)/Bio/Seq.pm \
	Bio/Root/Vector.pm \
	$(INST_LIBDIR)/Bio/Root/Vector.pm \
	Bio/SearchIO/Writer/TextResultWriter.pm \
	$(INST_LIBDIR)/Bio/SearchIO/Writer/TextResultWriter.pm \
	Bio/Tools/HMMER/Set.pm \
	$(INST_LIBDIR)/Bio/Tools/HMMER/Set.pm \
	Bio/SeqIO/FTHelper.pm \
	$(INST_LIBDIR)/Bio/SeqIO/FTHelper.pm \
	Bio/LiveSeq/IO/SRS.pm \
	$(INST_LIBDIR)/Bio/LiveSeq/IO/SRS.pm \
	Bio/SearchIO/IteratedSearchResultEventBuilder.pm \
	$(INST_LIBDIR)/Bio/SearchIO/IteratedSearchResultEventBuilder.pm \
	Bio/Tools/Analysis/Protein/HNN.pm \
	$(INST_LIBDIR)/Bio/Tools/Analysis/Protein/HNN.pm \
	Bio/Tools/BPlite/HSP.pm \
	$(INST_LIBDIR)/Bio/Tools/BPlite/HSP.pm \
	Bio/Tools/Sigcleave.pm \
	$(INST_LIBDIR)/Bio/Tools/Sigcleave.pm \
	Bio/SearchIO/Writer/BSMLResultWriter.pm \
	$(INST_LIBDIR)/Bio/SearchIO/Writer/BSMLResultWriter.pm \
	Bio/DB/GFF/Adaptor/dbi/pg.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Adaptor/dbi/pg.pm \
	Bio/Graphics/Glyph/triangle.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/triangle.pm \
	Bio/Assembly/IO/ace.pm \
	$(INST_LIBDIR)/Bio/Assembly/IO/ace.pm \
	Bio/Map/CytoMap.pm \
	$(INST_LIBDIR)/Bio/Map/CytoMap.pm \
	Bio/Restriction/IO/bairoch.pm \
	$(INST_LIBDIR)/Bio/Restriction/IO/bairoch.pm \
	Bio/Graphics/ConfiguratorI.pm \
	$(INST_LIBDIR)/Bio/Graphics/ConfiguratorI.pm \
	Bio/Structure/IO.pm \
	$(INST_LIBDIR)/Bio/Structure/IO.pm \
	Bio/Tree/RandomFactory.pm \
	$(INST_LIBDIR)/Bio/Tree/RandomFactory.pm \
	Bio/Factory/ObjectFactory.pm \
	$(INST_LIBDIR)/Bio/Factory/ObjectFactory.pm \
	Bio/Matrix/PSM/SiteMatrix.pm \
	$(INST_LIBDIR)/Bio/Matrix/PSM/SiteMatrix.pm \
	Bio/DasI.pm \
	$(INST_LIBDIR)/Bio/DasI.pm \
	Bio/Matrix/PSM/InstanceSiteI.pm \
	$(INST_LIBDIR)/Bio/Matrix/PSM/InstanceSiteI.pm \
	Bio/Coordinate/ExtrapolatingPair.pm \
	$(INST_LIBDIR)/Bio/Coordinate/ExtrapolatingPair.pm \
	Bio/SeqIO/pir.pm \
	$(INST_LIBDIR)/Bio/SeqIO/pir.pm \
	Bio/Tools/SeqAnal.pm \
	$(INST_LIBDIR)/Bio/Tools/SeqAnal.pm \
	Bio/Annotation/StructuredValue.pm \
	$(INST_LIBDIR)/Bio/Annotation/StructuredValue.pm \
	Bio/Biblio/Ref.pm \
	$(INST_LIBDIR)/Bio/Biblio/Ref.pm \
	Bio/SeqIO/phd.pm \
	$(INST_LIBDIR)/Bio/SeqIO/phd.pm \
	Bio/Tools/Glimmer.pm \
	$(INST_LIBDIR)/Bio/Tools/Glimmer.pm \
	Bio/CodonUsage/IO.pm \
	$(INST_LIBDIR)/Bio/CodonUsage/IO.pm \
	Bio/Location/Simple.pm \
	$(INST_LIBDIR)/Bio/Location/Simple.pm \
	Bio/Map/OrderedPositionWithDistance.pm \
	$(INST_LIBDIR)/Bio/Map/OrderedPositionWithDistance.pm \
	Bio/Tools/Phylo/PAML/Result.pm \
	$(INST_LIBDIR)/Bio/Tools/Phylo/PAML/Result.pm \
	Bio/AnnotationCollectionI.pm \
	$(INST_LIBDIR)/Bio/AnnotationCollectionI.pm \
	Bio/SeqAnalysisParserI.pm \
	$(INST_LIBDIR)/Bio/SeqAnalysisParserI.pm \
	Bio/Tools/GFF.pm \
	$(INST_LIBDIR)/Bio/Tools/GFF.pm \
	Bio/Variation/AAReverseMutate.pm \
	$(INST_LIBDIR)/Bio/Variation/AAReverseMutate.pm \
	Bio/Cluster/ClusterFactory.pm \
	$(INST_LIBDIR)/Bio/Cluster/ClusterFactory.pm \
	Bio/Tools/SiRNA.pm \
	$(INST_LIBDIR)/Bio/Tools/SiRNA.pm \
	Bio/SeqIO/qual.pm \
	$(INST_LIBDIR)/Bio/SeqIO/qual.pm \
	Bio/DB/EMBL.pm \
	$(INST_LIBDIR)/Bio/DB/EMBL.pm \
	Bio/Tools/SeqWords.pm \
	$(INST_LIBDIR)/Bio/Tools/SeqWords.pm \
	Bio/DB/UpdateableSeqI.pm \
	$(INST_LIBDIR)/Bio/DB/UpdateableSeqI.pm \
	Bio/SeqFeature/Computation.pm \
	$(INST_LIBDIR)/Bio/SeqFeature/Computation.pm \
	Bio/Matrix/IO.pm \
	$(INST_LIBDIR)/Bio/Matrix/IO.pm \
	Bio/SeqFeature/CollectionI.pm \
	$(INST_LIBDIR)/Bio/SeqFeature/CollectionI.pm \
	Bio/DB/GFF/Homol.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Homol.pm \
	Bio/SeqIO/locuslink.pm \
	$(INST_LIBDIR)/Bio/SeqIO/locuslink.pm \
	Bio/SeqIO/ctf.pm \
	$(INST_LIBDIR)/Bio/SeqIO/ctf.pm \
	Bio/SeqUtils.pm \
	$(INST_LIBDIR)/Bio/SeqUtils.pm \
	Bio/DB/Flat/BDB/embl.pm \
	$(INST_LIBDIR)/Bio/DB/Flat/BDB/embl.pm \
	Bio/Tools/CodonTable.pm \
	$(INST_LIBDIR)/Bio/Tools/CodonTable.pm \
	Bio/Variation/README \
	$(INST_LIBDIR)/Bio/Variation/README \
	Bio/Annotation/Comment.pm \
	$(INST_LIBDIR)/Bio/Annotation/Comment.pm \
	Bio/Seq/BaseSeqProcessor.pm \
	$(INST_LIBDIR)/Bio/Seq/BaseSeqProcessor.pm \
	Bio/DB/GFF/Adaptor/dbi.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Adaptor/dbi.pm \
	Bio/Tools/QRNA.pm \
	$(INST_LIBDIR)/Bio/Tools/QRNA.pm \
	Bio/SeqFeature/Primer.pm \
	$(INST_LIBDIR)/Bio/SeqFeature/Primer.pm \
	Bio/PopGen/PopStats.pm \
	$(INST_LIBDIR)/Bio/PopGen/PopStats.pm \
	Bio/DB/SeqI.pm \
	$(INST_LIBDIR)/Bio/DB/SeqI.pm \
	Bio/Tools/Pseudowise.pm \
	$(INST_LIBDIR)/Bio/Tools/Pseudowise.pm \
	Bio/Tools/Profile.pm \
	$(INST_LIBDIR)/Bio/Tools/Profile.pm \
	Bio/ClusterIO/unigene.pm \
	$(INST_LIBDIR)/Bio/ClusterIO/unigene.pm \
	Bio/LiveSeq/Repeat_Region.pm \
	$(INST_LIBDIR)/Bio/LiveSeq/Repeat_Region.pm \
	Bio/SeqFeature/Tools/TypeMapper.pm \
	$(INST_LIBDIR)/Bio/SeqFeature/Tools/TypeMapper.pm \
	Bio/DB/GFF/Aggregator.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Aggregator.pm \
	Bio/PopGen/PopulationI.pm \
	$(INST_LIBDIR)/Bio/PopGen/PopulationI.pm \
	Bio/SeqIO/game/seqHandler.pm \
	$(INST_LIBDIR)/Bio/SeqIO/game/seqHandler.pm \
	Bio/AnalysisResultI.pm \
	$(INST_LIBDIR)/Bio/AnalysisResultI.pm \
	Bio/Phenotype/MeSH/Term.pm \
	$(INST_LIBDIR)/Bio/Phenotype/MeSH/Term.pm \
	Bio/Tools/Sim4/Results.pm \
	$(INST_LIBDIR)/Bio/Tools/Sim4/Results.pm \
	Bio/Seq/PrimedSeq.pm \
	$(INST_LIBDIR)/Bio/Seq/PrimedSeq.pm \
	Bio/Biblio/MedlineJournalArticle.pm \
	$(INST_LIBDIR)/Bio/Biblio/MedlineJournalArticle.pm \
	Bio/SeqIO/swiss.pm \
	$(INST_LIBDIR)/Bio/SeqIO/swiss.pm \
	Bio/SeqFeatureI.pm \
	$(INST_LIBDIR)/Bio/SeqFeatureI.pm \
	Bio/Biblio/BiblioBase.pm \
	$(INST_LIBDIR)/Bio/Biblio/BiblioBase.pm \
	Bio/Location/Split.pm \
	$(INST_LIBDIR)/Bio/Location/Split.pm \
	Bio/DB/GenPept.pm \
	$(INST_LIBDIR)/Bio/DB/GenPept.pm \
	Bio/DB/GFF/Adaptor/biofetch.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Adaptor/biofetch.pm \
	Bio/Ontology/SimpleGOEngine.pm \
	$(INST_LIBDIR)/Bio/Ontology/SimpleGOEngine.pm \
	Bio/Graphics/Glyph.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph.pm \
	Bio/Graphics/FeatureFile.pm \
	$(INST_LIBDIR)/Bio/Graphics/FeatureFile.pm \
	Bio/Tree/Node.pm \
	$(INST_LIBDIR)/Bio/Tree/Node.pm \
	Bio/Symbol/ProteinAlphabet.pm \
	$(INST_LIBDIR)/Bio/Symbol/ProteinAlphabet.pm \
	Bio/Tools/Prediction/Gene.pm \
	$(INST_LIBDIR)/Bio/Tools/Prediction/Gene.pm \
	Bio/DB/GFF/Adaptor/dbi/oracleace.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Adaptor/dbi/oracleace.pm \
	Bio/Seq/MetaI.pm \
	$(INST_LIBDIR)/Bio/Seq/MetaI.pm \
	Bio/Tools/RepeatMasker.pm \
	$(INST_LIBDIR)/Bio/Tools/RepeatMasker.pm \
	Bio/LiveSeq/Repeat_Unit.pm \
	$(INST_LIBDIR)/Bio/LiveSeq/Repeat_Unit.pm \
	Bio/Matrix/PSM/SiteMatrixI.pm \
	$(INST_LIBDIR)/Bio/Matrix/PSM/SiteMatrixI.pm \
	Bio/Tools/BPlite/Sbjct.pm \
	$(INST_LIBDIR)/Bio/Tools/BPlite/Sbjct.pm \
	Bio/AnnotationI.pm \
	$(INST_LIBDIR)/Bio/AnnotationI.pm \
	Bio/DB/XEMBL.pm \
	$(INST_LIBDIR)/Bio/DB/XEMBL.pm \
	Bio/Tools/Geneid.pm \
	$(INST_LIBDIR)/Bio/Tools/Geneid.pm \
	Bio/Tools/Prints.pm \
	$(INST_LIBDIR)/Bio/Tools/Prints.pm \
	Bio/Phenotype/Phenotype.pm \
	$(INST_LIBDIR)/Bio/Phenotype/Phenotype.pm \
	Bio/Tools/GuessSeqFormat.pm \
	$(INST_LIBDIR)/Bio/Tools/GuessSeqFormat.pm \
	Bio/Coordinate/Chain.pm \
	$(INST_LIBDIR)/Bio/Coordinate/Chain.pm \
	Bio/SeqIO/chadoxml.pm \
	$(INST_LIBDIR)/Bio/SeqIO/chadoxml.pm \
	Bio/Matrix/PSM/IO/mast.pm \
	$(INST_LIBDIR)/Bio/Matrix/PSM/IO/mast.pm \
	Bio/Root/IO.pm \
	$(INST_LIBDIR)/Bio/Root/IO.pm \
	Bio/Graphics/Glyph/transcript2.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/transcript2.pm \
	Bio/Index/Swissprot.pm \
	$(INST_LIBDIR)/Bio/Index/Swissprot.pm \
	Bio/PopGen/GenotypeI.pm \
	$(INST_LIBDIR)/Bio/PopGen/GenotypeI.pm \
	Bio/Search/HSP/PSLHSP.pm \
	$(INST_LIBDIR)/Bio/Search/HSP/PSLHSP.pm \
	Bio/DB/Flat/BDB/genbank.pm \
	$(INST_LIBDIR)/Bio/DB/Flat/BDB/genbank.pm \
	Bio/DB/GFF/Util/Binning.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Util/Binning.pm \
	Bio/Seq/LargePrimarySeq.pm \
	$(INST_LIBDIR)/Bio/Seq/LargePrimarySeq.pm \
	Bio/DB/GFF.pm \
	$(INST_LIBDIR)/Bio/DB/GFF.pm \
	Bio/SeqFeature/Gene/NC_Feature.pm \
	$(INST_LIBDIR)/Bio/SeqFeature/Gene/NC_Feature.pm \
	Bio/SimpleAlign.pm \
	$(INST_LIBDIR)/Bio/SimpleAlign.pm \
	Bio/SearchDist.pm \
	$(INST_LIBDIR)/Bio/SearchDist.pm \
	Bio/DB/GFF/Aggregator/alignment.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Aggregator/alignment.pm \
	Bio/Symbol/Alphabet.pm \
	$(INST_LIBDIR)/Bio/Symbol/Alphabet.pm \
	Bio/Tools/Blast/HTML.pm \
	$(INST_LIBDIR)/Bio/Tools/Blast/HTML.pm \
	Bio/Tools/Gel.pm \
	$(INST_LIBDIR)/Bio/Tools/Gel.pm \
	Bio/OntologyIO/Handlers/BaseSAXHandler.pm \
	$(INST_LIBDIR)/Bio/OntologyIO/Handlers/BaseSAXHandler.pm \
	Bio/Graphics/RendererI.pm \
	$(INST_LIBDIR)/Bio/Graphics/RendererI.pm \
	Bio/Tools/BPlite.pm \
	$(INST_LIBDIR)/Bio/Tools/BPlite.pm \
	Bio/AlignIO/selex.pm \
	$(INST_LIBDIR)/Bio/AlignIO/selex.pm \
	Bio/Factory/TreeFactoryI.pm \
	$(INST_LIBDIR)/Bio/Factory/TreeFactoryI.pm \
	Bio/Range.pm \
	$(INST_LIBDIR)/Bio/Range.pm \
	Bio/Matrix/Scoring.pm \
	$(INST_LIBDIR)/Bio/Matrix/Scoring.pm \
	Bio/Annotation/AnnotationFactory.pm \
	$(INST_LIBDIR)/Bio/Annotation/AnnotationFactory.pm \
	Bio/Tools/Eponine.pm \
	$(INST_LIBDIR)/Bio/Tools/Eponine.pm \
	Bio/Map/Position.pm \
	$(INST_LIBDIR)/Bio/Map/Position.pm \
	Bio/Graphics/Util.pm \
	$(INST_LIBDIR)/Bio/Graphics/Util.pm \
	Bio/Biblio/MedlineBook.pm \
	$(INST_LIBDIR)/Bio/Biblio/MedlineBook.pm \
	Bio/Graphics/Glyph/diamond.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/diamond.pm \
	Bio/Search/Iteration/IterationI.pm \
	$(INST_LIBDIR)/Bio/Search/Iteration/IterationI.pm \
	Bio/Search/Result/GenericResult.pm \
	$(INST_LIBDIR)/Bio/Search/Result/GenericResult.pm \
	Bio/Factory/SequenceProcessorI.pm \
	$(INST_LIBDIR)/Bio/Factory/SequenceProcessorI.pm \
	Bio/Location/SplitLocationI.pm \
	$(INST_LIBDIR)/Bio/Location/SplitLocationI.pm \
	Bio/SeqIO/exp.pm \
	$(INST_LIBDIR)/Bio/SeqIO/exp.pm \
	Bio/Assembly/ContigAnalysis.pm \
	$(INST_LIBDIR)/Bio/Assembly/ContigAnalysis.pm \
	Bio/LiveSeq/DNA.pm \
	$(INST_LIBDIR)/Bio/LiveSeq/DNA.pm \
	Bio/DB/GFF/Adaptor/ace.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Adaptor/ace.pm \
	Bio/Graphics/Glyph/translation.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/translation.pm \
	Bio/Graphics/Glyph/ex.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/ex.pm \
	Bio/Seq/LargeSeq.pm \
	$(INST_LIBDIR)/Bio/Seq/LargeSeq.pm \
	Bio/Tools/Blast/README \
	$(INST_LIBDIR)/Bio/Tools/Blast/README \
	Bio/SeqIO/metafasta.pm \
	$(INST_LIBDIR)/Bio/SeqIO/metafasta.pm \
	Bio/Symbol/SymbolI.pm \
	$(INST_LIBDIR)/Bio/Symbol/SymbolI.pm \
	Bio/Matrix/PSM/Psm.pm \
	$(INST_LIBDIR)/Bio/Matrix/PSM/Psm.pm \
	Bio/Biblio/Thesis.pm \
	$(INST_LIBDIR)/Bio/Biblio/Thesis.pm \
	Bio/Search/GenericDatabase.pm \
	$(INST_LIBDIR)/Bio/Search/GenericDatabase.pm \
	Bio/Event/EventGeneratorI.pm \
	$(INST_LIBDIR)/Bio/Event/EventGeneratorI.pm \
	Bio/Taxonomy/Node.pm \
	$(INST_LIBDIR)/Bio/Taxonomy/Node.pm \
	Bio/AlignIO/meme.pm \
	$(INST_LIBDIR)/Bio/AlignIO/meme.pm \
	Bio/DB/GFF/Typename.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Typename.pm \
	Bio/Coordinate/Result/Gap.pm \
	$(INST_LIBDIR)/Bio/Coordinate/Result/Gap.pm \
	Bio/LiveSeq/Intron.pm \
	$(INST_LIBDIR)/Bio/LiveSeq/Intron.pm \
	Bio/AlignIO/mase.pm \
	$(INST_LIBDIR)/Bio/AlignIO/mase.pm \
	Bio/DB/GFF/Aggregator/ucsc_refgene.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Aggregator/ucsc_refgene.pm \
	Bio/SeqIO/pln.pm \
	$(INST_LIBDIR)/Bio/SeqIO/pln.pm \
	Bio/Structure/Atom.pm \
	$(INST_LIBDIR)/Bio/Structure/Atom.pm \
	Bio/DB/BioFetch.pm \
	$(INST_LIBDIR)/Bio/DB/BioFetch.pm \
	Bio/LiveSeq/Gene.pm \
	$(INST_LIBDIR)/Bio/LiveSeq/Gene.pm \
	Bio/Ontology/TermI.pm \
	$(INST_LIBDIR)/Bio/Ontology/TermI.pm \
	Bio/DB/GFF/Adaptor/dbi/mysql.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Adaptor/dbi/mysql.pm \
	Bio/Biblio/PubmedBookArticle.pm \
	$(INST_LIBDIR)/Bio/Biblio/PubmedBookArticle.pm \
	Bio/SeqIO/game/featHandler.pm \
	$(INST_LIBDIR)/Bio/SeqIO/game/featHandler.pm \
	Bio/Tools/Analysis/Protein/NetPhos.pm \
	$(INST_LIBDIR)/Bio/Tools/Analysis/Protein/NetPhos.pm \
	Bio/SeqIO/game/gameHandler.pm \
	$(INST_LIBDIR)/Bio/SeqIO/game/gameHandler.pm \
	Bio/Ontology/InterProTerm.pm \
	$(INST_LIBDIR)/Bio/Ontology/InterProTerm.pm \
	Bio/Phenotype/PhenotypeI.pm \
	$(INST_LIBDIR)/Bio/Phenotype/PhenotypeI.pm \
	Bio/Search/Iteration/GenericIteration.pm \
	$(INST_LIBDIR)/Bio/Search/Iteration/GenericIteration.pm \
	Bio/PopGen/Genotype.pm \
	$(INST_LIBDIR)/Bio/PopGen/Genotype.pm \
	Bio/Assembly/IO.pm \
	$(INST_LIBDIR)/Bio/Assembly/IO.pm \
	Bio/DB/GFF/RelSegment.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/RelSegment.pm \
	Bio/DB/XEMBLService.pm \
	$(INST_LIBDIR)/Bio/DB/XEMBLService.pm \
	Bio/SearchIO/EventHandlerI.pm \
	$(INST_LIBDIR)/Bio/SearchIO/EventHandlerI.pm \
	Bio/TreeIO/nhx.pm \
	$(INST_LIBDIR)/Bio/TreeIO/nhx.pm \
	Bio/Graphics/Glyph/processed_transcript.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/processed_transcript.pm \
	Bio/Ontology/TermFactory.pm \
	$(INST_LIBDIR)/Bio/Ontology/TermFactory.pm \
	Bio/SeqI.pm \
	$(INST_LIBDIR)/Bio/SeqI.pm \
	Bio/Align/StatisticsI.pm \
	$(INST_LIBDIR)/Bio/Align/StatisticsI.pm \
	Bio/Matrix/PSM/IO/meme.pm \
	$(INST_LIBDIR)/Bio/Matrix/PSM/IO/meme.pm \
	Bio/Matrix/MatrixI.pm \
	$(INST_LIBDIR)/Bio/Matrix/MatrixI.pm \
	Bio/Graphics/Glyph/graded_segments.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/graded_segments.pm \
	Bio/Matrix/PhylipDist.pm \
	$(INST_LIBDIR)/Bio/Matrix/PhylipDist.pm \
	Bio/Coordinate/GeneMapper.pm \
	$(INST_LIBDIR)/Bio/Coordinate/GeneMapper.pm \
	Bio/IdCollectionI.pm \
	$(INST_LIBDIR)/Bio/IdCollectionI.pm \
	Bio/Cluster/UniGene.pm \
	$(INST_LIBDIR)/Bio/Cluster/UniGene.pm \
	Bio/Coordinate/ResultI.pm \
	$(INST_LIBDIR)/Bio/Coordinate/ResultI.pm \
	Bio/Ontology/Ontology.pm \
	$(INST_LIBDIR)/Bio/Ontology/Ontology.pm \
	Bio/Tools/Blat.pm \
	$(INST_LIBDIR)/Bio/Tools/Blat.pm \
	Bio/SeqIO/scf.pm \
	$(INST_LIBDIR)/Bio/SeqIO/scf.pm \
	Bio/LiveSeq/Translation.pm \
	$(INST_LIBDIR)/Bio/LiveSeq/Translation.pm \
	Bio/LiveSeq/Chain.pm \
	$(INST_LIBDIR)/Bio/LiveSeq/Chain.pm \
	Bio/Seq/SeqBuilder.pm \
	$(INST_LIBDIR)/Bio/Seq/SeqBuilder.pm \
	Bio/Align/Utilities.pm \
	$(INST_LIBDIR)/Bio/Align/Utilities.pm \
	Bio/Root/Version.pm \
	$(INST_LIBDIR)/Bio/Root/Version.pm \
	Bio/PopGen/Marker.pm \
	$(INST_LIBDIR)/Bio/PopGen/Marker.pm \
	Bio/Biblio/IO/pubmed2ref.pm \
	$(INST_LIBDIR)/Bio/Biblio/IO/pubmed2ref.pm \
	Bio/LiveSeq/Mutation.pm \
	$(INST_LIBDIR)/Bio/LiveSeq/Mutation.pm \
	Bio/Location/AvWithinCoordPolicy.pm \
	$(INST_LIBDIR)/Bio/Location/AvWithinCoordPolicy.pm \
	Bio/Tools/IUPAC.pm \
	$(INST_LIBDIR)/Bio/Tools/IUPAC.pm \
	Bio/DB/GFF/Aggregator/processed_transcript.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Aggregator/processed_transcript.pm \
	Bio/Structure/Entry.pm \
	$(INST_LIBDIR)/Bio/Structure/Entry.pm \
	Bio/Perl.pm \
	$(INST_LIBDIR)/Bio/Perl.pm \
	Bio/Align/DNAStatistics.pm \
	$(INST_LIBDIR)/Bio/Align/DNAStatistics.pm \
	Bio/Ontology/OntologyEngineI.pm \
	$(INST_LIBDIR)/Bio/Ontology/OntologyEngineI.pm \
	Bio/SeqIO/ztr.pm \
	$(INST_LIBDIR)/Bio/SeqIO/ztr.pm \
	Bio/Tools/SeqPattern.pm \
	$(INST_LIBDIR)/Bio/Tools/SeqPattern.pm \
	Bio/Factory/LocationFactoryI.pm \
	$(INST_LIBDIR)/Bio/Factory/LocationFactoryI.pm \
	Bio/DB/CUTG.pm \
	$(INST_LIBDIR)/Bio/DB/CUTG.pm \
	Bio/SearchIO/SearchResultEventBuilder.pm \
	$(INST_LIBDIR)/Bio/SearchIO/SearchResultEventBuilder.pm \
	Bio/Restriction/EnzymeI.pm \
	$(INST_LIBDIR)/Bio/Restriction/EnzymeI.pm \
	Bio/SeqFeature/Tools/Unflattener.pm \
	$(INST_LIBDIR)/Bio/SeqFeature/Tools/Unflattener.pm \
	Bio/ClusterI.pm \
	$(INST_LIBDIR)/Bio/ClusterI.pm \
	Bio/SeqFeature/Gene/Transcript.pm \
	$(INST_LIBDIR)/Bio/SeqFeature/Gene/Transcript.pm \
	Bio/Align/PairwiseStatistics.pm \
	$(INST_LIBDIR)/Bio/Align/PairwiseStatistics.pm \
	Bio/Tools/WWW.pm \
	$(INST_LIBDIR)/Bio/Tools/WWW.pm \
	Bio/Tools/Tmhmm.pm \
	$(INST_LIBDIR)/Bio/Tools/Tmhmm.pm \
	Bio/Tools/Alignment/Trim.pm \
	$(INST_LIBDIR)/Bio/Tools/Alignment/Trim.pm \
	Bio/Tools/ESTScan.pm \
	$(INST_LIBDIR)/Bio/Tools/ESTScan.pm \
	Bio/SeqIO/asciitree.pm \
	$(INST_LIBDIR)/Bio/SeqIO/asciitree.pm \
	Bio/Graphics/Glyph/oval.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/oval.pm \
	Bio/SearchIO/blasttable.pm \
	$(INST_LIBDIR)/Bio/SearchIO/blasttable.pm \
	Bio/Tools/Primer/AssessorI.pm \
	$(INST_LIBDIR)/Bio/Tools/Primer/AssessorI.pm \
	Bio/Variation/IO/xml.pm \
	$(INST_LIBDIR)/Bio/Variation/IO/xml.pm \
	Bio/Taxonomy/FactoryI.pm \
	$(INST_LIBDIR)/Bio/Taxonomy/FactoryI.pm \
	Bio/Tools/RandomDistFunctions.pm \
	$(INST_LIBDIR)/Bio/Tools/RandomDistFunctions.pm \
	Bio/Seq/SeqWithQuality.pm \
	$(INST_LIBDIR)/Bio/Seq/SeqWithQuality.pm \
	Bio/Graphics/Glyph/ellipse.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/ellipse.pm \
	Bio/DB/GFF/Aggregator/ucsc_genscan.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Aggregator/ucsc_genscan.pm \
	Bio/Biblio/MedlineArticle.pm \
	$(INST_LIBDIR)/Bio/Biblio/MedlineArticle.pm \
	Bio/LiveSeq/Transcript.pm \
	$(INST_LIBDIR)/Bio/LiveSeq/Transcript.pm \
	Bio/Tools/MZEF.pm \
	$(INST_LIBDIR)/Bio/Tools/MZEF.pm \
	Bio/SearchIO/hmmer.pm \
	$(INST_LIBDIR)/Bio/SearchIO/hmmer.pm \
	Bio/Biblio/Article.pm \
	$(INST_LIBDIR)/Bio/Biblio/Article.pm \
	Bio/DB/GFF/Aggregator/ucsc_unigene.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Aggregator/ucsc_unigene.pm \
	Bio/AlignIO/prodom.pm \
	$(INST_LIBDIR)/Bio/AlignIO/prodom.pm \
	Bio/Graphics/Glyph/Factory.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/Factory.pm \
	Bio/Annotation/Collection.pm \
	$(INST_LIBDIR)/Bio/Annotation/Collection.pm \
	Bio/Restriction/IO/itype2.pm \
	$(INST_LIBDIR)/Bio/Restriction/IO/itype2.pm \
	biodatabases.pod \
	$(INST_LIBDIR)/biodatabases.pod \
	Bio/Restriction/EnzymeCollection.pm \
	$(INST_LIBDIR)/Bio/Restriction/EnzymeCollection.pm \
	Bio/SeqIO/tab.pm \
	$(INST_LIBDIR)/Bio/SeqIO/tab.pm \
	Bio/SeqFeature/Gene/Promoter.pm \
	$(INST_LIBDIR)/Bio/SeqFeature/Gene/Promoter.pm \
	Bio/DB/GFF/Adaptor/dbi/mysqlopt.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Adaptor/dbi/mysqlopt.pm \
	Bio/Factory/SeqAnalysisParserFactoryI.pm \
	$(INST_LIBDIR)/Bio/Factory/SeqAnalysisParserFactoryI.pm \
	Bio/Graphics/Feature.pm \
	$(INST_LIBDIR)/Bio/Graphics/Feature.pm \
	Bio/Restriction/IO/withrefm.pm \
	$(INST_LIBDIR)/Bio/Restriction/IO/withrefm.pm \
	Bio/DB/Universal.pm \
	$(INST_LIBDIR)/Bio/DB/Universal.pm \
	Bio/Graphics/Glyph/anchored_arrow.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/anchored_arrow.pm \
	Bio/Tools/Grail.pm \
	$(INST_LIBDIR)/Bio/Tools/Grail.pm \
	Bio/Variation/SNP.pm \
	$(INST_LIBDIR)/Bio/Variation/SNP.pm \
	Bio/Taxonomy.pm \
	$(INST_LIBDIR)/Bio/Taxonomy.pm \
	Bio/Variation/DNAMutation.pm \
	$(INST_LIBDIR)/Bio/Variation/DNAMutation.pm \
	Bio/PrimarySeqI.pm \
	$(INST_LIBDIR)/Bio/PrimarySeqI.pm \
	Bio/DescribableI.pm \
	$(INST_LIBDIR)/Bio/DescribableI.pm \
	Bio/Graphics/Glyph/group.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/group.pm \
	Bio/Search/Result/ResultI.pm \
	$(INST_LIBDIR)/Bio/Search/Result/ResultI.pm \
	Bio/Tools/Coil.pm \
	$(INST_LIBDIR)/Bio/Tools/Coil.pm \
	Bio/Expression/FeatureGroup.pm \
	$(INST_LIBDIR)/Bio/Expression/FeatureGroup.pm \
	Bio/TreeIO/tabtree.pm \
	$(INST_LIBDIR)/Bio/TreeIO/tabtree.pm \
	Bio/Coordinate/Graph.pm \
	$(INST_LIBDIR)/Bio/Coordinate/Graph.pm \
	Bio/DB/Registry.pm \
	$(INST_LIBDIR)/Bio/DB/Registry.pm \
	Bio/AnnotatableI.pm \
	$(INST_LIBDIR)/Bio/AnnotatableI.pm \
	biodesign.pod \
	$(INST_LIBDIR)/biodesign.pod \
	Bio/AlignIO/psi.pm \
	$(INST_LIBDIR)/Bio/AlignIO/psi.pm \
	Bio/Ontology/Term.pm \
	$(INST_LIBDIR)/Bio/Ontology/Term.pm \
	Bio/AlignIO/msf.pm \
	$(INST_LIBDIR)/Bio/AlignIO/msf.pm \
	Bio/Restriction/Enzyme.pm \
	$(INST_LIBDIR)/Bio/Restriction/Enzyme.pm \
	Bio/Coordinate/Collection.pm \
	$(INST_LIBDIR)/Bio/Coordinate/Collection.pm \
	Bio/Map/OrderedPosition.pm \
	$(INST_LIBDIR)/Bio/Map/OrderedPosition.pm \
	Bio/SearchIO/megablast.pm \
	$(INST_LIBDIR)/Bio/SearchIO/megablast.pm \
	Bio/Search/HSP/GenericHSP.pm \
	$(INST_LIBDIR)/Bio/Search/HSP/GenericHSP.pm \
	Bio/Tools/Genewise.pm \
	$(INST_LIBDIR)/Bio/Tools/Genewise.pm \
	Bio/Graphics/Glyph/toomany.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/toomany.pm \
	Bio/TreeIO.pm \
	$(INST_LIBDIR)/Bio/TreeIO.pm \
	Bio/DB/GFF/Adaptor/memory_iterator.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Adaptor/memory_iterator.pm \
	Bio/Tools/Phylo/PAML.pm \
	$(INST_LIBDIR)/Bio/Tools/Phylo/PAML.pm \
	Bio/DB/MeSH.pm \
	$(INST_LIBDIR)/Bio/DB/MeSH.pm \
	Bio/DB/Makefile.PL \
	$(INST_LIBDIR)/Bio/DB/Makefile.PL \
	Bio/Factory/HitFactoryI.pm \
	$(INST_LIBDIR)/Bio/Factory/HitFactoryI.pm \
	Bio/Map/CytoMarker.pm \
	$(INST_LIBDIR)/Bio/Map/CytoMarker.pm \
	Bio/Tree/TreeI.pm \
	$(INST_LIBDIR)/Bio/Tree/TreeI.pm \
	Bio/Assembly/ScaffoldI.pm \
	$(INST_LIBDIR)/Bio/Assembly/ScaffoldI.pm \
	Bio/Tree/Statistics.pm \
	$(INST_LIBDIR)/Bio/Tree/Statistics.pm \
	Bio/LiveSeq/Mutator.pm \
	$(INST_LIBDIR)/Bio/LiveSeq/Mutator.pm \
	Bio/Search/BlastUtils.pm \
	$(INST_LIBDIR)/Bio/Search/BlastUtils.pm \
	Bio/Tools/BPlite/Iteration.pm \
	$(INST_LIBDIR)/Bio/Tools/BPlite/Iteration.pm \
	Bio/OntologyIO/InterProParser.pm \
	$(INST_LIBDIR)/Bio/OntologyIO/InterProParser.pm \
	Bio/Phenotype/OMIM/OMIMentryAllelicVariant.pm \
	$(INST_LIBDIR)/Bio/Phenotype/OMIM/OMIMentryAllelicVariant.pm \
	Bio/SeqIO.pm \
	$(INST_LIBDIR)/Bio/SeqIO.pm \
	Bio/Index/Fasta.pm \
	$(INST_LIBDIR)/Bio/Index/Fasta.pm \
	Bio/DB/GFF/Aggregator/transcript.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Aggregator/transcript.pm \
	bptutorial.pl \
	$(INST_LIBDIR)/bptutorial.pl \
	Bio/Symbol/AlphabetI.pm \
	$(INST_LIBDIR)/Bio/Symbol/AlphabetI.pm \
	Bio/SeqIO/game.pm \
	$(INST_LIBDIR)/Bio/SeqIO/game.pm \
	Bio/Biblio/MedlineBookArticle.pm \
	$(INST_LIBDIR)/Bio/Biblio/MedlineBookArticle.pm \
	Bio/OntologyIO.pm \
	$(INST_LIBDIR)/Bio/OntologyIO.pm \
	Bio/SeqIO/tigr.pm \
	$(INST_LIBDIR)/Bio/SeqIO/tigr.pm \
	Bio/AlignIO/metafasta.pm \
	$(INST_LIBDIR)/Bio/AlignIO/metafasta.pm \
	Bio/AlignIO/pfam.pm \
	$(INST_LIBDIR)/Bio/AlignIO/pfam.pm \
	Bio/Map/LinkagePosition.pm \
	$(INST_LIBDIR)/Bio/Map/LinkagePosition.pm \
	Bio/Ontology/PathI.pm \
	$(INST_LIBDIR)/Bio/Ontology/PathI.pm \
	Bio/SeqIO/raw.pm \
	$(INST_LIBDIR)/Bio/SeqIO/raw.pm \
	Bio/Tools/BPbl2seq.pm \
	$(INST_LIBDIR)/Bio/Tools/BPbl2seq.pm \
	Bio/Search/HSP/PsiBlastHSP.pm \
	$(INST_LIBDIR)/Bio/Search/HSP/PsiBlastHSP.pm \
	Bio/SeqFeature/Gene/GeneStructureI.pm \
	$(INST_LIBDIR)/Bio/SeqFeature/Gene/GeneStructureI.pm \
	Bio/AlignIO/bl2seq.pm \
	$(INST_LIBDIR)/Bio/AlignIO/bl2seq.pm \
	Bio/Tools/Primer/Feature.pm \
	$(INST_LIBDIR)/Bio/Tools/Primer/Feature.pm \
	Bio/Tools/Phylo/Molphy/Result.pm \
	$(INST_LIBDIR)/Bio/Tools/Phylo/Molphy/Result.pm \
	Bio/Tools/Analysis/SimpleAnalysisBase.pm \
	$(INST_LIBDIR)/Bio/Tools/Analysis/SimpleAnalysisBase.pm \
	Bio/SeqFeature/Gene/UTR.pm \
	$(INST_LIBDIR)/Bio/SeqFeature/Gene/UTR.pm \
	Bio/DBLinkContainerI.pm \
	$(INST_LIBDIR)/Bio/DBLinkContainerI.pm \
	Bio/SearchIO/Writer/HSPTableWriter.pm \
	$(INST_LIBDIR)/Bio/SearchIO/Writer/HSPTableWriter.pm \
	Bio/SearchIO/axt.pm \
	$(INST_LIBDIR)/Bio/SearchIO/axt.pm \
	Bio/SeqIO/kegg.pm \
	$(INST_LIBDIR)/Bio/SeqIO/kegg.pm \
	Bio/Tools/Genomewise.pm \
	$(INST_LIBDIR)/Bio/Tools/Genomewise.pm \
	Bio/DB/Ace.pm \
	$(INST_LIBDIR)/Bio/DB/Ace.pm \
	Bio/SearchIO/psl.pm \
	$(INST_LIBDIR)/Bio/SearchIO/psl.pm \
	Bio/Map/CytoPosition.pm \
	$(INST_LIBDIR)/Bio/Map/CytoPosition.pm \
	Bio/Index/SwissPfam.pm \
	$(INST_LIBDIR)/Bio/Index/SwissPfam.pm \
	Bio/Restriction/IO/base.pm \
	$(INST_LIBDIR)/Bio/Restriction/IO/base.pm \
	Bio/DB/Taxonomy.pm \
	$(INST_LIBDIR)/Bio/DB/Taxonomy.pm \
	Bio/Graphics/Glyph/splice_site.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/splice_site.pm \
	Bio/Factory/ObjectBuilderI.pm \
	$(INST_LIBDIR)/Bio/Factory/ObjectBuilderI.pm \
	Bio/Tools/Est2Genome.pm \
	$(INST_LIBDIR)/Bio/Tools/Est2Genome.pm \
	Bio/AlignIO/emboss.pm \
	$(INST_LIBDIR)/Bio/AlignIO/emboss.pm \
	Bio/Index/Fastq.pm \
	$(INST_LIBDIR)/Bio/Index/Fastq.pm \
	Bio/DB/Flat/BDB/fasta.pm \
	$(INST_LIBDIR)/Bio/DB/Flat/BDB/fasta.pm \
	Bio/Tools/Run/WrapperBase.pm \
	$(INST_LIBDIR)/Bio/Tools/Run/WrapperBase.pm \
	Bio/Annotation/Reference.pm \
	$(INST_LIBDIR)/Bio/Annotation/Reference.pm \
	Bio/Biblio/PubmedArticle.pm \
	$(INST_LIBDIR)/Bio/Biblio/PubmedArticle.pm \
	Bio/Root/Err.pm \
	$(INST_LIBDIR)/Bio/Root/Err.pm \
	Bio/Matrix/PSM/PsmHeader.pm \
	$(INST_LIBDIR)/Bio/Matrix/PSM/PsmHeader.pm \
	Bio/Graphics/Glyph/rndrect.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/rndrect.pm \
	Bio/Ontology/SimpleOntologyEngine.pm \
	$(INST_LIBDIR)/Bio/Ontology/SimpleOntologyEngine.pm \
	Bio/AnalysisI.pm \
	$(INST_LIBDIR)/Bio/AnalysisI.pm \
	Bio/Annotation/DBLink.pm \
	$(INST_LIBDIR)/Bio/Annotation/DBLink.pm \
	Bio/Seq/TraceI.pm \
	$(INST_LIBDIR)/Bio/Seq/TraceI.pm \
	Bio/PopGen/Individual.pm \
	$(INST_LIBDIR)/Bio/PopGen/Individual.pm \
	Bio/DB/Flat/BDB.pm \
	$(INST_LIBDIR)/Bio/DB/Flat/BDB.pm \
	Bio/DB/DBFetch.pm \
	$(INST_LIBDIR)/Bio/DB/DBFetch.pm \
	Bio/Seq/SequenceTrace.pm \
	$(INST_LIBDIR)/Bio/Seq/SequenceTrace.pm \
	Bio/Graphics/Glyph/box.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/box.pm \
	Bio/SearchIO/SearchWriterI.pm \
	$(INST_LIBDIR)/Bio/SearchIO/SearchWriterI.pm \
	Bio/Graphics/Glyph/redgreen_segment.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/redgreen_segment.pm \
	Bio/Graphics/Glyph/span.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/span.pm \
	Bio/SearchIO/blast.pm \
	$(INST_LIBDIR)/Bio/SearchIO/blast.pm \
	Bio/Biblio/IO/medlinexml.pm \
	$(INST_LIBDIR)/Bio/Biblio/IO/medlinexml.pm \
	Bio/DB/Flat/BDB/swissprot.pm \
	$(INST_LIBDIR)/Bio/DB/Flat/BDB/swissprot.pm \
	Bio/Tools/Alignment/Consed.pm \
	$(INST_LIBDIR)/Bio/Tools/Alignment/Consed.pm \
	Bio/SeqFeature/Generic.pm \
	$(INST_LIBDIR)/Bio/SeqFeature/Generic.pm \
	Bio/SeqFeature/Gene/GeneStructure.pm \
	$(INST_LIBDIR)/Bio/SeqFeature/Gene/GeneStructure.pm \
	Bio/Ontology/Path.pm \
	$(INST_LIBDIR)/Bio/Ontology/Path.pm \
	Bio/Tools/Run/RemoteBlast.pm \
	$(INST_LIBDIR)/Bio/Tools/Run/RemoteBlast.pm \
	Bio/Tools/EMBOSS/Palindrome.pm \
	$(INST_LIBDIR)/Bio/Tools/EMBOSS/Palindrome.pm \
	Bio/DB/GFF/Aggregator/ucsc_ensgene.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Aggregator/ucsc_ensgene.pm \
	Bio/Search/SearchUtils.pm \
	$(INST_LIBDIR)/Bio/Search/SearchUtils.pm \
	Bio/Root/HTTPget.pm \
	$(INST_LIBDIR)/Bio/Root/HTTPget.pm \
	Bio/Structure/SecStr/STRIDE/Res.pm \
	$(INST_LIBDIR)/Bio/Structure/SecStr/STRIDE/Res.pm \
	Bio/Map/MappableI.pm \
	$(INST_LIBDIR)/Bio/Map/MappableI.pm \
	Bio/Root/Xref.pm \
	$(INST_LIBDIR)/Bio/Root/Xref.pm \
	Bio/Search/Hit/HMMERHit.pm \
	$(INST_LIBDIR)/Bio/Search/Hit/HMMERHit.pm \
	Bio/Graphics/Glyph/minmax.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/minmax.pm \
	Bio/Symbol/README.Symbol \
	$(INST_LIBDIR)/Bio/Symbol/README.Symbol \
	Bio/Graphics/Glyph/cds.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/cds.pm \
	Bio/SearchIO/Writer/GbrowseGFF.pm \
	$(INST_LIBDIR)/Bio/SearchIO/Writer/GbrowseGFF.pm \
	Bio/Biblio/IO/pubmedxml.pm \
	$(INST_LIBDIR)/Bio/Biblio/IO/pubmedxml.pm \
	Bio/Coordinate/Result/Match.pm \
	$(INST_LIBDIR)/Bio/Coordinate/Result/Match.pm \
	Bio/Graphics/Glyph/track.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/track.pm \
	Bio/Root/Object.pm \
	$(INST_LIBDIR)/Bio/Root/Object.pm \
	Bio/DB/GFF/Adaptor/dbi/oracle.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Adaptor/dbi/oracle.pm \
	Bio/Search/DatabaseI.pm \
	$(INST_LIBDIR)/Bio/Search/DatabaseI.pm \
	Bio/SeqIO/ace.pm \
	$(INST_LIBDIR)/Bio/SeqIO/ace.pm \
	Bio/Location/FuzzyLocationI.pm \
	$(INST_LIBDIR)/Bio/Location/FuzzyLocationI.pm \
	Bio/OntologyIO/Handlers/InterProHandler.pm \
	$(INST_LIBDIR)/Bio/OntologyIO/Handlers/InterProHandler.pm \
	Bio/Search/Result/ResultFactory.pm \
	$(INST_LIBDIR)/Bio/Search/Result/ResultFactory.pm \
	Bio/Factory/ResultFactoryI.pm \
	$(INST_LIBDIR)/Bio/Factory/ResultFactoryI.pm \
	Bio/FeatureHolderI.pm \
	$(INST_LIBDIR)/Bio/FeatureHolderI.pm \
	Bio/DB/QueryI.pm \
	$(INST_LIBDIR)/Bio/DB/QueryI.pm \
	Bio/Tools/AnalysisResult.pm \
	$(INST_LIBDIR)/Bio/Tools/AnalysisResult.pm \
	Bio/Biblio/BookArticle.pm \
	$(INST_LIBDIR)/Bio/Biblio/BookArticle.pm \
	Bio/DB/MANIFEST \
	$(INST_LIBDIR)/Bio/DB/MANIFEST \
	Bio/Matrix/PSM/PsmHeaderI.pm \
	$(INST_LIBDIR)/Bio/Matrix/PSM/PsmHeaderI.pm \
	Bio/Seq/RichSeqI.pm \
	$(INST_LIBDIR)/Bio/Seq/RichSeqI.pm \
	Bio/ClusterIO/dbsnp.pm \
	$(INST_LIBDIR)/Bio/ClusterIO/dbsnp.pm \
	Bio/LiveSeq/IO/Loader.pm \
	$(INST_LIBDIR)/Bio/LiveSeq/IO/Loader.pm \
	Bio/Variation/IO/flat.pm \
	$(INST_LIBDIR)/Bio/Variation/IO/flat.pm \
	Bio/CodonUsage/Table.pm \
	$(INST_LIBDIR)/Bio/CodonUsage/Table.pm \
	Bio/Structure/Chain.pm \
	$(INST_LIBDIR)/Bio/Structure/Chain.pm \
	Bio/LiveSeq/AARange.pm \
	$(INST_LIBDIR)/Bio/LiveSeq/AARange.pm \
	Bio/AlignIO/mega.pm \
	$(INST_LIBDIR)/Bio/AlignIO/mega.pm \
	Bio/DB/FileCache.pm \
	$(INST_LIBDIR)/Bio/DB/FileCache.pm \
	Bio/Assembly/Contig.pm \
	$(INST_LIBDIR)/Bio/Assembly/Contig.pm \
	Bio/Root/RootI.pm \
	$(INST_LIBDIR)/Bio/Root/RootI.pm \
	Bio/AnalysisParserI.pm \
	$(INST_LIBDIR)/Bio/AnalysisParserI.pm \
	Bio/DB/Taxonomy/flatfile.pm \
	$(INST_LIBDIR)/Bio/DB/Taxonomy/flatfile.pm \
	Bio/Search/HSP/HMMERHSP.pm \
	$(INST_LIBDIR)/Bio/Search/HSP/HMMERHSP.pm \
	Bio/Tools/BPpsilite.pm \
	$(INST_LIBDIR)/Bio/Tools/BPpsilite.pm \
	Bio/Ontology/Relationship.pm \
	$(INST_LIBDIR)/Bio/Ontology/Relationship.pm \
	Bio/DB/GenBank.pm \
	$(INST_LIBDIR)/Bio/DB/GenBank.pm \
	Bio/Tools/Hmmpfam.pm \
	$(INST_LIBDIR)/Bio/Tools/Hmmpfam.pm \
	Bio/Tools/Phylo/Molphy.pm \
	$(INST_LIBDIR)/Bio/Tools/Phylo/Molphy.pm \
	Bio/Variation/VariantI.pm \
	$(INST_LIBDIR)/Bio/Variation/VariantI.pm \
	Bio/LocatableSeq.pm \
	$(INST_LIBDIR)/Bio/LocatableSeq.pm \
	Bio/Tools/Sim4/Exon.pm \
	$(INST_LIBDIR)/Bio/Tools/Sim4/Exon.pm \
	Bio/OntologyIO/dagflat.pm \
	$(INST_LIBDIR)/Bio/OntologyIO/dagflat.pm \
	Bio/Graphics/Glyph/heterogeneous_segments.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/heterogeneous_segments.pm \
	Bio/AlignIO/nexus.pm \
	$(INST_LIBDIR)/Bio/AlignIO/nexus.pm \
	Bio/Biblio/Patent.pm \
	$(INST_LIBDIR)/Bio/Biblio/Patent.pm \
	Bio/Tools/Phylo/Phylip/ProtDist.pm \
	$(INST_LIBDIR)/Bio/Tools/Phylo/Phylip/ProtDist.pm \
	Bio/SeqIO/alf.pm \
	$(INST_LIBDIR)/Bio/SeqIO/alf.pm \
	Bio/Location/WidestCoordPolicy.pm \
	$(INST_LIBDIR)/Bio/Location/WidestCoordPolicy.pm \
	Bio/DB/Query/GenBank.pm \
	$(INST_LIBDIR)/Bio/DB/Query/GenBank.pm \
	Bio/Annotation/SimpleValue.pm \
	$(INST_LIBDIR)/Bio/Annotation/SimpleValue.pm \
	Bio/Biblio/MedlineJournal.pm \
	$(INST_LIBDIR)/Bio/Biblio/MedlineJournal.pm \
	Bio/Root/IOManager.pm \
	$(INST_LIBDIR)/Bio/Root/IOManager.pm \
	Bio/OntologyIO/soflat.pm \
	$(INST_LIBDIR)/Bio/OntologyIO/soflat.pm \
	Bio/Restriction/Enzyme/MultiCut.pm \
	$(INST_LIBDIR)/Bio/Restriction/Enzyme/MultiCut.pm \
	Bio/DB/Biblio/biofetch.pm \
	$(INST_LIBDIR)/Bio/DB/Biblio/biofetch.pm \
	Bio/Biblio/Proceeding.pm \
	$(INST_LIBDIR)/Bio/Biblio/Proceeding.pm \
	Bio/Graphics/Glyph/alignment.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/alignment.pm \
	Bio/Biblio/Journal.pm \
	$(INST_LIBDIR)/Bio/Biblio/Journal.pm \
	Bio/Graphics/Glyph/segmented_keyglyph.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/segmented_keyglyph.pm \
	Bio/Graphics/Glyph/transcript.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/transcript.pm \
	Bio/Cluster/SequenceFamily.pm \
	$(INST_LIBDIR)/Bio/Cluster/SequenceFamily.pm \
	Bio/Tree/TreeFunctionsI.pm \
	$(INST_LIBDIR)/Bio/Tree/TreeFunctionsI.pm \
	Bio/LocationI.pm \
	$(INST_LIBDIR)/Bio/LocationI.pm \
	Bio/Biblio/Provider.pm \
	$(INST_LIBDIR)/Bio/Biblio/Provider.pm \
	Bio/Biblio/Person.pm \
	$(INST_LIBDIR)/Bio/Biblio/Person.pm \
	Bio/AlignIO/phylip.pm \
	$(INST_LIBDIR)/Bio/AlignIO/phylip.pm \
	Bio/DB/WebDBSeqI.pm \
	$(INST_LIBDIR)/Bio/DB/WebDBSeqI.pm \
	Bio/DB/Flat/BinarySearch.pm \
	$(INST_LIBDIR)/Bio/DB/Flat/BinarySearch.pm \
	Bio/Graphics.pm \
	$(INST_LIBDIR)/Bio/Graphics.pm \
	Bio/Search/Hit/Fasta.pm \
	$(INST_LIBDIR)/Bio/Search/Hit/Fasta.pm \
	Bio/Graphics/Glyph/ruler_arrow.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/ruler_arrow.pm \
	Bio/DB/GFF/Adaptor/dbi/caching_handle.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Adaptor/dbi/caching_handle.pm \
	Bio/Ontology/RelationshipI.pm \
	$(INST_LIBDIR)/Bio/Ontology/RelationshipI.pm \
	Bio/Search/HSP/HSPFactory.pm \
	$(INST_LIBDIR)/Bio/Search/HSP/HSPFactory.pm \
	Bio/Root/Exception.pm \
	$(INST_LIBDIR)/Bio/Root/Exception.pm \
	Bio/Ontology/OntologyStore.pm \
	$(INST_LIBDIR)/Bio/Ontology/OntologyStore.pm \
	Bio/Search/Processor.pm \
	$(INST_LIBDIR)/Bio/Search/Processor.pm \
	Bio/Variation/Allele.pm \
	$(INST_LIBDIR)/Bio/Variation/Allele.pm \
	Bio/SeqFeature/FeaturePair.pm \
	$(INST_LIBDIR)/Bio/SeqFeature/FeaturePair.pm \
	Bio/Annotation/OntologyTerm.pm \
	$(INST_LIBDIR)/Bio/Annotation/OntologyTerm.pm \
	Bio/Biblio/PubmedJournalArticle.pm \
	$(INST_LIBDIR)/Bio/Biblio/PubmedJournalArticle.pm \
	Bio/TreeIO/TreeEventBuilder.pm \
	$(INST_LIBDIR)/Bio/TreeIO/TreeEventBuilder.pm \
	Bio/SeqIO/embl.pm \
	$(INST_LIBDIR)/Bio/SeqIO/embl.pm \
	Bio/WebAgent.pm \
	$(INST_LIBDIR)/Bio/WebAgent.pm \
	Bio/Biblio/IO/medline2ref.pm \
	$(INST_LIBDIR)/Bio/Biblio/IO/medline2ref.pm \
	Bio/SearchIO/FastHitEventBuilder.pm \
	$(INST_LIBDIR)/Bio/SearchIO/FastHitEventBuilder.pm \
	Bio/Tree/NodeI.pm \
	$(INST_LIBDIR)/Bio/Tree/NodeI.pm \
	Bio/Search/Hit/HitI.pm \
	$(INST_LIBDIR)/Bio/Search/Hit/HitI.pm \
	Bio/Phenotype/MeSH/Twig.pm \
	$(INST_LIBDIR)/Bio/Phenotype/MeSH/Twig.pm \
	Bio/Seq/PrimaryQual.pm \
	$(INST_LIBDIR)/Bio/Seq/PrimaryQual.pm \
	Bio/DB/NCBIHelper.pm \
	$(INST_LIBDIR)/Bio/DB/NCBIHelper.pm \
	Bio/Graphics/Glyph/xyplot.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/xyplot.pm \
	Bio/Annotation/TypeManager.pm \
	$(INST_LIBDIR)/Bio/Annotation/TypeManager.pm \
	Bio/Tools/Analysis/Protein/Sopma.pm \
	$(INST_LIBDIR)/Bio/Tools/Analysis/Protein/Sopma.pm \
	Bio/Structure/StructureI.pm \
	$(INST_LIBDIR)/Bio/Structure/StructureI.pm \
	Bio/SeqIO/largefasta.pm \
	$(INST_LIBDIR)/Bio/SeqIO/largefasta.pm \
	Bio/Matrix/PSM/InstanceSite.pm \
	$(INST_LIBDIR)/Bio/Matrix/PSM/InstanceSite.pm \
	Bio/Tools/dpAlign.pm \
	$(INST_LIBDIR)/Bio/Tools/dpAlign.pm \
	Bio/Align/AlignI.pm \
	$(INST_LIBDIR)/Bio/Align/AlignI.pm \
	Bio/SeqFeature/PositionProxy.pm \
	$(INST_LIBDIR)/Bio/SeqFeature/PositionProxy.pm \
	Bio/Index/Abstract.pm \
	$(INST_LIBDIR)/Bio/Index/Abstract.pm \
	Bio/AlignIO.pm \
	$(INST_LIBDIR)/Bio/AlignIO.pm \
	Bio/Factory/SeqAnalysisParserFactory.pm \
	$(INST_LIBDIR)/Bio/Factory/SeqAnalysisParserFactory.pm \
	Bio/Biblio.pm \
	$(INST_LIBDIR)/Bio/Biblio.pm \
	Bio/Restriction/Analysis.pm \
	$(INST_LIBDIR)/Bio/Restriction/Analysis.pm \
	Bio/Matrix/PSM/IO.pm \
	$(INST_LIBDIR)/Bio/Matrix/PSM/IO.pm \
	Bio/Seq/SeqFastaSpeedFactory.pm \
	$(INST_LIBDIR)/Bio/Seq/SeqFastaSpeedFactory.pm \
	Bio/Factory/SequenceFactoryI.pm \
	$(INST_LIBDIR)/Bio/Factory/SequenceFactoryI.pm \
	Bio/Structure/Residue.pm \
	$(INST_LIBDIR)/Bio/Structure/Residue.pm \
	Bio/SeqIO/fasta.pm \
	$(INST_LIBDIR)/Bio/SeqIO/fasta.pm \
	Bio/PopGen/IO/prettybase.pm \
	$(INST_LIBDIR)/Bio/PopGen/IO/prettybase.pm \
	Bio/AlignIO/maf.pm \
	$(INST_LIBDIR)/Bio/AlignIO/maf.pm \
	Bio/Restriction/Enzyme/MultiSite.pm \
	$(INST_LIBDIR)/Bio/Restriction/Enzyme/MultiSite.pm \
	Bio/Search/Result/HMMERResult.pm \
	$(INST_LIBDIR)/Bio/Search/Result/HMMERResult.pm \
	Bio/Tools/FootPrinter.pm \
	$(INST_LIBDIR)/Bio/Tools/FootPrinter.pm \
	Bio/Tools/Analysis/Protein/GOR4.pm \
	$(INST_LIBDIR)/Bio/Tools/Analysis/Protein/GOR4.pm \
	Bio/Search/Result/BlastResult.pm \
	$(INST_LIBDIR)/Bio/Search/Result/BlastResult.pm \
	Bio/MapIO/mapmaker.pm \
	$(INST_LIBDIR)/Bio/MapIO/mapmaker.pm \
	Bio/Tools/Blast.pm \
	$(INST_LIBDIR)/Bio/Tools/Blast.pm \
	Bio/Graphics/Glyph/dot.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/dot.pm \
	Bio/Seq/Meta/Array.pm \
	$(INST_LIBDIR)/Bio/Seq/Meta/Array.pm \
	bioperl.pod \
	$(INST_LIBDIR)/bioperl.pod \
	Bio/Structure/IO/pdb.pm \
	$(INST_LIBDIR)/Bio/Structure/IO/pdb.pm \
	Bio/Map/LinkageMap.pm \
	$(INST_LIBDIR)/Bio/Map/LinkageMap.pm \
	Bio/DB/SwissProt.pm \
	$(INST_LIBDIR)/Bio/DB/SwissProt.pm \
	Bio/Matrix/PSM/IO/transfac.pm \
	$(INST_LIBDIR)/Bio/Matrix/PSM/IO/transfac.pm \
	Bio/Tools/Signalp.pm \
	$(INST_LIBDIR)/Bio/Tools/Signalp.pm \
	Bio/Phenotype/Measure.pm \
	$(INST_LIBDIR)/Bio/Phenotype/Measure.pm \
	Bio/Factory/ApplicationFactoryI.pm \
	$(INST_LIBDIR)/Bio/Factory/ApplicationFactoryI.pm \
	Bio/SeqFeature/Gene/TranscriptI.pm \
	$(INST_LIBDIR)/Bio/SeqFeature/Gene/TranscriptI.pm \
	Bio/LiveSeq/Range.pm \
	$(INST_LIBDIR)/Bio/LiveSeq/Range.pm \
	Bio/Coordinate/Utils.pm \
	$(INST_LIBDIR)/Bio/Coordinate/Utils.pm \
	Bio/Search/Result/WABAResult.pm \
	$(INST_LIBDIR)/Bio/Search/Result/WABAResult.pm \
	Bio/DB/RandomAccessI.pm \
	$(INST_LIBDIR)/Bio/DB/RandomAccessI.pm \
	Bio/PopGen/Simulation/GeneticDrift.pm \
	$(INST_LIBDIR)/Bio/PopGen/Simulation/GeneticDrift.pm \
	Bio/DB/GFF/Feature.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Feature.pm \
	Bio/Tools/Blast/CHANGES \
	$(INST_LIBDIR)/Bio/Tools/Blast/CHANGES \
	Bio/DB/Biblio/soap.pm \
	$(INST_LIBDIR)/Bio/DB/Biblio/soap.pm \
	Bio/DB/Failover.pm \
	$(INST_LIBDIR)/Bio/DB/Failover.pm \
	Bio/Tools/HMMER/Results.pm \
	$(INST_LIBDIR)/Bio/Tools/HMMER/Results.pm \
	Bio/DB/GFF/Adaptor/dbi/mysqlace.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Adaptor/dbi/mysqlace.pm \
	Bio/Variation/RNAChange.pm \
	$(INST_LIBDIR)/Bio/Variation/RNAChange.pm \
	Bio/Tools/Run/README \
	$(INST_LIBDIR)/Bio/Tools/Run/README \
	Bio/TreeIO/lintree.pm \
	$(INST_LIBDIR)/Bio/TreeIO/lintree.pm \
	Bio/Graphics/Pictogram.pm \
	$(INST_LIBDIR)/Bio/Graphics/Pictogram.pm \
	Bio/Factory/AnalysisI.pm \
	$(INST_LIBDIR)/Bio/Factory/AnalysisI.pm \
	Bio/Expression/FeatureGroup/FeatureGroupMas50.pm \
	$(INST_LIBDIR)/Bio/Expression/FeatureGroup/FeatureGroupMas50.pm \
	Bio/Graphics/Glyph/dna.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/dna.pm \
	Bio/Phenotype/OMIM/OMIMparser.pm \
	$(INST_LIBDIR)/Bio/Phenotype/OMIM/OMIMparser.pm \
	Bio/SeqFeature/SiRNA/Pair.pm \
	$(INST_LIBDIR)/Bio/SeqFeature/SiRNA/Pair.pm \
	Bio/SeqFeature/Similarity.pm \
	$(INST_LIBDIR)/Bio/SeqFeature/Similarity.pm \
	Bio/Location/CoordinatePolicyI.pm \
	$(INST_LIBDIR)/Bio/Location/CoordinatePolicyI.pm \
	Bio/Expression/FeatureI.pm \
	$(INST_LIBDIR)/Bio/Expression/FeatureI.pm \
	Bio/SeqIO/fastq.pm \
	$(INST_LIBDIR)/Bio/SeqIO/fastq.pm \
	Bio/Tools/Genscan.pm \
	$(INST_LIBDIR)/Bio/Tools/Genscan.pm \
	Bio/Graphics/Glyph/generic.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/generic.pm \
	Bio/Map/SimpleMap.pm \
	$(INST_LIBDIR)/Bio/Map/SimpleMap.pm \
	Bio/Search/Hit/PsiBlastHit.pm \
	$(INST_LIBDIR)/Bio/Search/Hit/PsiBlastHit.pm \
	Bio/Search/HSP/WABAHSP.pm \
	$(INST_LIBDIR)/Bio/Search/HSP/WABAHSP.pm \
	Bio/SearchIO/Writer/ResultTableWriter.pm \
	$(INST_LIBDIR)/Bio/SearchIO/Writer/ResultTableWriter.pm \
	Bio/AlignIO/fasta.pm \
	$(INST_LIBDIR)/Bio/AlignIO/fasta.pm \
	Bio/Map/Marker.pm \
	$(INST_LIBDIR)/Bio/Map/Marker.pm \
	Bio/Factory/SequenceStreamI.pm \
	$(INST_LIBDIR)/Bio/Factory/SequenceStreamI.pm \
	Bio/AlignIO/clustalw.pm \
	$(INST_LIBDIR)/Bio/AlignIO/clustalw.pm \
	Bio/Coordinate/MapperI.pm \
	$(INST_LIBDIR)/Bio/Coordinate/MapperI.pm \
	Bio/DB/Fasta.pm \
	$(INST_LIBDIR)/Bio/DB/Fasta.pm \
	Bio/DB/GFF/Adaptor/dbi/iterator.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Adaptor/dbi/iterator.pm \
	Bio/Phenotype/OMIM/OMIMentry.pm \
	$(INST_LIBDIR)/Bio/Phenotype/OMIM/OMIMentry.pm \
	Bio/Seq/QualI.pm \
	$(INST_LIBDIR)/Bio/Seq/QualI.pm \
	Bio/Tools/Analysis/Protein/Scansite.pm \
	$(INST_LIBDIR)/Bio/Tools/Analysis/Protein/Scansite.pm \
	Bio/Tools/HMMER/Domain.pm \
	$(INST_LIBDIR)/Bio/Tools/HMMER/Domain.pm \
	Bio/Graphics/FeatureFile/Iterator.pm \
	$(INST_LIBDIR)/Bio/Graphics/FeatureFile/Iterator.pm \
	Bio/Das/FeatureTypeI.pm \
	$(INST_LIBDIR)/Bio/Das/FeatureTypeI.pm \
	Bio/DB/GDB.pm \
	$(INST_LIBDIR)/Bio/DB/GDB.pm \
	Bio/Cluster/UniGeneI.pm \
	$(INST_LIBDIR)/Bio/Cluster/UniGeneI.pm \
	Bio/Biblio/JournalArticle.pm \
	$(INST_LIBDIR)/Bio/Biblio/JournalArticle.pm \
	Bio/Tools/Blast/Sbjct.pm \
	$(INST_LIBDIR)/Bio/Tools/Blast/Sbjct.pm \
	Bio/Index/EMBL.pm \
	$(INST_LIBDIR)/Bio/Index/EMBL.pm \
	Bio/Das/SegmentI.pm \
	$(INST_LIBDIR)/Bio/Das/SegmentI.pm \
	Bio/Search/Hit/GenericHit.pm \
	$(INST_LIBDIR)/Bio/Search/Hit/GenericHit.pm \
	Bio/DB/GFF/Aggregator/ucsc_acembly.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Aggregator/ucsc_acembly.pm \
	Bio/Ontology/OntologyI.pm \
	$(INST_LIBDIR)/Bio/Ontology/OntologyI.pm \
	Bio/DB/GFF/Segment.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Segment.pm \
	Bio/SeqIO/gcg.pm \
	$(INST_LIBDIR)/Bio/SeqIO/gcg.pm \
	Bio/Tools/Lucy.pm \
	$(INST_LIBDIR)/Bio/Tools/Lucy.pm \
	Bio/Biblio/WebResource.pm \
	$(INST_LIBDIR)/Bio/Biblio/WebResource.pm \
	Bio/TreeIO/svggraph.pm \
	$(INST_LIBDIR)/Bio/TreeIO/svggraph.pm \
	Bio/DB/GFF/Aggregator/ucsc_softberry.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Aggregator/ucsc_softberry.pm \
	Bio/SeqFeature/Gene/Poly_A_site.pm \
	$(INST_LIBDIR)/Bio/SeqFeature/Gene/Poly_A_site.pm \
	Bio/RangeI.pm \
	$(INST_LIBDIR)/Bio/RangeI.pm \
	Bio/Symbol/DNAAlphabet.pm \
	$(INST_LIBDIR)/Bio/Symbol/DNAAlphabet.pm \
	Bio/Tools/Analysis/Protein/ELM.pm \
	$(INST_LIBDIR)/Bio/Tools/Analysis/Protein/ELM.pm \
	Bio/PrimarySeq.pm \
	$(INST_LIBDIR)/Bio/PrimarySeq.pm \
	Bio/Biblio/TechReport.pm \
	$(INST_LIBDIR)/Bio/Biblio/TechReport.pm \
	Bio/Seq/EncodedSeq.pm \
	$(INST_LIBDIR)/Bio/Seq/EncodedSeq.pm \
	Bio/PopGen/IndividualI.pm \
	$(INST_LIBDIR)/Bio/PopGen/IndividualI.pm \
	Bio/Factory/MapFactoryI.pm \
	$(INST_LIBDIR)/Bio/Factory/MapFactoryI.pm \
	Bio/Ontology/RelationshipFactory.pm \
	$(INST_LIBDIR)/Bio/Ontology/RelationshipFactory.pm \
	Bio/Tree/NodeNHX.pm \
	$(INST_LIBDIR)/Bio/Tree/NodeNHX.pm \
	Bio/SearchIO.pm \
	$(INST_LIBDIR)/Bio/SearchIO.pm \
	Bio/LiveSeq/SeqI.pm \
	$(INST_LIBDIR)/Bio/LiveSeq/SeqI.pm \
	Bio/Tools/Genemark.pm \
	$(INST_LIBDIR)/Bio/Tools/Genemark.pm \
	Bio/Graphics/Glyph/extending_arrow.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/extending_arrow.pm \
	Bio/DB/GFF/Aggregator/coding.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Aggregator/coding.pm \
	Bio/SearchIO/exonerate.pm \
	$(INST_LIBDIR)/Bio/SearchIO/exonerate.pm \
	Bio/Graphics/Glyph/segments.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/segments.pm \
	Bio/DB/GFF/Aggregator/none.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Aggregator/none.pm \
	Bio/SeqFeature/SimilarityPair.pm \
	$(INST_LIBDIR)/Bio/SeqFeature/SimilarityPair.pm \
	Bio/Ontology/RelationshipType.pm \
	$(INST_LIBDIR)/Bio/Ontology/RelationshipType.pm \
	Bio/Structure/SecStr/DSSP/Res.pm \
	$(INST_LIBDIR)/Bio/Structure/SecStr/DSSP/Res.pm \
	Bio/PopGen/Population.pm \
	$(INST_LIBDIR)/Bio/PopGen/Population.pm \
	Bio/PopGen/Simulation/Coalescent.pm \
	$(INST_LIBDIR)/Bio/PopGen/Simulation/Coalescent.pm \
	Bio/PopGen/Statistics.pm \
	$(INST_LIBDIR)/Bio/PopGen/Statistics.pm \
	Bio/DB/GFF/Adaptor/biofetch_oracle.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Adaptor/biofetch_oracle.pm \
	Bio/Structure/Model.pm \
	$(INST_LIBDIR)/Bio/Structure/Model.pm \
	Bio/Search/HSP/HSPI.pm \
	$(INST_LIBDIR)/Bio/Search/HSP/HSPI.pm \
	Bio/DB/GFF/Aggregator/ucsc_sanger22pseudo.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Aggregator/ucsc_sanger22pseudo.pm \
	Bio/Index/AbstractSeq.pm \
	$(INST_LIBDIR)/Bio/Index/AbstractSeq.pm \
	Bio/Seq/RichSeq.pm \
	$(INST_LIBDIR)/Bio/Seq/RichSeq.pm \
	Bio/Expression/FeatureSet/FeatureSetMas50.pm \
	$(INST_LIBDIR)/Bio/Expression/FeatureSet/FeatureSetMas50.pm \
	Bio/Tree/AlleleNode.pm \
	$(INST_LIBDIR)/Bio/Tree/AlleleNode.pm \
	Bio/DB/RefSeq.pm \
	$(INST_LIBDIR)/Bio/DB/RefSeq.pm \
	bioscripts.pod \
	$(INST_LIBDIR)/bioscripts.pod \
	Bio/DB/InMemoryCache.pm \
	$(INST_LIBDIR)/Bio/DB/InMemoryCache.pm \
	Bio/ClusterIO.pm \
	$(INST_LIBDIR)/Bio/ClusterIO.pm \
	Bio/TreeIO/nexus.pm \
	$(INST_LIBDIR)/Bio/TreeIO/nexus.pm \
	Bio/Graphics/Panel.pm \
	$(INST_LIBDIR)/Bio/Graphics/Panel.pm \
	Bio/Cluster/FamilyI.pm \
	$(INST_LIBDIR)/Bio/Cluster/FamilyI.pm \
	Bio/LiveSeq/Prim_Transcript.pm \
	$(INST_LIBDIR)/Bio/LiveSeq/Prim_Transcript.pm \
	Bio/SeqIO/game/gameSubs.pm \
	$(INST_LIBDIR)/Bio/SeqIO/game/gameSubs.pm \
	Bio/Tools/Run/StandAloneBlast.pm \
	$(INST_LIBDIR)/Bio/Tools/Run/StandAloneBlast.pm \
	Bio/Symbol/Symbol.pm \
	$(INST_LIBDIR)/Bio/Symbol/Symbol.pm \
	Bio/Biblio/IO.pm \
	$(INST_LIBDIR)/Bio/Biblio/IO.pm \
	Bio/Factory/DriverFactory.pm \
	$(INST_LIBDIR)/Bio/Factory/DriverFactory.pm \
	Bio/PopGen/IO/csv.pm \
	$(INST_LIBDIR)/Bio/PopGen/IO/csv.pm \
	Bio/Tree/Tree.pm \
	$(INST_LIBDIR)/Bio/Tree/Tree.pm \
	Bio/OntologyIO/Handlers/InterPro_BioSQL_Handler.pm \
	$(INST_LIBDIR)/Bio/OntologyIO/Handlers/InterPro_BioSQL_Handler.pm \
	Bio/Tools/Analysis/Protein/Domcut.pm \
	$(INST_LIBDIR)/Bio/Tools/Analysis/Protein/Domcut.pm \
	Bio/Graphics/Glyph/primers.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/primers.pm \
	Bio/Tools/Analysis/Protein/Mitoprot.pm \
	$(INST_LIBDIR)/Bio/Tools/Analysis/Protein/Mitoprot.pm \
	Bio/SeqIO/game/gameWriter.pm \
	$(INST_LIBDIR)/Bio/SeqIO/game/gameWriter.pm \
	Bio/Tools/Prediction/Exon.pm \
	$(INST_LIBDIR)/Bio/Tools/Prediction/Exon.pm \
	Bio/SearchIO/wise.pm \
	$(INST_LIBDIR)/Bio/SearchIO/wise.pm \
	Bio/Phenotype/OMIM/MiniMIMentry.pm \
	$(INST_LIBDIR)/Bio/Phenotype/OMIM/MiniMIMentry.pm \
	Bio/SimpleAnalysisI.pm \
	$(INST_LIBDIR)/Bio/SimpleAnalysisI.pm \
	Bio/Graphics/Glyph/crossbox.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/crossbox.pm \
	Bio/Tools/Blast/HSP.pm \
	$(INST_LIBDIR)/Bio/Tools/Blast/HSP.pm \
	Bio/Seq/Meta.pm \
	$(INST_LIBDIR)/Bio/Seq/Meta.pm \
	Bio/SeqFeature/AnnotationAdaptor.pm \
	$(INST_LIBDIR)/Bio/SeqFeature/AnnotationAdaptor.pm \
	Bio/Tools/SeqStats.pm \
	$(INST_LIBDIR)/Bio/Tools/SeqStats.pm \
	Bio/LiveSeq/ChainI.pm \
	$(INST_LIBDIR)/Bio/LiveSeq/ChainI.pm \
	Bio/Variation/SeqDiff.pm \
	$(INST_LIBDIR)/Bio/Variation/SeqDiff.pm \
	Bio/SeqFeature/SiRNA/Oligo.pm \
	$(INST_LIBDIR)/Bio/SeqFeature/SiRNA/Oligo.pm \
	Bio/Search/Hit/BlastHit.pm \
	$(INST_LIBDIR)/Bio/Search/Hit/BlastHit.pm \
	Bio/Assembly/Scaffold.pm \
	$(INST_LIBDIR)/Bio/Assembly/Scaffold.pm \
	Bio/Tools/Primer3.pm \
	$(INST_LIBDIR)/Bio/Tools/Primer3.pm \
	Bio/Matrix/Generic.pm \
	$(INST_LIBDIR)/Bio/Matrix/Generic.pm \
	Bio/SearchIO/blastxml.pm \
	$(INST_LIBDIR)/Bio/SearchIO/blastxml.pm \
	Bio/UpdateableSeqI.pm \
	$(INST_LIBDIR)/Bio/UpdateableSeqI.pm \
	Bio/Restriction/IO.pm \
	$(INST_LIBDIR)/Bio/Restriction/IO.pm \
	Bio/SeqFeature/Gene/Exon.pm \
	$(INST_LIBDIR)/Bio/SeqFeature/Gene/Exon.pm \
	Bio/Map/MarkerI.pm \
	$(INST_LIBDIR)/Bio/Map/MarkerI.pm \
	Bio/Matrix/IO/phylip.pm \
	$(INST_LIBDIR)/Bio/Matrix/IO/phylip.pm \
	Bio/DB/BiblioI.pm \
	$(INST_LIBDIR)/Bio/DB/BiblioI.pm \
	Bio/SearchIO/waba.pm \
	$(INST_LIBDIR)/Bio/SearchIO/waba.pm \
	Bio/Species.pm \
	$(INST_LIBDIR)/Bio/Species.pm \
	Bio/Tools/pSW.pm \
	$(INST_LIBDIR)/Bio/Tools/pSW.pm \
	Bio/Coordinate/Pair.pm \
	$(INST_LIBDIR)/Bio/Coordinate/Pair.pm \
	Bio/IdentifiableI.pm \
	$(INST_LIBDIR)/Bio/IdentifiableI.pm \
	Bio/LiveSeq/IO/README \
	$(INST_LIBDIR)/Bio/LiveSeq/IO/README \
	Bio/Index/GenBank.pm \
	$(INST_LIBDIR)/Bio/Index/GenBank.pm \
	Bio/AlignIO/stockholm.pm \
	$(INST_LIBDIR)/Bio/AlignIO/stockholm.pm \
	Bio/LiveSeq/IO/BioPerl.pm \
	$(INST_LIBDIR)/Bio/LiveSeq/IO/BioPerl.pm \
	Bio/Ontology/GOterm.pm \
	$(INST_LIBDIR)/Bio/Ontology/GOterm.pm \
	Bio/Tools/Seg.pm \
	$(INST_LIBDIR)/Bio/Tools/Seg.pm \
	Bio/Tools/AlignFactory.pm \
	$(INST_LIBDIR)/Bio/Tools/AlignFactory.pm \
	Bio/DB/Taxonomy/entrez.pm \
	$(INST_LIBDIR)/Bio/DB/Taxonomy/entrez.pm \
	Bio/SearchIO/sim4.pm \
	$(INST_LIBDIR)/Bio/SearchIO/sim4.pm \
	Bio/OntologyIO/simplehierarchy.pm \
	$(INST_LIBDIR)/Bio/OntologyIO/simplehierarchy.pm \
	Bio/SeqFeature/Gene/Intron.pm \
	$(INST_LIBDIR)/Bio/SeqFeature/Gene/Intron.pm \
	Bio/Search/HSP/BlastHSP.pm \
	$(INST_LIBDIR)/Bio/Search/HSP/BlastHSP.pm \
	Bio/DB/Flat/BDB/swiss.pm \
	$(INST_LIBDIR)/Bio/DB/Flat/BDB/swiss.pm \
	Bio/Tools/Promoterwise.pm \
	$(INST_LIBDIR)/Bio/Tools/Promoterwise.pm \
	Bio/Map/MapI.pm \
	$(INST_LIBDIR)/Bio/Map/MapI.pm \
	Bio/Tools/Phylo/PAML/ModelResult.pm \
	$(INST_LIBDIR)/Bio/Tools/Phylo/PAML/ModelResult.pm \
	Bio/Location/Atomic.pm \
	$(INST_LIBDIR)/Bio/Location/Atomic.pm \
	Bio/Root/Utilities.pm \
	$(INST_LIBDIR)/Bio/Root/Utilities.pm \
	Bio/Graphics/Glyph/pinsertion.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/pinsertion.pm \
	Bio/SeqIO/genbank.pm \
	$(INST_LIBDIR)/Bio/SeqIO/genbank.pm \
	Bio/PopGen/IO.pm \
	$(INST_LIBDIR)/Bio/PopGen/IO.pm \
	Bio/Matrix/IO/scoring.pm \
	$(INST_LIBDIR)/Bio/Matrix/IO/scoring.pm \
	Bio/Graphics/Glyph/arrow.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/arrow.pm \
	Bio/Biblio/Service.pm \
	$(INST_LIBDIR)/Bio/Biblio/Service.pm \
	Bio/Map/Microsatellite.pm \
	$(INST_LIBDIR)/Bio/Map/Microsatellite.pm \
	Bio/Tools/Primer/Pair.pm \
	$(INST_LIBDIR)/Bio/Tools/Primer/Pair.pm \
	Bio/DB/GFF/Featname.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Featname.pm \
	Bio/SearchIO/Writer/HitTableWriter.pm \
	$(INST_LIBDIR)/Bio/SearchIO/Writer/HitTableWriter.pm \
	Bio/Tools/EPCR.pm \
	$(INST_LIBDIR)/Bio/Tools/EPCR.pm \
	Bio/Graphics/Glyph/line.pm \
	$(INST_LIBDIR)/Bio/Graphics/Glyph/line.pm \
	Bio/SeqFeature/Gene/ExonI.pm \
	$(INST_LIBDIR)/Bio/SeqFeature/Gene/ExonI.pm \
	Bio/DB/Flat.pm \
	$(INST_LIBDIR)/Bio/DB/Flat.pm \
	Bio/Assembly/IO/phrap.pm \
	$(INST_LIBDIR)/Bio/Assembly/IO/phrap.pm \
	Bio/Taxonomy/Tree.pm \
	$(INST_LIBDIR)/Bio/Taxonomy/Tree.pm \
	Bio/Location/Fuzzy.pm \
	$(INST_LIBDIR)/Bio/Location/Fuzzy.pm \
	Bio/Index/Blast.pm \
	$(INST_LIBDIR)/Bio/Index/Blast.pm \
	Bio/SeqFeature/Collection.pm \
	$(INST_LIBDIR)/Bio/SeqFeature/Collection.pm \
	Bio/Tools/Analysis/DNA/ESEfinder.pm \
	$(INST_LIBDIR)/Bio/Tools/Analysis/DNA/ESEfinder.pm \
	Bio/TreeIO/newick.pm \
	$(INST_LIBDIR)/Bio/TreeIO/newick.pm \
	Bio/SeqIO/MultiFile.pm \
	$(INST_LIBDIR)/Bio/SeqIO/MultiFile.pm \
	Bio/Tools/RestrictionEnzyme.pm \
	$(INST_LIBDIR)/Bio/Tools/RestrictionEnzyme.pm \
	Bio/Tools/Primer/Assessor/Base.pm \
	$(INST_LIBDIR)/Bio/Tools/Primer/Assessor/Base.pm \
	Bio/Search/HSP/FastaHSP.pm \
	$(INST_LIBDIR)/Bio/Search/HSP/FastaHSP.pm \
	Bio/Map/PositionI.pm \
	$(INST_LIBDIR)/Bio/Map/PositionI.pm \
	Bio/Coordinate/Result.pm \
	$(INST_LIBDIR)/Bio/Coordinate/Result.pm \
	Bio/DB/GFF/Aggregator/ucsc_twinscan.pm \
	$(INST_LIBDIR)/Bio/DB/GFF/Aggregator/ucsc_twinscan.pm \
	Bio/DB/Query/WebQuery.pm \
	$(INST_LIBDIR)/Bio/DB/Query/WebQuery.pm \
	Bio/Seq/SeqFactory.pm \
	$(INST_LIBDIR)/Bio/Seq/SeqFactory.pm \
	Bio/Root/Storable.pm \
	$(INST_LIBDIR)/Bio/Root/Storable.pm \
	Bio/SearchIO/fasta.pm \
	$(INST_LIBDIR)/Bio/SearchIO/fasta.pm \
	Bio/Root/Root.pm \
	$(INST_LIBDIR)/Bio/Root/Root.pm \
	Bio/Phenotype/Correlate.pm \
	$(INST_LIBDIR)/Bio/Phenotype/Correlate.pm \
	Bio/Tools/pICalculator.pm \
	$(INST_LIBDIR)/Bio/Tools/pICalculator.pm \
	Bio/Matrix/PSM/PsmI.pm \
	$(INST_LIBDIR)/Bio/Matrix/PSM/PsmI.pm \
	Bio/LiveSeq/Exon.pm \
	$(INST_LIBDIR)/Bio/LiveSeq/Exon.pm \
	Bio/PopGen/MarkerI.pm \
	$(INST_LIBDIR)/Bio/PopGen/MarkerI.pm


# --- MakeMaker tool_autosplit section:

# Usage: $(AUTOSPLITFILE) FileToSplit AutoDirToSplitInto
AUTOSPLITFILE = $(PERL) "-I$(PERL_ARCHLIB)" "-I$(PERL_LIB)" -e 'use AutoSplit;autosplit($$ARGV[0], $$ARGV[1], 0, 1, 1) ;'


# --- MakeMaker tool_xsubpp section:


# --- MakeMaker tools_other section:

SHELL = /bin/sh
CHMOD = chmod
CP = cp
LD = ld
MV = mv
NOOP = $(SHELL) -c true
RM_F = rm -f
RM_RF = rm -rf
TEST_F = test -f
TOUCH = touch
UMASK_NULL = umask 0
DEV_NULL = > /dev/null 2>&1

# The following is a portable way to say mkdir -p
# To see which directories are created, change the if 0 to if 1
MKPATH = $(PERL) -I$(PERL_ARCHLIB) -I$(PERL_LIB) -MExtUtils::Command -e mkpath

# This helps us to minimize the effect of the .exists files A yet
# better solution would be to have a stable file in the perl
# distribution with a timestamp of zero. But this solution doesn't
# need any changes to the core distribution and works with older perls
EQUALIZE_TIMESTAMP = $(PERL) -I$(PERL_ARCHLIB) -I$(PERL_LIB) -MExtUtils::Command -e eqtime

# Here we warn users that an old packlist file was found somewhere,
# and that they should call some uninstall routine
WARN_IF_OLD_PACKLIST = $(PERL) -we 'exit unless -f $$ARGV[0];' \
-e 'print "WARNING: I have found an old package in\n";' \
-e 'print "\t$$ARGV[0].\n";' \
-e 'print "Please make sure the two installations are not conflicting\n";'

UNINST=0
VERBINST=1

MOD_INSTALL = $(PERL) -I$(INST_LIB) -I$(PERL_LIB) -MExtUtils::Install \
-e "install({@ARGV},'$(VERBINST)',0,'$(UNINST)');"

DOC_INSTALL = $(PERL) -e '$$\="\n\n";' \
-e 'print "=head2 ", scalar(localtime), ": C<", shift, ">", " L<", shift, ">";' \
-e 'print "=over 4";' \
-e 'while (defined($$key = shift) and defined($$val = shift)){print "=item *";print "C<$$key: $$val>";}' \
-e 'print "=back";'

UNINSTALL =   $(PERL) -MExtUtils::Install \
-e 'uninstall($$ARGV[0],1,1); print "\nUninstall is deprecated. Please check the";' \
-e 'print " packlist above carefully.\n  There may be errors. Remove the";' \
-e 'print " appropriate files manually.\n  Sorry for the inconveniences.\n"'


# --- MakeMaker dist section:

DISTVNAME = $(DISTNAME)-$(VERSION)
TAR  = tar
TARFLAGS = cvf
ZIP  = zip
ZIPFLAGS = -r
COMPRESS = gzip -9f
SUFFIX = .gz
SHAR = shar
PREOP = @$(NOOP)
POSTOP = @$(NOOP)
TO_UNIX = @$(NOOP)
CI = ci -u
RCS_LABEL = rcs -Nv$(VERSION_SYM): -q
DIST_CP = best
DIST_DEFAULT = all tardist


# --- MakeMaker macro section:


# --- MakeMaker depend section:


# --- MakeMaker cflags section:


# --- MakeMaker const_loadlibs section:


# --- MakeMaker const_cccmd section:


# --- MakeMaker post_constants section:


# --- MakeMaker pasthru section:

PASTHRU = LIB="$(LIB)"\
	LIBPERL_A="$(LIBPERL_A)"\
	LINKTYPE="$(LINKTYPE)"\
	PREFIX="$(PREFIX)"\
	OPTIMIZE="$(OPTIMIZE)"


# --- MakeMaker c_o section:


# --- MakeMaker xs_c section:


# --- MakeMaker xs_o section:


# --- MakeMaker top_targets section:

#all ::	config $(INST_PM) subdirs linkext manifypods

all :: pure_all htmlifypods manifypods
	@$(NOOP)

pure_all :: config pm_to_blib subdirs linkext
	@$(NOOP)

subdirs :: $(MYEXTLIB)
	@$(NOOP)

config :: Makefile $(INST_LIBDIR)/.exists
	@$(NOOP)

config :: $(INST_ARCHAUTODIR)/.exists
	@$(NOOP)

config :: $(INST_AUTODIR)/.exists
	@$(NOOP)

$(INST_AUTODIR)/.exists :: /usr/lib/perl-5.6.0/lib/5.6.0/alpha-dec_osf/CORE/perl.h
	@$(MKPATH) $(INST_AUTODIR)
	@$(EQUALIZE_TIMESTAMP) /usr/lib/perl-5.6.0/lib/5.6.0/alpha-dec_osf/CORE/perl.h $(INST_AUTODIR)/.exists

	-@$(CHMOD) $(PERM_RWX) $(INST_AUTODIR)

$(INST_LIBDIR)/.exists :: /usr/lib/perl-5.6.0/lib/5.6.0/alpha-dec_osf/CORE/perl.h
	@$(MKPATH) $(INST_LIBDIR)
	@$(EQUALIZE_TIMESTAMP) /usr/lib/perl-5.6.0/lib/5.6.0/alpha-dec_osf/CORE/perl.h $(INST_LIBDIR)/.exists

	-@$(CHMOD) $(PERM_RWX) $(INST_LIBDIR)

$(INST_ARCHAUTODIR)/.exists :: /usr/lib/perl-5.6.0/lib/5.6.0/alpha-dec_osf/CORE/perl.h
	@$(MKPATH) $(INST_ARCHAUTODIR)
	@$(EQUALIZE_TIMESTAMP) /usr/lib/perl-5.6.0/lib/5.6.0/alpha-dec_osf/CORE/perl.h $(INST_ARCHAUTODIR)/.exists

	-@$(CHMOD) $(PERM_RWX) $(INST_ARCHAUTODIR)

config :: $(INST_MAN1DIR)/.exists
	@$(NOOP)


$(INST_MAN1DIR)/.exists :: /usr/lib/perl-5.6.0/lib/5.6.0/alpha-dec_osf/CORE/perl.h
	@$(MKPATH) $(INST_MAN1DIR)
	@$(EQUALIZE_TIMESTAMP) /usr/lib/perl-5.6.0/lib/5.6.0/alpha-dec_osf/CORE/perl.h $(INST_MAN1DIR)/.exists

	-@$(CHMOD) $(PERM_RWX) $(INST_MAN1DIR)

config :: $(INST_MAN3DIR)/.exists
	@$(NOOP)


$(INST_MAN3DIR)/.exists :: /usr/lib/perl-5.6.0/lib/5.6.0/alpha-dec_osf/CORE/perl.h
	@$(MKPATH) $(INST_MAN3DIR)
	@$(EQUALIZE_TIMESTAMP) /usr/lib/perl-5.6.0/lib/5.6.0/alpha-dec_osf/CORE/perl.h $(INST_MAN3DIR)/.exists

	-@$(CHMOD) $(PERM_RWX) $(INST_MAN3DIR)

help:
	perldoc ExtUtils::MakeMaker

Version_check:
	@$(PERL) -I$(PERL_ARCHLIB) -I$(PERL_LIB) \
		-MExtUtils::MakeMaker=Version_check \
		-e "Version_check('$(MM_VERSION)')"


# --- MakeMaker linkext section:

linkext :: $(LINKTYPE)
	@$(NOOP)


# --- MakeMaker dlsyms section:


# --- MakeMaker dynamic section:

## $(INST_PM) has been moved to the all: target.
## It remains here for awhile to allow for old usage: "make dynamic"
#dynamic :: Makefile $(INST_DYNAMIC) $(INST_BOOT) $(INST_PM)
dynamic :: Makefile $(INST_DYNAMIC) $(INST_BOOT)
	@$(NOOP)


# --- MakeMaker dynamic_bs section:

BOOTSTRAP =


# --- MakeMaker dynamic_lib section:


# --- MakeMaker static section:

## $(INST_PM) has been moved to the all: target.
## It remains here for awhile to allow for old usage: "make static"
#static :: Makefile $(INST_STATIC) $(INST_PM)
static :: Makefile $(INST_STATIC)
	@$(NOOP)


# --- MakeMaker static_lib section:


# --- MakeMaker htmlifypods section:

htmlifypods : pure_all
	@$(NOOP)


# --- MakeMaker manifypods section:

manifypods : pure_all
	@$(NOOP)


# --- MakeMaker processPL section:

all :: bioscripts.pod
	@$(NOOP)

bioscripts.pod :: doc/makedoc.PL
	$(PERL) -I$(INST_ARCHLIB) -I$(INST_LIB) -I$(PERL_ARCHLIB) -I$(PERL_LIB) doc/makedoc.PL bioscripts.pod

all :: biodatabases.pod
	@$(NOOP)

biodatabases.pod :: doc/makedoc.PL
	$(PERL) -I$(INST_ARCHLIB) -I$(INST_LIB) -I$(PERL_ARCHLIB) -I$(PERL_LIB) doc/makedoc.PL biodatabases.pod

all :: biodesign.pod
	@$(NOOP)

biodesign.pod :: doc/makedoc.PL
	$(PERL) -I$(INST_ARCHLIB) -I$(INST_LIB) -I$(PERL_ARCHLIB) -I$(PERL_LIB) doc/makedoc.PL biodesign.pod

all :: bioperl.pod
	@$(NOOP)

bioperl.pod :: doc/makedoc.PL
	$(PERL) -I$(INST_ARCHLIB) -I$(INST_LIB) -I$(PERL_ARCHLIB) -I$(PERL_LIB) doc/makedoc.PL bioperl.pod


# --- MakeMaker installbin section:

$(INST_SCRIPT)/.exists :: /usr/lib/perl-5.6.0/lib/5.6.0/alpha-dec_osf/CORE/perl.h
	@$(MKPATH) $(INST_SCRIPT)
	@$(EQUALIZE_TIMESTAMP) /usr/lib/perl-5.6.0/lib/5.6.0/alpha-dec_osf/CORE/perl.h $(INST_SCRIPT)/.exists

	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)

EXE_FILES = ./scripts_temp/bp_load_gff.pl ./scripts_temp/bp_genbank2gff.pl ./scripts_temp/bp_bulk_load_gff.pl ./scripts_temp/bp_fast_load_gff.pl ./scripts_temp/bp_generate_histogram.pl ./scripts_temp/bp_pg_bulk_load_gff.pl ./scripts_temp/bp_process_gadfly.pl ./scripts_temp/bp_process_ncbi_human.pl ./scripts_temp/bp_process_sgd.pl ./scripts_temp/bp_process_wormbase.pl ./scripts_temp/bp_biofetch_genbank_proxy.pl ./scripts_temp/bp_bioflat_index.pl ./scripts_temp/bp_biogetseq.pl ./scripts_temp/bp_flanks.pl ./scripts_temp/bp_biblio.pl ./scripts_temp/bp_frend.pl ./scripts_temp/bp_feature_draw.pl ./scripts_temp/bp_search_overview.pl ./scripts_temp/bp_fetch.pl ./scripts_temp/bp_index.pl ./scripts_temp/bp_composite_LD.pl ./scripts_temp/bp_heterogeneity_test.pl ./scripts_temp/bp_filter_search.pl ./scripts_temp/bp_extract_feature_seq.pl ./scripts_temp/bp_seqconvert.pl ./scripts_temp/bp_split_seq.pl ./scripts_temp/bp_translate_seq.pl ./scripts_temp/bp_aacomp.pl ./scripts_temp/bp_chaos_plot.pl ./scripts_temp/bp_gccalc.pl ./scripts_temp/bp_oligo_count.pl ./scripts_temp/bp_local_taxonomydb_query.pl ./scripts_temp/bp_taxid4species.pl ./scripts_temp/bp_blast2tree.pl ./scripts_temp/bp_nrdb.pl ./scripts_temp/bp_mrtrans.pl ./scripts_temp/bp_sreformat.pl ./scripts_temp/bp_dbsplit.pl ./scripts_temp/bp_mask_by_search.pl ./scripts_temp/bp_mutate.pl ./scripts_temp/bp_pairwise_kaks.pl ./scripts_temp/bp_remote_blast.pl ./scripts_temp/bp_search2BSML.pl ./scripts_temp/bp_search2alnblocks.pl ./scripts_temp/bp_search2gff.pl ./scripts_temp/bp_search2tribe.pl ./scripts_temp/bp_seq_length.pl

FIXIN = $(PERL) -I$(PERL_ARCHLIB) -I$(PERL_LIB) -MExtUtils::MakeMaker \
    -e "MY->fixin(shift)"

pure_all :: $(INST_SCRIPT)/bp_chaos_plot.pl $(INST_SCRIPT)/bp_composite_LD.pl $(INST_SCRIPT)/bp_pairwise_kaks.pl $(INST_SCRIPT)/bp_flanks.pl $(INST_SCRIPT)/bp_translate_seq.pl $(INST_SCRIPT)/bp_feature_draw.pl $(INST_SCRIPT)/bp_load_gff.pl $(INST_SCRIPT)/bp_biblio.pl $(INST_SCRIPT)/bp_fast_load_gff.pl $(INST_SCRIPT)/bp_search_overview.pl $(INST_SCRIPT)/bp_search2tribe.pl $(INST_SCRIPT)/bp_fetch.pl $(INST_SCRIPT)/bp_seq_length.pl $(INST_SCRIPT)/bp_search2gff.pl $(INST_SCRIPT)/bp_extract_feature_seq.pl $(INST_SCRIPT)/bp_pg_bulk_load_gff.pl $(INST_SCRIPT)/bp_sreformat.pl $(INST_SCRIPT)/bp_dbsplit.pl $(INST_SCRIPT)/bp_biofetch_genbank_proxy.pl $(INST_SCRIPT)/bp_bioflat_index.pl $(INST_SCRIPT)/bp_search2alnblocks.pl $(INST_SCRIPT)/bp_local_taxonomydb_query.pl $(INST_SCRIPT)/bp_biogetseq.pl $(INST_SCRIPT)/bp_process_wormbase.pl $(INST_SCRIPT)/bp_search2BSML.pl $(INST_SCRIPT)/bp_taxid4species.pl $(INST_SCRIPT)/bp_mask_by_search.pl $(INST_SCRIPT)/bp_mrtrans.pl $(INST_SCRIPT)/bp_process_gadfly.pl $(INST_SCRIPT)/bp_gccalc.pl $(INST_SCRIPT)/bp_seqconvert.pl $(INST_SCRIPT)/bp_oligo_count.pl $(INST_SCRIPT)/bp_process_ncbi_human.pl $(INST_SCRIPT)/bp_index.pl $(INST_SCRIPT)/bp_split_seq.pl $(INST_SCRIPT)/bp_remote_blast.pl $(INST_SCRIPT)/bp_aacomp.pl $(INST_SCRIPT)/bp_heterogeneity_test.pl $(INST_SCRIPT)/bp_frend.pl $(INST_SCRIPT)/bp_generate_histogram.pl $(INST_SCRIPT)/bp_process_sgd.pl $(INST_SCRIPT)/bp_mutate.pl $(INST_SCRIPT)/bp_filter_search.pl $(INST_SCRIPT)/bp_genbank2gff.pl $(INST_SCRIPT)/bp_bulk_load_gff.pl $(INST_SCRIPT)/bp_nrdb.pl $(INST_SCRIPT)/bp_blast2tree.pl
	@$(NOOP)

realclean ::
	rm -f $(INST_SCRIPT)/bp_chaos_plot.pl $(INST_SCRIPT)/bp_composite_LD.pl $(INST_SCRIPT)/bp_pairwise_kaks.pl $(INST_SCRIPT)/bp_flanks.pl $(INST_SCRIPT)/bp_translate_seq.pl $(INST_SCRIPT)/bp_feature_draw.pl $(INST_SCRIPT)/bp_load_gff.pl $(INST_SCRIPT)/bp_biblio.pl $(INST_SCRIPT)/bp_fast_load_gff.pl $(INST_SCRIPT)/bp_search_overview.pl $(INST_SCRIPT)/bp_search2tribe.pl $(INST_SCRIPT)/bp_fetch.pl $(INST_SCRIPT)/bp_seq_length.pl $(INST_SCRIPT)/bp_search2gff.pl $(INST_SCRIPT)/bp_extract_feature_seq.pl $(INST_SCRIPT)/bp_pg_bulk_load_gff.pl $(INST_SCRIPT)/bp_sreformat.pl $(INST_SCRIPT)/bp_dbsplit.pl $(INST_SCRIPT)/bp_biofetch_genbank_proxy.pl $(INST_SCRIPT)/bp_bioflat_index.pl $(INST_SCRIPT)/bp_search2alnblocks.pl $(INST_SCRIPT)/bp_local_taxonomydb_query.pl $(INST_SCRIPT)/bp_biogetseq.pl $(INST_SCRIPT)/bp_process_wormbase.pl $(INST_SCRIPT)/bp_search2BSML.pl $(INST_SCRIPT)/bp_taxid4species.pl $(INST_SCRIPT)/bp_mask_by_search.pl $(INST_SCRIPT)/bp_mrtrans.pl $(INST_SCRIPT)/bp_process_gadfly.pl $(INST_SCRIPT)/bp_gccalc.pl $(INST_SCRIPT)/bp_seqconvert.pl $(INST_SCRIPT)/bp_oligo_count.pl $(INST_SCRIPT)/bp_process_ncbi_human.pl $(INST_SCRIPT)/bp_index.pl $(INST_SCRIPT)/bp_split_seq.pl $(INST_SCRIPT)/bp_remote_blast.pl $(INST_SCRIPT)/bp_aacomp.pl $(INST_SCRIPT)/bp_heterogeneity_test.pl $(INST_SCRIPT)/bp_frend.pl $(INST_SCRIPT)/bp_generate_histogram.pl $(INST_SCRIPT)/bp_process_sgd.pl $(INST_SCRIPT)/bp_mutate.pl $(INST_SCRIPT)/bp_filter_search.pl $(INST_SCRIPT)/bp_genbank2gff.pl $(INST_SCRIPT)/bp_bulk_load_gff.pl $(INST_SCRIPT)/bp_nrdb.pl $(INST_SCRIPT)/bp_blast2tree.pl

$(INST_SCRIPT)/bp_chaos_plot.pl: ./scripts_temp/bp_chaos_plot.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_chaos_plot.pl
	cp ./scripts_temp/bp_chaos_plot.pl $(INST_SCRIPT)/bp_chaos_plot.pl
	$(FIXIN) $(INST_SCRIPT)/bp_chaos_plot.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_chaos_plot.pl

$(INST_SCRIPT)/bp_composite_LD.pl: ./scripts_temp/bp_composite_LD.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_composite_LD.pl
	cp ./scripts_temp/bp_composite_LD.pl $(INST_SCRIPT)/bp_composite_LD.pl
	$(FIXIN) $(INST_SCRIPT)/bp_composite_LD.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_composite_LD.pl

$(INST_SCRIPT)/bp_pairwise_kaks.pl: ./scripts_temp/bp_pairwise_kaks.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_pairwise_kaks.pl
	cp ./scripts_temp/bp_pairwise_kaks.pl $(INST_SCRIPT)/bp_pairwise_kaks.pl
	$(FIXIN) $(INST_SCRIPT)/bp_pairwise_kaks.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_pairwise_kaks.pl

$(INST_SCRIPT)/bp_flanks.pl: ./scripts_temp/bp_flanks.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_flanks.pl
	cp ./scripts_temp/bp_flanks.pl $(INST_SCRIPT)/bp_flanks.pl
	$(FIXIN) $(INST_SCRIPT)/bp_flanks.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_flanks.pl

$(INST_SCRIPT)/bp_translate_seq.pl: ./scripts_temp/bp_translate_seq.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_translate_seq.pl
	cp ./scripts_temp/bp_translate_seq.pl $(INST_SCRIPT)/bp_translate_seq.pl
	$(FIXIN) $(INST_SCRIPT)/bp_translate_seq.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_translate_seq.pl

$(INST_SCRIPT)/bp_feature_draw.pl: ./scripts_temp/bp_feature_draw.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_feature_draw.pl
	cp ./scripts_temp/bp_feature_draw.pl $(INST_SCRIPT)/bp_feature_draw.pl
	$(FIXIN) $(INST_SCRIPT)/bp_feature_draw.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_feature_draw.pl

$(INST_SCRIPT)/bp_load_gff.pl: ./scripts_temp/bp_load_gff.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_load_gff.pl
	cp ./scripts_temp/bp_load_gff.pl $(INST_SCRIPT)/bp_load_gff.pl
	$(FIXIN) $(INST_SCRIPT)/bp_load_gff.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_load_gff.pl

$(INST_SCRIPT)/bp_biblio.pl: ./scripts_temp/bp_biblio.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_biblio.pl
	cp ./scripts_temp/bp_biblio.pl $(INST_SCRIPT)/bp_biblio.pl
	$(FIXIN) $(INST_SCRIPT)/bp_biblio.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_biblio.pl

$(INST_SCRIPT)/bp_fast_load_gff.pl: ./scripts_temp/bp_fast_load_gff.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_fast_load_gff.pl
	cp ./scripts_temp/bp_fast_load_gff.pl $(INST_SCRIPT)/bp_fast_load_gff.pl
	$(FIXIN) $(INST_SCRIPT)/bp_fast_load_gff.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_fast_load_gff.pl

$(INST_SCRIPT)/bp_search_overview.pl: ./scripts_temp/bp_search_overview.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_search_overview.pl
	cp ./scripts_temp/bp_search_overview.pl $(INST_SCRIPT)/bp_search_overview.pl
	$(FIXIN) $(INST_SCRIPT)/bp_search_overview.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_search_overview.pl

$(INST_SCRIPT)/bp_search2tribe.pl: ./scripts_temp/bp_search2tribe.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_search2tribe.pl
	cp ./scripts_temp/bp_search2tribe.pl $(INST_SCRIPT)/bp_search2tribe.pl
	$(FIXIN) $(INST_SCRIPT)/bp_search2tribe.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_search2tribe.pl

$(INST_SCRIPT)/bp_fetch.pl: ./scripts_temp/bp_fetch.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_fetch.pl
	cp ./scripts_temp/bp_fetch.pl $(INST_SCRIPT)/bp_fetch.pl
	$(FIXIN) $(INST_SCRIPT)/bp_fetch.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_fetch.pl

$(INST_SCRIPT)/bp_seq_length.pl: ./scripts_temp/bp_seq_length.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_seq_length.pl
	cp ./scripts_temp/bp_seq_length.pl $(INST_SCRIPT)/bp_seq_length.pl
	$(FIXIN) $(INST_SCRIPT)/bp_seq_length.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_seq_length.pl

$(INST_SCRIPT)/bp_search2gff.pl: ./scripts_temp/bp_search2gff.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_search2gff.pl
	cp ./scripts_temp/bp_search2gff.pl $(INST_SCRIPT)/bp_search2gff.pl
	$(FIXIN) $(INST_SCRIPT)/bp_search2gff.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_search2gff.pl

$(INST_SCRIPT)/bp_extract_feature_seq.pl: ./scripts_temp/bp_extract_feature_seq.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_extract_feature_seq.pl
	cp ./scripts_temp/bp_extract_feature_seq.pl $(INST_SCRIPT)/bp_extract_feature_seq.pl
	$(FIXIN) $(INST_SCRIPT)/bp_extract_feature_seq.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_extract_feature_seq.pl

$(INST_SCRIPT)/bp_pg_bulk_load_gff.pl: ./scripts_temp/bp_pg_bulk_load_gff.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_pg_bulk_load_gff.pl
	cp ./scripts_temp/bp_pg_bulk_load_gff.pl $(INST_SCRIPT)/bp_pg_bulk_load_gff.pl
	$(FIXIN) $(INST_SCRIPT)/bp_pg_bulk_load_gff.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_pg_bulk_load_gff.pl

$(INST_SCRIPT)/bp_sreformat.pl: ./scripts_temp/bp_sreformat.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_sreformat.pl
	cp ./scripts_temp/bp_sreformat.pl $(INST_SCRIPT)/bp_sreformat.pl
	$(FIXIN) $(INST_SCRIPT)/bp_sreformat.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_sreformat.pl

$(INST_SCRIPT)/bp_dbsplit.pl: ./scripts_temp/bp_dbsplit.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_dbsplit.pl
	cp ./scripts_temp/bp_dbsplit.pl $(INST_SCRIPT)/bp_dbsplit.pl
	$(FIXIN) $(INST_SCRIPT)/bp_dbsplit.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_dbsplit.pl

$(INST_SCRIPT)/bp_biofetch_genbank_proxy.pl: ./scripts_temp/bp_biofetch_genbank_proxy.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_biofetch_genbank_proxy.pl
	cp ./scripts_temp/bp_biofetch_genbank_proxy.pl $(INST_SCRIPT)/bp_biofetch_genbank_proxy.pl
	$(FIXIN) $(INST_SCRIPT)/bp_biofetch_genbank_proxy.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_biofetch_genbank_proxy.pl

$(INST_SCRIPT)/bp_bioflat_index.pl: ./scripts_temp/bp_bioflat_index.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_bioflat_index.pl
	cp ./scripts_temp/bp_bioflat_index.pl $(INST_SCRIPT)/bp_bioflat_index.pl
	$(FIXIN) $(INST_SCRIPT)/bp_bioflat_index.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_bioflat_index.pl

$(INST_SCRIPT)/bp_search2alnblocks.pl: ./scripts_temp/bp_search2alnblocks.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_search2alnblocks.pl
	cp ./scripts_temp/bp_search2alnblocks.pl $(INST_SCRIPT)/bp_search2alnblocks.pl
	$(FIXIN) $(INST_SCRIPT)/bp_search2alnblocks.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_search2alnblocks.pl

$(INST_SCRIPT)/bp_local_taxonomydb_query.pl: ./scripts_temp/bp_local_taxonomydb_query.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_local_taxonomydb_query.pl
	cp ./scripts_temp/bp_local_taxonomydb_query.pl $(INST_SCRIPT)/bp_local_taxonomydb_query.pl
	$(FIXIN) $(INST_SCRIPT)/bp_local_taxonomydb_query.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_local_taxonomydb_query.pl

$(INST_SCRIPT)/bp_biogetseq.pl: ./scripts_temp/bp_biogetseq.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_biogetseq.pl
	cp ./scripts_temp/bp_biogetseq.pl $(INST_SCRIPT)/bp_biogetseq.pl
	$(FIXIN) $(INST_SCRIPT)/bp_biogetseq.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_biogetseq.pl

$(INST_SCRIPT)/bp_process_wormbase.pl: ./scripts_temp/bp_process_wormbase.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_process_wormbase.pl
	cp ./scripts_temp/bp_process_wormbase.pl $(INST_SCRIPT)/bp_process_wormbase.pl
	$(FIXIN) $(INST_SCRIPT)/bp_process_wormbase.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_process_wormbase.pl

$(INST_SCRIPT)/bp_search2BSML.pl: ./scripts_temp/bp_search2BSML.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_search2BSML.pl
	cp ./scripts_temp/bp_search2BSML.pl $(INST_SCRIPT)/bp_search2BSML.pl
	$(FIXIN) $(INST_SCRIPT)/bp_search2BSML.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_search2BSML.pl

$(INST_SCRIPT)/bp_taxid4species.pl: ./scripts_temp/bp_taxid4species.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_taxid4species.pl
	cp ./scripts_temp/bp_taxid4species.pl $(INST_SCRIPT)/bp_taxid4species.pl
	$(FIXIN) $(INST_SCRIPT)/bp_taxid4species.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_taxid4species.pl

$(INST_SCRIPT)/bp_mask_by_search.pl: ./scripts_temp/bp_mask_by_search.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_mask_by_search.pl
	cp ./scripts_temp/bp_mask_by_search.pl $(INST_SCRIPT)/bp_mask_by_search.pl
	$(FIXIN) $(INST_SCRIPT)/bp_mask_by_search.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_mask_by_search.pl

$(INST_SCRIPT)/bp_mrtrans.pl: ./scripts_temp/bp_mrtrans.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_mrtrans.pl
	cp ./scripts_temp/bp_mrtrans.pl $(INST_SCRIPT)/bp_mrtrans.pl
	$(FIXIN) $(INST_SCRIPT)/bp_mrtrans.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_mrtrans.pl

$(INST_SCRIPT)/bp_process_gadfly.pl: ./scripts_temp/bp_process_gadfly.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_process_gadfly.pl
	cp ./scripts_temp/bp_process_gadfly.pl $(INST_SCRIPT)/bp_process_gadfly.pl
	$(FIXIN) $(INST_SCRIPT)/bp_process_gadfly.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_process_gadfly.pl

$(INST_SCRIPT)/bp_gccalc.pl: ./scripts_temp/bp_gccalc.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_gccalc.pl
	cp ./scripts_temp/bp_gccalc.pl $(INST_SCRIPT)/bp_gccalc.pl
	$(FIXIN) $(INST_SCRIPT)/bp_gccalc.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_gccalc.pl

$(INST_SCRIPT)/bp_seqconvert.pl: ./scripts_temp/bp_seqconvert.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_seqconvert.pl
	cp ./scripts_temp/bp_seqconvert.pl $(INST_SCRIPT)/bp_seqconvert.pl
	$(FIXIN) $(INST_SCRIPT)/bp_seqconvert.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_seqconvert.pl

$(INST_SCRIPT)/bp_oligo_count.pl: ./scripts_temp/bp_oligo_count.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_oligo_count.pl
	cp ./scripts_temp/bp_oligo_count.pl $(INST_SCRIPT)/bp_oligo_count.pl
	$(FIXIN) $(INST_SCRIPT)/bp_oligo_count.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_oligo_count.pl

$(INST_SCRIPT)/bp_process_ncbi_human.pl: ./scripts_temp/bp_process_ncbi_human.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_process_ncbi_human.pl
	cp ./scripts_temp/bp_process_ncbi_human.pl $(INST_SCRIPT)/bp_process_ncbi_human.pl
	$(FIXIN) $(INST_SCRIPT)/bp_process_ncbi_human.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_process_ncbi_human.pl

$(INST_SCRIPT)/bp_index.pl: ./scripts_temp/bp_index.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_index.pl
	cp ./scripts_temp/bp_index.pl $(INST_SCRIPT)/bp_index.pl
	$(FIXIN) $(INST_SCRIPT)/bp_index.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_index.pl

$(INST_SCRIPT)/bp_split_seq.pl: ./scripts_temp/bp_split_seq.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_split_seq.pl
	cp ./scripts_temp/bp_split_seq.pl $(INST_SCRIPT)/bp_split_seq.pl
	$(FIXIN) $(INST_SCRIPT)/bp_split_seq.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_split_seq.pl

$(INST_SCRIPT)/bp_remote_blast.pl: ./scripts_temp/bp_remote_blast.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_remote_blast.pl
	cp ./scripts_temp/bp_remote_blast.pl $(INST_SCRIPT)/bp_remote_blast.pl
	$(FIXIN) $(INST_SCRIPT)/bp_remote_blast.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_remote_blast.pl

$(INST_SCRIPT)/bp_aacomp.pl: ./scripts_temp/bp_aacomp.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_aacomp.pl
	cp ./scripts_temp/bp_aacomp.pl $(INST_SCRIPT)/bp_aacomp.pl
	$(FIXIN) $(INST_SCRIPT)/bp_aacomp.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_aacomp.pl

$(INST_SCRIPT)/bp_heterogeneity_test.pl: ./scripts_temp/bp_heterogeneity_test.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_heterogeneity_test.pl
	cp ./scripts_temp/bp_heterogeneity_test.pl $(INST_SCRIPT)/bp_heterogeneity_test.pl
	$(FIXIN) $(INST_SCRIPT)/bp_heterogeneity_test.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_heterogeneity_test.pl

$(INST_SCRIPT)/bp_frend.pl: ./scripts_temp/bp_frend.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_frend.pl
	cp ./scripts_temp/bp_frend.pl $(INST_SCRIPT)/bp_frend.pl
	$(FIXIN) $(INST_SCRIPT)/bp_frend.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_frend.pl

$(INST_SCRIPT)/bp_generate_histogram.pl: ./scripts_temp/bp_generate_histogram.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_generate_histogram.pl
	cp ./scripts_temp/bp_generate_histogram.pl $(INST_SCRIPT)/bp_generate_histogram.pl
	$(FIXIN) $(INST_SCRIPT)/bp_generate_histogram.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_generate_histogram.pl

$(INST_SCRIPT)/bp_process_sgd.pl: ./scripts_temp/bp_process_sgd.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_process_sgd.pl
	cp ./scripts_temp/bp_process_sgd.pl $(INST_SCRIPT)/bp_process_sgd.pl
	$(FIXIN) $(INST_SCRIPT)/bp_process_sgd.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_process_sgd.pl

$(INST_SCRIPT)/bp_mutate.pl: ./scripts_temp/bp_mutate.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_mutate.pl
	cp ./scripts_temp/bp_mutate.pl $(INST_SCRIPT)/bp_mutate.pl
	$(FIXIN) $(INST_SCRIPT)/bp_mutate.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_mutate.pl

$(INST_SCRIPT)/bp_filter_search.pl: ./scripts_temp/bp_filter_search.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_filter_search.pl
	cp ./scripts_temp/bp_filter_search.pl $(INST_SCRIPT)/bp_filter_search.pl
	$(FIXIN) $(INST_SCRIPT)/bp_filter_search.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_filter_search.pl

$(INST_SCRIPT)/bp_genbank2gff.pl: ./scripts_temp/bp_genbank2gff.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_genbank2gff.pl
	cp ./scripts_temp/bp_genbank2gff.pl $(INST_SCRIPT)/bp_genbank2gff.pl
	$(FIXIN) $(INST_SCRIPT)/bp_genbank2gff.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_genbank2gff.pl

$(INST_SCRIPT)/bp_bulk_load_gff.pl: ./scripts_temp/bp_bulk_load_gff.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_bulk_load_gff.pl
	cp ./scripts_temp/bp_bulk_load_gff.pl $(INST_SCRIPT)/bp_bulk_load_gff.pl
	$(FIXIN) $(INST_SCRIPT)/bp_bulk_load_gff.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_bulk_load_gff.pl

$(INST_SCRIPT)/bp_nrdb.pl: ./scripts_temp/bp_nrdb.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_nrdb.pl
	cp ./scripts_temp/bp_nrdb.pl $(INST_SCRIPT)/bp_nrdb.pl
	$(FIXIN) $(INST_SCRIPT)/bp_nrdb.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_nrdb.pl

$(INST_SCRIPT)/bp_blast2tree.pl: ./scripts_temp/bp_blast2tree.pl Makefile $(INST_SCRIPT)/.exists
	@rm -f $(INST_SCRIPT)/bp_blast2tree.pl
	cp ./scripts_temp/bp_blast2tree.pl $(INST_SCRIPT)/bp_blast2tree.pl
	$(FIXIN) $(INST_SCRIPT)/bp_blast2tree.pl
	-@$(CHMOD) $(PERM_RWX) $(INST_SCRIPT)/bp_blast2tree.pl


# --- MakeMaker subdirs section:

# none

# --- MakeMaker clean section:

# Delete temporary files but do not touch installed files. We don't delete
# the Makefile here so a later make realclean still has a makefile to use.

clean ::
	-rm -rf ./blib $(MAKE_APERL_FILE) $(INST_ARCHAUTODIR)/extralibs.all perlmain.c mon.out core core.*perl.*.? *perl.core so_locations pm_to_blib *~ */*~ */*/*~ *$(OBJ_EXT) *$(LIB_EXT) perl.exe $(BOOTSTRAP) $(BASEEXT).bso $(BASEEXT).def $(BASEEXT).exp
	-mv Makefile Makefile.old $(DEV_NULL)
	-rm -rf scripts_temp


# --- MakeMaker realclean section:

# Delete temporary files (via clean) and also delete installed files
realclean purge ::  clean
	rm -rf $(INST_AUTODIR) $(INST_ARCHAUTODIR)
	rm -f $(INST_LIBDIR)/Bio/SeqIO/bsml.pm $(INST_LIBDIR)/Bio/Location/NarrowestCoordPolicy.pm $(INST_LIBDIR)/Bio/MapIO.pm $(INST_LIBDIR)/Bio/SeqIO/abi.pm $(INST_LIBDIR)/Bio/OntologyIO/goflat.pm $(INST_LIBDIR)/Bio/Biblio/Organisation.pm $(INST_LIBDIR)/Bio/DB/GFF/Aggregator/ucsc_sanger22.pm $(INST_LIBDIR)/Bio/Taxonomy/Taxon.pm $(INST_LIBDIR)/Bio/DB/GFF/Adaptor/memory.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/redgreen_box.pm $(INST_LIBDIR)/Bio/Tools/PrositeScan.pm $(INST_LIBDIR)/Bio/Variation/IO.pm $(INST_LIBDIR)/Bio/Factory/FTLocationFactory.pm $(INST_LIBDIR)/Bio/Tools/ECnumber.pm $(INST_LIBDIR)/Bio/SearchIO/Writer/HTMLResultWriter.pm $(INST_LIBDIR)/Bio/Variation/AAChange.pm $(INST_LIBDIR)/Bio/Biblio/Book.pm $(INST_LIBDIR)/Bio/DB/GFF/Aggregator/match.pm $(INST_LIBDIR)/Bio/Event/EventHandlerI.pm $(INST_LIBDIR)/Bio/DB/GFF/Util/Rearrange.pm $(INST_LIBDIR)/Bio/Tools/OddCodes.pm $(INST_LIBDIR)/Bio/Factory/ObjectFactoryI.pm $(INST_LIBDIR)/Bio/Search/Hit/HitFactory.pm $(INST_LIBDIR)/Bio/DB/GFF/Aggregator/clone.pm $(INST_LIBDIR)/Bio/Root/Global.pm $(INST_LIBDIR)/Bio/Seq.pm $(INST_LIBDIR)/Bio/Root/Vector.pm $(INST_LIBDIR)/Bio/SearchIO/Writer/TextResultWriter.pm $(INST_LIBDIR)/Bio/Tools/HMMER/Set.pm $(INST_LIBDIR)/Bio/SeqIO/FTHelper.pm $(INST_LIBDIR)/Bio/LiveSeq/IO/SRS.pm $(INST_LIBDIR)/Bio/SearchIO/IteratedSearchResultEventBuilder.pm $(INST_LIBDIR)/Bio/Tools/Analysis/Protein/HNN.pm $(INST_LIBDIR)/Bio/Tools/BPlite/HSP.pm $(INST_LIBDIR)/Bio/Tools/Sigcleave.pm $(INST_LIBDIR)/Bio/SearchIO/Writer/BSMLResultWriter.pm $(INST_LIBDIR)/Bio/DB/GFF/Adaptor/dbi/pg.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/triangle.pm $(INST_LIBDIR)/Bio/Assembly/IO/ace.pm $(INST_LIBDIR)/Bio/Map/CytoMap.pm $(INST_LIBDIR)/Bio/Restriction/IO/bairoch.pm $(INST_LIBDIR)/Bio/Graphics/ConfiguratorI.pm $(INST_LIBDIR)/Bio/Structure/IO.pm $(INST_LIBDIR)/Bio/Tree/RandomFactory.pm $(INST_LIBDIR)/Bio/Factory/ObjectFactory.pm $(INST_LIBDIR)/Bio/Matrix/PSM/SiteMatrix.pm $(INST_LIBDIR)/Bio/DasI.pm $(INST_LIBDIR)/Bio/Matrix/PSM/InstanceSiteI.pm $(INST_LIBDIR)/Bio/Coordinate/ExtrapolatingPair.pm $(INST_LIBDIR)/Bio/SeqIO/pir.pm $(INST_LIBDIR)/Bio/Tools/SeqAnal.pm $(INST_LIBDIR)/Bio/Annotation/StructuredValue.pm $(INST_LIBDIR)/Bio/Biblio/Ref.pm $(INST_LIBDIR)/Bio/SeqIO/phd.pm $(INST_LIBDIR)/Bio/Tools/Glimmer.pm $(INST_LIBDIR)/Bio/CodonUsage/IO.pm $(INST_LIBDIR)/Bio/Location/Simple.pm $(INST_LIBDIR)/Bio/Map/OrderedPositionWithDistance.pm $(INST_LIBDIR)/Bio/Tools/Phylo/PAML/Result.pm $(INST_LIBDIR)/Bio/AnnotationCollectionI.pm $(INST_LIBDIR)/Bio/SeqAnalysisParserI.pm $(INST_LIBDIR)/Bio/Tools/GFF.pm $(INST_LIBDIR)/Bio/Variation/AAReverseMutate.pm $(INST_LIBDIR)/Bio/Cluster/ClusterFactory.pm $(INST_LIBDIR)/Bio/Tools/SiRNA.pm $(INST_LIBDIR)/Bio/SeqIO/qual.pm $(INST_LIBDIR)/Bio/DB/EMBL.pm $(INST_LIBDIR)/Bio/Tools/SeqWords.pm $(INST_LIBDIR)/Bio/DB/UpdateableSeqI.pm $(INST_LIBDIR)/Bio/SeqFeature/Computation.pm $(INST_LIBDIR)/Bio/Matrix/IO.pm $(INST_LIBDIR)/Bio/SeqFeature/CollectionI.pm $(INST_LIBDIR)/Bio/DB/GFF/Homol.pm $(INST_LIBDIR)/Bio/SeqIO/locuslink.pm $(INST_LIBDIR)/Bio/SeqIO/ctf.pm $(INST_LIBDIR)/Bio/SeqUtils.pm $(INST_LIBDIR)/Bio/DB/Flat/BDB/embl.pm $(INST_LIBDIR)/Bio/Tools/CodonTable.pm $(INST_LIBDIR)/Bio/Variation/README $(INST_LIBDIR)/Bio/Annotation/Comment.pm $(INST_LIBDIR)/Bio/Seq/BaseSeqProcessor.pm $(INST_LIBDIR)/Bio/DB/GFF/Adaptor/dbi.pm $(INST_LIBDIR)/Bio/Tools/QRNA.pm $(INST_LIBDIR)/Bio/SeqFeature/Primer.pm $(INST_LIBDIR)/Bio/PopGen/PopStats.pm $(INST_LIBDIR)/Bio/DB/SeqI.pm $(INST_LIBDIR)/Bio/Tools/Pseudowise.pm $(INST_LIBDIR)/Bio/Tools/Profile.pm $(INST_LIBDIR)/Bio/ClusterIO/unigene.pm $(INST_LIBDIR)/Bio/LiveSeq/Repeat_Region.pm $(INST_LIBDIR)/Bio/SeqFeature/Tools/TypeMapper.pm $(INST_LIBDIR)/Bio/DB/GFF/Aggregator.pm $(INST_LIBDIR)/Bio/PopGen/PopulationI.pm $(INST_LIBDIR)/Bio/SeqIO/game/seqHandler.pm $(INST_LIBDIR)/Bio/AnalysisResultI.pm $(INST_LIBDIR)/Bio/Phenotype/MeSH/Term.pm $(INST_LIBDIR)/Bio/Tools/Sim4/Results.pm $(INST_LIBDIR)/Bio/Seq/PrimedSeq.pm $(INST_LIBDIR)/Bio/Biblio/MedlineJournalArticle.pm $(INST_LIBDIR)/Bio/SeqIO/swiss.pm $(INST_LIBDIR)/Bio/SeqFeatureI.pm $(INST_LIBDIR)/Bio/Biblio/BiblioBase.pm $(INST_LIBDIR)/Bio/Location/Split.pm $(INST_LIBDIR)/Bio/DB/GenPept.pm $(INST_LIBDIR)/Bio/DB/GFF/Adaptor/biofetch.pm $(INST_LIBDIR)/Bio/Ontology/SimpleGOEngine.pm $(INST_LIBDIR)/Bio/Graphics/Glyph.pm $(INST_LIBDIR)/Bio/Graphics/FeatureFile.pm $(INST_LIBDIR)/Bio/Tree/Node.pm $(INST_LIBDIR)/Bio/Symbol/ProteinAlphabet.pm $(INST_LIBDIR)/Bio/Tools/Prediction/Gene.pm $(INST_LIBDIR)/Bio/DB/GFF/Adaptor/dbi/oracleace.pm $(INST_LIBDIR)/Bio/Seq/MetaI.pm $(INST_LIBDIR)/Bio/Tools/RepeatMasker.pm $(INST_LIBDIR)/Bio/LiveSeq/Repeat_Unit.pm $(INST_LIBDIR)/Bio/Matrix/PSM/SiteMatrixI.pm $(INST_LIBDIR)/Bio/Tools/BPlite/Sbjct.pm $(INST_LIBDIR)/Bio/AnnotationI.pm $(INST_LIBDIR)/Bio/DB/XEMBL.pm $(INST_LIBDIR)/Bio/Tools/Geneid.pm $(INST_LIBDIR)/Bio/Tools/Prints.pm $(INST_LIBDIR)/Bio/Phenotype/Phenotype.pm $(INST_LIBDIR)/Bio/Tools/GuessSeqFormat.pm $(INST_LIBDIR)/Bio/Coordinate/Chain.pm $(INST_LIBDIR)/Bio/SeqIO/chadoxml.pm $(INST_LIBDIR)/Bio/Matrix/PSM/IO/mast.pm $(INST_LIBDIR)/Bio/Root/IO.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/transcript2.pm $(INST_LIBDIR)/Bio/Index/Swissprot.pm $(INST_LIBDIR)/Bio/PopGen/GenotypeI.pm $(INST_LIBDIR)/Bio/Search/HSP/PSLHSP.pm $(INST_LIBDIR)/Bio/DB/Flat/BDB/genbank.pm $(INST_LIBDIR)/Bio/DB/GFF/Util/Binning.pm $(INST_LIBDIR)/Bio/Seq/LargePrimarySeq.pm $(INST_LIBDIR)/Bio/DB/GFF.pm $(INST_LIBDIR)/Bio/SeqFeature/Gene/NC_Feature.pm $(INST_LIBDIR)/Bio/SimpleAlign.pm $(INST_LIBDIR)/Bio/SearchDist.pm $(INST_LIBDIR)/Bio/DB/GFF/Aggregator/alignment.pm $(INST_LIBDIR)/Bio/Symbol/Alphabet.pm $(INST_LIBDIR)/Bio/Tools/Blast/HTML.pm $(INST_LIBDIR)/Bio/Tools/Gel.pm $(INST_LIBDIR)/Bio/OntologyIO/Handlers/BaseSAXHandler.pm $(INST_LIBDIR)/Bio/Graphics/RendererI.pm $(INST_LIBDIR)/Bio/Tools/BPlite.pm $(INST_LIBDIR)/Bio/AlignIO/selex.pm $(INST_LIBDIR)/Bio/Factory/TreeFactoryI.pm $(INST_LIBDIR)/Bio/Range.pm $(INST_LIBDIR)/Bio/Matrix/Scoring.pm $(INST_LIBDIR)/Bio/Annotation/AnnotationFactory.pm $(INST_LIBDIR)/Bio/Tools/Eponine.pm $(INST_LIBDIR)/Bio/Map/Position.pm $(INST_LIBDIR)/Bio/Graphics/Util.pm $(INST_LIBDIR)/Bio/Biblio/MedlineBook.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/diamond.pm $(INST_LIBDIR)/Bio/Search/Iteration/IterationI.pm $(INST_LIBDIR)/Bio/Search/Result/GenericResult.pm $(INST_LIBDIR)/Bio/Factory/SequenceProcessorI.pm $(INST_LIBDIR)/Bio/Location/SplitLocationI.pm $(INST_LIBDIR)/Bio/SeqIO/exp.pm $(INST_LIBDIR)/Bio/Assembly/ContigAnalysis.pm $(INST_LIBDIR)/Bio/LiveSeq/DNA.pm $(INST_LIBDIR)/Bio/DB/GFF/Adaptor/ace.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/translation.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/ex.pm $(INST_LIBDIR)/Bio/Seq/LargeSeq.pm $(INST_LIBDIR)/Bio/Tools/Blast/README $(INST_LIBDIR)/Bio/SeqIO/metafasta.pm $(INST_LIBDIR)/Bio/Symbol/SymbolI.pm $(INST_LIBDIR)/Bio/Matrix/PSM/Psm.pm $(INST_LIBDIR)/Bio/Biblio/Thesis.pm $(INST_LIBDIR)/Bio/Search/GenericDatabase.pm $(INST_LIBDIR)/Bio/Event/EventGeneratorI.pm $(INST_LIBDIR)/Bio/Taxonomy/Node.pm $(INST_LIBDIR)/Bio/AlignIO/meme.pm $(INST_LIBDIR)/Bio/DB/GFF/Typename.pm $(INST_LIBDIR)/Bio/Coordinate/Result/Gap.pm $(INST_LIBDIR)/Bio/LiveSeq/Intron.pm $(INST_LIBDIR)/Bio/AlignIO/mase.pm $(INST_LIBDIR)/Bio/DB/GFF/Aggregator/ucsc_refgene.pm $(INST_LIBDIR)/Bio/SeqIO/pln.pm $(INST_LIBDIR)/Bio/Structure/Atom.pm $(INST_LIBDIR)/Bio/DB/BioFetch.pm $(INST_LIBDIR)/Bio/LiveSeq/Gene.pm $(INST_LIBDIR)/Bio/Ontology/TermI.pm $(INST_LIBDIR)/Bio/DB/GFF/Adaptor/dbi/mysql.pm $(INST_LIBDIR)/Bio/Biblio/PubmedBookArticle.pm $(INST_LIBDIR)/Bio/SeqIO/game/featHandler.pm $(INST_LIBDIR)/Bio/Tools/Analysis/Protein/NetPhos.pm $(INST_LIBDIR)/Bio/SeqIO/game/gameHandler.pm $(INST_LIBDIR)/Bio/Ontology/InterProTerm.pm $(INST_LIBDIR)/Bio/Phenotype/PhenotypeI.pm $(INST_LIBDIR)/Bio/Search/Iteration/GenericIteration.pm $(INST_LIBDIR)/Bio/PopGen/Genotype.pm $(INST_LIBDIR)/Bio/Assembly/IO.pm $(INST_LIBDIR)/Bio/DB/GFF/RelSegment.pm $(INST_LIBDIR)/Bio/DB/XEMBLService.pm $(INST_LIBDIR)/Bio/SearchIO/EventHandlerI.pm $(INST_LIBDIR)/Bio/TreeIO/nhx.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/processed_transcript.pm $(INST_LIBDIR)/Bio/Ontology/TermFactory.pm $(INST_LIBDIR)/Bio/SeqI.pm $(INST_LIBDIR)/Bio/Align/StatisticsI.pm $(INST_LIBDIR)/Bio/Matrix/PSM/IO/meme.pm $(INST_LIBDIR)/Bio/Matrix/MatrixI.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/graded_segments.pm $(INST_LIBDIR)/Bio/Matrix/PhylipDist.pm $(INST_LIBDIR)/Bio/Coordinate/GeneMapper.pm $(INST_LIBDIR)/Bio/IdCollectionI.pm $(INST_LIBDIR)/Bio/Cluster/UniGene.pm $(INST_LIBDIR)/Bio/Coordinate/ResultI.pm $(INST_LIBDIR)/Bio/Ontology/Ontology.pm $(INST_LIBDIR)/Bio/Tools/Blat.pm $(INST_LIBDIR)/Bio/SeqIO/scf.pm $(INST_LIBDIR)/Bio/LiveSeq/Translation.pm $(INST_LIBDIR)/Bio/LiveSeq/Chain.pm $(INST_LIBDIR)/Bio/Seq/SeqBuilder.pm $(INST_LIBDIR)/Bio/Align/Utilities.pm $(INST_LIBDIR)/Bio/Root/Version.pm $(INST_LIBDIR)/Bio/PopGen/Marker.pm $(INST_LIBDIR)/Bio/Biblio/IO/pubmed2ref.pm $(INST_LIBDIR)/Bio/LiveSeq/Mutation.pm $(INST_LIBDIR)/Bio/Location/AvWithinCoordPolicy.pm $(INST_LIBDIR)/Bio/Tools/IUPAC.pm $(INST_LIBDIR)/Bio/DB/GFF/Aggregator/processed_transcript.pm $(INST_LIBDIR)/Bio/Structure/Entry.pm $(INST_LIBDIR)/Bio/Perl.pm $(INST_LIBDIR)/Bio/Align/DNAStatistics.pm $(INST_LIBDIR)/Bio/Ontology/OntologyEngineI.pm $(INST_LIBDIR)/Bio/SeqIO/ztr.pm $(INST_LIBDIR)/Bio/Tools/SeqPattern.pm $(INST_LIBDIR)/Bio/Factory/LocationFactoryI.pm $(INST_LIBDIR)/Bio/DB/CUTG.pm $(INST_LIBDIR)/Bio/SearchIO/SearchResultEventBuilder.pm $(INST_LIBDIR)/Bio/Restriction/EnzymeI.pm $(INST_LIBDIR)/Bio/SeqFeature/Tools/Unflattener.pm $(INST_LIBDIR)/Bio/ClusterI.pm $(INST_LIBDIR)/Bio/SeqFeature/Gene/Transcript.pm $(INST_LIBDIR)/Bio/Align/PairwiseStatistics.pm $(INST_LIBDIR)/Bio/Tools/WWW.pm $(INST_LIBDIR)/Bio/Tools/Tmhmm.pm $(INST_LIBDIR)/Bio/Tools/Alignment/Trim.pm $(INST_LIBDIR)/Bio/Tools/ESTScan.pm $(INST_LIBDIR)/Bio/SeqIO/asciitree.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/oval.pm $(INST_LIBDIR)/Bio/SearchIO/blasttable.pm $(INST_LIBDIR)/Bio/Tools/Primer/AssessorI.pm $(INST_LIBDIR)/Bio/Variation/IO/xml.pm $(INST_LIBDIR)/Bio/Taxonomy/FactoryI.pm $(INST_LIBDIR)/Bio/Tools/RandomDistFunctions.pm $(INST_LIBDIR)/Bio/Seq/SeqWithQuality.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/ellipse.pm $(INST_LIBDIR)/Bio/DB/GFF/Aggregator/ucsc_genscan.pm $(INST_LIBDIR)/Bio/Biblio/MedlineArticle.pm $(INST_LIBDIR)/Bio/LiveSeq/Transcript.pm $(INST_LIBDIR)/Bio/Tools/MZEF.pm $(INST_LIBDIR)/Bio/SearchIO/hmmer.pm $(INST_LIBDIR)/Bio/Biblio/Article.pm $(INST_LIBDIR)/Bio/DB/GFF/Aggregator/ucsc_unigene.pm $(INST_LIBDIR)/Bio/AlignIO/prodom.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/Factory.pm $(INST_LIBDIR)/Bio/Annotation/Collection.pm $(INST_LIBDIR)/Bio/Restriction/IO/itype2.pm $(INST_LIBDIR)/biodatabases.pod $(INST_LIBDIR)/Bio/Restriction/EnzymeCollection.pm $(INST_LIBDIR)/Bio/SeqIO/tab.pm $(INST_LIBDIR)/Bio/SeqFeature/Gene/Promoter.pm $(INST_LIBDIR)/Bio/DB/GFF/Adaptor/dbi/mysqlopt.pm $(INST_LIBDIR)/Bio/Factory/SeqAnalysisParserFactoryI.pm $(INST_LIBDIR)/Bio/Graphics/Feature.pm $(INST_LIBDIR)/Bio/Restriction/IO/withrefm.pm $(INST_LIBDIR)/Bio/DB/Universal.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/anchored_arrow.pm $(INST_LIBDIR)/Bio/Tools/Grail.pm $(INST_LIBDIR)/Bio/Variation/SNP.pm $(INST_LIBDIR)/Bio/Taxonomy.pm $(INST_LIBDIR)/Bio/Variation/DNAMutation.pm $(INST_LIBDIR)/Bio/PrimarySeqI.pm $(INST_LIBDIR)/Bio/DescribableI.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/group.pm $(INST_LIBDIR)/Bio/Search/Result/ResultI.pm $(INST_LIBDIR)/Bio/Tools/Coil.pm $(INST_LIBDIR)/Bio/Expression/FeatureGroup.pm $(INST_LIBDIR)/Bio/TreeIO/tabtree.pm $(INST_LIBDIR)/Bio/Coordinate/Graph.pm $(INST_LIBDIR)/Bio/DB/Registry.pm $(INST_LIBDIR)/Bio/AnnotatableI.pm $(INST_LIBDIR)/biodesign.pod $(INST_LIBDIR)/Bio/AlignIO/psi.pm $(INST_LIBDIR)/Bio/Ontology/Term.pm $(INST_LIBDIR)/Bio/AlignIO/msf.pm $(INST_LIBDIR)/Bio/Restriction/Enzyme.pm $(INST_LIBDIR)/Bio/Coordinate/Collection.pm $(INST_LIBDIR)/Bio/Map/OrderedPosition.pm $(INST_LIBDIR)/Bio/SearchIO/megablast.pm $(INST_LIBDIR)/Bio/Search/HSP/GenericHSP.pm $(INST_LIBDIR)/Bio/Tools/Genewise.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/toomany.pm $(INST_LIBDIR)/Bio/TreeIO.pm $(INST_LIBDIR)/Bio/DB/GFF/Adaptor/memory_iterator.pm $(INST_LIBDIR)/Bio/Tools/Phylo/PAML.pm $(INST_LIBDIR)/Bio/DB/MeSH.pm $(INST_LIBDIR)/Bio/DB/Makefile.PL $(INST_LIBDIR)/Bio/Factory/HitFactoryI.pm $(INST_LIBDIR)/Bio/Map/CytoMarker.pm $(INST_LIBDIR)/Bio/Tree/TreeI.pm $(INST_LIBDIR)/Bio/Assembly/ScaffoldI.pm $(INST_LIBDIR)/Bio/Tree/Statistics.pm $(INST_LIBDIR)/Bio/LiveSeq/Mutator.pm $(INST_LIBDIR)/Bio/Search/BlastUtils.pm $(INST_LIBDIR)/Bio/Tools/BPlite/Iteration.pm $(INST_LIBDIR)/Bio/OntologyIO/InterProParser.pm $(INST_LIBDIR)/Bio/Phenotype/OMIM/OMIMentryAllelicVariant.pm $(INST_LIBDIR)/Bio/SeqIO.pm $(INST_LIBDIR)/Bio/Index/Fasta.pm $(INST_LIBDIR)/Bio/DB/GFF/Aggregator/transcript.pm $(INST_LIBDIR)/bptutorial.pl $(INST_LIBDIR)/Bio/Symbol/AlphabetI.pm $(INST_LIBDIR)/Bio/SeqIO/game.pm $(INST_LIBDIR)/Bio/Biblio/MedlineBookArticle.pm $(INST_LIBDIR)/Bio/OntologyIO.pm $(INST_LIBDIR)/Bio/SeqIO/tigr.pm $(INST_LIBDIR)/Bio/AlignIO/metafasta.pm $(INST_LIBDIR)/Bio/AlignIO/pfam.pm $(INST_LIBDIR)/Bio/Map/LinkagePosition.pm $(INST_LIBDIR)/Bio/Ontology/PathI.pm $(INST_LIBDIR)/Bio/SeqIO/raw.pm $(INST_LIBDIR)/Bio/Tools/BPbl2seq.pm $(INST_LIBDIR)/Bio/Search/HSP/PsiBlastHSP.pm $(INST_LIBDIR)/Bio/SeqFeature/Gene/GeneStructureI.pm $(INST_LIBDIR)/Bio/AlignIO/bl2seq.pm $(INST_LIBDIR)/Bio/Tools/Primer/Feature.pm $(INST_LIBDIR)/Bio/Tools/Phylo/Molphy/Result.pm $(INST_LIBDIR)/Bio/Tools/Analysis/SimpleAnalysisBase.pm $(INST_LIBDIR)/Bio/SeqFeature/Gene/UTR.pm $(INST_LIBDIR)/Bio/DBLinkContainerI.pm $(INST_LIBDIR)/Bio/SearchIO/Writer/HSPTableWriter.pm $(INST_LIBDIR)/Bio/SearchIO/axt.pm $(INST_LIBDIR)/Bio/SeqIO/kegg.pm $(INST_LIBDIR)/Bio/Tools/Genomewise.pm $(INST_LIBDIR)/Bio/DB/Ace.pm $(INST_LIBDIR)/Bio/SearchIO/psl.pm $(INST_LIBDIR)/Bio/Map/CytoPosition.pm $(INST_LIBDIR)/Bio/Index/SwissPfam.pm $(INST_LIBDIR)/Bio/Restriction/IO/base.pm $(INST_LIBDIR)/Bio/DB/Taxonomy.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/splice_site.pm $(INST_LIBDIR)/Bio/Factory/ObjectBuilderI.pm $(INST_LIBDIR)/Bio/Tools/Est2Genome.pm $(INST_LIBDIR)/Bio/AlignIO/emboss.pm $(INST_LIBDIR)/Bio/Index/Fastq.pm $(INST_LIBDIR)/Bio/DB/Flat/BDB/fasta.pm $(INST_LIBDIR)/Bio/Tools/Run/WrapperBase.pm $(INST_LIBDIR)/Bio/Annotation/Reference.pm $(INST_LIBDIR)/Bio/Biblio/PubmedArticle.pm $(INST_LIBDIR)/Bio/Root/Err.pm $(INST_LIBDIR)/Bio/Matrix/PSM/PsmHeader.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/rndrect.pm $(INST_LIBDIR)/Bio/Ontology/SimpleOntologyEngine.pm $(INST_LIBDIR)/Bio/AnalysisI.pm $(INST_LIBDIR)/Bio/Annotation/DBLink.pm $(INST_LIBDIR)/Bio/Seq/TraceI.pm $(INST_LIBDIR)/Bio/PopGen/Individual.pm $(INST_LIBDIR)/Bio/DB/Flat/BDB.pm $(INST_LIBDIR)/Bio/DB/DBFetch.pm $(INST_LIBDIR)/Bio/Seq/SequenceTrace.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/box.pm $(INST_LIBDIR)/Bio/SearchIO/SearchWriterI.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/redgreen_segment.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/span.pm $(INST_LIBDIR)/Bio/SearchIO/blast.pm $(INST_LIBDIR)/Bio/Biblio/IO/medlinexml.pm $(INST_LIBDIR)/Bio/DB/Flat/BDB/swissprot.pm $(INST_LIBDIR)/Bio/Tools/Alignment/Consed.pm $(INST_LIBDIR)/Bio/SeqFeature/Generic.pm $(INST_LIBDIR)/Bio/SeqFeature/Gene/GeneStructure.pm $(INST_LIBDIR)/Bio/Ontology/Path.pm $(INST_LIBDIR)/Bio/Tools/Run/RemoteBlast.pm $(INST_LIBDIR)/Bio/Tools/EMBOSS/Palindrome.pm $(INST_LIBDIR)/Bio/DB/GFF/Aggregator/ucsc_ensgene.pm $(INST_LIBDIR)/Bio/Search/SearchUtils.pm $(INST_LIBDIR)/Bio/Root/HTTPget.pm $(INST_LIBDIR)/Bio/Structure/SecStr/STRIDE/Res.pm $(INST_LIBDIR)/Bio/Map/MappableI.pm $(INST_LIBDIR)/Bio/Root/Xref.pm $(INST_LIBDIR)/Bio/Search/Hit/HMMERHit.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/minmax.pm $(INST_LIBDIR)/Bio/Symbol/README.Symbol $(INST_LIBDIR)/Bio/Graphics/Glyph/cds.pm $(INST_LIBDIR)/Bio/SearchIO/Writer/GbrowseGFF.pm $(INST_LIBDIR)/Bio/Biblio/IO/pubmedxml.pm $(INST_LIBDIR)/Bio/Coordinate/Result/Match.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/track.pm $(INST_LIBDIR)/Bio/Root/Object.pm $(INST_LIBDIR)/Bio/DB/GFF/Adaptor/dbi/oracle.pm $(INST_LIBDIR)/Bio/Search/DatabaseI.pm $(INST_LIBDIR)/Bio/SeqIO/ace.pm $(INST_LIBDIR)/Bio/Location/FuzzyLocationI.pm $(INST_LIBDIR)/Bio/OntologyIO/Handlers/InterProHandler.pm $(INST_LIBDIR)/Bio/Search/Result/ResultFactory.pm $(INST_LIBDIR)/Bio/Factory/ResultFactoryI.pm $(INST_LIBDIR)/Bio/FeatureHolderI.pm $(INST_LIBDIR)/Bio/DB/QueryI.pm $(INST_LIBDIR)/Bio/Tools/AnalysisResult.pm $(INST_LIBDIR)/Bio/Biblio/BookArticle.pm $(INST_LIBDIR)/Bio/DB/MANIFEST $(INST_LIBDIR)/Bio/Matrix/PSM/PsmHeaderI.pm $(INST_LIBDIR)/Bio/Seq/RichSeqI.pm $(INST_LIBDIR)/Bio/ClusterIO/dbsnp.pm $(INST_LIBDIR)/Bio/LiveSeq/IO/Loader.pm $(INST_LIBDIR)/Bio/Variation/IO/flat.pm $(INST_LIBDIR)/Bio/CodonUsage/Table.pm $(INST_LIBDIR)/Bio/Structure/Chain.pm $(INST_LIBDIR)/Bio/LiveSeq/AARange.pm $(INST_LIBDIR)/Bio/AlignIO/mega.pm $(INST_LIBDIR)/Bio/DB/FileCache.pm $(INST_LIBDIR)/Bio/Assembly/Contig.pm $(INST_LIBDIR)/Bio/Root/RootI.pm $(INST_LIBDIR)/Bio/AnalysisParserI.pm $(INST_LIBDIR)/Bio/DB/Taxonomy/flatfile.pm $(INST_LIBDIR)/Bio/Search/HSP/HMMERHSP.pm $(INST_LIBDIR)/Bio/Tools/BPpsilite.pm $(INST_LIBDIR)/Bio/Ontology/Relationship.pm $(INST_LIBDIR)/Bio/DB/GenBank.pm $(INST_LIBDIR)/Bio/Tools/Hmmpfam.pm $(INST_LIBDIR)/Bio/Tools/Phylo/Molphy.pm $(INST_LIBDIR)/Bio/Variation/VariantI.pm $(INST_LIBDIR)/Bio/LocatableSeq.pm $(INST_LIBDIR)/Bio/Tools/Sim4/Exon.pm $(INST_LIBDIR)/Bio/OntologyIO/dagflat.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/heterogeneous_segments.pm $(INST_LIBDIR)/Bio/AlignIO/nexus.pm $(INST_LIBDIR)/Bio/Biblio/Patent.pm $(INST_LIBDIR)/Bio/Tools/Phylo/Phylip/ProtDist.pm $(INST_LIBDIR)/Bio/SeqIO/alf.pm $(INST_LIBDIR)/Bio/Location/WidestCoordPolicy.pm $(INST_LIBDIR)/Bio/DB/Query/GenBank.pm $(INST_LIBDIR)/Bio/Annotation/SimpleValue.pm $(INST_LIBDIR)/Bio/Biblio/MedlineJournal.pm $(INST_LIBDIR)/Bio/Root/IOManager.pm $(INST_LIBDIR)/Bio/OntologyIO/soflat.pm $(INST_LIBDIR)/Bio/Restriction/Enzyme/MultiCut.pm $(INST_LIBDIR)/Bio/DB/Biblio/biofetch.pm $(INST_LIBDIR)/Bio/Biblio/Proceeding.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/alignment.pm $(INST_LIBDIR)/Bio/Biblio/Journal.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/segmented_keyglyph.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/transcript.pm $(INST_LIBDIR)/Bio/Cluster/SequenceFamily.pm $(INST_LIBDIR)/Bio/Tree/TreeFunctionsI.pm $(INST_LIBDIR)/Bio/LocationI.pm $(INST_LIBDIR)/Bio/Biblio/Provider.pm $(INST_LIBDIR)/Bio/Biblio/Person.pm $(INST_LIBDIR)/Bio/AlignIO/phylip.pm $(INST_LIBDIR)/Bio/DB/WebDBSeqI.pm $(INST_LIBDIR)/Bio/DB/Flat/BinarySearch.pm $(INST_LIBDIR)/Bio/Graphics.pm $(INST_LIBDIR)/Bio/Search/Hit/Fasta.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/ruler_arrow.pm $(INST_LIBDIR)/Bio/DB/GFF/Adaptor/dbi/caching_handle.pm $(INST_LIBDIR)/Bio/Ontology/RelationshipI.pm $(INST_LIBDIR)/Bio/Search/HSP/HSPFactory.pm $(INST_LIBDIR)/Bio/Root/Exception.pm $(INST_LIBDIR)/Bio/Ontology/OntologyStore.pm $(INST_LIBDIR)/Bio/Search/Processor.pm $(INST_LIBDIR)/Bio/Variation/Allele.pm $(INST_LIBDIR)/Bio/SeqFeature/FeaturePair.pm $(INST_LIBDIR)/Bio/Annotation/OntologyTerm.pm $(INST_LIBDIR)/Bio/Biblio/PubmedJournalArticle.pm $(INST_LIBDIR)/Bio/TreeIO/TreeEventBuilder.pm $(INST_LIBDIR)/Bio/SeqIO/embl.pm $(INST_LIBDIR)/Bio/WebAgent.pm $(INST_LIBDIR)/Bio/Biblio/IO/medline2ref.pm $(INST_LIBDIR)/Bio/SearchIO/FastHitEventBuilder.pm $(INST_LIBDIR)/Bio/Tree/NodeI.pm $(INST_LIBDIR)/Bio/Search/Hit/HitI.pm $(INST_LIBDIR)/Bio/Phenotype/MeSH/Twig.pm $(INST_LIBDIR)/Bio/Seq/PrimaryQual.pm $(INST_LIBDIR)/Bio/DB/NCBIHelper.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/xyplot.pm $(INST_LIBDIR)/Bio/Annotation/TypeManager.pm $(INST_LIBDIR)/Bio/Tools/Analysis/Protein/Sopma.pm $(INST_LIBDIR)/Bio/Structure/StructureI.pm $(INST_LIBDIR)/Bio/SeqIO/largefasta.pm $(INST_LIBDIR)/Bio/Matrix/PSM/InstanceSite.pm $(INST_LIBDIR)/Bio/Tools/dpAlign.pm $(INST_LIBDIR)/Bio/Align/AlignI.pm $(INST_LIBDIR)/Bio/SeqFeature/PositionProxy.pm $(INST_LIBDIR)/Bio/Index/Abstract.pm $(INST_LIBDIR)/Bio/AlignIO.pm $(INST_LIBDIR)/Bio/Factory/SeqAnalysisParserFactory.pm $(INST_LIBDIR)/Bio/Biblio.pm $(INST_LIBDIR)/Bio/Restriction/Analysis.pm $(INST_LIBDIR)/Bio/Matrix/PSM/IO.pm $(INST_LIBDIR)/Bio/Seq/SeqFastaSpeedFactory.pm $(INST_LIBDIR)/Bio/Factory/SequenceFactoryI.pm $(INST_LIBDIR)/Bio/Structure/Residue.pm $(INST_LIBDIR)/Bio/SeqIO/fasta.pm $(INST_LIBDIR)/Bio/PopGen/IO/prettybase.pm $(INST_LIBDIR)/Bio/AlignIO/maf.pm $(INST_LIBDIR)/Bio/Restriction/Enzyme/MultiSite.pm $(INST_LIBDIR)/Bio/Search/Result/HMMERResult.pm $(INST_LIBDIR)/Bio/Tools/FootPrinter.pm $(INST_LIBDIR)/Bio/Tools/Analysis/Protein/GOR4.pm $(INST_LIBDIR)/Bio/Search/Result/BlastResult.pm $(INST_LIBDIR)/Bio/MapIO/mapmaker.pm $(INST_LIBDIR)/Bio/Tools/Blast.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/dot.pm $(INST_LIBDIR)/Bio/Seq/Meta/Array.pm $(INST_LIBDIR)/bioperl.pod $(INST_LIBDIR)/Bio/Structure/IO/pdb.pm $(INST_LIBDIR)/Bio/Map/LinkageMap.pm $(INST_LIBDIR)/Bio/DB/SwissProt.pm $(INST_LIBDIR)/Bio/Matrix/PSM/IO/transfac.pm $(INST_LIBDIR)/Bio/Tools/Signalp.pm $(INST_LIBDIR)/Bio/Phenotype/Measure.pm $(INST_LIBDIR)/Bio/Factory/ApplicationFactoryI.pm $(INST_LIBDIR)/Bio/SeqFeature/Gene/TranscriptI.pm $(INST_LIBDIR)/Bio/LiveSeq/Range.pm $(INST_LIBDIR)/Bio/Coordinate/Utils.pm $(INST_LIBDIR)/Bio/Search/Result/WABAResult.pm $(INST_LIBDIR)/Bio/DB/RandomAccessI.pm $(INST_LIBDIR)/Bio/PopGen/Simulation/GeneticDrift.pm $(INST_LIBDIR)/Bio/DB/GFF/Feature.pm $(INST_LIBDIR)/Bio/Tools/Blast/CHANGES $(INST_LIBDIR)/Bio/DB/Biblio/soap.pm $(INST_LIBDIR)/Bio/DB/Failover.pm $(INST_LIBDIR)/Bio/Tools/HMMER/Results.pm $(INST_LIBDIR)/Bio/DB/GFF/Adaptor/dbi/mysqlace.pm $(INST_LIBDIR)/Bio/Variation/RNAChange.pm $(INST_LIBDIR)/Bio/Tools/Run/README $(INST_LIBDIR)/Bio/TreeIO/lintree.pm $(INST_LIBDIR)/Bio/Graphics/Pictogram.pm $(INST_LIBDIR)/Bio/Factory/AnalysisI.pm $(INST_LIBDIR)/Bio/Expression/FeatureGroup/FeatureGroupMas50.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/dna.pm $(INST_LIBDIR)/Bio/Phenotype/OMIM/OMIMparser.pm $(INST_LIBDIR)/Bio/SeqFeature/SiRNA/Pair.pm $(INST_LIBDIR)/Bio/SeqFeature/Similarity.pm $(INST_LIBDIR)/Bio/Location/CoordinatePolicyI.pm $(INST_LIBDIR)/Bio/Expression/FeatureI.pm $(INST_LIBDIR)/Bio/SeqIO/fastq.pm $(INST_LIBDIR)/Bio/Tools/Genscan.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/generic.pm $(INST_LIBDIR)/Bio/Map/SimpleMap.pm $(INST_LIBDIR)/Bio/Search/Hit/PsiBlastHit.pm $(INST_LIBDIR)/Bio/Search/HSP/WABAHSP.pm $(INST_LIBDIR)/Bio/SearchIO/Writer/ResultTableWriter.pm $(INST_LIBDIR)/Bio/AlignIO/fasta.pm $(INST_LIBDIR)/Bio/Map/Marker.pm $(INST_LIBDIR)/Bio/Factory/SequenceStreamI.pm $(INST_LIBDIR)/Bio/AlignIO/clustalw.pm $(INST_LIBDIR)/Bio/Coordinate/MapperI.pm $(INST_LIBDIR)/Bio/DB/Fasta.pm $(INST_LIBDIR)/Bio/DB/GFF/Adaptor/dbi/iterator.pm $(INST_LIBDIR)/Bio/Phenotype/OMIM/OMIMentry.pm $(INST_LIBDIR)/Bio/Seq/QualI.pm $(INST_LIBDIR)/Bio/Tools/Analysis/Protein/Scansite.pm $(INST_LIBDIR)/Bio/Tools/HMMER/Domain.pm $(INST_LIBDIR)/Bio/Graphics/FeatureFile/Iterator.pm $(INST_LIBDIR)/Bio/Das/FeatureTypeI.pm $(INST_LIBDIR)/Bio/DB/GDB.pm $(INST_LIBDIR)/Bio/Cluster/UniGeneI.pm $(INST_LIBDIR)/Bio/Biblio/JournalArticle.pm $(INST_LIBDIR)/Bio/Tools/Blast/Sbjct.pm $(INST_LIBDIR)/Bio/Index/EMBL.pm $(INST_LIBDIR)/Bio/Das/SegmentI.pm $(INST_LIBDIR)/Bio/Search/Hit/GenericHit.pm $(INST_LIBDIR)/Bio/DB/GFF/Aggregator/ucsc_acembly.pm $(INST_LIBDIR)/Bio/Ontology/OntologyI.pm $(INST_LIBDIR)/Bio/DB/GFF/Segment.pm $(INST_LIBDIR)/Bio/SeqIO/gcg.pm $(INST_LIBDIR)/Bio/Tools/Lucy.pm $(INST_LIBDIR)/Bio/Biblio/WebResource.pm $(INST_LIBDIR)/Bio/TreeIO/svggraph.pm $(INST_LIBDIR)/Bio/DB/GFF/Aggregator/ucsc_softberry.pm $(INST_LIBDIR)/Bio/SeqFeature/Gene/Poly_A_site.pm $(INST_LIBDIR)/Bio/RangeI.pm $(INST_LIBDIR)/Bio/Symbol/DNAAlphabet.pm $(INST_LIBDIR)/Bio/Tools/Analysis/Protein/ELM.pm $(INST_LIBDIR)/Bio/PrimarySeq.pm $(INST_LIBDIR)/Bio/Biblio/TechReport.pm $(INST_LIBDIR)/Bio/Seq/EncodedSeq.pm $(INST_LIBDIR)/Bio/PopGen/IndividualI.pm $(INST_LIBDIR)/Bio/Factory/MapFactoryI.pm $(INST_LIBDIR)/Bio/Ontology/RelationshipFactory.pm $(INST_LIBDIR)/Bio/Tree/NodeNHX.pm $(INST_LIBDIR)/Bio/SearchIO.pm $(INST_LIBDIR)/Bio/LiveSeq/SeqI.pm $(INST_LIBDIR)/Bio/Tools/Genemark.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/extending_arrow.pm $(INST_LIBDIR)/Bio/DB/GFF/Aggregator/coding.pm $(INST_LIBDIR)/Bio/SearchIO/exonerate.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/segments.pm $(INST_LIBDIR)/Bio/DB/GFF/Aggregator/none.pm $(INST_LIBDIR)/Bio/SeqFeature/SimilarityPair.pm $(INST_LIBDIR)/Bio/Ontology/RelationshipType.pm $(INST_LIBDIR)/Bio/Structure/SecStr/DSSP/Res.pm $(INST_LIBDIR)/Bio/PopGen/Population.pm $(INST_LIBDIR)/Bio/PopGen/Simulation/Coalescent.pm $(INST_LIBDIR)/Bio/PopGen/Statistics.pm $(INST_LIBDIR)/Bio/DB/GFF/Adaptor/biofetch_oracle.pm $(INST_LIBDIR)/Bio/Structure/Model.pm $(INST_LIBDIR)/Bio/Search/HSP/HSPI.pm $(INST_LIBDIR)/Bio/DB/GFF/Aggregator/ucsc_sanger22pseudo.pm $(INST_LIBDIR)/Bio/Index/AbstractSeq.pm $(INST_LIBDIR)/Bio/Seq/RichSeq.pm $(INST_LIBDIR)/Bio/Expression/FeatureSet/FeatureSetMas50.pm $(INST_LIBDIR)/Bio/Tree/AlleleNode.pm $(INST_LIBDIR)/Bio/DB/RefSeq.pm $(INST_LIBDIR)/bioscripts.pod $(INST_LIBDIR)/Bio/DB/InMemoryCache.pm $(INST_LIBDIR)/Bio/ClusterIO.pm $(INST_LIBDIR)/Bio/TreeIO/nexus.pm $(INST_LIBDIR)/Bio/Graphics/Panel.pm $(INST_LIBDIR)/Bio/Cluster/FamilyI.pm $(INST_LIBDIR)/Bio/LiveSeq/Prim_Transcript.pm $(INST_LIBDIR)/Bio/SeqIO/game/gameSubs.pm $(INST_LIBDIR)/Bio/Tools/Run/StandAloneBlast.pm $(INST_LIBDIR)/Bio/Symbol/Symbol.pm $(INST_LIBDIR)/Bio/Biblio/IO.pm $(INST_LIBDIR)/Bio/Factory/DriverFactory.pm $(INST_LIBDIR)/Bio/PopGen/IO/csv.pm $(INST_LIBDIR)/Bio/Tree/Tree.pm $(INST_LIBDIR)/Bio/OntologyIO/Handlers/InterPro_BioSQL_Handler.pm $(INST_LIBDIR)/Bio/Tools/Analysis/Protein/Domcut.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/primers.pm $(INST_LIBDIR)/Bio/Tools/Analysis/Protein/Mitoprot.pm $(INST_LIBDIR)/Bio/SeqIO/game/gameWriter.pm $(INST_LIBDIR)/Bio/Tools/Prediction/Exon.pm $(INST_LIBDIR)/Bio/SearchIO/wise.pm $(INST_LIBDIR)/Bio/Phenotype/OMIM/MiniMIMentry.pm $(INST_LIBDIR)/Bio/SimpleAnalysisI.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/crossbox.pm $(INST_LIBDIR)/Bio/Tools/Blast/HSP.pm $(INST_LIBDIR)/Bio/Seq/Meta.pm $(INST_LIBDIR)/Bio/SeqFeature/AnnotationAdaptor.pm $(INST_LIBDIR)/Bio/Tools/SeqStats.pm $(INST_LIBDIR)/Bio/LiveSeq/ChainI.pm $(INST_LIBDIR)/Bio/Variation/SeqDiff.pm $(INST_LIBDIR)/Bio/SeqFeature/SiRNA/Oligo.pm $(INST_LIBDIR)/Bio/Search/Hit/BlastHit.pm $(INST_LIBDIR)/Bio/Assembly/Scaffold.pm $(INST_LIBDIR)/Bio/Tools/Primer3.pm $(INST_LIBDIR)/Bio/Matrix/Generic.pm $(INST_LIBDIR)/Bio/SearchIO/blastxml.pm $(INST_LIBDIR)/Bio/UpdateableSeqI.pm $(INST_LIBDIR)/Bio/Restriction/IO.pm $(INST_LIBDIR)/Bio/SeqFeature/Gene/Exon.pm $(INST_LIBDIR)/Bio/Map/MarkerI.pm $(INST_LIBDIR)/Bio/Matrix/IO/phylip.pm $(INST_LIBDIR)/Bio/DB/BiblioI.pm $(INST_LIBDIR)/Bio/SearchIO/waba.pm $(INST_LIBDIR)/Bio/Species.pm $(INST_LIBDIR)/Bio/Tools/pSW.pm $(INST_LIBDIR)/Bio/Coordinate/Pair.pm $(INST_LIBDIR)/Bio/IdentifiableI.pm $(INST_LIBDIR)/Bio/LiveSeq/IO/README $(INST_LIBDIR)/Bio/Index/GenBank.pm $(INST_LIBDIR)/Bio/AlignIO/stockholm.pm $(INST_LIBDIR)/Bio/LiveSeq/IO/BioPerl.pm $(INST_LIBDIR)/Bio/Ontology/GOterm.pm $(INST_LIBDIR)/Bio/Tools/Seg.pm $(INST_LIBDIR)/Bio/Tools/AlignFactory.pm $(INST_LIBDIR)/Bio/DB/Taxonomy/entrez.pm $(INST_LIBDIR)/Bio/SearchIO/sim4.pm $(INST_LIBDIR)/Bio/OntologyIO/simplehierarchy.pm $(INST_LIBDIR)/Bio/SeqFeature/Gene/Intron.pm $(INST_LIBDIR)/Bio/Search/HSP/BlastHSP.pm $(INST_LIBDIR)/Bio/DB/Flat/BDB/swiss.pm $(INST_LIBDIR)/Bio/Tools/Promoterwise.pm $(INST_LIBDIR)/Bio/Map/MapI.pm $(INST_LIBDIR)/Bio/Tools/Phylo/PAML/ModelResult.pm $(INST_LIBDIR)/Bio/Location/Atomic.pm $(INST_LIBDIR)/Bio/Root/Utilities.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/pinsertion.pm $(INST_LIBDIR)/Bio/SeqIO/genbank.pm $(INST_LIBDIR)/Bio/PopGen/IO.pm $(INST_LIBDIR)/Bio/Matrix/IO/scoring.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/arrow.pm $(INST_LIBDIR)/Bio/Biblio/Service.pm $(INST_LIBDIR)/Bio/Map/Microsatellite.pm $(INST_LIBDIR)/Bio/Tools/Primer/Pair.pm $(INST_LIBDIR)/Bio/DB/GFF/Featname.pm $(INST_LIBDIR)/Bio/SearchIO/Writer/HitTableWriter.pm $(INST_LIBDIR)/Bio/Tools/EPCR.pm $(INST_LIBDIR)/Bio/Graphics/Glyph/line.pm $(INST_LIBDIR)/Bio/SeqFeature/Gene/ExonI.pm $(INST_LIBDIR)/Bio/DB/Flat.pm $(INST_LIBDIR)/Bio/Assembly/IO/phrap.pm $(INST_LIBDIR)/Bio/Taxonomy/Tree.pm $(INST_LIBDIR)/Bio/Location/Fuzzy.pm $(INST_LIBDIR)/Bio/Index/Blast.pm $(INST_LIBDIR)/Bio/SeqFeature/Collection.pm $(INST_LIBDIR)/Bio/Tools/Analysis/DNA/ESEfinder.pm $(INST_LIBDIR)/Bio/TreeIO/newick.pm $(INST_LIBDIR)/Bio/SeqIO/MultiFile.pm $(INST_LIBDIR)/Bio/Tools/RestrictionEnzyme.pm $(INST_LIBDIR)/Bio/Tools/Primer/Assessor/Base.pm $(INST_LIBDIR)/Bio/Search/HSP/FastaHSP.pm $(INST_LIBDIR)/Bio/Map/PositionI.pm $(INST_LIBDIR)/Bio/Coordinate/Result.pm $(INST_LIBDIR)/Bio/DB/GFF/Aggregator/ucsc_twinscan.pm $(INST_LIBDIR)/Bio/DB/Query/WebQuery.pm $(INST_LIBDIR)/Bio/Seq/SeqFactory.pm $(INST_LIBDIR)/Bio/Root/Storable.pm $(INST_LIBDIR)/Bio/SearchIO/fasta.pm $(INST_LIBDIR)/Bio/Root/Root.pm $(INST_LIBDIR)/Bio/Phenotype/Correlate.pm $(INST_LIBDIR)/Bio/Tools/pICalculator.pm $(INST_LIBDIR)/Bio/Matrix/PSM/PsmI.pm $(INST_LIBDIR)/Bio/LiveSeq/Exon.pm $(INST_LIBDIR)/Bio/PopGen/MarkerI.pm
	rm -rf Makefile Makefile.old ./Seq.pm.bak


# --- MakeMaker dist_basics section:

distclean :: realclean distcheck

distcheck :
	$(PERL) -I$(PERL_ARCHLIB) -I$(PERL_LIB) -MExtUtils::Manifest=fullcheck \
		-e fullcheck

skipcheck :
	$(PERL) -I$(PERL_ARCHLIB) -I$(PERL_LIB) -MExtUtils::Manifest=skipcheck \
		-e skipcheck

manifest :
	$(PERL) -I$(PERL_ARCHLIB) -I$(PERL_LIB) -MExtUtils::Manifest=mkmanifest \
		-e mkmanifest


# --- MakeMaker dist_core section:

dist : $(DIST_DEFAULT)
	@$(PERL) -le 'print "Warning: Makefile possibly out of date with $$vf" if ' \
	    -e '-e ($$vf="$(VERSION_FROM)") and -M $$vf < -M "Makefile";'

tardist : $(DISTVNAME).tar$(SUFFIX)

zipdist : $(DISTVNAME).zip

$(DISTVNAME).tar$(SUFFIX) : distdir
	$(PREOP)
	$(TO_UNIX)
	$(TAR) $(TARFLAGS) $(DISTVNAME).tar $(DISTVNAME)
	$(RM_RF) $(DISTVNAME)
	$(COMPRESS) $(DISTVNAME).tar
	$(POSTOP)

$(DISTVNAME).zip : distdir
	$(PREOP)
	$(ZIP) $(ZIPFLAGS) $(DISTVNAME).zip $(DISTVNAME)
	$(RM_RF) $(DISTVNAME)
	$(POSTOP)

uutardist : $(DISTVNAME).tar$(SUFFIX)
	uuencode $(DISTVNAME).tar$(SUFFIX) \
		$(DISTVNAME).tar$(SUFFIX) > \
		$(DISTVNAME).tar$(SUFFIX)_uu

shdist : distdir
	$(PREOP)
	$(SHAR) $(DISTVNAME) > $(DISTVNAME).shar
	$(RM_RF) $(DISTVNAME)
	$(POSTOP)


# --- MakeMaker dist_dir section:

distdir :
	$(RM_RF) $(DISTVNAME)
	$(PERL) -I$(PERL_ARCHLIB) -I$(PERL_LIB) -MExtUtils::Manifest=manicopy,maniread \
		-e "manicopy(maniread(),'$(DISTVNAME)', '$(DIST_CP)');"


# --- MakeMaker dist_test section:

disttest : distdir
	cd $(DISTVNAME) && $(PERL) -I$(PERL_ARCHLIB) -I$(PERL_LIB) Makefile.PL
	cd $(DISTVNAME) && $(MAKE)
	cd $(DISTVNAME) && $(MAKE) test


# --- MakeMaker dist_ci section:

ci :
	$(PERL) -I$(PERL_ARCHLIB) -I$(PERL_LIB) -MExtUtils::Manifest=maniread \
		-e "@all = keys %{ maniread() };" \
		-e 'print("Executing $(CI) @all\n"); system("$(CI) @all");' \
		-e 'print("Executing $(RCS_LABEL) ...\n"); system("$(RCS_LABEL) @all");'


# --- MakeMaker install section:

install :: all pure_install doc_install

install_perl :: all pure_perl_install doc_perl_install

install_site :: all pure_site_install doc_site_install

install_ :: install_site
	@echo INSTALLDIRS not defined, defaulting to INSTALLDIRS=site

pure_install :: pure_$(INSTALLDIRS)_install

doc_install :: doc_$(INSTALLDIRS)_install
	@echo Appending installation info to $(INSTALLARCHLIB)/perllocal.pod

pure__install : pure_site_install
	@echo INSTALLDIRS not defined, defaulting to INSTALLDIRS=site

doc__install : doc_site_install
	@echo INSTALLDIRS not defined, defaulting to INSTALLDIRS=site

pure_perl_install ::
	@$(MOD_INSTALL) \
		read $(PERL_ARCHLIB)/auto/$(FULLEXT)/.packlist \
		write $(INSTALLARCHLIB)/auto/$(FULLEXT)/.packlist \
		$(INST_LIB) $(INSTALLPRIVLIB) \
		$(INST_ARCHLIB) $(INSTALLARCHLIB) \
		$(INST_BIN) $(INSTALLBIN) \
		$(INST_SCRIPT) $(INSTALLSCRIPT) \
		$(INST_HTMLLIBDIR) $(INSTALLHTMLPRIVLIBDIR) \
		$(INST_HTMLSCRIPTDIR) $(INSTALLHTMLSCRIPTDIR) \
		$(INST_MAN1DIR) $(INSTALLMAN1DIR) \
		$(INST_MAN3DIR) $(INSTALLMAN3DIR)
	@$(WARN_IF_OLD_PACKLIST) \
		$(SITEARCHEXP)/auto/$(FULLEXT)


pure_site_install ::
	@$(MOD_INSTALL) \
		read $(SITEARCHEXP)/auto/$(FULLEXT)/.packlist \
		write $(INSTALLSITEARCH)/auto/$(FULLEXT)/.packlist \
		$(INST_LIB) $(INSTALLSITELIB) \
		$(INST_ARCHLIB) $(INSTALLSITEARCH) \
		$(INST_BIN) $(INSTALLBIN) \
		$(INST_SCRIPT) $(INSTALLSCRIPT) \
		$(INST_HTMLLIBDIR) $(INSTALLHTMLSITELIBDIR) \
		$(INST_HTMLSCRIPTDIR) $(INSTALLHTMLSCRIPTDIR) \
		$(INST_MAN1DIR) $(INSTALLMAN1DIR) \
		$(INST_MAN3DIR) $(INSTALLMAN3DIR)
	@$(WARN_IF_OLD_PACKLIST) \
		$(PERL_ARCHLIB)/auto/$(FULLEXT)

doc_perl_install ::
	-@$(MKPATH) $(INSTALLARCHLIB)
	-@$(DOC_INSTALL) \
		"Module" "$(NAME)" \
		"installed into" "$(INSTALLPRIVLIB)" \
		LINKTYPE "$(LINKTYPE)" \
		VERSION "$(VERSION)" \
		EXE_FILES "$(EXE_FILES)" \
		>> $(INSTALLARCHLIB)/perllocal.pod

doc_site_install ::
	-@$(MKPATH) $(INSTALLARCHLIB)
	-@$(DOC_INSTALL) \
		"Module" "$(NAME)" \
		"installed into" "$(INSTALLSITELIB)" \
		LINKTYPE "$(LINKTYPE)" \
		VERSION "$(VERSION)" \
		EXE_FILES "$(EXE_FILES)" \
		>> $(INSTALLARCHLIB)/perllocal.pod


uninstall :: uninstall_from_$(INSTALLDIRS)dirs

uninstall_from_perldirs ::
	@$(UNINSTALL) $(PERL_ARCHLIB)/auto/$(FULLEXT)/.packlist

uninstall_from_sitedirs ::
	@$(UNINSTALL) $(SITEARCHEXP)/auto/$(FULLEXT)/.packlist


# --- MakeMaker force section:
# Phony target to force checking subdirectories.
FORCE:
	@$(NOOP)


# --- MakeMaker perldepend section:


# --- MakeMaker makefile section:

# We take a very conservative approach here, but it\'s worth it.
# We move Makefile to Makefile.old here to avoid gnu make looping.
Makefile : Makefile.PL $(CONFIGDEP)
	@echo "Makefile out-of-date with respect to $?"
	@echo "Cleaning current config before rebuilding Makefile..."
	-@$(RM_F) Makefile.old
	-@$(MV) Makefile Makefile.old
	-$(MAKE) -f Makefile.old clean $(DEV_NULL) || $(NOOP)
	$(PERL) "-I$(PERL_ARCHLIB)" "-I$(PERL_LIB)" Makefile.PL 
	@echo "==> Your Makefile has been rebuilt. <=="
	@echo "==> Please rerun the make command.  <=="
	false

# To change behavior to :: would be nice, but would break Tk b9.02
# so you find such a warning below the dist target.
#Makefile :: $(VERSION_FROM)
#	@echo "Warning: Makefile possibly out of date with $(VERSION_FROM)"


# --- MakeMaker staticmake section:

# --- MakeMaker makeaperl section ---
MAP_TARGET    = perl
FULLPERL      = /usr/bin/perl

$(MAP_TARGET) :: static $(MAKE_APERL_FILE)
	$(MAKE) -f $(MAKE_APERL_FILE) $@

$(MAKE_APERL_FILE) : $(FIRST_MAKEFILE)
	@echo Writing \"$(MAKE_APERL_FILE)\" for this $(MAP_TARGET)
	@$(PERL) -I$(INST_ARCHLIB) -I$(INST_LIB) -I$(PERL_ARCHLIB) -I$(PERL_LIB) \
		Makefile.PL DIR= \
		MAKEFILE=$(MAKE_APERL_FILE) LINKTYPE=static \
		MAKEAPERL=1 NORECURS=1 CCCDLFLAGS=


# --- MakeMaker test section:

TEST_VERBOSE=0
TEST_TYPE=test_$(LINKTYPE)
TEST_FILE = test.pl
TEST_FILES = 
TESTDB_SW = -d

testdb :: testdb_$(LINKTYPE)

test :: $(TEST_TYPE)
	@echo 'No tests defined for $(NAME) extension.'

test_dynamic :: pure_all

testdb_dynamic :: pure_all
	PERL_DL_NONLAZY=1 $(FULLPERL) $(TESTDB_SW) -I$(INST_ARCHLIB) -I$(INST_LIB) -I$(PERL_ARCHLIB) -I$(PERL_LIB) $(TEST_FILE)

test_ : test_dynamic

test_static :: test_dynamic
testdb_static :: testdb_dynamic


# --- MakeMaker ppd section:
# Creates a PPD (Perl Package Description) for a binary distribution.
ppd:
	@$(PERL) -e "print qq{<SOFTPKG NAME=\"bioperl\" VERSION=\"1,4,0,0\">\n}. qq{\t<TITLE>bioperl</TITLE>\n}. qq{\t<ABSTRACT>Bioinformatics Toolkit</ABSTRACT>\n}. qq{\t<AUTHOR>Bioperl Team (bioperl-l\@bioperl.org)</AUTHOR>\n}. qq{\t<IMPLEMENTATION>\n}. qq{\t\t<DEPENDENCY NAME=\"DB_File\" VERSION=\"0,0,0,0\" />\n}. qq{\t\t<DEPENDENCY NAME=\"File-Spec\" VERSION=\"0,0,0,0\" />\n}. qq{\t\t<DEPENDENCY NAME=\"File-Temp\" VERSION=\"0,0,0,0\" />\n}. qq{\t\t<DEPENDENCY NAME=\"HTML-Entities\" VERSION=\"0,0,0,0\" />\n}. qq{\t\t<DEPENDENCY NAME=\"IO-Scalar\" VERSION=\"0,0,0,0\" />\n}. qq{\t\t<DEPENDENCY NAME=\"IO-String\" VERSION=\"0,0,0,0\" />\n}. qq{\t\t<OS NAME=\"$(OSNAME)\" />\n}. qq{\t\t<ARCHITECTURE NAME=\"alpha-dec_osf\" />\n}. qq{\t\t<CODEBASE HREF=\"\" />\n}. qq{\t</IMPLEMENTATION>\n}. qq{</SOFTPKG>\n}" > bioperl.ppd

# --- MakeMaker pm_to_blib section:

pm_to_blib: $(TO_INST_PM)
	@$(PERL) "-I$(INST_ARCHLIB)" "-I$(INST_LIB)" \
	"-I$(PERL_ARCHLIB)" "-I$(PERL_LIB)" -MExtUtils::Install \
        -e "pm_to_blib({qw{$(PM_TO_BLIB)}},'$(INST_LIB)/auto')"
	@$(TOUCH) $@


# --- MakeMaker selfdocument section:


# --- MakeMaker postamble section:
show_tests :
	@echo 'type make test_<subtest> to run'
	@echo ''


# End.
-------------- next part --------------


-------------------------------
  Dr. Wolf-Gerolf THIES        
  ITG - Tox, Bau 439, Zi 507   
  Forschungszentrum Karlsruhe  
  Postfach 3640                
  D-76021 Karlsruhe            
  Tel: +49 7247 82-3139/-3547  
  Fax: +49 7247 82 3557        
  E-mail: thies@itg.fzk.de     
--------------------------------
From mwhahn at ucdavis.edu  Mon Mar  1 16:13:29 2004
From: mwhahn at ucdavis.edu (Matthew Hahn)
Date: Mon Mar  1 16:19:24 2004
Subject: Fwd: [Bioperl-l] Bio::PopGen::Statistics discrete calculations
In-Reply-To: <1078175195.4043a5dbad588@webmail.duke.edu>
References: <1078175195.4043a5dbad588@webmail.duke.edu>
Message-ID: <a05200f01bc6956edebfe@[169.237.66.77]>

>----- Forwarded message from avilella <avilella@ebi.ac.uk> -----
>     Date: Mon, 01 Mar 2004 12:53:55 +0000
>     From: avilella <avilella@ebi.ac.uk>
>Reply-To: avilella <avilella@ebi.ac.uk>
>  Subject: [Bioperl-l] Bio::PopGen::Statistics discrete calculations
>       To: bioperl-ml <bioperl-l@bioperl.org>
>
>Hi,
>
>I'm interested in calculating some of the statistics in
>Bio::PopGen::Statistics in a discrete, position-wise, way.
>
>For example,
>
>I've done a quick implementation of the method "pi" that returns the
>value of the statistic for each polymorphic positions instead of the
>mean of all of them. It's "pi_discrete" method:
>
>http://www.ebi.ac.uk/cvs/cvspublic/wallace/bioperl-live/Bio/PopGen/Statistics.pm?rev=1.4&content-type=text/x-cvsweb-markup
>


Do you want to just calculate each site's heterozygosity?  Otherwise, 
I'm not quite sure what 'pi' means in this context.  The pi 
calculation currently implemented simply sums up each site's 
heterozygosity.


>I would also like to have methods to do Sliding Window calculations for
>these statistics. Something like:
>
>"Calculate tajima_D for windows of 100 polymorphisms that slide 10
>positions"
>
>Any comments about the logic, naming, interest of this methods?


Sounds great--I'm sure it would get used.  The only precaution I 
would give is that some of the formats for inputting the polymorphism 
data (i.e. prettybase files) do not have to include monomorphic 
sites.  This would really mess with any kind of sliding window.


cheers,
Matt


>Thanks,
>
>     Albert.
>
>--
>Albert Vilella Bertran    avilella_at_ebi_ac_uk
>EMBL Outstation, European Bioinformatics Institute
>Wellcome Trust Genome Campus, Hinxton
>Cambs. CB10 1SD, United Kingdom
>Phone: +44 (0)1223 494 448   FAX: +44 (0)1223 494 468
>
>_______________________________________________
>Bioperl-l mailing list
>Bioperl-l@portal.open-bio.org
>http://portal.open-bio.org/mailman/listinfo/bioperl-l
>
>----- End forwarded message -----


-- 
Matthew W. Hahn
Center for Population Biology
2320 Storer Hall
University of California
Davis, CA 95616
From hlapp at gnf.org  Mon Mar  1 23:16:31 2004
From: hlapp at gnf.org (Hilmar Lapp)
Date: Mon Mar  1 23:22:21 2004
Subject: [Bioperl-l] load_seqdatabase.pl problem
In-Reply-To: <40433322.2080403@univ-montp2.fr>
Message-ID: <62B81678-6C00-11D8-B838-000A959EB4C4@gnf.org>

I suspect the database wasn't fresh at the time you loaded the NCBI 
taxonomy. Specifically, the offending species probably was already in 
there through some sequence load.

Alternatively, the EMBL SeqIO parser fails to parse out the NCBI 
taxonID and the species name slightly mismatches with the one in the 
NCBI taxonomy database.

What is the result if you issue the following query:

	select t.* from taxon t, taxon_name tn
	where tn.taxon_id = t.taxon_id
	and tn.name = 'Aspergillus oryzae'
	and tn.name_class = 'scientific name';

-hilmar

On Monday, March 1, 2004, at 04:57  AM, Eric Bazin wrote:

> Hi.
>
> I try to import embl sequence datas in a biosql schema using the perl
> script "load_seqdatabase.pl". Previously I loaded without any problem
> the taxon schema from the NCBI with load_ncbi_taxonomy.pl. Can anybody 
> give
> me a help for this?
>
> This is what I received:
>
> bash#./load_seqdatabase.pl --dbname polymorphix --format embl
> /home/eric/polybase.dat
>
> Could not store AB022832:
> ------------- EXCEPTION  -------------
> MSG: Unique key query in Bio::DB::BioSQL::SpeciesAdaptor returned 3 
> rows
> instead of 1. Query was [name_class="scie
> ntific name",binomial="Aspergillus oryzae"]
> STACK Bio::DB::BioSQL::BasePersistenceAdaptor::_find_by_unique_key
> /usr/lib/perl5/site_perl/5.8.0/Bio/DB/BioSQL/Ba
> sePersistenceAdaptor.pm:960
> STACK Bio::DB::BioSQL::BasePersistenceAdaptor::find_by_unique_key
> /usr/lib/perl5/site_perl/5.8.0/Bio/DB/BioSQL/Bas
> ePersistenceAdaptor.pm:856
> STACK Bio::DB::BioSQL::BasePersistenceAdaptor::create
> /usr/lib/perl5/site_perl/5.8.0/Bio/DB/BioSQL/BasePersistence
> Adaptor.pm:183
> STACK Bio::DB::Persistent::PersistentObject::create
> /usr/lib/perl5/site_perl/5.8.0/Bio/DB/Persistent/PersistentObj
> ect.pm:243
> STACK Bio::DB::BioSQL::BasePersistenceAdaptor::create
> /usr/lib/perl5/site_perl/5.8.0/Bio/DB/BioSQL/BasePersistence
> Adaptor.pm:170
> [root@thaler biosql]# :BasePersistenceAdaptor::store
> /usr/lib/perl5/site_perl/5.8.0/Bio/DB/BioSQL/BasePersistenceA
> daptor.pm:253
> STACK Bio::DB::Persistent::PersistentObject::store
> /usr/lib/perl5/site_perl/5.8.0/Bio/DB/Persistent/PersistentObje
> ct.pm:270
> STACK (eval) ./load_seqdatabase.pl:517
> STACK toplevel ./load_seqdatabase.pl:500
>
> --------------------------------------
>
> Thanks a lot
>
> Eric bazin
>
>
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>
-- 
-------------------------------------------------------------
Hilmar Lapp                            email: lapp at gnf.org
GNF, San Diego, Ca. 92121              phone: +1-858-812-1757
-------------------------------------------------------------


From cuddles21sapphire at hotmail.com  Tue Mar  2 06:24:09 2004
From: cuddles21sapphire at hotmail.com (mauro)
Date: Tue Mar  2 04:35:42 2004
Subject: [Bioperl-l] 8 times longer than V_I A_G R_A??
Message-ID: <1078226649-20222@excite.com>

Find out more about the new drug that puts V I A G R A to shame!!

http://drugsbusiness.com/sv/index.php?pid=eph9106


niki kathytacobell tanya blondie gibson concept promethe
joel chinaqwerty12 larry spain 
angels dan utopia 

Get off this list by writing to http://drugsbusiness.com/sv/applepie.php

From avilella at ebi.ac.uk  Tue Mar  2 04:33:13 2004
From: avilella at ebi.ac.uk (avilella)
Date: Tue Mar  2 04:39:37 2004
Subject: Fwd: [Bioperl-l] Bio::PopGen::Statistics discrete calculations
In-Reply-To: <a05200f01bc6956edebfe@[169.237.66.77]>
References: <1078175195.4043a5dbad588@webmail.duke.edu>
	<a05200f01bc6956edebfe@[169.237.66.77]>
Message-ID: <1078219993.2607.24.camel@localhost.localdomain>

> >I'm interested in calculating some of the statistics in
> >Bio::PopGen::Statistics in a discrete, position-wise, way.
> >
> >For example,
> >
> >I've done a quick implementation of the method "pi" that returns the
> >value of the statistic for each polymorphic positions instead of the
> >mean of all of them. It's "pi_discrete" method:
> >
> >http://www.ebi.ac.uk/cvs/cvspublic/wallace/bioperl-live/Bio/PopGen/Statistics.pm?rev=1.4&content-type=text/x-cvsweb-markup
> >
> 
> 
> Do you want to just calculate each site's heterozygosity?  Otherwise, 
> I'm not quite sure what 'pi' means in this context.  The pi 
> calculation currently implemented simply sums up each site's 
> heterozygosity.

Well, in pi_discrete I save the name of the marker in the key of the
hash, which for MSA files, using
Bio::PopGen::Utilities::aln_to_population is the position in the
alignment:

       for( my $i = 0; $i < $aln->length; $i++ ) {
# 	   my $nm = "Site-$i";
 	   my $nm = sprintf "Site-%09d", $i;


Apart from that, yes*, it's just heterozygosity in sliding windows of
width=1 sliding with jump=1 position each time.

*pi being a bad example of the statistic_discrete methods.

> >I would also like to have methods to do Sliding Window calculations for
> >these statistics. Something like:
> >
> >"Calculate tajima_D for windows of 100 polymorphisms that slide 10
> >positions"
> >
> >Any comments about the logic, naming, interest of this methods?
> 
> 
> Sounds great--I'm sure it would get used.  The only precaution I 
> would give is that some of the formats for inputting the polymorphism 
> data (i.e. prettybase files) do not have to include monomorphic 
> sites.  This would really mess with any kind of sliding window.

I would say that there are two kinds of windows concerning to that:

"Real position windows": where the window refers to the position in the
alignment (either, mono- or polymorphic)

"Polymorphic pos. windows": where the window refers to, e.g., the first
10 polymorphisms in the prettybase file. Obviously, with this second
option, the "real" width of the window is going to be different each
time. But if the name of the marker in the prettybase file informs about
the position, like "Site-$i", then one can, roughly, calculate the
midpoint for those polymorphisms.

And about the prettybase format, I would like to have a conversor in
bioperl from Bio::AlignIO aligments to prettybase "polymorphisms" or
markers.

Which would be the best place for that? Maybe in Bio::PopGen::Utilities?

Feedback is welcomed,

Thanks in advance,

    Albert.

> 
> 
> 
> 
> cheers,
> Matt
> 
> 
> 
> >Thanks,
> >
> >     Albert.
> >
> >--
> >Albert Vilella Bertran    avilella_at_ebi_ac_uk
> >EMBL Outstation, European Bioinformatics Institute
> >Wellcome Trust Genome Campus, Hinxton
> >Cambs. CB10 1SD, United Kingdom
> >Phone: +44 (0)1223 494 448   FAX: +44 (0)1223 494 468
> >
> >_______________________________________________
> >Bioperl-l mailing list
> >Bioperl-l@portal.open-bio.org
> >http://portal.open-bio.org/mailman/listinfo/bioperl-l
> >
> >----- End forwarded message -----
> 

From khoueiry at ibsm.cnrs-mrs.fr  Tue Mar  2 06:00:34 2004
From: khoueiry at ibsm.cnrs-mrs.fr (KHOUEIRY pierre)
Date: Tue Mar  2 06:06:28 2004
Subject: [Bioperl-l] Sequence search
Message-ID: <40446952.508@ibsm.cnrs-mrs.fr>

Hi all,
I'm searching in bioperl for methods that can detect in a protein 
sequence a subseq rich in a special amino acids. In other way, I want to 
find _per example_ if there is subsequence of 12 aa (sliding window)  
that contains at least 7 (valin + leucine) in a given sequence of 400 
aa. I need to progress amino acid by amino acid using my sliding window
I appreciate any help,
Thanks

-- 

---------------------------------
Pierre Khoueiry	
khoueiry@ibsm.cnrs-mrs.fr
LCB - CNRS
31, Chemin Joseph Aiguier,
13402 Marseille CEDEX 20, France
---------------------------------


From heikki at ebi.ac.uk  Tue Mar  2 08:04:59 2004
From: heikki at ebi.ac.uk (Heikki Lehvaslaiho)
Date: Tue Mar  2 08:10:51 2004
Subject: [Bioperl-l] Sequence search
In-Reply-To: <40446952.508@ibsm.cnrs-mrs.fr>
References: <40446952.508@ibsm.cnrs-mrs.fr>
Message-ID: <200403021304.59378.heikki@ebi.ac.uk>

Pierre,

The closest we have is a Bio::Tools::SeqPattern, but it does not do the 
pattern matching. That could be added if we decide what exactly and how 
generic we want to implement.  Ideas anyone?


The code below shows one simple approach to the problem. More advanced 
algorthms exist, but this one is simple to work with if you want to go a head 
and solve your immediate problem.

Yours,
		-Heikki

##############################################
#!/usr/bin/perl -w

use strict;

my $s= "abjabjabjbkakjkjbkajkjanakjnakjnakjnakjnakjaankjankajakjna";

my $verbose = 0;

my $length = 7;
my $pattern = '[ak]';
my $threshold = 5;


my $offset =0;
while (1) {
    my $subs = substr $s, $offset, $length;
    last unless length($subs) eq $length;

    my $subsori = $subs;
    my $c = $subs =~ tr/$pattern/1/;

    print "\t", $subsori, " $c\n" if $verbose;
    print $subsori, " $c\n" if $c >= $threshold;

    $offset++;
}
##############################################

On Tuesday 02 Mar 2004 11:00, KHOUEIRY pierre wrote:
> Hi all,
> I'm searching in bioperl for methods that can detect in a protein
> sequence a subseq rich in a special amino acids. In other way, I want to
> find _per example_ if there is subsequence of 12 aa (sliding window)
> that contains at least 7 (valin + leucine) in a given sequence of 400
> aa. I need to progress amino acid by amino acid using my sliding window
> I appreciate any help,
> Thanks

-- 
______ _/      _/_____________________________________________________
      _/      _/                      http://www.ebi.ac.uk/mutations/
     _/  _/  _/  Heikki Lehvaslaiho    heikki_at_ebi ac uk
    _/_/_/_/_/  EMBL Outstation, European Bioinformatics Institute
   _/  _/  _/  Wellcome Trust Genome Campus, Hinxton
  _/  _/  _/  Cambs. CB10 1SD, United Kingdom
     _/      Phone: +44 (0)1223 494 644   FAX: +44 (0)1223 494 468
___ _/_/_/_/_/________________________________________________________
From Anthony.Underwood at hpa.org.uk  Tue Mar  2 08:30:06 2004
From: Anthony.Underwood at hpa.org.uk (SRMD, Col - Underwood, Anthony)
Date: Tue Mar  2 08:20:55 2004
Subject: [Bioperl-l] Trace coordinates from scf files
Message-ID: <427DA33A6101484BA7A72934AF90DF73051AA817@exchange1.phls.org.uk>

Hi Bioperlers and Chad

As a follow up to my last message on the bioperl-mailing list:

I can see that the follwing methods do the following
$seq->length() returns the length in bases of the sequence held within the
scf file
$seq->trace_length() this returns the number of points for the trace data
print $seq->baseat("64") returns the base at position 64 in the sequence
print $seq->qualat("64") returns the quality at position 64 in the sequence
$seq->peak_indices() returns the peaks heights at each of the sequence
positions


however I find the $seq->trace("G") method a bit mysterious
the array returned has $seq->trace_length() elements and the values are
cumulative ( I think I may be dealing with a version 2 scf file). I
understand that the trace values should be calculated based on delta values.

something like $current_delta=$current_value-$previous_value
		$delta_delta=$current_delta-$previous_delta

This doesn't quite work. Please could anybody enlighten me on how the trace
values can be convrted into points that plotted as sequential "Y" values
would represent points on  a curve that resemble the traces when viewed in a
an application such as Chromas.

Many thanks,

Anthony

Dr Anthony Underwood
Bioinformatics Group
Genomics, Proteomic and Bioinformatics Unit
Central Public Health Laboratory
Health Protection Agency
61 Colindale Avenue
London
NW9 5HT
t:    0208 2004400 ext. 3618
f:    0208 3583138
e: anthony.underwood@hpa.org.uk


From john.herbert at clinical-pharmacology.oxford.ac.uk  Tue Mar  2 09:45:06 2004
From: john.herbert at clinical-pharmacology.oxford.ac.uk (john herbert)
Date: Tue Mar  2 09:51:03 2004
Subject: [Bioperl-l] Partial BioPerl installations
Message-ID: <s0449df7.088@gwmail.jr2.ox.ac.uk>

Hello.
Is there away to only install specific parts of BioPerl on to your
system. My problem is I have limited space on my machine and therefore
don't want to install the whole of BioPerl if I am only going to use
certain objects.

I realise some classes will have super classes and all will have to be
installed but I am assuming this will not need a full BioPerl install.

Any information on how I could install just specific classes from
BioPerl would be much appreciated, thank you.

Kind regards,

John Herbert,
From heikki at ebi.ac.uk  Tue Mar  2 11:11:26 2004
From: heikki at ebi.ac.uk (Heikki Lehvaslaiho)
Date: Tue Mar  2 11:17:17 2004
Subject: [Bioperl-l] Partial BioPerl installations
In-Reply-To: <s0449df7.088@gwmail.jr2.ox.ac.uk>
References: <s0449df7.088@gwmail.jr2.ox.ac.uk>
Message-ID: <200403021611.26589.heikki@ebi.ac.uk>

John,

This s something that have been discussed but no definite action has been 
taken. Bioperl core is still one monolithic package. However, if you really 
are tight with space there are some things you can do.

I am assuming you are using anonymous CVS access and setting PERL5LIB to point 
your perl to the bioperl modules. The whole bioperl core takes ~25M. 

1. Remove everything else that CVS and Bio subdirectories. This takes the disk 
space down to 12M. Run 'cvs update' within the Bio directory and you can keep 
up to date.  If you need to run tests, examples or read documentation, you 
can do 'cvs update' in the main directory.

2. If all you need is the basic sequence manipulation classes, you can, 
possibly, remove a number of subdirectories under Bio. For example, if you do 
not directly need  these, could try removing them:

CodonUsage
Coordinate
Das
Graphics
LiveSeq
Map
MapIO
Matrix
Ontology
Phenotype
PopGen
Tree
TreeIO
Variation

Note that I am only guessing here. If you try removing these, keep a log and 
be prepared to restore modules from cvs. If you really try this, please 
report back to the list. We'd be interested in knowing if and what works.

Yours,

	-Heikki

On Tuesday 02 Mar 2004 14:45, john herbert wrote:
> Hello.
> Is there away to only install specific parts of BioPerl on to your
> system. My problem is I have limited space on my machine and therefore
> don't want to install the whole of BioPerl if I am only going to use
> certain objects.
>
> I realise some classes will have super classes and all will have to be
> installed but I am assuming this will not need a full BioPerl install.
>
> Any information on how I could install just specific classes from
> BioPerl would be much appreciated, thank you.
>
> Kind regards,
>
> John Herbert,
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

-- 
______ _/      _/_____________________________________________________
      _/      _/                      http://www.ebi.ac.uk/mutations/
     _/  _/  _/  Heikki Lehvaslaiho    heikki_at_ebi ac uk
    _/_/_/_/_/  EMBL Outstation, European Bioinformatics Institute
   _/  _/  _/  Wellcome Trust Genome Campus, Hinxton
  _/  _/  _/  Cambs. CB10 1SD, United Kingdom
     _/      Phone: +44 (0)1223 494 644   FAX: +44 (0)1223 494 468
___ _/_/_/_/_/________________________________________________________
From john.herbert at clinical-pharmacology.oxford.ac.uk  Tue Mar  2 11:24:22 2004
From: john.herbert at clinical-pharmacology.oxford.ac.uk (john herbert)
Date: Tue Mar  2 11:30:47 2004
Subject: [Bioperl-l] Partial BioPerl installations
Message-ID: <s044b558.075@gwmail.jr2.ox.ac.uk>

Thanks for the info Heikki, very helpful.
One question arises, could I change the Perl library location to my own
custom location instead of PERL5LIB?

Thanks,

John.

>>> Heikki Lehvaslaiho <heikki@ebi.ac.uk> 02/03/2004 16:11:26 >>>
John,

This s something that have been discussed but no definite action has
been 
taken. Bioperl core is still one monolithic package. However, if you
really 
are tight with space there are some things you can do.

I am assuming you are using anonymous CVS access and setting PERL5LIB
to point 
your perl to the bioperl modules. The whole bioperl core takes ~25M. 

1. Remove everything else that CVS and Bio subdirectories. This takes
the disk 
space down to 12M. Run 'cvs update' within the Bio directory and you
can keep 
up to date.  If you need to run tests, examples or read documentation,
you 
can do 'cvs update' in the main directory.

2. If all you need is the basic sequence manipulation classes, you can,

possibly, remove a number of subdirectories under Bio. For example, if
you do 
not directly need  these, could try removing them:

CodonUsage
Coordinate
Das
Graphics
LiveSeq
Map
MapIO
Matrix
Ontology
Phenotype
PopGen
Tree
TreeIO
Variation

Note that I am only guessing here. If you try removing these, keep a
log and 
be prepared to restore modules from cvs. If you really try this, please

report back to the list. We'd be interested in knowing if and what
works.

Yours,

	-Heikki

On Tuesday 02 Mar 2004 14:45, john herbert wrote:
> Hello.
> Is there away to only install specific parts of BioPerl on to your
> system. My problem is I have limited space on my machine and
therefore
> don't want to install the whole of BioPerl if I am only going to use
> certain objects.
>
> I realise some classes will have super classes and all will have to
be
> installed but I am assuming this will not need a full BioPerl
install.
>
> Any information on how I could install just specific classes from
> BioPerl would be much appreciated, thank you.
>
> Kind regards,
>
> John Herbert,
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org 
> http://portal.open-bio.org/mailman/listinfo/bioperl-l 

-- 
______ _/      _/_____________________________________________________
      _/      _/                      http://www.ebi.ac.uk/mutations/ 
     _/  _/  _/  Heikki Lehvaslaiho    heikki_at_ebi ac uk
    _/_/_/_/_/  EMBL Outstation, European Bioinformatics Institute
   _/  _/  _/  Wellcome Trust Genome Campus, Hinxton
  _/  _/  _/  Cambs. CB10 1SD, United Kingdom
     _/      Phone: +44 (0)1223 494 644   FAX: +44 (0)1223 494 468
___ _/_/_/_/_/________________________________________________________
_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org 
http://portal.open-bio.org/mailman/listinfo/bioperl-l
From john.herbert at clinical-pharmacology.oxford.ac.uk  Tue Mar  2 11:31:39 2004
From: john.herbert at clinical-pharmacology.oxford.ac.uk (john herbert)
Date: Tue Mar  2 11:37:41 2004
Subject: [Bioperl-l] Partial BioPerl installations
Message-ID: <s044b6f4.081@gwmail.jr2.ox.ac.uk>

Please ignore my last question, I looked up what the PERL5LIB variable
does. Sorry, John.

>>> "john herbert" <john.herbert@clinical-pharmacology.oxford.ac.uk>
02/03/2004 16:24:22 >>>
Thanks for the info Heikki, very helpful.
One question arises, could I change the Perl library location to my
own
custom location instead of PERL5LIB?

Thanks,

John.

>>> Heikki Lehvaslaiho <heikki@ebi.ac.uk> 02/03/2004 16:11:26 >>>
John,

This s something that have been discussed but no definite action has
been 
taken. Bioperl core is still one monolithic package. However, if you
really 
are tight with space there are some things you can do.

I am assuming you are using anonymous CVS access and setting PERL5LIB
to point 
your perl to the bioperl modules. The whole bioperl core takes ~25M. 

1. Remove everything else that CVS and Bio subdirectories. This takes
the disk 
space down to 12M. Run 'cvs update' within the Bio directory and you
can keep 
up to date.  If you need to run tests, examples or read documentation,
you 
can do 'cvs update' in the main directory.

2. If all you need is the basic sequence manipulation classes, you
can,

possibly, remove a number of subdirectories under Bio. For example, if
you do 
not directly need  these, could try removing them:

CodonUsage
Coordinate
Das
Graphics
LiveSeq
Map
MapIO
Matrix
Ontology
Phenotype
PopGen
Tree
TreeIO
Variation

Note that I am only guessing here. If you try removing these, keep a
log and 
be prepared to restore modules from cvs. If you really try this,
please

report back to the list. We'd be interested in knowing if and what
works.

Yours,

	-Heikki

On Tuesday 02 Mar 2004 14:45, john herbert wrote:
> Hello.
> Is there away to only install specific parts of BioPerl on to your
> system. My problem is I have limited space on my machine and
therefore
> don't want to install the whole of BioPerl if I am only going to use
> certain objects.
>
> I realise some classes will have super classes and all will have to
be
> installed but I am assuming this will not need a full BioPerl
install.
>
> Any information on how I could install just specific classes from
> BioPerl would be much appreciated, thank you.
>
> Kind regards,
>
> John Herbert,
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org 
> http://portal.open-bio.org/mailman/listinfo/bioperl-l 

-- 
______ _/      _/_____________________________________________________
      _/      _/                      http://www.ebi.ac.uk/mutations/ 
     _/  _/  _/  Heikki Lehvaslaiho    heikki_at_ebi ac uk
    _/_/_/_/_/  EMBL Outstation, European Bioinformatics Institute
   _/  _/  _/  Wellcome Trust Genome Campus, Hinxton
  _/  _/  _/  Cambs. CB10 1SD, United Kingdom
     _/      Phone: +44 (0)1223 494 644   FAX: +44 (0)1223 494 468
___ _/_/_/_/_/________________________________________________________
_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org 
http://portal.open-bio.org/mailman/listinfo/bioperl-l 
_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org 
http://portal.open-bio.org/mailman/listinfo/bioperl-l
From chad at dieselwurks.com  Tue Mar  2 15:40:05 2004
From: chad at dieselwurks.com (Chad Matsalla)
Date: Tue Mar  2 15:46:00 2004
Subject: [Bioperl-l] Re: Trace coordinates from scf files
In-Reply-To: <427DA33A6101484BA7A72934AF90DF73051AA817@exchange1.phls.org.uk>
References: <427DA33A6101484BA7A72934AF90DF73051AA817@exchange1.phls.org.uk>
Message-ID: <Pine.LNX.4.50.0403021104140.17512-100000@sausage.usask.ca>


> Please does anyone have experience with using the
> Bio::Seq::SequenceTrace and Bio::SeqIO::scf modules to extract peak
> values for the different traces (G,A,T and C) which can then be
> plotted using a graphics module along with the bases themselves at the
> appropriate positions. I'm a little unclear as to how the trace, peak
> indices, peak_index_at, base_at etc methods work.

Hi All,

You can do that very easily.

Check the method Bio::Seq::SequenceTrace->scf_dump(). It demonstrates
how this can be done. There is a line in t/sequencetrace that calls that
method but in the interest of brevity iof output it is commented out.

<clickety click>

Remove the pound for line 119 and you will get output like this.[1]

See also the internal method Bio::Seq::SequenceTrace->_dump_traces($transformed):

'     - if a scalar is included as an argument (any scalar), this
     procedure will dump the _delta'd trace. If you don't know what
     that means you should not be using this.'

Note that scf_dump calls _dump_traces to produce output like this.[1]

Hope that helps,

Chad Matsalla

[1]
$ make test_sequencetrace TEST_VERBOSE=1
<snip>
Count   a       c       g       t
0       0       0       0       0
1       0       0       7       0
2       0       0       28      0
3       0       0       91      0
4       0       0       203     0
5       0       0       357     0
6       0       0       525     0
7       0       0       560     0
8       0       0       525     0
9       0       0       357     0
10      0       0       203     0
11      0       0       91      0
12      0       0       28      0
13      7       0       7       0
14      28      0       0       0
15      91      0       0       0
16      203     0       0       0
17      357     0       0       0
18      525     0       0       0
19      560     0       0       0
20      525     0       0       0
21      357     0       0       0
22      203     0       0       0
23      91      0       0       0
24      28      0       0       0
25      7       0       0       7
26      0       0       0       28
27      0       0       0       91
28      0       0       0       203
29      0       0       0       357
30      0       0       0       525
31      0       0       0       560
32      0       0       0       525
33      0       0       0       357
34      0       0       0       203
35      0       0       0       91
36      0       0       0       28
...

From facemann at yahoo.com  Tue Mar  2 19:07:48 2004
From: facemann at yahoo.com (Andy Hammer)
Date: Tue Mar  2 19:13:38 2004
Subject: [Bioperl-l] RemoteBlast VS Traditional Browser Blast
Message-ID: <20040303000748.14298.qmail@web13426.mail.yahoo.com>

My RemoteBlast results are differing from my web page
blast.  Is there a way to view the HEADER values from
NCBI's Blast page so I can duplicate them with
RemoteBlast?


__________________________________
Do you Yahoo!?
Yahoo! Search - Find what you�re looking for faster
http://search.yahoo.com
From chauser at duke.edu  Tue Mar  2 19:13:06 2004
From: chauser at duke.edu (Charles Hauser)
Date: Tue Mar  2 19:18:58 2004
Subject: [Bioperl-l] Root::IO tempfile error
Message-ID: <1078272786.2378.61.camel@pandorina>

All,

I'm working on a web blast script and it appears that Bio::SeqIO is not
writing the temp file for the input sequence.
(I'm using CVS version of bioperl updated this am)

If I output $tempfile, Bio::Root::IO appears to be creating it:

	/scratch/B3ltaJkxr4

but if I look in /scratch nada, and if I let the script execute blast
and attempt to open the results($tempfile.bls.txt) I get the server
error below.

ideas?

thanks Charles


relevant snippet of code:

    use constant TEMPOUTDIR  => '/scratch';

    my $io = new Bio::Root::IO;
    my ($tfh,$tempfile) = $io->tempfile(-DIR => TEMPOUTDIR);
    my $out = new Bio::SeqIO(-fh => $tfh, -format => 'fasta');
    $out->write_seq($seq);
    $out->close();

    [snip set up blast params  ]

    my $blastexe = "$prog -i $tempfile -o $tempfile.bls.txt -d ".BLASTDIR."/$db E=$evalue $options";
    `$blastexe`;

    my $parser = new Bio::SearchIO(-file   => "$tempfile.bls.txt",
				   -format => 'blast');


------------- EXCEPTION  -------------
MSG: Could not open /scratch/B3ltaJkxr4.bls.txt: No such file or directory
STACK Bio::Root::IO::_initialize_io /usr/local/src/bioperl/core/Bio/Root/IO.pm:273
STACK Bio::Root::IO::new /usr/local/src/bioperl/core/Bio/Root/IO.pm:213
STACK Bio::SearchIO::new /usr/local/src/bioperl/core/Bio/SearchIO.pm:135
STACK Bio::SearchIO::new /usr/local/src/bioperl/core/Bio/SearchIO.pm:167
STACK toplevel /var/www/cgi-bin/BLAST/foo.pl:217
 

From Marc.Logghe at devgen.com  Wed Mar  3 03:23:33 2004
From: Marc.Logghe at devgen.com (Marc Logghe)
Date: Wed Mar  3 03:30:01 2004
Subject: [Bioperl-l] Root::IO tempfile error
Message-ID: <BEE28BF86078B6429D6C780635718E21904BE3@morelia.be.devgen.com>

Hi Charles,
You have to pass the UNLINK argument, saying you don't want to delete the file as soon as the filehandle is closed. By default it is 1.
This should work:
my ($tfh,$tempfile) = $io->tempfile(-DIR => TEMPOUTDIR, -UNLINK => 0);

HTH,
Marc

> -----Original Message-----
> From: Charles Hauser [mailto:chauser@duke.edu]
> Sent: woensdag 3 maart 2004 1:13
> To: BioPerl-List
> Subject: [Bioperl-l] Root::IO tempfile error
> 
> 
> All,
> 
> I'm working on a web blast script and it appears that 
> Bio::SeqIO is not
> writing the temp file for the input sequence.
> (I'm using CVS version of bioperl updated this am)
> 
> If I output $tempfile, Bio::Root::IO appears to be creating it:
> 
> 	/scratch/B3ltaJkxr4
> 
> but if I look in /scratch nada, and if I let the script execute blast
> and attempt to open the results($tempfile.bls.txt) I get the server
> error below.
> 
> ideas?
> 
> thanks Charles
> 
> 
> relevant snippet of code:
> 
>     use constant TEMPOUTDIR  => '/scratch';
> 
>     my $io = new Bio::Root::IO;
>     my ($tfh,$tempfile) = $io->tempfile(-DIR => TEMPOUTDIR);
>     my $out = new Bio::SeqIO(-fh => $tfh, -format => 'fasta');
>     $out->write_seq($seq);
>     $out->close();
> 
>     [snip set up blast params  ]
> 
>     my $blastexe = "$prog -i $tempfile -o $tempfile.bls.txt 
> -d ".BLASTDIR."/$db E=$evalue $options";
>     `$blastexe`;
> 
>     my $parser = new Bio::SearchIO(-file   => "$tempfile.bls.txt",
> 				   -format => 'blast');
> 
> 
> 
> 
> ------------- EXCEPTION  -------------
> MSG: Could not open /scratch/B3ltaJkxr4.bls.txt: No such file 
> or directory
> STACK Bio::Root::IO::_initialize_io 
> /usr/local/src/bioperl/core/Bio/Root/IO.pm:273
> STACK Bio::Root::IO::new 
> /usr/local/src/bioperl/core/Bio/Root/IO.pm:213
> STACK Bio::SearchIO::new 
> /usr/local/src/bioperl/core/Bio/SearchIO.pm:135
> STACK Bio::SearchIO::new 
> /usr/local/src/bioperl/core/Bio/SearchIO.pm:167
> STACK toplevel /var/www/cgi-bin/BLAST/foo.pl:217
>  
> 
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 

From michael.watson at bbsrc.ac.uk  Wed Mar  3 05:16:04 2004
From: michael.watson at bbsrc.ac.uk (michael watson (IAH-C))
Date: Wed Mar  3 05:23:59 2004
Subject: [Bioperl-l] Validate Fasta
Message-ID: <20B7EB075F2D4542AFFAF813E98ACD9302822751@cl-exsrv1.irad.bbsrc.ac.uk>

Hi

I have searched the archives and only come up with one answer, and it didn't work - I want to validate a FASTA sequence (DNA).  What I mean is that if I am given a perfect FASTA sequence, then thats ok, but if there are ANY whitespace characters, or any other characters that really shouldn't be there, I want it to throw an error.  The script below was suggested by Jason in 2002:

use Bio::SeqIO;

my $in = Bio::SeqIO->new(-file => "rubbish.fasta");
my $out = Bio::SeqIO->new(-file => ">rubbish2.fasta");

eval {
	LOOP: while( my $seq = $in->next_seq ) {
		$out->write_seq($seq);
	}

};
if( $@) {
	print "There's an Error!\n";
	goto LOOP;
}

I actually fired this at one of my scripts, a perl script that clearly wasn't a fasta sequence - it has #'s, \ts, \ns and all sorts of non DNA sequence characters.  Here is the result:

>#!/usr/bin/perl
my$backups={'mysql'="/mick/mysql/",'apache'="/res/upity/apac
he",'mwatson'="/res/upity/mwatson",'www'="/www/Docs",'ensemb
l'="/too/fools/ensembl",'cgi'="/www/cgi-bin/"};my$location="
/mick/backups";my$date=`date`;my@date=split(/\s+/,$date);my$
date=join("_",@date[0..2],$date[$#date]);print"$date\n";#whi
le(my($name,$dir)=each%{$backups}){foreach$name(qw(apachemys
qlmwatsonwwwensemblcgi)){$dir=$backups-{$name};print"tarzipp
ing$dir\n";system("/bin/tar-c$dir$location/$name.$date.tar")
;system("/bin/gzip$location/$name.$date.tar");}

This is undoubtedly a wonderfully FASTA formatted perl script, but...

Anyone?  Any ideas?

Thanks in advance for the help!

Mick
From john.herbert at clinical-pharmacology.oxford.ac.uk  Wed Mar  3 05:45:16 2004
From: john.herbert at clinical-pharmacology.oxford.ac.uk (john herbert)
Date: Wed Mar  3 05:51:25 2004
Subject: [Bioperl-l] Validate Fasta
Message-ID: <s045b74d.066@gwmail.jr2.ox.ac.uk>

Hello Michael.
Im not a BioPerl extra-ordinaire programmer (so anyone correct me if I
am wrong) but I think the -format flag should help here. 

Try 

my $in = Bio::SeqIO->new(-file => "rubbish.fasta", -format =>
'Fasta');
my $out = Bio::SeqIO->new(-file => ">rubbish2.fasta", -format =>
'Fasta');

I am pretty sure if you put this change in your code and run it on your
very nice Perl fasta sequence, it will complain. 

Kind regards,

JOhn.


>>> "michael watson (IAH-C)" <michael.watson@bbsrc.ac.uk> 03/03/2004
10:16:04 >>>
Hi

I have searched the archives and only come up with one answer, and it
didn't work - I want to validate a FASTA sequence (DNA).  What I mean is
that if I am given a perfect FASTA sequence, then thats ok, but if there
are ANY whitespace characters, or any other characters that really
shouldn't be there, I want it to throw an error.  The script below was
suggested by Jason in 2002:

use Bio::SeqIO;

my $in = Bio::SeqIO->new(-file => "rubbish.fasta");
my $out = Bio::SeqIO->new(-file => ">rubbish2.fasta");

eval {
	LOOP: while( my $seq = $in->next_seq ) {
		$out->write_seq($seq);
	}

};
if( $@) {
	print "There's an Error!\n";
	goto LOOP;
}

I actually fired this at one of my scripts, a perl script that clearly
wasn't a fasta sequence - it has #'s, \ts, \ns and all sorts of non DNA
sequence characters.  Here is the result:

>#!/usr/bin/perl
my$backups={'mysql'="/mick/mysql/",'apache'="/res/upity/apac
he",'mwatson'="/res/upity/mwatson",'www'="/www/Docs",'ensemb
l'="/too/fools/ensembl",'cgi'="/www/cgi-bin/"};my$location="
/mick/backups";my$date=`date`;my@date=split(/\s+/,$date);my$
date=join("_",@date[0..2],$date[$#date]);print"$date\n";#whi 
le(my($name,$dir)=each%{$backups}){foreach$name(qw(apachemys
qlmwatsonwwwensemblcgi)){$dir=$backups-{$name};print"tarzipp
ing$dir\n";system("/bin/tar-c$dir$location/$name.$date.tar")
;system("/bin/gzip$location/$name.$date.tar");}

This is undoubtedly a wonderfully FASTA formatted perl script, but...

Anyone?  Any ideas?

Thanks in advance for the help!

Mick
_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org 
http://portal.open-bio.org/mailman/listinfo/bioperl-l
From michael.watson at bbsrc.ac.uk  Wed Mar  3 05:52:58 2004
From: michael.watson at bbsrc.ac.uk (michael watson (IAH-C))
Date: Wed Mar  3 06:02:48 2004
Subject: [Bioperl-l] Validate Fasta
Message-ID: <20B7EB075F2D4542AFFAF813E98ACD9302822754@cl-exsrv1.irad.bbsrc.ac.uk>

Thanks for youe help, but I am afraid not....

-----Original Message-----
From: john herbert
[mailto:john.herbert@clinical-pharmacology.oxford.ac.uk]
Sent: 03 March 2004 10:45
To: michael.watson@bbsrc.ac.uk; bioperl-l@portal.open-bio.org
Subject: Re: [Bioperl-l] Validate Fasta


Hello Michael.
Im not a BioPerl extra-ordinaire programmer (so anyone correct me if I
am wrong) but I think the -format flag should help here. 

Try 

my $in = Bio::SeqIO->new(-file => "rubbish.fasta", -format =>
'Fasta');
my $out = Bio::SeqIO->new(-file => ">rubbish2.fasta", -format =>
'Fasta');

I am pretty sure if you put this change in your code and run it on your
very nice Perl fasta sequence, it will complain. 

Kind regards,

JOhn.


>>> "michael watson (IAH-C)" <michael.watson@bbsrc.ac.uk> 03/03/2004
10:16:04 >>>
Hi

I have searched the archives and only come up with one answer, and it
didn't work - I want to validate a FASTA sequence (DNA).  What I mean is
that if I am given a perfect FASTA sequence, then thats ok, but if there
are ANY whitespace characters, or any other characters that really
shouldn't be there, I want it to throw an error.  The script below was
suggested by Jason in 2002:

use Bio::SeqIO;

my $in = Bio::SeqIO->new(-file => "rubbish.fasta");
my $out = Bio::SeqIO->new(-file => ">rubbish2.fasta");

eval {
	LOOP: while( my $seq = $in->next_seq ) {
		$out->write_seq($seq);
	}

};
if( $@) {
	print "There's an Error!\n";
	goto LOOP;
}

I actually fired this at one of my scripts, a perl script that clearly
wasn't a fasta sequence - it has #'s, \ts, \ns and all sorts of non DNA
sequence characters.  Here is the result:

>#!/usr/bin/perl
my$backups={'mysql'="/mick/mysql/",'apache'="/res/upity/apac
he",'mwatson'="/res/upity/mwatson",'www'="/www/Docs",'ensemb
l'="/too/fools/ensembl",'cgi'="/www/cgi-bin/"};my$location="
/mick/backups";my$date=`date`;my@date=split(/\s+/,$date);my$
date=join("_",@date[0..2],$date[$#date]);print"$date\n";#whi 
le(my($name,$dir)=each%{$backups}){foreach$name(qw(apachemys
qlmwatsonwwwensemblcgi)){$dir=$backups-{$name};print"tarzipp
ing$dir\n";system("/bin/tar-c$dir$location/$name.$date.tar")
;system("/bin/gzip$location/$name.$date.tar");}

This is undoubtedly a wonderfully FASTA formatted perl script, but...

Anyone?  Any ideas?

Thanks in advance for the help!

Mick
_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org 
http://portal.open-bio.org/mailman/listinfo/bioperl-l
_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org
http://portal.open-bio.org/mailman/listinfo/bioperl-l
From john.herbert at clinical-pharmacology.oxford.ac.uk  Wed Mar  3 06:10:11 2004
From: john.herbert at clinical-pharmacology.oxford.ac.uk (john herbert)
Date: Wed Mar  3 06:16:20 2004
Subject: [Bioperl-l] Validate Fasta
Message-ID: <s045bd25.080@gwmail.jr2.ox.ac.uk>

Interestingly, it also does not complain if you convert the fasta Perl
to EMBL format either :-)

ID   #!/usr/bin/perlstandard; AA; UNK; 527 BP.
XX
AC   unknown;
XX
DE   
XX
FH   Key             Location/Qualifiers
FH
XX
SQ   Sequence 527 BP; 38 A; 19 C; 5 G; 29 T; 436 other;
     my$backups ={'mysql'= "/mick/mys ql/",'apac he'="/res/ upity/apac 
      60
     he",'mwats on'="/res/ upity/mwat son",'www' ="/www/doc s",'ensemb 
     120
     l'="/too/f ools/ensem bl",'cgi'= "/www/cgi- bin/"};my$ location=" 
     180
     /mick/back ups";my$da te=`date`; my@date=sp lit(/\s+/, $date);my$ 
     240
     date=join( "_",@date[ 0..2],$dat e[$#date]) ;print"$da te\n";#whi 
     300
     le(my($nam e,$dir)=ea ch%{$backu ps}){forea ch$name(qw (apachemys 
     360
     qlmwatsonw wwensemblc gi)){$dir= $backups-{ $name};pri nt"tarzipp 
     420
     ing$dir\n" ;system("/ bin/tar-c$ dir$locati on/$name.$ date.tar") 
     480
     ;system("/ bin/gzip$l ocation/$n ame.$date. tar");}               
     527
//


>>> "michael watson (IAH-C)" <michael.watson@bbsrc.ac.uk> 03/03/2004
10:52:58 >>>
Thanks for youe help, but I am afraid not....

-----Original Message-----
From: john herbert
[mailto:john.herbert@clinical-pharmacology.oxford.ac.uk] 
Sent: 03 March 2004 10:45
To: michael.watson@bbsrc.ac.uk; bioperl-l@portal.open-bio.org 
Subject: Re: [Bioperl-l] Validate Fasta


Hello Michael.
Im not a BioPerl extra-ordinaire programmer (so anyone correct me if I
am wrong) but I think the -format flag should help here. 

Try 

my $in = Bio::SeqIO->new(-file => "rubbish.fasta", -format =>
'Fasta');
my $out = Bio::SeqIO->new(-file => ">rubbish2.fasta", -format =>
'Fasta');

I am pretty sure if you put this change in your code and run it on
your
very nice Perl fasta sequence, it will complain. 

Kind regards,

JOhn.


>>> "michael watson (IAH-C)" <michael.watson@bbsrc.ac.uk> 03/03/2004
10:16:04 >>>
Hi

I have searched the archives and only come up with one answer, and it
didn't work - I want to validate a FASTA sequence (DNA).  What I mean
is
that if I am given a perfect FASTA sequence, then thats ok, but if
there
are ANY whitespace characters, or any other characters that really
shouldn't be there, I want it to throw an error.  The script below was
suggested by Jason in 2002:

use Bio::SeqIO;

my $in = Bio::SeqIO->new(-file => "rubbish.fasta");
my $out = Bio::SeqIO->new(-file => ">rubbish2.fasta");

eval {
	LOOP: while( my $seq = $in->next_seq ) {
		$out->write_seq($seq);
	}

};
if( $@) {
	print "There's an Error!\n";
	goto LOOP;
}

I actually fired this at one of my scripts, a perl script that clearly
wasn't a fasta sequence - it has #'s, \ts, \ns and all sorts of non
DNA
sequence characters.  Here is the result:

>#!/usr/bin/perl
my$backups={'mysql'="/mick/mysql/",'apache'="/res/upity/apac
he",'mwatson'="/res/upity/mwatson",'www'="/www/Docs",'ensemb
l'="/too/fools/ensembl",'cgi'="/www/cgi-bin/"};my$location="
/mick/backups";my$date=`date`;my@date=split(/\s+/,$date);my$
date=join("_",@date[0..2],$date[$#date]);print"$date\n";#whi 
le(my($name,$dir)=each%{$backups}){foreach$name(qw(apachemys
qlmwatsonwwwensemblcgi)){$dir=$backups-{$name};print"tarzipp
ing$dir\n";system("/bin/tar-c$dir$location/$name.$date.tar")
;system("/bin/gzip$location/$name.$date.tar");}

This is undoubtedly a wonderfully FASTA formatted perl script, but...

Anyone?  Any ideas?

Thanks in advance for the help!

Mick
_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org 
http://portal.open-bio.org/mailman/listinfo/bioperl-l 
_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org 
http://portal.open-bio.org/mailman/listinfo/bioperl-l
From ak at ebi.ac.uk  Wed Mar  3 06:26:21 2004
From: ak at ebi.ac.uk (Andreas Kahari)
Date: Wed Mar  3 06:32:17 2004
Subject: [Bioperl-l] Validate Fasta
In-Reply-To: <s045b74d.066@gwmail.jr2.ox.ac.uk>
References: <s045b74d.066@gwmail.jr2.ox.ac.uk>
Message-ID: <20040303112621.GA601@ebi.ac.uk>

The only difference is (I think) that Michales code will use my
Bio::Tools::GuessSeqFormat module to guess that the format is
FastA (using BioPerl 1.4).

Currently, the guesser will say "fasta" if the first line of the
file matches /^>\w/ (and no other format matches for the first
line), or if any other line matches /^[A-IK-NP-Z]+$/i (and no
other format matches the same line).

I think that sequence file format validation ought to be an
optional part of Bio::SeqIO or a separate module.  I haven't
looked into Bio::SeqIO to see what goes on in there though...
My module just provides a rough guess.

Andreas

On Wed, Mar 03, 2004 at 10:45:16AM +0000, john herbert wrote:
> Hello Michael.
> Im not a BioPerl extra-ordinaire programmer (so anyone correct me if I
> am wrong) but I think the -format flag should help here. 
> 
> Try 
> 
> my $in = Bio::SeqIO->new(-file => "rubbish.fasta", -format =>
> 'Fasta');
> my $out = Bio::SeqIO->new(-file => ">rubbish2.fasta", -format =>
> 'Fasta');
> 
> I am pretty sure if you put this change in your code and run it on your
> very nice Perl fasta sequence, it will complain. 
> 
> Kind regards,
> 
> JOhn.
> 
> 
> >>> "michael watson (IAH-C)" <michael.watson@bbsrc.ac.uk> 03/03/2004
> 10:16:04 >>>
> Hi
> 
> I have searched the archives and only come up with one answer, and it
> didn't work - I want to validate a FASTA sequence (DNA).  What I mean is
> that if I am given a perfect FASTA sequence, then thats ok, but if there
> are ANY whitespace characters, or any other characters that really
> shouldn't be there, I want it to throw an error.  The script below was
> suggested by Jason in 2002:
> 
> use Bio::SeqIO;
> 
> my $in = Bio::SeqIO->new(-file => "rubbish.fasta");
> my $out = Bio::SeqIO->new(-file => ">rubbish2.fasta");
> 
> eval {
> 	LOOP: while( my $seq = $in->next_seq ) {
> 		$out->write_seq($seq);
> 	}
> 
> };
> if( $@) {
> 	print "There's an Error!\n";
> 	goto LOOP;
> }
> 
> I actually fired this at one of my scripts, a perl script that clearly
> wasn't a fasta sequence - it has #'s, \ts, \ns and all sorts of non DNA
> sequence characters.  Here is the result:
> 
> >#!/usr/bin/perl
> my$backups={'mysql'="/mick/mysql/",'apache'="/res/upity/apac
> he",'mwatson'="/res/upity/mwatson",'www'="/www/Docs",'ensemb
> l'="/too/fools/ensembl",'cgi'="/www/cgi-bin/"};my$location="
> /mick/backups";my$date=`date`;my@date=split(/\s+/,$date);my$
> date=join("_",@date[0..2],$date[$#date]);print"$date\n";#whi 
> le(my($name,$dir)=each%{$backups}){foreach$name(qw(apachemys
> qlmwatsonwwwensemblcgi)){$dir=$backups-{$name};print"tarzipp
> ing$dir\n";system("/bin/tar-c$dir$location/$name.$date.tar")
> ;system("/bin/gzip$location/$name.$date.tar");}
> 
> This is undoubtedly a wonderfully FASTA formatted perl script, but...
> 
> Anyone?  Any ideas?
> 
> Thanks in advance for the help!
> 
> Mick
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org 
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

-- 
|-][-|      Andreas K?h?ri                                |[==]|
|[--]|      EMBL, European Bioinformatics Institute       |=][=|
|-][-|      Wellcome Trust Genome Campus                  |[==]|
|[--]|      Hinxton, Cambridgeshire, CB10 1SD             |=][=|
|-][-|      United Kingdom                                |[==]|
From Richard.Adams at ed.ac.uk  Wed Mar  3 06:26:44 2004
From: Richard.Adams at ed.ac.uk (Richard Adams)
Date: Wed Mar  3 06:32:34 2004
Subject: [Bioperl-l] RemoteBlast VS Traditional Browser Blast
Message-ID: <4045C0F4.3090709@ed.ac.uk>

1. Submit your sequence to blast
2. Wait till you get the "Your request has been successfulyy submitted" 
page.
3.     Go back to the submission page- all the options are now in the URL.
I'd be interested to know what the differences are...
Is it the raw report that's different or after you've parsed them 
through SearchIO?

Richard

-- 
Dr Richard Adams
Psychiatric Genetics Group,
Medical Genetics,
Molecular Medicine Centre,
Western General Hospital,
Crewe Rd West,
Edinburgh UK
EH4 2XU

Tel: 44 131 651 1084
richard.adams@ed.ac.uk


From james.wasmuth at ed.ac.uk  Wed Mar  3 06:23:01 2004
From: james.wasmuth at ed.ac.uk (James Wasmuth)
Date: Wed Mar  3 06:33:14 2004
Subject: [Bioperl-l] Validate Fasta
In-Reply-To: <4045BFD8.50103@ed.ac.uk>
References: <s045bd25.080@gwmail.jr2.ox.ac.uk> <4045BFD8.50103@ed.ac.uk>
Message-ID: <4045C015.3070104@ed.ac.uk>

from my understanding the validator was built for speed, rather than 
complete accuracy.

It WILL fail if there is no header, but when checking the actual 
sequence it uses (or used) a number of general rules.

They were something along the lines of:

if 80% G T C A then DNA
something with U's in it is RNA
and anything else is protein.

Forgive me if I got that wrong, it was a while back that I remember this 
from.

If you want to check the sequence then write one...

for DNA:  unless ($seq=~/^[GCTA\n]+$/)   {
    print "Error!\n";
}

james


>
>
> john herbert wrote:
>
>>Interestingly, it also does not complain if you convert the fasta Perl
>>to EMBL format either :-)
>>
>>ID   #!/usr/bin/perlstandard; AA; UNK; 527 BP.
>>XX
>>AC   unknown;
>>XX
>>DE   
>>XX
>>FH   Key             Location/Qualifiers
>>FH
>>XX
>>SQ   Sequence 527 BP; 38 A; 19 C; 5 G; 29 T; 436 other;
>>     my$backups ={'mysql'= "/mick/mys ql/",'apac he'="/res/ upity/apac 
>>      60
>>     he",'mwats on'="/res/ upity/mwat son",'www' ="/www/doc s",'ensemb 
>>     120
>>     l'="/too/f ools/ensem bl",'cgi'= "/www/cgi- bin/"};my$ location=" 
>>     180
>>     /mick/back ups";my$da te=`date`; my@date=sp lit(/\s+/, $date);my$ 
>>     240
>>     date=join( "_",@date[ 0..2],$dat e[$#date]) ;print"$da te\n";#whi 
>>     300
>>     le(my($nam e,$dir)=ea ch%{$backu ps}){forea ch$name(qw (apachemys 
>>     360
>>     qlmwatsonw wwensemblc gi)){$dir= $backups-{ $name};pri nt"tarzipp 
>>     420
>>     ing$dir\n" ;system("/ bin/tar-c$ dir$locati on/$name.$ date.tar") 
>>     480
>>     ;system("/ bin/gzip$l ocation/$n ame.$date. tar");}               
>>     527
>>//
>>
>>
>>
>>  
>>
>>>>>"michael watson (IAH-C)" <michael.watson@bbsrc.ac.uk> 03/03/2004
>>>>>        
>>>>>
>>10:52:58 >>>
>>Thanks for youe help, but I am afraid not....
>>
>>-----Original Message-----
>>From: john herbert
>>[mailto:john.herbert@clinical-pharmacology.oxford.ac.uk] 
>>Sent: 03 March 2004 10:45
>>To: michael.watson@bbsrc.ac.uk; bioperl-l@portal.open-bio.org 
>>Subject: Re: [Bioperl-l] Validate Fasta
>>
>>
>>Hello Michael.
>>Im not a BioPerl extra-ordinaire programmer (so anyone correct me if I
>>am wrong) but I think the -format flag should help here. 
>>
>>Try 
>>
>>my $in = Bio::SeqIO->new(-file => "rubbish.fasta", -format =>
>>'Fasta');
>>my $out = Bio::SeqIO->new(-file => ">rubbish2.fasta", -format =>
>>'Fasta');
>>
>>I am pretty sure if you put this change in your code and run it on
>>your
>>very nice Perl fasta sequence, it will complain. 
>>
>>Kind regards,
>>
>>JOhn.
>>
>>
>>  
>>
>>>>>"michael watson (IAH-C)" <michael.watson@bbsrc.ac.uk> 03/03/2004
>>>>>        
>>>>>
>>10:16:04 >>>
>>Hi
>>
>>I have searched the archives and only come up with one answer, and it
>>didn't work - I want to validate a FASTA sequence (DNA).  What I mean
>>is
>>that if I am given a perfect FASTA sequence, then thats ok, but if
>>there
>>are ANY whitespace characters, or any other characters that really
>>shouldn't be there, I want it to throw an error.  The script below was
>>suggested by Jason in 2002:
>>
>>use Bio::SeqIO;
>>
>>my $in = Bio::SeqIO->new(-file => "rubbish.fasta");
>>my $out = Bio::SeqIO->new(-file => ">rubbish2.fasta");
>>
>>eval {
>>	LOOP: while( my $seq = $in->next_seq ) {
>>		$out->write_seq($seq);
>>	}
>>
>>};
>>if( $@) {
>>	print "There's an Error!\n";
>>	goto LOOP;
>>}
>>
>>I actually fired this at one of my scripts, a perl script that clearly
>>wasn't a fasta sequence - it has #'s, \ts, \ns and all sorts of non
>>DNA
>>sequence characters.  Here is the result:
>>
>>  
>>
>>>#!/usr/bin/perl
>>>    
>>>
>>my$backups={'mysql'="/mick/mysql/",'apache'="/res/upity/apac
>>he",'mwatson'="/res/upity/mwatson",'www'="/www/Docs",'ensemb
>>l'="/too/fools/ensembl",'cgi'="/www/cgi-bin/"};my$location="
>>/mick/backups";my$date=`date`;my@date=split(/\s+/,$date);my$
>>date=join("_",@date[0..2],$date[$#date]);print"$date\n";#whi 
>>le(my($name,$dir)=each%{$backups}){foreach$name(qw(apachemys
>>qlmwatsonwwwensemblcgi)){$dir=$backups-{$name};print"tarzipp
>>ing$dir\n";system("/bin/tar-c$dir$location/$name.$date.tar")
>>;system("/bin/gzip$location/$name.$date.tar");}
>>
>>This is undoubtedly a wonderfully FASTA formatted perl script, but...
>>
>>Anyone?  Any ideas?
>>
>>Thanks in advance for the help!
>>
>>Mick
>>_______________________________________________
>>Bioperl-l mailing list
>>Bioperl-l@portal.open-bio.org 
>>http://portal.open-bio.org/mailman/listinfo/bioperl-l 
>>_______________________________________________
>>Bioperl-l mailing list
>>Bioperl-l@portal.open-bio.org 
>>http://portal.open-bio.org/mailman/listinfo/bioperl-l
>>_______________________________________________
>>Bioperl-l mailing list
>>Bioperl-l@portal.open-bio.org
>>http://portal.open-bio.org/mailman/listinfo/bioperl-l
>>  
>>
>
>-- 
>Nematode Bioinformatics           ||
>Blaxter Nematode Genomics Group   ||
>School of Biological Sciences     ||
>Ashworth Laboratories             ||	
>King's Buildings                  ||    tel: +44 131 650 7403
>University of Edinburgh           ||    web: www.nematodes.org
>Edinburgh                         ||
>EH9 3JT                           ||
>UK                                ||	
>
>"I have not failed. I've just found 10,000 ways that don't work."
>               --- Thomas Edison
>  
>

-- 
Nematode Bioinformatics           ||
Blaxter Nematode Genomics Group   ||
School of Biological Sciences     ||
Ashworth Laboratories             ||	
King's Buildings                  ||    tel: +44 131 650 7403
University of Edinburgh           ||    web: www.nematodes.org
Edinburgh                         ||
EH9 3JT                           ||
UK                                ||	

"I have not failed. I've just found 10,000 ways that don't work."
               --- Thomas Edison

From 9904982 at sms.ed.ac.uk  Wed Mar  3 08:39:12 2004
From: 9904982 at sms.ed.ac.uk (martin)
Date: Wed Mar  3 08:48:31 2004
Subject: [Bioperl-l] Fetching > 500 sequences
In-Reply-To: <200403011427.35507.rnilsson@clarku.edu>
References: <200403011427.35507.rnilsson@clarku.edu>
Message-ID: <1078321151.9199.11.camel@adorus.cap.ed.ac.uk>

Hi, 

I've experienced a similar problem with the 500 sequence limit.  I've
not found a way round it; my solution has been to download the sequences
you want in GenBank format using SRS from the ncbi website, then open it
thus:

my $stream=Bio::SeqIO->new(-file => 'filename.genbank', -format =>
'GenBank');

#process each record in turn..

while (my $seq=$stream->next_seq()){
do_something
}

hope this helps.

Martin
On Mon, 2004-03-01 at 19:27, henrik nilsson wrote:
> Hi,
> 
> It seems that I have problems with fetching more than 500 sequences from 
> Genbank using Bioperl. It looks like the script (attached below) fetches all 
> the 7000+ sequences, but only 500 make it to the output file. Is there any 
> way to get all these 7000+ sequences written to the file - that is, is it 
> possible to sidestep the 500 seq. limit?
> 
> Thanks for your time,
> 
> Rolf
> 
> 
> 
> 
> Please find the script below. When I run it, I get
> 
> Writing accession number AJ406491
> ... etc ...
> Writing accession number AJ406489
> Writing accession number AJ406471
> Writing accession number AJ406465
> Writing accession number AJ406461
> Total number of records found = 7053
> 
> but when I type 
> 
> [rolf@localhost dir]$ cat data.gb | grep 'BASE COUNT' | wc -l
>     500
> [rolf@localhost dir]$
> 
> It is clear that only 500 seq. were written to the file.
> 
> #!/usr/bin/perl -w
> use strict;
> use Bio::DB::GenBank;
> use Bio::DB::Query::GenBank;
> use IO::String;
> use Bio::SeqIO;
> use Bio::Seq::RichSeq;
> 
> 
> my $query_string = 'Boletales';
> 
> my $query = Bio::DB::Query::GenBank->new(-db=>'nucleotide',
>                                          -query=>$query_string);
> my $out = Bio::SeqIO->new(-file=>">data.gb", -format=>'genbank');
> 
> my $count = $query->count;
> 
> my $gb = new Bio::DB::GenBank();
> 
> my $stream = $gb->get_Stream_by_query($query);
> 
> while (my $seq = $stream->next_seq) {
>         print "Writing accession number ", $seq->accession_number,"\n";
>         $out->write_seq($seq);
>         }
> 
> print "Total number of records found = $count\n";
> 
> exit;
> 
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
-- 
Martin Jones
Blaxter Nematode Genomics Lab
ICAPB
Ashworth Labs
Kings Buildings
University of Edinburgh
Edinburgh

0131 650 6761
9904982@sms.ed.ac.uk

From tintin21public at hotmail.com  Wed Mar  3 11:16:59 2004
From: tintin21public at hotmail.com (emory)
Date: Wed Mar  3 09:28:02 2004
Subject: [Bioperl-l] 8 times longer than V_I A_G R_A??
Message-ID: <1078330619-15212@excite.com>

Find out more about the new drug that puts V I A G R A to shame!!

http://drugsbusiness.com/sv/index.php?pid=eph9106


church imagelulu dan alison cookies robinhoo olivier
dickhead barrykingdom dougie aylmer 
chevy cyrano boogie 

Get off this list by writing to http://drugsbusiness.com/sv/applepie.php

From off2w0rk at yahoo.com  Tue Mar  2 20:45:12 2004
From: off2w0rk at yahoo.com (facemann)
Date: Wed Mar  3 11:07:05 2004
Subject: [Bioperl-l] Sequence search
Message-ID: <20040303014512.67475.qmail@web41303.mail.yahoo.com>

Its not BioPerl, but you might consider PatScan.  Its a good search program written in C and I think it uses perl.  
Check it out at http://www-unix.mcs.anl.gov/compbio/PatScan/HTML/


---------------------------------
Do you Yahoo!?
Yahoo! Search - Find what you�re looking for faster.
From off2w0rk at yahoo.com  Tue Mar  2 21:03:02 2004
From: off2w0rk at yahoo.com (facemann)
Date: Wed Mar  3 11:07:06 2004
Subject: [Bioperl-l] RemoteBlast VS Traditional Browser Blast
Message-ID: <20040303020302.79270.qmail@web41310.mail.yahoo.com>

Ah... Solved the header problem.
Merely check all the option you desire on the NCBI web page and hit the "Get URL" button at the bottom of the page.   Now all the choices are shown.


---------------------------------
Do you Yahoo!?
Yahoo! Search - Find what you�re looking for faster.
From off2w0rk at yahoo.com  Tue Mar  2 23:10:15 2004
From: off2w0rk at yahoo.com (facemann)
Date: Wed Mar  3 11:07:10 2004
Subject: [Bioperl-l] RemoteBlast VS Traditional Browser Blast
Message-ID: <20040303041015.26947.qmail@web41314.mail.yahoo.com>

I have found the HEADER values and matched them but my scores still differ.  This is causing a bit of head scratching...
I have matched everything that I thought could cause a difference and a few that shouldn't make the slightest difference.
My HEADER code:
 
my @params = ( '-prog' => $prog,
         '-data' => $db,
         '-expect' => $e_val,
         '-readmethod' => 'SearchIO' );
my $factory = Bio::Tools::Run::RemoteBlast->new(@params);
$Bio::Tools::Run::RemoteBlast::HEADER{'ALIGNMENT_VIEW'} = 'Pairwise';
$Bio::Tools::Run::RemoteBlast::HEADER{'AUTO_FORMAT'} = 'Semiauto';
$Bio::Tools::Run::RemoteBlast::HEADER{'DATABASE'} = 'nr';
$Bio::Tools::Run::RemoteBlast::HEADER{'HITLIST_SIZE'} = '3';
$Bio::Tools::Run::RemoteBlast::HEADER{'EXPECT'} = '10';
$Bio::Tools::Run::RemoteBlast::HEADER{'WORD_SIZE'} = '11';
$Bio::Tools::Run::RemoteBlast::HEADER{'FORMAT_OBJECT'} = 'Alignment';
$Bio::Tools::Run::RemoteBlast::HEADER{'SERVICE'} = 'plain';
# the following line locks up the query
#$Bio::Tools::Run::RemoteBlast::HEADER{'ENTREZ_QUERY'} = 'all';
$Bio::Tools::Run::RemoteBlast::HEADER{'GET_SEQUENCE'} = 'on';
$Bio::Tools::Run::RemoteBlast::HEADER{'NCBI_GI'} = 'on';
$Bio::Tools::Run::RemoteBlast::HEADER{'PAGE'} = 'Nucleotides';
delete $Bio::Tools::Run::RemoteBlast::HEADER{'FILTER'};

print $factory->header, "\n";


---------------------------------
Do you Yahoo!?
Yahoo! Search - Find what you�re looking for faster.
From chapmanb at uga.edu  Wed Mar  3 10:51:17 2004
From: chapmanb at uga.edu (Brad Chapman)
Date: Wed Mar  3 11:07:11 2004
Subject: [Bioperl-l] Fetching > 500 sequences
In-Reply-To: <1078321151.9199.11.camel@adorus.cap.ed.ac.uk>
References: <200403011427.35507.rnilsson@clarku.edu>
	<1078321151.9199.11.camel@adorus.cap.ed.ac.uk>
Message-ID: <20040303155117.GA51845@evostick.agtec.uga.edu>

Hi Rolf, Martin;

> > It seems that I have problems with fetching more than 500 sequences from 
> > Genbank using Bioperl. It looks like the script (attached below) fetches all 
> > the 7000+ sequences, but only 500 make it to the output file. Is there any 
> > way to get all these 7000+ sequences written to the file - that is, is it 
> > possible to sidestep the 500 seq. limit?

I actually debugged and fixed this problem recently for Biopython --
it looks like a change in the way EUtils works. If you pass 'retmax'
to the eutils URL then it will only give you back at max 500
sequences, no matter what you pass for this parameter. The fix I
found that worked was to not pass 'retmax'.

The attached patch to Bio/DB/Query/GenBank.pm should fix the
problem, if similar symptoms equal similar fixes in this case. An
actual Perl/BioPerl person should look at this, though, as I'm not
to be trusted for coding Perl :-).

Hope this helps.
Brad
-------------- next part --------------
*** Bio/DB/Query/GenBank.pm.orig	Tue Sep  9 17:29:00 2003
--- Bio/DB/Query/GenBank.pm	Wed Mar  3 10:44:21 2004
***************
*** 197,203 ****
    $method = 'get';
    $base   = ESEARCH;
    push @params,('term'   => $self->query);
!   push @params,('retmax' => $self->{'_count'} || MAXENTRY);
    ($method,$base,@params);
  }
  
--- 197,204 ----
    $method = 'get';
    $base   = ESEARCH;
    push @params,('term'   => $self->query);
!   # Providing 'retmax' limits queries to 500 sequences
!   # push @params,('retmax' => $self->{'_count'} || MAXENTRY);
    ($method,$base,@params);
  }
  
From tan10 at psu.edu  Wed Mar  3 16:03:45 2004
From: tan10 at psu.edu (Dr. Todd Naumann)
Date: Wed Mar  3 11:09:45 2004
Subject: [Bioperl-l] Bio::Graphics::Glyph::translation.pm
Message-ID: <1078347825.4592.39.camel@gentiBk>

Hi, 

I am trying to use the translation glyph and I am running into problems.
When I run the program below with glyph=>'dna' I get a picture of double
stranded DNA sequence as I expected. When I try to use
glyph=>'translation' I get the following error message:

Can't locate object method "translate" via package
"Bio::SeqFeature::Generic" at
/usr/lib/perl5/site_perl/5.8.2/Bio/Graphics/Glyph/translation.pm line
134, <GEN0> line 118.

I tried to find another SeqFeature object that has a translate method
but did not have any luck. If someone could point me in the right
direction I would really appreciate it.

-Todd

#!/usr/bin/perl

use strict;
use warnings;

use Bio::Graphics;
use Bio::SeqFeature::Generic;
use Bio::SeqIO;

# initialize the graphics panel
my $panel = Bio::Graphics::Panel->new(-length => 80,
-width => 800,
-pad_left => 10,
-pad_right => 10,
);

# make the feature object
my $feature = Bio::SeqFeature::Generic->new(-start=>1,
-end=>60,
);

# create a sequence object
my $seqIO_object = Bio::SeqIO->new(-file => 'test_sequence.gb',
-format => 'genbank',
);

my $seq_object = $seqIO_object->next_seq();

# attach sequence object to feature
$feature->attach_seq($seq_object);

# add track to panel
$panel->add_track($feature,
-glyph => 'translation',
-fgcolor => 'black',
);

print $panel->png;

exit;
============================
Todd Naumann
Dept. of Chemistry
Penn State University
University Park, PA  16802

From brian_osborne at cognia.com  Wed Mar  3 12:04:19 2004
From: brian_osborne at cognia.com (Brian Osborne)
Date: Wed Mar  3 12:10:13 2004
Subject: [Bioperl-l] Fetching > 500 sequences
In-Reply-To: <20040303155117.GA51845@evostick.agtec.uga.edu>
Message-ID: <GAEDKMGOKFBLJPKCLKCCOEBADIAA.brian_osborne@cognia.com>

Brad,

Thank you, I'll apply the patch in bioperl-live.

Brian O.

-----Original Message-----
From: bioperl-l-bounces@portal.open-bio.org
[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Brad Chapman
Sent: Wednesday, March 03, 2004 10:51 AM
To: bioperl
Subject: Re: [Bioperl-l] Fetching > 500 sequences

Hi Rolf, Martin;

> > It seems that I have problems with fetching more than 500 sequences from
> > Genbank using Bioperl. It looks like the script (attached below) fetches
all
> > the 7000+ sequences, but only 500 make it to the output file. Is there
any
> > way to get all these 7000+ sequences written to the file - that is, is
it
> > possible to sidestep the 500 seq. limit?

I actually debugged and fixed this problem recently for Biopython --
it looks like a change in the way EUtils works. If you pass 'retmax'
to the eutils URL then it will only give you back at max 500
sequences, no matter what you pass for this parameter. The fix I
found that worked was to not pass 'retmax'.

The attached patch to Bio/DB/Query/GenBank.pm should fix the
problem, if similar symptoms equal similar fixes in this case. An
actual Perl/BioPerl person should look at this, though, as I'm not
to be trusted for coding Perl :-).

Hope this helps.
Brad


From paeruginosa at hotmail.com  Wed Mar  3 14:51:59 2004
From: paeruginosa at hotmail.com (Andrew Walsh)
Date: Wed Mar  3 14:57:50 2004
Subject: [Bioperl-l] Small word sizes with BLAST (WU, NCBI)
Message-ID: <LAW11-F40IwCcfjW2BN0003dd23@hotmail.com>

Hello,

My question is not really related to a specific Bioperl library, so I 
apologize.  If there is a specific 'BLAST' newsgroup, I will be happy to 
post there.  But I was hoping somebody on the Bioperl list had some 
experience doing nucleic acid searches with small word sizes.

I would like to search for  small (5-7) bp matches between an oligo sequence 
and a ~100,000 mRNA database.  I've tried doing this with WU-BLAST and 
NCBI-BLAST.  NCBI-BLAST does not allow word sizes below 7, so I've tried 
lots of different command line parameters for WU-BLAST.

I've tried these searches with versions 2.0a19 (alpha) and 2.0 of WU-BLAST.

I get quite strange results when I start lowering the word size below the 
default (11).  For example, with the alpha version, I get more hits with a 
word size of 10 than I do with a word size of 7.  With the beta version, I 
get the same number of hits with word sizes 10 and 7.  I've checked this by 
hand, and the 'missing' hits do in fact have stretches of 7 continuous bps 
matching.

Here is an example of one of the command lines I've tried running:
blastn human_refseq.fasta seq3.fasta W=5 S=5 M=1 V=100000 B=100000

I've tried adjusting every parameter I thought would affect the search 
results, but still cannot  recover the 'missing' hits.

Maybe BLAST is the wrong tool for this.  I'd just like something that's 
fast.  If anyone has some advice, it would be greatly appreciated.

Thanks a lot,

Andrew

_________________________________________________________________
Add photos to your messages with MSN 8. Get 2 months FREE*.  
http://join.msn.com/?page=dept/features&pgmarket=en-ca&RU=http%3a%2f%2fjoin.msn.com%2f%3fpage%3dmisc%2fspecialoffers%26pgmarket%3den-ca

From jbedell at oriongenomics.com  Wed Mar  3 15:59:16 2004
From: jbedell at oriongenomics.com (Joseph Bedell)
Date: Wed Mar  3 16:05:21 2004
Subject: [Bioperl-l] Small word sizes with BLAST (WU, NCBI)
Message-ID: <434AF352F9D03C4C896782B8CC78BC7636B4CB@VADER.oriongenomics.com>

Hi Andrew,

I'm cross-posting your question to the Sequence Search Mailing List
(SSML). This should be a good place for a discussion of your problem.

https://bioinformatics.org/mailman/listinfo/ssml-general

Are you looking for only 5-7bp matches with no extension? How big is
your oligo? One parameter that would need adjustment is E which should
probably be set outrageously high (1e-10?). Can you share the seq3.fasta
sequence? I could try blasting against refseq too or against some
sequence that you know it should hit.

Regards,
Joey

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Joseph A Bedell, Ph.D.
Director, Bioinformatics
Orion Genomics, LLC
4041 Forest Park Ave.
St. Louis, MO 63108
Office:(314)615-6979; Fax:(314)615-6975
Mobile:(314)518-1343
http://www.oriongenomics.com
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 

>-----Original Message-----
>From: bioperl-l-bounces@portal.open-bio.org [mailto:bioperl-l-
>bounces@portal.open-bio.org] On Behalf Of Andrew Walsh
>Sent: Wednesday, March 03, 2004 1:52 PM
>To: bioperl-l@portal.open-bio.org
>Subject: [Bioperl-l] Small word sizes with BLAST (WU, NCBI)
>
>Hello,
>
>My question is not really related to a specific Bioperl library, so I
>apologize.  If there is a specific 'BLAST' newsgroup, I will be happy
to
>post there.  But I was hoping somebody on the Bioperl list had some
>experience doing nucleic acid searches with small word sizes.
>
>I would like to search for  small (5-7) bp matches between an oligo
>sequence
>and a ~100,000 mRNA database.  I've tried doing this with WU-BLAST and
>NCBI-BLAST.  NCBI-BLAST does not allow word sizes below 7, so I've
tried
>lots of different command line parameters for WU-BLAST.
>
>I've tried these searches with versions 2.0a19 (alpha) and 2.0 of
WU-BLAST.
>
>I get quite strange results when I start lowering the word size below
the
>default (11).  For example, with the alpha version, I get more hits
with a
>word size of 10 than I do with a word size of 7.  With the beta
version, I
>get the same number of hits with word sizes 10 and 7.  I've checked
this by
>hand, and the 'missing' hits do in fact have stretches of 7 continuous
bps
>matching.
>
>Here is an example of one of the command lines I've tried running:
>blastn human_refseq.fasta seq3.fasta W=5 S=5 M=1 V=100000 B=100000
>
>I've tried adjusting every parameter I thought would affect the search
>results, but still cannot  recover the 'missing' hits.
>
>Maybe BLAST is the wrong tool for this.  I'd just like something that's
>fast.  If anyone has some advice, it would be greatly appreciated.
>
>Thanks a lot,
>
>Andrew
>
>_________________________________________________________________
>Add photos to your messages with MSN 8. Get 2 months FREE*.
>http://join.msn.com/?page=dept/features&pgmarket=en-
>ca&RU=http%3a%2f%2fjoin.msn.com%2f%3fpage%3dmisc%2fspecialoffers%26pgma
rket
>%3den-ca
>
>_______________________________________________
>Bioperl-l mailing list
>Bioperl-l@portal.open-bio.org
>http://portal.open-bio.org/mailman/listinfo/bioperl-l


From karplus at soe.ucsc.edu  Wed Mar  3 16:14:47 2004
From: karplus at soe.ucsc.edu (Kevin Karplus)
Date: Wed Mar  3 16:20:47 2004
Subject: [ssml] RE: [Bioperl-l] Small word sizes with BLAST (WU, NCBI)
In-Reply-To: <434AF352F9D03C4C896782B8CC78BC7636B4CB@VADER.oriongenomics.com>
	(jbedell@oriongenomics.com)
References: <434AF352F9D03C4C896782B8CC78BC7636B4CB@VADER.oriongenomics.com>
Message-ID: <200403032114.i23LElJ1014377@cheep.cse.ucsc.edu>

If searching for an exact match to a 5-mer, the approximate-match
tools are a poor choice.  You're much better off just reading the flat
file and scanning the sequence with a standard string-match algorithm.
You can probably even use the built-in regular-expression search in perl.
That should be reasonably fast for a single search.  The Bioperl
wrappers for reading the files should make this a pretty trivial
program to write, though they might make things a little too slow for
heavy-duty use.

If you need more speed, you could write a c or c++ program to do the
i/o and use the gnu regular-expression package to do the searching.

If you have many different 5-mers to search for, you could build an
index, listing for each 5-mer all the sequences that contain that 5-mer.
Building the index would take only one pass over the data and would
allow very fast lookup.  Again, one could build a prototype quickly in
perl, and reimplement in a faster language if it turns out to be
necessary. 

Kevin Karplus 	karplus@soe.ucsc.edu	http://www.soe.ucsc.edu/~karplus
life member (LAB, Adventure Cycling, American Youth Hostels)
Effective Cycling Instructor #218-ck (lapsed)
Professor of Biomolecular Engineering, University of California, Santa Cruz
Undergraduate and Graduate Director, Bioinformatics
Affiliations for identification only.


From basu at pharm.sunysb.edu  Wed Mar  3 17:45:44 2004
From: basu at pharm.sunysb.edu (Siddhartha Basu)
Date: Wed Mar  3 17:51:28 2004
Subject: [Bioperl-l] Functionality of Bio::DB::Flat module
Message-ID: <40466018.7050007@pharm.sunysb.edu>

Hi,
I am trying to create some obda compliant indexed fasta sequence file 
using the Bio::DB::Flat module. Earlier i have used the 
Bio::Index::Fasta module and provided my own id_parser using the 
callback subroutine. However, i could'nt find any way to use the same 
with Bio::DB::Flat module.
So, does the Bio::DB::Flat module lacks this functionality. Is there is 
any way to get around of this problem.


Thanks in advance.

-siddhartha

From dmb at mrc-dunn.cam.ac.uk  Wed Mar  3 17:49:11 2004
From: dmb at mrc-dunn.cam.ac.uk (Dan Bolser)
Date: Wed Mar  3 17:53:14 2004
Subject: [ssml] RE: [Bioperl-l] Small word sizes with BLAST (WU, NCBI)
In-Reply-To: <434AF352F9D03C4C896782B8CC78BC7636B4CB@VADER.oriongenomics.com>
Message-ID: <Pine.LNX.4.21.0403032244340.20036-100000@mail.mrc-dunn.cam.ac.uk>


One solution which springs to mind (but isn't yet 'off the shelf') is to
customize cd-hit a bit. 

By default cd-hit clusters sequences, but it does this using a 'words in
common' heuristic to filter sequences which are likely to be below a
certain identity threshold.

If you need heavy duty calculation, and you are OK with c / c++, modifing
cd-hit would be the best bet (and the cd-hit project is trying to attract 
developers!).

Cheers,
Dan.

On Wed, 3 Mar 2004, Joseph Bedell wrote:

> Hi Andrew,
> 
> I'm cross-posting your question to the Sequence Search Mailing List
> (SSML). This should be a good place for a discussion of your problem.
> 
> https://bioinformatics.org/mailman/listinfo/ssml-general
> 
> Are you looking for only 5-7bp matches with no extension? How big is
> your oligo? One parameter that would need adjustment is E which should
> probably be set outrageously high (1e-10?). Can you share the seq3.fasta
> sequence? I could try blasting against refseq too or against some
> sequence that you know it should hit.
> 
> Regards,
> Joey
> 
> ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
> Joseph A Bedell, Ph.D.
> Director, Bioinformatics
> Orion Genomics, LLC
> 4041 Forest Park Ave.
> St. Louis, MO 63108
> Office:(314)615-6979; Fax:(314)615-6975
> Mobile:(314)518-1343
> http://www.oriongenomics.com
> ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
>  
> 
> >-----Original Message-----
> >From: bioperl-l-bounces@portal.open-bio.org [mailto:bioperl-l-
> >bounces@portal.open-bio.org] On Behalf Of Andrew Walsh
> >Sent: Wednesday, March 03, 2004 1:52 PM
> >To: bioperl-l@portal.open-bio.org
> >Subject: [Bioperl-l] Small word sizes with BLAST (WU, NCBI)
> >
> >Hello,
> >
> >My question is not really related to a specific Bioperl library, so I
> >apologize.  If there is a specific 'BLAST' newsgroup, I will be happy
> to
> >post there.  But I was hoping somebody on the Bioperl list had some
> >experience doing nucleic acid searches with small word sizes.
> >
> >I would like to search for  small (5-7) bp matches between an oligo
> >sequence
> >and a ~100,000 mRNA database.  I've tried doing this with WU-BLAST and
> >NCBI-BLAST.  NCBI-BLAST does not allow word sizes below 7, so I've
> tried
> >lots of different command line parameters for WU-BLAST.
> >
> >I've tried these searches with versions 2.0a19 (alpha) and 2.0 of
> WU-BLAST.
> >
> >I get quite strange results when I start lowering the word size below
> the
> >default (11).  For example, with the alpha version, I get more hits
> with a
> >word size of 10 than I do with a word size of 7.  With the beta
> version, I
> >get the same number of hits with word sizes 10 and 7.  I've checked
> this by
> >hand, and the 'missing' hits do in fact have stretches of 7 continuous
> bps
> >matching.
> >
> >Here is an example of one of the command lines I've tried running:
> >blastn human_refseq.fasta seq3.fasta W=5 S=5 M=1 V=100000 B=100000
> >
> >I've tried adjusting every parameter I thought would affect the search
> >results, but still cannot  recover the 'missing' hits.
> >
> >Maybe BLAST is the wrong tool for this.  I'd just like something that's
> >fast.  If anyone has some advice, it would be greatly appreciated.
> >
> >Thanks a lot,
> >
> >Andrew
> >
> >_________________________________________________________________
> >Add photos to your messages with MSN 8. Get 2 months FREE*.
> >http://join.msn.com/?page=dept/features&pgmarket=en-
> >ca&RU=http%3a%2f%2fjoin.msn.com%2f%3fpage%3dmisc%2fspecialoffers%26pgma
> rket
> >%3den-ca
> >
> >_______________________________________________
> >Bioperl-l mailing list
> >Bioperl-l@portal.open-bio.org
> >http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 
> _______________________________________________
> ssml-general mailing list
> ssml-general@bioinformatics.org
> https://bioinformatics.org/mailman/listinfo/ssml-general
> 

From lawso1n98 at hotmail.com  Thu Mar  4 01:15:17 2004
From: lawso1n98 at hotmail.com (dean)
Date: Thu Mar  4 01:21:09 2004
Subject: [Bioperl-l] Save up to 80% on your medications!
Message-ID: <1078380917-23422@excite.com>


Buy the drugs you need at huge discounts from Offshore Pharmacy!

http://bigdiscounts.selcydc.com/s95/index.php?id=s95


This weeks specials include great deals on:

- V i a g r a Soft Tabs - (Takes effect in a quarter of the time!)
- Super V i a g r a - (Lasts 8 times longer!)
- U l t r a m - (Pain Relief)
- S o m a - (Muscle Relaxants)
- Z y b a n - (Stop Smoking)
- P r o z a c - (Anti-Depressant)
- X a n a x - (Anti-Depressant)
- V a l i u m - (Anxiety Relief)

http://bigdiscounts.selcydc.com/s95/index.php?id=s95


pollard deliver indiana  jtwalton sstberna tulip mjkuzia thesis Golferklopfer drojek  shells  njashley sillars rock adrian  schwarz   vicky3 cvdors 
Get off this list by writing to getmeoff731@excitemail.com
From ildhdbu at ezagenda.com  Thu Mar  4 03:04:38 2004
From: ildhdbu at ezagenda.com (Franklin Wilkerson)
Date: Thu Mar  4 03:10:19 2004
Subject: [Bioperl-l] RE[2]: earn .
Message-ID: <200403038929.HMS94564@gate7.ezagenda.com>

Top quality software stripped from it's Expensive.
Extra's gives you the lowest Possible price - Guaran1eed!
http://www.goodsoftwarenow.biz/lnow/?showman
The 10 All-Time Best Software Products at Rock Bottom Prices!

-s-t-o-p-
http://goodsoftwarenow.biz/re/clean.php?seaside


----0125611139935041814--

From Richard.Adams at ed.ac.uk  Thu Mar  4 03:28:24 2004
From: Richard.Adams at ed.ac.uk (Richard Adams)
Date: Thu Mar  4 03:34:13 2004
Subject: [Bioperl-l] RemoteBlast VS Traditional Browser Blast
Message-ID: <4046E8A8.3050700@ed.ac.uk>

Can you send the output from web and scripted versions and your input 
sequence? It's harder to fix otherwise.

Richard

-- 
Dr Richard Adams
Psychiatric Genetics Group,
Medical Genetics,
Molecular Medicine Centre,
Western General Hospital,
Crewe Rd West,
Edinburgh UK
EH4 2XU

Tel: 44 131 651 1084
richard.adams@ed.ac.uk


-- 
Dr Richard Adams
Psychiatric Genetics Group,
Medical Genetics,
Molecular Medicine Centre,
Western General Hospital,
Crewe Rd West,
Edinburgh UK
EH4 2XU

Tel: 44 131 651 1084
richard.adams@ed.ac.uk


From mbatesalann at netscape.net  Thu Mar  4 06:35:36 2004
From: mbatesalann at netscape.net (mbatesalann@netscape.net)
Date: Thu Mar  4 06:34:32 2004
Subject: [Bioperl-l] REPLY SOON
Message-ID: <SERVERA1qYarBQXpXry0001613d@servera1.danis.com.cn>

Dear Friend,

As you read this, I don't want you to feel sorry for 
me, because, I believe everyone will die someday. 
My name is BATES ALAN a merchant in Dubai, in the 
U.A.E.I have been diagnosed with Esophageal cancer.
It has defiled all forms of medical treatment, and right now 
I have only about a few months to live, according to medical experts. 
I have not particularly lived my life so well, as I 
never really cared for anyone(not even myself)but my 
business. Though I am very rich, I was never 
generous, I was always hostile to people and only 
focused on my business as that was the only thing I 
cared for. But now I regret all this as I now know 
that there is more to life than just wanting to have 
or make all the money in the world. 
I believe when God gives me a second chance to come 
to this world I would live my life a different way 
from how I have lived it. Now that God has called 
me, I have willed and given most of my property 
and assets to my immediate and extended family 
members as well as a few close friends. 
I want God to be merciful to me and accept my soul 
so, I have decided to give alms to charity 
organizations, as I want this to be one of the last 
good deeds I do on earth. So far, I have distributed 
money to some charity organizations in the U.A.E, 
Algeria and Malaysia. Now that my health has 
deteriorated so badly, I cannot do this myself 
anymore. I once asked members of my family to close one 
of my accounts and distribute the money which I have 
there to charity organization in Bulgaria and 
Pakistan, they refused and kept the money to 
themselves. Hence, I do not trust them anymore, as 
they seem not to be contended with what I have left 
for them. 
The last of my money which no one knows of is the 
huge cash deposit of eighteen million dollars 
$18,000,000,00 that I have with a finance/Security Company 
abroad. I will want you to help me collect this deposit 
and dispatched it to charity organizations.
I have set aside 10% for you and for your time.
God be with you. 
BATES ALAN

From t.goetz at dkfz-heidelberg.de  Thu Mar  4 08:58:59 2004
From: t.goetz at dkfz-heidelberg.de (Thomas =?iso-8859-1?q?G=F6tz?=)
Date: Thu Mar  4 08:56:48 2004
Subject: [Bioperl-l] PubMed and UTF8
Message-ID: <200403041458.59119.t.goetz@dkfz-heidelberg.de>

a little bit offtopic maybe, but I thought this would be the best place to 
ask ;)

I want to automatically download abstracts from PubMed for database storage. 
As many author's names contain umlauts (e.g. german Umlaute ?,?,? etc.) I 
also want to catch them. Right now I'm using LWP::Simple and get($URL) to 
download articles, but umlauts aren't displayed correctly that way. 

Anybody could give me a hint on how to solve that?
Tia,
Tom

-- 
Thomas G?tz
German Cancer Research Center
Central Spectroscopic Department - B090
Im Neuenheimer Feld 280
D-69120 Heidelberg
Tel. +49-6221-424670
Fax  +49-6221-42524670
Email: t.goetz@dkfz-heidelberg.de
http://www.dkfz.de/spec/

From Nathan.Agrin at umassmed.edu  Thu Mar  4 09:54:12 2004
From: Nathan.Agrin at umassmed.edu (Agrin, Nathan)
Date: Thu Mar  4 09:59:51 2004
Subject: [Bioperl-l] Clickable Glyphs...
Message-ID: <89AA811FD79DC94788093B23DA79E71FD9AC0A@edunivmail02.ad.umassmed.edu>

I finally got Bio::Graphics to start generating some images for me.

2 quick questions I can't seem to resolve.

Is there anyway to have the description of the track placed to the right
of the track, instead of under it?

Is there a way to keep tracks from aligning on the same vertical plane?
I am trying to visualize some interpro hits and I want each type of
conserved region to exist on its own track in box form with a straight
line connecting and/or taking up the remainder of the track.

I'm hoping these are just some basic options I've over looked.

Thanks,
Nate


-----Original Message-----
From: Lincoln Stein [mailto:lstein@cshl.edu] 
Sent: Thursday, February 19, 2004 5:09 AM
To: Jonathan Greenwood; bioperl-l@bioperl.org
Subject: Re: [Bioperl-l] Clickable Glyphs...

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Get the latest CVS version of bioperl-live and read the section of the 
Bio::Graphics::Panel manual page labeled "Creating Imagemaps."  
Essentially what you need to do is to replace the section after you 
create the panel with this:

	my ($url,$map,$mapname) - $panel->image_and_map(
						-root =>
'/var/www/html',
						-url   => '/tmpimages');
	print $q->header(),$q->start_html('A Bitmap Rendering');
	print $q->img({-src=>$url,-usemap=>"#$mapname");
	print $map;
	print $q->end_html;

I'm frankly more fond of the function-oriented CGI calls, so I would 
bring in the standard functions and then:

	print header(),
		start_html('A Bitmap Rendering'),
		img({-src=>$url,-usemap=>"#$mapname"),
		$map,
		end_html();

Lincoln

On Wednesday 18 February 2004 05:53 pm, Jonathan Greenwood wrote:
> Hi, I've submitted my code with the email, what I'm trying to do is
> to render a Genbank file as a png file, I need to make each glyph
> clickable(I'm also displaying this page online)...any help with the
> new changes to Bio::Graphics::Panel would be appreciated...many
> thanks...
>
> Sincerely,
>
> Jonathan Greenwood
> email: jonathon@mgcheo.med.uottawa.ca
>
> code:
> #! /usr/local/bin/perl -wT
>
> use strict;
> use Bio::Graphics;
> use Bio::SeqIO;
> use Bio::SeqFeature::Generic;
> use CGI;
> use CGI::Pretty;
>
> my $file = 'x65306.gb';
> my $io = Bio::SeqIO->new(-file=>$file);
> my $seq = $io->next_seq;
> my $wholeseq = Bio::SeqFeature::Generic->new(-start=>1,
>
> -end=>$seq->length);
> my @features = $seq->all_SeqFeatures;
> my $q = new CGI;
>
> # sort features by their primary tags
> my %sorted_features;
> for my $f (@features) {
>   my $tag = $f->primary_tag;
>   push @{$sorted_features{$tag}},$f;
> }
>
> print $q->header( 'text/html' );
> print $q->start_html('A Vector Rendering');
>
> my $panel = Bio::Graphics::Panel->new(-length      => $seq->length,
> 				      -width       => 1000,
> 				      -pad_left    => 10,
> 				      -pad_right   => 10,
> 				      -key_color   => 'white',
> 				      -key_spacing => 15,
> 				      -key_style   => 'bottom',
> 				      -spacing     => -0.25,
> 				      -box_subparts => 'true'
> 				      );
>
> my ($url,$map,$mapname) = $panel->image_and_map(-root =>
> '/webfiles/cgi-bin',
> 						-url  => '/tmpimages',
> 					       );
>
> $panel->add_track($wholeseq,
> 		  -glyph  => 'arrow',
> 		  -bump   => +1,
> 		  -double => 1,
> 		  -tick   => 2
> 	          );
>
> $panel->add_track($wholeseq,
> 		  -glyph   => 'generic',
> 		  -bgcolor => 'purple',
> 		  -height  => 12,
> 		  -key     => 'Whole Sequence',
> 		  -title   => 'Whole Sequence'
> 		  );
>
> # special feature
> if ($sorted_features{CDS}) {
>   $panel->add_track($sorted_features{CDS},
> 		    -glyph          => 'transcript2',
> 		    -bgcolor        => 'orange',
> 		    -bump           =>  +1,
> 		    -height         => 12,
> 		    -key            => 'CDS',
> 		    -label          => \&gene_label,
> 		    -title          => 'CDS',
> 		    -link           => 'feature1.html#CDS'
> 		    );
>   delete $sorted_features{'CDS'};
> }
>
> #general case
> my @colors = qw(wheat blue yellow green cyan chartreuse magenta
> gray); my $idx    = 0;
> for my $tag (sort keys %sorted_features) {
> my $features = $sorted_features{$tag};
> $panel->add_track($features,
> 		  -glyph        =>  'generic',
> 		  -bgcolor      =>  $colors[$idx++ % @colors],
> 		  -fgcolor      =>  'black',
> 		  -font2color   => 'red',
> 		  -key          => "${tag}s",
> 		  -bump         => +1,
> 		  -height       => 12,
>                   -label        => \&gene_label,
> 		  -description  => \&generic_description,
> 		  -title        => \&gene_label,
> 		  -link         => 'feature1.html#$tag',
> 		  );
> }
>
> print $q->img({-src=>$url,-usemap=>"#$mapname"});
> print $q->$map;
> print $q->($panel->png);
>
> print $q->exit_html;
>
> exit;
>
>   sub gene_label {
>      my $feature = shift;
>      my @notes;
>      foreach (qw(product gene)) {
>        next unless $feature->has_tag($_);
>        @notes = $feature->each_tag_value($_);
>        last;
>     }
>     $notes[0];
>   }
>
>   sub generic_description {
>     my $feature = shift;
>     my $description;
>     foreach ($feature->all_tags) {
>       my @values = $feature->each_tag_value($_);
>       $description .= $_ eq 'note' ? "@values" : "$_=@values; ";
>     }
>     $description =~ s/; $//; # get rid of last
>     $description;
>   }
>
> _________________________________________________________________
> The new MSN 8: smart spam protection and 2 months FREE*
> http://join.msn.com/?page=features/junkmail
> http://join.msn.com/?page=dept/bcomm&pgmarket=en-ca&RU=http%3a%2f%2
>fjoin.msn.com%2f%3fpage%3dmisc%2fspecialoffers%26pgmarket%3den-ca
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

- -- 
Lincoln D. Stein
Cold Spring Harbor Laboratory
1 Bungtown Road
Cold Spring Harbor, NY 11724
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.1 (GNU/Linux)

iD8DBQFANIss0CIvUP7P+AkRAmh/AJ9SaY4MIZPS5vW5gE5xzaw7AzrjaQCdHJdE
S+2+MS2vScLrVTd+C3V4mME=
=MBei
-----END PGP SIGNATURE-----
_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org
http://portal.open-bio.org/mailman/listinfo/bioperl-l

From parrot21orchid at hotmail.com  Thu Mar  4 23:41:27 2004
From: parrot21orchid at hotmail.com (johnnie)
Date: Thu Mar  4 10:46:22 2004
Subject: [Bioperl-l] This Drug puts VlAGRA to shame!!
Message-ID: <1078461687-28857@excite.com>

The Biggest New Drug since V1agra! Many times as powerful.

http://newmedformula.com/sv/index.php?pid=eph9106

-A quicker more stable erection
-More enjoyable sex for both
-Longer sex
-Known to add length to you erection
-Lasts up to 36 hours (not a thrity-six hour erection, but enhancement for thirty-six)

We have it at a discounted savings. Save when you go through our site on all your orders.

http://newmedformula.com/sv/index.php?pid=eph9106


playboy dashatrumpet pearl bmw playboy paula cuddles 
ladybug cosmosmookie sting1 surf
asdfghjk hornet abcdef 

Get off this list go to http://newmedformula.com/sv/applepie.php
From allenday at ucla.edu  Thu Mar  4 12:50:00 2004
From: allenday at ucla.edu (Allen Day)
Date: Thu Mar  4 12:55:51 2004
Subject: [Bioperl-l] PubMed and UTF8
In-Reply-To: <200403041458.59119.t.goetz@dkfz-heidelberg.de>
Message-ID: <Pine.LNX.4.44.0403040948300.12368-100000@sumo.genetics.ucla.edu>

have you tried using Bio::DB::Biblio::eutils in bioperl-live?  that pulls
pubmed info from NCBI's eutils utilities directly.  i'm not sure if it
handles umlautes any better than the ebi's soap biblio server, but it may
be worth a look...

-allen

On Thu, 4 Mar 2004, Thomas G?tz wrote:

> a little bit offtopic maybe, but I thought this would be the best place to 
> ask ;)
> 
> I want to automatically download abstracts from PubMed for database storage. 
> As many author's names contain umlauts (e.g. german Umlaute ?,?,? etc.) I 
> also want to catch them. Right now I'm using LWP::Simple and get($URL) to 
> download articles, but umlauts aren't displayed correctly that way. 
> 
> Anybody could give me a hint on how to solve that?
> Tia,
> Tom
> 
> 

From ahammer at genetics.utah.edu  Thu Mar  4 13:00:26 2004
From: ahammer at genetics.utah.edu (andy hammer)
Date: Thu Mar  4 13:06:14 2004
Subject: [Bioperl-l] RemoteBlast VS Traditional Browser Blast
Message-ID: <001501c40212$93304ea0$4696659b@WSP>

SOLVED.
I am submitting no code because the issue was with the "keyboard to floor
interface".  :-(
Once my head was on straight both BLAST and RemoteBlast.pm work perfectly.
Thank you for your time and suggestions.

Andy Hammer
University of Utah
Dept. of Human Genetics
Gesteland & Atkins Lab

From sjmiller at email.arizona.edu  Thu Mar  4 13:13:21 2004
From: sjmiller at email.arizona.edu (Susan J. Miller)
Date: Thu Mar  4 13:19:09 2004
Subject: [Bioperl-l] Broken link in bptutorial
Message-ID: <404771C1.8060302@email.arizona.edu>

In the BioPerl Tutorial Introduction Overview section 
http://www.bioperl.org/Core/Latest/bptutorial.html#i._introduction the 
link http://bioperl.org/Core/Latest/bioscripts.html does not work 
because there is punctuation included in the URL.

-- 
Regards,
-susan

Susan J. Miller
Biotechnology Computing Facility
Arizona Research Laboratories
Bio West 228
University of Arizona
Tucson, AZ  85721
(520) 626-2597

From Annie.Law at nrc-cnrc.gc.ca  Thu Mar  4 15:16:37 2004
From: Annie.Law at nrc-cnrc.gc.ca (Law, Annie)
Date: Thu Mar  4 15:22:36 2004
Subject: [Bioperl-l] Bioperl-DB error
Message-ID: <10C94843061E094A98C02EB77CFC328722FE0F@nrcmrdex1d.imsb.nrc.ca>

Hi Hilmar,

I would appreciate help on the following.  Previously, I had errors with the
make tests of both Bioperl-DB And Bioperl-1.4.  I went to CVS bioperl-live
and dowloaded again the Head branch of bioperl-db and bioperl-schema. As
well from bioperl-live I downloaded the bioperl-1.4 from branch 1.4. 

I have bioperl installed in two places.  I think it should be okay since the
Errors that I am receiving have reduced.  I not sure how to get rid of the
initial install or if it matters.

1) When I performed the make test for bioperl-db all of the tests passed but
there seems to be one warning and I am concerned That this will affect what
I want to do which is to load unigene information into the annotation
database that I am creating With the perl script load_seqdatabase.pl. Should
I be concerned about this warning?


[root@ bioperl-db]# make test PERL_DL_NONLAZY=1 /usr/bin/perl
"-MExtUtils::Command::MM" "-e" "test_harness(0, 'blib/lib', 'blib/arch')"
t/*.t t/cluster.......
 

t/cluster.......ok 1/160
 

t/cluster.......ok 2/160
 

t/cluster.......ok 3/160
 

t/cluster.......ok 4/160
 

t/cluster.......ok 5/160
 

t/cluster.......ok 6/160
 

t/cluster.......ok 7/160
 

etc...
 

t/cluster.......ok 154/160
 

t/cluster.......ok 155/160
-------------------- WARNING ---------------------
MSG: failed to store one or more child objects for an instance of class
Bio::Cluster::UniGene (PK=1)
---------------------------------------------------

 
t/cluster.......ok 160/160
 

t/cluster.......ok
t/comment.......
 

All tests successful.
Files=15, Tests=930, 93 wallclock secs (31.64 cusr +  1.26 csys = 32.90 CPU)

2) I started with an empty database and I loaded the NCBI taxonomy into the
database, then the GO information then locuslink (I used
The --safe and --lookup options and did not get any errors).  Loading
locuslink information to half a day to a day.

Next I loaded unigene information and it is taking three days and is still
not finished?! (I meant to use the option --mergeobjs but forgot when I ran
the script but I don't think that this will make a difference in terms of
execution time) Something strange that is happening is that when I use
mysqlcc to refresh what the current state of the database is the number of
bioentries increases and then decreases.  However the general overall trend
is that the number of bioentries is increasing. Is this correct or is
something wrong happening.
I am wondering if loading of unigene information is affected by the warning
I got from the make test in bioperl-db
MSG: failed to store one or more child objects for an instance of class
Bio::Cluster::UniGene (PK=1).  If so, I am not sure how to
Resolve the problem.

3) On a related note every time including the initial time that I load the
database with load_ontology.pl, or load_seqdatbase.pl I use the option
lookup.  I want to create annotation database so I am loading first with
NCBI taxonomy databse, then GO, then locuslink, then unigene.  I want find
out the annotation for some clones from ESTs.  I think I only need to use
the mergobjs option when I load the unigene information?
I read the documentation of mergobjs in the load_seqdatbase.pl script but
would appreciate an example of how this works.

4) Also after I intalled Bioperl-1.4 again I decreased the number of failed
tests but still there Where 3 test failures and I am not sure if I should be
concerned about these.

t/AlignIO....................ok 75/80Bio::AlignIO: largemultifasta cannot be
found Exception 
------------- EXCEPTION  -------------
MSG: Failed to load module Bio::AlignIO::largemultifasta. Can't locate
Bio/Seq/LargeLocatableSeq.pm in @INC (@INC contains: t
/usr/lib/bioperl-live/blib/lib /usr/lib/bioperl-live/blib/arch
/usr/lib/perl5/5.8.0/i386-linux-thread-multi /usr/lib/perl5/5.8.0
/usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi
/usr/lib/perl5/site_perl/5.8.0 /usr/lib/perl5/site_perl
/usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi
/usr/lib/perl5/vendor_perl/5.8.0 /usr/lib/perl5/vendor_perl .) at
/usr/lib/bioperl-live/blib/lib/Bio/AlignIO/largemultifasta.pm line 69,
<GEN32> line 28. BEGIN failed--compilation aborted at
/usr/lib/bioperl-live/blib/lib/Bio/AlignIO/largemultifasta.pm line 69,
<GEN32> line 28. Compilation failed in require at
/usr/lib/bioperl-live/blib/lib/Bio/Root/Root.pm line 394, <GEN32> line 28.

STACK Bio::Root::Root::_load_module
/usr/lib/bioperl-live/blib/lib/Bio/Root/Root.pm:396
STACK (eval) /usr/lib/bioperl-live/blib/lib/Bio/AlignIO.pm:420
STACK Bio::AlignIO::_load_format_module
/usr/lib/bioperl-live/blib/lib/Bio/AlignIO.pm:419
STACK Bio::AlignIO::new /usr/lib/bioperl-live/blib/lib/Bio/AlignIO.pm:347
STACK toplevel t/AlignIO.t:283

--------------------------------------

For more information about the AlignIO system please see the AlignIO docs.
This includes ways of checking for formats at compile time, not run time
Can't call method "next_aln" on an undefined value at t/AlignIO.t line 285,
<GEN32> line 28.

 
t/AlignIO....................dubious
	Test returned status 2 (wstat 512, 0x200)
DIED. FAILED tests 76-80
	Failed 5/80 tests, 93.75% okay
t/AlignStats.................


-the other error I got was
 
t/GeneCoordinateMap
per.......ok 70/113
 

t/GeneCoordinateMapper.......ok 71/113
-------------------- WARNING ---------------------
MSG: sorted sublocation array requested but root location doesn't define
seq_id (at least one sublocation does!)
---------------------------------------------------

-------------------- WARNING ---------------------
MSG: sorted sublocation array requested but root location doesn't define
seq_id (at least one sublocation does!)
---------------------------------------------------

 
t/GeneCoordinateMapper.......ok 72/113
 
t/GeneCoordinateMapper.......ok 73/113
 

t/GeneCoordinateMapper.......ok 74/113
 

---the next error was

                                      
t/tutorial...................ok 9/21
 

t/tutorial...................ok 10/21
------------- EXCEPTION  -------------
MSG: WebDBSeqI Request Error:
HTTP/1.1 500 Internal Server Error
Connection: close
Date: Mon, 23 Feb 2004 15:59:50 GMT
Via: 1.1 eutils.ncbi.nih.gov
Server: Apache
Content-Type: text/html
Client-Date: Mon, 23 Feb 2004 16:05:33 GMT
Client-Response-Num: 1
Client-Transfer-Encoding: chunked
X-Cache: MISS from eutils.ncbi.nih.gov


<HTML>
<HEADER>
<TITLE>NCBI/xpubmed3 - WWW Error 500 Diagnostic</TITLE> </HEADER> <BODY>
<H1>Server Error</H1>


Your request could not be processed due to a problem on
our Web server.  This could be a transient problem, please
try the query again.  If it doesn't clear up within a reasonable period of
time, e-mail a short description of your query and the diagnostic
information shown below to:<BR> <BR> <CENTER><DT>pubmed@nlm.nih.gov - for
problems with PubMed<BR> info@ncbi.nlm.nih.gov - for problems with other
services<BR> </CENTER> <BR> Thank you for your assistance.  We will try to
fix the problem as soon as possible. <HR> Diagnostic Information: <UL>
<DT>Error: 500

<DT>URL:
http://eutils.ncbi.nlm.nih.gov/entrez/viewer.fcgi?WebEnvRq=1&save=efetch&db=
nucleotide&val=gb%7CAF303112&retmode=text&dopt=gb&tmpl=__reset&tool=bioperl&
usehistory=n

<DT>User agent: bioperl-Bio_DB_GenBank/1.4

<DT>Client: 130.14.29.13

<DT>Server: xpubmed3

<DT>Time: Mon Feb 23 11:00:20 EST 2004

 </UL>
<P>
NOTE: The above is an internal URL which may differ from the one you used to
address the page. <HR> Rev. 05/09/00

</BODY>
</HTML>


STACK Bio::DB::WebDBSeqI::_stream_request Bio/DB/WebDBSeqI.pm:728 STACK
Bio::DB::WebDBSeqI::get_seq_stream Bio/DB/WebDBSeqI.pm:460 STACK
Bio::DB::NCBIHelper::get_Stream_by_acc Bio/DB/NCBIHelper.pm:415 STACK
Bio::DB::WebDBSeqI::get_Seq_by_acc Bio/DB/WebDBSeqI.pm:181 STACK
Bio::DB::GenBank::get_Seq_by_acc Bio/DB/GenBank.pm:216 STACK main::__ANON__
/usr/lib/bioperl-live/blib/lib/bptutorial.pl:2920
STACK main::run_examples /usr/lib/bioperl-live/blib/lib/bptutorial.pl:4144
STACK toplevel t/tutorial.t:23

--------------------------------------

-------------------- WARNING ---------------------
MSG: acc (gb|AF303112) does not exist
---------------------------------------------------
Can't call method "display_id" on an undefined value at
/usr/lib/bioperl-live/blib/lib/bptutorial.pl line 2921.

 
t/tutorial...................dubious
	Test returned status 255 (wstat 65280, 0xff00)
DIED. FAILED tests 11-21
	Failed 11/21 tests, 47.62% okay

Failed Test  Stat Wstat Total Fail  Failed  List of Failed
----------------------------------------------------------------------------
---
t/AlignIO.t     2   512    80    5   6.25%  76-80
t/DB.t                     78   ??       %  ??
t/tutorial.t  255 65280    21   11  52.38%  11-21
122 subtests skipped.
Failed 3/180 test scripts, 98.33% okay. 7/8305 subtests failed, 99.92% okay.


5) I have been loading the database with the following command. I would like
to know if the format option should be SeqIO:locuslink or should it simply
be locuslink? 

Perl /root/bioperl-db/scripts/biosql/load_seqdatabase.pl --dbuser=root
--dbpass=ms22a --dbname=annotatedata --namespace LocusLink 
--format SeqIO::locuslink /var/lib/mysql/LL_tmpl


thanks very much,
Annie.

From barry.moore at genetics.utah.edu  Thu Mar  4 15:30:49 2004
From: barry.moore at genetics.utah.edu (Barry Moore)
Date: Thu Mar  4 15:36:39 2004
Subject: [Bioperl-l] Getting values from a GenBank File
Message-ID: <404791F9.2030103@genetics.utah.edu>

While learning BioPerl, I've spent a good bit of time trying to figure 
out where some bit of information in a GenBank file might be tucked away 
in that RichSeq object, and how the hell to get at it.  I finally 
decided it was time to sift through that object and find out what's in 
there and where it is.  Along the way I wrote down  what I found, and 
then added a bit to it to try and turn it into something that I would 
like to have found when I first started learning Bioperl.  The result is 
a "GeneBank-file-centric" crib sheet for Bioperl.  The document is 
centered around a "standard" GenBank file where each seperate item of 
information is marked-up with  numbers and codes that refer to 
information later in the document about how to get at that particular 
value in using BioPerl.  It also includes a script at the end that can 
be modified easily to get at all the major stuff in the GenBank file.  
This document is aimed at beginners to BioPerl, but assumes a working 
knowledge of Perl.  The file is, of course, free to be used by anyone in 
anyway that they wish.  I'd appreciate any corrections or improvements 
if you find errors or shortcomings.
 
Barry
-- 

Barry Moore
Dept. of Human Genetics
University of Utah
Salt Lake City, UT

-----------------------------------------------------------------------------------

                 How to get values in a GenBank file with BioPerl

This is a guide to accessing the value of each item in a GenBank file 
with BioPerl.
Underneath each value in the sample GenBank file there is a number and a 
3-letter
code.  The 3-letter code represents the object where the associated 
value is stored.
Table 1 at the end of this file gives the object for each code.  By 
looking up the
numbers in Table 2 at the end of this file you will find the object 
where this
information can be called from, and the method to get (and usually set) 
the value.
Some values may be stored by, and accessible by methods in more than one 
object.
There is perl code at the end that you can use as a starting point to 
get at all of
the objects and their methods.
-----------------------------------------------------------------------------------
LOCUS       NM_079543               1821 bp    mRNA    linear   INV 
10-DEC-2003
            1-BSR                   2-BPS      3-BSR   4-BSR  5-BSR 6-BSR
DEFINITION  Drosophila melanogaster alpha-Esterase-3 CG1257-PA (alpha-Est3)
            mRNA, complete cds.
            7-BPS
ACCESSION   NM_079543 XX_123456
            8-BPS     9-BSR
VERSION     NM_079543.2  GI:24644853
                      |     |
                    10-BPS  11-BSR
KEYWORDS    Esterases
            12-BSR
SOURCE      Drosophila melanogaster (fruit fly)
            13-BSC
  ORGANISM  Drosophila melanogaster
            14-BSC     15-BSC
            Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota;
            Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha;
            Ephydroidea; Drosophilidae; Drosophila.
            16-BSC
REFERENCE   1  (bases 1 to 1821)
                    17-BAR 18-BAR
  AUTHORS   Adams,M.D. et al.(additional authors deleted for brevity)
            19-BAR
  TITLE     The genome sequence of Drosophila melanogaster
            20-BAR
  JOURNAL   Science 287 (5461), 2185-2195 (2000)
            21-BAR
  MEDLINE   20196006
            22-BAR
   PUBMED   10731132
            23-BAR
COMMENT     PROVISIONAL REFSEQ: This record has not yet been subject to 
final
            NCBI review. This record is derived from an annotated genomic
            sequence (NT_033777). The reference sequence was derived from
            CG1257-RA.
            On Nov 6, 2002 this sequence version replaced gi:17737826.
            24-BAC
FEATURES             Location/Qualifiers
     source          1..1821
                25-BLS  26-BLS
                     /organism="Drosophila melanogaster"
                     27-BSG
                     /mol_type="mRNA"
                     /db_xref="taxon:7227"
                     /chromosome="3R"
                     /note="genotype: y[1]; cn[1] bw[1] sp[1]; Rh6[1]"
     gene            1..1821
                     /gene="alpha-Est3"
                     /locus_tag="CG1257"
                     /note="alpha-Esterase-3; synonyms: B, aE3, CG1257,
                     alphaE3, alpha-Ests, fragment B;
                     go_function: carboxylesterase activity [goid 0004091]
                     [evidence NAS]"
                     /map="84D9-84D9"
                     /db_xref="FLYBASE:FBgn0015571"
                     /db_xref="GeneID:40907"
                     /db_xref="LocusID:40907"
     CDS             119..1750
                     /gene="alpha-Est3"
                     /locus_tag="CG1257"
                     /EC_number="3.1.1.1"
                     /note="alpha-Est3 gene product"
                     /codon_start=1
                     /product="alpha-Esterase-3 CG1257-PA"
                     /protein_id="NP_524267.2"
                     /db_xref="GI:24644854"
                     /db_xref="FLYBASE:FBgn0015571"
                     /db_xref="GeneID:40907"
                     /db_xref="LocusID:40907"
                     
/translation="MESLQVNTTSGPVLGKQCTGVYGDEYVSFERIPYAQPPVGHLRF
                     
MAPLPVEPWSQPLDCTKPGQKPLQFNHYSKQLEGVEDCLYLNVYAKELDSPRPLPLIV
                     
FFFGGGFEKGDPTKELHSPDYFMMRDVVVVTVSYRVGPLGFLSLNDPAVGVPGNAGLK
                     
DQLLAMEWIKENAERFNGDPKNVTAFGESAGAASVHYLMLNPKAEGLFHKAILQSGNV
                     
LCSWALCTIKNLPHRLAVNLGMESAEHVTDAMVLDFLQKLPGEKLVRPYLLSAEEHLD
                     
DCVFQFGPMVEPYKTEHCALPNHPQELLDKAWGNRIPVLMSGTSFEGLLMYARVQMAP
                     
YLLTSLKKEPEHMLPLDVKRNLPQALARHLGQRLQETHFGGNDPSAMSPESLKAYCEY
                     
ASYKVFWHPILKTLRSRVKSSSASTYLYRFDFDSPTFNHQRLKYCGDKLRGVAHVDDH
                     
SYLWYGDFSWKLDKHTPEFLTIERMIDMLTSFARTSNPNCKLIQDQLPRAKEWKPLNS
                     KSALECLNISENIKMMELPELQKLRVWESVCQSTG"
ORIGIN
        1 gtactatggc aaatggtact atgggcagtc gcgataataa taataaatga ggctttaaat
       61 gtttttccac tacaagataa aaagttacga gtgcgcaccg agcatttagt agacgaacat
          28-BPS
... (additional sequence deleted for brevity)
//
-----------------------------------------------------------------------------------
Table 1. Object Key
BPS - Bio::PrimarySeq
BSR - Bio::Seq::RichSeq
BSC - Bio::Species
BAC - Bio::Annotation::Comment
BAR - Bio::Annotation::Reference
BLS - Bio::Location::Simple
BSG - Bio::SeqFeature::Generic
-----------------------------------------------------------------------------------
Table 2. Usage key
1.   Bio::Seq::RichSeq           $scalar = $self->display_name() or 
->display_id()
2.   Bio::PrimarySeq             $scalar = $self->length()
3.   Bio::Seq::RichSeq           $scalar = $self->molecule()
4.   Bio::PrimarySeq             $scalar = $self->is_circular
5.   Bio::Seq::RichSeq           $scalar = $self->division()
6.   Bio::seq::Richseq           @array  = $self->get_dates
7.   Bio::PrimarySeq             $scalar = $self->description()
8.   Bio::PrimarySeq             $scalar = $self->accession_number()
     Bio::Seq::RichSeq           $scalar = $self->accession()
9.   Bio::Seq::RichSeq           @array  = $self->get_secondary_accessions
10.  Bio::PrimarySeq             $scalar = $self->version()
     Bio::Seq::RichSeq           $scalar = $self->seq_version()
11.  Bio::PrimarySeq             $scalar = $self->primary_id()
12.  Bio::Seq::RichSeq           @array  = $self->get_keywords
13.  Bio::Species                $scalar = $self->common_name()
14.  Bio::Species                $scalar = $self->genus()
15.  Bio::Species                $scalar = $self->species()
16.  Bio::Species                @array  = $self->classification()
17.  Bio::Annotation::Reference  $scalar = $self->start()
18.  Bio::Annotation::Reference  $scalar = $self->end()
19.  Bio::Annotation::Reference  $scalar = $self->authors()
20.  Bio::Annotation::Reference  $scalar = $self->title()
21.  Bio::Annotation::Reference  $scalar = $self->location()
22.  Bio::Annotation::Reference  $scalar = $self->medline()
23.  Bio::Annotation::Reference  $scalar = $self->pubmed()
24.  Bio::Annotation::Comment    $scalar = $self->text() or ->as_text
25.  Bio::SeqFeature::Generic    $scalar = $self->start()
26.  Bio::SeqFeature::Generic    $scalar = $self->end()
27.  Bio::SeqFeature::Generic

Get all features from the features table using a variation on the code 
below.
You will need to vary the primary_tag (e.g. source, gene, CDS) and the
get_tag_values (e.g. organism, db_xref, chromosome, note, gene, locus_tag,
go_function, map, EC_number, codon_start, protein_id, etc.).  If there 
is more
than one primary tag associated with a give value (e.g. more than one 
gene) then
you will have to step through the @source_feats array to find what you 
want rather
than simply shifting the first scalar off the top.

my @source_feats = grep { $_->primary_tag eq 'source' } 
$seq->get_SeqFeatures();
my $source_feat = shift @source_feats;
my @mol_type = $source_feat->get_tag_values('mol_type');

28.    Bio::PrimarySeq    $scalar =    $self->seq()
-----------------------------------------------------------------------------------
#!/usr/bin/perl

use strict;
use warnings;
use Bio::SeqIO;
use Bio::DB::GenBank; #use Bio::DB::GenPept or Bio::DB::RefSeq if needed

#Get some sequence IDs either like below, or read in from a file.  Note that
#this sample script works with the accession numbers below (at least at 
the time
#it was written).  If you add different accession numbers, and you get 
errors,
#you may be calling for something that the sequence doesn't have.  
You'll have
#to add your own error trapping code to handle that.
my @ids = ('U59228', 'AB039327', 'BC035972');

#Create the GenBank database object to read from the database.
my $gb = new Bio::DB::GenBank();

#Create a sequence stream to pass the sequences from the database to the 
program.
my $seqio = $gb->get_Stream_by_id(\@ids);

#Loop over all of the sequences that you requested.
while (my $seq = $seqio->next_seq) {

  #Here is how you get methods directly from the RichSeq object.  Replace
  #'display_name' with any other method in Table 2. that can be called on
  #either the RichSeq object directly, or the PrimarySeq object which it has
  #inherited.
  print $seq->display_name,"\n";

  #Here is how to access the classification data from the species object.
  my $species = $seq->species;
  print $species->common_name,"\n";
  my @class = $species->classification;
  print "@class\n";

  #Here is a general way to call things that are stored as a 
Bio::SeqFeature::
  #Generic object.  Replace 'source' with any other of the "major" 
headings in
  #the feature table (e.g gene, CDS, etc.) and replace 'organism' with 
any of
  #the tag values found under that heading (mol_type, locus_tag, gene, etc.)
  my @source_feats = grep { $_->primary_tag eq 'source' } 
$seq->get_SeqFeatures();
  my $source_feat = shift @source_feats;
  my @mol_type = $source_feat->get_tag_values('mol_type');
  print "@mol_type\n";

  #Here is a general way to call things that are stored as some type of a
  #Bio::Annotation oject.  This includes reference information, and 
comments.
  #Replace reference with 'comment' to get the comment, and replace
  #$ref->authors with $ref->title (or location, medline, etc.) to get other
  #reference categories
  my $ann = $seq->annotation();
  my @references = ($ann->get_Annotations('reference'));
  my $ref = shift @references;
  my ($title, $authors, $location, $pubmed, $reference);
  if (defined $ref) {
    $authors = $ref->authors;
    print "$authors\n";
  }
  print "\n";
}
-----------------------------------------------------------------------------------
This file is without copyright.  It may be reproduced, edited, and 
redistributed at
will without notice to the author, however I would appreciate it if any 
corrections
or improvements were sent to barry.moore@genetics.utah.edu


From hlapp at gnf.org  Thu Mar  4 16:10:06 2004
From: hlapp at gnf.org (Hilmar Lapp)
Date: Thu Mar  4 16:15:49 2004
Subject: [Bioperl-l] Re: Bioperl-DB error
In-Reply-To: <10C94843061E094A98C02EB77CFC328722FE0F@nrcmrdex1d.imsb.nrc.ca>
Message-ID: <4F9FD8AC-6E20-11D8-A997-000A959EB4C4@gnf.org>


On Thursday, March 4, 2004, at 12:16  PM, Law, Annie wrote:

>
> I have bioperl installed in two places.  I think it should be okay  
> since the
> Errors that I am receiving have reduced.  I not sure how to get rid of  
> the
> initial install or if it matters.

Brian may be the better person to answer this, but if you just remove  
the Bio/ directory where it got installed then you should have purged  
most of bioperl (and bioperl-db too). There's a few files more that get  
installed into the root directory for site libraries/modules in perl,  
but if you're uncomfortable picking them then there's no harm really.

It'd actually be useful to have an 'uninstall' command. Any comments or  
ideas anybody has had on that already? Maybe there already is and I'm  
just missing it?

As for having two bioperls around and one of them is installed, be  
careful with that. I just recently found myself duped by believing that  
in this situation if you set PERL5LIB to point to your uninstalled  
bioperl perl will pick up the uninstalled version instead of the  
installed. This is not true - it will pick up the installed one if it  
can find one. To discern from where it is going to take the bioperl  
modules you'll have to use the -I option to perl.

Unless somebody else has a better suggestion ...

>
>  Should
> I be concerned about this warning?
>
> t/cluster.......ok 155/160
> -------------------- WARNING ---------------------
> MSG: failed to store one or more child objects for an instance of class
> Bio::Cluster::UniGene (PK=1)
> ---------------------------------------------------
>
>

No, don't worry. The warning reflects the fact that one of the  
associations failed to insert because it was already there. I turned  
off the DBI warnings on failed statements, so you don't see that  
precede a bioperl-style warning anymore.

>
> All tests successful.
> Files=15, Tests=930, 93 wallclock secs (31.64 cusr +  1.26 csys =  
> 32.90 CPU)
>

That's the important message to watch out for.


> 2) I started with an empty database and I loaded the NCBI taxonomy  
> into the
> database, then the GO information then locuslink (I used
> The --safe and --lookup options and did not get any errors).  Loading
> locuslink information to half a day to a day.

Sounds about what I get. The script can report progress every n entries  
using the --logchunk option, check out the POD. It will report some  
speed statistics along.

>
> Next I loaded unigene information and it is taking three days and is  
> still
> not finished?! (I meant to use the option --mergeobjs but forgot when  
> I ran
> the script but I don't think that this will make a difference in terms  
> of
> execution time)

Human and mouse unigene releases take a *very* long time to load, each  
takes about 2-2.5 days for me. The thing with unigene that makes it  
very heavy is that every cluster member becomes its own bioentry (as it  
is a sequence - but it won't have an actual sequence associated as the  
unigene SeqIO parser currently doesn't use the file that would contain  
them). So, when it finishes you will find several million bioentries,  
even though there are already 120k clusters. If you turn on progress  
reporting you'll see that the speed also varies widely depending on  
where in the file the script is. The reason is that some cluster are  
*huge* with tens of thousands of members. Such a single cluster will  
lead to tens of thousands of bioentries being created for it alone.

When you load unigene the first time it doesn't really matter which  
combination you use for merging or look-up since none will be found  
anyway. Once you update though, and you don't purge Unigene from the  
database before, this is critical, because otherwise a large cluster  
will not just take a long time to insert, but also to look up. A very  
useful combination of options for unigene is --flatlookup (see the POD)  
and --mergeobjs="del-assocs-sql.pl". The latter will delete the  
associations partially using SQL instead of traversing object trees and  
deleting one association at a time. (It is therefore also specific to  
the schema version I wrote it for, which is Oracle, but can be ported  
easily to work with the mysql and pg versions. Let me know once you are  
there.)

Also, what to keep in mind is that sequence loading can be trivially  
parallelized by firing up different load_seqdatabase's in parallel on  
different targets simultaneously. How many parallel processes are  
beneficial depends on your number of CPUs and what your RDBMS can  
handle. (e.g. I always run human and mouse unigene in parallel)

Also, you can run Unigene and LL in parallel, if you've got the  
necessary CPU power on the db-server end.

>  Something strange that is happening is that when I use
> mysqlcc to refresh what the current state of the database is the  
> number of
> bioentries increases and then decreases.

I don't know about mysqlcc and hence can't comment ...


> I am wondering if loading of unigene information is affected by the  
> warning
> I got from the make test in bioperl-db

No, it's not, don't worry.

> 3) On a related note every time including the initial time that I load  
> the
> database with load_ontology.pl, or load_seqdatbase.pl I use the option
> lookup.  I want to create annotation database so I am loading first  
> with
> NCBI taxonomy databse, then GO, then locuslink, then unigene.  I want  
> find
> out the annotation for some clones from ESTs.  I think I only need to  
> use
> the mergobjs option when I load the unigene information?

If the database is empty or doesn't contain yet the datasource you're  
trying to load then there is no point in using --lookup (or  
--mergeobjs). In fact, it'll hurt, although really not much -- what  
will happen is that for every bioentry a look-up (SELECT query) is  
generated which you already know won't return anything. The lookup is  
fast though - should be at or below 0.01 seconds, but again that may  
depend on the load on the database.

As for --mergeobjs, there are a couple implementations provided in the  
repository. Check those out to see how it works, e.g.  
update-on-new-date.pl, update-on-new-version.pl, freshen-annot.pl, and  
merge-unique-ann.pl. It's not difficult to write your own if you have  
other requirements.

> I read the documentation of mergobjs in the load_seqdatbase.pl script  
> but
> would appreciate an example of how this works.
>
> 4) Also after I intalled Bioperl-1.4 again I decreased the number of  
> failed
> tests but still there Where 3 test failures and I am not sure if I  
> should be
> concerned about these.
>
> Failed Test  Stat Wstat Total Fail  Failed  List of Failed
> ----------------------------------------------------------------------- 
> -----
> ---
> t/AlignIO.t     2   512    80    5   6.25%  76-80
> t/DB.t                     78   ??       %  ??
> t/tutorial.t  255 65280    21   11  52.38%  11-21
> 122 subtests skipped.
> Failed 3/180 test scripts, 98.33% okay. 7/8305 subtests failed, 99.92%  
> okay.
>

I'll leave this one to Brian, Heikki, Jason, or whoever can provide  
insight.

>
> 5) I have been loading the database with the following command. I  
> would like
> to know if the format option should be SeqIO:locuslink or should it  
> simply
> be locuslink?
>

Either will work. SeqIO::locuslink is more precise, but since SeqIO is  
(and will remain) the default, locuslink will work too. Only for  
ClusterIO formats you have to prepend the ClusterIO:: to the name of  
the format (e.g. unigene).

Hth,

	-hilmar

> Perl /root/bioperl-db/scripts/biosql/load_seqdatabase.pl --dbuser=root
> --dbpass=ms22a --dbname=annotatedata --namespace LocusLink
> --format SeqIO::locuslink /var/lib/mysql/LL_tmpl
>
>
> thanks very much,
> Annie.
>
>
-- 
-------------------------------------------------------------
Hilmar Lapp                            email: lapp at gnf.org
GNF, San Diego, Ca. 92121              phone: +1-858-812-1757
-------------------------------------------------------------


From jason at cgt.duhs.duke.edu  Thu Mar  4 16:15:22 2004
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Thu Mar  4 16:21:14 2004
Subject: [Bioperl-l] getting a subtree
In-Reply-To: <4172.152.14.14.253.1078434268.squirrel@webmail.ncsu.edu>
References: <4172.152.14.14.253.1078434268.squirrel@webmail.ncsu.edu>
Message-ID: <Pine.LNX.4.50.0403041609260.18707-100000@tenero.duhs.duke.edu>

Deng - here is some code to answer your question.

use Bio::Tree::Tree;
use Bio::TreeIO;

# get a tree somehow
# choose the subnodes for finding the lca somehow

# get the least common ancestor
my $node = $tree->get_lca( -nodes => \@nodes );

# build a new tree with its root at $node
my $subtree = Bio::Tree::Tree->new(-root => $node, -nodelete => 1);

my $out = Bio::TreeIO->new(-format=>'newick');
$out->write_tree($subtree);

-jason

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From brian_osborne at cognia.com  Thu Mar  4 16:46:59 2004
From: brian_osborne at cognia.com (Brian Osborne)
Date: Thu Mar  4 16:52:59 2004
Subject: [Bioperl-l] Broken link in bptutorial
In-Reply-To: <404771C1.8060302@email.arizona.edu>
Message-ID: <GAEDKMGOKFBLJPKCLKCCKECHDIAA.brian_osborne@cognia.com>

Susan,

I thought I'd gotten all of those!

Thanks for the tip.

Brian O.


-----Original Message-----
From: bioperl-l-bounces@portal.open-bio.org
[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Susan J. Miller
Sent: Thursday, March 04, 2004 1:13 PM
To: bioperl-l@bioperl.org
Subject: [Bioperl-l] Broken link in bptutorial

In the BioPerl Tutorial Introduction Overview section
http://www.bioperl.org/Core/Latest/bptutorial.html#i._introduction the
link http://bioperl.org/Core/Latest/bioscripts.html does not work
because there is punctuation included in the URL.

--
Regards,
-susan

Susan J. Miller
Biotechnology Computing Facility
Arizona Research Laboratories
Bio West 228
University of Arizona
Tucson, AZ  85721
(520) 626-2597

_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org
http://portal.open-bio.org/mailman/listinfo/bioperl-l


From morten at binf.ku.dk  Thu Mar  4 10:57:11 2004
From: morten at binf.ku.dk (Morten Lindow)
Date: Thu Mar  4 21:30:04 2004
Subject: [Bioperl-l] Unusual behaviour of SeqIO::tigr
In-Reply-To: <Pine.LNX.4.50.0312281604220.29220-100000@tenero.duhs.duke.edu>
References: <Pine.LNX.4.50.0312281604220.29220-100000@tenero.duhs.duke.edu>
Message-ID: <404751D7.6030406@binf.ku.dk>

I am trying to build a table of genomic features from the tigrxml-format 
of a rice (pseudo)chromosome.

However the tigr-parser seem to behave differently from genbank/embl: 
tigr.pm considers every TU/etc an  individual sequence, and hence resets 
its coordinate system every time it starts on a new TU.

My question is:
Is there a bioperl-way to get to the global coordinates, like when I am 
parsing a genbankfile of a whole chromosome?

my $tigrin = Bio::SeqIO->new( -format => 'tigr', -file => 'chr01.xml');

while (my $seq = $tigrin->next_seq){ #I would expect this loop to run 
only once
    foreach my $feat ( $seq->get_SeqFeatures ) {
        foreach my $location ($feat->location->each_Location){
                print $location->to_FTstring, "\n";
                #blablabla
                }
      }
  }

cheers,
Morten

From off2w0rk at yahoo.com  Wed Mar  3 21:37:12 2004
From: off2w0rk at yahoo.com (facemann)
Date: Thu Mar  4 21:30:37 2004
Subject: [Bioperl-l] RemoteBlast VS Traditional Browser Blast
Message-ID: <20040304023712.83186.qmail@web41306.mail.yahoo.com>

SOLVED.
I had a problem with my "keyboard to floor interface".
BLAST & RemoteBlast.pm work just fine.
Thank you for you time.


---------------------------------
Do you Yahoo!?
Yahoo! Search - Find what you�re looking for faster.
From advil21dundee at hotmail.com  Fri Mar  5 04:25:50 2004
From: advil21dundee at hotmail.com (berry)
Date: Fri Mar  5 04:31:39 2004
Subject: [Bioperl-l] 36 H0UR V1=AG-RA
Message-ID: <1078478750-5189@excite.com>


Did you know That the normal cost for Super V i a g r a is $20, per dose?

We are running a hot special!! T0DAY Its only an amazing $3.00

Shipped world wide!

DISC0UNT 0RDER: http://healthypolicy.com/sv/index.php?pid=eph9106


laura biologygretchen dirk percy diana praise corrado 
vicky bensongray abcdef molly1
mantra rufus joel 

Get off this list by going to http://healthypolicy.com/sv/applepie.php
From heikki at ebi.ac.uk  Fri Mar  5 05:39:44 2004
From: heikki at ebi.ac.uk (Heikki Lehvaslaiho)
Date: Fri Mar  5 05:45:34 2004
Subject: [Bioperl-l] Getting values from a GenBank File
In-Reply-To: <404791F9.2030103@genetics.utah.edu>
References: <404791F9.2030103@genetics.utah.edu>
Message-ID: <200403051039.45640.heikki@ebi.ac.uk>

Barry,

I like your RichSeq breakdown! It would make a great addition into BioPerl 
howtos. Would you be interested in converting it into DocBook yourself? 

	-Heikki

On Thursday 04 Mar 2004 20:30, Barry Moore wrote:
> While learning BioPerl, I've spent a good bit of time trying to figure
> out where some bit of information in a GenBank file might be tucked away
> in that RichSeq object, and how the hell to get at it.  I finally
> decided it was time to sift through that object and find out what's in
> there and where it is.  Along the way I wrote down  what I found, and
> then added a bit to it to try and turn it into something that I would
> like to have found when I first started learning Bioperl.  The result is
> a "GeneBank-file-centric" crib sheet for Bioperl.  The document is
> centered around a "standard" GenBank file where each seperate item of
> information is marked-up with  numbers and codes that refer to
> information later in the document about how to get at that particular
> value in using BioPerl.  It also includes a script at the end that can
> be modified easily to get at all the major stuff in the GenBank file.
> This document is aimed at beginners to BioPerl, but assumes a working
> knowledge of Perl.  The file is, of course, free to be used by anyone in
> anyway that they wish.  I'd appreciate any corrections or improvements
> if you find errors or shortcomings.
>
> Barry

-- 
______ _/      _/_____________________________________________________
      _/      _/                      http://www.ebi.ac.uk/mutations/
     _/  _/  _/  Heikki Lehvaslaiho    heikki_at_ebi ac uk
    _/_/_/_/_/  EMBL Outstation, European Bioinformatics Institute
   _/  _/  _/  Wellcome Trust Genome Campus, Hinxton
  _/  _/  _/  Cambs. CB10 1SD, United Kingdom
     _/      Phone: +44 (0)1223 494 644   FAX: +44 (0)1223 494 468
___ _/_/_/_/_/________________________________________________________
From Alexandre.Irrthum at icr.ac.uk  Fri Mar  5 06:26:28 2004
From: Alexandre.Irrthum at icr.ac.uk (Alexandre Irrthum)
Date: Fri Mar  5 06:32:44 2004
Subject: [Bioperl-l] bl2seq overflow
Message-ID: <s0486400.024@icr.ac.uk>

Hello bioperl people,

I have a strange problem here. I am trying to blast (bl2seq) EST
sequences against a reference cDNA. In this test, the EST file comprises
> 2000 sequences. Here is the skeleton of the program (sorry if it's
formatted improperly):

#! /usr/bin/perl

use warnings;
use strict;
use Bio::Tools::Run::StandAloneBlast;
use Bio::SeqIO;

my $factory = Bio::Tools::Run::StandAloneBlast->new(program =>'blastn');
my $refseqio = Bio::SeqIO->new(-file => 'refseq.fasta',                 
       -format => 'Fasta');
my $refseq = $refseqio->next_seq();
my $estseqio = Bio::SeqIO->new(-file => 'ests.fasta',              
-format => 'Fasta');
my $count = 0;
while (my $estseq = $estseqio->next_seq()) {              $count++;
my $report = $factory->bl2seq($estseq, $refseq);
if (my $hsp = $report->next_feature()) {
my $query_seq = $hsp->querySeq();
my $sbjct_seq = $hsp->sbjctSeq();
print "COUNT: $count\n";
print "QUERY: $query_seq\n";
print "SBJCT: $sbjct_seq\n";
}
}

The program crashes with the following message, after successfully
blasting 1020 sequences. Always after 1020 blasts, even when I
reorganize the EST input file. 

--------------program output-------------------
Can't locate Carp/Heavy.pm in @INC (@INC
contains:/usr/lib/perl5/5.8.0/i386-linux-thread-multi
/usr/lib/perl5/5.8.0/usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi/usr/lib/perl5/site_perl/5.8.0
/usr/lib/perl5/site_perl/usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi/usr/lib/perl5/vendor_perl/5.8.0
/usr/lib/perl5/vendor_perl/usr/lib/perl5/5.8.0/i386-linux-thread-multi
/usr/lib/perl5/5.8.0 .) at/usr/lib/perl5/5.8.0/Carp.pm line 178, <GEN1>
line 1021.

I don't know why bioperl cannot find Carp and Carp::heavy, since they
are in /usr/lib/perl5/5.8.0/. Anyhow, when I add "use Carp;" and
"useCarp::Heavy;", I get this:

---------------program output---------------
Error in tempfile() using /tmp/XXXXXXXXXX: Could not create temp
file/tmp/C5b2L4SloO: Too many open files
at/usr/lib/perl5/site_perl/5.8.0/Bio/Root/IO.pm line 659

There are no blast temporary files in /tmp. I'm clueless.

Many thanks to the bioperl developers for the fantastic tool they've
built us.

alex
From matthias.wahl at gsf.de  Fri Mar  5 07:16:10 2004
From: matthias.wahl at gsf.de (Matthias Wahl)
Date: Fri Mar  5 07:16:36 2004
Subject: [Bioperl-l] bl2seq overflow
References: <s0486400.024@icr.ac.uk>
Message-ID: <40486F8A.7020601@gsf.de>

One possible solution is to use the old bioperl 0.7 version. This one 
works with unlimited number of sequences. But you might have to change 
your code to be compatible with the old version.

Matthias

Alexandre Irrthum wrote:

>Hello bioperl people,
>
>I have a strange problem here. I am trying to blast (bl2seq) EST
>sequences against a reference cDNA. In this test, the EST file comprises
>  
>
>>2000 sequences. Here is the skeleton of the program (sorry if it's
>>    
>>
>formatted improperly):
>
>#! /usr/bin/perl
>
>use warnings;
>use strict;
>use Bio::Tools::Run::StandAloneBlast;
>use Bio::SeqIO;
>
>my $factory = Bio::Tools::Run::StandAloneBlast->new(program =>'blastn');
>my $refseqio = Bio::SeqIO->new(-file => 'refseq.fasta',                 
>       -format => 'Fasta');
>my $refseq = $refseqio->next_seq();
>my $estseqio = Bio::SeqIO->new(-file => 'ests.fasta',              
>-format => 'Fasta');
>my $count = 0;
>while (my $estseq = $estseqio->next_seq()) {              $count++;
>my $report = $factory->bl2seq($estseq, $refseq);
>if (my $hsp = $report->next_feature()) {
>my $query_seq = $hsp->querySeq();
>my $sbjct_seq = $hsp->sbjctSeq();
>print "COUNT: $count\n";
>print "QUERY: $query_seq\n";
>print "SBJCT: $sbjct_seq\n";
>}
>}
>
>The program crashes with the following message, after successfully
>blasting 1020 sequences. Always after 1020 blasts, even when I
>reorganize the EST input file. 
>
>--------------program output-------------------
>Can't locate Carp/Heavy.pm in @INC (@INC
>contains:/usr/lib/perl5/5.8.0/i386-linux-thread-multi
>/usr/lib/perl5/5.8.0/usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi/usr/lib/perl5/site_perl/5.8.0
>/usr/lib/perl5/site_perl/usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi/usr/lib/perl5/vendor_perl/5.8.0
>/usr/lib/perl5/vendor_perl/usr/lib/perl5/5.8.0/i386-linux-thread-multi
>/usr/lib/perl5/5.8.0 .) at/usr/lib/perl5/5.8.0/Carp.pm line 178, <GEN1>
>line 1021.
>
>I don't know why bioperl cannot find Carp and Carp::heavy, since they
>are in /usr/lib/perl5/5.8.0/. Anyhow, when I add "use Carp;" and
>"useCarp::Heavy;", I get this:
>
>---------------program output---------------
>Error in tempfile() using /tmp/XXXXXXXXXX: Could not create temp
>file/tmp/C5b2L4SloO: Too many open files
>at/usr/lib/perl5/site_perl/5.8.0/Bio/Root/IO.pm line 659
>
>There are no blast temporary files in /tmp. I'm clueless.
>
>Many thanks to the bioperl developers for the fantastic tool they've
>built us.
>
>alex
>_______________________________________________
>Bioperl-l mailing list
>Bioperl-l@portal.open-bio.org
>http://portal.open-bio.org/mailman/listinfo/bioperl-l
>
>
>  
>


-- 
Matthias Wahl
GSF-National Research Center for Environment and Health
Institute of Developmental Genetics
Ingolstaedter Landstrasse 1
D-85764 Neuherberg
Germany
TEL: ++49 89 3187-4117,-2638
FAX: ++49 89 3187-3099
E-mail: matthias.wahl@gsf.de
WWW: http://www.gsf.de/idg


From brian_osborne at cognia.com  Fri Mar  5 07:52:56 2004
From: brian_osborne at cognia.com (Brian Osborne)
Date: Fri Mar  5 07:58:48 2004
Subject: [Bioperl-l] Getting values from a GenBank File
In-Reply-To: <404791F9.2030103@genetics.utah.edu>
Message-ID: <GAEDKMGOKFBLJPKCLKCCAECMDIAA.brian_osborne@cognia.com>

Barry,

This is good. I've also been talking to another Bioperl-er who's writing a
script that creates an output very much like your Tables, where the input is
any type of format (Genbank, Locuslink, and so on). So you could write your
own HOWTO, yes, but in my opinion the better idea is to marry the existing
HOWTO, your tables, and this incoming script somehow.

I'm convinced that having too much on a single topic ("you could read the
FAQ 2.6 or the HOWTO  or the module documentation for Bio::That or Bio::This
or the bptutorial section 3.4...") is not good. Bioperl is under-documented,
generally, but you'll also find places where there's documentation scattered
all over the place on a given topic. The advice is inconsistent, one section
gets outdated, the authors move on, these sorts of things happen.

My suggestion would be to wait a moment until my colleague's script appears,
then let's write something coherent that's easy to maintain.

Brian O.

-----Original Message-----
From: bioperl-l-bounces@portal.open-bio.org
[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Barry Moore
Sent: Thursday, March 04, 2004 3:31 PM
To: bioperl
Subject: [Bioperl-l] Getting values from a GenBank File

While learning BioPerl, I've spent a good bit of time trying to figure
out where some bit of information in a GenBank file might be tucked away
in that RichSeq object, and how the hell to get at it.  I finally
decided it was time to sift through that object and find out what's in
there and where it is.  Along the way I wrote down  what I found, and
then added a bit to it to try and turn it into something that I would
like to have found when I first started learning Bioperl.  The result is
a "GeneBank-file-centric" crib sheet for Bioperl.  The document is
centered around a "standard" GenBank file where each seperate item of
information is marked-up with  numbers and codes that refer to
information later in the document about how to get at that particular
value in using BioPerl.  It also includes a script at the end that can
be modified easily to get at all the major stuff in the GenBank file.
This document is aimed at beginners to BioPerl, but assumes a working
knowledge of Perl.  The file is, of course, free to be used by anyone in
anyway that they wish.  I'd appreciate any corrections or improvements
if you find errors or shortcomings.

Barry
--

Barry Moore
Dept. of Human Genetics
University of Utah
Salt Lake City, UT

----------------------------------------------------------------------------
-------

                 How to get values in a GenBank file with BioPerl

This is a guide to accessing the value of each item in a GenBank file
with BioPerl.
Underneath each value in the sample GenBank file there is a number and a
3-letter
code.  The 3-letter code represents the object where the associated
value is stored.
Table 1 at the end of this file gives the object for each code.  By
looking up the
numbers in Table 2 at the end of this file you will find the object
where this
information can be called from, and the method to get (and usually set)
the value.
Some values may be stored by, and accessible by methods in more than one
object.
There is perl code at the end that you can use as a starting point to
get at all of
the objects and their methods.
----------------------------------------------------------------------------
-------
LOCUS       NM_079543               1821 bp    mRNA    linear   INV
10-DEC-2003
            1-BSR                   2-BPS      3-BSR   4-BSR  5-BSR 6-BSR
DEFINITION  Drosophila melanogaster alpha-Esterase-3 CG1257-PA (alpha-Est3)
            mRNA, complete cds.
            7-BPS
ACCESSION   NM_079543 XX_123456
            8-BPS     9-BSR
VERSION     NM_079543.2  GI:24644853
                      |     |
                    10-BPS  11-BSR
KEYWORDS    Esterases
            12-BSR
SOURCE      Drosophila melanogaster (fruit fly)
            13-BSC
  ORGANISM  Drosophila melanogaster
            14-BSC     15-BSC
            Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota;
            Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha;
            Ephydroidea; Drosophilidae; Drosophila.
            16-BSC
REFERENCE   1  (bases 1 to 1821)
                    17-BAR 18-BAR
  AUTHORS   Adams,M.D. et al.(additional authors deleted for brevity)
            19-BAR
  TITLE     The genome sequence of Drosophila melanogaster
            20-BAR
  JOURNAL   Science 287 (5461), 2185-2195 (2000)
            21-BAR
  MEDLINE   20196006
            22-BAR
   PUBMED   10731132
            23-BAR
COMMENT     PROVISIONAL REFSEQ: This record has not yet been subject to
final
            NCBI review. This record is derived from an annotated genomic
            sequence (NT_033777). The reference sequence was derived from
            CG1257-RA.
            On Nov 6, 2002 this sequence version replaced gi:17737826.
            24-BAC
FEATURES             Location/Qualifiers
     source          1..1821
                25-BLS  26-BLS
                     /organism="Drosophila melanogaster"
                     27-BSG
                     /mol_type="mRNA"
                     /db_xref="taxon:7227"
                     /chromosome="3R"
                     /note="genotype: y[1]; cn[1] bw[1] sp[1]; Rh6[1]"
     gene            1..1821
                     /gene="alpha-Est3"
                     /locus_tag="CG1257"
                     /note="alpha-Esterase-3; synonyms: B, aE3, CG1257,
                     alphaE3, alpha-Ests, fragment B;
                     go_function: carboxylesterase activity [goid 0004091]
                     [evidence NAS]"
                     /map="84D9-84D9"
                     /db_xref="FLYBASE:FBgn0015571"
                     /db_xref="GeneID:40907"
                     /db_xref="LocusID:40907"
     CDS             119..1750
                     /gene="alpha-Est3"
                     /locus_tag="CG1257"
                     /EC_number="3.1.1.1"
                     /note="alpha-Est3 gene product"
                     /codon_start=1
                     /product="alpha-Esterase-3 CG1257-PA"
                     /protein_id="NP_524267.2"
                     /db_xref="GI:24644854"
                     /db_xref="FLYBASE:FBgn0015571"
                     /db_xref="GeneID:40907"
                     /db_xref="LocusID:40907"

/translation="MESLQVNTTSGPVLGKQCTGVYGDEYVSFERIPYAQPPVGHLRF

MAPLPVEPWSQPLDCTKPGQKPLQFNHYSKQLEGVEDCLYLNVYAKELDSPRPLPLIV

FFFGGGFEKGDPTKELHSPDYFMMRDVVVVTVSYRVGPLGFLSLNDPAVGVPGNAGLK

DQLLAMEWIKENAERFNGDPKNVTAFGESAGAASVHYLMLNPKAEGLFHKAILQSGNV

LCSWALCTIKNLPHRLAVNLGMESAEHVTDAMVLDFLQKLPGEKLVRPYLLSAEEHLD

DCVFQFGPMVEPYKTEHCALPNHPQELLDKAWGNRIPVLMSGTSFEGLLMYARVQMAP

YLLTSLKKEPEHMLPLDVKRNLPQALARHLGQRLQETHFGGNDPSAMSPESLKAYCEY

ASYKVFWHPILKTLRSRVKSSSASTYLYRFDFDSPTFNHQRLKYCGDKLRGVAHVDDH

SYLWYGDFSWKLDKHTPEFLTIERMIDMLTSFARTSNPNCKLIQDQLPRAKEWKPLNS
                     KSALECLNISENIKMMELPELQKLRVWESVCQSTG"
ORIGIN
        1 gtactatggc aaatggtact atgggcagtc gcgataataa taataaatga ggctttaaat
       61 gtttttccac tacaagataa aaagttacga gtgcgcaccg agcatttagt agacgaacat
          28-BPS
... (additional sequence deleted for brevity)
//
----------------------------------------------------------------------------
-------
Table 1. Object Key
BPS - Bio::PrimarySeq
BSR - Bio::Seq::RichSeq
BSC - Bio::Species
BAC - Bio::Annotation::Comment
BAR - Bio::Annotation::Reference
BLS - Bio::Location::Simple
BSG - Bio::SeqFeature::Generic
----------------------------------------------------------------------------
-------
Table 2. Usage key
1.   Bio::Seq::RichSeq           $scalar = $self->display_name() or
->display_id()
2.   Bio::PrimarySeq             $scalar = $self->length()
3.   Bio::Seq::RichSeq           $scalar = $self->molecule()
4.   Bio::PrimarySeq             $scalar = $self->is_circular
5.   Bio::Seq::RichSeq           $scalar = $self->division()
6.   Bio::seq::Richseq           @array  = $self->get_dates
7.   Bio::PrimarySeq             $scalar = $self->description()
8.   Bio::PrimarySeq             $scalar = $self->accession_number()
     Bio::Seq::RichSeq           $scalar = $self->accession()
9.   Bio::Seq::RichSeq           @array  = $self->get_secondary_accessions
10.  Bio::PrimarySeq             $scalar = $self->version()
     Bio::Seq::RichSeq           $scalar = $self->seq_version()
11.  Bio::PrimarySeq             $scalar = $self->primary_id()
12.  Bio::Seq::RichSeq           @array  = $self->get_keywords
13.  Bio::Species                $scalar = $self->common_name()
14.  Bio::Species                $scalar = $self->genus()
15.  Bio::Species                $scalar = $self->species()
16.  Bio::Species                @array  = $self->classification()
17.  Bio::Annotation::Reference  $scalar = $self->start()
18.  Bio::Annotation::Reference  $scalar = $self->end()
19.  Bio::Annotation::Reference  $scalar = $self->authors()
20.  Bio::Annotation::Reference  $scalar = $self->title()
21.  Bio::Annotation::Reference  $scalar = $self->location()
22.  Bio::Annotation::Reference  $scalar = $self->medline()
23.  Bio::Annotation::Reference  $scalar = $self->pubmed()
24.  Bio::Annotation::Comment    $scalar = $self->text() or ->as_text
25.  Bio::SeqFeature::Generic    $scalar = $self->start()
26.  Bio::SeqFeature::Generic    $scalar = $self->end()
27.  Bio::SeqFeature::Generic

Get all features from the features table using a variation on the code
below.
You will need to vary the primary_tag (e.g. source, gene, CDS) and the
get_tag_values (e.g. organism, db_xref, chromosome, note, gene, locus_tag,
go_function, map, EC_number, codon_start, protein_id, etc.).  If there
is more
than one primary tag associated with a give value (e.g. more than one
gene) then
you will have to step through the @source_feats array to find what you
want rather
than simply shifting the first scalar off the top.

my @source_feats = grep { $_->primary_tag eq 'source' }
$seq->get_SeqFeatures();
my $source_feat = shift @source_feats;
my @mol_type = $source_feat->get_tag_values('mol_type');

28.    Bio::PrimarySeq    $scalar =    $self->seq()
----------------------------------------------------------------------------
-------
#!/usr/bin/perl

use strict;
use warnings;
use Bio::SeqIO;
use Bio::DB::GenBank; #use Bio::DB::GenPept or Bio::DB::RefSeq if needed

#Get some sequence IDs either like below, or read in from a file.  Note that
#this sample script works with the accession numbers below (at least at
the time
#it was written).  If you add different accession numbers, and you get
errors,
#you may be calling for something that the sequence doesn't have.
You'll have
#to add your own error trapping code to handle that.
my @ids = ('U59228', 'AB039327', 'BC035972');

#Create the GenBank database object to read from the database.
my $gb = new Bio::DB::GenBank();

#Create a sequence stream to pass the sequences from the database to the
program.
my $seqio = $gb->get_Stream_by_id(\@ids);

#Loop over all of the sequences that you requested.
while (my $seq = $seqio->next_seq) {

  #Here is how you get methods directly from the RichSeq object.  Replace
  #'display_name' with any other method in Table 2. that can be called on
  #either the RichSeq object directly, or the PrimarySeq object which it has
  #inherited.
  print $seq->display_name,"\n";

  #Here is how to access the classification data from the species object.
  my $species = $seq->species;
  print $species->common_name,"\n";
  my @class = $species->classification;
  print "@class\n";

  #Here is a general way to call things that are stored as a
Bio::SeqFeature::
  #Generic object.  Replace 'source' with any other of the "major"
headings in
  #the feature table (e.g gene, CDS, etc.) and replace 'organism' with
any of
  #the tag values found under that heading (mol_type, locus_tag, gene, etc.)
  my @source_feats = grep { $_->primary_tag eq 'source' }
$seq->get_SeqFeatures();
  my $source_feat = shift @source_feats;
  my @mol_type = $source_feat->get_tag_values('mol_type');
  print "@mol_type\n";

  #Here is a general way to call things that are stored as some type of a
  #Bio::Annotation oject.  This includes reference information, and
comments.
  #Replace reference with 'comment' to get the comment, and replace
  #$ref->authors with $ref->title (or location, medline, etc.) to get other
  #reference categories
  my $ann = $seq->annotation();
  my @references = ($ann->get_Annotations('reference'));
  my $ref = shift @references;
  my ($title, $authors, $location, $pubmed, $reference);
  if (defined $ref) {
    $authors = $ref->authors;
    print "$authors\n";
  }
  print "\n";
}
----------------------------------------------------------------------------
-------
This file is without copyright.  It may be reproduced, edited, and
redistributed at
will without notice to the author, however I would appreciate it if any
corrections
or improvements were sent to barry.moore@genetics.utah.edu


_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org
http://portal.open-bio.org/mailman/listinfo/bioperl-l


From jason at cgt.duhs.duke.edu  Fri Mar  5 08:18:57 2004
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Fri Mar  5 08:24:47 2004
Subject: [Bioperl-l] bl2seq overflow
In-Reply-To: <s0486400.024@icr.ac.uk>
References: <s0486400.024@icr.ac.uk>
Message-ID: <Pine.LNX.4.50.0403050813220.24188-100000@tenero.duhs.duke.edu>

What version of bioperl - this problem of not cleaning up tempfiles was
supposed to be resolved in 1.4

You might add
  $factory->io->_io_cleanup();

At the beginning of your sequence loop
while ( my $estseq = $estseqio->next_seq ) {

}


On Fri, 5 Mar 2004, Alexandre Irrthum wrote:

> Hello bioperl people,
>
> I have a strange problem here. I am trying to blast (bl2seq) EST
> sequences against a reference cDNA. In this test, the EST file comprises
> > 2000 sequences. Here is the skeleton of the program (sorry if it's
> formatted improperly):
>
> #! /usr/bin/perl
>
> use warnings;
> use strict;
> use Bio::Tools::Run::StandAloneBlast;
> use Bio::SeqIO;
>
> my $factory = Bio::Tools::Run::StandAloneBlast->new(program =>'blastn');
> my $refseqio = Bio::SeqIO->new(-file => 'refseq.fasta',
>        -format => 'Fasta');
> my $refseq = $refseqio->next_seq();
> my $estseqio = Bio::SeqIO->new(-file => 'ests.fasta',
> -format => 'Fasta');
> my $count = 0;
> while (my $estseq = $estseqio->next_seq()) {              $count++;
> my $report = $factory->bl2seq($estseq, $refseq);
> if (my $hsp = $report->next_feature()) {
> my $query_seq = $hsp->querySeq();
> my $sbjct_seq = $hsp->sbjctSeq();
> print "COUNT: $count\n";
> print "QUERY: $query_seq\n";
> print "SBJCT: $sbjct_seq\n";
> }
> }
>
> The program crashes with the following message, after successfully
> blasting 1020 sequences. Always after 1020 blasts, even when I
> reorganize the EST input file.
>
> --------------program output-------------------
> Can't locate Carp/Heavy.pm in @INC (@INC
> contains:/usr/lib/perl5/5.8.0/i386-linux-thread-multi
> /usr/lib/perl5/5.8.0/usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi/usr/lib/perl5/site_perl/5.8.0
> /usr/lib/perl5/site_perl/usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi/usr/lib/perl5/vendor_perl/5.8.0
> /usr/lib/perl5/vendor_perl/usr/lib/perl5/5.8.0/i386-linux-thread-multi
> /usr/lib/perl5/5.8.0 .) at/usr/lib/perl5/5.8.0/Carp.pm line 178, <GEN1>
> line 1021.
>
> I don't know why bioperl cannot find Carp and Carp::heavy, since they
> are in /usr/lib/perl5/5.8.0/. Anyhow, when I add "use Carp;" and
> "useCarp::Heavy;", I get this:
>
> ---------------program output---------------
> Error in tempfile() using /tmp/XXXXXXXXXX: Could not create temp
> file/tmp/C5b2L4SloO: Too many open files
> at/usr/lib/perl5/site_perl/5.8.0/Bio/Root/IO.pm line 659
>
> There are no blast temporary files in /tmp. I'm clueless.
>
> Many thanks to the bioperl developers for the fantastic tool they've
> built us.
>
> alex
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From jason at cgt.duhs.duke.edu  Fri Mar  5 08:20:08 2004
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Fri Mar  5 08:26:35 2004
Subject: [Bioperl-l] bl2seq overflow
In-Reply-To: <40486F8A.7020601@gsf.de>
References: <s0486400.024@icr.ac.uk> <40486F8A.7020601@gsf.de>
Message-ID: <Pine.LNX.4.50.0403050819060.24188-100000@tenero.duhs.duke.edu>

I doubt this will fix it - there is no coded limit in bioperl - it has to
do with the creation of tempfiles and not being cleaned up properly until
perl exits instead of within a loop of factory->bl2seq calls.


On Fri, 5 Mar 2004, Matthias Wahl wrote:

> One possible solution is to use the old bioperl 0.7 version. This one
> works with unlimited number of sequences. But you might have to change
> your code to be compatible with the old version.
>
> Matthias
>
> Alexandre Irrthum wrote:
>
> >Hello bioperl people,
> >
> >I have a strange problem here. I am trying to blast (bl2seq) EST
> >sequences against a reference cDNA. In this test, the EST file comprises
> >
> >
> >>2000 sequences. Here is the skeleton of the program (sorry if it's
> >>
> >>
> >formatted improperly):
> >
> >#! /usr/bin/perl
> >
> >use warnings;
> >use strict;
> >use Bio::Tools::Run::StandAloneBlast;
> >use Bio::SeqIO;
> >
> >my $factory = Bio::Tools::Run::StandAloneBlast->new(program =>'blastn');
> >my $refseqio = Bio::SeqIO->new(-file => 'refseq.fasta',
> >       -format => 'Fasta');
> >my $refseq = $refseqio->next_seq();
> >my $estseqio = Bio::SeqIO->new(-file => 'ests.fasta',
> >-format => 'Fasta');
> >my $count = 0;
> >while (my $estseq = $estseqio->next_seq()) {              $count++;
> >my $report = $factory->bl2seq($estseq, $refseq);
> >if (my $hsp = $report->next_feature()) {
> >my $query_seq = $hsp->querySeq();
> >my $sbjct_seq = $hsp->sbjctSeq();
> >print "COUNT: $count\n";
> >print "QUERY: $query_seq\n";
> >print "SBJCT: $sbjct_seq\n";
> >}
> >}
> >
> >The program crashes with the following message, after successfully
> >blasting 1020 sequences. Always after 1020 blasts, even when I
> >reorganize the EST input file.
> >
> >--------------program output-------------------
> >Can't locate Carp/Heavy.pm in @INC (@INC
> >contains:/usr/lib/perl5/5.8.0/i386-linux-thread-multi
> >/usr/lib/perl5/5.8.0/usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi/usr/lib/perl5/site_perl/5.8.0
> >/usr/lib/perl5/site_perl/usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi/usr/lib/perl5/vendor_perl/5.8.0
> >/usr/lib/perl5/vendor_perl/usr/lib/perl5/5.8.0/i386-linux-thread-multi
> >/usr/lib/perl5/5.8.0 .) at/usr/lib/perl5/5.8.0/Carp.pm line 178, <GEN1>
> >line 1021.
> >
> >I don't know why bioperl cannot find Carp and Carp::heavy, since they
> >are in /usr/lib/perl5/5.8.0/. Anyhow, when I add "use Carp;" and
> >"useCarp::Heavy;", I get this:
> >
> >---------------program output---------------
> >Error in tempfile() using /tmp/XXXXXXXXXX: Could not create temp
> >file/tmp/C5b2L4SloO: Too many open files
> >at/usr/lib/perl5/site_perl/5.8.0/Bio/Root/IO.pm line 659
> >
> >There are no blast temporary files in /tmp. I'm clueless.
> >
> >Many thanks to the bioperl developers for the fantastic tool they've
> >built us.
> >
> >alex
> >_______________________________________________
> >Bioperl-l mailing list
> >Bioperl-l@portal.open-bio.org
> >http://portal.open-bio.org/mailman/listinfo/bioperl-l
> >
> >
> >
> >
>
>
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From matthias.wahl at gsf.de  Fri Mar  5 08:47:23 2004
From: matthias.wahl at gsf.de (Matthias Wahl)
Date: Fri Mar  5 08:47:48 2004
Subject: [Bioperl-l] bl2seq overflow
References: <s0486400.024@icr.ac.uk> <40486F8A.7020601@gsf.de>
	<Pine.LNX.4.50.0403050819060.24188-100000@tenero.duhs.duke.edu>
Message-ID: <404884EB.90006@gsf.de>

At least it worked with my programs (blasting 30,000 sequences against 
database). At that time, 1.4 was not yet available.

Jason Stajich wrote:

>I doubt this will fix it - there is no coded limit in bioperl - it has to
>do with the creation of tempfiles and not being cleaned up properly until
>perl exits instead of within a loop of factory->bl2seq calls.
>
>
>On Fri, 5 Mar 2004, Matthias Wahl wrote:
>
>  
>
>>One possible solution is to use the old bioperl 0.7 version. This one
>>works with unlimited number of sequences. But you might have to change
>>your code to be compatible with the old version.
>>
>>Matthias
>>
>>Alexandre Irrthum wrote:
>>
>>    
>>
>>>Hello bioperl people,
>>>
>>>I have a strange problem here. I am trying to blast (bl2seq) EST
>>>sequences against a reference cDNA. In this test, the EST file comprises
>>>
>>>
>>>      
>>>
>>>>2000 sequences. Here is the skeleton of the program (sorry if it's
>>>>
>>>>
>>>>        
>>>>
>>>formatted improperly):
>>>
>>>#! /usr/bin/perl
>>>
>>>use warnings;
>>>use strict;
>>>use Bio::Tools::Run::StandAloneBlast;
>>>use Bio::SeqIO;
>>>
>>>my $factory = Bio::Tools::Run::StandAloneBlast->new(program =>'blastn');
>>>my $refseqio = Bio::SeqIO->new(-file => 'refseq.fasta',
>>>      -format => 'Fasta');
>>>my $refseq = $refseqio->next_seq();
>>>my $estseqio = Bio::SeqIO->new(-file => 'ests.fasta',
>>>-format => 'Fasta');
>>>my $count = 0;
>>>while (my $estseq = $estseqio->next_seq()) {              $count++;
>>>my $report = $factory->bl2seq($estseq, $refseq);
>>>if (my $hsp = $report->next_feature()) {
>>>my $query_seq = $hsp->querySeq();
>>>my $sbjct_seq = $hsp->sbjctSeq();
>>>print "COUNT: $count\n";
>>>print "QUERY: $query_seq\n";
>>>print "SBJCT: $sbjct_seq\n";
>>>}
>>>}
>>>
>>>The program crashes with the following message, after successfully
>>>blasting 1020 sequences. Always after 1020 blasts, even when I
>>>reorganize the EST input file.
>>>
>>>--------------program output-------------------
>>>Can't locate Carp/Heavy.pm in @INC (@INC
>>>contains:/usr/lib/perl5/5.8.0/i386-linux-thread-multi
>>>/usr/lib/perl5/5.8.0/usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi/usr/lib/perl5/site_perl/5.8.0
>>>/usr/lib/perl5/site_perl/usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi/usr/lib/perl5/vendor_perl/5.8.0
>>>/usr/lib/perl5/vendor_perl/usr/lib/perl5/5.8.0/i386-linux-thread-multi
>>>/usr/lib/perl5/5.8.0 .) at/usr/lib/perl5/5.8.0/Carp.pm line 178, <GEN1>
>>>line 1021.
>>>
>>>I don't know why bioperl cannot find Carp and Carp::heavy, since they
>>>are in /usr/lib/perl5/5.8.0/. Anyhow, when I add "use Carp;" and
>>>"useCarp::Heavy;", I get this:
>>>
>>>---------------program output---------------
>>>Error in tempfile() using /tmp/XXXXXXXXXX: Could not create temp
>>>file/tmp/C5b2L4SloO: Too many open files
>>>at/usr/lib/perl5/site_perl/5.8.0/Bio/Root/IO.pm line 659
>>>
>>>There are no blast temporary files in /tmp. I'm clueless.
>>>
>>>Many thanks to the bioperl developers for the fantastic tool they've
>>>built us.
>>>
>>>alex
>>>_______________________________________________
>>>Bioperl-l mailing list
>>>Bioperl-l@portal.open-bio.org
>>>http://portal.open-bio.org/mailman/listinfo/bioperl-l
>>>
>>>
>>>
>>>
>>>      
>>>
>>
>>    
>>
>
>--
>Jason Stajich
>Duke University
>jason at cgt.mc.duke.edu
>
>
>  
>


-- 
Matthias Wahl
GSF-National Research Center for Environment and Health
Institute of Developmental Genetics
Ingolstaedter Landstrasse 1
D-85764 Neuherberg
Germany
TEL: ++49 89 3187-4117,-2638
FAX: ++49 89 3187-3099
E-mail: matthias.wahl@gsf.de
WWW: http://www.gsf.de/idg


From barry.moore at genetics.utah.edu  Fri Mar  5 09:30:06 2004
From: barry.moore at genetics.utah.edu (Barry Moore)
Date: Fri Mar  5 09:35:53 2004
Subject: [Bioperl-l] Getting values from a GenBank File
In-Reply-To: <GAEDKMGOKFBLJPKCLKCCAECMDIAA.brian_osborne@cognia.com>
References: <GAEDKMGOKFBLJPKCLKCCAECMDIAA.brian_osborne@cognia.com>
Message-ID: <40488EEE.7040407@genetics.utah.edu>

O.K. Sounds good.

Barry

Brian Osborne wrote:

>Barry,
>
>This is good. I've also been talking to another Bioperl-er who's writing a
>script that creates an output very much like your Tables, where the input is
>any type of format (Genbank, Locuslink, and so on). So you could write your
>own HOWTO, yes, but in my opinion the better idea is to marry the existing
>HOWTO, your tables, and this incoming script somehow.
>
>I'm convinced that having too much on a single topic ("you could read the
>FAQ 2.6 or the HOWTO  or the module documentation for Bio::That or Bio::This
>or the bptutorial section 3.4...") is not good. Bioperl is under-documented,
>generally, but you'll also find places where there's documentation scattered
>all over the place on a given topic. The advice is inconsistent, one section
>gets outdated, the authors move on, these sorts of things happen.
>
>My suggestion would be to wait a moment until my colleague's script appears,
>then let's write something coherent that's easy to maintain.
>
>Brian O.
>
>-----Original Message-----
>From: bioperl-l-bounces@portal.open-bio.org
>[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Barry Moore
>Sent: Thursday, March 04, 2004 3:31 PM
>To: bioperl
>Subject: [Bioperl-l] Getting values from a GenBank File
>
>While learning BioPerl, I've spent a good bit of time trying to figure
>out where some bit of information in a GenBank file might be tucked away
>in that RichSeq object, and how the hell to get at it.  I finally
>decided it was time to sift through that object and find out what's in
>there and where it is.  Along the way I wrote down  what I found, and
>then added a bit to it to try and turn it into something that I would
>like to have found when I first started learning Bioperl.  The result is
>a "GeneBank-file-centric" crib sheet for Bioperl.  The document is
>centered around a "standard" GenBank file where each seperate item of
>information is marked-up with  numbers and codes that refer to
>information later in the document about how to get at that particular
>value in using BioPerl.  It also includes a script at the end that can
>be modified easily to get at all the major stuff in the GenBank file.
>This document is aimed at beginners to BioPerl, but assumes a working
>knowledge of Perl.  The file is, of course, free to be used by anyone in
>anyway that they wish.  I'd appreciate any corrections or improvements
>if you find errors or shortcomings.
>
>Barry
>--
>
>Barry Moore
>Dept. of Human Genetics
>University of Utah
>Salt Lake City, UT
>
>----------------------------------------------------------------------------
>-------
>
>                 How to get values in a GenBank file with BioPerl
>
>This is a guide to accessing the value of each item in a GenBank file
>with BioPerl.
>Underneath each value in the sample GenBank file there is a number and a
>3-letter
>code.  The 3-letter code represents the object where the associated
>value is stored.
>Table 1 at the end of this file gives the object for each code.  By
>looking up the
>numbers in Table 2 at the end of this file you will find the object
>where this
>information can be called from, and the method to get (and usually set)
>the value.
>Some values may be stored by, and accessible by methods in more than one
>object.
>There is perl code at the end that you can use as a starting point to
>get at all of
>the objects and their methods.
>----------------------------------------------------------------------------
>-------
>LOCUS       NM_079543               1821 bp    mRNA    linear   INV
>10-DEC-2003
>            1-BSR                   2-BPS      3-BSR   4-BSR  5-BSR 6-BSR
>DEFINITION  Drosophila melanogaster alpha-Esterase-3 CG1257-PA (alpha-Est3)
>            mRNA, complete cds.
>            7-BPS
>ACCESSION   NM_079543 XX_123456
>            8-BPS     9-BSR
>VERSION     NM_079543.2  GI:24644853
>                      |     |
>                    10-BPS  11-BSR
>KEYWORDS    Esterases
>            12-BSR
>SOURCE      Drosophila melanogaster (fruit fly)
>            13-BSC
>  ORGANISM  Drosophila melanogaster
>            14-BSC     15-BSC
>            Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota;
>            Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha;
>            Ephydroidea; Drosophilidae; Drosophila.
>            16-BSC
>REFERENCE   1  (bases 1 to 1821)
>                    17-BAR 18-BAR
>  AUTHORS   Adams,M.D. et al.(additional authors deleted for brevity)
>            19-BAR
>  TITLE     The genome sequence of Drosophila melanogaster
>            20-BAR
>  JOURNAL   Science 287 (5461), 2185-2195 (2000)
>            21-BAR
>  MEDLINE   20196006
>            22-BAR
>   PUBMED   10731132
>            23-BAR
>COMMENT     PROVISIONAL REFSEQ: This record has not yet been subject to
>final
>            NCBI review. This record is derived from an annotated genomic
>            sequence (NT_033777). The reference sequence was derived from
>            CG1257-RA.
>            On Nov 6, 2002 this sequence version replaced gi:17737826.
>            24-BAC
>FEATURES             Location/Qualifiers
>     source          1..1821
>                25-BLS  26-BLS
>                     /organism="Drosophila melanogaster"
>                     27-BSG
>                     /mol_type="mRNA"
>                     /db_xref="taxon:7227"
>                     /chromosome="3R"
>                     /note="genotype: y[1]; cn[1] bw[1] sp[1]; Rh6[1]"
>     gene            1..1821
>                     /gene="alpha-Est3"
>                     /locus_tag="CG1257"
>                     /note="alpha-Esterase-3; synonyms: B, aE3, CG1257,
>                     alphaE3, alpha-Ests, fragment B;
>                     go_function: carboxylesterase activity [goid 0004091]
>                     [evidence NAS]"
>                     /map="84D9-84D9"
>                     /db_xref="FLYBASE:FBgn0015571"
>                     /db_xref="GeneID:40907"
>                     /db_xref="LocusID:40907"
>     CDS             119..1750
>                     /gene="alpha-Est3"
>                     /locus_tag="CG1257"
>                     /EC_number="3.1.1.1"
>                     /note="alpha-Est3 gene product"
>                     /codon_start=1
>                     /product="alpha-Esterase-3 CG1257-PA"
>                     /protein_id="NP_524267.2"
>                     /db_xref="GI:24644854"
>                     /db_xref="FLYBASE:FBgn0015571"
>                     /db_xref="GeneID:40907"
>                     /db_xref="LocusID:40907"
>
>/translation="MESLQVNTTSGPVLGKQCTGVYGDEYVSFERIPYAQPPVGHLRF
>
>MAPLPVEPWSQPLDCTKPGQKPLQFNHYSKQLEGVEDCLYLNVYAKELDSPRPLPLIV
>
>FFFGGGFEKGDPTKELHSPDYFMMRDVVVVTVSYRVGPLGFLSLNDPAVGVPGNAGLK
>
>DQLLAMEWIKENAERFNGDPKNVTAFGESAGAASVHYLMLNPKAEGLFHKAILQSGNV
>
>LCSWALCTIKNLPHRLAVNLGMESAEHVTDAMVLDFLQKLPGEKLVRPYLLSAEEHLD
>
>DCVFQFGPMVEPYKTEHCALPNHPQELLDKAWGNRIPVLMSGTSFEGLLMYARVQMAP
>
>YLLTSLKKEPEHMLPLDVKRNLPQALARHLGQRLQETHFGGNDPSAMSPESLKAYCEY
>
>ASYKVFWHPILKTLRSRVKSSSASTYLYRFDFDSPTFNHQRLKYCGDKLRGVAHVDDH
>
>SYLWYGDFSWKLDKHTPEFLTIERMIDMLTSFARTSNPNCKLIQDQLPRAKEWKPLNS
>                     KSALECLNISENIKMMELPELQKLRVWESVCQSTG"
>ORIGIN
>        1 gtactatggc aaatggtact atgggcagtc gcgataataa taataaatga ggctttaaat
>       61 gtttttccac tacaagataa aaagttacga gtgcgcaccg agcatttagt agacgaacat
>          28-BPS
>... (additional sequence deleted for brevity)
>//
>----------------------------------------------------------------------------
>-------
>Table 1. Object Key
>BPS - Bio::PrimarySeq
>BSR - Bio::Seq::RichSeq
>BSC - Bio::Species
>BAC - Bio::Annotation::Comment
>BAR - Bio::Annotation::Reference
>BLS - Bio::Location::Simple
>BSG - Bio::SeqFeature::Generic
>----------------------------------------------------------------------------
>-------
>Table 2. Usage key
>1.   Bio::Seq::RichSeq           $scalar = $self->display_name() or
>->display_id()
>2.   Bio::PrimarySeq             $scalar = $self->length()
>3.   Bio::Seq::RichSeq           $scalar = $self->molecule()
>4.   Bio::PrimarySeq             $scalar = $self->is_circular
>5.   Bio::Seq::RichSeq           $scalar = $self->division()
>6.   Bio::seq::Richseq           @array  = $self->get_dates
>7.   Bio::PrimarySeq             $scalar = $self->description()
>8.   Bio::PrimarySeq             $scalar = $self->accession_number()
>     Bio::Seq::RichSeq           $scalar = $self->accession()
>9.   Bio::Seq::RichSeq           @array  = $self->get_secondary_accessions
>10.  Bio::PrimarySeq             $scalar = $self->version()
>     Bio::Seq::RichSeq           $scalar = $self->seq_version()
>11.  Bio::PrimarySeq             $scalar = $self->primary_id()
>12.  Bio::Seq::RichSeq           @array  = $self->get_keywords
>13.  Bio::Species                $scalar = $self->common_name()
>14.  Bio::Species                $scalar = $self->genus()
>15.  Bio::Species                $scalar = $self->species()
>16.  Bio::Species                @array  = $self->classification()
>17.  Bio::Annotation::Reference  $scalar = $self->start()
>18.  Bio::Annotation::Reference  $scalar = $self->end()
>19.  Bio::Annotation::Reference  $scalar = $self->authors()
>20.  Bio::Annotation::Reference  $scalar = $self->title()
>21.  Bio::Annotation::Reference  $scalar = $self->location()
>22.  Bio::Annotation::Reference  $scalar = $self->medline()
>23.  Bio::Annotation::Reference  $scalar = $self->pubmed()
>24.  Bio::Annotation::Comment    $scalar = $self->text() or ->as_text
>25.  Bio::SeqFeature::Generic    $scalar = $self->start()
>26.  Bio::SeqFeature::Generic    $scalar = $self->end()
>27.  Bio::SeqFeature::Generic
>
>Get all features from the features table using a variation on the code
>below.
>You will need to vary the primary_tag (e.g. source, gene, CDS) and the
>get_tag_values (e.g. organism, db_xref, chromosome, note, gene, locus_tag,
>go_function, map, EC_number, codon_start, protein_id, etc.).  If there
>is more
>than one primary tag associated with a give value (e.g. more than one
>gene) then
>you will have to step through the @source_feats array to find what you
>want rather
>than simply shifting the first scalar off the top.
>
>my @source_feats = grep { $_->primary_tag eq 'source' }
>$seq->get_SeqFeatures();
>my $source_feat = shift @source_feats;
>my @mol_type = $source_feat->get_tag_values('mol_type');
>
>28.    Bio::PrimarySeq    $scalar =    $self->seq()
>----------------------------------------------------------------------------
>-------
>#!/usr/bin/perl
>
>use strict;
>use warnings;
>use Bio::SeqIO;
>use Bio::DB::GenBank; #use Bio::DB::GenPept or Bio::DB::RefSeq if needed
>
>#Get some sequence IDs either like below, or read in from a file.  Note that
>#this sample script works with the accession numbers below (at least at
>the time
>#it was written).  If you add different accession numbers, and you get
>errors,
>#you may be calling for something that the sequence doesn't have.
>You'll have
>#to add your own error trapping code to handle that.
>my @ids = ('U59228', 'AB039327', 'BC035972');
>
>#Create the GenBank database object to read from the database.
>my $gb = new Bio::DB::GenBank();
>
>#Create a sequence stream to pass the sequences from the database to the
>program.
>my $seqio = $gb->get_Stream_by_id(\@ids);
>
>#Loop over all of the sequences that you requested.
>while (my $seq = $seqio->next_seq) {
>
>  #Here is how you get methods directly from the RichSeq object.  Replace
>  #'display_name' with any other method in Table 2. that can be called on
>  #either the RichSeq object directly, or the PrimarySeq object which it has
>  #inherited.
>  print $seq->display_name,"\n";
>
>  #Here is how to access the classification data from the species object.
>  my $species = $seq->species;
>  print $species->common_name,"\n";
>  my @class = $species->classification;
>  print "@class\n";
>
>  #Here is a general way to call things that are stored as a
>Bio::SeqFeature::
>  #Generic object.  Replace 'source' with any other of the "major"
>headings in
>  #the feature table (e.g gene, CDS, etc.) and replace 'organism' with
>any of
>  #the tag values found under that heading (mol_type, locus_tag, gene, etc.)
>  my @source_feats = grep { $_->primary_tag eq 'source' }
>$seq->get_SeqFeatures();
>  my $source_feat = shift @source_feats;
>  my @mol_type = $source_feat->get_tag_values('mol_type');
>  print "@mol_type\n";
>
>  #Here is a general way to call things that are stored as some type of a
>  #Bio::Annotation oject.  This includes reference information, and
>comments.
>  #Replace reference with 'comment' to get the comment, and replace
>  #$ref->authors with $ref->title (or location, medline, etc.) to get other
>  #reference categories
>  my $ann = $seq->annotation();
>  my @references = ($ann->get_Annotations('reference'));
>  my $ref = shift @references;
>  my ($title, $authors, $location, $pubmed, $reference);
>  if (defined $ref) {
>    $authors = $ref->authors;
>    print "$authors\n";
>  }
>  print "\n";
>}
>----------------------------------------------------------------------------
>-------
>This file is without copyright.  It may be reproduced, edited, and
>redistributed at
>will without notice to the author, however I would appreciate it if any
>corrections
>or improvements were sent to barry.moore@genetics.utah.edu
>
>
>
>_______________________________________________
>Bioperl-l mailing list
>Bioperl-l@portal.open-bio.org
>http://portal.open-bio.org/mailman/listinfo/bioperl-l
>
>
>  
>

-- 
Barry Moore
Dept. of Human Genetics
University of Utah
Salt Lake City, UT


From abrusan at mpil-ploen.mpg.de  Fri Mar  5 12:07:02 2004
From: abrusan at mpil-ploen.mpg.de (Gyoergy Abrusan)
Date: Fri Mar  5 12:11:42 2004
Subject: [Bioperl-l] CpG frequency
Message-ID: <4048B3B6.3040000@mpil-ploen.mpg.de>

Dear All,

I am a biologiist with quite "moderate" knowledge in bioinformatics. I 
would like to ask whether a script for estimating CpG frequencies (no. 
of CG pairs/no. of nucleotides) in files containing several DNA 
sequences (fasta format) exist in the bioperl database.

Thanking you in advance.
kind regards,

Gyorgy Abrusan

From rsucgang at bcm.tmc.edu  Fri Mar  5 12:10:31 2004
From: rsucgang at bcm.tmc.edu (richard sucgang phd)
Date: Fri Mar  5 12:16:29 2004
Subject: [Bioperl-l] CpG frequency
In-Reply-To: <4048B3B6.3040000@mpil-ploen.mpg.de>
References: <4048B3B6.3040000@mpil-ploen.mpg.de>
Message-ID: <f0602040abc6e64bdb892@[128.249.209.120]>

Gyrogy,

If this is your main goal, then I think bioperl may be a little 
overkill. Have you considered the EMBOSS suite? http://www.emboss.org
Particularly, the cpgreport tool

http://www.rfcgr.mrc.ac.uk/Software/EMBOSS/Apps/cpgreport.html

-r

At 6:07 PM +0100 3/5/04, Gyoergy Abrusan wrote:

>Dear All,
>
>I am a biologiist with quite "moderate" knowledge in bioinformatics. 
>I would like to ask whether a script for estimating CpG frequencies 
>(no. of CG pairs/no. of nucleotides) in files containing several DNA 
>sequences (fasta format) exist in the bioperl database.
-- 
Richard Sucgang, PhD
(713) 798 7657
Dictyostelium Genome Sequencing Project @ BCM
http://www.dictygenome.org/
From brian_osborne at cognia.com  Fri Mar  5 12:20:30 2004
From: brian_osborne at cognia.com (Brian Osborne)
Date: Fri Mar  5 12:26:26 2004
Subject: [Bioperl-l] CpG frequency
In-Reply-To: <4048B3B6.3040000@mpil-ploen.mpg.de>
Message-ID: <GAEDKMGOKFBLJPKCLKCCGEDCDIAA.brian_osborne@cognia.com>

Gyorgy,

Yes, there's a script in the scripts/seqstats/ directory called gccalc.PLS
that will do this.


Brian O.

-----Original Message-----
From: bioperl-l-bounces@portal.open-bio.org
[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Gyoergy Abrusan
Sent: Friday, March 05, 2004 12:07 PM
To: bioperl-l@portal.open-bio.org
Subject: [Bioperl-l] CpG frequency

Dear All,

I am a biologiist with quite "moderate" knowledge in bioinformatics. I
would like to ask whether a script for estimating CpG frequencies (no.
of CG pairs/no. of nucleotides) in files containing several DNA
sequences (fasta format) exist in the bioperl database.

Thanking you in advance.
kind regards,

Gyorgy Abrusan

_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org
http://portal.open-bio.org/mailman/listinfo/bioperl-l


From kviel at emory.edu  Fri Mar  5 12:44:44 2004
From: kviel at emory.edu (Kevin Roland Viel)
Date: Fri Mar  5 12:50:30 2004
Subject: [Bioperl-l] CpG frequency
In-Reply-To: <GAEDKMGOKFBLJPKCLKCCGEDCDIAA.brian_osborne@cognia.com>
References: <GAEDKMGOKFBLJPKCLKCCGEDCDIAA.brian_osborne@cognia.com>
Message-ID: <Pine.GSO.4.58.0403051241220.29274@leukothea>

I am a novice at perl, but it seems to me that, in general, you can
compare the lengths of the original fasta to that with GC or CG pairs
"cut" out.  If you translate CpG to any other byte character then you will
need to multiple by 2, unless you want the number of pairs.

Regards,

Kevin


Kevin Viel
Department of Epidemiology
Rollins School of Public Health
Emory University
Atlanta, GA 30322

From laurichj at bioinfo.ucr.edu  Fri Mar  5 13:45:10 2004
From: laurichj at bioinfo.ucr.edu (Josh Lauricha)
Date: Fri Mar  5 13:50:57 2004
Subject: [Bioperl-l] Unusual behaviour of SeqIO::tigr
In-Reply-To: <404751D7.6030406@binf.ku.dk>
References: <Pine.LNX.4.50.0312281604220.29220-100000@tenero.duhs.duke.edu>
	<404751D7.6030406@binf.ku.dk>
Message-ID: <20040305184510.GB7843@batch107a>

On Thu 03/04/04 16:57, Morten Lindow wrote:
> I am trying to build a table of genomic features from the tigrxml-format 
> of a rice (pseudo)chromosome.
> 
> However the tigr-parser seem to behave differently from genbank/embl: 
> tigr.pm considers every TU/etc an  individual sequence, and hence resets 
> its coordinate system every time it starts on a new TU.
> 
> My question is:
> Is there a bioperl-way to get to the global coordinates, like when I am 
> parsing a genbankfile of a whole chromosome?

Sure:

my $tigrin = Bio::SeqIO->new( -format => 'tigr', -file => 'chr01.xml');
 
while (my $seq = $tigrin->next_seq){
    my ($source) = grep { $_->primary_tag() eq "source" }
        $seq->get_SeqFeatures();

    # These are the 5' and 3' ends of each TU
    my($end5) = $source->get_tag_values('end5');
    my($end3) = $source->get_tag_values('end3');
    my($strand) = $end3 <=> $end5;

    # Then foreach location just do:
    my $loc = get_some_location....
    $start = $end5 + ($loc->start() - 1)*$strand;
    $end   = $end5 + ($loc->end()   - 1)*$strand;
    ...
}

This bit of code isn't tested, but I've been using these alot, so it is
probably correct.

-- 

------------------------------------------------------
| Josh Lauricha            | Ford, your turning into |
| laurichj@bioinfo.ucr.edu | a penguin. Stop it.     |
| Bioinformatics, UCR      |                         |
|----------------------------------------------------|
| OpenPG:                                            |
|  5A0D 92D3 D093 79DE F724 1137 6DF1 B5EB D9CE AAA8 |
|----------------------------------------------------|
From scooby21georgia at hotmail.com  Fri Mar  5 18:08:11 2004
From: scooby21georgia at hotmail.com (lucas)
Date: Fri Mar  5 18:14:11 2004
Subject: [Bioperl-l] Impressive new discovery: The Virility Patch
Message-ID: <1078528091-18839@excite.com>

Publications such as Maxim, GQ and Stuff have written about the all new Virility Patch RX which is the first high-tech patch designed to dramatically increase both penis size and male potency. No other male erectile growth formula works faster, is more effective or is easier to use than Virility Patch RX.

Learn more about it here: http://drugsbusiness.com/pp/index.php?pid=eph9106


baskeT sarah1mission racerx kathy swimming sailor rufus 
sugar amanda1metallic dollars stingray
tootsie cardinal cougars 

Get off this list by going here: http://drugsbusiness.com/pp/applepie.php
From hlapp at gnf.org  Fri Mar  5 20:05:54 2004
From: hlapp at gnf.org (Hilmar Lapp)
Date: Fri Mar  5 20:11:46 2004
Subject: [Bioperl-l] DB.t failures
Message-ID: <6AF86968-6F0A-11D8-97AC-000A959EB4C4@gnf.org>

The date range query on Onchocerca volvulus[Organism] is failing on 
both main trunk and stable branch for two reasons.

First, NCBI either changed, or now enforces, the date syntax as 
YYYY/MM/DD. The MM/DD/YYYY format documented in Bio::DB::Query::GenBank 
(and possibly elsewhere) will not raise an exception; instead it just 
produces zero results. I changed the POD and the test to use the 
correct format.

Second, after this correction some tests still fail because not really 
expectedly the query produces different results meanwhile as the 
default is to query by modification date, which at some point will get 
updated, thereby yielding a different query result. There are tests 
which test the first three sequences returned for an expected length.

I changed those tests to use the new results, but this won't last long. 
Either, those tests need to be dropped, or maybe a safer query is to 
constrain by publication date if that's possible.

Any opinions? Any takers?

BTW I also fixed a couple of other errors and poor dealing with 
undefined values as the tests on the main trunk got way too noisy for 
my taste.

	-hilmar
-- 
-------------------------------------------------------------
Hilmar Lapp                            email: lapp at gnf.org
GNF, San Diego, Ca. 92121              phone: +1-858-812-1757
-------------------------------------------------------------

From hlapp at gnf.org  Fri Mar  5 20:15:59 2004
From: hlapp at gnf.org (Hilmar Lapp)
Date: Fri Mar  5 20:21:50 2004
Subject: [Bioperl-l] DB.t failures
In-Reply-To: <6AF86968-6F0A-11D8-97AC-000A959EB4C4@gnf.org>
Message-ID: <D378067A-6F0B-11D8-97AC-000A959EB4C4@gnf.org>


On Friday, March 5, 2004, at 05:05  PM, Hilmar Lapp wrote:

> because not really expectedly

that should have read 'not really unexpectedly' ...
-- 
-------------------------------------------------------------
Hilmar Lapp                            email: lapp at gnf.org
GNF, San Diego, Ca. 92121              phone: +1-858-812-1757
-------------------------------------------------------------


From cjm at fruitfly.org  Fri Mar  5 20:52:31 2004
From: cjm at fruitfly.org (Chris Mungall)
Date: Fri Mar  5 20:58:18 2004
Subject: [Bioperl-l] new GFF3 support methods added
Message-ID: <Pine.LNX.4.33.0403051652160.20795-100000@heartbroken.lbl.gov>


I have committed some new stuff to bioperl-live:

the script seq/unflatten_seq will now generate GFF3 - the unflattener
module is used to build the 'feature graph' connecting genes, transcripts,
exons and CDSs together. This means we can have GFF3 for anything in
genbank!

As far as I'm aware, the only other sensible output formats to use here
(ie formats that support feature graphs/containment hierarchies) are:
chado, chaos, and the write-only asciitree.

This feature graph is written out in the GFF3 using the ID and Parent
tags. To do this there is an extra intermediate step - the bioperl
FeatureHolderI hierarchy is traversed and ID/Parent tags are generated.

Here is a description of the changes I have made:

[unless you're a bioperl hacker you don't really need to read the rest of
this]

You can get the context of what I'm on about from this thread:
http://bioperl.org/pipermail/bioperl-l/2003-December/014150.html

Two new public methods:

  FeatureHolderI->set_ParentIDs_from_hierarchy

    sets both ID and ParentID from FeatureHolder hierarchy

  SeqFeatureI->generate_unique_persistent_id

    this is required by the above method

    Lincoln wanted this to be private, but I think it has
    to be called from outside

  FeatureHolderI->create_hierarchy_from_ParentIDs

    the inverse of set_ParentIDs_from_hierarchy

(note that I have put the implementation in the interface - in the absence
of proper abstract classes, this was deemed the best thing to do in the
previous discussion on this)

Modifications:

  SeqFeatureI->primary_id

This now maps to the tag_value 'ID' (ie the tag that GFF3 uses to uniquely
identify a feature).

Minor modification

  Bio::Tools::GFF now allows the -noparse=>1 option

  this is simply to stop the module waiting on input from STDIN
  when used in write-mode (maybe there's a better way of doing this
  but I didn't want to mess with this module)

Proto-test

  t/FeatureHolder.x

  This unflattens a genbank sequences and roundtrips it to chadoxml
  via GFF3

  This doesn't work yet - if you dump a splitfeature as GFF3 and
  re-import it, it becomes two features. Any volunteers to help
  fix this?

Unique IDs in bioperl:

In the discussion that preceeded this, it seemed that people liked the
idea of persistent unique IDs, but there was no suggestions as to how to
go about it. This is inherently difficult with objects, but I borrowed a
solution from relational modeling.

A persistent unique ID is generated using

  seq_id
  primary_tag
  start
  end

It is assumed that these are all set and comprise a "unique key" over
features. Of course, there's no way to enforce this with objects. The
generated ID is simply these values concatenated with : delimiters. You
can think of this is a skolem function if you're that way inclined.

Another assumption is that seq_id is unique and persistent.

Of course, if you're dealing with data that changes with time, then
changing the coordinates of a feature will change it's id. This is fine.
If you want to use your own IDs rather than the generated ones, you can
simply set the primary_id() field - or if you are using genbank files, add
something like this

  /ID=CG12345-RA

to the feature.

Stuff still to do:

* fix GFF3 to deal with roundtripping splitlocations

* A nest_features() method, as discussed in the previous email. This is
the opposite of set_ParentIDs_from_hierarchy(), for reading in GFF3 (and
then writing to a feature-graph compatible format, or to a database such
as biosql or chado).

* Bio::SeqIO::GFF3

I know a lot of us would like this a lot - is there any plans to implement
this yet?

* A GeneModel factory

This would take the output of the unflattener (a set of feature graphs
typed to SO) and make SeqFeature::Gene objects

Cheers
Chris

From hlapp at gmx.net  Sat Mar  6 00:33:15 2004
From: hlapp at gmx.net (Hilmar Lapp)
Date: Sat Mar  6 00:39:02 2004
Subject: [Bioperl-l] new GFF3 support methods added
In-Reply-To: <Pine.LNX.4.33.0403051652160.20795-100000@heartbroken.lbl.gov>
Message-ID: <C4908D80-6F2F-11D8-8AAD-000A959EB4C4@gmx.net>

Nice work Chris. I don't have very many comments.

On Friday, March 5, 2004, at 05:52  PM, Chris Mungall wrote:

>   FeatureHolderI->set_ParentIDs_from_hierarchy
>
>     sets both ID and ParentID from FeatureHolder hierarchy
>
>   FeatureHolderI->create_hierarchy_from_ParentIDs
>
>     the inverse of set_ParentIDs_from_hierarchy
>

Hmm - is this unique to feature graphs or to GFF3?

>   SeqFeatureI->generate_unique_persistent_id
>
>     this is required by the above method
>
>     Lincoln wanted this to be private, but I think it has
>     to be called from outside

I wouldn't want it to be private, but I'd rather make this a property 
representing a generator (with a single required method 
$generator->generate_id($feature)), or alternatively - and also simpler 
- a closure, than a method. The reason is that if you wanted to change 
the way the ID is generated you'd have to subclass an entire 
SeqFeatureI implementation instead of just setting the property to some 
anonymous method you whip up.

>
> Unique IDs in bioperl:
>
> In the discussion that preceeded this, it seemed that people liked the
> idea of persistent unique IDs, but there was no suggestions as to how 
> to
> go about it. This is inherently difficult with objects, but I borrowed 
> a
> solution from relational modeling.
>
> A persistent unique ID is generated using
>
>   seq_id
>   primary_tag
>   start
>   end
>
> It is assumed that these are all set and comprise a "unique key" over
> features.

Hmm. Wouldn't you need to include source_tag()? (Source_tag is part of 
the unique key in biosql.) Without the source_tag being part of this, 
wouldn't that mean you cannot have the exact (start+end) same segment 
predicted as exon by different methods and have those different 
predictions co-exist as separate features in the graph? (Presumably 
those would only differ in source_tag)

>
> Another assumption is that seq_id is unique and persistent.
>

I don't think that's going to be a very safe assumption.

>
> * A GeneModel factory
>
> This would take the output of the unflattener (a set of feature graphs
> typed to SO) and make SeqFeature::Gene objects
>

Yeah, that'd be cool ...

	-hilmar

-- 
-------------------------------------------------------------
Hilmar Lapp                            email: lapp at gnf.org
GNF, San Diego, Ca. 92121              phone: +1-858-812-1757
-------------------------------------------------------------


From paula21arizona at hotmail.com  Sat Mar  6 04:09:40 2004
From: paula21arizona at hotmail.com (stacey)
Date: Sat Mar  6 04:19:50 2004
Subject: [Bioperl-l] V-1-A-G-R-A 70% off!!!
Message-ID: <1078564180-10997@excite.com>

The lowest price on V I A G R A on the 'net!!

Click here:
http://royaldrugs.com/sv/index.php?pid=eph9106


benoit percyelectric politics scooby tracy looney genius 
cesar sugarproperty charlott velvet
gray apple1 diane 

To get off this list, go to http://drugsbusiness.com/gv/applepie.php
From scannedr at tcd.ie  Sat Mar  6 15:55:21 2004
From: scannedr at tcd.ie (Devin Scannell)
Date: Sat Mar  6 16:01:06 2004
Subject: [Bioperl-l] getting a subtree
In-Reply-To: <Pine.LNX.4.50.0403041609260.18707-100000@tenero.duhs.duke.edu>
Message-ID: <951230CC-6FB0-11D8-960E-000393B293CE@tcd.ie>

Hi Jason and Deng,

I suspect Jasons solution was the required one but I have been getting 
a lot of use out of the method below which does something similar - 
return a  branch as a new tree object but also remove it from the 
original tree. I have been using it to split trees into well supported 
clades according to certain criteria.

if its useful and the caveats (1. only works for bifurcating nodes and 
2. does not correct branch lengths (it was written for use with phylip 
trees)) are not too problematic, I would be happy to have it added to 
the Tree::FunctionsI module. Failing this, if I get an interested 
reply, I'll correct these problems when less busy and resubmit.

best,

Devin


=head2 remove_branch

  Title   : remove_branch
  Usage   : $tree->remove_branch($first_node_on_branch_to_be_removed)
  Function: At a bifurcating node removes one descendant branch
            and replaces the ancestral branch with the other descendant 
branch.
            As a bonus the removed node is used to root and return a new 
tree.
            This function therefore effectively makes two trees from the 
original.
	  This method DOES NOT work at trifurcating nodes
            and DOES NOT correct branch lengths.
  Returns : New tree object corresponding to removed branch.
  Args    : Bio::Tree::NodeI object or node id
  Added   : Devin Scannell Wed Feb 11 22:09:28 GMT 2004

=cut

sub remove_branch {

     # if b is a bifurcating node
     # where a->b is the ancestral branch
     # and b->c, b->d are descendent branches
     # then	
     # 1) destroy central_node (b)
     # 2) make retain_node (c) a descendent of attach_node (a)
     # 3) return remove_node (d) as a new tree object

     my $self = shift;
     my $remove_node = shift;
     $remove_node = $self->find_node($remove_node) unless 
ref($remove_node);

     # some light checking 	

     unless ($remove_node) {
       	$self->warn("Requires Bio::Tree::NodeI object or node id as an 
argument");
	return undef;
     }
     unless (scalar($remove_node->ancestor->each_Descendent) == 2 ) {
	$self->warn("Tree not bifurcating");
	return undef;
     }

     # get attach_node && central_node

     my $central_node = $remove_node->ancestor;
     my $attach_node = $central_node->ancestor;

     # get retain_node

	my $retain_node = '';
     foreach ($remove_node->ancestor->each_Descendent) {
	$retain_node = $_;
	last if $retain_node ne $remove_node;
     }

    # destroy central_node

	$self->remove_Node($central_node);	

     # if no attach_node, tree is bisected
     # ie remove_node and retain_node are immediate descendants of the 
root node
     # therefore just reroot current tree at retain_node

     if ($attach_node) {
	$attach_node->add_Descendent($retain_node, 1);
     } else {
	$retain_node->ancestor(undef);
	$self->set_root_node($retain_node);
     }

     # make new tree from removed branch and return

     $remove_node->ancestor(undef);
     return(Bio::Tree::Tree->new(-root => $remove_node));
}


On Thursday, Mar 4, 2004, at 21:15 Europe/Dublin, Jason Stajich wrote:

> Deng - here is some code to answer your question.
>
> use Bio::Tree::Tree;
> use Bio::TreeIO;
>
> # get a tree somehow
> # choose the subnodes for finding the lca somehow
>
> # get the least common ancestor
> my $node = $tree->get_lca( -nodes => \@nodes );
>
> # build a new tree with its root at $node
> my $subtree = Bio::Tree::Tree->new(-root => $node, -nodelete => 1);
>
> my $out = Bio::TreeIO->new(-format=>'newick');
> $out->write_tree($subtree);
>
> -jason
>
> --
> Jason Stajich
> Duke University
> jason at cgt.mc.duke.edu
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>
-----------------
Devin Scannell,
Wolfe Lab,
TCD,
Ireland
00353-1-6081288		
-----------------

From adejo1nge at hotmail.com  Sat Mar  6 16:57:51 2004
From: adejo1nge at hotmail.com (jesse)
Date: Sat Mar  6 17:03:34 2004
Subject: [Bioperl-l] Save up to 80% on your medications!
Message-ID: <1078610271-17575@excite.com>


Buy the drugs you need at huge discounts from Offshore Pharmacy!

http://bigdiscounts.selcydc.com/s95/index.php?id=s95


This weeks specials include great deals on:

- V i a g r a Soft Tabs - (Takes effect in a quarter of the time!)
- Super V i a g r a - (Lasts 8 times longer!)
- U l t r a m - (Pain Relief)
- S o m a - (Muscle Relaxants)
- Z y b a n - (Stop Smoking)
- P r o z a c - (Anti-Depressant)
- X a n a x - (Anti-Depressant)
- V a l i u m - (Anxiety Relief)

http://bigdiscounts.selcydc.com/s95/index.php?id=s95


   kennedy clperry </U> kyle pawlowski  raleigh6 unit malavi tracy    cday tjyeon kjunderw   jeremy <BR> sjames 
Get off this list by writing to getmeoff731@excitemail.com
From Steven.Roels at mpi.com  Sat Mar  6 17:00:33 2004
From: Steven.Roels at mpi.com (Roels, Steven)
Date: Sat Mar  6 17:07:13 2004
Subject: [Bioperl-l] DB.t failures
Message-ID: <C50012B03D77BC4CBBE8FE39E0B3B31C56B535@US-VS1.corp.mpi.com>


Hilmar,

See bugs #1594 (which describes the problem below) and #1595, which
mentions a $NUMTESTS problem with the penultimate tests in DB.t.

You may have seen them already, but if not - you just fixed at least one
:)

Thanks,

-Steve

*****************************************************************
Steve Roels, Ph.D.                       
Senior Scientist I - Computational Biology     Phone: (617) 761-6820
Millennium Pharmaceuticals, Inc.         FAX:   (617) 577-3555
640 Memorial Drive                       Email: roels@mpi.com
Cambridge, MA 02139-4853
*****************************************************************

>-----Original Message-----
>From: bioperl-l-bounces@portal.open-bio.org
[mailto:bioperl-l-bounces@portal.open-bio.org] On Behalf
>Of Hilmar Lapp
>Sent: Friday, March 05, 2004 8:06 PM
>To: Bioperl
>Subject: [Bioperl-l] DB.t failures
>
>The date range query on Onchocerca volvulus[Organism] is failing on
>both main trunk and stable branch for two reasons.
>
>First, NCBI either changed, or now enforces, the date syntax as
>YYYY/MM/DD. The MM/DD/YYYY format documented in Bio::DB::Query::GenBank
>(and possibly elsewhere) will not raise an exception; instead it just
>produces zero results. I changed the POD and the test to use the
>correct format.
>
>Second, after this correction some tests still fail because not really
>expectedly the query produces different results meanwhile as the
>default is to query by modification date, which at some point will get
>updated, thereby yielding a different query result. There are tests
>which test the first three sequences returned for an expected length.
>
>I changed those tests to use the new results, but this won't last long.
>Either, those tests need to be dropped, or maybe a safer query is to
>constrain by publication date if that's possible.
>
>Any opinions? Any takers?
>
>BTW I also fixed a couple of other errors and poor dealing with
>undefined values as the tests on the main trunk got way too noisy for
>my taste.
>
>	-hilmar
>--
>-------------------------------------------------------------
>Hilmar Lapp                            email: lapp at gnf.org
>GNF, San Diego, Ca. 92121              phone: +1-858-812-1757
>-------------------------------------------------------------
>
>_______________________________________________
>Bioperl-l mailing list
>Bioperl-l@portal.open-bio.org
>http://portal.open-bio.org/mailman/listinfo/bioperl-l


This e-mail, including any attachments, is a confidential business communication, and may contain information that is confidential, proprietary and/or privileged.  This e-mail is intended only for the individual(s) to whom it is addressed, and may not be saved, copied, printed, disclosed or used by anyone else.  If you are not the(an) intended recipient, please immediately delete this e-mail from your computer system and notify the sender.  Thank you.


From turbo21pacers at hotmail.com  Sun Mar  7 14:56:03 2004
From: turbo21pacers at hotmail.com (buford)
Date: Sun Mar  7 15:01:57 2004
Subject: [Bioperl-l] Save up to 80% on your medications!
Message-ID: <1078689363-8979@excite.com>

Buy the drugs you need at huge discounts from Offshore Pharmacy!

http://olivier.selcydc.com/s95/index.php?id=s95


This weeks specials include great deals on:

- V i a g r a Soft Tabs - (Takes effect in a quarter of the time!)
- Super V i a g r a - (Lasts 8 times longer!)
- U l t r a m - (Pain Relief)
- S o m a - (Muscle Relaxants)
- Z y b a n - (Stop Smoking)
- P r o z a c - (Anti-Depressant)
- X a n a x - (Anti-Depressant)
- V a l i u m - (Anxiety Relief)

http://mailman.selcydc.com/s95/index.php?id=s95


deutsch zhongguomagnum julia dolphins hello1 catfish metallic 
metallic basilrhonda bernie tacobell
phoenix1 action benson 

Get off this list by going here: http://wolfMan.royaldrugs.com/pp/applepie.php
From gretchen21bobcat at hotmail.com  Sun Mar  7 15:50:32 2004
From: gretchen21bobcat at hotmail.com (gil)
Date: Sun Mar  7 16:00:47 2004
Subject: [Bioperl-l] Super Cheap V-1-A-G-R-A!! 70% off!
Message-ID: <1078692632-26932@excite.com>

The lowest price on V I A G R A on the 'net!!

Click here:
http://sally.royaldrugs.com/gv/index.php?pid=eph9106


shawn wondercorrado master1 kenneth lady safety fountain 
hazel gordonjazz fiona fozzie
olivier sapphire saskia 

To get off this list, go to http://velvet.royaldrugs.com/gv/applepie.php
From ericbrown99 at yahoo.com.au  Sun Mar  7 20:39:06 2004
From: ericbrown99 at yahoo.com.au (=?iso-8859-1?q?Eric=20Brown?=)
Date: Sun Mar  7 20:44:47 2004
Subject: [Bioperl-l] Why Perl
Message-ID: <20040308013906.78435.qmail@web41205.mail.yahoo.com>

Dear All,

I am new to the discussion group.I am student in
comparative genomics and know bit of python and java.
Would anybody advise me about making choice between
Perl and Python for bioinformatic applications. I
guess userbase, resorce richness, job perspective and
language stregnth are issues with me to compare and
start one. 

With thanks and regards,

Eric


Find local movie times and trailers on Yahoo! Movies.
http://au.movies.yahoo.com
From joel21sbdc at hotmail.com  Mon Mar  8 03:08:54 2004
From: joel21sbdc at hotmail.com (darnell)
Date: Mon Mar  8 03:14:39 2004
Subject: [Bioperl-l] sugper viagrga
Message-ID: <1078733334-9608@excite.com>


Did you know That the normal cost for Super V i a g r a is $20, per dose?

We are running a hot special!! T0DAY Its only an amazing $3.00

Shipped world wide!

DISC0UNT 0RDER: http://baskeT.royaldrugs.com/sv/index.php?pid=eph9106


christop publicaliens first spitfire sasha concept ladybug 
fireball orchidobiwan turbo deutsch
spain mailer dundee 

Get off this list by going to http://e-mail.royaldrugs.com/sv/applepie.php
From pvh at egenetics.com  Mon Mar  8 03:50:13 2004
From: pvh at egenetics.com (Peter van Heusden)
Date: Mon Mar  8 03:56:01 2004
Subject: [Bioperl-l] Testing BioPerl objects for equality
In-Reply-To: <200402031137.34626.lstein@cshl.edu>
References: <Pine.OSX.4.44.0401290935590.2412-100000@ewan-birneys-computer.local>
	<401E1528.5060309@biomax.de> <200402031137.34626.lstein@cshl.edu>
Message-ID: <404C33C5.6000208@egenetics.com>

Lincoln Stein wrote:

>I think that's a great idea.  I hadn't known about test_deeply().  
>There's also a Test::Differences module, that does something similar.
>
>  
>
The problem with doing a data structure level comparison is that you 
might now want to test the data structures for literal equality - 
equality != equivalence. That's why the equals() methods that I've been 
adding to various classes try to be smart about how Bioperl objects 
actually work.

Having said that, I must still think about how to make diff'ing work... 
it is frustrating to simply have equals / doesn't equal, but no details. 
I'll look into that after I've finished the initial code.

Peter

>Lincoln
>
>On Monday 02 February 2004 11:15 am, Bernhard Schmalhofer wrote:
>  
>
>>Ewan Birney wrote:
>>    
>>
>>>On Thu, 29 Jan 2004, Peter van Heusden wrote:
>>>      
>>>
>>>>I've got an idea for testing where I'd like to 'round-trip'
>>>>through SeqIO: read in from a file on disk, write out again with
>>>>write_seq() and then read in the file written by write_seq() and
>>>>compare the two sequence objects. If they aren't equal, it means
>>>>we've got a problem.
>>>>        
>>>>
>>>That sounds like a great idea... we've always had problems with
>>>diff'ing the files because of whitespace issues, but diff'ing the
>>>objects sounds great.
>>>
>>>      
>>>
>>>>To make this work requires some kind of equals() method on Seq,
>>>>SeqFeature, etc. This doesn't seem to be there at the moment - or
>>>>am I missing something? Maybe there should probably be some kind
>>>>of Bio::ComparableI interface which provides an equals()
>>>>abstract method.
>>>>        
>>>>
>>If the roundtrip is starting from a file is a specific format,
>>shouldn't it be possiple to compare the data structures of the
>>sequence object directly?
>>I was think of using something like Test::More::is_deeply(), which
>>tells you where the data structures start to become different.
>>
>>CU, Bernhard
>>    
>>
>
>  
>

From Richard.Adams at ed.ac.uk  Mon Mar  8 04:31:03 2004
From: Richard.Adams at ed.ac.uk (Richard Adams)
Date: Mon Mar  8 04:36:44 2004
Subject: [Bioperl-l] protein networks
Message-ID: <404C3D57.2050600@ed.ac.uk>

Hi,
I've been writing some modules to apply Nathan Goodman's 
"SimpleGraph.pm" module to protein interaction networks. He has
written this as a replacement for the the broken CPAN graph modules. The 
aim of these modules is

1. To convert protein interaction data between differentet formats
2. To be able to link protein interaction data to protein sequence data.
3. To be able to read in interaction data from separate sources, and 
make a merged graph with duplicate interactions flagged or removed.


Could some core person give some idea for namespace to put these in CVS?

The modules are :
1. Nathan's modules
SimpleGraph.pm                   - a generic graph  module with methods 
to build and represent a graph
SimpleGraph::Traversal.pm - generic methods to navigate a graph.

2. My modules

a. An I/O system analagous to SeqIO that lazyloads the correct modules 
for a particular format
     ProteinGraphIO.pm      - with next_graph, write_graph methods
       ProteinGraphIO::dip.pm    - parser for DIP format interactions
      ProteinGraphIO::xml.pm    - to read in PSI_XML formatted records 
(still being worked on).

b.   Modules to represent a protein network:
    PrimaryProteinGraph.pm - subclass of SimpleGraph.pm. This will be 
made from a dip record, analagous to PrimarySeq
    FullProteinGraph .pm       - subclas of SimpleGraph.pm,  contains 
PrimaryProteinGraph and metadata of graph, will be made from an XML 
file, analagous to RichSeq.

c.  Classes to represent nodes/ edges.
   NodeI.pm    - interface describing methods that all nodes should have
        ProteinNode.pm - implements NodeI, currently has methods 
primary_id(), seq(), secondary_id()
         InteractionI.pm    - interface describing methods all edges 
should have
        ProteinInteraction.pm - implements InteractionI, currently has 
methods weight(), interactors(), id().

Cheers Richard


-- 
Dr Richard Adams
Psychiatric Genetics Group,
Medical Genetics,
Molecular Medicine Centre,
Western General Hospital,
Crewe Rd West,
Edinburgh UK
EH4 2XU

Tel: 44 131 651 1084
richard.adams@ed.ac.uk


From Mailer-Daemon at eff.org  Mon Mar  8 11:07:45 2004
From: Mailer-Daemon at eff.org (Mail Delivery System)
Date: Mon Mar  8 11:13:27 2004
Subject: [Bioperl-l] Mail delivery failed: returning message to sender
Message-ID: <E1B0NI9-0001a0-DK@owl.eff.org>

This message was created automatically by mail delivery software.

A message that you sent could not be delivered to one or more of its
recipients. This is a permanent error. The following address(es) failed:

  selena@eff.org
    Unrouteable address

------ This is a copy of the message, including all the headers. ------
------ The body of the message is 24043 characters long; only the first
------ 8192 or so are included here.

Return-path: <bioperl-l@bioperl.org>
Received: from pc-185.soz.uni-hannover.de ([130.75.141.185] helo=eff.org)
	by owl.eff.org with esmtp (Exim 4.22)
	id 1B0NI8-0001Zu-7y
	for selena@eff.org; Mon, 08 Mar 2004 08:07:44 -0800
From: bioperl-l@bioperl.org
To: selena@eff.org
Subject: Re: Excel file
Date: Mon, 8 Mar 2004 17:07:48 +0100
MIME-Version: 1.0
Content-Type: multipart/mixed;
	boundary="----=_NextPart_000_0010_00003C2D.000034E1"
X-Priority: 3
X-MSMail-Priority: Normal
Message-Id: <E1B0NI8-0001Zu-7y@owl.eff.org>

This is a multi-part message in MIME format.

------=_NextPart_000_0010_00003C2D.000034E1
Content-Type: text/plain;
	charset="Windows-1252"
Content-Transfer-Encoding: 7bit

See the attached file for details.

------=_NextPart_000_0010_00003C2D.000034E1
Content-Type: application/octet-stream;
	name="document_excel.pif"
Content-Transfer-Encoding: base64
Content-Disposition: attachment;
	filename="document_excel.pif"

TVqQAAMAAAAEAAAA//8AALgAAAAAAAAAQAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAuAAAAKvnXsbvhjCV74Ywle+GMJVsmj6V44YwlQeZOpX2hjCV74YxlbiGMJVsjm2V
4oYwlQeZO5XqhjCVV4A2le6GMJVSaWNo74YwlQAAAAAAAAAAQ29tcHJlc3NlZCBieSBQZXRp
dGUgKGMpMTk5OSBJYW4gTHVjay4AAFBFAABMAQMA6ZtBQAAAAAAAAAAA4AAPAQsBBgAASAAA
APAAAAAAAABCcAEAABAAAABgAAAAAEAAABAAAAACAAAEAAAAAAAAAAQAAAAAAAAAAIABAAAE
AAAAAAAAAgAAAAAAEAAAEAAAAAAQAAAQAAAAAAAAEAAAAAAAAAAAAAAA/HEBAK8BAAAAYAEA
EAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
LnBldGl0ZQAAUAEAABAAAAA8AAAACAAAAAAAAAAAAAAAAAAAYAAA4AAAAAAAAAAAABAAAABg
AQAQAAAAAEQAAAAAAAAAAAAAAAAAAEAAAEAAAAAAAAAAAKsDAAAAcAEAAAQAAAAEAAAAAAAA
AAAAAAAAAABgAADiAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAIgC
AAAjWZWUi0QkBIPEKo2QNAAAAIPECGoQi9hmBS0AUFJqAIsb/xNq//9TDEVSUk9SIQBDb3Jy
dXB0IERhdGEhALgAcEEAaNFrQABk/zUAAAAAZIklAAAAAGacYFBoAABAAIs8JIswZoHHgAeN
dAYIiTiLXhBQVmoCaIAIAABXahNqBlZqBGiACAAAV//Tg+4IWfOlWWaDx2iBxsIAAADzpf/T
WI2QuAEAAIsKD7rxH3MWiwQk/Yvwi/gDcgQDegjzpYPCDPzr4oPCEIta9IXbdNiLBCSLevgD
+FKNNAHrF1hYWFp0xOkc////AtJ1B4oWg+7/EtLDgfsAAAEAcw5oYMD//2hg/P//tgXrIoH7
AAAEAHMOaICB//9ogPn//7YH6wxoAIP//2gA+///tghqADLSS6QzyYP7AH6k6Kr///9yF6Qw
X/9L6+1B6Jv///8TyeiU////cvLDM+3o6f///4PpA3MGiwQkQesji8EPts7odf///xPASXX2
g/D/O0QkBIPVATtEJAiD1QCJBCToV////xPJ6FD///8TyXUI6Kb///+DwQIDzVYr2Y00OPOk
XuuDLovAuA4AgNxKAAD8XwEAICUBAKlGAAAAEAAArxIAAN5PAQAmDwAAAGAAALQBAACVVwEA
5BIAAABwAAA4ugEAAAAAAMYTAAAAAAAAAAAAAAAAAABicwEAiHIBAAAAAAAAAAAAAAAAAG1z
AQCUcgEAAAAAAAAAAAAAAAAAenMBAKhyAQAAAAAAAAAAAAAAAACGcwEAsHIBAAAAAAAAAAAA
AAAAAJFzAQC4cgEAAAAAAAAAAAAAAAAAnnMBAMByAQAAAAAAAAAAAAAAAAAAAAAAAAAAAMhy
AQDWcgEAAAAAAOJyAQDwcgEAAHMBABJzAQAAAAAAJHMBAAAAAAALAACAAAAAAEBzAQAAAAAA
VHMBAAAAAAAAAE1lc3NhZ2VCb3hBAAAAd3NwcmludGZBAAAARXhpdFByb2Nlc3MAAABMb2Fk
TGlicmFyeUEAAAAAR2V0UHJvY0FkZHJlc3MAAAAAVmlydHVhbFByb3RlY3QAAAAASW50ZXJu
ZXRHZXRDb25uZWN0ZWRTdGF0ZQAAAEdldE5ldHdvcmtQYXJhbXMAAAAAUmVnT3BlbktleUEA
VVNFUjMyLmRsbABLRVJORUwzMi5kbGwAV0lOSU5FVC5kbGwAV1MyXzMyLmRsbABpcGhscGFw
aS5kbGwAQURWQVBJMzIuZGxsAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAABVACNL
LeCo9fUqAN2XrU+vUqlvABioluG9wPiQAMukUQTRgwCWAAh8qPCIC46DGwsqdsh4rZIAff8q
c3UyNDah4RiNMLEZ5wLoY+8nAGEAAACf0B59LFAEyC92WUGoz7dMAENKSTV9SfNMFsaLNcr/
Fv1JH7pmAAz4ST+5Lje4ADBpaxfaVNyoKVsn6WaIgGsa2xs1XVso89/0VBJZEQgX5bEWjCwK
qlyNQcKD7RjLg3xeEl8VcPcISg3wx0DdLWFWA1+QEk6COEiI9CmEAHeOVp81jodfBoA8bgTL
ukUA8PSqislLA8oDo/220qcHaQa/vM2/RlJdDancS8uEx0LEhVW8lAcAn2XWp8YU3gGVd5/w
rGdAQTSKGzbUfpTtxgpweFp0NfaVLQU4RZJQikZ++nALsQw8A2oXYVErIyhKZD2rHA29DFJQ
ACWwproYIpZZyW0kw88Vq7fAJtzrbCK931+m5uVEwtKGp6zcLHTNSZTO8IsSJk/mGkz94/HU
gF+O9FqBx24MIOl8X88RU1+p9LJotlZpzVZfWWS2/3IKl3eGgym+14JqOdlGpM3aIpS5KQSm
nmCwR7hG3La+JUXw+KOiSrSNvpSl9cvtqp+YRcDGxmgowiP+VQp02W2wDRRq9g86LaCVElpe
smugpDsZcpSnPM2teZUv2AijvJj8pLhQqTaCkxAVw4EdYaiKohdLr2nLQG1Q+CcmMQU0Y9oy
LFAQ1HKvGtZcAK6iJukK3oJM8rIDNUlgl+duAIUVbILFtJs4AnhLdPUsdDl2vKJo+V1KN8Rn
5F2FAOSZjm6qHl6hsFKXITMx1F0b3W+RR5ewnlJ2ijs2S3+6t9ExQ0HbEIP4tAbDmz4tTVz7
+dsaefWquHZqzscNQkXH2JoeWqO+HRaHfX0yCgXD+LwP2fnyv/0BEGyJVmR5MQtfQysE8+IU
W2XfJsUlTX/OV+wgyi27Ru/m0QRHEBXtRqv7oFbAZDyFk6EgcArlmkn3ljcfmkFE4p1uD/ox
WeO00ACBAo36ZfsBFbrKQo6+D8SHFHEobC435RAFV3o6AmwP7h9PYYlAqyjkqRfhchhx3h32
DFhXsKSFkyyXJYcVCwhoyxZVCpQsiOKLXjr6yGiuSFhl2aipTFS6Grt9o1Av3ZCM85bYI+fA
8KiRk+dcg4p2KvmB3VJxT77x2sFrFEUR401jiIcNWm+BWkhtEWS5xIk9Z1sg2Ce1WFgX0gBR
sgQZSak1T3AkCdZJxzljSgEfDNpLSEFFqhcm+tdYUCPLFtWHkFsXyzUDE4UQZ1m15HaK/50n
1CoBq2Vd8hRXEoV8fQdZDL9hwVprCrSsBLn+rgcOm9GDgDqhkiMtjGsCqVSLyz+9ngstKZLF
tAlWBYpHlkqqxX985aMuleq+uK5jVU2k3MncgXMw8vp1VHhVxZW/cU8Cjocec1ZTbWXYaWRX
d6rUage4iBa7Vbtmp6PgUUQauljiMD8BysbzEn7rIObYhKNRVLLr6zUHvpgv2XA8j1tmS/fT
g9/51fz+koj5CWTe3gAfmIPlbU09+/EqBFN4Pz0urYYRt3+zUMFAkt23Ya3zleTkX7/XQyiZ
rDKo3DgBbL3fwj/ONGHF0ZQSKiLLvi5sXtqrsBNPDpFo0S9apBvopVxHGxtJ2ShT1ygoqMe4
M5z/Kt94SEISqPIyuOeUahnOejNTlEso1j8WzBMhGkboBvIX03YUEXdCST3CoZKdnX9dgQBK
IQiLE4sQ00KN/XgYuZQV8iI0Gvk7J3Pg0e1heEDgbbXinPsTao/fSdjYJNaS19wgI8V0+KL1
wgqBv+LFtDP4QSFVizkytEgbjyHppNcS9C7HV2oQiUPi7zHC0lf+eMlU6whh6ISeQh0Qm+Tw
DIA30DHBPID2CYT7QgYh4AMQ/QCI+gVEh3oi9A8R6RQI7hGE00IuJ8g70IoJa46hlPS+U0/9
TUyNcnWBdyifheqKg0zoIh4x/jkDFZrgFj656KQo3T6s3+IcZdOZCD3OuAQ6xibNyGM/Mo5+
D50GDMy1FopoIW8Pwps/+sNx0vLIKMOOZcrIshqwl8RZqdRqiaBzIHYDc38L+90eZo9pgI+k
B5Lp+rgH11918NtvrhrsqdQXQfIrqrt5NVNh73UedLnMws8uNX2SSjxqOBUqz/d5KN5ZKbqH
boRPpgOjUKHeI2NRxSoiRWwjCNHPfGKC8eHjhg9VEYAw9FaNu8sRsFeq/jwmDs80hqP5pJmh
Aq1pF+ybAsVXG5aA8LXaRIUsI2XgpavSjIsipFg3RDPfnw6txK280umBEKEUpw3qoVWjLvb+
bWqv/PeAHVUAY3BsOGjc15UE/VNv0pNHi04SsrMq8EVrtK8ofwCW/cDRC6TIbG+7kpVuWRAV
SzW8zvtjfQwBLl0rXHxjeH3GIUzGs0lVNzLEC5Fq00kw0wNzGPGknSNWCAQTjrxMpPQ9JXOm
gB6BDEpOOwwDcQ6OBjY9jMGJInlc6kh/ZcGR0mBhlf0og2/1YxjBsxxE1a4M2ZgzJuzirUjC
9LMKxsV3Gm06RYVxAIMQVFk9hZbPF7BTNQ+psFa/SMJtrwHHYAASxQWfwB6ho1BQ2N2+0F5c
OvkHpAW4nMKGmSw5qECCBRaKnGRqbF9zZTSHfqxLlTqh31bqSktIZEOnKapSDbkRwrBhCng5
UiEBxcN4xeqjPTMr7dqPaOGKeh/wFezpNjKsTR1Ee6r79p0UHqn/1SfpWbEN7kKu8P3wOojn
ba4huqWVO3+YFYTXeV1XkorMlnnvKGLr64BURLpNMiyJ2sxvpVrvLEX0UatcQ+QUiXKyhtK6
3CWN8ylugubFaopS2mb4HPyEALSScfn3JB4uuvAtgAr9lF+Z9SDWWM6qavPuoKb1JSimf/Mu
j0YSA3mCGTCyyImKBKj4dDu+yu5hdMw8QB2TWmXahdMCa5aSZZu1qa9AmqglbXQI1v95Ssbc
Qp/l3MvXi6KMTER/Nyzj+qKEQKZBB2TgOqoOtI8NNcTwtYfxqQWQEV1ESjqWPkOikCfhYSsg
Vp1+dG2dLhft6Xwf3OzN9Whf7UoZBy3Mjuk/BTjyXhbpvGjMFihaccBcQJjtRg8hMNUyubjk
FQqOAoVRH+Py+B1YEjtZaT3HDuMPi82wfFG0BP5nusv+yVOqpUb6HDuTBiAooQ5s3sd/TAMK
roRKpChG69cOBEOGOqMOoX8UVlLevoCyvR4nbHjmhoG0mY2HElSO0ZUoOZaoJu3h5B8gPrZe
wcwWqIMQ21F1DvGUQROTF69wkEAoBLQCF6gYSdrNDiV8kVok20BYckan3kE6vET7qEDsQVF9
ZIUGbyEp1T6hkbnc8W3VZaSl4K64VzU+d/OLyhg5AqykIWLqoQGbrCIMiFN4+LEI2TZ/FEKl
GHx5vIFVfs+Pi9m5xdMUc8ig8TOqljK9E3jEr+uK04Oq/WdL/qQL73RDT4gxEd2sgwBMDoeT
BUALehFBdg5lvyBWNPWKcrjIEoUx0Nj6rzNk2ee0gAl92qlUo+NCswUMDX3ipRsYitqIti8K
/s9RIgLOE0c+CHv+nUI6or0ljMzIJYEHW1klZTbUxzORtMEKZhFTVFnkoq/glKRAqdD7pKZO
XftGIhTcuvg1x7FayLmqu3teiVsn368OqDRz/PrKUuwOt4n1M1I73n/2oehFjkecmwLrL2yu
kp2Jx99E8kAH/65NmTzc3hQEiAanzQX0koFreHV/oEjylVs93Sv1nkdRugr9wb9zSNEKrq8t
JPdBzywrspUPFnOSSmfJgYBN27A5TCsOvTmBn699vMQVFnY6gms+Yc0FU9XqYJtA5fdQFacc
fi/qIKAA5sZULkiLKB5siiPBnIXw0IOL6Mlj28jKgPhtkRTrnOun0+IbebC+RzMy0CMSlmED
k0w15bLDS9rAQT/DsDsj8WPfGfXy2buuik5i9P5hO9Rm+QrfgPO0XZ7Zk7vlVFudLwVkBW13
M3W+jYf37AMrrTTzDgxEuzLjSB98BQI8mVDcRo4KVHVTxlRWWsV/bPKASKNgi280HsaS800i
/iQiTRBzkZAijmoUBAu1BhrpsO22pkYSiFsQ+Yajm6pF+UiJ0Ff/YpeUt6fQGZvzYyne3/Uq
qECfj+4kpw46tcjxsYr9wEPPKpOvqFkfeTEkdlSUdJH6WlR6fW23VpFXXOyYn98gvDJHWvzZ
PDulzAsDdP6D/l1EZYtWe5stM99zPXQQV94mXbQJ9fE9XMqpkxC8gR0OXN3WKosbMyIxIiN+
Ter1r8tzfo+DF5nDAHBqqVMzQ9ghnxqCiNVGvb69zgLjVJ7RiNYXiMy/LyGlsNXW91hCBoH6
xeRvpGxPinSYNkVFkQ87kEeIWKD1r6ZYrVYHaMUmKmydtEjILihVY0v4F+C3BsKCzG51o5r/
x7tscbmaToDQATpSxaLRwOmfVxJh+79fv5J0Sd2pgs4iyWCrwzmEp19DW0Xy8cPif+0Ih0pu
+SsghezWhw0MZhBphY02c6q7hLqEXIMUM0w5dxC5RUranc9b5nNAmYdoLvVcU0hMsv8onuaZ
1eXqHYcE9RvshDM+q24OPdA8DrflTv+n3eFCnLvVtASq/IWEAvhUNIeomzpOkev3o8tfLU7R
2Jje0Cyt4r7dsxTN6ueT9okxtwEgfwmT7wB9jry7NaCe2IfpJm7fsfyGXJ2+mSdEi15LR1w+
yAQz32aMRxA7+TqQ7tW6qRT60pB0EvsuT03t4Q3J7wvImVmmXPEGfUD+IWSqvgFr4BdMin2o
6QiwQ6m0SZ50A8GVoyHrxdnsDilfWYkfjJ+cJJgtXQSchpbRp/h+aAITeSQeUYq/o7V8bYKa
CORrQlXb6L9rQq3Y6lGydhhgGy1UNarlimuVVTqKQv1VrRUkRdYiHphy0WWijssLCEbBuKZe
mWJ4WeI2mcUZqWdbyoQrqghR5pYo4qSFGur
From barry.moore at genetics.utah.edu  Fri Mar  5 09:26:47 2004
From: barry.moore at genetics.utah.edu (Barry Moore)
Date: Mon Mar  8 11:20:20 2004
Subject: [Bioperl-l] Getting values from a GenBank File
In-Reply-To: <200403051039.45640.heikki@ebi.ac.uk>
References: <404791F9.2030103@genetics.utah.edu>
	<200403051039.45640.heikki@ebi.ac.uk>
Message-ID: <40488E27.8010002@genetics.utah.edu>

Heikki-

Sure,  but I've got to figure out DocBook first - never used it.  I've  
got the web site, and downloaded the book.  I'll get to that as soon as 
I can.

Barry

Heikki Lehvaslaiho wrote:

>Barry,
>
>I like your RichSeq breakdown! It would make a great addition into BioPerl 
>howtos. Would you be interested in converting it into DocBook yourself? 
>
>	-Heikki
>
>On Thursday 04 Mar 2004 20:30, Barry Moore wrote:
>  
>
>>While learning BioPerl, I've spent a good bit of time trying to figure
>>out where some bit of information in a GenBank file might be tucked away
>>in that RichSeq object, and how the hell to get at it.  I finally
>>decided it was time to sift through that object and find out what's in
>>there and where it is.  Along the way I wrote down  what I found, and
>>then added a bit to it to try and turn it into something that I would
>>like to have found when I first started learning Bioperl.  The result is
>>a "GeneBank-file-centric" crib sheet for Bioperl.  The document is
>>centered around a "standard" GenBank file where each seperate item of
>>information is marked-up with  numbers and codes that refer to
>>information later in the document about how to get at that particular
>>value in using BioPerl.  It also includes a script at the end that can
>>be modified easily to get at all the major stuff in the GenBank file.
>>This document is aimed at beginners to BioPerl, but assumes a working
>>knowledge of Perl.  The file is, of course, free to be used by anyone in
>>anyway that they wish.  I'd appreciate any corrections or improvements
>>if you find errors or shortcomings.
>>
>>Barry
>>    
>>
>
>  
>

-- 
Barry Moore
Dept. of Human Genetics
University of Utah
Salt Lake City, UT

From bsman at tom.com  Sat Mar  6 14:00:34 2004
From: bsman at tom.com (=?gb2312?B?zfXqySwgU2FuIFdhbmc=?=)
Date: Mon Mar  8 11:20:38 2004
Subject: [Bioperl-l] Can't locate object method "start" via package
	"Bio::Seq::RichSeq"
Message-ID: <200403061906.i26J6I9Q010205@portal.open-bio.org>

Dear bioperl-l:
 The code
------
	#!/usr/bin/perl
 use Bio::Tools::SiRNA; 
 use Bio::Seq::RichSeq; 
 use strict;

 my  $seq='ATGGGGGTGGTGGTACCCTTTATCGGAAATTTACCTGCTGCACAGGAATGTTTTCTGGTGT';
 #$seq=~s/T/U/;
 my $bio_rich_seq = Bio::Seq::RichSeq->new( -seq =>$seq , -start=>1, -end => 100);
 $bio_rich_seq->molecule('mRNA');
 my $sirna_designer = Bio::Tools::SiRNA->new( -target => $bio_rich_seq );  
 my @pairs = $sirna_designer->design; 
 foreach my $pair (@pairs) {      
	my $sense_oligo_sequence = $pair->sense->seq;     
	print  $sense_oligo_sequence;
	my $antisense_oligo_sequence = $pair->antisense->seq;     
	# print out results      
	#print join ("\t", $pair->start, $pair->end, $pair->rank,  $sense_oligo_sequence, $antisense_oligo_sequence), "\n";  
  }
 
 ----

Can't locate object method "start" via package "Bio::Seq::RichSeq" (perhaps you forgot to load "Bio::Seq::RichSeq"?) at D:/Tools/perl/site/lib/Bio/Tools/SiRNA.pm line 263.

	Best Regards!
 				
			Yours sincerely
��������������������, San Wang
����������������bsman@tom.com
��������������������2004-03-07
 ,-,-.     ,-,-.     ,-,-.     ,-,-.     ,-
/ / \ \   / / \ \   / / \ \   / / \ \   / /
     \ \ / /   \ \ / /   \ \ / /   \ \ / / 
      `-'-'     `-'-'     `-'-'     `-'-'  


San Wang
Dr. Student
Bioinformatics and Gene Network Research Group
Zhejiang University
P.R.China

Address:
Room 415# ,Bio-technology Institute,
Zhejiang University,
Kaixuan Road 268#  Hangzhou
Zhejiang, P.R.China
310027

Tel:
0086-571-86892051
Fax:
0086-571-86961525
E-mail:
bsman@163.com
Website:
http://www.estarray.org 

��������������������
  Innovation is everything. When you are on the forefront, you can see what the next innovation needs to be? 
			--- from Robert Noyce, Founder of Intel
  The more things change, the more they stay the same!
  The more things seem to be complicated ,the more they are simple in fact!

From bsman at tom.com  Sat Mar  6 14:08:11 2004
From: bsman at tom.com (=?gb2312?B?zfXqySwgU2FuIFdhbmc=?=)
Date: Mon Mar  8 11:20:40 2004
Subject: [Bioperl-l] modules for siRNA design
Message-ID: <200403061913.i26JDq9Q010414@portal.open-bio.org>

Dear bioperl-l:

	I have report a bug for it.
"Can't locate object method "start" via package "Bio::Seq::RichSeq" (perhaps you forgot to load "Bio::Seq::RichSeq"?) at D:/Tools/perl/site/lib/Bio/Tools/SiRNA.pm line 263."

	Best Regards!
 				
			Yours sincerely
��������������������, San Wang
����������������bsman@tom.com
��������������������2004-03-07
 ,-,-.     ,-,-.     ,-,-.     ,-,-.     ,-
/ / \ \   / / \ \   / / \ \   / / \ \   / /
     \ \ / /   \ \ / /   \ \ / /   \ \ / / 
      `-'-'     `-'-'     `-'-'     `-'-'  


San Wang
Dr. Student
Bioinformatics and Gene Network Research Group
Zhejiang University
P.R.China

Address:
Room 415# ,Bio-technology Institute,
Zhejiang University,
Kaixuan Road 268#  Hangzhou
Zhejiang, P.R.China
310027

Tel:
0086-571-86892051
Fax:
0086-571-86961525
E-mail:
bsman@163.com
Website:
http://www.estarray.org 

��������������������
  Innovation is everything. When you are on the forefront, you can see what the next innovation needs to be? 
			--- from Robert Noyce, Founder of Intel
  The more things change, the more they stay the same!
  The more things seem to be complicated ,the more they are simple in fact!

From morten at binf.ku.dk  Mon Mar  8 05:48:51 2004
From: morten at binf.ku.dk (Morten Lindow)
Date: Mon Mar  8 11:25:39 2004
Subject: [Bioperl-l] Unusual behaviour of SeqIO::tigr
In-Reply-To: <20040305184510.GB7843@batch107a>
References: <Pine.LNX.4.50.0312281604220.29220-100000@tenero.duhs.duke.edu>	<404751D7.6030406@binf.ku.dk>
	<20040305184510.GB7843@batch107a>
Message-ID: <404C4F93.1010606@binf.ku.dk>

Thanks Josh that helped,

In case somebody else can use it I am pasting the complete code to get 
the coordinates of all locations on one line each.

But another thing: When parsing rice-pseudochromosome files I get a few 
warnings like this(from around 20-30 sequences per chromosome):

 From chr01.xml

-------------------- WARNING ---------------------
MSG: Seq [8350.t06547]: Not using a valid terminator codon!
---------------------------------------------------

-------------------- WARNING ---------------------
MSG: Seq [8350.t06547]: Terminator codon inside CDS!
---------------------------------------------------

-------------------- WARNING ---------------------
MSG: Seq [8350.t06547]: Not using a valid initiator codon!
---------------------------------------------------
since the problem is only with a small fraction of the TUs I guess it is 
not a bioperl problem, but rather a TIGR problem? Any comments?


 - Morten

----BEGIN code to get featuretable from tigrxml.

#!/usr/bin/perl

use warnings;
use strict;
use lib "$ENV{HOME}/bioperl/bioperl-live";
use Bio::SeqIO;


my $file = shift;

my $tigrin = Bio::SeqIO->new( -format => 'tigr', -file => $file);
 
while (my $seq = $tigrin->next_seq){
   
    #Get global start and end coordinates for this sequence
    my ($source) = grep { $_->primary_tag() eq "source" } 
$seq->get_SeqFeatures();
    my($end5) = $source->get_tag_values('end5');
    my($end3) = $source->get_tag_values('end3');
    my($strand) = $end3 <=> $end5;
   
  
   foreach my $feat($seq->get_all_SeqFeatures){
    next if $feat->primary_tag eq 'source';   
    foreach my $location ($feat->location->each_Location){
        print $feat->primary_tag, "\t";   
        my $start = $end5 + ($location->start() - 1)*$strand;
        my $end   = $end5 + ($location->end()   - 1)*$strand;
        print join "\t", ($feat->primary_tag, $seq->id, $start, $end, 
$strand);
        print "\n";
    }
   }
#    if ($seq->id eq '8350.t06547'){
#    print $seq->seq;
#    die $!
#    }
}


Josh Lauricha wrote:

>On Thu 03/04/04 16:57, Morten Lindow wrote:
>  
>
>>I am trying to build a table of genomic features from the tigrxml-format 
>>of a rice (pseudo)chromosome.
>>
>>However the tigr-parser seem to behave differently from genbank/embl: 
>>tigr.pm considers every TU/etc an  individual sequence, and hence resets 
>>its coordinate system every time it starts on a new TU.
>>
>>My question is:
>>Is there a bioperl-way to get to the global coordinates, like when I am 
>>parsing a genbankfile of a whole chromosome?
>>    
>>
>
>Sure:
>
>my $tigrin = Bio::SeqIO->new( -format => 'tigr', -file => 'chr01.xml');
> 
>while (my $seq = $tigrin->next_seq){
>    my ($source) = grep { $_->primary_tag() eq "source" }
>        $seq->get_SeqFeatures();
>
>    # These are the 5' and 3' ends of each TU
>    my($end5) = $source->get_tag_values('end5');
>    my($end3) = $source->get_tag_values('end3');
>    my($strand) = $end3 <=> $end5;
>
>    # Then foreach location just do:
>    my $loc = get_some_location....
>    $start = $end5 + ($loc->start() - 1)*$strand;
>    $end   = $end5 + ($loc->end()   - 1)*$strand;
>    ...
>}
>
>This bit of code isn't tested, but I've been using these alot, so it is
>probably correct.
>
>  
>


From bioperl-guts-l-bounces at portal.open-bio.org  Mon Mar  8 11:43:20 2004
From: bioperl-guts-l-bounces at portal.open-bio.org (bioperl-guts-l-bounces@portal.open-bio.org)
Date: Mon Mar  8 11:43:33 2004
Subject: [Bioperl-l] Forward of moderated message
Message-ID: <mailman.0.1078764200.1565.bioperl-guts-l@portal.open-bio.org>

An embedded message was scrubbed...
From: Ruifang <zhangrf@xysm.net>
Subject: a problem of RestrictionEnzyme
Date: Mon, 08 Mar 2004 15:28:13 +0800
Size: 1941
Url: http://portal.open-bio.org/pipermail/bioperl-l/attachments/20040308/b9b3c189/attachment.eml
From iain.wallace at ucd.ie  Mon Mar  8 11:57:34 2004
From: iain.wallace at ucd.ie (Iain Wallace)
Date: Mon Mar  8 12:03:15 2004
Subject: [Bioperl-l] Locatable sequence with numbers
Message-ID: <1078765054.4741.7.camel@bioinf10>

Hi 

I am trying to create a file for clustalw that is of that includes the
secondary structure mask

!GM_id 1111144444411
id     GVPINVSCTGSPQ

My problem is that the code below doesn't work as $gap_mask is a string
of numbers, and not a proper sequence.

 my $newseq1 = new Bio::LocatableSeq(-seq => $gap_mask,
                                        -id => "!GM_".$seq->id,
                                        -start => 1,
                                        -end => $seq->length());
        print "adding id:\t",$seq->id,"\n";
        $Align->add_seq($newseq1);

The error that gets generated is 
 
-------------------- WARNING ---------------------
MSG: seq doesn't validate, mismatch is 1
---------------------------------------------------
 
------------- EXCEPTION  -------------
MSG: Attempting to set the sequence to
[12211111114444444444412222222112222221] which does not look healthy
STACK Bio::PrimarySeq::seq
/usr/lib/perl5/site_perl/5.8.0/Bio/PrimarySeq.pm:268
STACK Bio::PrimarySeq::new
/usr/lib/perl5/site_perl/5.8.0/Bio/PrimarySeq.pm:217
STACK Bio::LocatableSeq::new
/usr/lib/perl5/site_perl/5.8.0/Bio/LocatableSeq.pm:100
STACK toplevel ./dssp.pl:61
 
--------------------------------------


Does anyone know how I could get around this ? 

Thanks 

Iain 


From heikki at ebi.ac.uk  Mon Mar  8 12:03:40 2004
From: heikki at ebi.ac.uk (Heikki Lehvaslaiho)
Date: Mon Mar  8 12:09:21 2004
Subject: [Bioperl-l] modules for siRNA design
In-Reply-To: <200403061913.i26JDq9Q010414@portal.open-bio.org>
References: <200403061913.i26JDq9Q010414@portal.open-bio.org>
Message-ID: <200403081703.40515.heikki@ebi.ac.uk>

San Wang,

The code in SiRNA.pm around line 263 looks like this:

    if ($cds) {
	$left = $cds->start + $self->end_pad;
	$right = $cds->end - $self->start_pad;	
    }
    else {
	$left = $target->start + $self->end_pad;    # line #263
	$right = $target->end - $self->start_pad;
    }

Start and end calls are fine on a SeqFeature object like a CDS, but are not 
defined for a sequence. These lines  should be replaced by something like the 
following:

	$left = 1 + $self->end_pad;
	$right = $target->length - $self->start_pad;

Your example sequence is not too realistic, so you'll get an other error 
caused by the shortness of the sequece. Call 

$sirna_designer->start_pad(0);
$sirna_designer->end_pad(0);

before calling design().

Yours,	
		-Heikki

On Saturday 06 Mar 2004 19:08, ����, San Wang wrote:
> Dear bioperl-l:
>
> 	I have report a bug for it.
> "Can't locate object method "start" via package "Bio::Seq::RichSeq"
> (perhaps you forgot to load "Bio::Seq::RichSeq"?) at
> D:/Tools/perl/site/lib/Bio/Tools/SiRNA.pm line 263."
>
> 	Best Regards!
>
> 			Yours sincerely
> ��������������������, San Wang
> ����������������bsman@tom.com
> ��������������������2004-03-07
>  ,-,-.     ,-,-.     ,-,-.     ,-,-.     ,-
> / / \ \   / / \ \   / / \ \   / / \ \   / /
>      \ \ / /   \ \ / /   \ \ / /   \ \ / /
>       `-'-'     `-'-'     `-'-'     `-'-'
>
>
> San Wang
> Dr. Student
> Bioinformatics and Gene Network Research Group
> Zhejiang University
> P.R.China
>
> Address:
> Room 415# ,Bio-technology Institute,
> Zhejiang University,
> Kaixuan Road 268#  Hangzhou
> Zhejiang, P.R.China
> 310027
>
> Tel:
> 0086-571-86892051
> Fax:
> 0086-571-86961525
> E-mail:
> bsman@163.com
> Website:
> http://www.estarray.org
>
> ��������������������
>   Innovation is everything. When you are on the forefront, you can see what
> the next innovation needs to be? --- from Robert Noyce, Founder of Intel
>   The more things change, the more they stay the same!
>   The more things seem to be complicated ,the more they are simple in fact!
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

-- 
______ _/      _/_____________________________________________________
      _/      _/                      http://www.ebi.ac.uk/mutations/
     _/  _/  _/  Heikki Lehvaslaiho    heikki_at_ebi ac uk
    _/_/_/_/_/  EMBL Outstation, European Bioinformatics Institute
   _/  _/  _/  Wellcome Trust Genome Campus, Hinxton
  _/  _/  _/  Cambs. CB10 1SD, United Kingdom
     _/      Phone: +44 (0)1223 494 644   FAX: +44 (0)1223 494 468
___ _/_/_/_/_/________________________________________________________

From redwards at utmem.edu  Mon Mar  8 12:09:02 2004
From: redwards at utmem.edu (Rob Edwards)
Date: Mon Mar  8 12:14:57 2004
Subject: [Bioperl-l] Forward of moderated message
In-Reply-To: <mailman.0.1078764200.1565.bioperl-guts-l@portal.open-bio.org>
References: <mailman.0.1078764200.1565.bioperl-guts-l@portal.open-bio.org>
Message-ID: <4C3BE857-7123-11D8-8461-000A959E1622@utmem.edu>

Are you sure that you are using the right module. This has been fixed 
in Bio::Restriction (ver. 1.4)

e.g.


use Bio::Seq;
use Bio::Restriction::Enzyme;
use Bio::Restriction::Analysis;
use strict;

my $seq=Bio::Seq->new(-seq=>'GTACTTTTAGAGGATGAAACCT');
my $renz=Bio::Restriction::Enzyme->new(-enzyme=>'MnII', -seq=>'^CCTC');
my $ranal=Bio::Restriction::Analysis->new(-seq=>$seq, -enzymes=>$renz);
my $enz='MnII';
my @frags=$ranal->fragments($enz);

print join "\n", @frags, "\n";


gives

GTACTTTTA
GAGGATGAAACCT

Rob


On Mar 8, 2004, at 10:43 AM, bioperl-guts-l-bounces@portal.open-bio.org 
wrote:

>
> From: Ruifang <zhangrf@xysm.net>
> Date: March 8, 2004 1:28:13 AM CST
> To: bioperl-guts-l@portal.open-bio.org
> Subject: a problem of RestrictionEnzyme
>
>
> Dear,
>
> I am a graduate student in National Lab of Medical Genetics of China.
>
> I am using Bioperl Module: RestrictionEnzyme.pm and I have encountered 
> a question about this code.
>
> I found  the restriction enzymes could not recognize the DAN sequence 
> which is reverse complement to the enzymes sequence.
> For example, the enzyme: MnlI, the recognize sequence :CCTC, the DNA 
> sequence: GTACTTTTAGAGGATGAAACCT
> the DAN sequence include  GAGG , which is reverse complement to CCTC,  
> and the MnlI should cut this sequence, but the code could not give 
> this result.
>
> May I could have answer from you soon!
>
> Thanks!
> Best Wishes!
>
> Ruifang
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

From heikki at ebi.ac.uk  Mon Mar  8 12:21:59 2004
From: heikki at ebi.ac.uk (Heikki Lehvaslaiho)
Date: Mon Mar  8 12:27:42 2004
Subject: [Bioperl-l] Forward of moderated message
In-Reply-To: <mailman.0.1078764200.1565.bioperl-guts-l@portal.open-bio.org>
References: <mailman.0.1078764200.1565.bioperl-guts-l@portal.open-bio.org>
Message-ID: <200403081721.59780.heikki@ebi.ac.uk>

Ruifang,

The code works. See this example:
 
######################################
use  Bio::Tools::RestrictionEnzyme;
use Bio::PrimarySeq;

$seqobj = new Bio::PrimarySeq(-seq=>'GTACTTTTAGAGGATGAAACCT');
$re2 = new Bio::Tools::RestrictionEnzyme(-NAME =>'MnlI--CC^TC', 
                                         -MAKE =>'custom');
@fragments = $re2->cut_seq($seqobj);
print scalar @fragments, "\n";
######################################

However, you should notice that Bio:Tools::RestrictionEnzyme class is 
deprecated and not really supported any more. You should have a look at 
Bio::Restriction classes.

Yours,
		-Heikki

On Monday 08 Mar 2004 16:43, bioperl-guts-l-bounces@portal.open-bio.org wrote:
> Dear,
>
> I am a graduate student in National Lab of Medical Genetics of China.
>
> I am using Bioperl Module: RestrictionEnzyme.pm and I have encountered a
> question about this code.
>
> I found  the restriction enzymes could not recognize the DAN sequence
> which is reverse complement to the enzymes sequence.
> For example, the enzyme: MnlI, the recognize sequence :CCTC, the DNA
> sequence: GTACTTTTAGAGGATGAAACCT
> the DAN sequence include  GAGG , which is reverse complement to CCTC,
> and the MnlI should cut this sequence, but the code could not give this
> result.
>
> May I could have answer from you soon!
>
> Thanks!
> Best Wishes!
>
> Ruifang

-- 
______ _/      _/_____________________________________________________
      _/      _/                      http://www.ebi.ac.uk/mutations/
     _/  _/  _/  Heikki Lehvaslaiho    heikki_at_ebi ac uk
    _/_/_/_/_/  EMBL Outstation, European Bioinformatics Institute
   _/  _/  _/  Wellcome Trust Genome Campus, Hinxton
  _/  _/  _/  Cambs. CB10 1SD, United Kingdom
     _/      Phone: +44 (0)1223 494 644   FAX: +44 (0)1223 494 468
___ _/_/_/_/_/________________________________________________________
From heikki at ebi.ac.uk  Mon Mar  8 12:30:34 2004
From: heikki at ebi.ac.uk (Heikki Lehvaslaiho)
Date: Mon Mar  8 12:36:15 2004
Subject: [Bioperl-l] Locatable sequence with numbers
In-Reply-To: <1078765054.4741.7.camel@bioinf10>
References: <1078765054.4741.7.camel@bioinf10>
Message-ID: <200403081730.34719.heikki@ebi.ac.uk>

Ian,

You could use Bio::Seq::Meta to store the mask, but  
Bio::Tools::Run::Alignment::Clustalw would not know about your mask encoded 
into the sequence object, so you'd have to modify its code to actually pass 
the mask to clustalw.

	-Heikki


On Monday 08 Mar 2004 16:57, Iain Wallace wrote:
> Hi
>
> I am trying to create a file for clustalw that is of that includes the
> secondary structure mask
>
> !GM_id 1111144444411
> id     GVPINVSCTGSPQ
>
> My problem is that the code below doesn't work as $gap_mask is a string
> of numbers, and not a proper sequence.
>
>  my $newseq1 = new Bio::LocatableSeq(-seq => $gap_mask,
>                                         -id => "!GM_".$seq->id,
>                                         -start => 1,
>                                         -end => $seq->length());
>         print "adding id:\t",$seq->id,"\n";
>         $Align->add_seq($newseq1);
>
> The error that gets generated is
>
> -------------------- WARNING ---------------------
> MSG: seq doesn't validate, mismatch is 1
> ---------------------------------------------------
>
> ------------- EXCEPTION  -------------
> MSG: Attempting to set the sequence to
> [12211111114444444444412222222112222221] which does not look healthy
> STACK Bio::PrimarySeq::seq
> /usr/lib/perl5/site_perl/5.8.0/Bio/PrimarySeq.pm:268
> STACK Bio::PrimarySeq::new
> /usr/lib/perl5/site_perl/5.8.0/Bio/PrimarySeq.pm:217
> STACK Bio::LocatableSeq::new
> /usr/lib/perl5/site_perl/5.8.0/Bio/LocatableSeq.pm:100
> STACK toplevel ./dssp.pl:61
>
> --------------------------------------
>
>
> Does anyone know how I could get around this ?
>
> Thanks
>
> Iain
>
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

-- 
______ _/      _/_____________________________________________________
      _/      _/                      http://www.ebi.ac.uk/mutations/
     _/  _/  _/  Heikki Lehvaslaiho    heikki_at_ebi ac uk
    _/_/_/_/_/  EMBL Outstation, European Bioinformatics Institute
   _/  _/  _/  Wellcome Trust Genome Campus, Hinxton
  _/  _/  _/  Cambs. CB10 1SD, United Kingdom
     _/      Phone: +44 (0)1223 494 644   FAX: +44 (0)1223 494 468
___ _/_/_/_/_/________________________________________________________
From lstein at cshl.edu  Mon Mar  8 13:01:11 2004
From: lstein at cshl.edu (Lincoln Stein)
Date: Mon Mar  8 13:07:15 2004
Subject: [Bioperl-l] Bio::Graphics::Glyph::translation.pm
In-Reply-To: <1078347825.4592.39.camel@gentiBk>
References: <1078347825.4592.39.camel@gentiBk>
Message-ID: <200403081301.17539.lstein@cshl.edu>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

You need to attach a Bio::PrimarySeqI to the Bio::SeqFeature::Generic, 
not a Bio::SeqI.  I know this is confusing, but I think that you can 
get the effect you want by changing this line to read:

	$feature->attach_seq($seq_object->seq);

Lincoln

On Wednesday 03 March 2004 04:03 pm, Dr. Todd Naumann wrote:
> Hi,
>
> I am trying to use the translation glyph and I am running into
> problems. When I run the program below with glyph=>'dna' I get a
> picture of double stranded DNA sequence as I expected. When I try
> to use
> glyph=>'translation' I get the following error message:
>
> Can't locate object method "translate" via package
> "Bio::SeqFeature::Generic" at
> /usr/lib/perl5/site_perl/5.8.2/Bio/Graphics/Glyph/translation.pm
> line 134, <GEN0> line 118.
>
> I tried to find another SeqFeature object that has a translate
> method but did not have any luck. If someone could point me in the
> right direction I would really appreciate it.
>
> -Todd
>
> #!/usr/bin/perl
>
> use strict;
> use warnings;
>
> use Bio::Graphics;
> use Bio::SeqFeature::Generic;
> use Bio::SeqIO;
>
> # initialize the graphics panel
> my $panel = Bio::Graphics::Panel->new(-length => 80,
> -width => 800,
> -pad_left => 10,
> -pad_right => 10,
> );
>
> # make the feature object
> my $feature = Bio::SeqFeature::Generic->new(-start=>1,
> -end=>60,
> );
>
> # create a sequence object
> my $seqIO_object = Bio::SeqIO->new(-file => 'test_sequence.gb',
> -format => 'genbank',
> );
>
> my $seq_object = $seqIO_object->next_seq();
>
> # attach sequence object to feature
> $feature->attach_seq($seq_object);
>
> # add track to panel
> $panel->add_track($feature,
> -glyph => 'translation',
> -fgcolor => 'black',
> );
>
> print $panel->png;
>
> exit;
> ============================
> Todd Naumann
> Dept. of Chemistry
> Penn State University
> University Park, PA  16802
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

- -- 
Lincoln D. Stein
Cold Spring Harbor Laboratory
1 Bungtown Road
Cold Spring Harbor, NY 11724
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.1 (GNU/Linux)

iD8DBQFATLTt0CIvUP7P+AkRAgrqAJ9YLCuetr50r704+MyFo1+H7Py+4wCeOXeM
YiFetRZeCf+misFV4omy+f0=
=sLUm
-----END PGP SIGNATURE-----
From jason at cgt.duhs.duke.edu  Mon Mar  8 13:12:30 2004
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Mon Mar  8 13:18:21 2004
Subject: [Bioperl-l] Bio::Graphics::Glyph::translation.pm
In-Reply-To: <200403081301.17539.lstein@cshl.edu>
References: <1078347825.4592.39.camel@gentiBk>
	<200403081301.17539.lstein@cshl.edu>
Message-ID: <Pine.LNX.4.50.0403081310460.23605-100000@tenero.duhs.duke.edu>

err - I don't think you should be calling attach_seq really, but rather
add_feature on the seq object

 $seq_object->add_SeqFeature($feature);

This is really the right way to do this - it will properly update the
seqfeature and call attach_feature for you.

-jason

On Mon, 8 Mar 2004, Lincoln Stein wrote:

> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
>
> You need to attach a Bio::PrimarySeqI to the Bio::SeqFeature::Generic,
> not a Bio::SeqI.  I know this is confusing, but I think that you can
> get the effect you want by changing this line to read:
>
> 	$feature->attach_seq($seq_object->seq);
>
> Lincoln
>
> On Wednesday 03 March 2004 04:03 pm, Dr. Todd Naumann wrote:
> > Hi,
> >
> > I am trying to use the translation glyph and I am running into
> > problems. When I run the program below with glyph=>'dna' I get a
> > picture of double stranded DNA sequence as I expected. When I try
> > to use
> > glyph=>'translation' I get the following error message:
> >
> > Can't locate object method "translate" via package
> > "Bio::SeqFeature::Generic" at
> > /usr/lib/perl5/site_perl/5.8.2/Bio/Graphics/Glyph/translation.pm
> > line 134, <GEN0> line 118.
> >
> > I tried to find another SeqFeature object that has a translate
> > method but did not have any luck. If someone could point me in the
> > right direction I would really appreciate it.
> >
> > -Todd
> >
> > #!/usr/bin/perl
> >
> > use strict;
> > use warnings;
> >
> > use Bio::Graphics;
> > use Bio::SeqFeature::Generic;
> > use Bio::SeqIO;
> >
> > # initialize the graphics panel
> > my $panel = Bio::Graphics::Panel->new(-length => 80,
> > -width => 800,
> > -pad_left => 10,
> > -pad_right => 10,
> > );
> >
> > # make the feature object
> > my $feature = Bio::SeqFeature::Generic->new(-start=>1,
> > -end=>60,
> > );
> >
> > # create a sequence object
> > my $seqIO_object = Bio::SeqIO->new(-file => 'test_sequence.gb',
> > -format => 'genbank',
> > );
> >
> > my $seq_object = $seqIO_object->next_seq();
> >
> > # attach sequence object to feature
> > $feature->attach_seq($seq_object);
> >
> > # add track to panel
> > $panel->add_track($feature,
> > -glyph => 'translation',
> > -fgcolor => 'black',
> > );
> >
> > print $panel->png;
> >
> > exit;
> > ============================
> > Todd Naumann
> > Dept. of Chemistry
> > Penn State University
> > University Park, PA  16802
> >
> > _______________________________________________
> > Bioperl-l mailing list
> > Bioperl-l@portal.open-bio.org
> > http://portal.open-bio.org/mailman/listinfo/bioperl-l
>
> - --
> Lincoln D. Stein
> Cold Spring Harbor Laboratory
> 1 Bungtown Road
> Cold Spring Harbor, NY 11724
> -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v1.2.1 (GNU/Linux)
>
> iD8DBQFATLTt0CIvUP7P+AkRAgrqAJ9YLCuetr50r704+MyFo1+H7Py+4wCeOXeM
> YiFetRZeCf+misFV4omy+f0=
> =sLUm
> -----END PGP SIGNATURE-----
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From lstein at cshl.edu  Mon Mar  8 13:17:26 2004
From: lstein at cshl.edu (Lincoln Stein)
Date: Mon Mar  8 13:23:15 2004
Subject: [Bioperl-l] new GFF3 support methods added
In-Reply-To: <Pine.LNX.4.33.0403051652160.20795-100000@heartbroken.lbl.gov>
References: <Pine.LNX.4.33.0403051652160.20795-100000@heartbroken.lbl.gov>
Message-ID: <200403081317.26261.lstein@cshl.edu>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Hi Chris,

Nice job.  My only comment is that there's been a great deal of 
consternation over the role of whitespace in GFF3 recently and I am 
thinking changing the column delimiter back to strict tabs and 
allowing spaces (but no tabs or other unescaped whitespace) in the 
fields.  I don't think this will affect your methods at all, but just 
a heads up.

Lincoln

On Friday 05 March 2004 08:52 pm, Chris Mungall wrote:
> I have committed some new stuff to bioperl-live:
>
> the script seq/unflatten_seq will now generate GFF3 - the
> unflattener module is used to build the 'feature graph' connecting
> genes, transcripts, exons and CDSs together. This means we can have
> GFF3 for anything in genbank!
>
> As far as I'm aware, the only other sensible output formats to use
> here (ie formats that support feature graphs/containment
> hierarchies) are: chado, chaos, and the write-only asciitree.
>
> This feature graph is written out in the GFF3 using the ID and
> Parent tags. To do this there is an extra intermediate step - the
> bioperl FeatureHolderI hierarchy is traversed and ID/Parent tags
> are generated.
>
> Here is a description of the changes I have made:
>
> [unless you're a bioperl hacker you don't really need to read the
> rest of this]
>
> You can get the context of what I'm on about from this thread:
> http://bioperl.org/pipermail/bioperl-l/2003-December/014150.html
>
> Two new public methods:
>
>   FeatureHolderI->set_ParentIDs_from_hierarchy
>
>     sets both ID and ParentID from FeatureHolder hierarchy
>
>   SeqFeatureI->generate_unique_persistent_id
>
>     this is required by the above method
>
>     Lincoln wanted this to be private, but I think it has
>     to be called from outside
>
>   FeatureHolderI->create_hierarchy_from_ParentIDs
>
>     the inverse of set_ParentIDs_from_hierarchy
>
> (note that I have put the implementation in the interface - in the
> absence of proper abstract classes, this was deemed the best thing
> to do in the previous discussion on this)
>
> Modifications:
>
>   SeqFeatureI->primary_id
>
> This now maps to the tag_value 'ID' (ie the tag that GFF3 uses to
> uniquely identify a feature).
>
> Minor modification
>
>   Bio::Tools::GFF now allows the -noparse=>1 option
>
>   this is simply to stop the module waiting on input from STDIN
>   when used in write-mode (maybe there's a better way of doing this
>   but I didn't want to mess with this module)
>
> Proto-test
>
>   t/FeatureHolder.x
>
>   This unflattens a genbank sequences and roundtrips it to chadoxml
>   via GFF3
>
>   This doesn't work yet - if you dump a splitfeature as GFF3 and
>   re-import it, it becomes two features. Any volunteers to help
>   fix this?
>
> Unique IDs in bioperl:
>
> In the discussion that preceeded this, it seemed that people liked
> the idea of persistent unique IDs, but there was no suggestions as
> to how to go about it. This is inherently difficult with objects,
> but I borrowed a solution from relational modeling.
>
> A persistent unique ID is generated using
>
>   seq_id
>   primary_tag
>   start
>   end
>
> It is assumed that these are all set and comprise a "unique key"
> over features. Of course, there's no way to enforce this with
> objects. The generated ID is simply these values concatenated with
> : delimiters. You can think of this is a skolem function if you're
> that way inclined.
>
> Another assumption is that seq_id is unique and persistent.
>
> Of course, if you're dealing with data that changes with time, then
> changing the coordinates of a feature will change it's id. This is
> fine. If you want to use your own IDs rather than the generated
> ones, you can simply set the primary_id() field - or if you are
> using genbank files, add something like this
>
>   /ID=CG12345-RA
>
> to the feature.
>
> Stuff still to do:
>
> * fix GFF3 to deal with roundtripping splitlocations
>
> * A nest_features() method, as discussed in the previous email.
> This is the opposite of set_ParentIDs_from_hierarchy(), for reading
> in GFF3 (and then writing to a feature-graph compatible format, or
> to a database such as biosql or chado).
>
> * Bio::SeqIO::GFF3
>
> I know a lot of us would like this a lot - is there any plans to
> implement this yet?
>
> * A GeneModel factory
>
> This would take the output of the unflattener (a set of feature
> graphs typed to SO) and make SeqFeature::Gene objects
>
> Cheers
> Chris
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

- -- 
Lincoln D. Stein
Cold Spring Harbor Laboratory
1 Bungtown Road
Cold Spring Harbor, NY 11724
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.1 (GNU/Linux)

iD8DBQFATLi20CIvUP7P+AkRAuErAKCc4iNS3cnVLpbkLAfpba176o29aQCgndia
SZzP/ANnxz7kvKmg+5Ovq9c=
=FM1X
-----END PGP SIGNATURE-----
From birney at ebi.ac.uk  Mon Mar  8 13:17:50 2004
From: birney at ebi.ac.uk (Ewan Birney)
Date: Mon Mar  8 13:23:45 2004
Subject: [Bioperl-l] Bio::Graphics::Glyph::translation.pm
In-Reply-To: <200403081301.17539.lstein@cshl.edu>
Message-ID: <Pine.LNX.4.44.0403081816250.23156-100000@pigeon.ebi.ac.uk>

On Mon, 8 Mar 2004, Lincoln Stein wrote:

> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
> 
> You need to attach a Bio::PrimarySeqI to the Bio::SeqFeature::Generic, 
> not a Bio::SeqI.  I know this is confusing, but I think that you can 
> get the effect you want by changing this line to read:
> 
> 	$feature->attach_seq($seq_object->seq);
> 


We should probably get $feature->attach_seq(xxx) to do something magical 
with Bio::SeqI's...


This has always been unsatisfactory, but roll on parrot with its 
mark-and-sweep style GC so we can do something more sensible here 
(circular references...)


From lstein at cshl.edu  Mon Mar  8 13:47:13 2004
From: lstein at cshl.edu (Lincoln Stein)
Date: Mon Mar  8 13:53:03 2004
Subject: [Bioperl-l] Bio::Graphics::Glyph::translation.pm
In-Reply-To: <Pine.LNX.4.50.0403081310460.23605-100000@tenero.duhs.duke.edu>
References: <1078347825.4592.39.camel@gentiBk>
	<200403081301.17539.lstein@cshl.edu>
	<Pine.LNX.4.50.0403081310460.23605-100000@tenero.duhs.duke.edu>
Message-ID: <200403081347.13182.lstein@cshl.edu>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Todd,

Maybe I've made a mistake.  Let me know which way works.

Lincoln

On Monday 08 March 2004 01:12 pm, Jason Stajich wrote:
> err - I don't think you should be calling attach_seq really, but
> rather add_feature on the seq object
>
>  $seq_object->add_SeqFeature($feature);
>
> This is really the right way to do this - it will properly update
> the seqfeature and call attach_feature for you.
>
> -jason
>
> On Mon, 8 Mar 2004, Lincoln Stein wrote:
> > -----BEGIN PGP SIGNED MESSAGE-----
> > Hash: SHA1
> >
> > You need to attach a Bio::PrimarySeqI to the
> > Bio::SeqFeature::Generic, not a Bio::SeqI.  I know this is
> > confusing, but I think that you can get the effect you want by
> > changing this line to read:
> >
> > 	$feature->attach_seq($seq_object->seq);
> >
> > Lincoln
> >
> > On Wednesday 03 March 2004 04:03 pm, Dr. Todd Naumann wrote:
> > > Hi,
> > >
> > > I am trying to use the translation glyph and I am running into
> > > problems. When I run the program below with glyph=>'dna' I get
> > > a picture of double stranded DNA sequence as I expected. When I
> > > try to use
> > > glyph=>'translation' I get the following error message:
> > >
> > > Can't locate object method "translate" via package
> > > "Bio::SeqFeature::Generic" at
> > > /usr/lib/perl5/site_perl/5.8.2/Bio/Graphics/Glyph/translation.p
> > >m line 134, <GEN0> line 118.
> > >
> > > I tried to find another SeqFeature object that has a translate
> > > method but did not have any luck. If someone could point me in
> > > the right direction I would really appreciate it.
> > >
> > > -Todd
> > >
> > > #!/usr/bin/perl
> > >
> > > use strict;
> > > use warnings;
> > >
> > > use Bio::Graphics;
> > > use Bio::SeqFeature::Generic;
> > > use Bio::SeqIO;
> > >
> > > # initialize the graphics panel
> > > my $panel = Bio::Graphics::Panel->new(-length => 80,
> > > -width => 800,
> > > -pad_left => 10,
> > > -pad_right => 10,
> > > );
> > >
> > > # make the feature object
> > > my $feature = Bio::SeqFeature::Generic->new(-start=>1,
> > > -end=>60,
> > > );
> > >
> > > # create a sequence object
> > > my $seqIO_object = Bio::SeqIO->new(-file => 'test_sequence.gb',
> > > -format => 'genbank',
> > > );
> > >
> > > my $seq_object = $seqIO_object->next_seq();
> > >
> > > # attach sequence object to feature
> > > $feature->attach_seq($seq_object);
> > >
> > > # add track to panel
> > > $panel->add_track($feature,
> > > -glyph => 'translation',
> > > -fgcolor => 'black',
> > > );
> > >
> > > print $panel->png;
> > >
> > > exit;
> > > ============================
> > > Todd Naumann
> > > Dept. of Chemistry
> > > Penn State University
> > > University Park, PA  16802
> > >
> > > _______________________________________________
> > > Bioperl-l mailing list
> > > Bioperl-l@portal.open-bio.org
> > > http://portal.open-bio.org/mailman/listinfo/bioperl-l
> >
> > - --
> > Lincoln D. Stein
> > Cold Spring Harbor Laboratory
> > 1 Bungtown Road
> > Cold Spring Harbor, NY 11724
> > -----BEGIN PGP SIGNATURE-----
> > Version: GnuPG v1.2.1 (GNU/Linux)
> >
> > iD8DBQFATLTt0CIvUP7P+AkRAgrqAJ9YLCuetr50r704+MyFo1+H7Py+4wCeOXeM
> > YiFetRZeCf+misFV4omy+f0=
> > =sLUm
> > -----END PGP SIGNATURE-----
> > _______________________________________________
> > Bioperl-l mailing list
> > Bioperl-l@portal.open-bio.org
> > http://portal.open-bio.org/mailman/listinfo/bioperl-l
>
> --
> Jason Stajich
> Duke University
> jason at cgt.mc.duke.edu
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

- -- 
Lincoln D. Stein
Cold Spring Harbor Laboratory
1 Bungtown Road
Cold Spring Harbor, NY 11724
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.1 (GNU/Linux)

iD8DBQFATL+x0CIvUP7P+AkRAh2bAJwIEXstWUJYdwAYuJ/O0nG87I7hyACfZWxR
mGai9ckUJFQb6sAozxuxJBk=
=OPAW
-----END PGP SIGNATURE-----
From adegjp at hotmail.com  Mon Mar  8 14:26:48 2004
From: adegjp at hotmail.com (marlin)
Date: Mon Mar  8 14:32:29 2004
Subject: [Bioperl-l] Use offshore phar-macies to save $$$ on your meds!
Message-ID: <1078774008-32193@excite.com>


We created an online medical consultation to get you the medications you need without the hassle, embarrassment, and cost of the doctor's office and pharmacy.

When you need them fast: 
X a n a x - S o m a - V a l i u m - V i a g r a - M e r i d i a - MORE!

We don't require any prescriptions.

Because you can add more to your life. 
Shop Now.  

http://selcydc.com/s95/index.php?id=s95


brooks <LI> ssurdi nlee  zgao mekdecic nnguyen  marrowmontreal a1b2c3 mulcahy &nbsp; Bigdog vandam foster dmiller1  jcheydwe parrot  stan mahartna <U> 
Get off this list by writing to getmeoff731@excitemail.com
From mcipriano at mbl.edu  Mon Mar  8 14:48:54 2004
From: mcipriano at mbl.edu (Michael Cipriano)
Date: Mon Mar  8 14:54:35 2004
Subject: [Bioperl-l] phd 2 abi file
Message-ID: <000801c40546$6371f9c0$8fae8080@Ripley>

Is there a way to create an abi sequence file(abi, ab1) from a phred
(phd) file within bioperl? If not within bioperl, does anyone know of
any way to do this? 

Thanks in advance

Michael Cipriano (mcipriano@mbl.edu)
Research Assistant I, Global Infectious Diseases
Josephine Bay Paul Center, Marine Biological Laboratory
7 MBL Street, Woods Hole, MA 02543-1015


From cjm at fruitfly.org  Mon Mar  8 16:07:16 2004
From: cjm at fruitfly.org (Chris Mungall)
Date: Mon Mar  8 16:12:58 2004
Subject: [Bioperl-l] new GFF3 support methods added
In-Reply-To: <C4908D80-6F2F-11D8-8AAD-000A959EB4C4@gmx.net>
Message-ID: <Pine.LNX.4.33.0403081248510.20795-100000@heartbroken.lbl.gov>


On Fri, 5 Mar 2004, Hilmar Lapp wrote:

> Nice work Chris. I don't have very many comments.
>
> On Friday, March 5, 2004, at 05:52  PM, Chris Mungall wrote:
>
> >   FeatureHolderI->set_ParentIDs_from_hierarchy
> >
> >     sets both ID and ParentID from FeatureHolder hierarchy
> >
> >   FeatureHolderI->create_hierarchy_from_ParentIDs
> >
> >     the inverse of set_ParentIDs_from_hierarchy
> >
>
> Hmm - is this unique to feature graphs or to GFF3?

I think it's useful for any time you want to capture the hierarchical
knowledge from the bioperl FeatureHolder hierarchy in any kind of format
that does not explicitly capture this kind of info.

You could use this with any flavour of GFF - the difference with GFF3 is
that the ID and Parent tags are part of the format specification

If you wanted to persist your bioperl data through genbank files, this
would also be useful, as features could use /ID and /Parent tags (although
this is actually an extension of the genbank format)

It is useful in a variety of contexts, but it does 'clutter up'
FeatureHolder a little

> >   SeqFeatureI->generate_unique_persistent_id
> >
> >     this is required by the above method
> >
> >     Lincoln wanted this to be private, but I think it has
> >     to be called from outside
>
> I wouldn't want it to be private, but I'd rather make this a property
> representing a generator (with a single required method
> $generator->generate_id($feature)), or alternatively - and also simpler
> - a closure, than a method. The reason is that if you wanted to change
> the way the ID is generated you'd have to subclass an entire
> SeqFeatureI implementation instead of just setting the property to some
> anonymous method you whip up.

Much as I like closures, I think there should be a standard sanctioned way
of generating unique/persistent IDs

> > Unique IDs in bioperl:
> >
> > In the discussion that preceeded this, it seemed that people liked the
> > idea of persistent unique IDs, but there was no suggestions as to how
> > to
> > go about it. This is inherently difficult with objects, but I borrowed
> > a
> > solution from relational modeling.
> >
> > A persistent unique ID is generated using
> >
> >   seq_id
> >   primary_tag
> >   start
> >   end
> >
> > It is assumed that these are all set and comprise a "unique key" over
> > features.
>
> Hmm. Wouldn't you need to include source_tag()? (Source_tag is part of
> the unique key in biosql.) Without the source_tag being part of this,
> wouldn't that mean you cannot have the exact (start+end) same segment
> predicted as exon by different methods and have those different
> predictions co-exist as separate features in the graph? (Presumably
> those would only differ in source_tag)

Good point! I've added this

Ok, so are you saying that some of these methods don't really belong in
the class-interfaces they are in?

Perhaps it would be better to have a Bio::SeqFeature::Tools::IDHandler
class? This would contain methods

  generate_unique_persistent_id($feat) # uses $feat->seq_id
  generate_unique_persistent_id($feat, $seq_id)

  create_hierarchy_from_ParentIDs($featholder);
  set_ParentIDs_from_hierarchy($featholder);

Any preference? I'm leaving them as is for now if that's ok, but I have no
objections to moving everything to a seperate class if that's prefered.

> >
> > Another assumption is that seq_id is unique and persistent.
> >
>
> I don't think that's going to be a very safe assumption.

Fair enough, but I think this is the responsibility of the user.

This is also partially the fault of the bioperl model - $feat->seq_id()
generally refers to the versionless accession no. There isn't really a way
to get to the versioned seq_id from the feature. I think it would cause
too much churn to add at this late stage...

> >
> > * A GeneModel factory
> >
> > This would take the output of the unflattener (a set of feature graphs
> > typed to SO) and make SeqFeature::Gene objects
> >
>
> Yeah, that'd be cool ...

Sometime..

> 	-hilmar
>
>


From stefan.weckx at ua.ac.be  Mon Mar  8 17:26:56 2004
From: stefan.weckx at ua.ac.be (Stefan Weckx)
Date: Mon Mar  8 17:32:21 2004
Subject: [Bioperl-l] phd 2 abi file
References: <000801c40546$6371f9c0$8fae8080@Ripley>
Message-ID: <001d01c4055c$8cd5f9e0$df8f0b51@desktop>

Michael,

have look at http://bioperl.org/pipermail/bioperl-l/2001-August/006176.html,
You can also try this: translate your phd file to a fasta using phd2fasta,
then run mktrace. This will create an scf file. mktrace comes with Consed
(http://www.phrap.org/consed/distributions/README.13.0.txt).
other possibility: try "sudophred" which comes with the newest release of
PolyPhred (see at http://droog.mbt.washington.edu/poly_doc.html).

Stefan

-- 
 Stefan Weckx
Bioinformatics Unit
Department of Molecular Genetics VIB8
University of Antwerp
Belgium

----- Original Message ----- 
From: "Michael Cipriano" <mcipriano@mbl.edu>
To: <bioperl-l@bioperl.org>
Sent: Monday, March 08, 2004 8:48 PM
Subject: [Bioperl-l] phd 2 abi file


> Is there a way to create an abi sequence file(abi, ab1) from a phred
> (phd) file within bioperl? If not within bioperl, does anyone know of
> any way to do this?
>
> Thanks in advance
>
> Michael Cipriano (mcipriano@mbl.edu)
> Research Assistant I, Global Infectious Diseases
> Josephine Bay Paul Center, Marine Biological Laboratory
> 7 MBL Street, Woods Hole, MA 02543-1015
>
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

From mcipriano at mbl.edu  Mon Mar  8 17:29:34 2004
From: mcipriano at mbl.edu (Michael Cipriano)
Date: Mon Mar  8 17:35:14 2004
Subject: [Bioperl-l] phd 2 abi file
In-Reply-To: <001d01c4055c$8cd5f9e0$df8f0b51@desktop>
Message-ID: <000d01c4055c$d58352a0$8fae8080@Ripley>

Thank you, I will look into those solutions.

Michael Cipriano (mcipriano@mbl.edu)
Research Assistant I, Global Infectious Diseases
Josephine Bay Paul Center, Marine Biological Laboratory
7 MBL Street, Woods Hole, MA 02543-1015


> -----Original Message-----
> From: Stefan Weckx [mailto:stefan.weckx@ua.ac.be]
> Sent: Monday, March 08, 2004 5:27 PM
> To: Michael Cipriano
> Cc: bioperl-l@bioperl.org
> Subject: Re: [Bioperl-l] phd 2 abi file
> 
> Michael,
> 
> have look at http://bioperl.org/pipermail/bioperl-l/2001-
> August/006176.html,
> You can also try this: translate your phd file to a fasta using
phd2fasta,
> then run mktrace. This will create an scf file. mktrace comes with
Consed
> (http://www.phrap.org/consed/distributions/README.13.0.txt).
> other possibility: try "sudophred" which comes with the newest release
of
> PolyPhred (see at http://droog.mbt.washington.edu/poly_doc.html).
> 
> Stefan
> 
> --
>  Stefan Weckx
> Bioinformatics Unit
> Department of Molecular Genetics VIB8
> University of Antwerp
> Belgium
> 
> ----- Original Message -----
> From: "Michael Cipriano" <mcipriano@mbl.edu>
> To: <bioperl-l@bioperl.org>
> Sent: Monday, March 08, 2004 8:48 PM
> Subject: [Bioperl-l] phd 2 abi file
> 
> 
> > Is there a way to create an abi sequence file(abi, ab1) from a phred
> > (phd) file within bioperl? If not within bioperl, does anyone know
of
> > any way to do this?
> >
> > Thanks in advance
> >
> > Michael Cipriano (mcipriano@mbl.edu)
> > Research Assistant I, Global Infectious Diseases
> > Josephine Bay Paul Center, Marine Biological Laboratory
> > 7 MBL Street, Woods Hole, MA 02543-1015
> >
> >
> >
> > _______________________________________________
> > Bioperl-l mailing list
> > Bioperl-l@portal.open-bio.org
> > http://portal.open-bio.org/mailman/listinfo/bioperl-l
> >
> 


From hz5 at njit.edu  Mon Mar  8 17:48:57 2004
From: hz5 at njit.edu (hz5@njit.edu)
Date: Mon Mar  8 17:54:39 2004
Subject: [Bioperl-l] GD::SVG
In-Reply-To: <200403081347.13182.lstein@cshl.edu>
References: <1078347825.4592.39.camel@gentiBk>
	<200403081301.17539.lstein@cshl.edu>
	<Pine.LNX.4.50.0403081310460.23605-100000@tenero.duhs.duke.edu>
	<200403081347.13182.lstein@cshl.edu>
Message-ID: <1078786137.404cf859816f7@webmail.njit.edu>

Dear friends,

I've successfully rendered multiple panels into png image using GD module and 
bioperl panel object. Now I am trying to make the picture quality higher. So I 
tried to use GD::SVG like this:

        my $panel = Bio::Graphics::Panel->new(
                                      #-segment    => $whole_seq,
                                      -start      => $s-1000,
                                      -end        => $t+1000,
                                      #-grid       => 'true',
                                      -key_style  => 'bottom',
                                      -key_font => GD::Font->Large(),
                                      -width      => $width,
                                      -pad_left   => 10,
                                      -pad_right  => 10,
                                      -pad_bottom => 10,
                                      -image_class=> 'GD::SVG',
                                      -spacing    => 1*$ratio,
                                     );

          ......

        open (OUT, ">my.svg");
        binmode OUT;
        print OUT $panel->svg();

Now I get the following message:
Can't locate object method "formatting" via package "GD::Font" (perhaps you 
forgot to load "GD::Font"?) at /usr/lib/perl5/site_perl/5.6.1/GD/SVG.pm line 
702, <DATA> line 191.

don't know why, thanks in advance for your help!
haibo
From harris at cshl.edu  Mon Mar  8 20:14:54 2004
From: harris at cshl.edu (Todd Harris)
Date: Mon Mar  8 20:21:46 2004
Subject: [Bioperl-l] GD::SVG
In-Reply-To: <1078786137.404cf859816f7@webmail.njit.edu>
Message-ID: <BC7276AE.C97A%harris@cshl.org>

Hi Haibo - 

The problem you are seeing is due to how GD::SVG mimics the interface of GD
in order to generate SVG.  The "Can't locate object method "formatting" via
package "GD::Font" (perhaps you..." error is because you aren't working with
GD (or GD::Font) objects anymore.  Instead, try passing a string for your
key_font:

      -key_font => 'gdLargeFont',

It's admittedly not an ideal solution, but GD::SVG will know what to do.
Thanks for bringing this up, and I'll make a note in the documentation to
address this issue.

Todd


> On 3/8/04 4:48 PM, hz5@njit.edu wrote:

> Dear friends,
> 
> I've successfully rendered multiple panels into png image using GD module and
> bioperl panel object. Now I am trying to make the picture quality higher. So I
> tried to use GD::SVG like this:
> 
>       my $panel = Bio::Graphics::Panel->new(
>                                     #-segment    => $whole_seq,
>                                     -start      => $s-1000,
>                                     -end        => $t+1000,
>                                     #-grid       => 'true',
>                                     -key_style  => 'bottom',
>                                     -key_font => GD::Font->Large(),
>                                     -width      => $width,
>                                     -pad_left   => 10,
>                                     -pad_right  => 10,
>                                     -pad_bottom => 10,
>                                     -image_class=> 'GD::SVG',
>                                     -spacing    => 1*$ratio,
>                                    );
> 
>         ......
> 
>       open (OUT, ">my.svg");
>       binmode OUT;
>       print OUT $panel->svg();
> 
> Now I get the following message:
> Can't locate object method "formatting" via package "GD::Font" (perhaps you
> forgot to load "GD::Font"?) at /usr/lib/perl5/site_perl/5.6.1/GD/SVG.pm line
> 702, <DATA> line 191.
> 
> don't know why, thanks in advance for your help!
> haibo
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 

From donald.jackson at bms.com  Mon Mar  8 21:46:19 2004
From: donald.jackson at bms.com (Donald Jackson)
Date: Mon Mar  8 21:51:58 2004
Subject: [Bioperl-l] modules for siRNA design
Message-ID: <8a74589334.893348a745@bms.com>

San and Heikki,

I've modified the SiRNA.pm module to fixed the bug per Heikki's suggestion.  I also added a sanity check in case the sequence is so short that the start_pad and end_pad values don't leave room for an oligo, and I added a feature to allow oligo design in the 3 prime UTR.  The mods just went into CVS as version 1.7

San - I strongly suggest that you give the module a longer target sequence if at all possible!

Don Jackson

----- Original Message -----
From: Heikki Lehvaslaiho <heikki@ebi.ac.uk>
Date: Monday, March 8, 2004 12:03 pm
Subject: Re: [Bioperl-l] modules for siRNA design

> San Wang,
> 
> The code in SiRNA.pm around line 263 looks like this:
> 
>    if ($cds) {
> 	$left = $cds->start + $self->end_pad;
> 	$right = $cds->end - $self->start_pad;	
>    }
>    else {
> 	$left = $target->start + $self->end_pad;    # line #263
> 	$right = $target->end - $self->start_pad;
>    }
> 
> Start and end calls are fine on a SeqFeature object like a CDS, but 
> are not 
> defined for a sequence. These lines  should be replaced by 
> something like the 
> following:
> 
> 	$left = 1 + $self->end_pad;
> 	$right = $target->length - $self->start_pad;
> 
> Your example sequence is not too realistic, so you'll get an other 
> error 
> caused by the shortness of the sequece. Call 
> 
> $sirna_designer->start_pad(0);
> $sirna_designer->end_pad(0);
> 
> before calling design().
> 
> Yours,	
>        	-Heikki
> 
> On Saturday 06 Mar 2004 19:08, ??, San Wang wrote:
> > Dear bioperl-l:
> >
> > 	I have report a bug for it.
> > "Can't locate object method "start" via package "Bio::Seq::RichSeq"
> > (perhaps you forgot to load "Bio::Seq::RichSeq"?) at
> > D:/Tools/perl/site/lib/Bio/Tools/SiRNA.pm line 263."
> >
> > 	Best Regards!
> >
> >                 	Yours sincerely
> > ??????????, San Wang
> > ????????bsman@tom.com
> > ??????????2004-03-07
> >  ,-,-.     ,-,-.     ,-,-.     ,-,-.     ,-
> > / / \ \   / / \ \   / / \ \   / / \ \   / /
> >      \ \ / /   \ \ / /   \ \ / /   \ \ / /
> >       `-'-'     `-'-'     `-'-'     `-'-'
> >
> >
> > San Wang
> > Dr. Student
> > Bioinformatics and Gene Network Research Group
> > Zhejiang University
> > P.R.China
> >
> > Address:
> > Room 415# ,Bio-technology Institute,
> > Zhejiang University,
> > Kaixuan Road 268#  Hangzhou
> > Zhejiang, P.R.China
> > 310027
> >
> > Tel:
> > 0086-571-86892051
> > Fax:
> > 0086-571-86961525
> > E-mail:
> > bsman@163.com
> > Website:
> > http://www.estarray.org
> >
> > ??????????
> >   Innovation is everything. When you are on the forefront, you 
> can see what
> > the next innovation needs to be? --- from Robert Noyce, Founder 
> of Intel
> >   The more things change, the more they stay the same!
> >   The more things seem to be complicated ,the more they are 
> simple in fact!
> >
> > _______________________________________________
> > Bioperl-l mailing list
> > Bioperl-l@portal.open-bio.org
> > http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 
> -- 
> ______ _/      _/_____________________________________________________
>      _/      _/                      http://www.ebi.ac.uk/mutations/
>     _/  _/  _/  Heikki Lehvaslaiho    heikki_at_ebi ac uk
>    _/_/_/_/_/  EMBL Outstation, European Bioinformatics Institute
>   _/  _/  _/  Wellcome Trust Genome Campus, Hinxton
>  _/  _/  _/  Cambs. CB10 1SD, United Kingdom
>     _/      Phone: +44 (0)1223 494 644   FAX: +44 (0)1223 494 468
> ___ _/_/_/_/_/________________________________________________________
> 

From donald.jackson at bms.com  Mon Mar  8 15:10:35 2004
From: donald.jackson at bms.com (Donald Jackson)
Date: Mon Mar  8 22:03:10 2004
Subject: [Bioperl-l] modules for siRNA design
In-Reply-To: <200403081703.40515.heikki@ebi.ac.uk>
References: <200403061913.i26JDq9Q010414@portal.open-bio.org>
	<200403081703.40515.heikki@ebi.ac.uk>
Message-ID: <404CD33B.2080405@bms.com>

Thanks Heikki,
I thought I'd fixed all my code to handle that change from 1.0 to 1.2,
but clearly I messed one.
I'll get a patch into CVS later today and will also try to get some
better error checking on sequence lengths and start/end pad.

Don

Heikki Lehvaslaiho wrote:

>San Wang,
>
>The code in SiRNA.pm around line 263 looks like this:
>
>    if ($cds) {
>	$left = $cds->start + $self->end_pad;
>	$right = $cds->end - $self->start_pad;	
>    }
>    else {
>	$left = $target->start + $self->end_pad;    # line #263
>	$right = $target->end - $self->start_pad;
>    }
>
>Start and end calls are fine on a SeqFeature object like a CDS, but are not 
>defined for a sequence. These lines  should be replaced by something like the 
>following:
>
>	$left = 1 + $self->end_pad;
>	$right = $target->length - $self->start_pad;
>
>Your example sequence is not too realistic, so you'll get an other error 
>caused by the shortness of the sequece. Call 
>
>$sirna_designer->start_pad(0);
>$sirna_designer->end_pad(0);
>
>before calling design().
>
>Yours,	
>		-Heikki
>
>On Saturday 06 Mar 2004 19:08, ����, San Wang wrote:
>  
>
>>Dear bioperl-l:
>>
>>	I have report a bug for it.
>>"Can't locate object method "start" via package "Bio::Seq::RichSeq"
>>(perhaps you forgot to load "Bio::Seq::RichSeq"?) at
>>D:/Tools/perl/site/lib/Bio/Tools/SiRNA.pm line 263."
>>
>>	Best Regards!
>>
>>			Yours sincerely
>>��������������������, San Wang
>>����������������bsman@tom.com
>>��������������������2004-03-07
>> ,-,-.     ,-,-.     ,-,-.     ,-,-.     ,-
>>/ / \ \   / / \ \   / / \ \   / / \ \   / /
>>     \ \ / /   \ \ / /   \ \ / /   \ \ / /
>>      `-'-'     `-'-'     `-'-'     `-'-'
>>
>>
>>San Wang
>>Dr. Student
>>Bioinformatics and Gene Network Research Group
>>Zhejiang University
>>P.R.China
>>
>>Address:
>>Room 415# ,Bio-technology Institute,
>>Zhejiang University,
>>Kaixuan Road 268#  Hangzhou
>>Zhejiang, P.R.China
>>310027
>>
>>Tel:
>>0086-571-86892051
>>Fax:
>>0086-571-86961525
>>E-mail:
>>bsman@163.com
>>Website:
>>http://www.estarray.org
>>
>>��������������������
>>  Innovation is everything. When you are on the forefront, you can see what
>>the next innovation needs to be? --- from Robert Noyce, Founder of Intel
>>  The more things change, the more they stay the same!
>>  The more things seem to be complicated ,the more they are simple in fact!
>>
>>_______________________________________________
>>Bioperl-l mailing list
>>Bioperl-l@portal.open-bio.org
>>http://portal.open-bio.org/mailman/listinfo/bioperl-l
>>    
>>
>
>  
>
From laurichj at bioinfo.ucr.edu  Mon Mar  8 14:38:33 2004
From: laurichj at bioinfo.ucr.edu (Josh Lauricha)
Date: Mon Mar  8 22:03:17 2004
Subject: [Bioperl-l] Unusual behaviour of SeqIO::tigr
In-Reply-To: <404C4F93.1010606@binf.ku.dk>
References: <Pine.LNX.4.50.0312281604220.29220-100000@tenero.duhs.duke.edu>
	<404751D7.6030406@binf.ku.dk> <20040305184510.GB7843@batch107a>
	<404C4F93.1010606@binf.ku.dk>
Message-ID: <20040308193832.GA10347@batch107a>

On Mon 03/08/04 11:48, Morten Lindow wrote:
> Thanks Josh that helped,
> 
> In case somebody else can use it I am pasting the complete code to get 
> the coordinates of all locations on one line each.
> 
> But another thing: When parsing rice-pseudochromosome files I get a few 
> warnings like this(from around 20-30 sequences per chromosome):

Thats because not all of the proteins annotated are valid proteins.

> since the problem is only with a small fraction of the TUs I guess it is 
> not a bioperl problem, but rather a TIGR problem? Any comments?

TIGRs rice publications have some flaws that are slowly getting worked out.
The invalid proteins are one and there are still some examples where the
exon's are annotated to be outside of the TU their enclosed in.

-- 

------------------------------------------------------
| Josh Lauricha            | Ford, your turning into |
| laurichj@bioinfo.ucr.edu | a penguin. Stop it.     |
| Bioinformatics, UCR      |                         |
|----------------------------------------------------|
| OpenPG:                                            |
|  5A0D 92D3 D093 79DE F724 1137 6DF1 B5EB D9CE AAA8 |
|----------------------------------------------------|
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
Url : http://portal.open-bio.org/pipermail/bioperl-l/attachments/20040308/7e9f40d8/attachment.bin
From hlapp at gmx.net  Tue Mar  9 02:51:57 2004
From: hlapp at gmx.net (Hilmar Lapp)
Date: Tue Mar  9 02:57:35 2004
Subject: [Bioperl-l] new GFF3 support methods added
In-Reply-To: <Pine.LNX.4.33.0403081248510.20795-100000@heartbroken.lbl.gov>
Message-ID: <A3999380-719E-11D8-910D-000A959EB4C4@gmx.net>


On Monday, March 8, 2004, at 01:07  PM, Chris Mungall wrote:

> Perhaps it would be better to have a Bio::SeqFeature::Tools::IDHandler
> class? This would contain methods
>
>   generate_unique_persistent_id($feat) # uses $feat->seq_id
>   generate_unique_persistent_id($feat, $seq_id)
>
>   create_hierarchy_from_ParentIDs($featholder);
>   set_ParentIDs_from_hierarchy($featholder);
>
> Any preference? I'm leaving them as is for now if that's ok, but I 
> have no
> objections to moving everything to a seperate class if that's prefered.
>

I like your suggestion. Keeps FeatureHolderI lighter, and is easier to 
change behaviour. But maybe it's just me who likes this better.

	-hilmar

-- 
-------------------------------------------------------------
Hilmar Lapp                            email: lapp at gnf.org
GNF, San Diego, Ca. 92121              phone: +1-858-812-1757
-------------------------------------------------------------


From allenday at ucla.edu  Tue Mar  9 04:04:52 2004
From: allenday at ucla.edu (Allen Day)
Date: Tue Mar  9 04:10:36 2004
Subject: [Bioperl-l] Unique IDs in bioperl (Was: new GFF3 support methods
	added)
In-Reply-To: <Pine.LNX.4.33.0403081248510.20795-100000@heartbroken.lbl.gov>
Message-ID: <Pine.LNX.4.44.0403090059110.9003-100000@sumo.genetics.ucla.edu>

I'd be interested in having this discussion and/or method documented as a
set of guidelines or rules for creating unique IDs.  We've had to deal
with a similar problem to generate unique IDs for our chado GFF3 loader
and UCSC datafile parsers and the solutions are essentially bad hacks.

-allen

> Much as I like closures, I think there should be a standard sanctioned way
> of generating unique/persistent IDs
> 
> > > Unique IDs in bioperl:
> > >
> > > In the discussion that preceeded this, it seemed that people liked the
> > > idea of persistent unique IDs, but there was no suggestions as to how
> > > to
> > > go about it. This is inherently difficult with objects, but I borrowed
> > > a
> > > solution from relational modeling.
> > >
> > > A persistent unique ID is generated using
> > >
> > >   seq_id
> > >   primary_tag
> > >   start
> > >   end
> > >
> > > It is assumed that these are all set and comprise a "unique key" over
> > > features.
> >
> > Hmm. Wouldn't you need to include source_tag()? (Source_tag is part of
> > the unique key in biosql.) Without the source_tag being part of this,
> > wouldn't that mean you cannot have the exact (start+end) same segment
> > predicted as exon by different methods and have those different
> > predictions co-exist as separate features in the graph? (Presumably
> > those would only differ in source_tag)
> 
> Good point! I've added this
> 
> Ok, so are you saying that some of these methods don't really belong in
> the class-interfaces they are in?
> 
> Perhaps it would be better to have a Bio::SeqFeature::Tools::IDHandler
> class? This would contain methods
> 
>   generate_unique_persistent_id($feat) # uses $feat->seq_id
>   generate_unique_persistent_id($feat, $seq_id)
> 
>   create_hierarchy_from_ParentIDs($featholder);
>   set_ParentIDs_from_hierarchy($featholder);
> 
> Any preference? I'm leaving them as is for now if that's ok, but I have no
> objections to moving everything to a seperate class if that's prefered.

From guinness21quebec at hotmail.com  Tue Mar  9 06:46:52 2004
From: guinness21quebec at hotmail.com (chris)
Date: Tue Mar  9 06:52:37 2004
Subject: [Bioperl-l] Fight off the signs of aging!
Message-ID: <1078832812-15658@excite.com>


Tired of looking at your wrinkled face in the mirror as you pluck yet another grey hair and watch the 
pounds pile on? Is the "spark" missing from your love life? If you're over 40, chances are it is. 
Wouldn't you enjoy a longer, healthier and happier life? 

http://improvedpills.com/hgh/index.php?pid=eph9106


Human Growth Hormone can repair the physiology of the old cell, and rejuvenating the body, and reversing 
years of damage!

http://doctorspill.com/hgh/index.php?pid=eph9106


Human Growth Hormone (HGH) increases: 

Energy 
Weight Loss 
Muscle Gain and Endurance  
Increase Immune Function
Smoother Skin - More Elasticity
Quality of Deep Sleep 

http://doctorspill.com/hgh/index.php?pid=eph9106


nirvana1 rambo1jamaica olivier medical speedo christop ariane 
river kittysurf tootsie redrum
valentin rugby money1 

Get off this list by going to http://doctorspill.com/hgh/applepie.php
From theking21play at hotmail.com  Tue Mar  9 21:11:46 2004
From: theking21play at hotmail.com (brooks)
Date: Tue Mar  9 08:16:35 2004
Subject: [Bioperl-l] This drug puts V I A G R A to shame!
Message-ID: <1078884706-20840@excite.com>

The Biggest New Drug since V1agra! Many times as powerful.

http://skidoo.royaldrugs.com/sv/index.php?pid=eph9106

-A quicker more stable erection
-More enjoyable sex for both
-Longer sex
-Known to add length to you erection
-Lasts up to 36 hours (not a thrity-six hour erection, but enhancement for thirty-six)

We have it at a discounted savings. Save when you go through our site on all your orders.

http://river.royaldrugs.com/sv/index.php?pid=eph9106


image taffycanela space dundee cobra mission liverpoo 
nikita dgjbeanie boogie arizona
first young bird 

Get off this list go to http://active.royaldrugs.com/sv/applepie.php
From chauser at duke.edu  Tue Mar  9 10:52:44 2004
From: chauser at duke.edu (Charles Hauser)
Date: Tue Mar  9 10:58:22 2004
Subject: [Bioperl-l] passing HTML code to start_report (HTMLResultWriter.pm)
Message-ID: <1078847563.17733.19.camel@pandorina>

All,

Looking at HTMLResultWriter.pm it looks as though I can specify pass
info I want to appear in the <head> by:

    my $processor = new Bio::SearchIO::Writer::HTMLResultWriter(@args);
    $processor->start_report(\&my_start_report);

sub my_start_report {
    my ($result) = @_;
    return sprintf(
    qq{<HTML>
      <HEAD><CENTER><TITLE> whatever </TITLE></CENTER></HEAD>
      <BODY BGCOLOR="WHITE">
    },$result->algorithm);    
}


If I try this I get back the html returned from the method 'default_start_report'???
Specifically I want to add some js links(head) and a logo(body) to the HTML report for blast.


Charles

From mbatealann at netscape.net  Tue Mar  9 11:23:25 2004
From: mbatealann at netscape.net (mbatealann@netscape.net)
Date: Tue Mar  9 11:25:32 2004
Subject: [Bioperl-l] REPLY SOON
Message-ID: <EPSBPDCTzvb5YI9IJsa000005e1@EPSBPDC.epsb.com>

Dear Friend,
As you read this, I don't want you to feel sorry for 
me, because, I believe everyone will die someday. 
My name is BATES ALAN a merchant in Dubai, in the 
U.A.E.I have been diagnosed with Esophageal cancer.
It has defiled all forms of medical treatment, and right now 
I have only about a few months to live, according to medical experts. 
I have not particularly lived my life so well, as I 
never really cared for anyone(not even myself)but my 
business. Though I am very rich, I was never 
generous, I was always hostile to people and only 
focused on my business as that was the only thing I 
cared for. But now I regret all this as I now know 
that there is more to life than just wanting to have 
or make all the money in the world. 
I believe when God gives me a second chance to come 
to this world I would live my life a different way 
from how I have lived it. Now that God has called 
me, I have willed and given most of my property 
and assets to my immediate and extended family 
members as well as a few close friends. 
I want God to be merciful to me and accept my soul 
so, I have decided to give alms to charity 
organizations, as I want this to be one of the last 
good deeds I do on earth. So far, I have distributed 
money to some charity organizations in the U.A.E, 
Algeria and Malaysia. Now that my health has 
deteriorated so badly, I cannot do this myself 
anymore. I once asked members of my family to close one 
of my accounts and distribute the money which I have 
there to charity organization in Bulgaria and 
Pakistan, they refused and kept the money to 
themselves. Hence, I do not trust them anymore, as 
they seem not to be contended with what I have left 
for them. 
The last of my money which no one knows of is the 
huge cash deposit of eighteen million dollars 
$18,000,000,00 that I have with a finance/Security Company 
abroad. I will want you to help me collect this deposit 
and dispatched it to charity organizations.
I have set aside 10% for you and for your time.
God be with you. 
BATES ALAN

From barry.moore at genetics.utah.edu  Tue Mar  9 13:17:39 2004
From: barry.moore at genetics.utah.edu (Barry Moore)
Date: Tue Mar  9 13:23:37 2004
Subject: [Bioperl-l] Bio::Annotation::Reference
Message-ID: <404E0A43.3000303@genetics.utah.edu>

The value for the PUBMED line of a GenBank file seems to be ending up in 
the wrong place.  When I call  $ref->location on a 
Bio::Annotation::Reference object, I get a line such as, "Mol. Biol. 
Cell 12 (12), 3839-3851 (2001) PUBMED   11739784".  When I call 
$ref->pubmed on the same object it returns undefined.  It seems to me 
that $ref->location should return "Mol. Biol. Cell 12 (12), 3839-3851 
(2001)" and $ref->pubmed should return "11739784".  Is this a bug or is 
this the intended behaviour?  I didn't put this in Bugzilla since I 
wasn't entirely sure it was really a bug, should it go there?

Here's a bit of code that should repeat this situation.  It does the 
same thing for me on half a dozen different RefSeq accessions that I 
have tried.

use Bio::DB::GenBank;
use Bio::SeqIO;

my @ids = (NM_013366);
my $gb = new Bio::DB::GenBank();
my $seqio = $gb->get_Stream_by_id(@ids);

while (my $seq = $seqio->next_seq) {
  my $ann = $seq->annotation();
  my @refs = ($ann->get_Annotations('reference'));
  my $ref = shift @refs;
  print "LOCATION:\t",$ref->location, "\n";
  print "PUBMED:\t", $ref->pubmed, "\n";
}

I am running BioPerl 1.4 and ActiveState Perl 5.8 on Windows XP. 

Barry

-- 
Barry Moore
Dept. of Human Genetics
University of Utah
Salt Lake City, UT


From jason at cgt.duhs.duke.edu  Tue Mar  9 13:39:59 2004
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Tue Mar  9 13:45:41 2004
Subject: [Bioperl-l] Bio::Annotation::Reference
In-Reply-To: <404E0A43.3000303@genetics.utah.edu>
References: <404E0A43.3000303@genetics.utah.edu>
Message-ID: <Pine.LNX.4.50.0403091321380.31725-100000@tenero.duhs.duke.edu>

I think I may have already fixed this since the 1.4 release.  I'll test it
out when I get a chance or you can test on the CVS version of the code.  I
don't remember if I migrated the fix to the 1.4 branch yet though.

-jason

On Tue, 9 Mar 2004, Barry Moore wrote:

> The value for the PUBMED line of a GenBank file seems to be ending up in
> the wrong place.  When I call  $ref->location on a
> Bio::Annotation::Reference object, I get a line such as, "Mol. Biol.
> Cell 12 (12), 3839-3851 (2001) PUBMED   11739784".  When I call
> $ref->pubmed on the same object it returns undefined.  It seems to me
> that $ref->location should return "Mol. Biol. Cell 12 (12), 3839-3851
> (2001)" and $ref->pubmed should return "11739784".  Is this a bug or is
> this the intended behaviour?  I didn't put this in Bugzilla since I
> wasn't entirely sure it was really a bug, should it go there?
>
> Here's a bit of code that should repeat this situation.  It does the
> same thing for me on half a dozen different RefSeq accessions that I
> have tried.
>
> use Bio::DB::GenBank;
> use Bio::SeqIO;
>
> my @ids = (NM_013366);
> my $gb = new Bio::DB::GenBank();
> my $seqio = $gb->get_Stream_by_id(@ids);
>
> while (my $seq = $seqio->next_seq) {
>   my $ann = $seq->annotation();
>   my @refs = ($ann->get_Annotations('reference'));
>   my $ref = shift @refs;
>   print "LOCATION:\t",$ref->location, "\n";
>   print "PUBMED:\t", $ref->pubmed, "\n";
> }
>
> I am running BioPerl 1.4 and ActiveState Perl 5.8 on Windows XP.
>
> Barry
>
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From ksdoctor at burnham.org  Tue Mar  9 13:46:12 2004
From: ksdoctor at burnham.org (Kutbuddin Doctor)
Date: Tue Mar  9 14:03:16 2004
Subject: [Bioperl-l] Bioperl 1.4 on Debian
Message-ID: <404E10F4.5010300@burnham.org>


I am about to switch a debian machine from the stable debian package 
"bioperl 1.0-1".
The test package "bioperl 1.2.1-2" is listed as unstable.

Why is this ~1yr old source still listed as "unstable"?

I wish to go directly to bioperl 1.4 . I see my options as CPAN for 1.4 
or the test debian package "bioperl 1.2.1-2". Our happy-go-lucky system 
admin prefers debian packages :)

Are there any plans for updating the debian packages in the near future?

thanks.
-- 
-<>--<>--<>--<>--<>--<>--<>--<>-
Kutbuddin S. Doctor, PhD
Informatics & Data Management,
The Burnham Institute,
La Jolla, CA 92037
ksdoctor (AT)burnham.org
http://burnham.org/
-<>--<>--<>--<>--<>--<>--<>--<>- 


From laurichj at bioinfo.ucr.edu  Tue Mar  9 14:06:12 2004
From: laurichj at bioinfo.ucr.edu (Josh Lauricha)
Date: Tue Mar  9 14:11:50 2004
Subject: [Bioperl-l] Bioperl 1.4 on Debian
In-Reply-To: <404E10F4.5010300@burnham.org>
References: <404E10F4.5010300@burnham.org>
Message-ID: <20040309190612.GB9004@bioinfo.ucr.edu>

On Tue 03/09/04 10:46, Kutbuddin Doctor wrote:
> 
> I am about to switch a debian machine from the stable debian package 
> "bioperl 1.0-1".
> The test package "bioperl 1.2.1-2" is listed as unstable.
> 
> Why is this ~1yr old source still listed as "unstable"?

The package is not unstable, it is in the unstable distro. If I recall
correctly, its also in testing. This just means that it was released
after woody was. When sarge is released, it will be in stable.

> I wish to go directly to bioperl 1.4 . I see my options as CPAN for 1.4 
> or the test debian package "bioperl 1.2.1-2". Our happy-go-lucky system 
> admin prefers debian packages :)

I've got an unofficial 1.4 package that I could make availible, but its
not likely to ever get into debian (I'm not a maintainer) and when 1.4
is in, its not likely to be structured like mine (which isn't really a
big deal.)

-- 

------------------------------------------------------
| Josh Lauricha            | Ford, your turning into |
| laurichj@bioinfo.ucr.edu | a penguin. Stop it.     |
| Bioinformatics, UCR      |                         |
|----------------------------------------------------|
| OpenPG:                                            |
|  5A0D 92D3 D093 79DE F724 1137 6DF1 B5EB D9CE AAA8 |
|----------------------------------------------------|
From jhpark98 at dreamwiz.com  Mon Mar  8 23:40:37 2004
From: jhpark98 at dreamwiz.com (=?ks_c_5601-1987?B?udrB2Mf8?=)
Date: Tue Mar  9 14:23:34 2004
Subject: [Bioperl-l] biosql error
Message-ID: <!~!UENERkVCMDkAAQACAAAAAAAAAAAAAAAAABgAAAAAAAAAQyA9uEDQJEe0abyT1RfADsKAAAAQAAAAY+/yMNpL9EyodJoxoatw9wEAAAAA@dreamwiz.com>

Dear Bioperl Experts

I have some problem to perform biosql database update like this.

Until upgrade perl 5.8.3, I had been used load_seqdatabase.pl script to
update genbank file into biosql database.

I don?t know what is the problem. 

Is there some method to fix these bugs?

 
------------- EXCEPTION  -------------

MSG: failed to open connection: 

STACK Bio::DB::DBI::base::new_connection /usr/local/lib/perl5/site_perl/5.8.
3//Bio/DB/DBI/base.pm:249

STACK Bio::DB::DBI::base::get_connection /usr/local/lib/perl5/site_perl/5.8.
3//Bio/DB/DBI/base.pm:209

STACK Bio::DB::BioSQL::BasePersistenceAdaptor::dbh
/usr/local/lib/perl5/site_perl/5.8.3//Bio/DB/BioSQL/BasePersistenceAdaptor.p
m:1443

STACK Bio::DB::BioSQL::BasePersistenceAdaptor::rollback
/usr/local/lib/perl5/site_perl/5.8.3//Bio/DB/BioSQL/BasePersistenceAdaptor.p
m:1362

STACK toplevel load_seqdatabase.pl:515

From hlapp at gnf.org  Tue Mar  9 15:05:44 2004
From: hlapp at gnf.org (Hilmar Lapp)
Date: Tue Mar  9 15:11:24 2004
Subject: [Bioperl-l] biosql error
In-Reply-To: <!~!UENERkVCMDkAAQACAAAAAAAAAAAAAAAAABgAAAAAAAAAQyA9uEDQJEe0abyT1RfADsKAAAAQAAAAY+/yMNpL9EyodJoxoatw9wEAAAAA@dreamwiz.com>
Message-ID: <25C41E56-7205-11D8-A93D-000A959EB4C4@gnf.org>

If you upgraded perl you will need to recompile and re-install DBI as  
well as your DBD driver(s) of choice. What you're seeing is most likely  
not due to a bug - it rather looks like an installation issue on your  
machine.

Also, to see the DBI error messages, which in this case I'm pretty sure  
will conclusively point at the problem, supply --printerror to the  
script. (If your version doesn't recognize that option, you need to  
update from bioperl-db cvs.)

	-hilmar

On Monday, March 8, 2004, at 08:40  PM, ??? wrote:

> Dear Bioperl Experts
>
> I have some problem to perform biosql database update like this.
>
> Until upgrade perl 5.8.3, I had been used load_seqdatabase.pl script to
> update genbank file into biosql database.
>
> I don?t know what is the problem.
>
> Is there some method to fix these bugs?
>
>
>
> ------------- EXCEPTION  -------------
>
> MSG: failed to open connection:
>
> STACK Bio::DB::DBI::base::new_connection  
> /usr/local/lib/perl5/site_perl/5.8.
> 3//Bio/DB/DBI/base.pm:249
>
> STACK Bio::DB::DBI::base::get_connection  
> /usr/local/lib/perl5/site_perl/5.8.
> 3//Bio/DB/DBI/base.pm:209
>
> STACK Bio::DB::BioSQL::BasePersistenceAdaptor::dbh
> /usr/local/lib/perl5/site_perl/5.8.3//Bio/DB/BioSQL/ 
> BasePersistenceAdaptor.p
> m:1443
>
> STACK Bio::DB::BioSQL::BasePersistenceAdaptor::rollback
> /usr/local/lib/perl5/site_perl/5.8.3//Bio/DB/BioSQL/ 
> BasePersistenceAdaptor.p
> m:1362
>
> STACK toplevel load_seqdatabase.pl:515
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
-- 
-------------------------------------------------------------
Hilmar Lapp                            email: lapp at gnf.org
GNF, San Diego, Ca. 92121              phone: +1-858-812-1757
-------------------------------------------------------------


From cjm at fruitfly.org  Tue Mar  9 20:54:30 2004
From: cjm at fruitfly.org (Chris Mungall)
Date: Tue Mar  9 21:00:13 2004
Subject: [Bioperl-l] new GFF3 support methods added
In-Reply-To: <A3999380-719E-11D8-910D-000A959EB4C4@gmx.net>
Message-ID: <Pine.LNX.4.33.0403091614060.20795-100000@heartbroken.lbl.gov>


On Mon, 8 Mar 2004, Hilmar Lapp wrote:

>
> On Monday, March 8, 2004, at 01:07  PM, Chris Mungall wrote:
>
> > Perhaps it would be better to have a Bio::SeqFeature::Tools::IDHandler
> > class? This would contain methods
> >
> >   generate_unique_persistent_id($feat) # uses $feat->seq_id
> >   generate_unique_persistent_id($feat, $seq_id)
> >
> >   create_hierarchy_from_ParentIDs($featholder);
> >   set_ParentIDs_from_hierarchy($featholder);
> >
> > Any preference? I'm leaving them as is for now if that's ok, but I
> > have no
> > objections to moving everything to a seperate class if that's prefered.
> >
>
> I like your suggestion. Keeps FeatureHolderI lighter, and is easier to
> change behaviour. But maybe it's just me who likes this better.

Ok, I'll go ahead and add this

We will now have 3 classes in Bio::SeqFeature::Tools

  Unflattener
  TypeMapper
  IDHandler

Personally I like this pattern, whereby the object model is kept simple
and lightweight, and complex behaviour is punted to seperate classes. It
does force the question - how do we decide which methods goes in the model
class and which goes in a behaviour class? Is it just an intuitive thing?
I'm happy for the bioperl czar to weigh in (who is that these days?) and
make individual judgement calls, I just want what I'm doing to be
consistent with some overall plan.

Cheers
Chris

> 	-hilmar
>
>


From jason at cgt.duhs.duke.edu  Tue Mar  9 21:25:53 2004
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Tue Mar  9 21:31:33 2004
Subject: [Bioperl-l] OSX, NCBI blast 2.2.8, blast parsers
Message-ID: <Pine.LNX.4.50.0403092117170.2958-100000@tenero.duhs.duke.edu>

Strict BLAST parsers are going to fall over on ncbi blast 2.2.8 produced
reports.

You can see what happens on you system by doing this is the bioperl
directory:

$ blastpgp -d  $BLASTDB/swissprot  -i t/data/amino.fa -j 2 | grep "total letters"
  U79 sequences; E27 total letters
$ blastall -p blastn -i t/data/test.txt -d $BLASTDB/ecoli.nt | grep "total letters"
 400 sequences; N39 total letters

The parser expects numbers in those spots not a combination.  This is
using
ftp://ftp.ncbi.nih.gov/blast/executables/blast-2.2.8-powerpc-macosx.tar.gz

This does not appear on linux ia32 distribution so I assume it is an OSX
specific issue or compilation related issue.

This was found per bug #1598
http://bugzilla.open-bio.org/show_bug.cgi?id=1598

-jason
--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From skchan at cs.usask.ca  Wed Mar 10 00:56:24 2004
From: skchan at cs.usask.ca (Simon K. Chan)
Date: Wed Mar 10 01:02:46 2004
Subject: [Bioperl-l] TreeIO question/problem
Message-ID: <1078898184.404eae08173ae@webmail.usask.ca>

Hi,

I wish to parse a tree in newick format.

I attempted to use the TreeIO module to do so:

My problem is that $treeio->next_tree does not return anything, thus I cannot
call the get_nodes method to get all the nodes in my tree.

My code follows.  Many thanks for your time.

use lib '/home/skchan/bioperl/bioperl-1.4';
use Bio::Tree::Node;
use Bio::TreeIO;
use Bio::Tree::Tree;
use Bio::Tree::TreeI;
                     
# obtained code from bioperl documentation:                                    
                      
my $treeio = new Bio::TreeIO(-format=>"newick", -file=>"./newick_tree");
my $tree = $treeio->next_tree;
                                                                                
print "tree: $tree\n";
                                                                                
my @nodes = $treeIO->get_nodes;
                                                                                
print "nodes: @nodes\n";


From skchan at cs.usask.ca  Wed Mar 10 01:05:03 2004
From: skchan at cs.usask.ca (Simon K. Chan)
Date: Wed Mar 10 01:10:54 2004
Subject: Part 2: [Bioperl-l] TreeIO question/problem
Message-ID: <1078898703.404eb00f5833d@webmail.usask.ca>

My apologies, but there was a mistake in my first email.  My code is actually
the following:  


use lib '/home/skchan/bioperl/bioperl-1.4';
use Bio::Tree::Node;
use Bio::TreeIO;
use Bio::Tree::Tree;
use Bio::Tree::TreeI;
                     
# obtained code from bioperl documentation:                                    
                      
my $treeio = new Bio::TreeIO(-format=>"newick", -file=>"./newick_tree");
my $tree = $treeio->next_tree;
                                                                                
print "tree: $tree\n";

$treeio->next_tree does not return anything, thus, I cannot run get_nodes.

Many thanks again.

Simon


----- Forwarded message from "Simon K. Chan" <skchan@cs.usask.ca> -----
    Date: Tue, 09 Mar 2004 23:56:24 -0600
    From: "Simon K. Chan" <skchan@cs.usask.ca>
Reply-To: "Simon K. Chan" <skchan@cs.usask.ca>
 Subject: [Bioperl-l] TreeIO question/problem
      To: bioperl-l@bioperl.org

Hi,

I wish to parse a tree in newick format.

I attempted to use the TreeIO module to do so:

My problem is that $treeio->next_tree does not return anything, thus I cannot
call the get_nodes method to get all the nodes in my tree.

My code follows.  Many thanks for your time.

use lib '/home/skchan/bioperl/bioperl-1.4';
use Bio::Tree::Node;
use Bio::TreeIO;
use Bio::Tree::Tree;
use Bio::Tree::TreeI;
                     
# obtained code from bioperl documentation:                                    
                      
my $treeio = new Bio::TreeIO(-format=>"newick", -file=>"./newick_tree");
my $tree = $treeio->next_tree;
                                                                                
print "tree: $tree\n";
                                                                                
my @nodes = $treeIO->get_nodes;
                                                                                
print "nodes: @nodes\n";


_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org
http://portal.open-bio.org/mailman/listinfo/bioperl-l

----- End forwarded message -----


From abrusan at mpil-ploen.mpg.de  Wed Mar 10 07:32:25 2004
From: abrusan at mpil-ploen.mpg.de (Gyoergy Abrusan)
Date: Wed Mar 10 07:36:53 2004
Subject: [Bioperl-l] Printing string fragments?
Message-ID: <404F0AD9.2040809@mpil-ploen.mpg.de>

Dear All,

I would like to ask how can I print out certain fragments of strings.
my data are in the format:

 >seqname1#uninteresting_information
ATCGTACGTGCGAGACGT......
 >seqname2#.....
ATGTCATG....

I wrote a script that makes some statistics on the nucleotide sequences.
I would like to print the results out as:

seqname1    results
seqname2    results

at the moment I can print them out only as:

 >seqname1#uninteresting_information     results
 >seqname2#uninteresting_information     results

Because the scriprt prints out the entire seqname-sting.
How can I print out parts of a string, say from symbol '>' to
symbol '#'?

Thanking you in advance.
Kind regards,

Gyorgy Abrusan,

Max-Planck Institute for Limnology,
Ploen, Germany


From ak at ebi.ac.uk  Wed Mar 10 07:52:04 2004
From: ak at ebi.ac.uk (Andreas Kahari)
Date: Wed Mar 10 07:57:42 2004
Subject: [Bioperl-l] Printing string fragments?
In-Reply-To: <404F0AD9.2040809@mpil-ploen.mpg.de>
References: <404F0AD9.2040809@mpil-ploen.mpg.de>
Message-ID: <20040310125204.GA27239@ebi.ac.uk>

On Wed, Mar 10, 2004 at 01:32:25PM +0100, Gyoergy Abrusan wrote:
> Dear All,
> 
> I would like to ask how can I print out certain fragments of strings.
> my data are in the format:
> 
> >seqname1#uninteresting_information
> ATCGTACGTGCGAGACGT......
> >seqname2#.....
> ATGTCATG....
> 
> I wrote a script that makes some statistics on the nucleotide sequences.
> I would like to print the results out as:
> 
> seqname1    results
> seqname2    results
> 
> at the moment I can print them out only as:
> 
> >seqname1#uninteresting_information     results
> >seqname2#uninteresting_information     results
> 
> Because the scriprt prints out the entire seqname-sting.
> How can I print out parts of a string, say from symbol '>' to
> symbol '#'?

my $result = 'some result';
my $string = '>seqname1#uninteresting_information';

$string =~ /^>([^#]+)#/;    # See the perlre manual ("perldoc perlre")
my $seqname = $1;

print "$seqname\t$result\n";


Cheers,
Andreas

-- 
| {} |      Andreas K?h?ri                                |][][|
|{}{}|      EMBL, European Bioinformatics Institute       |[][]|
| {} |      Wellcome Trust Genome Campus                  |][][|
|{}{}|      Hinxton, Cambridgeshire, CB10 1SD             |[][]|
| {} |      United Kingdom                                |][][|
From brian_osborne at cognia.com  Wed Mar 10 08:35:21 2004
From: brian_osborne at cognia.com (Brian Osborne)
Date: Wed Mar 10 08:41:02 2004
Subject: [Bioperl-l] Web site notes
Message-ID: <GAEDKMGOKFBLJPKCLKCCMEFMDIAA.brian_osborne@cognia.com>

Bioperl-l,

Just writing to tell you that over the past few weeks I've been editing many
of the Web pages. Changes include:

- standardizing font sizes and faces over the pages
- removing links to outdated pages
- editing outdated text
- making the "highlights" in the left navigation bar work on every page,
giving every page a navigation bar
- fixing some broken links

Much of this work is returning the site to its initial "graphic" state, as
it existed before it was editable in CVS.

Brian O.


From brian_osborne at cognia.com  Wed Mar 10 09:02:06 2004
From: brian_osborne at cognia.com (Brian Osborne)
Date: Wed Mar 10 09:07:50 2004
Subject: [Bioperl-l] Printing string fragments?
In-Reply-To: <404F0AD9.2040809@mpil-ploen.mpg.de>
Message-ID: <GAEDKMGOKFBLJPKCLKCCOEFNDIAA.brian_osborne@cognia.com>

Gyoergy,

If you are actually creating this fasta file yourself then you could change
the format of the header a bit by adding a space, to:

 >seqname1 #uninteresting_information

Then, $seq_obj->display_id will give "seqname1", not
"seqname1#uninteresting_information".

Brian O.


-----Original Message-----
From: bioperl-l-bounces@portal.open-bio.org
[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Gyoergy Abrusan
Sent: Wednesday, March 10, 2004 7:32 AM
To: bioperl-l@bioperl.org
Subject: [Bioperl-l] Printing string fragments?

Dear All,

I would like to ask how can I print out certain fragments of strings.
my data are in the format:

 >seqname1#uninteresting_information
ATCGTACGTGCGAGACGT......
 >seqname2#.....
ATGTCATG....

I wrote a script that makes some statistics on the nucleotide sequences.
I would like to print the results out as:

seqname1    results
seqname2    results

at the moment I can print them out only as:

 >seqname1#uninteresting_information     results
 >seqname2#uninteresting_information     results

Because the scriprt prints out the entire seqname-sting.
How can I print out parts of a string, say from symbol '>' to
symbol '#'?

Thanking you in advance.
Kind regards,

Gyorgy Abrusan,

Max-Planck Institute for Limnology,
Ploen, Germany


_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org
http://portal.open-bio.org/mailman/listinfo/bioperl-l


From jason at cgt.duhs.duke.edu  Wed Mar 10 09:17:36 2004
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Wed Mar 10 09:23:15 2004
Subject: Part 2: [Bioperl-l] TreeIO question/problem
In-Reply-To: <1078898703.404eb00f5833d@webmail.usask.ca>
References: <1078898703.404eb00f5833d@webmail.usask.ca>
Message-ID: <Pine.LNX.4.50.0403100916390.7337-100000@tenero.duhs.duke.edu>

Did you try other tree files like the examples in t/data?

I cannot do much to help without a copy of the offending tree file
in newick_tree.

-jason
On Wed, 10 Mar 2004, Simon K. Chan wrote:

> My apologies, but there was a mistake in my first email.  My code is actually
> the following:
>
>
> use lib '/home/skchan/bioperl/bioperl-1.4';
> use Bio::Tree::Node;
> use Bio::TreeIO;
> use Bio::Tree::Tree;
> use Bio::Tree::TreeI;
>
> # obtained code from bioperl documentation:
>
> my $treeio = new Bio::TreeIO(-format=>"newick", -file=>"./newick_tree");
> my $tree = $treeio->next_tree;
>
> print "tree: $tree\n";
>
> $treeio->next_tree does not return anything, thus, I cannot run get_nodes.
>
> Many thanks again.
>
> Simon
>
>
> ----- Forwarded message from "Simon K. Chan" <skchan@cs.usask.ca> -----
>     Date: Tue, 09 Mar 2004 23:56:24 -0600
>     From: "Simon K. Chan" <skchan@cs.usask.ca>
> Reply-To: "Simon K. Chan" <skchan@cs.usask.ca>
>  Subject: [Bioperl-l] TreeIO question/problem
>       To: bioperl-l@bioperl.org
>
> Hi,
>
> I wish to parse a tree in newick format.
>
> I attempted to use the TreeIO module to do so:
>
> My problem is that $treeio->next_tree does not return anything, thus I cannot
> call the get_nodes method to get all the nodes in my tree.
>
> My code follows.  Many thanks for your time.
>
> use lib '/home/skchan/bioperl/bioperl-1.4';
> use Bio::Tree::Node;
> use Bio::TreeIO;
> use Bio::Tree::Tree;
> use Bio::Tree::TreeI;
>
> # obtained code from bioperl documentation:
>
> my $treeio = new Bio::TreeIO(-format=>"newick", -file=>"./newick_tree");
> my $tree = $treeio->next_tree;
>
> print "tree: $tree\n";
>
> my @nodes = $treeIO->get_nodes;
>
> print "nodes: @nodes\n";
>
>
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>
> ----- End forwarded message -----
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From dlondon at ebi.ac.uk  Wed Mar 10 09:35:02 2004
From: dlondon at ebi.ac.uk (Darin London)
Date: Wed Mar 10 09:40:40 2004
Subject: [Bioperl-l] BOSC 2004 Announcement and Call for Papers (fwd)
Message-ID: <Pine.LNX.4.44.0403101432140.4340-100000@parrot.ebi.ac.uk>

 {Please pass the word!}
 
 MEETING ANNOUNCEMENT & CALL FOR SPEAKERS
 
 The 5th annual Bioinformatics Open Source Conference (BOSC'2004) is 
 organized by the not-for-profit Open Bioinformatics Foundation. The 
 meeting will take place July 29-30, 2004 in Glasgow, Scotland, and is 
 one of several Special Interest Group (SIG) meetings occurring in 
 conjunction with the 12th International Conference on Intelligent 
 Systems for Molecular Biology.

 see http://www.iscb.org/ismb2004/ for more information.

 The focus of the meeting will be on current and emerging Open Source** 
 informatics tools and toolkits. BOSC provides a forum for developers, 
 project groups, users and interested parties to meet personally, exchange ideas and 
 collaborate together.

 In addition, keynote speeches from well known Open Source Bioinformatics 
 leaders are being planned.
 
 BOSC PROGRAM & CONTACT INFO
 
 * Web: http://www.open-bio.org/bosc2004/
 * Email: bosc@open-bio.org
 * Online registration: https://www.cteusa.com/iscb3/
 
 
 FEES
 
 * Corporate :GBP ?165.00 british pounds sterling
 * Academic : GBP ?120.00 british pounds sterling
 * Student : GBP ?90.00 british pounds sterling
 
 A 17.5% Valued Added Tax(VAT) will be added to all fees.

 Note: We have tried to set our fees as low as possible without risking 
 the chance that the foundation will lose money on the event. We budget 
 with the goal of breaking even on costs or realizing a small profit.
 
 REGISTER ONLINE FOR BOSC'2004 & ISMB AT:
 https://www.cteusa.com/iscb3/
 
 SPEAKERS & ABSTRACTS WANTED
 
 The program committee is currently seeking abstracts for talks at BOSC 
 2004. BOSC is a great opportunity for you to tell the community about 
 your use, development, or philosophy of open source software development 
 in bioinformatics. The committee will select several submitted abstracts 
 for 25-minute talks and others for shorter "lightning" talks. Accepted 
 abstracts will be published on the BOSC web site.
 
 If you are interested in speaking at BOSC 2004, 
 please send us:

 * an abstract (no more than a few paragraphs)
 * a URL for the project page, if applicable  
 * information about the open source license used for your software or
   your release plans.
 
 LIGHTNING-TALK SPEAKERS WANTED!
 
 The program committee is currently seeking speakers for the lightning 
 talks at BOSC 2004. Lightning talks are quick - only five minutes 
 long - and a great opportunity for you to give people a quick 
 summary of your open source project, code, idea, or vision of the future.

 If you are interested in giving a lightning talk at BOSC 2004, 
 please send us:

 * a brief title and summary (one or two lines)
 * a URL for the project page, if applicable
 * information about the open source license used for your software or 
   your release plans.

 We will accept entries on-line until BOSC starts, but
 space for demos and lightning talks is limited.<br/>
    
 SOFTWARE DEMONSTRATIONS WANTED!

 If you are involved in the development of Open Source Bioinformatics 
 Software, you are invited to provide a short demonstration to attendees 
 of BOSC 2004.

 If you are interested in giving a software demonstration at BOSC 2004,
 please send us:

 * a brief title and summary (one or two lines)
 * a URL for the project page, if applicable
 * Internet connectivity requirements (e.g. website Application served on 
   the world wide web, or web based client application).

   We will accept entries on-line until the BOSC starts, but
   space for demos and lightning talks is limited.  

 ** Because the mission of the OBF is to promote Open Source software, we 
 will favor submissions for projects that apply a recognized Open Source 
 License, or adhere to the general Open Source Philosophy.

 See the following websites for further details:
 href="http://www.opensource.org/licenses/
 href="http://www.opensource.org/docs/definition.php


From nathanhaigh at ukonline.co.uk  Wed Mar 10 13:17:39 2004
From: nathanhaigh at ukonline.co.uk (Nathan Haigh)
Date: Wed Mar 10 13:23:19 2004
Subject: [Bioperl-l] fasta3 program support
Message-ID: <!~!UENERkVCMDkAAQACAAAAAAAAAAAAAAAAABgAAAAAAAAA+EsXQZcrCEGeBpZF7/IE7sKAAAAQAAAAhwgO1FXZwkG5mTO7LZUs3QEAAAAA@ukonline.co.uk>

Does the standalonefasta module support any other version that fasta3, I
have fasta34 and am running into problems! If it only runs with fasta3, does
anyone know where I can download the correct version to run with bioperl,
and what the differences between fasta3 and fasta34 are?

Regards
Nathan


From Cariaso at yahoo.com  Wed Mar 10 13:59:52 2004
From: Cariaso at yahoo.com (Michael Cariaso)
Date: Wed Mar 10 14:05:30 2004
Subject: [Bioperl-l] seeking advice about setting up a local GenBank mirror
In-Reply-To: <Pine.LNX.4.44.0403101432140.4340-100000@parrot.ebi.ac.uk>
References: <Pine.LNX.4.44.0403101432140.4340-100000@parrot.ebi.ac.uk>
Message-ID: <404F65A8.3040806@yahoo.com>

This is not a strictly bioperl question so I'll take non-bioperl related 
answers offline.

I need to setup a local mirror of most of GenBank (I don't need the ESTs 
or indexes). There seems to be several ways to achieve this, but no 
single definitive solution. I'd like to keep the files in their gzipped 
form. And my copies don't need to be too fresh, up to a week old should 
be acceptable. Can anyone recommend, or warn me away from *ANY* of these 
approaches.

Is there a suitable part of bioperl?

Has anyone used BioMirror.pm?
http://www.bio-mirror.net/biomirror/software/biomirror/biomir-help.txt

What about ftpmirror?
http://noc.intec.co.jp/ftpmirror.html

What about mirror? (It's hard to find a current copy, the word is too 
common. Is anyone still using this?)
http://www.etsimo.uniovi.es/mirror/mirror.html

I'm about 10 miles away from the NIH, and not connected to the I2 
network. Is there any reason to prefer mirroring off of
ftp://bio-mirror.net/biomirror//genbank/
versus
ftp://ftp.ncbi.nih.gov/genbank/


--
Michael Cariaso
http://www.cariaso.com

From sdavis2 at mail.nih.gov  Wed Mar 10 14:20:14 2004
From: sdavis2 at mail.nih.gov (Sean Davis)
Date: Wed Mar 10 14:24:19 2004
Subject: [Bioperl-l] bio::tools::run::blat?
Message-ID: <BC74D49E.5888%sdavis2@mail.nih.gov>

Just a quick question:  Has anyone considered (or achieved) the blat
equivalent of standaloneblast?

Thanks,
Sean

From jason at cgt.duhs.duke.edu  Wed Mar 10 14:39:16 2004
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Wed Mar 10 14:44:54 2004
Subject: [Bioperl-l] bio::tools::run::blat?
In-Reply-To: <BC74D49E.5888%sdavis2@mail.nih.gov>
References: <BC74D49E.5888%sdavis2@mail.nih.gov>
Message-ID: <Pine.LNX.4.50.0403101433560.7337-100000@tenero.duhs.duke.edu>

Bio::Tools::Run::Alignment::Blat
In the bioperl-run CVS repository and in the bioperl-run distro.

Any new "standalone-xxx" should be written in the bioperl runnable way.
StandAloneBlast is legacy code in many ways and pretty hard to manage.

Will Spooner at Ensembl has written some very nice generic pairwise
alignment search interfaces that allow dispatching of Fasta,SSAHA, Blast,
etc.  One day we may be able to import some more of his nice design into
the system.

-jason

On Wed, 10 Mar 2004, Sean Davis wrote:

> Just a quick question:  Has anyone considered (or achieved) the blat
> equivalent of standaloneblast?
>
> Thanks,
> Sean
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From shawnh at stanford.edu  Wed Mar 10 14:46:13 2004
From: shawnh at stanford.edu (Shawn Hoon)
Date: Wed Mar 10 14:51:53 2004
Subject: [Bioperl-l] bio::tools::run::blat?
In-Reply-To: <BC74D49E.5888%sdavis2@mail.nih.gov>
References: <BC74D49E.5888%sdavis2@mail.nih.gov>
Message-ID: <9661DDC0-72CB-11D8-8DA3-000A95783436@stanford.edu>

yes, its implemented.
Bio::Tools::Run::Alignment::Blat

shawn

On Mar 10, 2004, at 11:20 AM, Sean Davis wrote:

> Just a quick question:  Has anyone considered (or achieved) the blat
> equivalent of standaloneblast?
>
> Thanks,
> Sean
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

From lstein at cshl.edu  Wed Mar 10 16:27:21 2004
From: lstein at cshl.edu (Lincoln Stein)
Date: Wed Mar 10 16:32:58 2004
Subject: use of seq_id. was: [Bioperl-l] Bio::Tools::GFF  use of seqname
In-Reply-To: <B94A4B7C-6742-11D8-983C-000A959EB4C4@gnf.org>
References: <B94A4B7C-6742-11D8-983C-000A959EB4C4@gnf.org>
Message-ID: <200403101627.21642.lstein@cshl.edu>

Bio::Graphics doesn't depend on a seq_id as far as I know.  Please let me know 
if you have evidence otherwise.

Lincoln


On Tuesday 24 February 2004 10:28 pm, Hilmar Lapp wrote:
> On Tuesday, February 24, 2004, at 01:55  AM, Dave Howorth wrote:
> > Hilmar Lapp wrote:
> >> Actually, $feat->can('seq_id') must be true at all times iff
> >> $feat->isa("Bio::SeqFeatureI"), so it's kind of superfluous to test
> >> for it.
> >
> > Where does this come from, please?  In the SeqFeatureI documentation
> > it says seq_id 'is an attribute such that you *can* store the ID' (my
> > emphasis).  You seem to be saying that if I'm creating a bunch of
> > (sub) features just so I can use Bio::Graphics, I must attach a seq_id
> > to each and every one.
>
> I'm not saying anything about the value of the attribute.
> $feat->can('seq_id') will be true if you can call $feat->seq_id(),
> which you will always be able to since it's defined in
> Bio::SeqFeatureI. Whether that method returns garbage or something
> useful is another story. I'm not sure whether you have to set seq_id()
> to something meaningful in order to remain compatible with
> Bio::Graphics, but I'd guess you do. Lincoln?
>
> > I have an inverse question that I haven't managed to find an answer to
> > yet. If I'm displaying these sub-features as segments, how can I
> > attach some text to the feature that will be displayed alongside each
> > individual segment?
>
> This one is for Lincoln ...
>
> 	-hilmar
>
> > Thanks, Dave
> >
> > _______________________________________________
> > Bioperl-l mailing list
> > Bioperl-l@portal.open-bio.org
> > http://portal.open-bio.org/mailman/listinfo/bioperl-l

-- 
Lincoln Stein
lstein@cshl.edu
Cold Spring Harbor Laboratory
1 Bungtown Road
Cold Spring Harbor, NY 11724
(516) 367-8380 (voice)
(516) 367-8389 (fax)
From dag at sonsorol.org  Wed Mar 10 21:30:13 2004
From: dag at sonsorol.org (Chris Dagdigian)
Date: Wed Mar 10 21:35:57 2004
Subject: [Bioperl-l] O|B|F mail update -- making progress on anti-spam
 issues with our mailing lists
Message-ID: <404FCF35.5010705@sonsorol.org>


Hi folks,

Apologies for the cross-posting but I just wanted to give our list 
members and admins an update on some new anti-spam measures we have 
(re)enabled. Good news to report basically...

The most annoying spams recently have been the simple plain text 
messages without any HTML, attachments or mime-encoding that just slip 
right by our filters.  Some lists have been forced to switch over to 
"only members can post" while other lists (like bioperl) have 
consistantly voted to stay as open as possible.

I'll update you on our current efforts as well as a new effort that is 
about 24 hours old but already working really well so far.

Until yesterday we had three main lines of defense against spam:

1. The mailserver itself (rejects mail from nonexistant domains, etc.)

2. The sendmail Mail::Milter extention (MIMEDefang+SpamAssassin are used 
to scan all incoming messages. Anything that scores higher than 8.0 is 
simply discarded automatically. MIMEDefang also strips dangerous 
attachments like .exe and .pif)

3. Our mailing list moderation queue (emails with attachments, odd MIME 
encodings and spamassassin scores from 0.0 - 7.9 are held in a moderator 
queue for a human to make an accept/discard decision)

Here are some stats on how this system worked over the past few days:

  o 138 attempts to relay mail through our server blocked
  o 192 emails blocked due to forged or unresolvable sender domain
  o 577 emails discarded automatically by SpamAssassin+MIMEDefang

This system worked *ok* but put a lot of work onto the shoulders of our 
list admins who constantly had to weed out the spam caught up in the 
mailing list moderator system.

Yesterday I brought online another system that seems to be already 
working really well. It catches spam before we even accept it on our 
server which makes the load easier on both our scanning software and our 
  human list moderators.

The system is the RBL+ blackhole list from http://www.mail-abuse.org and 
the way it works is that we now query (via DNS) the RBL+ database each 
time someone connects to our mail server. If the RBL check against the 
sender IP address comes back as "positive" we reject the incoming email.

RBL+ is a combination of four constantly updated databases:

  1. RBL -- IP addresses of known, documented spammers and spam machines
  2. RSS -- IP addresses of documented/tested unsecured email relays
  3. OPS -- IP addresses of documented open proxy servers w/ spam history
  3. DUL -- IP addresses belonging to ISP dialup and DHCP customers

We have already blocked 137 email attempts in the last 24 hours from 
machines that were listed in one or more of the RBL databases.

It is too soon to tell but if the RBL+ system plus our existing 
anti-spam measures work well enough we may be in a position where our 
"closed" mailing lists could revert back to being 'anyone can post'.

Feedback appreciated. Especially if you get a "reject" message from us 
saying that you are listed in the RBL+ blackhole database!


Regards,
Chris
O|B|F


From MAILER-DAEMON at xmegs.com  Thu Mar 11 05:40:54 2004
From: MAILER-DAEMON at xmegs.com (MAILER-DAEMON@xmegs.com)
Date: Thu Mar 11 05:36:22 2004
Subject: [Bioperl-l] failure notice
Message-ID: <200403111036.i2BAaEtl017063@portal.open-bio.org>

Hi. This is the qmail-send program at xmegs.com.
I'm afraid I wasn't able to deliver your message to the following addresses.
This is a permanent error; I've given up. Sorry it didn't work out.

<.skedo@kspc.com.my>:
vdeliver: Invalid or unknown virtual user '.skedo'

--- Below this line is a copy of the message.

Return-Path: <bioperl-l@bioperl.org>
Received: (qmail 22341 invoked from network); 11 Mar 2004 10:40:51 -0000
Received: from unknown (HELO kspc.com.my) (218.208.34.66)
  by 202.157.185.141 with SMTP; 11 Mar 2004 10:40:51 -0000
From: bioperl-l@bioperl.org
To: .skedo@kspc.com.my
Subject: hello
Date: Thu, 11 Mar 2004 18:30:48 +0800
MIME-Version: 1.0
Content-Type: multipart/mixed; boundary="70606400"

--70606400
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit

my hero

--70606400
Content-Type: application/octet-stream; name="information.scr"
Content-Transfer-Encoding: base64
Content-Disposition: attachment; filename="information.scr"

TVqQAAMAAAAEAAAA//8AALgAAAAAAAAAQAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAgAAAAA4fug4AtAnNIbgBTM0hVGhpcyBwcm9ncmFtIGNhbm5vdCBiZSBydW4gaW4g
RE9TIG1vZGUuDQ0KJAAAAAAAAABQRQAATAEDAFn0MEAAAAAAAAAAAOAADwILAQI4AFAAAAAQ
AAAAQAEA0JABAABQAQAAoAEAAABAAAAQAAAAAgAABAAAAAAAAAAEAAAAAAAAAACwAQAAEAAA
AAAAAAIAAAAAABAAABAAAAAAEAAAEAAAAAAAABAAAAAAAAAAAAAAAGStAQCAAQAAAKABAGQN
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAFVQ
WDAAAAAAAEABAAAQAAAAAAAAAAIAAAAAAAAAAAAAAAAAAIAAAOBVUFgxAAAAAABQAAAAUAEA
AEQAAAACAAAAAAAAAAAAAAAAAABAAADgLnJzcmMAAAAAEAAAAKABAAAQAAAARgAAAAAAAAAA
AAAAAAAAQAAAwDEuMjQAVVBYIQwJAglrSdS+0oUytzh2AQCwQAAAAKQAACYFADf/////VYvs
i0UMVleLfQgz0jPJM/aAPwB0KVNqAVsr34ldCIr3/+3/H4D7LnUMiAwCi1UgyQPX6wWIXAYB
QUZHJ/v/bXd14VsYgGQPAI1GAV9eXcOLRCQIU0xv/3+7fCQQTYH6AAgAAH06D7YIhcl0WcHA
dbr//7ckV147znwLihwGiB9HRjvxfvWAfAE+RH97+98EdATGBy5HQuvIL0ABA0gY67yAJwDb
7+5uVVvDo4HsGEtTVzPbuf8uAP//7v8zwI296ff//4id6AVqEPOrZqtaqlKNRexTUIlVf/v/
/+joBQAiDIs9SGFAAIPEDGY5XRBmxxoCAHYF/3W+u7v9EOscaCCLGGgUBP8VTCM7w3QGZm1v
7d8NCOsEajX/1yIxiUXuGlAnm3v7Pvj/C/B1FxQrVCVbcG1rKlwAARgnAgHt2yFbKVgQJmr9
WOl+WvPb/wJdav7r9lZo3xGs/9eAjeoB/O6uu9tYhZsBadcI7BKNhfQFmW4zt1ANne5kCAnw
3/520wby6F7+BFmL8FmDxn51FIic3u/XvjXuRlBJhDVKtNcJa7cF9178CFApBVNVJvZPNvdW
UOybXHUFavxb6+Xe2m/uNWAPKvxqBFC/I5xoBhBnnbvdBFfHEOgDqTDWHGiHu4O9BRcQ6FBc
UFNoz2yDjG0SGFdkEQpoY3+h3T1MJxtGavvrmYvYIGz/N/43i8NeX1vJw1aLdBeLxldpwBAQ
BABQ8tb+D55ki/hZhf90JxUUBAIA/g0R2mpttrCF9n4Pi8eLzkaj/d3ZMAUbSXX1DggHux8N
dwwQsWUPt4ACfFFo//tqjUj/viaJTfjrA4sEHNt7c7xuWH5TsBH8jboa948Zun/D/v1WO08C
di+Nn/wLVnu82DbWBlONfE1TBxRhYxk7Oot8JGkDgznr+FtGdb2FwHSjjMmFGMe2a7iApeie
AIlqPyZZkcPdbjgOiYt16h1A/JKxLdx+g2X8AKp7RgaK063Az75Itx/4DAgJChYDx7tt7Ws6
SQMeMPTGfegoXip9aexQDIoIhDY9vslA/zdu2sfwQe+L0QrB6QLzpYvKg+EDpt1v7fOkiykJ
AU30A/lzA8E+vbb70oBnHEf/RfRD68DLVfcDe/vfJqy9WXQVEICkBee917ZvjV0tjXwwE45E
BI/KwtttZj0ddRLx+HQNxfgwWCucQ4fBxWZ7MzvXBhAYVAJhqQh1B/wZ4X9H8QVgg33sAA+E
AwEZ/VduZ5szB+FIFpAABqa3G03Gg2p0bgQKdAyU0m39dS0APTWNR4VQof2HzeZ4AEoIUfiS
APG924yF/NkIKumNjSb37bc1f4MQUS0lvGtHCllZbuGbazcm8P8swLVBAnVZgg7IyPbrU1wK
/us9drsE5zJDnzk3fSj13cw1y/BEUHNweY235mbuhAgEqmtfamHsdAYuzLoadQg7QzgMEjwL
N22tAMdH6/QjCKgLp2V0qlk2QBwAX26ymvhXyAEOwKr91tJroBkJD4ZosFxBAPsXlu5DrKAU
al91Lv81MICXs4BNQs8vLxq/WSlhMEMfswMupVKstVmBa4UXYBsnA+3SdQQNrkc7EH4uuP6D
/ghX99C5Dm6M0f7B77FA+5dK3/fbjTTeiR+RGsMbl755I/Ez89rB690EtYAf9u25djPDQhQZ
FwZaAYs0FtbeNjjB6CfwGcYjwSAWYW4ZOQSF7sYwLe+CuVEiLBJPD4VB/rvlDI7hUnQZxfgj
+TP7t8KP/CM8vcdCTnXnn/xbXQan1gmhdJ/RraVyqeG/gAdWyZBg4GAwtNoDVgL+vGIbFkag
7+v86/3BO8YwyNZsB/VWJAJA5bbUeC2Izf8mfQwssNme0f4HyWoeSsCHbMdqi+pqLguQFr7d
gizgCczHJFBLAwTT3cEGd8pQu8QKAAWWrmm6BY3GA5jImi81G7TGCUKFJbX8nCduX9cKzAee
F8e8jGABts3dtjAQzgKgViOWVgnSbAba5ggFpAunI4hdV7bNDdYQqDraA6wUaK5siwiorQm2
5mtLR4QheNyuArq9B2INfh7k0wWKXTr27dcNVlaQHlZdKJue6zqANil4jPuZ5ddQGVpQHHUI
fBiy3XZLITkMdBwljO3WEWtfUFCbAUXrvge8Que6+LLwSJCQMh32bLgsHZABAhKUFLQIDlaY
4bYgeJkWddFdtlY14AUGL+RvLs9nQ9sH5itYXegB6gH0NyucbGvs4JKJBBozZzaveLsIgdIG
L5QF7V7r7mpY3H+4bR+D7BAz8DEVlC2BffDP3nr37wdyCAfaB3YGXvDUB2bP8gFyT5rHxgYM
E/IBAPb2HxeWriP2Cuzw8kpEwRqt3f7gCcHhBQvBDQwLGJ3w37YGD0GNFwYP+mbR6TmjG2sI
HQgayb8IhEe7Eb5XVgCrhcMywkLCfIv8u4W7uXOD+vj7IPH4idZm7rah14syOQh0LeQeZzAb
6B34KwXrzz0KJTs4KKY7qh8dWsJkIwsDBO3egaFW86mKkYRlGEwkLvCuG0ARikVwAYPiveIE
/93Rl7AEC9aDJAGKkiGIUQF+GmVZluaKUAECDwIGL7TvZxzrArI9KwIlcn4OioL92bdAIuA/
ioAasD2IQQP5DV0wmF2BQdSc2VBy5GZuB9iYBtyU4JBHjhw55IzoiOyEpIDkyJEjqHyseLB0
jhw5crRwuGy8aMBkyJEjR8RgyFzMWBOfxujQVFiceqX4Y2ezmYZP/hOYi43qF8pCkQIxoAPI
99m2yI5vL/F5AvfeA/QGBgA6jCU/JAB1MQz0j+9eNcm5UGF9BblMi4pqPJlfDd56K+4HUleZ
x/5QUYzP83QLqVAE+vjw8m6e7kJshaAM9vTUaCQoxPrwqL4cYSa+VmOJwckUNfhFGi1cBtw8
FwvFI4p0QuN0Pfzt7aC7xYVcbJR0BWtzgCbbZXurdOsDfNspOXQrJPQwO9xHL42HQApOOFK7
5JBzNexBUtNGjRa2N3EEfO08+AIQbG+pdvuZWff5OQt9Bz1qkUS9twEXfU5ooKAvGGaBzRiD
+M1xZN/obeWDfMMKBnUDsAHDQ+LF6jLAw3dpIWzg7912IgknMWoJYKEQgsiKBO1SY29GBD5G
ITtXct5NcxZ5AvcsCDAoIPHdhbWk/m40lWyBQPj/OXPbWZtZhyQhg/oPD478cqkXXXq+Hnhg
/A+hdzoVdG8NbrwFq7Mlul8MXLUTFmgTOmsxWFXMS2pQT9jZWXdcalllCn4g2ZEnZ5oEXPsk
gkRekgNsHxRizZIl6XV4c6Qb2WrhEqcYOtRXwpsle2e4SCQcK9mnofIHB73461CpSA7JyQ0G
pP7OlAy2HxQJH/vgJeGVSIGf6BSZ9z20erDExl1B9F3ANbfQ2UWWgmQmsLztxnexu9b5IoA8
H0AdU0cNWWps/Dv4fPLrEnsfeYZkQ0PnO7Q5va/T+DYMbTZMsL9I63ZAUbZq0TTm7/T4tMDO
9nUh/j6szPhvWMA+2IpTDvQjIKFHF7ZyddaDECC4c42DEMGF25owd6IADwRLZcu7pczWinRe
4QSS2Pkg2d0kViEoelkTc5uvOW4vLi9qEBhHtHLAoWom3iBfBdkuQv8wXNyuGJO6xVoiJMhq
GbuwAaXSagZzSW7a9C094Gep12AF+IBDNsCjBb5W7wHv2Qbs0xoDFQT4OHMOeRAWMPfEqXbp
BGjrNIxUna1079v7HTQSja5d+kiXDLNZhId1GoEbps6AvziOAV5XGsb2CIcx2BfWP3CXvC3s
dcYsFxG7BxB0eAHjU6kPDW3YV+FfwatZNmOw0pCy1oXqBrG2wQwlzE4B9Gz2Yvbhimx8ErFD
ZrZmKlRFRI4jZm4GHhfbx1iFeshgrpIMXEjEFtgZbB9Yww8AGcq9FdAFQCZ5JbwFTICcCGQI
VwVJDiADEv4ECOxskkRZEVmcQg4AcroEdQQnz77mA2ETODcMx8cA4SLjBCQwJFNibJGcMLAo
SUoGZCAc6AwYkCYUy9iAZCAL9grBC1kczBVuKxNqE5hXZUzGlrzUjHY4XQLkZazQjFsCOSDN
sVNXZP3MZP3yQHghEAlk/WWckBdk/XyMkM4ZLFuA9i8EexPfEosUlTQSUolVCDCwyU8w4AkE
aHSMf11pQ6Q4dQcQJ+ucyWbuBWgQBm6MpJTcYiWMILyLgEwhvCNYILHySgpwSQGjRTvNpRZ0
WPoEMKGXLJ516xVwHBCgm5sNDWklqGC5IvZPcAh0FWpIV1o5a+TeGJ6kHFxDP0YohC8nT1nn
P2EJB+S0izPb/ewJR/ICrItTE4VlmlpUNGotQA7G0y6ki0gkiZA9XFmK1DcglOsCM8An/39T
B3nBigY8IHQEPAl1A0br8w++/9YRvwZWKK1FDQ0OjQS/Ro18dakP9EHQ6+Vni1RVM8ltqyss
KhGrMwrYxgpT37v4IEGB+QAOfOiAogcANrm5/d64CUncVsIA8LhdQWuFVgSSskWHg/4beooU
MJIUgPogdQ84VDAx3Q/2e4iUDSzrBwhBQD3/D1rVBeHZ2ICkDwBMUFb4aHvoWZZRofpWKie0
a+FupQ+PioOCWSG/zbt4OPr5N3EpDFmFb+53gVZhZzs1MXzkG9vFoAhFeA2LDRhwRez9UIkE
jTplJ4IfDlrmEPxgJGGCbRT7niJznoUnt489NGAyDBXGgLCUFAH/oR4CvAUMOjRTS+rWmNWD
NKE7w6CNC9QykF349VAaWxNsGmX/KTsSD2/iN1r7D74RRasciwy19AoLL5TCLTg6EXFDpuES
bIZgP0B560uiRr7YdgQ4lDQg45wsu1++4EekODxgfnt8HjwvBzp8ewV4+BaAff8B5gdeQnHv
/W336wgMxkUYQ4P7KH4IEho6bNjgg/4Do7RIZqG4RalClHL7f+KdK26Tfc6QX1MrwwNws5Ee
DcxFfAjajpK9ZCuABXZsENlgv9T9gHw1y12NBHWUW/81AIGt94xsfAIgB3cHhQ4tX5qle8ku
dCEGyBrKE4A/XR7syGEZB3UKYRijWQM+zPC2W7mMtP4NhAM8mq7n+3VbvS3UME4f/34XBT5p
459tP/QUSFk78HTo6D4aGIaMMb0Ms4UNG2cjGvvYst5ZrxA/ZZb8xeBN8lkOM7i2Jr98DoXX
DGy/AX90Aov3zRJ85Tv3kZMbWA9T4YuiSJNbh2Z8u2FD9O9pwwRPITVI3r9FEn2t2RnJO9ac
7LB3vA2BD443ARxQinaP2EU+PIiLgypcZYXcdmC3mrwW8kzJM6BUiWSSj+xDdHZoLBJIaDQj
+Ug+NWhcImhE2Mv+sg+xGUdZ60IOGBAakC2BXOImNkZ4MDDZVfhYvBBM1GiBmUpOXUU4m1kN
WUNbG04SJYOPDYAIAo/kBq7Rc/j9vjRJlohEPyf8XIKPZQvvDMP7/nshm6DC/P7/Ni3swMwt
Dwy/bPcCZ+NQEMBJgf6UadybDHZ8lF6RRC5+DUhTy3htlBDkB7yzR7waHcxgozkcEfjvCKBr
llv7gL3pJgh1DeguFRdkZGTM6hYe2MrMyemt994b2IRaluELPXJrLJKdHPYQdEnsEjiDO4yO
FxawEwkZhKBFfNkv+xzmWQwdeOsMDRr0gUH4gRP1+Fl/Fgi2rXOBVqTIYMEIDv/CYYNpxFVW
vmCPY9uRpYLQBXQfNlRZIVrPCH704AT4BQ9hBgK+VwtKmKrg/S7/SQgHJPjQIQd5ydj+1/7Y
/vGSHGwSeI4RDLak45wP1CdwHUS2khMQh7v2wlW/QBVTaGlkeyxxvlgN2OlXx55BW7aOmABg
CIs9tkjmngTXQTx0CBt7g+0TaDAq0wQgaGNyhyz2aAEkHmj0z5nsbC/yqgwuAlgiKzBMTR5c
JCdHAtzUSSPklZyN3gPTcHiYAcy84LF2DTQYJysxxFpTU5rNsdkW3KOwSwrYPQqcwQc3dQlD
woVbuHYWVmjCRgMsPtFUN7n/CaS/6r6wFh8/4UUXVokdj6pUwuIsAlFWUmKxcFF4HCftff+p
k3cTahBoqOeIkWOihe/YGGEsHvgEzv3UcNR+ca19ajLndLRoLpqCUDzkR3T++waMdOk5HWh+
4cdFEJneS2zn/OzUgLuf3uLJ4gJO/zCnB8aDmZvLxaJEiEJqMo9t0VwcJF87R3y/62ooWPeX
/yV0bswA+wyOGvolsASF0nRHu0Q9N4BfiovVBHIt99l0dAhqvvL/K9GIB0dJdfqLyMHgBhDK
uq7wJtnpAnQGIDoGI0ptfFFnPl8Qw6r/yW7sHImaLDHdw8wAxK1VDbZXgnNNEHOh/9bai0jR
A8Y7/nYIOy+CeP8793D3xwOjFFthg/kIcinzpf8klaXab8PIMyjHuhyD6Z38Nbfy9uADA8gX
heAyHo3YkN113dMHXPATHAhAAyPRirbmtt+cikYBiEcBBQJWCFnGZScZW8dczI1JK+RZlrEl
AQICppCvO5uQI0YhRz+MaZquO78GrAOknJTu/5qmjIR8v0SO5IlEj+QHmqZpmujo7Ozw8Gma
pmn09Pj4/EP4rrH8jWT2AAPwA/gJDbXpvv/w4APsADSNCNnA3tJeXxWQnQv5kBBcsBGjDRDe
PswKK410MWd8Ofx/Z7e9ZCQN/eP8d2A1k3DeGhXvjRA1j/n7RT4nK2g0LJB4C62wma6YA8Bt
Azpv9pZ83QNOWE9WtksffLdLGKPuAu8CKYwJb9mAkCckq2DjlS0tA65FWtN1F+YdWxQGHAMk
YdM0TSw0PERXNZdpmqYZHBwYGBSmaZqmFBAQDAwspGmaCAgEBGHTdScfcAV4A4icNZdsCc4t
t7WHD8LAFsKDE7f/o2UTzAD3COtqjaQk6PBTe3pvu1f3wYf/bAFehYoBQcI7DnXxiwG6/xtv
/f/+/n4D0IPw/zPCg8EEqRsBgXR3QZtrqbv8JiOE5Iap+DgO279RcwYH2uvNjXn/6w0E/sxU
y8vrCP3rA/zNX92oVB4ZihHsSRdHxQrwg2Lu6wWJF3lnd5MdrG5pixFr4S80hPa1sTf2dCf3
wmkSB2rHOJJtZ2cuZgjG8wAMGewF2wiIB9/eFJEdDjlABQHjcMlJczIkE0Ekk2yPNSvBwwn+
/TbwK8j8x6Pgt8Oh/thv/wVpwP1DeAXDniYAFcH4ECX/f7JFVwkY6ASc5OCV+SiB2HVKZRc3
TwtQiCyTGuDhUAiOx05X7yX4pdx6VlOL2awU98bN0js2EUF1B8t1b+sho/as+8BGc3QlwSkf
dest3WyBvx1Rg+OTDSAdL2HSxu5LdfOmEFslw7lhzwSFXjrmLhErDZx0Ou5so0sqwhZhQli3
Y6+6zSAfcgYWg8beLLcngzQeDHXGOesYnKZz0YHiRgkOALa12Aa/0lPnVQoEYbtS74kHX8Ow
dYWj+AYPhyoR+hKDPWySz6BFO6t+DtUpLSm7LnEwdFxgkDEEQTvxZFtUBCcRaAelKhfedgJm
iyslHmFRPepW4QBdZTcUgbeO/e4VOO4tEIUBF3PspIvEweFLbwyL4YvFQARQw4/dodGi8ULZ
gfFpBRru7opxAfRPi/cZcemXztDwONB0FWkLcwoKdfUXPsMWfl/MEPCXjX6/g9bx/4phAmco
EDE44HXEikHau8a7AzEYimb/jxB03+uxL29z3+00isKQKaKNR/8MvscFJNpB+o1C/1vDzY1k
BoNowsTGG9htmwiD+I9QdNUTigpCONl00SHbb/1sURJ17QvYDMPB4xBWCIuE6zbCCr/GwbYz
y49SW3y4wfH/z88zEsIDjef3w+HQdRwlBnTTAagrEe3QgebsrbHNd6W7v4tC/DjYdDa37zjc
rs/nwejtpmm6EBIV3AbU65Ytc9K5Z7FC/jcG/fyDHQaLTwRTpDy229vtiwI6ay4KQyY6YQgl
ClcdlG6BaDqqGRQRrZszbR0QtaUaddLPk7btd4qQG8DR4ECR/0MB9/bZut0CQkTpQTDgEwKo
Zlg0T/O1M1vSysnBdKkuNnDrjGNqZMhlaD9cNnaYR2ShXFBkifizdEc/rexYMYll6Kj0XdXo
DbSK1Ik+ZMiL3TEKJ8oN3A3B4eyxu51tygrYr6PUBzP28O7THaA2X1nmahwLK/tZidBno08G
NLRr8NhioTijj/4zgqO8CDG1tdD2sTB8s54r0Jqkl4LPdArsFiTB9kUN+PLC0AFcD7dFA2oK
WMwFB9nonFZW0OggxXds8CvJCC3LDOy1CYlNfbuz6ZhQUQMuoMd1mB7c0m7BLSjEcgcFDThs
aTmXew84pWjTnbEvyQ02hCRZJfh1pICBUHs1JF8jvgY4pJt24HcivV3i8BxsxxY5p3QQEzn4
3t3bv8K9gTs1sJNJdwtWGj0vteqfpxyF9nUDDSIPg+bAUy/B8FaGNaBhl/xZcB0wxeY/b8x8
+lu3uPirO1sgg8AIQj3ffPGi+9tL2RNyHQQkdxjHBcgjDf3rLrmR9dX8KqMQw4H5vDZnZtsT
chIHyiUIdgpoLXbOMRaciQTat9R8yTpRVtJQC3zmXHFe1oWVAGGF2kDtJoKNSAFVfXcMtzUu
z28Pt+tSME41Djfi38XBdrbR9kRWAYBezWX+2Tb+Ev1N/IhF/WqLCQ39tReoVIWjjU0KBaAd
gq1hAVEpC5ToKJdLQlxOAuMOHLd3AQojRQwIodRDO/vBdfwC/9BoEIDDCATvhmgEDuhaMOQA
JLFqIc+ye6kMEC3tDAF2uP02Vw9fOT0QXlN1EdPbDaUrygjyBNgMd28tAU1c6Yk9DCKIHQjm
Vmd/KDyh0IMi58xihWYN/o1x/DvwchMml23k+69AayJz7V5oGJQUvuS7MEZoIBAchdtbOCP2
leN6iYZlXwbJdsEoqnMNV3txpBjh6+12U58v4QDaDR/AIHuLWAhIQZc7WhUBcPsFdWAIbnN5
1/npJN6D+wH2AA0UYY8tEEshCEGJC4tIBNZg7EcVhcgd8EoFsdX/5hX0A9FWO8p9FY00SeCN
tRC7FkASgyavDLG5FWjG+SM1/D2Ou4CvvD/AdQwMg/pwPQH5GbCQEoFdPZH5GZCfhEo9k4U3
PY0ZkJ8BgiQ9j4bY6eT5ET2SCoqSiIi1WsRqg2kKHe4r1KWa+lERmqODaLh4411otdlOtOEM
01td0Oz46d6zuzkVeAVWuHTt63jbfov/wAw7xnMEOXT1jQxJXgONFbLLxfc7wRJ0uyjIYqKX
48gAR6vo2WgdFdhVIsOaRgduwI0xGBH3wFB/Q6WJb9tv5kbr44A+IQ0HCjwgdh/bi9pbDCB3
+jRWD+lW4P3Ci8bbUzPbOR1ag1u76EALWiqyOsMVC/4WvTw9dAFHVvYgc6lvkwYB6+hlvQSA
W7jsdSwfO/MJ8DH038LTgwnWBz1BOB90OVWK3f4I/IvoWUWAP0kiVTQ/4rImkgYuVx4lvGI3
aDdZA/03Ol3/hFv49yyaiR0LiR5fXofEqZWN9YGEWwtRvRR6heG+GIBa0I/tMEZDoSmiAHxI
DUHh/jgYTXn484ko0e9TU58xzmjV1qhhW9iI1HDW14ZNuqEILyck2xYcdoZQVjX8VEha6CKE
+0WAo+QGCKndYNtMGBwU1oMhcmoj1mhRj1S1IIaWSpBzdzeKIhZuFJmAOJtEhS4WXnZAgPq+
KewlvjewcfvS9oKBYEcEdD0BGAaKEBU7MvaIFkZAC9XrzgzGbm+peh1GQBzrQx4FW/K2RQRA
RNr2gxny1tz9GIgeRmUgdAkJCAl1zKFYY4H/SLtKGMzS9kaAZRgATgC24Ixt39dEKwUnA17x
F8i99g/MvItVFP8Cx9DXi7//FuQ4XHUEQEPr95Is9sNa9hcchEdtDYB4ASKN4xi2Ercdi8JQ
NwgMqe03GlgYGA+UwokF0Ufav1tw00uwDkOIxgZcRrGNtmumQ4CnSoM/VXGpbb4Kij90Og9n
dC4w4bJXSuIGHzY3IJwbD0ADFQFAfW0Iu5AyujAPDoi1RjTcxwODJ44UuvsLTdwooEmhHGNT
uy2ao7qCUAlXOcC10dg2qHUE1Q4LdBU8EM8WIXAomYU7ohsn+Dv7F+q5vMucGwL+NF+D+IWB
T7VZh0MMPyesZmdvt9I5HnPrQEAIGHX5BvK0jd3SK8YvWE7R+I5AAqlYYmtdA4nKNIHb1JJ+
6DvrdDIys3QjHI7CNXBVULskJTTdNkjddQ4MECdcCYsDVtZF/GyeXMPrU+ZMpUalk7mFsXQ8
YOrt33aJZUA4e/sE9ivHQGrSV7CkVc5aC7pbwVnBVtQMMRB+cYQ6u11bguxEYQeg0IknBDqW
Jk2FZTIbFcCnlgsmuBjAYiBLlY0bvIYptHMabQToXXq/tsZGBQqhI/UIBRuJQci1iuGNZglr
26mjQnXFNRZE6QvtxdJnuTCN3LhISpn7d/uNHC58AnY5NWN9Ur/ETI+3mn1gADiDf/uNiC5L
82N+wXMYgGAIQIsPM8fYLtGBwXzk1UmlqBD7fLvrBosJ+wn4SzXqRosDRomKTQD2wQGeW/XW
fgQIdQuhRGAeJehfiijPwfgFg+EfDXRv1XrPIdILiQgviDVe4hvrR0WDw5v+fLpQKPECn+w8
2P/y2HVNO3sralUACBX2WOuIpttKfcNI99hljfVYSOpkf0C7dBdXZgwlGqUfRgo+0AaATmrq
ugJl3goDdQo2BYBmi32rWQN8m/+4NkxFAxYOqb1E6EoG+KiEHGhxdg6NbA0gVTyjW1DHQw03
bhNKD004cIdsQB1yzcO/aMoVH55V12i4bnqwoEbiTexdOYvlXbHqHgsPQQQGnbgdr94Ahg+u
KRCJArhy1D8YgMOQ2Gr+aMBGRRek2f3/NQAZII6FQt1Ji3AMQVw72bdd/cJ0KCB2iwyzibWJ
SBd8s7YHlaIEERMts/GCb/99N3L/VAjrw2SPcn8Ncs6hjOYFD4F5BHxrCXpoW1GlUgw5UWDq
7i2wBZuKUbsMB7Yd0axwCFiJSwJDF6jVt89rDFlb8oVWQ/j3AfwyMFhDMDBMCPr8i10MHJZi
G7j3QOTYgohrruBUOZ0IPpb4Llshc3sIwWG5dmt/qdixjxRFVlWNaxCoC1X3QnddXkELwzN4
PCVTLWPd9rOcswQdVgzeCDYmW8E2bt6PSY/Gd67bVQw7CDAaizSP66H1st+xr3scyesVXGr/
P0MbQmxdFpS8O+qS3X6LKYtBHFADGFAk4aE1FHC9b6CY8SqZis1bfvSOQCFoQ8Go61h6oSDK
We8j0awedJDfpLv6iyqIuCCTExB0/S3OmwtBPbCTlPHB5gM7lqVhbuEaJhwqbLuHbtKZ6HAN
ENeoVv21vfp1C/EfhVz+E3h2KELWF6hoQs0OIZpZEsn2dizevQdgQFllPHYpGeDsJGAP+A2D
+ircX0VqAwP4aKRBXnyzJN2nzGD/VYgQh5xNqldbHYTMWs1m7v+2JNMWEQk7yGCmAydcR8dZ
iWKufixf6yaNoTD0TdpNqDY6CGr023KrUzV+hClZKF9OXx8xD7HQsQR0IYChmXtSCJS80aYp
r5x1AQsllGERuJ3NBpgxo5BqvM0RuIgFGUChGEddY283gKGcB4j3FIMLu0b1K1AMFCRyB7cU
iAG5Qspob+qKWlTTAItBb7FtUDSQcQxa2sL8V0B9i9LB7s3mevxpye7eKNGGS73vjAFEmYld
9DKwVKITpBMSqL19ifZ1f8H5uT9JXwu11i/exs92Ax5ME/cD8KVMLXpI+vEgcxy/i7/1Xd7T
741MATDXIXywRP5dgr3Ubit1ITl6g8HgHqdzD+YtIbywxBIkBti24UrTUdN8VYkK8LvtzQQI
A134DQiMi/vB/wRPgKGtLTM/e4ZfyzUBja6Ol+yFgSt6i1gzwhGhcfhJWrbW3bVnpnYFifPK
QRv7um3w50A+O/p2Tvq/dGvAtlYjrTu+Ub0ueWRkuurSIVQR5MOCRR690iGUbVusJUxSv0m+
Sqq1spwLBAgRkVhA4Sa3dQk5Mxl1b8i3KfCNDPkLJomXrWzNLw4FCJdKY4q37/7tTAcE7yCI
TQ/+wYgLcyWAfQ9GDrvJdjd4iJHT63YJGQ2N2LcSWrEJGOspJP4Q3LPYT+AZJVkED50Wb3js
hLcJOItURfCJGlR4LAvwE/z/r/qhdhbuAZ6J37yMDbrittHNcMHhD0sMUoAAFwVaZID/Xr3v
QZg9HzIcCVAIDt3s/WE5QBCDpIhsJA/+aLjR2UhDCkh/eUMTg/QSx5ar/hGDeLF1bFPQvdbA
EChaEgkQGvBIWB70TAuFEjHyDpLLyHirhWMoK8iSESuNSBSDMPCJAkhczKptNd6vDS87BSI1
JRRAo9OvljqJDUypsqLzM8usiTVkvSsFbBRmL2hXjTyCw7TxySwbSBd28BdqhZe6o0k0fQ6D
q9Pug+0DHLei/9frECYZ9yu6UFvT6Ob4oWkX3gDwi9g753MZi0vhOyO4RYtvKyP+C89gNRQ7
8v1u15oYcucHdXmL2jvYJhXc3TYTBevmGXVZJHMRg+xcARoshRM36+3m7B3yJg0bL+4Hm9uG
DghAsHuF23QURm5b0fZBYVlbEOJDqDj/697PqFRAq4kdpRSLFkTfSm36x0oti4yQxGxnD/si
kESIN4sScBFVXzAQrd3NDkQL1otCZYJvC3UXi5GGtdP/VrgcW4v+IzkL13Tpi5eHNatQymNc
WE3BGnQbdkxXzipmu63+3WogZF+FyXwF0eFHX4sgVPmCu7puQworf/F7wf4EbgVNt20/fvhe
AoQNpE2DVCRhIH0rEdvSUgVROJzT8+xb4Lj7I1yIRIkD/g916p7saLGB9CEL6zEXK5UVXLvF
oTIhGSk2mJNzFIIshSIKwJteLmJ6BOyVr3oIJZ7bXJCElDSpFANIrW1CDKUiwmSpdLMsBv4L
fSnEmcY212gLMBFiv7DObrtkl4wJOwqPCXyu6y/vQ3rAKA2NTrYJewSxXI90sbytFr7uCTdq
W7pRi9yOCokD/LLDb3uXeXXwA9EiARIy/J/o8dttiw4hjXkPPnUaOx3yQSNSV2xLO6QGSG/k
gmsR0o1CBAi4IvOkAg2InaaFUhtddZVNUHLrkJqlUJCcV5csHMyg0Ko7bIicg1+wGMA9CmjE
v22hmekIRTD4gTNSscWR/IlGXCpqF/TgqzxosvoMpH8wGQx1FP92EFf8cWstba3rfE4kxYl+
ylSLLUoFYkHno9as2LRfN+mJ0dpi43HIQb/bxVhVo9lP4EPDN2UlKsbWWvswgmhbQxfbQAgC
BNpKHvuFwUM+263n33kMixCAAFaTyUF30SdCBUvbd/WXAHBg+nc8jUd3SPKDbitHg4h+9Hj8
BoFoBvPHQPzwQg4j1Oe+UdYEx4DoEBQFd8ENPiBI8JZ2x2BPDAV1rTBF1yYmibeXrb2sjUoM
CI9BZJ5EQrye77rxD+OKRkOKyAuEwHqITkN1BwXG+AMJeAS6LMtoftGwWgFq2LQ4coE0e2gY
oSyLDSi4iRXvPr26Uhdo5AteVqwzVluAk62AIAT9HRvWEI/iVmNcJBnV+2kj7M6lAlijQ3DQ
3fafJJMcSQWhSLY9qkdlqwhYvTyb4DMjQ5OUOV0YzbaCuxmhWCp4jVMsLdEPsEEgEOAIQIAY
iNtTtTcoJOBWdGPQAAq0GnLr1e5FvJ4DJPw+wIv0FkCjSh83wqBEhw7rC0iNbQk2msiDvP/C
KUnnkrXZ4FZfHFVSEaSrWkEUzysg4SyY+I1lzHsmDUjyEKgRBdlDtqlRBYAA78MG7IJbEYSI
cHUcstAN2oKfDoxFasWqAmyFIwd2N8HwDLKNinBp69tcAm1FgDVk+XUz2ZohmiJIpwlWlsub
+tK4wGI5MHRyMEKUpsERcAqTHNzbxwhAJChAY1m/gIICj5W2h+jGUPOrqrhp6sfNhA+G7xV9
7ma7xE9t/03vihGE0gyuebZB/wbE3i8wO8IPh5Mlx1oMS23Z7lJIk1Jxv7D7pdgEqo2e0JGA
O3vLdCyKUbRRxYgBsDT6fbt3tJR3QvyKkrggCJBGQIGBhb8TdvVBQYA5GNT5yPFSsHgIKgRy
wa+H94TYqXxJUKOsC1bdZqnKMcS/cA+lbaqr3d+ju6XrVUB5/0xIreLMYGdCoQiuLNbKRVpw
OSzWXnvZVOsG+gvCTV/B/TarAOsNOR0wCpsw/VSZunYERiYwA7uj4bWGMechVf4gjfAgW0sw
/yU4av1jiciFFBheD7cGHFsWGUktpPbU397idCJRBHQXBA10DEh0A+1sBdpouAQ1BRIL3AZ1
nggR8FmqN0KwE2yqtBejxTlS9b3cw19kFAWMCCWi7BHnCv++AAYWzb6HiIQF7H3/BVf5gsZy
9IpF8saFDSAJYOsC9TdTp1XQoQs0aAomtXcdGh6Ae6y8KkG4IACXvyOg0ITe3apCQopC/3ZA
LwBe0F9b8uz6CHf2GoM1jXpQEmdsQp2bOCP97GaTfR1WHlY0I0uRM8WVjPxoOyd/TUsBXlyC
jXJmixHN30/49sIBdBb6EIqUBWSIkIDryJ2TtxwaAnQQIFtDo/E28qAcgTwA2G6YcL/rSRUl
QXIZBFolGh3WqkvIJX2Tl7exiEkfHWFyE3p3Duhu2Jsg6SDr4ExKvl7JRv3xkIYSakZD51nM
ORLNoJJKNF9I0VX9QmgEaYVkdegiGjVnmgP49jVUDoYkoyl0+vfD79noEGjUB6M41NajPAZx
6AZeoQt5Fv/QqKzrPbu8oTwQBVMRixgDI8QzMIxNBetyqgTi+MzM36hZ38jnBMBYuFk8B9AA
gdh1E/wDIFnfQg6AvKhZqFlN1w0WP58GjAOEfIhN0zR0bGRcWT5zCBDfqFnwwEACsekDzOBZ
30fIQw5AW/BaSMSu+51aLJBYC3gDoFohkFcI30BbbrBQyEBbW/R/TdMsu/wDBFsMFBwkIUAg
Njdb39h03QkfUAVYA2h8WwktAIHfNEWTIeQQaRzkQm+64D1gdnVGV1cxW1PJQi1Wah43bCe0
/LbAHSPrIlM5V+migyxoIgE7YA00oT85fRR+EC9itR56N6JZuBShHVUdCwi92BYctE9IfEY2
NE5NIdN9ICw0a5Mgcy5OJG/AyYAgixjkO99CO8BthZw2vgQbUqEPbRfEQdw66xNLtzbWDv8m
EYs4Z9x0ydqsoWat3GEhV95ZzHX0TewapWxttiX+l3F12Dv3dDL2RQ0YQD4czW6G2niyItV/
Htohs7WRMkjSj40oFYTkyDDkF7Idc7M23Ild4BcrkGQSlbJ9c6ese990tFZk5Gd0nI+zt1mL
dnUEAz2MKGggB8S+B5TVWL9chFIuAP8IcVLNS0WoCItEVqFeaG3U/+c4f16L8UluqW6hBfMM
XgArHlsMBG6DwsOPPDTUSL0ykB5Tq3Zs6HRfdSF6i9CewMG7f3+KCoD5QXwEWn8FgKCjdfyt
aBp16utnVmRTAJiJEi5GYr03LLWDWxQrxCBhOFe7rWIYKagqLFdQJrnEKydZSF8ggZoB6u4N
thhPUPAoNwxAQ1FhhyoAAJb/Lf7/MAd3LGEO7rpRCZkZxG0HEWpwNaVj6aOV/////2SeMojb
DqS43Hke6dXgiNnSlytMtgm9fLF+By2455Ed/v///7+QZBC3HfIgsGpIcbnz3kG+hH3U2hrr
5N1tUbXU9Mf///8FkYNWmGwTwKhrZHr5Yv3syWWKT1wBFNlsBv8b/P9jYz0P+vUNCI3IIG47
XmlM5EFg1XJxZ6L/////0eQDPEfUBEv9hQ3Sa7UKpfqotTVsmLJC1sm720D5vKz/////42zY
MnVc30XPDdbcWT3Rq6ww2SY6AN5RgFHXyBZh0L//////tfS0ISPEs1aZlbrPD6W9uJ64AigI
iAVfstkMxiTpC7H/////h3xvLxFMaFirHWHBPS1mtpBB3HYGcdsBvCDSmCoQ1e//////iYWx
cR+1tgal5L+fM9S46KLJB3g0+QAPjqgJlhiYDuH/////uw1qfy09bQiXbGSRAVxj5vRRa2ti
YWwc2DBlhU4AYvL/////7ZUGbHulARvB9AiCV8QP9cbZsGVQ6bcS6ri+i3yIufxf+P//3x3d
Ykkt2hXzfNOMZUzU+1hhsk3OLDp0ALz///b/o+Iwu9RBpd9K15XYYcTRpPv01tNq6WlD/Nlu
NP////9GiGet0Lhg2nMtBETlHQMzX0wKqsl8Dd08cQVQqkECJ/////8QEAu+hiAMySW1aFez
hW8gCdRmuZ/kYc4O+d5emMnZKf////8imNCwtKjXxxc9s1mBDbQuO1y9t61susAgg7jttrO/
mv////8M4rYDmtKxdDlH1eqvd9KdFSbbBIMW3HMSC2PjhDtklP////8+am0NqFpqegvPDuSd
/wmTJ64ACrGeB31Ekw/w0qMIh/////9o8gEe/sIGaV1XYvfLZ2WAcTZsGecGa252G9T+4CvT
if////9aetoQzErdZ2/fufn5776OQ763F9WOsGDoo9bWfpPRof/////Ewtg4UvLfT/Fnu9Fn
V7ym3Qa1P0s2skjaKw3YTBsKr//////2SgM2YHoEQcPvYN9V32eo745uMXm+aUaMs2HLGoNm
vP////+g0m8lNuJoUpV3DMwDRwu7uRYCIi8mBVW+O7rFKAu9sv////+SWrQrBGqzXKf/18Ix
z9C1i57ZLB2u3luwwmSbJvJj7P////+co2p1CpNtAqkGCZw/Ng7rhWcHchNXAAWCSr+VFHq4
4v////+uK7F7OBu2DJuO0pINvtXlt+/cfCHf2wvU0tOGQuLU8cb////4s91oboPaH80WvoFb
Jrn24Xewb3dHtxjmWn2N////cGoP/8o7BmZcCwER/55lj2muYvjT/2thxP////9sFnjiCqDu
0g3XVIMETsKzAzlhJmen9xZg0E1HaUnbd/9L/P9uPkpq0a7cWtbZZgvfQILYN1OuvKnFnrv/
////3n/Pskfp/7UwHPK9vYrCusowk7NTpqO0JAU20LqTBtf9////zSlX3lS/Z9kjLnpms7hK
YcQCG2hdlCtvKje+C7ShJzb6G17DG98FWo3vLUsW8P//QUJDREVGR0hJSktMTU5PUFFSU1Tb
/////1hZWmFiY2RlZmdoaWprbG1ub3BxcnN0dXZ3eHl6MDESm+7/MjM0NTY3ODkrLwAA/7s7
2Vvx/93PA3J1bnRpbWUgZXJyb3K/VEf1rMRMT7cNDQrEsvYDdklORw4ARE9NQRIRsbzd/lI2
MDI4CC0gR2FibHT7dqm9zmluaVJmaXoNaGVhcDdb2843JzeZdD0EdS1022+oIHNwYWMjZnds
f2nkstuAOGEGb243Np+B5ClzdGQ1cHVba4W3cit2aXILITOlY8gX234jIGMMbChfNF7bblNf
KmV4XC9YBhZ2stfc4l8xOfcK7uYWcmVYMXNvD4prkwHbc2MrOEYkBkKEW4FlZBlX2+0h+SM3
bXVsrHRov2GFMJJvL2xvY2sXa24bbDRkt2EuAqLat4ZbIXJtAHBAZ3JhbSDshVDYSm02LzA5
T41maCkQQSonU8jnGiwuKzhh9jyE73JndShzXzAyZsEutm27bm5ngm8FdDoRQiuctWTmf00t
YDlg/MPbZhVWaXOqQysrIFKch7nv9kxpYrRyeScKLRZFa5xtDw4hEVDUOr4Ac23YZS4APOXg
JSyxJExta2ydQ9j4bvn/WVNdA0dldExhRkF7LxToFnb8wnVwABMPgW9tO1epZDqbZXNzYSfx
hQV4Qm94QHM5MzIuZMbc8qw+R6VcqQNTXaCiMGcDAC6nsg+vV0AjCIv4immaptkD4NC4rKCm
aZqmlHxoWEDNsmmaIBQI8InYxDRN0zSomIBsVNM0TdM8LCgkIE3TNE0cGBQQDAh0btM0BAD8
iI8D9NM0TdPw7Ojk4E3TNE3c2NTQzMQ0TdM0wLiwqKDTNE3TnJSMhHxN0zRNdGxkXFRMNE3T
NEQ8NCwopnNf0yAMj4eLA+Sapmma3NTMwLy4aZqmabConJSIpGuapoR8dGhDYGmaphtYA1BA
OCymaZqmKCAUDARN0zTL/Ib07OTc0DRN0zTIwLiwqNM0XdOgmC+QiIBN0zRNXFBIQDgw5huk
O/98lOeGmmbZdAMI/IXo0LhpmqZpsKiYhGyyaZqmWEQsFPyETdM0zdjArIx8cDZN0zRkXFA8
IITTdKbpZ4SDA9S8TdM0TaiYkIh4cKbrzjZkg8tUB0ADLKi/bJogBPCCc29tZXRoK9RG7bNp
c9pv8hOxVN8LZ28Idxlnj/1G/e95b3X9ZSBiYWQLdHJ5+mHfVRdzdGVhbB9mZWVssEZtpZ4k
c5sT3srtWx5ybiBtRGV5GmF0c+7298FSd2h5Pzd0YWsvaXQnte92qnMDYnBsCGRbsa461j4/
Mydz3G4fc21sa10hLGRjA04TwK1UC31dZHVo1MAOBxfYm21fAUQsZh9tPli19muVKT9hYmmB
AFy1jbIAQWZNAwludkgXhhsh2tiyfxt0dWZmLddner23PdcvJXN9ZRePI7wJ7utBC0pPZYYT
GrZzpnJIbBNpkFULzm3vZKYgCGNO1QXsco0Dcv8V7C/2QIUzaG9wXdd2F16ACHVlnGtpVe/s
wu66J2nuIG9mViHby+aSJWMWU0lhXLjQvXa6bnCfc3cZZCEL9k5iC2G9WC9NOMTcVtZjCCM4
g/uXd2GUVC7cIb3usWQnWGFjY+x0K3vLvoQX3z8TziPRNb7DQ2ttfgpvrbB9zi+C9W0uZOOJ
ZO8we2AnHvrraG32zC4vFPKY7Yf31xITaSdtpa4Ab2sP3V6hvXdwNJVYOGFuEdqEzXgEeSIg
oyOPPPYucGlmB2NvbXNjcmVvyQLOeGWXCyNuIwX7m+5vI3QFZXMjayN5Iy0H8o+tmxOxbatj
/3BhcnRzb5AuD28y76wH21wJ+G9iakDHkGuvpq+V2icY6tdsQhIQqW16Zg+QUxnlU4zEY2pv
+zHD3cJpBWTfZWJzs1N4AKUYn8iAY1LDxXUHPrANfBvvEXAjXndncAiN1ni7aWlW9HWGbdJw
bx93gZBwjmc2Ykp3YgeK1raACA87YzAnz9a2gGx0ozsAb8m9SYdzcz/jFZQJhsOGEWitA6Kv
0Rq2O3LadM99I+HtegCvN2xrQ9t4Q2ibE3ND4xOzrjYJFc9bAN9XISFtcG7bA5dmj2U8e/ui
7ENzBDBTT8+PgMMztCrjcOuRT46V0gdzaGRieH5vcuR0YmJhZKQXd2Ex2shBc3B1d0vyscLJ
cnR2lwdodG1sOOvOCGtsA2gzdM/dm1E/ZyIHW10tQNdsm38LXy1cL3o6A3l4B03TNE13dnV0
c3I0TdM0cXBvbm3TNE3TbGtqaWhO0zRNZ2ZlZGN2TXoX420y1ATfeCD0p1jAAxkGcmZjICFZ
hM0eJGxzF1NZC4hAoPUSGq4LLacKIGzJZtHolhWXFXcuhGMWsKy+Ef5qjmVb131zSXMbosKy
pQeTvWMw7jXaRtd4u3nOILFHRot07xkVVy2DbAiWFYIMQ+ymIN0LaWnMWG8uNxcj2G7uZXED
IC2kaazYWmNXfXB1iL/cM4ajOU4EY/f8qWgMhtXAczRyD3k1GrPDtUcy/XO0gS0IX4q32T+u
yV9Xr3D8anBnc+yLuRrooV8ab1S1s80RwlR4DHD03YG9qfKccCA5IFRzInA7aLOmcPhyEOBd
X2LC7BlotKtiVXf22wE7eHCOMjHwNS4xMDAD+KfuwACCv3boVURQACUcawU6piX6BgUuMnvP
JWtTBBQGAyu3DMRHkCtPqQBOL93Y9W92AE8fU2W+QXU2SnVshVbYcAP2TZMPzC1ba3MHA0aP
E2FT2rZordcLtrNoRFdzW4FWeQfyH28XL23vTYFJUVVJVAcDLmlbjvwGAC0tACJDJnT9at02
sC1UF3NmsC1Fbpf90dGYJDolZTY0IkRqj64CWXhpgRxzv9fLbjsglvI9IlNQUHAlJnlVJkof
Cx/D98UveC16WS3XcmSz1lyLpjg0MzW01hhdGBeTZZG9tqwqL5O/Ny+27YEhLzphvDuka2wL
t3JidD22LcVjmOgQhHYiN2LUFzgQIYsTV3Ed+DY+Ti/KeLZi7gjbHzOE701JTUUtVk9zFm7g
WvcxLjA/RLw3dFN1Q2zhRQqTbwanIoJ9BQAJMADbfyJ+P0FUQTdDUFQgVE8ePP0lwm0LPhBV
TCBGUk9NK/gH7hEAx0hFTE/Tzj0+w0wTXLPtyfx/f2MHZRMqLiobU09GVFdBUkVcYwWHQEhc
vXNiMLfFXEMpcuK4XAxMjgU9U7Bh1xlYomN57W3hS28ybWzcIGt5QYtF7Wxq3/j/R5tDTFNJ
RFx7RTZGQjVFMjAZRTM1LSW22/8xMUNGLTlDODct1EFBAzXIxFsg/jdFRH1cSW6KY11mzTcM
JJthc2ttc25eyy3Co3tJORvDmr0frgv7ZdCCgRiIOK9kEXoijsliZX5le7brexAX22vTdEAG
H/tYa747QWRtUw9Ka2xTIQMGbLkzvwG6wTZ44D0xAhcWAwKaZrBBAwcEGAVpmqZpDQYJBwzB
BhmkCAkKG/a9F5ALVzsHD1eCdIMNEBMRAxKQwQb5FyE1D0HBBhtkQ1AzUhcGG2ywUwdXX1l7
bKZpusEXbasgcBwG+16QcscvgLOBG2SwwQeCH4OEjxmkaQaRKZ6hbJDBBqRvp7efchAGG84f
1wsYB9l7rmqJA5UBAyCTHCggSAwgE8kAEIQQgQzIhIEBDMiADBCCApmbDIEQvwBp0l1VAQcu
XwzSDfbACxcdCwSWyCDNgI0IjgzIgAyPkJGADMiAkpOyUQzSA68KN4wkLwtvDKMABZMZ6Vrw
Y9M0aIMHCM80y6YJ3GcKuDeapmm6jAcRXBI4EzTLpmkMGNRmGazTNE3TGnQbPBxl0zRNFHgE
efRlE5Vpmnrk/AbYh9e9Rw/4wEMCBNLPDvbdpA9ggnmCIa+m3wehpc3z7yeBn+D8L0B+gPyo
wXL2COOj2qOPgf4HQIMMgQ21L0G2XyH/d1/PouSiGgDlouiiW36h/rLf7j5RBQPaXtpfX9pq
2jIvqWiXv9PY3uD5MX45g1gAKgoAKioJQQFUIKsCqEBGBVCBjAqgAhkUQAUybIaobAPEGFCx
TRSwASBDUAfHWlRtBkkxClN0KSpHVJlIolqGrFcPQU0jqv+bWUJ5dGVUb1dpZGVDvrZQAVsU
SARSHYBti6o1YwxW+4NFqKMNUnRsVW53P7Xfe2xkSk9FTW8vQ3IENXb7rEULRGVzY295IkY9
2GtEEGt6ZEhhqs5KtztsDVMKQ0UBY6ZCHUULYc+SzaNzVxcWtmRtWKy2wRRGFNUI24NlRFFA
t90BQWRkIXM9TO4sCmjhvEEN2YXN2kNNsywNV/2kqGIvWUYY2FZU7UQWVW8+rTDswkMYc2XW
Nllt7Rd78uAIUG8xm3Jw5qrKsWsabDBPws0eB25BIFNpeorq7E1CDxlT6vbN/gNUaW16CFrZ
ZUlte8uoChfMY6Df+7pnJV9sQmQHY5QIby/Z9gp6JgdixQv45G8Iz4pjcHlNb2RrbztWTIBO
YU5BPh8yDINtbmuaRnmYRgEKVJ3F8gpO8risdcsZ+3JRwkTOboPcanZlUxRlcLFhDHtF1SMM
8zB+byvDA3gx5WNrEmwgtEZGMQ+eNJyEw2khdGGecLXuNjsREDltbR9MidusqIIhuQtF4REh
xnhpA/+kAAo44QUKF1Sql+yke2UmUGxj2YEAOfxof4M7bG1kTxBwg5/fmqUhjHxBntEA2oK7
cRtnU5F7dSgWewT37Q9IS2V5DO+zt2xsH0EQHg6yWXqGT8oM8d50UULhwnZOAndJa1AJsyrg
NNtzGusYs9sYkB0BsXCOdGahfTxd9iAkSZduPTa1VwUcbm7btdk2y83/IwIBLP9zAgRlWZZl
EBYTDwyWZVmWCTcLNBcUs5ZlWRURbwOl/0P+y1BFTAEEAFn0MEDgAA8CCwECOKDq9w4KAwDk
OthZ905WgA0qEA8EM7lj3ywHHwEMA9ubSzaw7w8kEAcGN4HLsxwoaYxwYA1qhdwGAmAefAEX
bNdxLsZ0B5ROkOcg2FzYBEUgLnK692wOAiMOYBQnVG6x7kJAAi4mJ9zibUoGaYB0wE8bm32l
c8VKDfN7lE8A/34rGzBrDZJ0AQAAAAAAAACABP8AAAAAAAAAAAAAAGC+FVBBAI2+67/+/1eD
zf/rEJCQkJCQkIoGRogHRwHbdQeLHoPu/BHbcu24AQAAAAHbdQeLHoPu/BHbEcAB23PvdQmL
HoPu/BHbc+QxyYPoA3INweAIigZGg/D/dHSJxQHbdQeLHoPu/BHbEckB23UHix6D7vwR2xHJ
dSBBAdt1B4seg+78EdsRyQHbc+91CYseg+78Edtz5IPBAoH9APP//4PRAY0UL4P9/HYPigJC
iAdHSXX36WP///+QiwKDwgSJB4PHBIPpBHfxAc/pTP///16J97lEAQAAigdHLOg8AXf3gD8F
dfKLB4pfBGbB6AjBwBCGxCn4gOvoAfCJB4PHBYnY4tmNvgBwAQCLBwnAdEWLXwSNhDBknQEA
AfNQg8cI/5bwnQEAlYoHRwjAdNyJ+XkHD7cHR1BHuVdI8q5V/5b0nQEACcB0B4kDg8ME69j/
lvidAQBh6beo/v8AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAgADAAAAIAAAgA4AAABgAACAAAAAAAAAAAAAAAAAAAABAAEAAAA4AACAAAAAAAAA
AAAAAAAAAAABAAcEAABQAAAApKABAKgMAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAQBlAAAA
eAAAgAAAAAAAAAAAAAAAAAAAAQAHBAAAkAAAAFCtAQAUAAAAAAAAAAAAAACgcAEAKAAAACAA
AABAAAAAAQAYAAAAAACADAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAICAgMDAwMDAwMDAwMDAwMDAwMDAwMDA
wMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAICAgP//////////////////////////////////////////
/////////////////////////////////////////////8DAwAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAICAgP//////////////////////////////////////////////////////////
/////////////////////////////8DAwAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAICAgP//
////////////////////////////////////////////////////////////////////////
/////////////8DAwAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAICAgP//////////////////
/////////////////////////////////8DAwMDAwMDAwMDAwMDAwMDAwP///////////8DA
wAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAICAgP//////////////////////////////////
/////////////////////////////////////////////////////8DAwAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAICAgP///////////8DAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDA
wMDAwMDAwMDAwMDAwMDAwMDAwP///////////8DAwAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AICAgP//////////////////////////////////////////////////////////////////
/////////////////////8DAwAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAICAgP//////////
/8DAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwP//////
/////8DAwAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAICAgP//////////////////////////
/////////////////////////////////////////////////////////////8DAwAAAAP8A
AAAAAP8AAAAAAP8AAAAAAP8AAAAAAP8AAAAAAP8AAAAAAP8AAAAAAP8AAAAAAMDAwMDAwMDA
wMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwP///////////8DAwAAAAAAAAP8AAAAAAP8AAAAA
AP8AAAAAAP8AAAAAAP8AAAAAAP8AAAAAAP8AAAAAAP8AAP//////////////////////////
/////////////////////////////8DAwAAAAP8AAAAAAP//////////////////////////
//////////////////////8AAAAAAMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDA
wP///////////8DAwAAAAAAAAP8AAP////8AAAAAAP8AAP////8AAAAAAP8AAAAAAP//////
/////wAAAP8AAP///////////////////////////////////////////////////////8DA
wAAAAP8AAAAAAP///wAAAP8AAAAAAP///wAAAP8AAAAAAP8AAMDAwP////////8AAAAAAMDA
wMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwP///////////8DAwAAAAAAAAP8AAP//
//8AAAAAAP8AAP////8AAAAAAP8AAAAAAICAgP///////wAAAP8AAP//////////////////
/////////////////////////////////////8DAwAAAAP8AAAAAAP///wAAAP8AAAAAAP8A
AAAAAP8AAAAAAP8AAAAAAP////////8AAAAAAMDAwMDAwP///////8DAwMDAwMDAwMDAwMDA
wMDAwMDAwP///////////8DAwAAAAAAAAP8AAP////8AAAAAAP8AAAAAAP8AAAAAAP8AAAAA
AP8AAMDAwP///wAAAP8AAP///////////////8DAwMDAwMDAwP///8DAwMDAwMDAwP//////
/////8DAwAAAAP8AAAAAAP///wAAAP8AAP////8AAAAAAP8AAP////8AAAAAAICAgP////8A
AAAAAMDAwMDAwP///////8DAwMDAwP///////////////8DAwP///////////8DAwAAAAAAA
AP8AAAAAAP8AAAAAAP///wAAAP8AAAAAAP///wAAAP8AAAAAAP///wAAAP8AAP//////////
/////8DAwMDAwMDAwP///////8DAwMDAwP///////////8DAwAAAAP8AAAAAAP8AAAAAAP8A
AP////8AAAAAAP8AAP////8AAAAAAP8AAAAAAP8AAAAAAMDAwMDAwP///////8DAwP//////
/////////////8DAwP///////////8DAwAAAAAAAAP8AAAAAAP8AAAAAAP8AAAAAAP8AAAAA
AP8AAAAAAP8AAAAAAP8AAAAAAP8AAP///////////////8DAwMDAwP///////4CAgAAAAAAA
AAAAAAAAAAAAAAAAAAAAAP8AAAAAAP//////////////////////////////////////////
//////8AAAAAAP///////////////8DAwMDAwMDAwMDAwICAgP///////////8DAwICAgAAA
AAAAAAAAAP8AAP///////////////////////////////////////////////wAAAP8AAP//
/////////////8DAwMDAwMDAwMDAwICAgP///////8DAwICAgAAAAAAAAAAAAP8AAAAAAP8A
AAAAAP8AAAAAAP8AAAAAAP8AAAAAAP8AAAAAAP8AAAAAAP8AAAAAAP//////////////////
/////////////4CAgP///8DAwICAgAAAAAAAAAAAAAAAAAAAAP8AAAAAAP8AAAAAAP8AAAAA
AP8AAAAAAP8AAAAAAP8AAAAAAP8AAAAAAP8AAP///////////////////////////////4CA
gMDAwICAgAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAICAgP//////////
/////////////////////////////////////////////////////4CAgICAgAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAICAgP//////////////////////////
/////////////////////////////////////4CAgAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICA
gICAgICAgICAgICAgICAgICAgAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAP/////+AAAA/gAAAP4AAAD+AAAA/gAAAP4A
AAD+AAAA/gAAAP4AAAD+AAAA/gAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAEAAAADAAAABwAAAA/+AAAf/gAAP/4AAH//////SH0BAAAA
AQABACAgAAABABgAqAwAAAEAAAAAAAAAAAAAAAAAKK4BAPCtAQAAAAAAAAAAAAAAAAA1rgEA
AK4BAAAAAAAAAAAAAAAAAEKuAQAIrgEAAAAAAAAAAAAAAAAAT64BABCuAQAAAAAAAAAAAAAA
AABargEAGK4BAAAAAAAAAAAAAAAAAGauAQAgrgEAAAAAAAAAAAAAAAAAAAAAAAAAAABwrgEA
fq4BAI6uAQAAAAAAnK4BAAAAAACqrgEAAAAAALyuAQAAAAAAyK4BAAAAAAADAACAAAAAAEtF
Uk5FTDMyLkRMTABBRFZBUEkzMi5kbGwAaXBobHBhcGkuZGxsAFVTRVIzMi5kbGwAV0lOSU5F
VC5kbGwAV1MyXzMyLmRsbAAATG9hZExpYnJhcnlBAABHZXRQcm9jQWRkcmVzcwAARXhpdFBy
b2Nlc3MAAABSZWdDbG9zZUtleQAAAEdldE5ldHdvcmtQYXJhbXMAAHdzcHJpbnRmQQAAAElu
dGVybmV0R2V0Q29ubmVjdGVkU3RhdGUAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA=

--70606400--

From brian_osborne at cognia.com  Thu Mar 11 08:16:43 2004
From: brian_osborne at cognia.com (Brian Osborne)
Date: Thu Mar 11 08:22:26 2004
Subject: [Bioperl-l] protein networks
In-Reply-To: <404C3D57.2050600@ed.ac.uk>
Message-ID: <GAEDKMGOKFBLJPKCLKCCAEHDDIAA.brian_osborne@cognia.com>

Richard,

You may want to reconsider the name "ProteinGraphIO::xml.pm" since other
pathway databases (BIND certainly, KEGG perhaps) also export to XML.

Brian O.

-----Original Message-----
From: bioperl-l-bounces@portal.open-bio.org
[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Richard Adams
Sent: Monday, March 08, 2004 4:31 AM
To: bioperl-l@portal.open-bio.org
Subject: [Bioperl-l] protein networks

Hi,
I've been writing some modules to apply Nathan Goodman's
"SimpleGraph.pm" module to protein interaction networks. He has
written this as a replacement for the the broken CPAN graph modules. The
aim of these modules is

1. To convert protein interaction data between differentet formats
2. To be able to link protein interaction data to protein sequence data.
3. To be able to read in interaction data from separate sources, and
make a merged graph with duplicate interactions flagged or removed.


Could some core person give some idea for namespace to put these in CVS?

The modules are :
1. Nathan's modules
SimpleGraph.pm                   - a generic graph  module with methods
to build and represent a graph
SimpleGraph::Traversal.pm - generic methods to navigate a graph.

2. My modules

a. An I/O system analagous to SeqIO that lazyloads the correct modules
for a particular format
     ProteinGraphIO.pm      - with next_graph, write_graph methods
       ProteinGraphIO::dip.pm    - parser for DIP format interactions
         - to read in PSI_XML formatted records
(still being worked on).

b.   Modules to represent a protein network:
    PrimaryProteinGraph.pm - subclass of SimpleGraph.pm. This will be
made from a dip record, analagous to PrimarySeq
    FullProteinGraph .pm       - subclas of SimpleGraph.pm,  contains
PrimaryProteinGraph and metadata of graph, will be made from an XML
file, analagous to RichSeq.

c.  Classes to represent nodes/ edges.
   NodeI.pm    - interface describing methods that all nodes should have
        ProteinNode.pm - implements NodeI, currently has methods
primary_id(), seq(), secondary_id()
         InteractionI.pm    - interface describing methods all edges
should have
        ProteinInteraction.pm - implements InteractionI, currently has
methods weight(), interactors(), id().

Cheers Richard


--
Dr Richard Adams
Psychiatric Genetics Group,
Medical Genetics,
Molecular Medicine Centre,
Western General Hospital,
Crewe Rd West,
Edinburgh UK
EH4 2XU

Tel: 44 131 651 1084
richard.adams@ed.ac.uk


_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org
http://portal.open-bio.org/mailman/listinfo/bioperl-l


From baber at mrtcbko.org  Thu Mar 11 09:46:47 2004
From: baber at mrtcbko.org (Ibrahima BABER)
Date: Thu Mar 11 10:01:49 2004
Subject: [Bioperl-l] how to set up bioperl cygwin
Message-ID: <3FE6D6566E4FD511B54A009027EEADE71BE649@MRTCBKO>

Dear

I have installed Cygwin successfully, and 

I would like to know how to set up bioperl
with cygwin?

Thanks

Ibrahima

Mali - west-Africa


From heikki at ebi.ac.uk  Thu Mar 11 10:06:10 2004
From: heikki at ebi.ac.uk (Heikki Lehvaslaiho)
Date: Thu Mar 11 10:11:47 2004
Subject: [Bioperl-l] Re: [Bioperl-guts-l] [Bug 1600] New:
	$gb->species->ncbi_taxid
In-Reply-To: <40507A3C.6090208@ed.ac.uk>
References: <GAEDKMGOKFBLJPKCLKCCCEHEDIAA.brian_osborne@cognia.com>
	<40507A3C.6090208@ed.ac.uk>
Message-ID: <200403111506.10392.heikki@ebi.ac.uk>

James,

I think you are on the right track here.  Getting the taxid and the higher 
taxa is more important than the lack of binomial name. 

I am moving this discussion from guts into bioperl-l list where it belongs.

Thank your your contribution. Putting a patch into bugzilla as an attachment 
is exactly the right thing to do.

	-Heikki

On Thursday 11 Mar 2004 14:39, James Wasmuth wrote:
> Brian and all at bioperl-guts,
>
>
> below is the comment I've added to the bug[1600].  I think it may need
> some discussion, but the patch I've added works to the extent that it
> allows creation of a Bio::Species object but the subsequent genus,
> species, subspecies calls will be 'wrong'.  Personally I'm more
> concerned with the taxid, which I think will be sufficient.
>
> If you want to see the size of this problem go to NCBI taxonomy and
> enter the term identified as a token set!  I think that maintaining the
> taxid is enough, otherwise the artifical split of terms such as
> **unidentified diatom endosymbiont of Peridinium foliaceum*
> <http://www.ncbi.nlm.nih.gov/Taxonomy/Browser/wwwtax.cgi?mode=Undef&id=4224
>7&lvl=3&lin=f&keep=1&srchmode=3&unlock>* may be a problem, though some of
> them are intuitive.
>
> One last question, I've never tried to fix a bug before, so I've
> commited a patch as an attachment to Bugzilla for the bug.  Do others
> check this and if okay place it in the code...
> apologies for the newbie bit...
>
> -james
>
>
>
> genbank.pm
>
> line 1123: return unless $genus and  $genus !~ /^(Unknown|None)$/oi;
>
> a number of species are described as Unknown blah blah blah.
>
> The NCBI taxid assigned to unknown taxa is 32644 and has a number of
> synonyms, none of which are 'unknown'.
>
> The list includes: other, unknown organism, not specified, not shown,
> unspecified, Unknown, None, unclassified , unidentified organism
>
> I've changed the _read_GenBank_Species subroutine to allow organism
> names such as 'unknown marine gamma proteobacterium NOR5'.  This will
> create a Bio::Species object, but the genus=unknown species=marine
> subspecies=gamma.
>
> There is a whole host of species names that ignore the nice rules in
> _read_GenBank_Species!  However this fix will allow the correct taxid to
> be provided which I think is more than the name!
>
>
>
> sub _read_GenBank_Species {
>     my( $self,$buffer) = @_;
>     my @organell_names = ("chloroplast", "mitochondr");
>      # only those carrying DNA, apart from the nucleus
>
>     #CHANGE
>      my @unkn_names=("other", 'unknown organism', 'not specified', 'not
> shown', 'Unspecified', 'Unknown', 'None', 'unclassified', 'unidentified
> organism');
>
>     $_ = $$buffer;
>
>     my( $sub_species, $species, $genus, $common, $organelle, @class,
> $ns_name );
>     # upon first entering the loop, we must not read a new line -- the
> SOURCE
>     # line is already in the buffer (HL 05/10/2000)
>     while (defined($_) || defined($_ = $self->_readline())) {
>     # de-HTMLify (links that may be encountered here don't contain
>     # escaped '>', so a simple-minded approach suffices)
>         s/<[^>]+>//g;
>     if (/^SOURCE\s+(.*)/o) {
>         # FIXME this is probably mostly wrong (e.g., it yields things like
>         # Homo sapiens adult placenta cDNA to mRNA
>         # which is certainly not what you want)
>         $common = $1;
>         $common =~ s/\.$//; # remove trailing dot
>     } elsif (/^\s{2}ORGANISM/o) {
>         my @spflds = split(' ', $_);
>             ($ns_name) = $_ =~ /\w+\s+(.*)/o;
>         shift(@spflds); # ORGANISM
>
>          if(grep { $_ =~ /^$spflds[0]/i; } @organell_names) {
>         $organelle = shift(@spflds);
>         }
>             $genus = shift(@spflds);
>         if(@spflds) {
>         $species = shift(@spflds);
>         } elsif ( grep { $genus } @unkn_names){
>         $species = '';
>         } else {$species='sp.';}      #there's no species name but it
> isn't unclassified
>         $sub_species = shift(@spflds) if(@spflds);
>         } elsif (/^\s+(.+)/o) {
>         # only split on ';' or '.' so that
>         # classification that is 2 words will
>         # still get matched
>         # use map to remove trailing/leading spaces
>             push(@class, map { s/^\s+//; s/\s+$//; $_; } split /[;\.]+/,
> $1);
>         } else {
>             last;
>         }
>
>         $_ = undef; # Empty $_ to trigger read of next line
>     }
>
>      $$buffer = $_;
>
>      # Don't make a species object if it's empty or "Unknown" or "None"
>     my $unkn = grep { $_ =~ /^$genus$species/i; } @unkn_names;
>
>      return unless $genus and  $unkn==0;
>
>      # Bio::Species array needs array in Species -> Kingdom direction
>     if ($class[0] eq 'Viruses') {
>         push( @class, $ns_name );
>     }
>     elsif ($class[$#class] eq $genus) {
>         push( @class, $species );
>     } else {
>         push( @class, $genus, $species );
>     }
>     @class = reverse @class;
>
>     my $make = Bio::Species->new();
>     $make->classification( \@class, "FORCE" ); # no name validation please
>     $make->common_name( $common      ) if $common;
>     unless ($class[-1] eq 'Viruses') {
>         $make->sub_species( $sub_species ) if $sub_species;
>     }
>     $make->organelle($organelle) if $organelle;
>     return $make;
> }
>
> Brian Osborne wrote:
> >James,
> >
> >Your guess is right, no Species is made because of the name. That's
> > because genbank.pm normally looks at:
> >
> >ORGANISM Bos taurus
> >
> >And makes "Bos" the genus, and so on.
> >
> >If it sees:
> >
> >ORGANISM Unknown
> >
> >It refuses to make a Species object, and it's interpreting your ORGANISM
> >line in the same way because it can't make a valid genus, that's the
> > current rule. Personally I'd say that I agree with its principle - how
> > can we make a Species object without genus and species?
> >
> >You can get the taxid from a SeqFeature object, you already knew that.
> >
> >Brian O.
> >
> >
> >-----Original Message-----
> >From: bioperl-guts-l-bounces@portal.open-bio.org
> >[mailto:bioperl-guts-l-bounces@portal.open-bio.org]On Behalf Of
> >bugzilla-daemon@portal.open-bio.org
> >Sent: Thursday, March 11, 2004 4:21 AM
> >To: bioperl-guts-l@bioperl.org
> >Subject: [Bioperl-guts-l] [Bug 1600] New: $gb->species->ncbi_taxid
> >
> >http://bugzilla.bioperl.org/show_bug.cgi?id=1600
> >
> >           Summary: $gb->species->ncbi_taxid
> >           Product: Bioperl
> >           Version: unspecified
> >          Platform: PC
> >        OS/Version: Linux
> >            Status: NEW
> >          Severity: normal
> >          Priority: P2
> >         Component: Bio::SeqIO
> >        AssignedTo: bioperl-guts-l@bioperl.org
> >        ReportedBy: james.wasmuth@ed.ac.uk
> >
> >
> >I've included a genbank file for which I have been unable to extract the
> >ncbi_taxid for using
> >
> >$gb->species->ncbi_taxid
> >
> >the error is:
> >Can't call method "ncbi_taxid" on an undefined value
> >
> >infact I don't get a Bio::Species object.  I'm sure its because of the
> > name, which is correct.
> >
> >I've tried looking into it, but could not find which Seq object creates
> > the Bio::Species object.
> >
> >
> >
> >LOCUS       AY007676                1389 bp    DNA     linear   BCT
> >29-OCT-2001
> >DEFINITION  Unknown marine gamma proteobacterium NOR5 16S ribosomal RNA,
> >            partial sequence.
> >ACCESSION   AY007676
> >VERSION     AY007676.1  GI:12000362
> >KEYWORDS    .
> >SOURCE      unknown marine gamma proteobacterium NOR5
> >  ORGANISM  unknown marine gamma proteobacterium NOR5
> >            Bacteria; Proteobacteria; Gammaproteobacteria.
> >REFERENCE   1  (bases 1 to 1389)
> >  AUTHORS   Eilers,H., Pernthaler,J., Peplies,J., Glockner,F.O., Gerdts,G.
> >and
> >            Amann,R.
> >  TITLE     Isolation of novel pelagic bacteria from the German bight and
> >their
> >            seasonal contributions to surface picoplankton
> >  JOURNAL   Appl. Environ. Microbiol. 67 (11), 5134-5142 (2001)
> >  MEDLINE   21536174
> >   PUBMED   11679337
> >REFERENCE   2  (bases 1 to 1389)
> >  AUTHORS   Eilers,H., Pernthaler,J., Peplies,J., Gloeckner,F.O.,
> > Gerdts,G., Schuett,C. and Amann,R.
> >  TITLE     Identification and seasonal dominance of culturable marine
> >bacteria
> >  JOURNAL   Unpublished
> >REFERENCE   3  (bases 1 to 1389)
> >  AUTHORS   Eilers,H., Pernthaler,J., Peplies,J., Gloeckner,F.O.,
> > Gerdts,G., Schuett,C. and Amann,R.
> >  TITLE     Direct Submission
> >  JOURNAL   Submitted (29-AUG-2000) Molecular Ecology,
> > Max-Planck-Institute, Celsiusstrasse 1, Bremen 28359, Germany
> >FEATURES             Location/Qualifiers
> >     source          1..1389
> >                     /organism="unknown marine gamma proteobacterium NOR5"
> >                     /mol_type="genomic DNA"
> >                     /db_xref="taxon:145658"
> >     rRNA            <1..>1389
> >                     /product="16S ribosomal RNA"
> >BASE COUNT      343 a    319 c    453 g    274 t
> >ORIGIN
> >        1 cgcgaaagta cttcggtatg agtagagcgg cggacgggtg agtaacgcgt
> > aggaatctat 61 ccagtagtgg gggacaactc ggggaaactc gagctaatac cgcatacgtc
> > ctaagggaga 121 aagcggggga tcttcggacc tcgcgctatt ggaggagcct gcgttggatt
> > agctagttgg 181 tggggtaaag gcctaccaag gcgacgatcc atagctggtc tgagaggatg
> > atcagccaca 241 ccgggactga gacacggccc ggactcctac gggaggcagc agtggggaat
> > attgcgcaat 301 gggcgaaagc ctgacgcagc catgccgcgt gtgtgaagaa ggccttcggg
> > ttgtaaagca 361 ctttcaattg ggaagaaagg ttagtagtta ataactgcta gctgtgacat
> > tacctttaga 421 agaagcaccg gctaactccg tgccagcagc cgcggtaata cggaggtgcg
> > agcgttaatc 481 ggaattactg ggcgtaaagc gcgcgtaggc ggtctgttaa gtcggatgtg
> > aaagccccgg 541 gctcaacctg ggaattgcac ccgatactgg ccgactggag tgcgagagag
> > ggaggtagaa 601 ttccacgtgt agcggtgaaa tgcgtagata tgtggaggaa taccggtggc
> > gaaggcggcc 661 tcctggctcg acactgacgc tgaggtgcga aagcgtgggg agcaaacagg
> > attagatacc 721 ctggtagtcc acgccgtaaa cgatgtctac tagccgttgg gagacttgat
> > ttcttggtgg 781 cgaagttaac gcgataagta gaccgcctgg ggagtacggc cgcaaggtta
> > aaactcaaat 841 gaattgacgg gggcccgcac aagcggtgga gcatgtggtt taattcgatg
> > caacgcgaag 901 aaccttacca ggccttgaca tcctaggaat cctgtagaga tacgggagtg
> > ccttcgggaa 961 tctagtgaca ggtgctgcat ggctgtcgtc agctcgtgtc gtgagatgtt
> > gggttaagtc 1021 ccgtaacgag cgcaaccctt gtccttagtt gccagcgcgt aatggcggga
> > actctaagga 1081 gactgccggt gacaaaccgg aggaaggtgg ggacgacgtc aagtcatcat
> > ggcccttacg 1141 gcctgggcta cacacgtgct acaatggaac gcacagaggg cagcaaaccc
> > gcgaggggga 1201 gcgaatccca caaaacgttt cgtagtccgg atcggagtct gcaactcgac
> > tccgtgaagt 1261 cggaatcgct agtaatcgtg aatcagaatg tcacggtgaa tacgttcccg
> > ggccttgtac 1321 acaccgcccg tcacaccatg ggagtgggtt gctccagaag tggttagcct
> > aaccttcggg 1381 agggcgatc
> >//
> >
> >
> >
> >------- You are receiving this mail because: -------
> >You are the assignee for the bug, or are watching the assignee.
> >_______________________________________________
> >Bioperl-guts-l mailing list
> >Bioperl-guts-l@portal.open-bio.org
> >http://portal.open-bio.org/mailman/listinfo/bioperl-guts-l

-- 
______ _/      _/_____________________________________________________
      _/      _/                      http://www.ebi.ac.uk/mutations/
     _/  _/  _/  Heikki Lehvaslaiho    heikki_at_ebi ac uk
    _/_/_/_/_/  EMBL Outstation, European Bioinformatics Institute
   _/  _/  _/  Wellcome Trust Genome Campus, Hinxton
  _/  _/  _/  Cambs. CB10 1SD, United Kingdom
     _/      Phone: +44 (0)1223 494 644   FAX: +44 (0)1223 494 468
___ _/_/_/_/_/________________________________________________________
From brian_osborne at cognia.com  Thu Mar 11 10:16:12 2004
From: brian_osborne at cognia.com (Brian Osborne)
Date: Thu Mar 11 10:22:19 2004
Subject: [Bioperl-l] how to set up bioperl cygwin
In-Reply-To: <3FE6D6566E4FD511B54A009027EEADE71BE649@MRTCBKO>
Message-ID: <GAEDKMGOKFBLJPKCLKCCEEHHDIAA.brian_osborne@cognia.com>

Ibrahima,

If you've installed Cygwin's Perl as well then you should be ready to
install Bioperl. Download, gunzip, and untar the latest 1.4 package and take
a look at the INSTALL document online
(http://bioperl.org/Core/Latest/INSTALL), it's the most up-to-date. Choose
one of the "easy" methods, either should work, and write back if there are
any problems.

Brian O.

-----Original Message-----
From: bioperl-l-bounces@portal.open-bio.org
[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Ibrahima BABER
Sent: Thursday, March 11, 2004 9:47 AM
To: 'gillyfish@hotmail.com'
Cc: 'bioperl-l@bioperl.org'
Subject: [Bioperl-l] how to set up bioperl cygwin

Dear

I have installed Cygwin successfully, and

I would like to know how to set up bioperl
with cygwin?

Thanks

Ibrahima

Mali - west-Africa


_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org
http://portal.open-bio.org/mailman/listinfo/bioperl-l


From brian_osborne at cognia.com  Thu Mar 11 10:17:17 2004
From: brian_osborne at cognia.com (Brian Osborne)
Date: Thu Mar 11 10:23:06 2004
Subject: [Bioperl-l] RE: [Bioperl-guts-l] [Bug 1600] New:
	$gb->species->ncbi_taxid
In-Reply-To: <40507A3C.6090208@ed.ac.uk>
Message-ID: <GAEDKMGOKFBLJPKCLKCCIEHHDIAA.brian_osborne@cognia.com>

James,

Right, some discussion is called for. What James is saying is that Genbank
entries with ORGANISM lines like "Unknown marine bacterium" should have
Species objects, although binomial() will return something like "unknown".
This Species object will be useful to those who want to use classification()
and ncbi_taxid(). Are there votes or comments?

Brian O.


-----Original Message-----
From: James Wasmuth [mailto:james.wasmuth@ed.ac.uk]
Sent: Thursday, March 11, 2004 9:40 AM
To: Brian Osborne
Cc: bioperl-guts-l@bioperl.org
Subject: Re: [Bioperl-guts-l] [Bug 1600] New: $gb->species->ncbi_taxid

Brian and all at bioperl-guts,


below is the comment I've added to the bug[1600].  I think it may need
some discussion, but the patch I've added works to the extent that it
allows creation of a Bio::Species object but the subsequent genus,
species, subspecies calls will be 'wrong'.  Personally I'm more
concerned with the taxid, which I think will be sufficient.

If you want to see the size of this problem go to NCBI taxonomy and
enter the term identified as a token set!  I think that maintaining the
taxid is enough, otherwise the artifical split of terms such as
**unidentified diatom endosymbiont of Peridinium foliaceum*
<http://www.ncbi.nlm.nih.gov/Taxonomy/Browser/wwwtax.cgi?mode=Undef&id=42247
&lvl=3&lin=f&keep=1&srchmode=3&unlock>*
may be a problem, though some of them are intuitive.

One last question, I've never tried to fix a bug before, so I've
commited a patch as an attachment to Bugzilla for the bug.  Do others
check this and if okay place it in the code...
apologies for the newbie bit...

-james


genbank.pm

line 1123: return unless $genus and  $genus !~ /^(Unknown|None)$/oi;

a number of species are described as Unknown blah blah blah.

The NCBI taxid assigned to unknown taxa is 32644 and has a number of
synonyms, none of which are 'unknown'.

The list includes: other, unknown organism, not specified, not shown,
unspecified, Unknown, None, unclassified , unidentified organism

I've changed the _read_GenBank_Species subroutine to allow organism
names such as 'unknown marine gamma proteobacterium NOR5'.  This will
create a Bio::Species object, but the genus=unknown species=marine
subspecies=gamma.

There is a whole host of species names that ignore the nice rules in
_read_GenBank_Species!  However this fix will allow the correct taxid to
be provided which I think is more than the name!


sub _read_GenBank_Species {
    my( $self,$buffer) = @_;
    my @organell_names = ("chloroplast", "mitochondr");
     # only those carrying DNA, apart from the nucleus

    #CHANGE
     my @unkn_names=("other", 'unknown organism', 'not specified', 'not
shown', 'Unspecified', 'Unknown', 'None', 'unclassified', 'unidentified
organism');

    $_ = $$buffer;

    my( $sub_species, $species, $genus, $common, $organelle, @class,
$ns_name );
    # upon first entering the loop, we must not read a new line -- the
SOURCE
    # line is already in the buffer (HL 05/10/2000)
    while (defined($_) || defined($_ = $self->_readline())) {
    # de-HTMLify (links that may be encountered here don't contain
    # escaped '>', so a simple-minded approach suffices)
        s/<[^>]+>//g;
    if (/^SOURCE\s+(.*)/o) {
        # FIXME this is probably mostly wrong (e.g., it yields things like
        # Homo sapiens adult placenta cDNA to mRNA
        # which is certainly not what you want)
        $common = $1;
        $common =~ s/\.$//; # remove trailing dot
    } elsif (/^\s{2}ORGANISM/o) {
        my @spflds = split(' ', $_);
            ($ns_name) = $_ =~ /\w+\s+(.*)/o;
        shift(@spflds); # ORGANISM

         if(grep { $_ =~ /^$spflds[0]/i; } @organell_names) {
        $organelle = shift(@spflds);
        }
            $genus = shift(@spflds);
        if(@spflds) {
        $species = shift(@spflds);
        } elsif ( grep { $genus } @unkn_names){
        $species = '';
        } else {$species='sp.';}      #there's no species name but it
isn't unclassified
        $sub_species = shift(@spflds) if(@spflds);
        } elsif (/^\s+(.+)/o) {
        # only split on ';' or '.' so that
        # classification that is 2 words will
        # still get matched
        # use map to remove trailing/leading spaces
            push(@class, map { s/^\s+//; s/\s+$//; $_; } split /[;\.]+/,
$1);
        } else {
            last;
        }

        $_ = undef; # Empty $_ to trigger read of next line
    }

     $$buffer = $_;

     # Don't make a species object if it's empty or "Unknown" or "None"
    my $unkn = grep { $_ =~ /^$genus$species/i; } @unkn_names;

     return unless $genus and  $unkn==0;

     # Bio::Species array needs array in Species -> Kingdom direction
    if ($class[0] eq 'Viruses') {
        push( @class, $ns_name );
    }
    elsif ($class[$#class] eq $genus) {
        push( @class, $species );
    } else {
        push( @class, $genus, $species );
    }
    @class = reverse @class;

    my $make = Bio::Species->new();
    $make->classification( \@class, "FORCE" ); # no name validation please
    $make->common_name( $common      ) if $common;
    unless ($class[-1] eq 'Viruses') {
        $make->sub_species( $sub_species ) if $sub_species;
    }
    $make->organelle($organelle) if $organelle;
    return $make;
}


Brian Osborne wrote:

>James,
>
>Your guess is right, no Species is made because of the name. That's because
>genbank.pm normally looks at:
>
>ORGANISM Bos taurus
>
>And makes "Bos" the genus, and so on.
>
>If it sees:
>
>ORGANISM Unknown
>
>It refuses to make a Species object, and it's interpreting your ORGANISM
>line in the same way because it can't make a valid genus, that's the
current
>rule. Personally I'd say that I agree with its principle - how can we make
a
>Species object without genus and species?
>
>You can get the taxid from a SeqFeature object, you already knew that.
>
>Brian O.
>
>
>-----Original Message-----
>From: bioperl-guts-l-bounces@portal.open-bio.org
>[mailto:bioperl-guts-l-bounces@portal.open-bio.org]On Behalf Of
>bugzilla-daemon@portal.open-bio.org
>Sent: Thursday, March 11, 2004 4:21 AM
>To: bioperl-guts-l@bioperl.org
>Subject: [Bioperl-guts-l] [Bug 1600] New: $gb->species->ncbi_taxid
>
>http://bugzilla.bioperl.org/show_bug.cgi?id=1600
>
>           Summary: $gb->species->ncbi_taxid
>           Product: Bioperl
>           Version: unspecified
>          Platform: PC
>        OS/Version: Linux
>            Status: NEW
>          Severity: normal
>          Priority: P2
>         Component: Bio::SeqIO
>        AssignedTo: bioperl-guts-l@bioperl.org
>        ReportedBy: james.wasmuth@ed.ac.uk
>
>
>I've included a genbank file for which I have been unable to extract the
>ncbi_taxid for using
>
>$gb->species->ncbi_taxid
>
>the error is:
>Can't call method "ncbi_taxid" on an undefined value
>
>infact I don't get a Bio::Species object.  I'm sure its because of the
name,
>which is correct.
>
>I've tried looking into it, but could not find which Seq object creates the
>Bio::Species object.
>
>
>
>LOCUS       AY007676                1389 bp    DNA     linear   BCT
>29-OCT-2001
>DEFINITION  Unknown marine gamma proteobacterium NOR5 16S ribosomal RNA,
>            partial sequence.
>ACCESSION   AY007676
>VERSION     AY007676.1  GI:12000362
>KEYWORDS    .
>SOURCE      unknown marine gamma proteobacterium NOR5
>  ORGANISM  unknown marine gamma proteobacterium NOR5
>            Bacteria; Proteobacteria; Gammaproteobacteria.
>REFERENCE   1  (bases 1 to 1389)
>  AUTHORS   Eilers,H., Pernthaler,J., Peplies,J., Glockner,F.O., Gerdts,G.
>and
>            Amann,R.
>  TITLE     Isolation of novel pelagic bacteria from the German bight and
>their
>            seasonal contributions to surface picoplankton
>  JOURNAL   Appl. Environ. Microbiol. 67 (11), 5134-5142 (2001)
>  MEDLINE   21536174
>   PUBMED   11679337
>REFERENCE   2  (bases 1 to 1389)
>  AUTHORS   Eilers,H., Pernthaler,J., Peplies,J., Gloeckner,F.O.,
Gerdts,G.,
>            Schuett,C. and Amann,R.
>  TITLE     Identification and seasonal dominance of culturable marine
>bacteria
>  JOURNAL   Unpublished
>REFERENCE   3  (bases 1 to 1389)
>  AUTHORS   Eilers,H., Pernthaler,J., Peplies,J., Gloeckner,F.O.,
Gerdts,G.,
>            Schuett,C. and Amann,R.
>  TITLE     Direct Submission
>  JOURNAL   Submitted (29-AUG-2000) Molecular Ecology,
Max-Planck-Institute,
>            Celsiusstrasse 1, Bremen 28359, Germany
>FEATURES             Location/Qualifiers
>     source          1..1389
>                     /organism="unknown marine gamma proteobacterium NOR5"
>                     /mol_type="genomic DNA"
>                     /db_xref="taxon:145658"
>     rRNA            <1..>1389
>                     /product="16S ribosomal RNA"
>BASE COUNT      343 a    319 c    453 g    274 t
>ORIGIN
>        1 cgcgaaagta cttcggtatg agtagagcgg cggacgggtg agtaacgcgt aggaatctat
>       61 ccagtagtgg gggacaactc ggggaaactc gagctaatac cgcatacgtc ctaagggaga
>      121 aagcggggga tcttcggacc tcgcgctatt ggaggagcct gcgttggatt agctagttgg
>      181 tggggtaaag gcctaccaag gcgacgatcc atagctggtc tgagaggatg atcagccaca
>      241 ccgggactga gacacggccc ggactcctac gggaggcagc agtggggaat attgcgcaat
>      301 gggcgaaagc ctgacgcagc catgccgcgt gtgtgaagaa ggccttcggg ttgtaaagca
>      361 ctttcaattg ggaagaaagg ttagtagtta ataactgcta gctgtgacat tacctttaga
>      421 agaagcaccg gctaactccg tgccagcagc cgcggtaata cggaggtgcg agcgttaatc
>      481 ggaattactg ggcgtaaagc gcgcgtaggc ggtctgttaa gtcggatgtg aaagccccgg
>      541 gctcaacctg ggaattgcac ccgatactgg ccgactggag tgcgagagag ggaggtagaa
>      601 ttccacgtgt agcggtgaaa tgcgtagata tgtggaggaa taccggtggc gaaggcggcc
>      661 tcctggctcg acactgacgc tgaggtgcga aagcgtgggg agcaaacagg attagatacc
>      721 ctggtagtcc acgccgtaaa cgatgtctac tagccgttgg gagacttgat ttcttggtgg
>      781 cgaagttaac gcgataagta gaccgcctgg ggagtacggc cgcaaggtta aaactcaaat
>      841 gaattgacgg gggcccgcac aagcggtgga gcatgtggtt taattcgatg caacgcgaag
>      901 aaccttacca ggccttgaca tcctaggaat cctgtagaga tacgggagtg ccttcgggaa
>      961 tctagtgaca ggtgctgcat ggctgtcgtc agctcgtgtc gtgagatgtt gggttaagtc
>     1021 ccgtaacgag cgcaaccctt gtccttagtt gccagcgcgt aatggcggga actctaagga
>     1081 gactgccggt gacaaaccgg aggaaggtgg ggacgacgtc aagtcatcat ggcccttacg
>     1141 gcctgggcta cacacgtgct acaatggaac gcacagaggg cagcaaaccc gcgaggggga
>     1201 gcgaatccca caaaacgttt cgtagtccgg atcggagtct gcaactcgac tccgtgaagt
>     1261 cggaatcgct agtaatcgtg aatcagaatg tcacggtgaa tacgttcccg ggccttgtac
>     1321 acaccgcccg tcacaccatg ggagtgggtt gctccagaag tggttagcct aaccttcggg
>     1381 agggcgatc
>//
>
>
>
>------- You are receiving this mail because: -------
>You are the assignee for the bug, or are watching the assignee.
>_______________________________________________
>Bioperl-guts-l mailing list
>Bioperl-guts-l@portal.open-bio.org
>http://portal.open-bio.org/mailman/listinfo/bioperl-guts-l
>
>
>
>

--
"I have not failed. I've just found 10,000 ways that don't work."
               --- Thomas Edison

Nematode Bioinformatics           ||
Blaxter Nematode Genomics Group   ||
School of Biological Sciences     ||
Ashworth Laboratories             ||
King's Buildings                  ||    tel: +44 131 650 7403
University of Edinburgh           ||    web: www.nematodes.org
Edinburgh                         ||
EH9 3JT                           ||
UK                                ||


From boris.steipe at utoronto.ca  Thu Mar 11 10:42:36 2004
From: boris.steipe at utoronto.ca (Boris Steipe)
Date: Thu Mar 11 10:48:12 2004
Subject: [Bioperl-l] protein networks
In-Reply-To: <GAEDKMGOKFBLJPKCLKCCAEHDDIAA.brian_osborne@cognia.com>
Message-ID: <B8AC5C55-7372-11D8-A854-000A9577512E@utoronto.ca>


Richard,

Thanks for this initiative !

I might suggest including "interaction" in the package name rather than 
"protein" since this is what you are graphing. Also there are 
Protein-DNA and DNA-DNA interactions (and there will be many more) 
which are being stored by some.

In terms of IO, I would like to suggest supporting the HUPO-PSI 
standard (which DIP supports) for consideration, and naming the module 
after the format, not the database.
<http://psidev.sourceforge.net/mi/xml/doc/user/>

In terms of methods: again, I am not sure you *want* to restrict 
yourself to protein only. MolecularInteraction.pm would suggest itself.


Best regards,


Boris

---
Boris Steipe
University of Toronto
Program in Proteomics & Bioinformatics
Departments of Biochemistry & Molecular and Medical Genetics
http://biochemistry.utoronto.ca/steipe/


> Richard,
>
> You may want to reconsider the name "ProteinGraphIO::xml.pm" since 
> other
> pathway databases (BIND certainly, KEGG perhaps) also export to XML.
>
> Brian O.
>
> -----Original Message-----
> From: bioperl-l-bounces@portal.open-bio.org
> [mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Richard 
> Adams
> Sent: Monday, March 08, 2004 4:31 AM
> To: bioperl-l@portal.open-bio.org
> Subject: [Bioperl-l] protein networks
>
> Hi,
> I've been writing some modules to apply Nathan Goodman's
> "SimpleGraph.pm" module to protein interaction networks. He has
> written this as a replacement for the the broken CPAN graph modules. 
> The
> aim of these modules is
>
> 1. To convert protein interaction data between differentet formats
> 2. To be able to link protein interaction data to protein sequence 
> data.
> 3. To be able to read in interaction data from separate sources, and
> make a merged graph with duplicate interactions flagged or removed.
>
>
>
> Could some core person give some idea for namespace to put these in 
> CVS?
>
> The modules are :
> 1. Nathan's modules
> SimpleGraph.pm                   - a generic graph  module with methods
> to build and represent a graph
> SimpleGraph::Traversal.pm - generic methods to navigate a graph.
>
> 2. My modules
>
> a. An I/O system analagous to SeqIO that lazyloads the correct modules
> for a particular format
>      ProteinGraphIO.pm      - with next_graph, write_graph methods
>        ProteinGraphIO::dip.pm    - parser for DIP format interactions
>          - to read in PSI_XML formatted records
> (still being worked on).
>
> b.   Modules to represent a protein network:
>     PrimaryProteinGraph.pm - subclass of SimpleGraph.pm. This will be
> made from a dip record, analagous to PrimarySeq
>     FullProteinGraph .pm       - subclas of SimpleGraph.pm,  contains
> PrimaryProteinGraph and metadata of graph, will be made from an XML
> file, analagous to RichSeq.
>
> c.  Classes to represent nodes/ edges.
>    NodeI.pm    - interface describing methods that all nodes should 
> have
>         ProteinNode.pm - implements NodeI, currently has methods
> primary_id(), seq(), secondary_id()
>          InteractionI.pm    - interface describing methods all edges
> should have
>         ProteinInteraction.pm - implements InteractionI, currently has
> methods weight(), interactors(), id().
>
> Cheers Richard
>
>
> --
> Dr Richard Adams
> Psychiatric Genetics Group,
> Medical Genetics,
> Molecular Medicine Centre,
> Western General Hospital,
> Crewe Rd West,
> Edinburgh UK
> EH4 2XU
>
> Tel: 44 131 651 1084
> richard.adams@ed.ac.uk
>

From james.wasmuth at ed.ac.uk  Thu Mar 11 10:40:37 2004
From: james.wasmuth at ed.ac.uk (James Wasmuth)
Date: Thu Mar 11 10:53:17 2004
Subject: [Bioperl-l] RE: [Bioperl-guts-l] [Bug 1600]
	New:	$gb->species->ncbi_taxid
In-Reply-To: <GAEDKMGOKFBLJPKCLKCCIEHHDIAA.brian_osborne@cognia.com>
References: <GAEDKMGOKFBLJPKCLKCCIEHHDIAA.brian_osborne@cognia.com>
Message-ID: <40508875.5010904@ed.ac.uk>

I feel there are just too many different permutations to deal with it 
clearly and robustly.

If people did care to get the bionomial name then perhaps if the 
ORGANISM is not a synonym of unclassified, it gets all put into species 
and genus is left blank?  I realise this may require relaxing some 
checks on Bio::Species.

-james


Brian Osborne wrote:

>James,
>
>Right, some discussion is called for. What James is saying is that Genbank
>entries with ORGANISM lines like "Unknown marine bacterium" should have
>Species objects, although binomial() will return something like "unknown".
>This Species object will be useful to those who want to use classification()
>and ncbi_taxid(). Are there votes or comments?
>
>Brian O.
>
>
>-----Original Message-----
>From: James Wasmuth [mailto:james.wasmuth@ed.ac.uk]
>Sent: Thursday, March 11, 2004 9:40 AM
>To: Brian Osborne
>Cc: bioperl-guts-l@bioperl.org
>Subject: Re: [Bioperl-guts-l] [Bug 1600] New: $gb->species->ncbi_taxid
>
>Brian and all at bioperl-guts,
>
>
>below is the comment I've added to the bug[1600].  I think it may need
>some discussion, but the patch I've added works to the extent that it
>allows creation of a Bio::Species object but the subsequent genus,
>species, subspecies calls will be 'wrong'.  Personally I'm more
>concerned with the taxid, which I think will be sufficient.
>
>If you want to see the size of this problem go to NCBI taxonomy and
>enter the term identified as a token set!  I think that maintaining the
>taxid is enough, otherwise the artifical split of terms such as
>**unidentified diatom endosymbiont of Peridinium foliaceum*
><http://www.ncbi.nlm.nih.gov/Taxonomy/Browser/wwwtax.cgi?mode=Undef&id=42247
>&lvl=3&lin=f&keep=1&srchmode=3&unlock>*
>may be a problem, though some of them are intuitive.
>
>One last question, I've never tried to fix a bug before, so I've
>commited a patch as an attachment to Bugzilla for the bug.  Do others
>check this and if okay place it in the code...
>apologies for the newbie bit...
>
>-james
>
>
>
>genbank.pm
>
>line 1123: return unless $genus and  $genus !~ /^(Unknown|None)$/oi;
>
>a number of species are described as Unknown blah blah blah.
>
>The NCBI taxid assigned to unknown taxa is 32644 and has a number of
>synonyms, none of which are 'unknown'.
>
>The list includes: other, unknown organism, not specified, not shown,
>unspecified, Unknown, None, unclassified , unidentified organism
>
>I've changed the _read_GenBank_Species subroutine to allow organism
>names such as 'unknown marine gamma proteobacterium NOR5'.  This will
>create a Bio::Species object, but the genus=unknown species=marine
>subspecies=gamma.
>
>There is a whole host of species names that ignore the nice rules in
>_read_GenBank_Species!  However this fix will allow the correct taxid to
>be provided which I think is more than the name!
>
>
>
>sub _read_GenBank_Species {
>    my( $self,$buffer) = @_;
>    my @organell_names = ("chloroplast", "mitochondr");
>     # only those carrying DNA, apart from the nucleus
>
>    #CHANGE
>     my @unkn_names=("other", 'unknown organism', 'not specified', 'not
>shown', 'Unspecified', 'Unknown', 'None', 'unclassified', 'unidentified
>organism');
>
>    $_ = $$buffer;
>
>    my( $sub_species, $species, $genus, $common, $organelle, @class,
>$ns_name );
>    # upon first entering the loop, we must not read a new line -- the
>SOURCE
>    # line is already in the buffer (HL 05/10/2000)
>    while (defined($_) || defined($_ = $self->_readline())) {
>    # de-HTMLify (links that may be encountered here don't contain
>    # escaped '>', so a simple-minded approach suffices)
>        s/<[^>]+>//g;
>    if (/^SOURCE\s+(.*)/o) {
>        # FIXME this is probably mostly wrong (e.g., it yields things like
>        # Homo sapiens adult placenta cDNA to mRNA
>        # which is certainly not what you want)
>        $common = $1;
>        $common =~ s/\.$//; # remove trailing dot
>    } elsif (/^\s{2}ORGANISM/o) {
>        my @spflds = split(' ', $_);
>            ($ns_name) = $_ =~ /\w+\s+(.*)/o;
>        shift(@spflds); # ORGANISM
>
>         if(grep { $_ =~ /^$spflds[0]/i; } @organell_names) {
>        $organelle = shift(@spflds);
>        }
>            $genus = shift(@spflds);
>        if(@spflds) {
>        $species = shift(@spflds);
>        } elsif ( grep { $genus } @unkn_names){
>        $species = '';
>        } else {$species='sp.';}      #there's no species name but it
>isn't unclassified
>        $sub_species = shift(@spflds) if(@spflds);
>        } elsif (/^\s+(.+)/o) {
>        # only split on ';' or '.' so that
>        # classification that is 2 words will
>        # still get matched
>        # use map to remove trailing/leading spaces
>            push(@class, map { s/^\s+//; s/\s+$//; $_; } split /[;\.]+/,
>$1);
>        } else {
>            last;
>        }
>
>        $_ = undef; # Empty $_ to trigger read of next line
>    }
>
>     $$buffer = $_;
>
>     # Don't make a species object if it's empty or "Unknown" or "None"
>    my $unkn = grep { $_ =~ /^$genus$species/i; } @unkn_names;
>
>     return unless $genus and  $unkn==0;
>
>     # Bio::Species array needs array in Species -> Kingdom direction
>    if ($class[0] eq 'Viruses') {
>        push( @class, $ns_name );
>    }
>    elsif ($class[$#class] eq $genus) {
>        push( @class, $species );
>    } else {
>        push( @class, $genus, $species );
>    }
>    @class = reverse @class;
>
>    my $make = Bio::Species->new();
>    $make->classification( \@class, "FORCE" ); # no name validation please
>    $make->common_name( $common      ) if $common;
>    unless ($class[-1] eq 'Viruses') {
>        $make->sub_species( $sub_species ) if $sub_species;
>    }
>    $make->organelle($organelle) if $organelle;
>    return $make;
>}
>
>
>
>
>Brian Osborne wrote:
>
>  
>
>>James,
>>
>>Your guess is right, no Species is made because of the name. That's because
>>genbank.pm normally looks at:
>>
>>ORGANISM Bos taurus
>>
>>And makes "Bos" the genus, and so on.
>>
>>If it sees:
>>
>>ORGANISM Unknown
>>
>>It refuses to make a Species object, and it's interpreting your ORGANISM
>>line in the same way because it can't make a valid genus, that's the
>>    
>>
>current
>  
>
>>rule. Personally I'd say that I agree with its principle - how can we make
>>    
>>
>a
>  
>
>>Species object without genus and species?
>>
>>You can get the taxid from a SeqFeature object, you already knew that.
>>
>>Brian O.
>>
>>
>>-----Original Message-----
>>From: bioperl-guts-l-bounces@portal.open-bio.org
>>[mailto:bioperl-guts-l-bounces@portal.open-bio.org]On Behalf Of
>>bugzilla-daemon@portal.open-bio.org
>>Sent: Thursday, March 11, 2004 4:21 AM
>>To: bioperl-guts-l@bioperl.org
>>Subject: [Bioperl-guts-l] [Bug 1600] New: $gb->species->ncbi_taxid
>>
>>http://bugzilla.bioperl.org/show_bug.cgi?id=1600
>>
>>          Summary: $gb->species->ncbi_taxid
>>          Product: Bioperl
>>          Version: unspecified
>>         Platform: PC
>>       OS/Version: Linux
>>           Status: NEW
>>         Severity: normal
>>         Priority: P2
>>        Component: Bio::SeqIO
>>       AssignedTo: bioperl-guts-l@bioperl.org
>>       ReportedBy: james.wasmuth@ed.ac.uk
>>
>>
>>I've included a genbank file for which I have been unable to extract the
>>ncbi_taxid for using
>>
>>$gb->species->ncbi_taxid
>>
>>the error is:
>>Can't call method "ncbi_taxid" on an undefined value
>>
>>infact I don't get a Bio::Species object.  I'm sure its because of the
>>    
>>
>name,
>  
>
>>which is correct.
>>
>>I've tried looking into it, but could not find which Seq object creates the
>>Bio::Species object.
>>
>>
>>
>>LOCUS       AY007676                1389 bp    DNA     linear   BCT
>>29-OCT-2001
>>DEFINITION  Unknown marine gamma proteobacterium NOR5 16S ribosomal RNA,
>>           partial sequence.
>>ACCESSION   AY007676
>>VERSION     AY007676.1  GI:12000362
>>KEYWORDS    .
>>SOURCE      unknown marine gamma proteobacterium NOR5
>> ORGANISM  unknown marine gamma proteobacterium NOR5
>>           Bacteria; Proteobacteria; Gammaproteobacteria.
>>REFERENCE   1  (bases 1 to 1389)
>> AUTHORS   Eilers,H., Pernthaler,J., Peplies,J., Glockner,F.O., Gerdts,G.
>>and
>>           Amann,R.
>> TITLE     Isolation of novel pelagic bacteria from the German bight and
>>their
>>           seasonal contributions to surface picoplankton
>> JOURNAL   Appl. Environ. Microbiol. 67 (11), 5134-5142 (2001)
>> MEDLINE   21536174
>>  PUBMED   11679337
>>REFERENCE   2  (bases 1 to 1389)
>> AUTHORS   Eilers,H., Pernthaler,J., Peplies,J., Gloeckner,F.O.,
>>    
>>
>Gerdts,G.,
>  
>
>>           Schuett,C. and Amann,R.
>> TITLE     Identification and seasonal dominance of culturable marine
>>bacteria
>> JOURNAL   Unpublished
>>REFERENCE   3  (bases 1 to 1389)
>> AUTHORS   Eilers,H., Pernthaler,J., Peplies,J., Gloeckner,F.O.,
>>    
>>
>Gerdts,G.,
>  
>
>>           Schuett,C. and Amann,R.
>> TITLE     Direct Submission
>> JOURNAL   Submitted (29-AUG-2000) Molecular Ecology,
>>    
>>
>Max-Planck-Institute,
>  
>
>>           Celsiusstrasse 1, Bremen 28359, Germany
>>FEATURES             Location/Qualifiers
>>    source          1..1389
>>                    /organism="unknown marine gamma proteobacterium NOR5"
>>                    /mol_type="genomic DNA"
>>                    /db_xref="taxon:145658"
>>    rRNA            <1..>1389
>>                    /product="16S ribosomal RNA"
>>BASE COUNT      343 a    319 c    453 g    274 t
>>ORIGIN
>>       1 cgcgaaagta cttcggtatg agtagagcgg cggacgggtg agtaacgcgt aggaatctat
>>      61 ccagtagtgg gggacaactc ggggaaactc gagctaatac cgcatacgtc ctaagggaga
>>     121 aagcggggga tcttcggacc tcgcgctatt ggaggagcct gcgttggatt agctagttgg
>>     181 tggggtaaag gcctaccaag gcgacgatcc atagctggtc tgagaggatg atcagccaca
>>     241 ccgggactga gacacggccc ggactcctac gggaggcagc agtggggaat attgcgcaat
>>     301 gggcgaaagc ctgacgcagc catgccgcgt gtgtgaagaa ggccttcggg ttgtaaagca
>>     361 ctttcaattg ggaagaaagg ttagtagtta ataactgcta gctgtgacat tacctttaga
>>     421 agaagcaccg gctaactccg tgccagcagc cgcggtaata cggaggtgcg agcgttaatc
>>     481 ggaattactg ggcgtaaagc gcgcgtaggc ggtctgttaa gtcggatgtg aaagccccgg
>>     541 gctcaacctg ggaattgcac ccgatactgg ccgactggag tgcgagagag ggaggtagaa
>>     601 ttccacgtgt agcggtgaaa tgcgtagata tgtggaggaa taccggtggc gaaggcggcc
>>     661 tcctggctcg acactgacgc tgaggtgcga aagcgtgggg agcaaacagg attagatacc
>>     721 ctggtagtcc acgccgtaaa cgatgtctac tagccgttgg gagacttgat ttcttggtgg
>>     781 cgaagttaac gcgataagta gaccgcctgg ggagtacggc cgcaaggtta aaactcaaat
>>     841 gaattgacgg gggcccgcac aagcggtgga gcatgtggtt taattcgatg caacgcgaag
>>     901 aaccttacca ggccttgaca tcctaggaat cctgtagaga tacgggagtg ccttcgggaa
>>     961 tctagtgaca ggtgctgcat ggctgtcgtc agctcgtgtc gtgagatgtt gggttaagtc
>>    1021 ccgtaacgag cgcaaccctt gtccttagtt gccagcgcgt aatggcggga actctaagga
>>    1081 gactgccggt gacaaaccgg aggaaggtgg ggacgacgtc aagtcatcat ggcccttacg
>>    1141 gcctgggcta cacacgtgct acaatggaac gcacagaggg cagcaaaccc gcgaggggga
>>    1201 gcgaatccca caaaacgttt cgtagtccgg atcggagtct gcaactcgac tccgtgaagt
>>    1261 cggaatcgct agtaatcgtg aatcagaatg tcacggtgaa tacgttcccg ggccttgtac
>>    1321 acaccgcccg tcacaccatg ggagtgggtt gctccagaag tggttagcct aaccttcggg
>>    1381 agggcgatc
>>//
>>
>>
>>
>>------- You are receiving this mail because: -------
>>You are the assignee for the bug, or are watching the assignee.
>>_______________________________________________
>>Bioperl-guts-l mailing list
>>Bioperl-guts-l@portal.open-bio.org
>>http://portal.open-bio.org/mailman/listinfo/bioperl-guts-l
>>
>>
>>
>>
>>    
>>
>
>--
>"I have not failed. I've just found 10,000 ways that don't work."
>               --- Thomas Edison
>
>Nematode Bioinformatics           ||
>Blaxter Nematode Genomics Group   ||
>School of Biological Sciences     ||
>Ashworth Laboratories             ||
>King's Buildings                  ||    tel: +44 131 650 7403
>University of Edinburgh           ||    web: www.nematodes.org
>Edinburgh                         ||
>EH9 3JT                           ||
>UK                                ||
>
>
>_______________________________________________
>Bioperl-l mailing list
>Bioperl-l@portal.open-bio.org
>http://portal.open-bio.org/mailman/listinfo/bioperl-l
>  
>

-- 
"I have not failed. I've just found 10,000 ways that don't work."
               --- Thomas Edison

Nematode Bioinformatics           ||
Blaxter Nematode Genomics Group   ||
School of Biological Sciences     ||
Ashworth Laboratories             ||	
King's Buildings                  ||    tel: +44 131 650 7403
University of Edinburgh           ||    web: www.nematodes.org
Edinburgh                         ||
EH9 3JT                           ||
UK                                ||	

From rus1 at duke.edu  Thu Mar 11 12:21:48 2004
From: rus1 at duke.edu (Rhazes Spell)
Date: Thu Mar 11 12:10:02 2004
Subject: [Bioperl-l] Bio::DB::GFF::Segment trouble
Message-ID: <93F9BB66-7380-11D8-BE84-000393B65CBA@duke.edu>

I am having trouble getting a list of features from a segment in my 
database.  I can pull out all of the features, but I can not limit the 
features returned to a specific range.  Following is a code excerpt and 
output.  Any assistance that you can provide would be great!

CODE:
my @segment = $db->segment(-name => 'amata',
			   -start => 1,
			   -end => 40000);
	
foreach my $segment (@segment) {
     my($start,$end) = ($segment->start,$segment->end);

#Some debug info cuz I don't get it yet

     print "\n*** Segment= ",$segment->asString();
     print " Seg Source= ",$segment->sourceseq,"\t";
     print "Reference= ",$segment->ref,"\t";
     print "Abs Start= ",$segment->abs_start," :: ";
     print "Abs End= ", $segment->abs_end,"\n";

#Change the coordinate system to absolute if necessary
     $segment->absolute(1) if (!($segment->sourceseq eq $segment->ref));
     my @f = $segment->features(-types => ['similarity:blastn'],
			      -rangetype => 'contained_in');

     foreach my $feat (@f) {
	print "\n\t",$feat->gff_string(),"\n";
	print "-Target == ";
	foreach ($feat->get_tag_values("Target")) {
	    print $_;
	}
	print "\n-description == ";
	foreach ($feat->get_tag_values("description")) {
	    print $_,"\n";
	}
     }
}

OUTPUT:: (## = Added comment)

*** Segment= amata:1,40000 Seg Source= amata    Reference= amata        
Abs Start= 1 :: Abs End= 40000
##This is the main segment and it should not have any blast 
similarities.

*** Segment= amata:1,40000 Seg Source= amatalpha        Reference= 
amata        Abs Start= 14584 :: Abs End= 54583
## These are blast hits that have a hit on my segment.  I only want the 
hits that occur between 1 and 40000 on
##  the segment, amata
##  I have 2 questions:
## 1: Why are there any hits included with 'Target "Sequence:amata" 
start stop' where start or start is > 40000?
## 2: Why can't I access the members of the Target attribute tag as I 
can the description tag.
## Output follows


         amatalpha       blastn  similarity      14584   14713   
0.832061        +       .       Target "Sequence:a
mata" 41056 41186 ; description "upstream of CAP1 - downstream of CAP1"
-Target ==
-description == upstream of CAP1 - downstream of CAP1

         amatalpha       blastn  similarity      18722   18774   
0.907407        +       .       Target "Sequence:a
mata" 33568 33621 ; description "downstream of SPO14 - downstream of 
SPO14"
-Target ==
-description == downstream of SPO14 - downstream of SPO14

         amatalpha       blastn  similarity      19199   19283   
0.905882        +       .       Target "Sequence:a
mata" 34087 34171 ; description "downstream of SPO14 - downstream of 
SPO14"
-Target ==
-description == downstream of SPO14 - downstream of SPO14

         amatalpha       blastn  similarity      24823   24922   0.83    
+       .       Target "Sequence:amata" 39
695 39794 ; description "upstream of RPL22 - downstream of CAP1"
-Target ==
-description == upstream of RPL22 - downstream of CAP1

         amatalpha       blastn  similarity      25099   25150   
0.923077        +       .       Target "Sequence:a
mata" 39971 40022 ; description "upstream of RPL22 - downstream of CAP1"
-Target ==
-description == upstream of RPL22 - downstream of CAP1


Thanks,
Rhazes

**************************************
Duke University
Department of Biomedical Engineering
Duke University Medical Center,
  Center for Genome Technology

919.684.2720 (lab)
***************************************

From Annie.Law at nrc-cnrc.gc.ca  Thu Mar 11 15:32:38 2004
From: Annie.Law at nrc-cnrc.gc.ca (Law, Annie)
Date: Thu Mar 11 15:45:15 2004
Subject: [Bioperl-l] Bioperl-db, bioperl schema,
	and load_seqdatabase.pl along with de l-assocs-sql.pl
Message-ID: <10C94843061E094A98C02EB77CFC328722FE12@nrcmrdex1d.imsb.nrc.ca>

Hi Hilmar,

I would appreciate help with the following.  

1. My problem is I am now unable to find the data in the database that I set
it up for. It seems that I have successfully loaded NCBI taxonomy, GO
information, locuslink (using LL_tmpl file) information and then unigene
information. 
When I loaded the information in each case there seemed to be almost no
problem in inserting the information into the database Ie. It seemed that
all of the inserts worked properly.

I would like to take clone ids (associated with a microarray associated with
ESTs). I have matched the clone ids to GenBank accesion numbers form the
IMAGE consortium. I wanted to check if the SQL statements that I was
creating where giving me the desired result so I followed My statements
through.  If I were to do this by hand I would take the GenBank accession
number and find out the Unigene identifier.  I would then take the Unigene
identifier and find the corresponding locuslink id.  I would then take the
locuslink ids and find the GO ids (from each category of go)

My understanding of the the biosql schema is that as stated the bioentry
table is the main table. From this you can perform some select statements to
find out the other database references a bioentry has. For Example if we
look at all of the fields in one entry of a Unigene database It has all of
the fields that reference databases in the dbxref table. (I haven't figured
out where all of the other data from this Entry goes or if it does get
inserted in the database?)

BX095770 --accession number
Hs.2 --clusterid
N-acetyltransferase 2 (arylamine N-acetyltransferase) --title NAT2
--gene_symbol 8p22 --cytoband liver--express 
S --gnm_terminus 
10 --locuslink 
8--chromosome 
ACC=G59899 
UNISTS=137181--sts 
ORG=Escherischia coli; 
PROTGI=16129422; 
PROTID=ref:NP_415980.1; 
PCT=24; ALN=255--protsim sapiens--species Hs.2--diplay_id
N-acetyltransferase 2 (arylamine N-acetyltransferase)--description 26--size
Hs.2--object_id NCBI--authority UniGene--namespace Hs.2--display_name
26--scount

The select statement I wrote to go from GenBank accession number to
locuslink id was. 

Select dbxref.dbxref_id 
FROM dbxref
WHERE dbxref.accession = 'H08278'

With the dbxref_id that I got I was going to find out the corresponding
bioentry_id (with biodatabase_id corresponding to Unigene) and then go back
to the dbxref table to find the dbxref locuslink entry for the unigene
bioentry (in the bioentry table). The problem is that for each Genbank
accession number that I look up I get 0 rows returned using The select
statement above (I don't think there is anyhing wrong with the SQL statement
since I checked it With an unrelated GenBank accession number).  This is
true for all of the clones that I have followed through.

I found that this GenBank accession number does exist in the Hs.data file I
used as input. That entry is 
//
ID          Hs.478728
TITLE       Homo sapiens transcribed sequences
EXPRESS     whole brain ; total brain 
CHROMOSOME  3
STS         ACC=SHGC-77720 UNISTS=45248
SCOUNT      3
SEQUENCE    ACC=R60728.1; NID=g831423; CLONE=IMAGE:42222; END=5'; LID=263;
SEQTYPE=EST
SEQUENCE    ACC=Z42053.1; NID=g564288; CLONE=c-06h12; LID=186; SEQTYPE=EST
SEQUENCE    ACC=H08278.1; NID=g873100; CLONE=IMAGE:45161; END=5'; LID=263;
SEQTYPE=EST
//

????? Somehow this information is not getting entered in the database or I
am trying to access the information in the wrong 
Manner?  How can I resolve this problem?   Is there some script that needs
to be altered or some option used?

Alternatively,  I searched in the bioentry database for an entry with the
GenBank accession number that I am looking for however I am unable to find
the same bioentry_id in the bioentry_dbxref table.  Sometimes I am not even
able to find the Genbank accesion number I am looking for in the bioentry
table.
 
2. My second question when hopefully I have sorted things out in the first
question is.
I am not exactly sure how to go about accessing the GO information
corresponding to a locuslink id?
Once I had found the bioentry_id of the relevant Locuslink bioentry and join
with the bioentry table with the 
Bioentry relationship table and subsequently the term table.  I would like
to get the GO id and the category or ontology that
The GO id is associated with be it molecular function, biological process or
cellular component.  It seems that most of the
Entries in the term table are of Ontoloy Id = 1 (Gene ontology) and only
around 200 entries molecular function, biological process, and cellular
component put together when there are about 16000 entries in the term table.
Shouldn't the total number of these three
Types equal to the number of entries of Ontology Id = 1?
How do I find out what GO category or rather ontology (molecular function,
biological process, and cellular component) 
That a GO identifier belongs to?
I am not sure that I understand what all 7 distinct Ontology Ids in the
Ontology represent?
(I understand the meaning of 3,4 and 5)
1 = Gene Ontology
2 = Annotation
3 = cellular
4 = molecular
5 = biological
6 = Object Slots
7 = Relationship

3.  In a previous e-mail you were kind enough to mention that I could
contact you once I had gotten to the point
I would need to update the database.  I have gotten to this point and would
like to know how I can modify del-assocs-sql.pl to
be used with MySQL for the --mergeobjs option 

It seems to me that the --lookup option in unlike the mergeobjs option ie.
You would not enter something like
--lookup=***.pl

As an aside, I would like to know since locuslink, unigene takes around 2
days to load.  What happens in between when a user is trying
To access the database are there are deleterious effects?  My guess is that
the user will just get the data that is currently in the
Database and that no harm will happen.

Thanks very much,
Annie.

From sumit.kaur at sbri.org  Thu Mar 11 16:36:43 2004
From: sumit.kaur at sbri.org (Sumit Kaur)
Date: Thu Mar 11 16:42:24 2004
Subject: [Bioperl-l] (no subject)
Message-ID: <9B09CC35724F4B4091625AD346E54D1BFE7631@exchange01.sbri.org>


From sumit.kaur at sbri.org  Thu Mar 11 16:36:45 2004
From: sumit.kaur at sbri.org (Sumit Kaur)
Date: Thu Mar 11 16:42:26 2004
Subject: [Bioperl-l] (no subject)
Message-ID: <9B09CC35724F4B4091625AD346E54D1BFE7632@exchange01.sbri.org>


From lstein at cshl.edu  Thu Mar 11 13:08:10 2004
From: lstein at cshl.edu (Lincoln Stein)
Date: Thu Mar 11 18:35:25 2004
Subject: [Bioperl-l] Bio::DB::GFF::Segment trouble
In-Reply-To: <93F9BB66-7380-11D8-BE84-000393B65CBA@duke.edu>
References: <93F9BB66-7380-11D8-BE84-000393B65CBA@duke.edu>
Message-ID: <200403111308.11186.lstein@cshl.edu>

Hi,

This is a very slight misconception.  You need to use the "contains" 
range type.  The range type is relative to the viewpoint of the 
segment.  You are looking for the features that the segment contains.  
The contained_in relationship works the other way around, and will 
find features that completely contain the segment.

Looking at your output, the contained_in relationship doesn't appear 
to be working properly, however.  I wonder which database adaptor you 
are using?

The target information is a special method called target().  It is not 
treated as an ordinary tag.

Lincoln

On Thursday 11 March 2004 12:21 pm, Rhazes Spell wrote:
> I am having trouble getting a list of features from a segment in my
> database.  I can pull out all of the features, but I can not limit
> the features returned to a specific range.  Following is a code
> excerpt and output.  Any assistance that you can provide would be
> great!
>
> CODE:
> my @segment = $db->segment(-name => 'amata',
> 			   -start => 1,
> 			   -end => 40000);
>
> foreach my $segment (@segment) {
>      my($start,$end) = ($segment->start,$segment->end);
>
> #Some debug info cuz I don't get it yet
>
>      print "\n*** Segment= ",$segment->asString();
>      print " Seg Source= ",$segment->sourceseq,"\t";
>      print "Reference= ",$segment->ref,"\t";
>      print "Abs Start= ",$segment->abs_start," :: ";
>      print "Abs End= ", $segment->abs_end,"\n";
>
> #Change the coordinate system to absolute if necessary
>      $segment->absolute(1) if (!($segment->sourceseq eq
> $segment->ref)); my @f = $segment->features(-types =>
> ['similarity:blastn'], -rangetype => 'contained_in');
>
>      foreach my $feat (@f) {
> 	print "\n\t",$feat->gff_string(),"\n";
> 	print "-Target == ";
> 	foreach ($feat->get_tag_values("Target")) {
> 	    print $_;
> 	}
> 	print "\n-description == ";
> 	foreach ($feat->get_tag_values("description")) {
> 	    print $_,"\n";
> 	}
>      }
> }
>
> OUTPUT:: (## = Added comment)
>
> *** Segment= amata:1,40000 Seg Source= amata    Reference= amata
> Abs Start= 1 :: Abs End= 40000
> ##This is the main segment and it should not have any blast
> similarities.
>
> *** Segment= amata:1,40000 Seg Source= amatalpha        Reference=
> amata        Abs Start= 14584 :: Abs End= 54583
> ## These are blast hits that have a hit on my segment.  I only want
> the hits that occur between 1 and 40000 on
> ##  the segment, amata
> ##  I have 2 questions:
> ## 1: Why are there any hits included with 'Target "Sequence:amata"
> start stop' where start or start is > 40000?
> ## 2: Why can't I access the members of the Target attribute tag as
> I can the description tag.
> ## Output follows
>
>
>          amatalpha       blastn  similarity      14584   14713
> 0.832061        +       .       Target "Sequence:a
> mata" 41056 41186 ; description "upstream of CAP1 - downstream of
> CAP1" -Target ==
> -description == upstream of CAP1 - downstream of CAP1
>
>          amatalpha       blastn  similarity      18722   18774
> 0.907407        +       .       Target "Sequence:a
> mata" 33568 33621 ; description "downstream of SPO14 - downstream
> of SPO14"
> -Target ==
> -description == downstream of SPO14 - downstream of SPO14
>
>          amatalpha       blastn  similarity      19199   19283
> 0.905882        +       .       Target "Sequence:a
> mata" 34087 34171 ; description "downstream of SPO14 - downstream
> of SPO14"
> -Target ==
> -description == downstream of SPO14 - downstream of SPO14
>
>          amatalpha       blastn  similarity      24823   24922  
> 0.83 +       .       Target "Sequence:amata" 39
> 695 39794 ; description "upstream of RPL22 - downstream of CAP1"
> -Target ==
> -description == upstream of RPL22 - downstream of CAP1
>
>          amatalpha       blastn  similarity      25099   25150
> 0.923077        +       .       Target "Sequence:a
> mata" 39971 40022 ; description "upstream of RPL22 - downstream of
> CAP1" -Target ==
> -description == upstream of RPL22 - downstream of CAP1
>
>
>
> Thanks,
> Rhazes
>
> **************************************
> Duke University
> Department of Biomedical Engineering
> Duke University Medical Center,
>   Center for Genome Technology
>
> 919.684.2720 (lab)
> ***************************************
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

-- 
Lincoln D. Stein
Cold Spring Harbor Laboratory
1 Bungtown Road
Cold Spring Harbor, NY 11724
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: signature
Url : http://portal.open-bio.org/pipermail/bioperl-l/attachments/20040311/b4ca293d/attachment.bin
From bsman at tom.com  Thu Mar 11 15:33:14 2004
From: bsman at tom.com (=?gb2312?B?zfXqySwgU2FuIFdhbmc=?=)
Date: Thu Mar 11 18:36:07 2004
Subject: [Bioperl-l] Anybody use 3'UTR for siRNA design?
Message-ID: <200403112038.i2BKcetk021732@portal.open-bio.org>

Dear bioperl-l:

	Anybody use 3'UTR for siRNA design?

	Best Regards!
 				
			Yours sincerely
��������������������, San Wang
����������������bsman@tom.com
��������������������2004-03-12

http://www.estarray.org

 ,-,-.     ,-,-.     ,-,-.     ,-,-.     ,-
/ / \ \   / / \ \   / / \ \   / / \ \   / /
     \ \ / /   \ \ / /   \ \ / /   \ \ / / 
      `-'-'     `-'-'     `-'-'     `-'-'  


San Wang
Dr. Student
Bioinformatics and Gene Network Research Group
Zhejiang University
P.R.China

Address:
Room 415# ,Bio-technology Institute,
Zhejiang University,
Kaixuan Road 268#  Hangzhou
Zhejiang, P.R.China
310027

Tel:
0086-571-86892051
Fax:
0086-571-86961525
E-mail:
bsman@163.com
Website:
http://www.estarray.org 

��������������������
  Innovation is everything. When you are on the forefront, you can see what the next innovation needs to be? 
			--- from Robert Noyce, Founder of Intel
  The more things change, the more they stay the same!
  The more things seem to be complicated ,the more they are simple in fact!

From confirm-s2-Ci_AjA_jwvYcCcj8Reh9HC1iG1A-bioperl-l=bioperl.org at yahoogroups.com  Thu Mar 11 20:00:37 2004
From: confirm-s2-Ci_AjA_jwvYcCcj8Reh9HC1iG1A-bioperl-l=bioperl.org at yahoogroups.com (Yahoo! Groups)
Date: Thu Mar 11 20:06:11 2004
Subject: [Bioperl-l] Please confirm your request to join kammi_dalnet 
Message-ID: <1079053237.107.93134.m19@yahoogroups.com>


Hello bioperl-l@bioperl.org,

We have received your request to join the kammi_dalnet 
group hosted by Yahoo! Groups, a free, easy-to-use community service.

This request will expire in 7 days.

TO BECOME A MEMBER OF THE GROUP: 

1) Go to the Yahoo! Groups site by clicking on this link:

   http://groups.yahoo.com/i?i=Ci_AjA_jwvYcCcj8Reh9HC1iG1A&e=bioperl-l%40bioperl%2Eorg 

  (If clicking doesn't work, "Cut" and "Paste" the line above into your 
   Web browser's address bar.)

-OR-

2) REPLY to this email by clicking "Reply" and then "Send"
   in your email program

If you did not request, or do not want, a membership in the
kammi_dalnet group, please accept our apologies
and ignore this message.

Regards,

Yahoo! Groups Customer Care

Your use of Yahoo! Groups is subject to http://docs.yahoo.com/info/terms/ 

 
From JLJMJPLFIFG at hotmail.com  Thu Mar 11 21:28:16 2004
From: JLJMJPLFIFG at hotmail.com (Heidi Wright)
Date: Thu Mar 11 21:37:49 2004
Subject: [Bioperl-l] RE:[2] removal
Message-ID: <200403035370.q9Q9PBt217715@gateway4.wongfaye.com>

We offer a large selection of both domestic and 
international brand cheap cigarettes online at 
rock bottom prices! We carry Marlboros, Winstons, 
Camels, Salems, Parliaments, Kents, Dunhills, 
davidoffs and many, many more brands and flavors 
of Cheap Cigarettes Online. 

more information here:
http://autobiography.cigaretesstore.biz/il/?reproach

______________________
Un_subscr1ber$ go here:
http://riven.cigaretesstore.biz/pizdec.php?96555

----396921786406309320--
From mgear911 at yahoo.com  Thu Mar 11 21:49:41 2004
From: mgear911 at yahoo.com (=?iso-8859-1?q?Yuhui?=)
Date: Thu Mar 11 21:55:18 2004
Subject: [Bioperl-l] How to display newick tree in graphics?
Message-ID: <20040312024941.17553.qmail@web13303.mail.yahoo.com>

hi,
a newbie to perl here, can someone teach me how to
output a newick file in the form of a image on a
webpage.

Thanks.

__________________________________________________
Do You Yahoo!?
Stand a chance to win a dream date, join the Dream Guy Contest!
http://sg.yahoo.com/dreamguy
From jonathon at mgcheo3.med.uottawa.ca  Fri Mar 12 09:08:43 2004
From: jonathon at mgcheo3.med.uottawa.ca (Jonathon Greenwood)
Date: Fri Mar 12 09:19:15 2004
Subject: [Bioperl-l] Parsing features
Message-ID: <Pine.GSO.4.40.0403120840030.17126-100000@mgcheo3>

Hi all!

I'm wondering if it is possible to extract the features from a Genbank
file and print out the feature information within a text box (I'm dealing
with the text of the feature I don't want to display them as a glyph). I
tried this already using the Dumper method...but it spews out too much
information. I also get the errors when I run the code(which is for this
feature):
Bio::SeqFeature::Generic=HASH(0x654788) a bunch of times... and
(in cleanup) Undefined subroutine Fh:flush at
/usr/local/lib/perl5/site_perl/5.8.0/Bio/Root/IO.pm line 519

Thanks for any help that can be given...

Jonathan Greenwood
Co-op student,
email: jonathon@mgcheo.med.uottawa.ca

CODE:

#! /usr/local/bin/perl -w

use strict;
use CGI qw / :standard /;
use CGI::Pretty;
use Bio::SeqFeature::Generic;
use Bio::SeqIO;
use Bio::Root::IO;
use File::Temp;

my @features = read_file(param('file'));

print header, start_html('Plasmid Feature Editor');

print start_multipart_form (),
table ({-cellpadding => 10},
TR({-class => 'resultsbody'},
	td (textarea(-name => 'editarea',
		     -values => "@features",
		     -rows   => 20,
		     -cols   => 70,
		     -override => (@features) || (param('clear')),
		    ),
		),
	 ),
TR({-class => 'resultstitle'},
td (filefield(-name => 'file',
	      -length => 40),
	),

td(submit(-name => 'submit_button',
	  -value => 'Click to display features'),
	),
    ),
TR({-class => 'resultstitle'},
td(submit(-name => 'save_button',
	  -value => 'Click here to save your work'),
	),
td(reset(),
),
),
),
end_form;

print end_html;

exit 0;

sub read_file {

my $fh = param('file');
my $gb_parser = Bio::SeqIO->new(-fh=>$fh,-format=>'Genbank');
my @features;

while (my $seq = $gb_parser-> next_seq) {
	push @features, $seq->get_SeqFeatures();
}
return @features;
}

From jason at cgt.duhs.duke.edu  Fri Mar 12 09:24:27 2004
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Fri Mar 12 09:30:11 2004
Subject: [Bioperl-l] Parsing features
In-Reply-To: <Pine.GSO.4.40.0403120840030.17126-100000@mgcheo3>
References: <Pine.GSO.4.40.0403120840030.17126-100000@mgcheo3>
Message-ID: <Pine.LNX.4.50.0403120923150.23843-100000@tenero.duhs.duke.edu>

Did you try some of the code in the Feature-Annotation HOWTO?
  http://bioperl.org/HOWTOs/

-jason
On Fri, 12 Mar 2004, Jonathon Greenwood wrote:

> Hi all!
>
> I'm wondering if it is possible to extract the features from a Genbank
> file and print out the feature information within a text box (I'm dealing
> with the text of the feature I don't want to display them as a glyph). I
> tried this already using the Dumper method...but it spews out too much
> information. I also get the errors when I run the code(which is for this
> feature):
> Bio::SeqFeature::Generic=HASH(0x654788) a bunch of times... and
> (in cleanup) Undefined subroutine Fh:flush at
> /usr/local/lib/perl5/site_perl/5.8.0/Bio/Root/IO.pm line 519
>
> Thanks for any help that can be given...
>
> Jonathan Greenwood
> Co-op student,
> email: jonathon@mgcheo.med.uottawa.ca
>
> CODE:
>
> #! /usr/local/bin/perl -w
>
> use strict;
> use CGI qw / :standard /;
> use CGI::Pretty;
> use Bio::SeqFeature::Generic;
> use Bio::SeqIO;
> use Bio::Root::IO;
> use File::Temp;
>
> my @features = read_file(param('file'));
>
> print header, start_html('Plasmid Feature Editor');
>
> print start_multipart_form (),
> table ({-cellpadding => 10},
> TR({-class => 'resultsbody'},
> 	td (textarea(-name => 'editarea',
> 		     -values => "@features",
> 		     -rows   => 20,
> 		     -cols   => 70,
> 		     -override => (@features) || (param('clear')),
> 		    ),
> 		),
> 	 ),
> TR({-class => 'resultstitle'},
> td (filefield(-name => 'file',
> 	      -length => 40),
> 	),
>
> td(submit(-name => 'submit_button',
> 	  -value => 'Click to display features'),
> 	),
>     ),
> TR({-class => 'resultstitle'},
> td(submit(-name => 'save_button',
> 	  -value => 'Click here to save your work'),
> 	),
> td(reset(),
> ),
> ),
> ),
> end_form;
>
> print end_html;
>
> exit 0;
>
> sub read_file {
>
> my $fh = param('file');
> my $gb_parser = Bio::SeqIO->new(-fh=>$fh,-format=>'Genbank');
> my @features;
>
> while (my $seq = $gb_parser-> next_seq) {
> 	push @features, $seq->get_SeqFeatures();
> }
> return @features;
> }
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From Jan.Aerts at wur.nl  Fri Mar 12 09:36:13 2004
From: Jan.Aerts at wur.nl (Aerts, Jan)
Date: Fri Mar 12 09:41:50 2004
Subject: [Bioperl-l] bp_pg_bulk_load_gff.pl: invalid command \N
Message-ID: <7D030487F1A3D143A76F2A1E91F57035EF3D8F@scomp0010>

Hi all,

Has anyone encountered the following errors when trying to load a gff file into a PostgresQL database? The bioperl version is 1.4. The gff file looks like this:

[start gff file]
GGA10_random    Blast   Gene    1906433 1908278 1e-102  +       .       Gene VIM ; Accession NM_003380.1
GGA10_random    Blast   Gene    1539573 1542536 0.0     +       .       Gene GTF2E1 ; Accession NM_005513.1
[stop gff file]

[start error]
...
psql:/usr/tmp/fdata.30284:85393: invalid command \N
psql:/usr/tmp/fdata.30284:85393: invalid command \N
psql:/usr/tmp/fdata.30284:85394: invalid command \N
psql:/usr/tmp/fdata.30284:85394: invalid command \N
psql:/upsql:/usr/tmp/ftype.30284:1: ERROR:  parser: parse error at or near "("
psql:/usr/tmp/ftype.30284:3: invalid command \.
psql:/usr/tmp/ftype.30284:4: ERROR:  parser: parse error at or near "1"
psql:/usr/tmp/fgroup.30284:1: ERROR:  parser: parse error at or near "("
psql:/usr/tmp/fgroup.30284:13694: invalid command \.
psql:/usr/tmp/fgroup.30284:13695: ERROR:  parser: parse error at or near "1"
psql:/usr/tmp/fattribute.30284:1: ERROR:  parser: parse error at or near "("
psql:/usr/tmp/fattribute.30284:3: invalid command \.
psql:/usr/tmp/fattribute.30284:4: ERROR:  parser: parse error at or near "1"
psql:/usr/tmp/fattribute_to_feature.30284:1: ERROR:  parser: parse error at or near "("
psql:/usr/tmp/fattribute_to_feature.30284:139850: invalid command \.
psql:/usr/tmp/fattribute_to_feature.30284:139851: ERROR:  parser: parse error at or near "1"
[stop error]

Any help appreciated.
Thanks,
Jan Aerts

From jonathon at mgcheo3.med.uottawa.ca  Fri Mar 12 09:32:37 2004
From: jonathon at mgcheo3.med.uottawa.ca (Jonathon Greenwood)
Date: Fri Mar 12 09:43:13 2004
Subject: [Bioperl-l] Parsing features
In-Reply-To: <Pine.LNX.4.50.0403120923150.23843-100000@tenero.duhs.duke.edu>
Message-ID: <Pine.GSO.4.40.0403120931440.17653-100000@mgcheo3>

I tried some of the code in the Feature Annotation HOWTO but it wouldn't
display anything within the text box.

Jonathan

On Fri, 12 Mar 2004, Jason Stajich wrote:

> Did you try some of the code in the Feature-Annotation HOWTO?
>   http://bioperl.org/HOWTOs/
>
> -jason
> On Fri, 12 Mar 2004, Jonathon Greenwood wrote:
>
> > Hi all!
> >
> > I'm wondering if it is possible to extract the features from a Genbank
> > file and print out the feature information within a text box (I'm dealing
> > with the text of the feature I don't want to display them as a glyph). I
> > tried this already using the Dumper method...but it spews out too much
> > information. I also get the errors when I run the code(which is for this
> > feature):
> > Bio::SeqFeature::Generic=HASH(0x654788) a bunch of times... and
> > (in cleanup) Undefined subroutine Fh:flush at
> > /usr/local/lib/perl5/site_perl/5.8.0/Bio/Root/IO.pm line 519
> >
> > Thanks for any help that can be given...
> >
> > Jonathan Greenwood
> > Co-op student,
> > email: jonathon@mgcheo.med.uottawa.ca
> >
> > CODE:
> >
> > #! /usr/local/bin/perl -w
> >
> > use strict;
> > use CGI qw / :standard /;
> > use CGI::Pretty;
> > use Bio::SeqFeature::Generic;
> > use Bio::SeqIO;
> > use Bio::Root::IO;
> > use File::Temp;
> >
> > my @features = read_file(param('file'));
> >
> > print header, start_html('Plasmid Feature Editor');
> >
> > print start_multipart_form (),
> > table ({-cellpadding => 10},
> > TR({-class => 'resultsbody'},
> > 	td (textarea(-name => 'editarea',
> > 		     -values => "@features",
> > 		     -rows   => 20,
> > 		     -cols   => 70,
> > 		     -override => (@features) || (param('clear')),
> > 		    ),
> > 		),
> > 	 ),
> > TR({-class => 'resultstitle'},
> > td (filefield(-name => 'file',
> > 	      -length => 40),
> > 	),
> >
> > td(submit(-name => 'submit_button',
> > 	  -value => 'Click to display features'),
> > 	),
> >     ),
> > TR({-class => 'resultstitle'},
> > td(submit(-name => 'save_button',
> > 	  -value => 'Click here to save your work'),
> > 	),
> > td(reset(),
> > ),
> > ),
> > ),
> > end_form;
> >
> > print end_html;
> >
> > exit 0;
> >
> > sub read_file {
> >
> > my $fh = param('file');
> > my $gb_parser = Bio::SeqIO->new(-fh=>$fh,-format=>'Genbank');
> > my @features;
> >
> > while (my $seq = $gb_parser-> next_seq) {
> > 	push @features, $seq->get_SeqFeatures();
> > }
> > return @features;
> > }
> >
> > _______________________________________________
> > Bioperl-l mailing list
> > Bioperl-l@portal.open-bio.org
> > http://portal.open-bio.org/mailman/listinfo/bioperl-l
> >
>
> --
> Jason Stajich
> Duke University
> jason at cgt.mc.duke.edu
>

From Jan.Aerts at wur.nl  Fri Mar 12 10:00:27 2004
From: Jan.Aerts at wur.nl (Aerts, Jan)
Date: Fri Mar 12 10:06:02 2004
Subject: [Bioperl-l] bp_pg_bulk_load_gff.pl: invalid command \N
Message-ID: <7D030487F1A3D143A76F2A1E91F57035EF3D90@scomp0010>

I forgot to say: the command used was this: bp_pg_bulk_load_gff.pl -d my_db file.gff

Jan


-----Original Message-----
From:	bioperl-l-bounces@portal.open-bio.org on behalf of Aerts, Jan
Sent:	Fri 12-Mar-04 15:36
To:	bioperl-l@bioperl.org
Cc:	
Subject:	[Bioperl-l] bp_pg_bulk_load_gff.pl: invalid command \N
Hi all,

Has anyone encountered the following errors when trying to load a gff file into a PostgresQL database? The bioperl version is 1.4. The gff file looks like this:

[start gff file]
GGA10_random    Blast   Gene    1906433 1908278 1e-102  +       .       Gene VIM ; Accession NM_003380.1
GGA10_random    Blast   Gene    1539573 1542536 0.0     +       .       Gene GTF2E1 ; Accession NM_005513.1
[stop gff file]

[start error]
...
psql:/usr/tmp/fdata.30284:85393: invalid command \N
psql:/usr/tmp/fdata.30284:85393: invalid command \N
psql:/usr/tmp/fdata.30284:85394: invalid command \N
psql:/usr/tmp/fdata.30284:85394: invalid command \N
psql:/upsql:/usr/tmp/ftype.30284:1: ERROR:  parser: parse error at or near "("
psql:/usr/tmp/ftype.30284:3: invalid command \.
psql:/usr/tmp/ftype.30284:4: ERROR:  parser: parse error at or near "1"
psql:/usr/tmp/fgroup.30284:1: ERROR:  parser: parse error at or near "("
psql:/usr/tmp/fgroup.30284:13694: invalid command \.
psql:/usr/tmp/fgroup.30284:13695: ERROR:  parser: parse error at or near "1"
psql:/usr/tmp/fattribute.30284:1: ERROR:  parser: parse error at or near "("
psql:/usr/tmp/fattribute.30284:3: invalid command \.
psql:/usr/tmp/fattribute.30284:4: ERROR:  parser: parse error at or near "1"
psql:/usr/tmp/fattribute_to_feature.30284:1: ERROR:  parser: parse error at or near "("
psql:/usr/tmp/fattribute_to_feature.30284:139850: invalid command \.
psql:/usr/tmp/fattribute_to_feature.30284:139851: ERROR:  parser: parse error at or near "1"
[stop error]

Any help appreciated.
Thanks,
Jan Aerts

_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org
http://portal.open-bio.org/mailman/listinfo/bioperl-l


From Jan.Aerts at wur.nl  Fri Mar 12 10:44:24 2004
From: Jan.Aerts at wur.nl (Aerts, Jan)
Date: Fri Mar 12 10:50:10 2004
Subject: [Bioperl-l] bp_pg_bulk_load_gff.pl: invalid command \N
Message-ID: <7D030487F1A3D143A76F2A1E91F57035EF3D92@scomp0010>

Thanks Sean,

I did try bp_load_gff.pl, and that one works. But the gff file I want to upload is almost 140,000 lines big, so it would take more than a day to get that into Postgres. I did use bp_pg_bulk_load_gff.pl before (must have been another version), and it worked then.

So I'm afraid I'm still stuck here.
Jan


-----Original Message-----
From:	Sean Davis [mailto:sdavis2@mail.nih.gov]
Sent:	Fri 12-Mar-04 16:39
To:	Aerts, Jan
Cc:	
Subject:	Re: [Bioperl-l] bp_pg_bulk_load_gff.pl: invalid command \N
I have had problems using the bp_bulk_load scripts for mysql on macos.  I
ended up solving the problem by using the bp_load script instead.  If there
is one for postgresql, you might try it, as I think the functionality is the
same.

Sean

On 3/12/04 10:00 AM, "Aerts, Jan" <Jan.Aerts@wur.nl> wrote:

> I forgot to say: the command used was this: bp_pg_bulk_load_gff.pl -d my_db
> file.gff
> 
> Jan
> 
> 
> -----Original Message-----
> From:    bioperl-l-bounces@portal.open-bio.org on behalf of Aerts, Jan
> Sent:    Fri 12-Mar-04 15:36
> To:    bioperl-l@bioperl.org
> Cc:    
> Subject:    [Bioperl-l] bp_pg_bulk_load_gff.pl: invalid command \N
> Hi all,
> 
> Has anyone encountered the following errors when trying to load a gff file
> into a PostgresQL database? The bioperl version is 1.4. The gff file looks
> like this:
> 
> [start gff file]
> GGA10_random    Blast   Gene    1906433 1908278 1e-102  +       .       Gene
> VIM ; Accession NM_003380.1
> GGA10_random    Blast   Gene    1539573 1542536 0.0     +       .       Gene
> GTF2E1 ; Accession NM_005513.1
> [stop gff file]
> 
> [start error]
> ...
> psql:/usr/tmp/fdata.30284:85393: invalid command \N
> psql:/usr/tmp/fdata.30284:85393: invalid command \N
> psql:/usr/tmp/fdata.30284:85394: invalid command \N
> psql:/usr/tmp/fdata.30284:85394: invalid command \N
> psql:/upsql:/usr/tmp/ftype.30284:1: ERROR:  parser: parse error at or near "("
> psql:/usr/tmp/ftype.30284:3: invalid command \.
> psql:/usr/tmp/ftype.30284:4: ERROR:  parser: parse error at or near "1"
> psql:/usr/tmp/fgroup.30284:1: ERROR:  parser: parse error at or near "("
> psql:/usr/tmp/fgroup.30284:13694: invalid command \.
> psql:/usr/tmp/fgroup.30284:13695: ERROR:  parser: parse error at or near "1"
> psql:/usr/tmp/fattribute.30284:1: ERROR:  parser: parse error at or near "("
> psql:/usr/tmp/fattribute.30284:3: invalid command \.
> psql:/usr/tmp/fattribute.30284:4: ERROR:  parser: parse error at or near "1"
> psql:/usr/tmp/fattribute_to_feature.30284:1: ERROR:  parser: parse error at or
> near "("
> psql:/usr/tmp/fattribute_to_feature.30284:139850: invalid command \.
> psql:/usr/tmp/fattribute_to_feature.30284:139851: ERROR:  parser: parse error
> at or near "1"
> [stop error]
> 
> Any help appreciated.
> Thanks,
> Jan Aerts
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 
> 
> 
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 


From amackey at pcbi.upenn.edu  Fri Mar 12 10:57:38 2004
From: amackey at pcbi.upenn.edu (Aaron J Mackey)
Date: Fri Mar 12 11:03:11 2004
Subject: [Bioperl-l] bp_pg_bulk_load_gff.pl: invalid command \N
In-Reply-To: <7D030487F1A3D143A76F2A1E91F57035EF3D92@scomp0010>
Message-ID: <Pine.LNX.4.44.0403121054550.9421-100000@snowball.pcbi.upenn.edu>


The bulk loaders work by writing the data to files that are read by
Pg/MySQL directly into the tables in a single operation; the \N's are the
NULL entries for your GFF fields with "." in them.  I would look through 
the documentation of your version of Pg to find out what the NULL 
encoding should be for your version; perhaps there was a change?

-Aaron

On Fri, 12 Mar 2004, Aerts, Jan wrote:

> Thanks Sean,
> 
> I did try bp_load_gff.pl, and that one works. But the gff file I want to upload is almost 140,000 lines big, so it would take more than a day to get that into Postgres. I did use bp_pg_bulk_load_gff.pl before (must have been another version), and it worked then.
> 
> So I'm afraid I'm still stuck here.
> Jan
> 
> 
> -----Original Message-----
> From:	Sean Davis [mailto:sdavis2@mail.nih.gov]
> Sent:	Fri 12-Mar-04 16:39
> To:	Aerts, Jan
> Cc:	
> Subject:	Re: [Bioperl-l] bp_pg_bulk_load_gff.pl: invalid command \N
> I have had problems using the bp_bulk_load scripts for mysql on macos.  I
> ended up solving the problem by using the bp_load script instead.  If there
> is one for postgresql, you might try it, as I think the functionality is the
> same.
> 
> Sean
> 
> On 3/12/04 10:00 AM, "Aerts, Jan" <Jan.Aerts@wur.nl> wrote:
> 
> > I forgot to say: the command used was this: bp_pg_bulk_load_gff.pl -d my_db
> > file.gff
> > 
> > Jan
> > 
> > 
> > -----Original Message-----
> > From:    bioperl-l-bounces@portal.open-bio.org on behalf of Aerts, Jan
> > Sent:    Fri 12-Mar-04 15:36
> > To:    bioperl-l@bioperl.org
> > Cc:    
> > Subject:    [Bioperl-l] bp_pg_bulk_load_gff.pl: invalid command \N
> > Hi all,
> > 
> > Has anyone encountered the following errors when trying to load a gff file
> > into a PostgresQL database? The bioperl version is 1.4. The gff file looks
> > like this:
> > 
> > [start gff file]
> > GGA10_random    Blast   Gene    1906433 1908278 1e-102  +       .       Gene
> > VIM ; Accession NM_003380.1
> > GGA10_random    Blast   Gene    1539573 1542536 0.0     +       .       Gene
> > GTF2E1 ; Accession NM_005513.1
> > [stop gff file]
> > 
> > [start error]
> > ...
> > psql:/usr/tmp/fdata.30284:85393: invalid command \N
> > psql:/usr/tmp/fdata.30284:85393: invalid command \N
> > psql:/usr/tmp/fdata.30284:85394: invalid command \N
> > psql:/usr/tmp/fdata.30284:85394: invalid command \N
> > psql:/upsql:/usr/tmp/ftype.30284:1: ERROR:  parser: parse error at or near "("
> > psql:/usr/tmp/ftype.30284:3: invalid command \.
> > psql:/usr/tmp/ftype.30284:4: ERROR:  parser: parse error at or near "1"
> > psql:/usr/tmp/fgroup.30284:1: ERROR:  parser: parse error at or near "("
> > psql:/usr/tmp/fgroup.30284:13694: invalid command \.
> > psql:/usr/tmp/fgroup.30284:13695: ERROR:  parser: parse error at or near "1"
> > psql:/usr/tmp/fattribute.30284:1: ERROR:  parser: parse error at or near "("
> > psql:/usr/tmp/fattribute.30284:3: invalid command \.
> > psql:/usr/tmp/fattribute.30284:4: ERROR:  parser: parse error at or near "1"
> > psql:/usr/tmp/fattribute_to_feature.30284:1: ERROR:  parser: parse error at or
> > near "("
> > psql:/usr/tmp/fattribute_to_feature.30284:139850: invalid command \.
> > psql:/usr/tmp/fattribute_to_feature.30284:139851: ERROR:  parser: parse error
> > at or near "1"
> > [stop error]
> > 
> > Any help appreciated.
> > Thanks,
> > Jan Aerts
> > 
> > _______________________________________________
> > Bioperl-l mailing list
> > Bioperl-l@portal.open-bio.org
> > http://portal.open-bio.org/mailman/listinfo/bioperl-l
> > 
> > 
> > 
> > 
> > _______________________________________________
> > Bioperl-l mailing list
> > Bioperl-l@portal.open-bio.org
> > http://portal.open-bio.org/mailman/listinfo/bioperl-l
> > 
> 
> 
> 
> 
> 
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 

From cain at cshl.org  Fri Mar 12 11:19:48 2004
From: cain at cshl.org (Scott Cain)
Date: Fri Mar 12 11:25:21 2004
Subject: [Bioperl-l] bp_pg_bulk_load_gff.pl: invalid command \N
In-Reply-To: <200403121442.i2CEgVtl028921@portal.open-bio.org>
References: <200403121442.i2CEgVtl028921@portal.open-bio.org>
Message-ID: <1079108387.1491.54.camel@localhost.localdomain>

Hi Jan,

What version of Postgres are you using?  At some point, the bulk loading
format for Postgres changed, and bp_pg_bulk_load_gff.pl only supports
the newer format.  I believe that this change occurred between
7.2->7.3.  If you are using an older Postgres, you should be able to
still use bp_load_gff.pl, though it will be slower.

Scott


--------------Original Message---------------
Hi all,

Has anyone encountered the following errors when trying to load a gff
file into a PostgresQL database? The bioperl version is 1.4. The gff
file looks like this:

[start gff file]
GGA10_random    Blast   Gene    1906433 1908278 1e-102  +       .      
Gene VIM ; Accession NM_003380.1
GGA10_random    Blast   Gene    1539573 1542536 0.0     +       .      
Gene GTF2E1 ; Accession NM_005513.1
[stop gff file]

[start error]
...
psql:/usr/tmp/fdata.30284:85393: invalid command \N
psql:/usr/tmp/fdata.30284:85393: invalid command \N
psql:/usr/tmp/fdata.30284:85394: invalid command \N
psql:/usr/tmp/fdata.30284:85394: invalid command \N
psql:/upsql:/usr/tmp/ftype.30284:1: ERROR:  parser: parse error at or
near "("
psql:/usr/tmp/ftype.30284:3: invalid command \.
psql:/usr/tmp/ftype.30284:4: ERROR:  parser: parse error at or near "1"
psql:/usr/tmp/fgroup.30284:1: ERROR:  parser: parse error at or near "("
psql:/usr/tmp/fgroup.30284:13694: invalid command \.
psql:/usr/tmp/fgroup.30284:13695: ERROR:  parser: parse error at or near
"1"
psql:/usr/tmp/fattribute.30284:1: ERROR:  parser: parse error at or near
"("
psql:/usr/tmp/fattribute.30284:3: invalid command \.
psql:/usr/tmp/fattribute.30284:4: ERROR:  parser: parse error at or near
"1"
psql:/usr/tmp/fattribute_to_feature.30284:1: ERROR:  parser: parse error
at or near "("
psql:/usr/tmp/fattribute_to_feature.30284:139850: invalid command \.
psql:/usr/tmp/fattribute_to_feature.30284:139851: ERROR:  parser: parse
error at or near "1"
[stop error]

Any help appreciated.
Thanks,
Jan Aerts

-- 
------------------------------------------------------------------------
Scott Cain, Ph. D.                                         cain@cshl.org
GMOD Coordinator (http://www.gmod.org/)                     216-392-3087
Cold Spring Harbor Laboratory

From Jan.Aerts at wur.nl  Fri Mar 12 11:23:35 2004
From: Jan.Aerts at wur.nl (Aerts, Jan)
Date: Fri Mar 12 11:29:11 2004
Subject: [Bioperl-l] bp_pg_bulk_load_gff.pl: invalid command \N
Message-ID: <7D030487F1A3D143A76F2A1E91F57035EF3D93@scomp0010>

I just found out that it _is_ working with 7.3.4, but not with 7.2.2. Seems like I will have to update that postgres server...

Thanks to all,

jan.


-----Original Message-----
From:	Scott Cain [mailto:cain@cshl.org]
Sent:	Fri 12-Mar-04 17:19
To:	Bioperl list; Aerts, Jan
Cc:	
Subject:	Re: [Bioperl-l] bp_pg_bulk_load_gff.pl: invalid command \N
Hi Jan,

What version of Postgres are you using?  At some point, the bulk loading
format for Postgres changed, and bp_pg_bulk_load_gff.pl only supports
the newer format.  I believe that this change occurred between
7.2->7.3.  If you are using an older Postgres, you should be able to
still use bp_load_gff.pl, though it will be slower.

Scott


--------------Original Message---------------
Hi all,

Has anyone encountered the following errors when trying to load a gff
file into a PostgresQL database? The bioperl version is 1.4. The gff
file looks like this:

[start gff file]
GGA10_random    Blast   Gene    1906433 1908278 1e-102  +       .      
Gene VIM ; Accession NM_003380.1
GGA10_random    Blast   Gene    1539573 1542536 0.0     +       .      
Gene GTF2E1 ; Accession NM_005513.1
[stop gff file]

[start error]
...
psql:/usr/tmp/fdata.30284:85393: invalid command \N
psql:/usr/tmp/fdata.30284:85393: invalid command \N
psql:/usr/tmp/fdata.30284:85394: invalid command \N
psql:/usr/tmp/fdata.30284:85394: invalid command \N
psql:/upsql:/usr/tmp/ftype.30284:1: ERROR:  parser: parse error at or
near "("
psql:/usr/tmp/ftype.30284:3: invalid command \.
psql:/usr/tmp/ftype.30284:4: ERROR:  parser: parse error at or near "1"
psql:/usr/tmp/fgroup.30284:1: ERROR:  parser: parse error at or near "("
psql:/usr/tmp/fgroup.30284:13694: invalid command \.
psql:/usr/tmp/fgroup.30284:13695: ERROR:  parser: parse error at or near
"1"
psql:/usr/tmp/fattribute.30284:1: ERROR:  parser: parse error at or near
"("
psql:/usr/tmp/fattribute.30284:3: invalid command \.
psql:/usr/tmp/fattribute.30284:4: ERROR:  parser: parse error at or near
"1"
psql:/usr/tmp/fattribute_to_feature.30284:1: ERROR:  parser: parse error
at or near "("
psql:/usr/tmp/fattribute_to_feature.30284:139850: invalid command \.
psql:/usr/tmp/fattribute_to_feature.30284:139851: ERROR:  parser: parse
error at or near "1"
[stop error]

Any help appreciated.
Thanks,
Jan Aerts

-- 
------------------------------------------------------------------------
Scott Cain, Ph. D.                                         cain@cshl.org
GMOD Coordinator (http://www.gmod.org/)                     216-392-3087
Cold Spring Harbor Laboratory


From laurichj at bioinfo.ucr.edu  Fri Mar 12 12:41:50 2004
From: laurichj at bioinfo.ucr.edu (Josh Lauricha)
Date: Fri Mar 12 12:47:24 2004
Subject: [Bioperl-l] How to display newick tree in graphics?
In-Reply-To: <20040312024941.17553.qmail@web13303.mail.yahoo.com>
References: <20040312024941.17553.qmail@web13303.mail.yahoo.com>
Message-ID: <20040312174150.GA5204@batch107a>

On Fri 03/12/04 10:49, Yuhui wrote:
> hi,
> a newbie to perl here, can someone teach me how to
> output a newick file in the form of a image on a
> webpage.

The sort version is, right now you can't, but I've got some code that
will let you, its not to well documented now and is probably buggy, but
seems to be working for me.

The problem is, Bio::Graphics is geared towards sequence objects, so
drawing a tree won't work to well. Since there was some interest in
generalizing Bio::Graphics, I've added Bio::Graphics::Image (basically
just a wraper around GD::Image (contains one, not inherits)) and
Bio::Graphics::Widget{.pm,::Tree.pm} which aims to be a generic "here's
a chunk of space, draw something" object. But neither of these have been
added to bioperl, but I've been planning on making them availible for
critique for a while, just been a little to busy to get around to it.

-- 

------------------------------------------------------
| Josh Lauricha            | Ford, your turning into |
| laurichj@bioinfo.ucr.edu | a penguin. Stop it.     |
| Bioinformatics, UCR      |                         |
|----------------------------------------------------|
From brian_osborne at cognia.com  Fri Mar 12 14:49:17 2004
From: brian_osborne at cognia.com (Brian Osborne)
Date: Fri Mar 12 14:55:29 2004
Subject: [Bioperl-l] RE: [Bioperl-guts-l] [Bug 1600] New:
	$gb->species->ncbi_taxid
In-Reply-To: <40507A3C.6090208@ed.ac.uk>
Message-ID: <GAEDKMGOKFBLJPKCLKCCEEJODIAA.brian_osborne@cognia.com>

James,

Yes, I could read your patch but I'm lazy. You said:

>> create a Bio::Species object, but the genus=unknown species=marine
subspecies=gamma.

Shouldn't the values be the same for all these "species" for which the genus
is not known? Like:

Genus=unknown, species=unknown, subspecies=unknown

That way you can check, since one can no longer use "unless defined
$species_object" to see if real species information is lacking or not. Have
I missed something here?

Brian O.


-----Original Message-----
From: James Wasmuth [mailto:james.wasmuth@ed.ac.uk]
Sent: Thursday, March 11, 2004 9:40 AM
To: Brian Osborne
Cc: bioperl-guts-l@bioperl.org
Subject: Re: [Bioperl-guts-l] [Bug 1600] New: $gb->species->ncbi_taxid

Brian and all at bioperl-guts,


below is the comment I've added to the bug[1600].  I think it may need
some discussion, but the patch I've added works to the extent that it
allows creation of a Bio::Species object but the subsequent genus,
species, subspecies calls will be 'wrong'.  Personally I'm more
concerned with the taxid, which I think will be sufficient.

If you want to see the size of this problem go to NCBI taxonomy and
enter the term identified as a token set!  I think that maintaining the
taxid is enough, otherwise the artifical split of terms such as
**unidentified diatom endosymbiont of Peridinium foliaceum*
<http://www.ncbi.nlm.nih.gov/Taxonomy/Browser/wwwtax.cgi?mode=Undef&id=42247
&lvl=3&lin=f&keep=1&srchmode=3&unlock>*
may be a problem, though some of them are intuitive.

One last question, I've never tried to fix a bug before, so I've
commited a patch as an attachment to Bugzilla for the bug.  Do others
check this and if okay place it in the code...
apologies for the newbie bit...

-james


genbank.pm

line 1123: return unless $genus and  $genus !~ /^(Unknown|None)$/oi;

a number of species are described as Unknown blah blah blah.

The NCBI taxid assigned to unknown taxa is 32644 and has a number of
synonyms, none of which are 'unknown'.

The list includes: other, unknown organism, not specified, not shown,
unspecified, Unknown, None, unclassified , unidentified organism

I've changed the _read_GenBank_Species subroutine to allow organism
names such as 'unknown marine gamma proteobacterium NOR5'.  This will
create a Bio::Species object, but the genus=unknown species=marine
subspecies=gamma.

There is a whole host of species names that ignore the nice rules in
_read_GenBank_Species!  However this fix will allow the correct taxid to
be provided which I think is more than the name!


sub _read_GenBank_Species {
    my( $self,$buffer) = @_;
    my @organell_names = ("chloroplast", "mitochondr");
     # only those carrying DNA, apart from the nucleus

    #CHANGE
     my @unkn_names=("other", 'unknown organism', 'not specified', 'not
shown', 'Unspecified', 'Unknown', 'None', 'unclassified', 'unidentified
organism');

    $_ = $$buffer;

    my( $sub_species, $species, $genus, $common, $organelle, @class,
$ns_name );
    # upon first entering the loop, we must not read a new line -- the
SOURCE
    # line is already in the buffer (HL 05/10/2000)
    while (defined($_) || defined($_ = $self->_readline())) {
    # de-HTMLify (links that may be encountered here don't contain
    # escaped '>', so a simple-minded approach suffices)
        s/<[^>]+>//g;
    if (/^SOURCE\s+(.*)/o) {
        # FIXME this is probably mostly wrong (e.g., it yields things like
        # Homo sapiens adult placenta cDNA to mRNA
        # which is certainly not what you want)
        $common = $1;
        $common =~ s/\.$//; # remove trailing dot
    } elsif (/^\s{2}ORGANISM/o) {
        my @spflds = split(' ', $_);
            ($ns_name) = $_ =~ /\w+\s+(.*)/o;
        shift(@spflds); # ORGANISM

         if(grep { $_ =~ /^$spflds[0]/i; } @organell_names) {
        $organelle = shift(@spflds);
        }
            $genus = shift(@spflds);
        if(@spflds) {
        $species = shift(@spflds);
        } elsif ( grep { $genus } @unkn_names){
        $species = '';
        } else {$species='sp.';}      #there's no species name but it
isn't unclassified
        $sub_species = shift(@spflds) if(@spflds);
        } elsif (/^\s+(.+)/o) {
        # only split on ';' or '.' so that
        # classification that is 2 words will
        # still get matched
        # use map to remove trailing/leading spaces
            push(@class, map { s/^\s+//; s/\s+$//; $_; } split /[;\.]+/,
$1);
        } else {
            last;
        }

        $_ = undef; # Empty $_ to trigger read of next line
    }

     $$buffer = $_;

     # Don't make a species object if it's empty or "Unknown" or "None"
    my $unkn = grep { $_ =~ /^$genus$species/i; } @unkn_names;

     return unless $genus and  $unkn==0;

     # Bio::Species array needs array in Species -> Kingdom direction
    if ($class[0] eq 'Viruses') {
        push( @class, $ns_name );
    }
    elsif ($class[$#class] eq $genus) {
        push( @class, $species );
    } else {
        push( @class, $genus, $species );
    }
    @class = reverse @class;

    my $make = Bio::Species->new();
    $make->classification( \@class, "FORCE" ); # no name validation please
    $make->common_name( $common      ) if $common;
    unless ($class[-1] eq 'Viruses') {
        $make->sub_species( $sub_species ) if $sub_species;
    }
    $make->organelle($organelle) if $organelle;
    return $make;
}


Brian Osborne wrote:

>James,
>
>Your guess is right, no Species is made because of the name. That's because
>genbank.pm normally looks at:
>
>ORGANISM Bos taurus
>
>And makes "Bos" the genus, and so on.
>
>If it sees:
>
>ORGANISM Unknown
>
>It refuses to make a Species object, and it's interpreting your ORGANISM
>line in the same way because it can't make a valid genus, that's the
current
>rule. Personally I'd say that I agree with its principle - how can we make
a
>Species object without genus and species?
>
>You can get the taxid from a SeqFeature object, you already knew that.
>
>Brian O.
>
>
>-----Original Message-----
>From: bioperl-guts-l-bounces@portal.open-bio.org
>[mailto:bioperl-guts-l-bounces@portal.open-bio.org]On Behalf Of
>bugzilla-daemon@portal.open-bio.org
>Sent: Thursday, March 11, 2004 4:21 AM
>To: bioperl-guts-l@bioperl.org
>Subject: [Bioperl-guts-l] [Bug 1600] New: $gb->species->ncbi_taxid
>
>http://bugzilla.bioperl.org/show_bug.cgi?id=1600
>
>           Summary: $gb->species->ncbi_taxid
>           Product: Bioperl
>           Version: unspecified
>          Platform: PC
>        OS/Version: Linux
>            Status: NEW
>          Severity: normal
>          Priority: P2
>         Component: Bio::SeqIO
>        AssignedTo: bioperl-guts-l@bioperl.org
>        ReportedBy: james.wasmuth@ed.ac.uk
>
>
>I've included a genbank file for which I have been unable to extract the
>ncbi_taxid for using
>
>$gb->species->ncbi_taxid
>
>the error is:
>Can't call method "ncbi_taxid" on an undefined value
>
>infact I don't get a Bio::Species object.  I'm sure its because of the
name,
>which is correct.
>
>I've tried looking into it, but could not find which Seq object creates the
>Bio::Species object.
>
>
>
>LOCUS       AY007676                1389 bp    DNA     linear   BCT
>29-OCT-2001
>DEFINITION  Unknown marine gamma proteobacterium NOR5 16S ribosomal RNA,
>            partial sequence.
>ACCESSION   AY007676
>VERSION     AY007676.1  GI:12000362
>KEYWORDS    .
>SOURCE      unknown marine gamma proteobacterium NOR5
>  ORGANISM  unknown marine gamma proteobacterium NOR5
>            Bacteria; Proteobacteria; Gammaproteobacteria.
>REFERENCE   1  (bases 1 to 1389)
>  AUTHORS   Eilers,H., Pernthaler,J., Peplies,J., Glockner,F.O., Gerdts,G.
>and
>            Amann,R.
>  TITLE     Isolation of novel pelagic bacteria from the German bight and
>their
>            seasonal contributions to surface picoplankton
>  JOURNAL   Appl. Environ. Microbiol. 67 (11), 5134-5142 (2001)
>  MEDLINE   21536174
>   PUBMED   11679337
>REFERENCE   2  (bases 1 to 1389)
>  AUTHORS   Eilers,H., Pernthaler,J., Peplies,J., Gloeckner,F.O.,
Gerdts,G.,
>            Schuett,C. and Amann,R.
>  TITLE     Identification and seasonal dominance of culturable marine
>bacteria
>  JOURNAL   Unpublished
>REFERENCE   3  (bases 1 to 1389)
>  AUTHORS   Eilers,H., Pernthaler,J., Peplies,J., Gloeckner,F.O.,
Gerdts,G.,
>            Schuett,C. and Amann,R.
>  TITLE     Direct Submission
>  JOURNAL   Submitted (29-AUG-2000) Molecular Ecology,
Max-Planck-Institute,
>            Celsiusstrasse 1, Bremen 28359, Germany
>FEATURES             Location/Qualifiers
>     source          1..1389
>                     /organism="unknown marine gamma proteobacterium NOR5"
>                     /mol_type="genomic DNA"
>                     /db_xref="taxon:145658"
>     rRNA            <1..>1389
>                     /product="16S ribosomal RNA"
>BASE COUNT      343 a    319 c    453 g    274 t
>ORIGIN
>        1 cgcgaaagta cttcggtatg agtagagcgg cggacgggtg agtaacgcgt aggaatctat
>       61 ccagtagtgg gggacaactc ggggaaactc gagctaatac cgcatacgtc ctaagggaga
>      121 aagcggggga tcttcggacc tcgcgctatt ggaggagcct gcgttggatt agctagttgg
>      181 tggggtaaag gcctaccaag gcgacgatcc atagctggtc tgagaggatg atcagccaca
>      241 ccgggactga gacacggccc ggactcctac gggaggcagc agtggggaat attgcgcaat
>      301 gggcgaaagc ctgacgcagc catgccgcgt gtgtgaagaa ggccttcggg ttgtaaagca
>      361 ctttcaattg ggaagaaagg ttagtagtta ataactgcta gctgtgacat tacctttaga
>      421 agaagcaccg gctaactccg tgccagcagc cgcggtaata cggaggtgcg agcgttaatc
>      481 ggaattactg ggcgtaaagc gcgcgtaggc ggtctgttaa gtcggatgtg aaagccccgg
>      541 gctcaacctg ggaattgcac ccgatactgg ccgactggag tgcgagagag ggaggtagaa
>      601 ttccacgtgt agcggtgaaa tgcgtagata tgtggaggaa taccggtggc gaaggcggcc
>      661 tcctggctcg acactgacgc tgaggtgcga aagcgtgggg agcaaacagg attagatacc
>      721 ctggtagtcc acgccgtaaa cgatgtctac tagccgttgg gagacttgat ttcttggtgg
>      781 cgaagttaac gcgataagta gaccgcctgg ggagtacggc cgcaaggtta aaactcaaat
>      841 gaattgacgg gggcccgcac aagcggtgga gcatgtggtt taattcgatg caacgcgaag
>      901 aaccttacca ggccttgaca tcctaggaat cctgtagaga tacgggagtg ccttcgggaa
>      961 tctagtgaca ggtgctgcat ggctgtcgtc agctcgtgtc gtgagatgtt gggttaagtc
>     1021 ccgtaacgag cgcaaccctt gtccttagtt gccagcgcgt aatggcggga actctaagga
>     1081 gactgccggt gacaaaccgg aggaaggtgg ggacgacgtc aagtcatcat ggcccttacg
>     1141 gcctgggcta cacacgtgct acaatggaac gcacagaggg cagcaaaccc gcgaggggga
>     1201 gcgaatccca caaaacgttt cgtagtccgg atcggagtct gcaactcgac tccgtgaagt
>     1261 cggaatcgct agtaatcgtg aatcagaatg tcacggtgaa tacgttcccg ggccttgtac
>     1321 acaccgcccg tcacaccatg ggagtgggtt gctccagaag tggttagcct aaccttcggg
>     1381 agggcgatc
>//
>
>
>
>------- You are receiving this mail because: -------
>You are the assignee for the bug, or are watching the assignee.
>_______________________________________________
>Bioperl-guts-l mailing list
>Bioperl-guts-l@portal.open-bio.org
>http://portal.open-bio.org/mailman/listinfo/bioperl-guts-l
>
>
>
>

--
"I have not failed. I've just found 10,000 ways that don't work."
               --- Thomas Edison

Nematode Bioinformatics           ||
Blaxter Nematode Genomics Group   ||
School of Biological Sciences     ||
Ashworth Laboratories             ||
King's Buildings                  ||    tel: +44 131 650 7403
University of Edinburgh           ||    web: www.nematodes.org
Edinburgh                         ||
EH9 3JT                           ||
UK                                ||


From barry.moore at genetics.utah.edu  Fri Mar 12 14:54:22 2004
From: barry.moore at genetics.utah.edu (Barry Moore)
Date: Fri Mar 12 14:59:57 2004
Subject: [Bioperl-l] load_seqdatabase.pl
Message-ID: <4052156E.2040402@genetics.utah.edu>

I'm having some trouble with load_seqdatabase.pl.  I using BioPerl 1.4 
(with some CVS updates including bioperl-db and bioperl-live Bio::DB 
yesterday), ActiveState Perl 5.8, MySQL version 12.22  distribution 
4.0.16, Windows XP.  I've successfully created a database mysql with the 
biosqldb-mysql.sql schema downloaded yesterday.  I have a file 
RefSeq_Human_mRNA_short.gb that has RefSeq mRNA in GenBank format (I 
pasted the first record from that file below).  I run 
load_seqdatabase.pl like this:

 >perl -d load_seqdatabase.pl --dbpass ******* RefSeq_Human_mRNA_short.gb

All of the script defaults should be O.K., but I've tried setting them 
all on the command line as well.

The script opens the file, and gets sequence out of it O.K., but then 
somewhere in the eval statement that begins at line 500 it heads off to 
Bio::Root::Root, and loses the plot with this error:

Undefined subroutine &Bio::Root::Root::debug called at 
C:/Perl/site/lib/Bio/DB/BioSQL/BasePersistenceAdaptor.pm line 1526, 
<GEN1> line 413.
Bio::DB::BioSQL::BasePersistenceAdaptor::_get_driver_class('Bio::DB::Bio
SQL::SeqAdaptor=HASH(0x1d22200)','Bio::DB::BioSQL::mysql::','Driver','Bio::DB::B 

ioSQL::SeqAdaptor') called at 
C:/Perl/site/lib/Bio/DB/BioSQL/BasePersistenceAdap
tor.pm line 1507
Bio::DB::BioSQL::BasePersistenceAdaptor::dbd('Bio::DB::BioSQL::SeqAdapto
r=HASH(0x1d22200)') called at 
C:/Perl/site/lib/Bio/DB/BioSQL/BasePersistenceAdap
tor.pm line 1386
Bio::DB::BioSQL::BasePersistenceAdaptor::rollback('Bio::DB::BioSQL::SeqA
daptor=HASH(0x1d22200)') called at load_seqdatabase.pl line 524

I'm happy to try to figure this out myself, but right now I haven't a 
clue what's going on since the subroutine &Bio::Root::Root::debug seems 
to me to be where it's supposed to be.  Is this possible a OS issue 
since I'm on Windows?  Can anyone nudge me in the right direction?

Barry
-- 
Barry Moore
Dept. of Human Genetics
University of Utah
Salt Lake City, UT


LOCUS NM_000014 4577 bp mRNA linear PRI 20-DEC-2003
DEFINITION Homo sapiens alpha-2-macroglobulin (A2M), mRNA.
ACCESSION NM_000014
VERSION NM_000014.3 GI:6226959
KEYWORDS .
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 4577)
AUTHORS Mathew,S., Arandjelovic,S., Beyer,W.F., Gonias,S.L. and Pizzo,S.V.
TITLE Characterization of the interaction between alpha2-macroglobulin
and fibroblast growth factor-2: the role of hydrophobic
interactions
JOURNAL Biochem. J. 374 (Pt 1), 123-129 (2003)
PUBMED 12755687
REMARK GeneRIF: FGF-2 and this protein interact at specific binding sites,
involving different FGF-2 sequences.
REFERENCE 2 (bases 1 to 4577)
AUTHORS Athauda,S.B., Nishigai,M., Arakawa,H., Ikai,A., Ukai,M. and
Takahashi,K.
TITLE Inhibition of human pepsin and gastricsin by alpha2-macroglobulin
JOURNAL J Enzyme Inhib Med Chem 18 (3), 219-224 (2003)
PUBMED 14506912
REMARK GeneRIF: alpha2-macroglobulin inhibits human pepsin adn gastricsin
REFERENCE 3 (bases 1 to 4577)
AUTHORS Arandjelovic,S., Freed,T.A. and Gonias,S.L.
TITLE Growth factor-binding sequence in human alpha2-macroglobulin
targets the receptor-binding site in transforming growth
factor-beta
JOURNAL Biochemistry 42 (20), 6121-6127 (2003)
PUBMED 12755614
REMARK GeneRIF: alpha(2)M-derived peptides target the receptor-binding
sequence in TGF-beta
REFERENCE 4 (bases 1 to 4577)
AUTHORS Shibata,M., Sakai,H., Sakai,E., Okamoto,K., Nishishita,K.,
Yasuda,Y., Kato,Y. and Yamamoto,K.
TITLE Disruption of structural and functional integrity of alpha
2-macroglobulin by cathepsin E
JOURNAL Eur. J. Biochem. 270 (6), 1189-1198 (2003)
PUBMED 12631277
REMARK GeneRIF: These results suggest the possible involvement of
cathepsin E in disruption of the structural and functional
integrity of alpha 2-macroglobulin in the endolysosome system.
REFERENCE 5 (bases 1 to 4577)
AUTHORS Zappia,M., Cittadella,R., Manna,I., Nicoletti,G., Andreoli,V.,
Bonavita,S., Gambardella,A. and Quattrone,A.
TITLE Genetic association of alpha2-macroglobulin polymorphisms with AD
in southern Italy
JOURNAL Neurology 59 (5), 756-758 (2002)
PUBMED 12221172
REMARK GeneRIF: Genetic association of alpha2-macroglobulin polymorphisms
with Alzheimer's disease
REFERENCE 6 (bases 1 to 4577)
AUTHORS Ghebremedhin,E., Schultz,C., Thal,D.R., Del Tredici,K., Rueb,U. and
Braak,H.
TITLE Genetic association of argyrophilic grain disease with
polymorphisms in alpha-2 macroglobulin and low-density lipoprotein
receptor-related protein genes
JOURNAL Neuropathol Appl Neurobiol 28 (4), 308-313 (2002)
PUBMED 12175343
REMARK GeneRIF: Genetic association of argyrophilic grain disease with
polymorphisms in alpha-2 macroglobulin.
REFERENCE 7 (bases 1 to 4577)
AUTHORS Kolodziej,S.J., Wagenknecht,T., Strickland,D.K. and Stoops,J.K.
TITLE The three-dimensional structure of the human alpha 2-macroglobulin
dimer reveals its structural organization in the tetrameric native
and chymotrypsin alpha 2-macroglobulin complexes
JOURNAL J. Biol. Chem. 277 (31), 28031-28037 (2002)
PUBMED 12015318
REMARK GeneRIF: The three-dimensional structure of the dimer reveals its
structural organization in the tetrameric native and chymotrypsin
alpha 2-macroglobulin complexes.
REFERENCE 8 (bases 1 to 4577)
AUTHORS McElhinney,B., Ardill,J., Caldwell,C., Lloyd,F. and McClure,N.
TITLE Ovarian hyperstimulation syndrome and assisted reproductive
technologies: why some and not others?
JOURNAL Hum. Reprod. 17 (6), 1548-1553 (2002)
PUBMED 12042276
REMARK GeneRIF: relationship between serum VEGF levels, alpha(2)M levels
and the development of OHSS in hyperstimulated subjects undergoing
IVF
REFERENCE 9 (bases 1 to 4577)
AUTHORS Mettenburg,J.M., Webb,D.J. and Gonias,S.L.
TITLE Distinct binding sites in the structure of alpha 2-macroglobulin
mediate the interaction with beta-amyloid peptide and growth
factors
JOURNAL J. Biol. Chem. 277 (15), 13338-13345 (2002)
PUBMED 11823454
REMARK GeneRIF: distinct binding sites mediate interaction with
beta-amyloid peptide and growth factors
REFERENCE 10 (bases 1 to 4577)
AUTHORS Cvirn,G., Gallistl,S., Koestenberger,M., Kutschera,J., Leschnik,B.
and Muntean,W.
TITLE Alpha 2-macroglobulin enhances prothrombin activation and thrombin
potential by inhibiting the anticoagulant protein C/protein S
system in cord and adult plasma
JOURNAL Thromb. Res. 105 (5), 433-439 (2002)
PUBMED 12062545
REMARK GeneRIF: Alpha 2-macroglobulin enhances prothrombin activation and
thrombin potential by inhibiting the anticoagulant protein
C/protein S system in cord and adult plasma.
REFERENCE 11 (bases 1 to 4577)
AUTHORS Janka,Z., Juhasz,A., Rimanoczy,A., Boda,K., Marki-Zay,J.,
Palotas,M., Kuk,I., Zollei,M., Jakab,K. and Kalman,J.
TITLE Alpha2-macroglobulin exon 24 (Val-1000-Ile) polymorphism is not
associated with late-onset sporadic Alzheimer's dementia in the
Hungarian population
JOURNAL Psychiatr. Genet. 12 (1), 49-54 (2002)
PUBMED 11901360
REMARK GeneRIF: has an important role in the AD-specific neurodegenerative
process but its exon 24 Val-1000-Ile polymorphism is not likely to
be associated with late-onset sporadic AD in the Hungarian
population
REFERENCE 12 (bases 1 to 4577)
AUTHORS Chiabrando,G.A., Vides,M.A. and Sanchez,M.C.
TITLE Differential binding properties of human pregnancy zone protein-
and alpha2-macroglobulin-proteinase complexes to low-density
lipoprotein receptor-related protein
JOURNAL Arch. Biochem. Biophys. 398 (1), 73-78 (2002)
PUBMED 11811950
REMARK GeneRIF: Differential binding to ldl receptor related protein
REFERENCE 13 (bases 1 to 4577)
AUTHORS Toombs,C.F.
TITLE Alfimeprase: pharmacology of a novel fibrinolytic metalloproteinase
for thrombolysis
JOURNAL Haemostasis 31 (3-6), 141-147 (2001)
PUBMED 11910179
REMARK GeneRIF: REVIEW: binds and neutralizes alfimeprase, which has
direct proteolytic activity against the fibrinogen Aalpha chain
REFERENCE 14 (bases 1 to 4577)
AUTHORS Borth,W.
TITLE Alpha 2-macroglobulin, a multifunctional binding protein with
targeting characteristics
JOURNAL FASEB J. 6 (15), 3345-3353 (1992)
PUBMED 1281457
REFERENCE 15 (bases 1 to 4577)
AUTHORS Poller,W., Faber,J.P., Klobeck,G. and Olek,K.
TITLE Cloning of the human alpha 2-macroglobulin gene and detection of
mutations in two functional domains: the bait region and the
thiolester site
JOURNAL Hum. Genet. 88 (3), 313-319 (1992)
PUBMED 1370808
REFERENCE 16 (bases 1 to 4577)
AUTHORS Poller,W., Faber,J.P. and Olek,K.
TITLE Sequence polymorphism in the human alpha-2-macroglobulin (A2M) gene
JOURNAL Nucleic Acids Res. 19 (1), 198 (1991)
PUBMED 1707161
REFERENCE 17 (bases 1 to 4577)
AUTHORS Bell,G.I., Rall,L.B., Sanchez-Pescador,R., Merryweather,J.P.,
Scott,J., Eddy,R.L. and Shows,T.B.
TITLE Human alpha 2-macroglobulin gene is located on chromosome 12
JOURNAL Somat. Cell Mol. Genet. 11 (3), 285-289 (1985)
PUBMED 2408344
REFERENCE 18 (bases 1 to 4577)
AUTHORS Kan,C.C., Solomon,E., Belt,K.T., Chain,A.C., Hiorns,L.R. and Fey,G.
TITLE Nucleotide sequence of cDNA encoding human alpha 2-macroglobulin
and assignment of the chromosomal locus
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82 (8), 2282-2286 (1985)
PUBMED 2581245
COMMENT REVIEWED REFSEQ: This record has been curated by NCBI staff. The
reference sequence was derived from M11313.1.
On Nov 4, 1999 this sequence version replaced gi:6226762.

Summary: Alpha-2-macroglobulin is a protease inhibitor and cytokine
transporter. It inhibits many proteases, including trypsin,
thrombin and collagenase. A2M is implicated in Alzheimer disease
(AD) due to its ability to mediate the clearance and degradation of
A-beta, the major component of beta-amyloid deposits.
FEATURES Location/Qualifiers
source 1..4577
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/chromosome="12"
/map="12p13.3-p12.3"
gene 1..4577
/gene="A2M"
/db_xref="GeneID:2"
/db_xref="LocusID:2"
/db_xref="MIM:103950"
CDS 44..4468
/gene="A2M"
/note="go_function: protein carrier activity [goid
0008320] [evidence NR];
go_function: endopeptidase inhibitor activity [goid
0004866] [evidence NR];
go_function: wide-spectrum protease inhibitor activity
[goid 0017114] [evidence IEA];
go_function: serine protease inhibitor activity [goid
0004867] [evidence IEA];
go_function: alpha-2 macroglobulin [goid 0016975]
[evidence NAS];
go_process: intracellular protein transport [goid 0006886]
[evidence NR]"
/codon_start=1
/product="alpha 2 macroglobulin precursor"
/protein_id="NP_000005.1"
/db_xref="GI:4557225"
/db_xref="GeneID:2"
/db_xref="LocusID:2"
/db_xref="MIM:103950"
/translation="MGKNKLLHPSLVLLLLVLLPTDASVSGKPQYMVLVPSLLHTETT
EKGCVLLSYLNETVTVSASLESVRGNRSLFTDLEAENDVLHCVAFAVPKSSSNEEVMF
LTVQVKGPTQEFKKRTTVMVKNEDSLVFVQTDKSIYKPGQTVKFRVVSMDENFHPLNE
LIPLVYIQDPKGNRIAQWQSFQLEGGLKQFSFPLSSEPFQGSYKVVVQKKSGGRTEHP
FTVEEFVLPKFEVQVTVPKIITILEEEMNVSVCGLYTYGKPVPGHVTVSICRKYSDAS
DCHGEDSQAFCEKFSGQLNSHGCFYQQVKTKVFQLKRKEYEMKLHTEAQIQEEGTVVE
LTGRQSSEITRTITKLSFVKVDSHFRQGIPFFGQVRLVDGKGVPIPNKVIFIRGNEAN
YYSNATTDEHGLVQFSINTTNVMGTSLTVRVNYKDRSPCYGYQWVSEEHEEAHHTAYL
VFSPSKSFVHLEPMSHELPCGHTQTVQAHYILNGGTLLGLKKLSFYYLIMAKGGIVRT
GTHGLLVKQEDMKGHFSISIPVKSDIAPVARLLIYAVLPTGDVIGDSAKYDVENCLAN
KVDLSFSPSQSLPASHAHLRVTAAPQSVCALRAVDQSVLLMKPDAELSASSVYNLLPE
KDLTGFPGPLNDQDDEDCINRHNVYINGITYTPVSSTNEKDMYSFLEDMGLKAFTNSK
IRKPKMCPQLQQYEMHGPEGLRVGFYESDVMGRGHARLVHVEEPHTETVRKYFPETWI
WDLVVVNSAGVAEVGVTVPDTITEWKAGAFCLSEDAGLGISSTASLRAFQPFFVELTM
PYSVIRGEAFTLKATVLNYLPKCIRVSVQLEASPAFLAVPVEKEQAPHCICANGRQTV
SWAVTPKSLGNVNFTVSAEALESQELCGTEVPSVPEHGRKDTVIKPLLVEPEGLEKET
TFNSLLCPSGGEVSEELSLKLPPNVVEESARASVSVLGDILGSAMQNTQNLLQMPYGC
GEQNMVLFAPNIYVLDYLNETQQLTPEVKSKAIGYLNTGYQRQLNYKHYDGSYSTFGE
RYGRNQGNTWLTAFVLKTFAQARAYIFIDEAHITQALIWLSQRQKDNGCFRSSGSLLN
NAIKGGVEDEVTLSAYITIALLEIPLTVTHPVVRNALFCLESAWKTAQEGDHGSHVYT
KALLAYAFALAGNQDKRKEVLKSLNEEAVKKDNSVHWERPQKPKAPVGHFYEPQAPSA
EVEMTSYVLLAYLTAQPAPTSEDLTSATNIVKWITKQQNAQGGFSSTQDTVVALHALS
KYGAATFTRTGKAAQVTIQSSGTFSSKFQVDNNNRLLLQQVSLPELPGEYSMKVTGEG
CVYLQTSLKYNILPEKEEFPFALGVQTLPQTCDEPKAHTSFQISLSVSYTGSRSASNM
AIVDVKMVSGFIPLKPTVKMLERSNHVSRTEVSSNHVLIYLDKVSNQTLSLFFTVLQD
VPVRDLKPAIVKVYDYYETDEFAIAEYNAPCSKDLGNA"
sig_peptide 44..112
/gene="A2M"
/note="alpha-2-macroglobulin signal peptide"
misc_feature 104..4465
/gene="A2M"
/note="KOG1366; Region: Alpha-macroglobulin
[Posttranslational modification, protein turnover,
chaperones]"
/db_xref="CDD:19155"
misc_feature 110..1927
/gene="A2M"
/note="A2M_N; Region: Alpha-2-macroglobulin family
N-terminal region"
/db_xref="CDD:17056"
mat_peptide 113..4465
/gene="A2M"
/product="alpha-2-macroglobulin"
misc_feature 2216..4432
/gene="A2M"
/note="Region: Alpha-2-macroglobulin family"
/db_xref="CDD:5952"
variation 1122
/gene="A2M"
/replace="G"
/replace="A"
/db_xref="dbSNP:2229298"
variation 1282
/gene="A2M"
/replace="T"
/replace="C"
/db_xref="dbSNP:1049134"
variation 1339
/gene="A2M"
/replace="T"
/replace="C"
/db_xref="dbSNP:2228222"
variation 1354
/gene="A2M"
/replace="G"
/replace="A"
/db_xref="dbSNP:226396"
variation 1958
/gene="A2M"
/replace="G"
/replace="A"
/db_xref="dbSNP:226405"
variation 2154
/gene="A2M"
/replace="G"
/replace="A"
/db_xref="dbSNP:1800434"
variation 2431
/gene="A2M"
/replace="T"
/replace="C"
/db_xref="dbSNP:1049143"
variation 2487
/gene="A2M"
/replace="T"
/replace="A"
/db_xref="dbSNP:3180392"
variation 2487
/gene="A2M"
/replace="T"
/replace="A"
/db_xref="dbSNP:3210107"
variation 2958
/gene="A2M"
/replace="G"
/replace="A"
/db_xref="dbSNP:1800433"
variation 3041
/gene="A2M"
/replace="G"
/replace="A"
/db_xref="dbSNP:669"
variation 3988
/gene="A2M"
/replace="G"
/replace="A"
/db_xref="dbSNP:1802964"
variation 4334
/gene="A2M"
/replace="T"
/replace="C"
/db_xref="dbSNP:1802965"
variation 4474
/gene="A2M"
/replace="C"
/replace="A"
/db_xref="dbSNP:1130840"
variation 4508
/gene="A2M"
/replace="T"
/replace="C"
/db_xref="dbSNP:1049985"
variation 4511
/gene="A2M"
/replace="T"
/replace="A"
/db_xref="dbSNP:1802966"
variation 4519
/gene="A2M"
/replace="C"
/replace="A"
/db_xref="dbSNP:3190224"
ORIGIN
1 gctacaatcc atctggtctc ctccagctcc ttctttctgc aacatgggga agaacaaact
61 ccttcatcca agtctggttc ttctcctctt ggtcctcctg cccacagacg cctcagtctc
121 tggaaaaccg cagtatatgg ttctggtccc ctccctgctc cacactgaga ccactgagaa
181 gggctgtgtc cttctgagct acctgaatga gacagtgact gtaagtgctt ccttggagtc
241 tgtcagggga aacaggagcc tcttcactga cctggaggcg gagaatgacg tactccactg
301 tgtcgccttc gctgtcccaa agtcttcatc caatgaggag gtaatgttcc tcactgtcca
361 agtgaaagga ccaacccaag aatttaagaa gcggaccaca gtgatggtta agaacgagga
421 cagtctggtc tttgtccaga cagacaaatc aatctacaaa ccagggcaga cagtgaaatt
481 tcgtgttgtc tccatggatg aaaactttca ccccctgaat gagttgattc cactagtata
541 cattcaggat cccaaaggaa atcgcatcgc acaatggcag agtttccagt tagagggtgg
601 cctcaagcaa ttttcttttc ccctctcatc agagcccttc cagggctcct acaaggtggt
661 ggtacagaag aaatcaggtg gaaggacaga gcaccctttc accgtggagg aatttgttct
721 tcccaagttt gaagtacaag taacagtgcc aaagataatc accatcttgg aagaagagat
781 gaatgtatca gtgtgtggcc tatacacata tgggaagcct gtccctggac atgtgactgt
841 gagcatttgc agaaagtata gtgacgcttc cgactgccac ggtgaagatt cacaggcttt
901 ctgtgagaaa ttcagtggac agctaaacag ccatggctgc ttctatcagc aagtaaaaac
961 caaggtcttc cagctgaaga ggaaggagta tgaaatgaaa cttcacactg aggcccagat
1021 ccaagaagaa ggaacagtgg tggaattgac tggaaggcag tccagtgaaa tcacaagaac
1081 cataaccaaa ctctcatttg tgaaagtgga ctcacacttt cgacagggaa ttcccttctt
1141 tgggcaggtg cgcctagtag atgggaaagg cgtccctata ccaaataaag tcatattcat
1201 cagaggaaat gaagcaaact attactccaa tgctaccacg gatgagcatg gccttgtaca
1261 gttctctatc aacaccacca acgttatggg tacctctctt actgttaggg tcaattacaa
1321 ggatcgtagt ccctgttacg gctaccagtg ggtgtcagaa gaacacgaag aggcacatca
1381 cactgcttat cttgtgttct ccccaagcaa gagctttgtc caccttgagc ccatgtctca
1441 tgaactaccc tgtggccata ctcagacagt ccaggcacat tatattctga atggaggcac
1501 cctgctgggg ctgaagaagc tctcctttta ttatctgata atggcaaagg gaggcattgt
1561 ccgaactggg actcatggac tgcttgtgaa gcaggaagac atgaagggcc atttttccat
1621 ctcaatccct gtgaagtcag acattgctcc tgtcgctcgg ttgctcatct atgctgtttt
1681 acctaccggg gacgtgattg gggattctgc aaaatatgat gttgaaaatt gtctggccaa
1741 caaggtggat ttgagcttca gcccatcaca aagtctccca gcctcacacg cccacctgcg
1801 agtcacagcg gctcctcagt ccgtctgcgc cctccgtgct gtggaccaaa gcgtgctgct
1861 catgaagcct gatgctgagc tctcggcgtc ctcggtttac aacctgctac cagaaaagga
1921 cctcactggc ttccctgggc ctttgaatga ccaggacgat gaagactgca tcaatcgtca
1981 taatgtctat attaatggaa tcacatatac tccagtatca agtacaaatg aaaaggatat
2041 gtacagcttc ctagaggaca tgggcttaaa ggcattcacc aactcaaaga ttcgtaaacc
2101 caaaatgtgt ccacagcttc aacagtatga aatgcatgga cctgaaggtc tacgtgtagg
2161 tttttatgag tcagatgtaa tgggaagagg ccatgcacgc ctggtgcatg ttgaagagcc
2221 tcacacggag accgtacgaa agtacttccc tgagacatgg atctgggatt tggtggtggt
2281 aaactcagca ggggtggctg aggtaggagt aacagtccct gacaccatca ccgagtggaa
2341 ggcaggggcc ttctgcctgt ctgaagatgc tggacttggt atctcttcca ctgcctctct
2401 ccgagccttc cagcccttct ttgtggagct tacaatgcct tactctgtga ttcgtggaga
2461 ggccttcaca ctcaaggcca cggtcctaaa ctaccttccc aaatgcatcc gggtcagtgt
2521 gcagctggaa gcctctcccg ccttccttgc tgtcccagtg gagaaggaac aagcgcctca
2581 ctgcatctgt gcaaacgggc ggcaaactgt gtcctgggca gtaaccccaa agtcattagg
2641 aaatgtgaat ttcactgtga gcgcagaggc actagagtct caagagctgt gtgggactga
2701 ggtgccttca gttcctgaac acggaaggaa agacacagtc atcaagcctc tgttggttga
2761 acctgaagga ctagagaagg aaacaacatt caactcccta ctttgtccat caggtggtga
2821 ggtttctgaa gaattatccc tgaaactgcc accaaatgtg gtagaagaat ctgcccgagc
2881 ttctgtctca gttttgggag acatattagg ctctgccatg caaaacacac aaaatcttct
2941 ccagatgccc tatggctgtg gagagcagaa tatggtcctc tttgctccta acatctatgt
3001 actggattat ctaaatgaaa cacagcagct tactccagag gtcaagtcca aggccattgg
3061 ctatctcaac actggttacc agagacagtt gaactacaaa cactatgatg gctcctacag
3121 cacctttggg gagcgatatg gcaggaacca gggcaacacc tggctcacag cctttgttct
3181 gaagactttt gcccaagctc gagcctacat cttcatcgat gaagcacaca ttacccaagc
3241 cctcatatgg ctctcccaga ggcagaagga caatggctgt ttcaggagct ctgggtcact
3301 gctcaacaat gccataaagg gaggagtaga agatgaagtg accctctccg cctatatcac
3361 catcgccctt ctggagattc ctctcacagt cactcaccct gttgtccgca atgccctgtt
3421 ttgcctggag tcagcctgga agacagcaca agaaggggac catggcagcc atgtatatac
3481 caaagcactg ctggcctatg cttttgccct ggcaggtaac caggacaaga ggaaggaagt
3541 actcaagtca cttaatgagg aagctgtgaa gaaagacaac tctgtccatt gggagcgccc
3601 tcagaaaccc aaggcaccag tggggcattt ttacgaaccc caggctccct ctgctgaggt
3661 ggagatgaca tcctatgtgc tcctcgctta tctcacggcc cagccagccc caacctcgga
3721 ggacctgacc tctgcaacca acatcgtgaa gtggatcacg aagcagcaga atgcccaggg
3781 cggtttctcc tccacccagg acacagtggt ggctctccat gctctgtcca aatatggagc
3841 cgccacattt accaggactg ggaaggctgc acaggtgact atccagtctt cagggacatt
3901 ttccagcaaa ttccaagtgg acaacaacaa tcgcctgtta ctgcagcagg tctcattgcc
3961 agagctgcct ggggaataca gcatgaaagt gacaggagaa ggatgtgtct acctccagac
4021 ctccttgaaa tacaatattc tcccagaaaa ggaagagttc ccctttgctt taggagtgca
4081 gactctgcct caaacttgtg atgaacccaa agcccacacc agcttccaaa tctccctaag
4141 tgtcagttac acagggagcc gctctgcctc caacatggcg atcgttgatg tgaagatggt
4201 ctctggcttc attcccctga agccaacagt gaaaatgctt gaaagatcta accatgtgag
4261 ccggacagaa gtcagcagca accatgtctt gatttacctt gataaggtgt caaatcagac
4321 actgagcttg ttcttcacgg ttctgcaaga tgtcccagta agagatctca aaccagccat
4381 agtgaaagtc tatgattact acgagacgga tgagtttgca atcgctgagt acaatgctcc
4441 ttgcagcaaa gatcttggaa atgcttgaag accacaaggc tgaaaagtgc tttgctggag
4501 tcctgttctc tgagctccac agaagacacg tgtttttgta tctttaaaga cttgatgaat
4561 aaacactttt tctggtc
//


From radams at staffmail.ed.ac.uk  Fri Mar 12 16:22:44 2004
From: radams at staffmail.ed.ac.uk (Richard Adams)
Date: Fri Mar 12 16:28:19 2004
Subject: [Bioperl-l] protein networks
Message-ID: <1079126564.40522a2475877@staffmail.ed.ac.uk>

Hi Boris,
Thanks for the tips.
I've put some modules in CVS in Bio::Graph namespace.
The module representing a protein interaction network is called 
ProteinGraph.pm, but you're right, including the word "interaction" 
would clarify it. 

The base class, Nat Goodman's SimpleGraph is totally generic  for any
graph, not just biological graphs. ProteinGraph.pm uses Bio::Seq objects
to represent the nodes and only subclasses a few methods from SimpleGraph. 
I would imagine that a protein::DNA and DNA/DNA interactions could also be 
represented
using this class, using the alphabet() method to distinguish proteins from DNA.
If you think that additional data fields /methods would be needed I'd love to 
know - methods in common between the classes could then be abstracted into 
a common Bio::Graph::Base class.

I've include a basic XML parser for parsing PSI XML records. 
e.g., 
my $io = Bio::Graph::IO->new(-file => 'myfile', -format =>'psixml');
my $gr = $io->next_network();
At present it
reads the minimun infor to construct a graph but does not make an effort to
get all the data. If you know of the existence of an industrial strength
PSI-XML parser, I'd be really pleased, there must be one out there somewhere.

Richard
From allenday at ucla.edu  Fri Mar 12 17:31:17 2004
From: allenday at ucla.edu (Allen Day)
Date: Fri Mar 12 17:36:51 2004
Subject: [Bioperl-l] How to display newick tree in graphics?
In-Reply-To: <20040312174150.GA5204@batch107a>
References: <20040312024941.17553.qmail@web13303.mail.yahoo.com>
	<20040312174150.GA5204@batch107a>
Message-ID: <Pine.LNX.4.58.0403121430120.15330@sumo.genetics.ucla.edu>

this can be done using Bio::TreIO::svggraph.  this module will write SVG
images of a Bio::Tree.  there is a dependency of this module on SVG::Graph
and SVG.

-allen


On Fri, 12 Mar 2004, Josh Lauricha wrote:

> On Fri 03/12/04 10:49, Yuhui wrote:
> > hi,
> > a newbie to perl here, can someone teach me how to
> > output a newick file in the form of a image on a
> > webpage.
> 
> The sort version is, right now you can't, but I've got some code that
> will let you, its not to well documented now and is probably buggy, but
> seems to be working for me.
> 
> The problem is, Bio::Graphics is geared towards sequence objects, so
> drawing a tree won't work to well. Since there was some interest in
> generalizing Bio::Graphics, I've added Bio::Graphics::Image (basically
> just a wraper around GD::Image (contains one, not inherits)) and
> Bio::Graphics::Widget{.pm,::Tree.pm} which aims to be a generic "here's
> a chunk of space, draw something" object. But neither of these have been
> added to bioperl, but I've been planning on making them availible for
> critique for a while, just been a little to busy to get around to it.
> 
> 
From barry.moore at genetics.utah.edu  Fri Mar 12 12:24:26 2004
From: barry.moore at genetics.utah.edu (Barry Moore)
Date: Fri Mar 12 18:15:21 2004
Subject: [Bioperl-l] Parsing features
In-Reply-To: <Pine.GSO.4.40.0403120931440.17653-100000@mgcheo3>
References: <Pine.GSO.4.40.0403120931440.17653-100000@mgcheo3>
Message-ID: <4051F24A.10107@genetics.utah.edu>

Jonathon,

I looks like what you are trying to do is print your @features array as 
if it were a tidy list of features.  In fact it is a complex data 
structure that holds the values of the Generic SeqFeature object.  You 
will need to step through it and collect the goodies yourself.  (At 
least I don't think there is a prepackaged way to just "get it all" - 
although the list can correct me if I'm wrong on that.)  If you are 
after certain values of the GenBank features table, the following 
document may be of some help in getting at it.  (If the pasted document 
doesn't come to you well formatted let me know, and I can mail it 
directly to you as an attachement.)

Barry

---------------------------------------------------------------------------


                 How to get values in a GenBank file with BioPerl
 
This is a guide to accessing the value of each item in a GenBank file 
with BioPerl.  Underneath each value in the sample GenBank file there is 
a number and a 3-letter code.  The 3-letter code represents the object 
where the associated value is stored.  Table 1 at the end of this file 
gives the object for each code.  By looking up the numbers in Table 2 at 
the end of this file you will find the object where this information can 
be called from, and the method to get (and sometimes set) the value.  
Some values may be stored, and accessible by methods in more than one 
object.  There is perl code at the end that you can use as a starting 
point to get at all of the objects and their methods.
----------------------------------------------------------------------------------- 

LOCUS       NM_079543               1821 bp    mRNA    linear   INV 
10-DEC-2003
            1-BSR                   2-BPS      3-BSR   4-BSR  5-BSR 6-BSR
DEFINITION  Drosophila melanogaster alpha-Esterase-3 CG1257-PA (alpha-Est3)
            mRNA, complete cds.
            7-BPS
ACCESSION   NM_079543 XX_123456
            8-BPS     9-BSR
VERSION     NM_079543.2  GI:24644853
                      |     |
                    10-BPS  11-BSR
KEYWORDS    Esterases
            12-BSR
SOURCE      Drosophila melanogaster (fruit fly)
            13-BSC
  ORGANISM  Drosophila melanogaster
            14-BSC     15-BSC
            Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota;
            Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha;
            Ephydroidea; Drosophilidae; Drosophila.
            16-BSC
REFERENCE   1  (bases 1 to 1821)
                    17-BAR 18-BAR
  AUTHORS   Adams,M.D. et al.(additional authors deleted for brevity)
            19-BAR
  TITLE     The genome sequence of Drosophila melanogaster
            20-BAR
  JOURNAL   Science 287 (5461), 2185-2195 (2000)
            21-BAR
  MEDLINE   20196006
            22-BAR
   PUBMED   10731132
            23-BAR
COMMENT     PROVISIONAL REFSEQ: This record has not yet been subject to 
final
            NCBI review. This record is derived from an annotated genomic
            sequence (NT_033777). The reference sequence was derived from
            CG1257-RA.
            On Nov 6, 2002 this sequence version replaced gi:17737826.
            24-BAC
FEATURES             Location/Qualifiers
     source          1..1821
                25-BLS  26-BLS
                     /organism="Drosophila melanogaster"
                     27-BSG
                     /mol_type="mRNA"
                     /db_xref="taxon:7227"
                     /chromosome="3R"
                     /note="genotype: y[1]; cn[1] bw[1] sp[1]; Rh6[1]"
     gene            1..1821
                     /gene="alpha-Est3"
                     /locus_tag="CG1257"
                     /note="alpha-Esterase-3; synonyms: B, aE3, CG1257,
                     alphaE3, alpha-Ests, fragment B;
                     go_function: carboxylesterase activity [goid 0004091]
                     [evidence NAS]"
                     /map="84D9-84D9"
                     /db_xref="FLYBASE:FBgn0015571"
                     /db_xref="GeneID:40907"
                     /db_xref="LocusID:40907"
     CDS             119..1750
                     /gene="alpha-Est3"
                     /locus_tag="CG1257"
                     /EC_number="3.1.1.1"
                     /note="alpha-Est3 gene product"
                     /codon_start=1
                     /product="alpha-Esterase-3 CG1257-PA"
                     /protein_id="NP_524267.2"
                     /db_xref="GI:24644854"
                     /db_xref="FLYBASE:FBgn0015571"
                     /db_xref="GeneID:40907"
                     /db_xref="LocusID:40907"
                     
/translation="MESLQVNTTSGPVLGKQCTGVYGDEYVSFERIPYAQPPVGHLRF
                     
MAPLPVEPWSQPLDCTKPGQKPLQFNHYSKQLEGVEDCLYLNVYAKELDSPRPLPLIV
                   
  FFFGGGFEKGDPTKELHSPDYFMMRDVVVVTVSYRVGPLGFLSLNDPAVGVPGNAGLK
                     
DQLLAMEWIKENAERFNGDPKNVTAFGESAGAASVHYLMLNPKAEGLFHKAILQSGNV
                     
LCSWALCTIKNLPHRLAVNLGMESAEHVTDAMVLDFLQKLPGEKLVRPYLLSAEEHLD
                     
DCVFQFGPMVEPYKTEHCALPNHPQELLDKAWGNRIPVLMSGTSFEGLLMYARVQMAP
                     
YLLTSLKKEPEHMLPLDVKRNLPQALARHLGQRLQETHFGGNDPSAMSPESLKAYCEY
                     
ASYKVFWHPILKTLRSRVKSSSASTYLYRFDFDSPTFNHQRLKYCGDKLRGVAHVDDH
                     
SYLWYGDFSWKLDKHTPEFLTIERMIDMLTSFARTSNPNCKLIQDQLPRAKEWKPLNS
                     KSALECLNISENIKMMELPELQKLRVWESVCQSTG"
ORIGIN     
        1 gtactatggc aaatggtact atgggcagtc gcgataataa taataaatga ggctttaaat
       61 gtttttccac tacaagataa aaagttacga gtgcgcaccg agcatttagt agacgaacat
          28-BPS
... (additional sequence deleted for brevity)
//
----------------------------------------------------------------------------------- 

Table 1. Object Key
BPS - Bio::PrimarySeq
BSR - Bio::Seq::RichSeq
BSC - Bio::Species
BAC - Bio::Annotation::Comment
BAR - Bio::Annotation::Reference
BLS ? Bio::Location::Simple
BSG - Bio::SeqFeature::Generic
----------------------------------------------------------------------------------- 

Table 2. Usage key
1.   Bio::Seq::RichSeq           $scalar = $self->display_name() or 
->display_id()
2.   Bio::PrimarySeq             $scalar = $self->length()
3.   Bio::Seq::RichSeq           $scalar = $self->molecule()
4.   Bio::PrimarySeq             $scalar = $self->is_circular
5.   Bio::Seq::RichSeq           $scalar = $self->division()
6.   Bio::seq::Richseq           @array  = $self->get_dates
7.   Bio::PrimarySeq             $scalar = $self->description()
8.   Bio::PrimarySeq             $scalar = $self->accession_number()
     Bio::Seq::RichSeq           $scalar = $self->accession()
9.   Bio::Seq::RichSeq           @array  = $self->get_secondary_accessions
10.  Bio::PrimarySeq             $scalar = $self->version()
     Bio::Seq::RichSeq           $scalar = $self->seq_version()
11.  Bio::PrimarySeq             $scalar = $self->primary_id()
12.  Bio::Seq::RichSeq           @array  = $self->get_keywords
13.  Bio::Species                $scalar = $self->common_name() 
14.  Bio::Species                $scalar = $self->genus()
15.  Bio::Species                $scalar = $self->species()
16.  Bio::Species                @array  = $self->classification()
17.  Bio::Annotation::Reference  $scalar = $self->start()
18.  Bio::Annotation::Reference  $scalar = $self->end()
19.  Bio::Annotation::Reference  $scalar = $self->authors()
20.  Bio::Annotation::Reference  $scalar = $self->title()
21.  Bio::Annotation::Reference  $scalar = $self->location()
22.  Bio::Annotation::Reference  $scalar = $self->medline()
23.  Bio::Annotation::Reference  $scalar = $self->pubmed()
24.  Bio::Annotation::Comment    $scalar = $self->text() or ->as_text
25.  Bio::SeqFeature::Generic    $scalar = $self->start()
26.  Bio::SeqFeature::Generic    $scalar = $self->end()
27.  Bio::SeqFeature::Generic
 
Get all features from the features table using a variation on the code 
below.  You will need to vary the primary_tag (e.g. source, gene, CDS) 
and the get_tag_values (e.g. organism, db_xref, chromosome, note, gene, 
locus_tag, go_function, map, EC_number, codon_start, protein_id, etc.).  
If there is more than one primary tag associated with a give value (e.g. 
more than one gene) then you will have to step through the @source_feats 
array to find what you want rather than simply shifting the first scalar 
off the top.
 
my @source_feats = grep { $_->primary_tag eq 'source' } 
$seq->get_SeqFeatures();
my $source_feat = shift @source_feats;
my @mol_type = $source_feat->get_tag_values('mol_type');
 
28.   Bio::PrimarySeq   $scalar =   $self->seq()
----------------------------------------------------------------------------------- 

#!/usr/bin/perl
 
use strict;
use warnings;
use Bio::SeqIO;
use Bio::DB::GenBank; #use Bio::DB::GenPept or Bio::DB::RefSeq if needed
 
#Get some sequence IDs either like below, or read in from a file.  Note 
that
#this sample script works with the accession numbers below (at least at 
the time
#it was written).  If you add different accession numbers, and you get 
errors,
#you may be calling for something that the sequence doesn't have.  
You'll have
#to add your own error trapping code to handle that.
my @ids = ('U59228', 'AB039327', 'BC035972');
 
#Create the GenBank database object to read from the database.
my $gb = new Bio::DB::GenBank();
 
#Create a sequence stream to pass the sequences from the database to the 
program.
my $seqio = $gb->get_Stream_by_id(\@ids);
 
#Loop over all of the sequences that you requested.
while (my $seq = $seqio->next_seq) {
 
  #Here is how you get methods directly from the RichSeq object.  Replace
  #'display_name' with any other method in Table 2. that can be called on
  #either the RichSeq object directly, or the PrimarySeq object which it 
has
  #inherited.
  print $seq->display_name,"\n";
 
  #Here is how to access the classification data from the species object.
  my $species = $seq->species;
  print $species->common_name,"\n";
  my @class = $species->classification;
  print "@class\n";
 
  #Here is a general way to call things that are stored as a 
Bio::SeqFeature::
  #Generic object.  Replace 'source' with any other of the "major" 
headings in
  #the feature table (e.g gene, CDS, etc.) and replace 'organism' with 
any of
  #the tag values found under that heading (mol_type, locus_tag, gene, 
etc.)
  my @source_feats = grep { $_->primary_tag eq 'source' } 
$seq->get_SeqFeatures();
  my $source_feat = shift @source_feats;
  my @mol_type = $source_feat->get_tag_values('mol_type');
  print "@mol_type\n";
 
  #Here is a general way to call things that are stored as some type of a
  #Bio::Annotation oject.  This includes reference information, and 
comments.
  #Replace reference with 'comment' to get the comment, and replace
  #$ref->authors with $ref->title (or location, medline, etc.) to get other
  #reference categories
  my $ann = $seq->annotation();
  my @references = ($ann->get_Annotations('reference'));
  my $ref = shift @references;
  my ($title, $authors, $location, $pubmed, $reference);
  if (defined $ref) {
    $authors = $ref->authors;
    print "$authors\n";
  }
  print "\n";
}
---------------------------------------------------------------------
This file is without copyright.  It may be reproduced, edited, and 
redistributed at will without notice to the author, however I would 
appreciate it if any corrections or improvements were sent to 
barry.moore@genetics.utah.edu

From cjm at fruitfly.org  Fri Mar 12 20:22:18 2004
From: cjm at fruitfly.org (Chris Mungall)
Date: Fri Mar 12 20:27:51 2004
Subject: [Bioperl-l] new GFF3 support methods added
In-Reply-To: <A3999380-719E-11D8-910D-000A959EB4C4@gmx.net>
Message-ID: <Pine.LNX.4.33.0403121720030.20795-100000@heartbroken.lbl.gov>


On Mon, 8 Mar 2004, Hilmar Lapp wrote:

>
> On Monday, March 8, 2004, at 01:07  PM, Chris Mungall wrote:
>
> > Perhaps it would be better to have a Bio::SeqFeature::Tools::IDHandler
> > class? This would contain methods
> >
> >   generate_unique_persistent_id($feat) # uses $feat->seq_id
> >   generate_unique_persistent_id($feat, $seq_id)
> >
> >   create_hierarchy_from_ParentIDs($featholder);
> >   set_ParentIDs_from_hierarchy($featholder);
> >
> > Any preference? I'm leaving them as is for now if that's ok, but I
> > have no
> > objections to moving everything to a seperate class if that's prefered.
> >
>
> I like your suggestion. Keeps FeatureHolderI lighter, and is easier to
> change behaviour. But maybe it's just me who likes this better.

Well, no one else seems to care much. I've created

  Bio::SeqFeature::Tools::IDHandler

which takes care of these methods

keeping the core interfaces light

Cheers
Chris

> 	-hilmar

From hlapp at gnf.org  Fri Mar 12 21:01:01 2004
From: hlapp at gnf.org (Hilmar Lapp)
Date: Fri Mar 12 21:06:41 2004
Subject: [Bioperl-l] load_seqdatabase.pl
In-Reply-To: <4052156E.2040402@genetics.utah.edu>
Message-ID: <471DCDBA-7492-11D8-86B3-000A959EB4C4@gnf.org>

I have seen this before I believe from someone on a Windows machine.  
The problem could be some trouble with the dynamic loading of modules.

The -d option to perl invokes the debugger. Do you see the same problem  
without invoking the debugger?

Did the bioperl-db tests pass?

Root::debug() is invoked all over the place many many times. By  
default, you won't see any of that since $obj->verbose() governs  
whether the messages are printed or suppressed. You can try to pass  
--debug to load_seqdatabase.pl and see when it crashes then. (All  
messages that you see in addition after enabling --debug will have been  
routed through Root::debug.)

	-hilmar

On Friday, March 12, 2004, at 11:54  AM, Barry Moore wrote:

> I'm having some trouble with load_seqdatabase.pl.  I using BioPerl 1.4  
> (with some CVS updates including bioperl-db and bioperl-live Bio::DB  
> yesterday), ActiveState Perl 5.8, MySQL version 12.22  distribution  
> 4.0.16, Windows XP.  I've successfully created a database mysql with  
> the biosqldb-mysql.sql schema downloaded yesterday.  I have a file  
> RefSeq_Human_mRNA_short.gb that has RefSeq mRNA in GenBank format (I  
> pasted the first record from that file below).  I run  
> load_seqdatabase.pl like this:
>
> >perl -d load_seqdatabase.pl --dbpass *******  
> RefSeq_Human_mRNA_short.gb
>
> All of the script defaults should be O.K., but I've tried setting them  
> all on the command line as well.
>
> The script opens the file, and gets sequence out of it O.K., but then  
> somewhere in the eval statement that begins at line 500 it heads off  
> to Bio::Root::Root, and loses the plot with this error:
>
> Undefined subroutine &Bio::Root::Root::debug called at  
> C:/Perl/site/lib/Bio/DB/BioSQL/BasePersistenceAdaptor.pm line 1526,  
> <GEN1> line 413.
> Bio::DB::BioSQL::BasePersistenceAdaptor::_get_driver_class('Bio::DB::Bi 
> o
> SQL::SeqAdaptor=HASH(0x1d22200)','Bio::DB::BioSQL::mysql::','Driver','B 
> io::DB::B
> ioSQL::SeqAdaptor') called at  
> C:/Perl/site/lib/Bio/DB/BioSQL/BasePersistenceAdap
> tor.pm line 1507
> Bio::DB::BioSQL::BasePersistenceAdaptor::dbd('Bio::DB::BioSQL::SeqAdapt 
> o
> r=HASH(0x1d22200)') called at  
> C:/Perl/site/lib/Bio/DB/BioSQL/BasePersistenceAdap
> tor.pm line 1386
> Bio::DB::BioSQL::BasePersistenceAdaptor::rollback('Bio::DB::BioSQL::Seq 
> A
> daptor=HASH(0x1d22200)') called at load_seqdatabase.pl line 524
>
> I'm happy to try to figure this out myself, but right now I haven't a  
> clue what's going on since the subroutine &Bio::Root::Root::debug  
> seems to me to be where it's supposed to be.  Is this possible a OS  
> issue since I'm on Windows?  Can anyone nudge me in the right  
> direction?
>
> Barry
> -- 
> Barry Moore
> Dept. of Human Genetics
> University of Utah
> Salt Lake City, UT
>
>
> LOCUS NM_000014 4577 bp mRNA linear PRI 20-DEC-2003
> DEFINITION Homo sapiens alpha-2-macroglobulin (A2M), mRNA.
> ACCESSION NM_000014
> VERSION NM_000014.3 GI:6226959
> KEYWORDS .
> SOURCE Homo sapiens (human)
> ORGANISM Homo sapiens
> Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
> Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
> REFERENCE 1 (bases 1 to 4577)
> AUTHORS Mathew,S., Arandjelovic,S., Beyer,W.F., Gonias,S.L. and  
> Pizzo,S.V.
> TITLE Characterization of the interaction between alpha2-macroglobulin
> and fibroblast growth factor-2: the role of hydrophobic
> interactions
> JOURNAL Biochem. J. 374 (Pt 1), 123-129 (2003)
> PUBMED 12755687
> REMARK GeneRIF: FGF-2 and this protein interact at specific binding  
> sites,
> involving different FGF-2 sequences.
> REFERENCE 2 (bases 1 to 4577)
> AUTHORS Athauda,S.B., Nishigai,M., Arakawa,H., Ikai,A., Ukai,M. and
> Takahashi,K.
> TITLE Inhibition of human pepsin and gastricsin by alpha2-macroglobulin
> JOURNAL J Enzyme Inhib Med Chem 18 (3), 219-224 (2003)
> PUBMED 14506912
> REMARK GeneRIF: alpha2-macroglobulin inhibits human pepsin adn  
> gastricsin
> REFERENCE 3 (bases 1 to 4577)
> AUTHORS Arandjelovic,S., Freed,T.A. and Gonias,S.L.
> TITLE Growth factor-binding sequence in human alpha2-macroglobulin
> targets the receptor-binding site in transforming growth
> factor-beta
> JOURNAL Biochemistry 42 (20), 6121-6127 (2003)
> PUBMED 12755614
> REMARK GeneRIF: alpha(2)M-derived peptides target the receptor-binding
> sequence in TGF-beta
> REFERENCE 4 (bases 1 to 4577)
> AUTHORS Shibata,M., Sakai,H., Sakai,E., Okamoto,K., Nishishita,K.,
> Yasuda,Y., Kato,Y. and Yamamoto,K.
> TITLE Disruption of structural and functional integrity of alpha
> 2-macroglobulin by cathepsin E
> JOURNAL Eur. J. Biochem. 270 (6), 1189-1198 (2003)
> PUBMED 12631277
> REMARK GeneRIF: These results suggest the possible involvement of
> cathepsin E in disruption of the structural and functional
> integrity of alpha 2-macroglobulin in the endolysosome system.
> REFERENCE 5 (bases 1 to 4577)
> AUTHORS Zappia,M., Cittadella,R., Manna,I., Nicoletti,G., Andreoli,V.,
> Bonavita,S., Gambardella,A. and Quattrone,A.
> TITLE Genetic association of alpha2-macroglobulin polymorphisms with AD
> in southern Italy
> JOURNAL Neurology 59 (5), 756-758 (2002)
> PUBMED 12221172
> REMARK GeneRIF: Genetic association of alpha2-macroglobulin  
> polymorphisms
> with Alzheimer's disease
> REFERENCE 6 (bases 1 to 4577)
> AUTHORS Ghebremedhin,E., Schultz,C., Thal,D.R., Del Tredici,K.,  
> Rueb,U. and
> Braak,H.
> TITLE Genetic association of argyrophilic grain disease with
> polymorphisms in alpha-2 macroglobulin and low-density lipoprotein
> receptor-related protein genes
> JOURNAL Neuropathol Appl Neurobiol 28 (4), 308-313 (2002)
> PUBMED 12175343
> REMARK GeneRIF: Genetic association of argyrophilic grain disease with
> polymorphisms in alpha-2 macroglobulin.
> REFERENCE 7 (bases 1 to 4577)
> AUTHORS Kolodziej,S.J., Wagenknecht,T., Strickland,D.K. and Stoops,J.K.
> TITLE The three-dimensional structure of the human alpha  
> 2-macroglobulin
> dimer reveals its structural organization in the tetrameric native
> and chymotrypsin alpha 2-macroglobulin complexes
> JOURNAL J. Biol. Chem. 277 (31), 28031-28037 (2002)
> PUBMED 12015318
> REMARK GeneRIF: The three-dimensional structure of the dimer reveals  
> its
> structural organization in the tetrameric native and chymotrypsin
> alpha 2-macroglobulin complexes.
> REFERENCE 8 (bases 1 to 4577)
> AUTHORS McElhinney,B., Ardill,J., Caldwell,C., Lloyd,F. and McClure,N.
> TITLE Ovarian hyperstimulation syndrome and assisted reproductive
> technologies: why some and not others?
> JOURNAL Hum. Reprod. 17 (6), 1548-1553 (2002)
> PUBMED 12042276
> REMARK GeneRIF: relationship between serum VEGF levels, alpha(2)M  
> levels
> and the development of OHSS in hyperstimulated subjects undergoing
> IVF
> REFERENCE 9 (bases 1 to 4577)
> AUTHORS Mettenburg,J.M., Webb,D.J. and Gonias,S.L.
> TITLE Distinct binding sites in the structure of alpha 2-macroglobulin
> mediate the interaction with beta-amyloid peptide and growth
> factors
> JOURNAL J. Biol. Chem. 277 (15), 13338-13345 (2002)
> PUBMED 11823454
> REMARK GeneRIF: distinct binding sites mediate interaction with
> beta-amyloid peptide and growth factors
> REFERENCE 10 (bases 1 to 4577)
> AUTHORS Cvirn,G., Gallistl,S., Koestenberger,M., Kutschera,J.,  
> Leschnik,B.
> and Muntean,W.
> TITLE Alpha 2-macroglobulin enhances prothrombin activation and  
> thrombin
> potential by inhibiting the anticoagulant protein C/protein S
> system in cord and adult plasma
> JOURNAL Thromb. Res. 105 (5), 433-439 (2002)
> PUBMED 12062545
> REMARK GeneRIF: Alpha 2-macroglobulin enhances prothrombin activation  
> and
> thrombin potential by inhibiting the anticoagulant protein
> C/protein S system in cord and adult plasma.
> REFERENCE 11 (bases 1 to 4577)
> AUTHORS Janka,Z., Juhasz,A., Rimanoczy,A., Boda,K., Marki-Zay,J.,
> Palotas,M., Kuk,I., Zollei,M., Jakab,K. and Kalman,J.
> TITLE Alpha2-macroglobulin exon 24 (Val-1000-Ile) polymorphism is not
> associated with late-onset sporadic Alzheimer's dementia in the
> Hungarian population
> JOURNAL Psychiatr. Genet. 12 (1), 49-54 (2002)
> PUBMED 11901360
> REMARK GeneRIF: has an important role in the AD-specific  
> neurodegenerative
> process but its exon 24 Val-1000-Ile polymorphism is not likely to
> be associated with late-onset sporadic AD in the Hungarian
> population
> REFERENCE 12 (bases 1 to 4577)
> AUTHORS Chiabrando,G.A., Vides,M.A. and Sanchez,M.C.
> TITLE Differential binding properties of human pregnancy zone protein-
> and alpha2-macroglobulin-proteinase complexes to low-density
> lipoprotein receptor-related protein
> JOURNAL Arch. Biochem. Biophys. 398 (1), 73-78 (2002)
> PUBMED 11811950
> REMARK GeneRIF: Differential binding to ldl receptor related protein
> REFERENCE 13 (bases 1 to 4577)
> AUTHORS Toombs,C.F.
> TITLE Alfimeprase: pharmacology of a novel fibrinolytic  
> metalloproteinase
> for thrombolysis
> JOURNAL Haemostasis 31 (3-6), 141-147 (2001)
> PUBMED 11910179
> REMARK GeneRIF: REVIEW: binds and neutralizes alfimeprase, which has
> direct proteolytic activity against the fibrinogen Aalpha chain
> REFERENCE 14 (bases 1 to 4577)
> AUTHORS Borth,W.
> TITLE Alpha 2-macroglobulin, a multifunctional binding protein with
> targeting characteristics
> JOURNAL FASEB J. 6 (15), 3345-3353 (1992)
> PUBMED 1281457
> REFERENCE 15 (bases 1 to 4577)
> AUTHORS Poller,W., Faber,J.P., Klobeck,G. and Olek,K.
> TITLE Cloning of the human alpha 2-macroglobulin gene and detection of
> mutations in two functional domains: the bait region and the
> thiolester site
> JOURNAL Hum. Genet. 88 (3), 313-319 (1992)
> PUBMED 1370808
> REFERENCE 16 (bases 1 to 4577)
> AUTHORS Poller,W., Faber,J.P. and Olek,K.
> TITLE Sequence polymorphism in the human alpha-2-macroglobulin (A2M)  
> gene
> JOURNAL Nucleic Acids Res. 19 (1), 198 (1991)
> PUBMED 1707161
> REFERENCE 17 (bases 1 to 4577)
> AUTHORS Bell,G.I., Rall,L.B., Sanchez-Pescador,R., Merryweather,J.P.,
> Scott,J., Eddy,R.L. and Shows,T.B.
> TITLE Human alpha 2-macroglobulin gene is located on chromosome 12
> JOURNAL Somat. Cell Mol. Genet. 11 (3), 285-289 (1985)
> PUBMED 2408344
> REFERENCE 18 (bases 1 to 4577)
> AUTHORS Kan,C.C., Solomon,E., Belt,K.T., Chain,A.C., Hiorns,L.R. and  
> Fey,G.
> TITLE Nucleotide sequence of cDNA encoding human alpha 2-macroglobulin
> and assignment of the chromosomal locus
> JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82 (8), 2282-2286 (1985)
> PUBMED 2581245
> COMMENT REVIEWED REFSEQ: This record has been curated by NCBI staff.  
> The
> reference sequence was derived from M11313.1.
> On Nov 4, 1999 this sequence version replaced gi:6226762.
>
> Summary: Alpha-2-macroglobulin is a protease inhibitor and cytokine
> transporter. It inhibits many proteases, including trypsin,
> thrombin and collagenase. A2M is implicated in Alzheimer disease
> (AD) due to its ability to mediate the clearance and degradation of
> A-beta, the major component of beta-amyloid deposits.
> FEATURES Location/Qualifiers
> source 1..4577
> /organism="Homo sapiens"
> /mol_type="mRNA"
> /db_xref="taxon:9606"
> /chromosome="12"
> /map="12p13.3-p12.3"
> gene 1..4577
> /gene="A2M"
> /db_xref="GeneID:2"
> /db_xref="LocusID:2"
> /db_xref="MIM:103950"
> CDS 44..4468
> /gene="A2M"
> /note="go_function: protein carrier activity [goid
> 0008320] [evidence NR];
> go_function: endopeptidase inhibitor activity [goid
> 0004866] [evidence NR];
> go_function: wide-spectrum protease inhibitor activity
> [goid 0017114] [evidence IEA];
> go_function: serine protease inhibitor activity [goid
> 0004867] [evidence IEA];
> go_function: alpha-2 macroglobulin [goid 0016975]
> [evidence NAS];
> go_process: intracellular protein transport [goid 0006886]
> [evidence NR]"
> /codon_start=1
> /product="alpha 2 macroglobulin precursor"
> /protein_id="NP_000005.1"
> /db_xref="GI:4557225"
> /db_xref="GeneID:2"
> /db_xref="LocusID:2"
> /db_xref="MIM:103950"
> /translation="MGKNKLLHPSLVLLLLVLLPTDASVSGKPQYMVLVPSLLHTETT
> EKGCVLLSYLNETVTVSASLESVRGNRSLFTDLEAENDVLHCVAFAVPKSSSNEEVMF
> LTVQVKGPTQEFKKRTTVMVKNEDSLVFVQTDKSIYKPGQTVKFRVVSMDENFHPLNE
> LIPLVYIQDPKGNRIAQWQSFQLEGGLKQFSFPLSSEPFQGSYKVVVQKKSGGRTEHP
> FTVEEFVLPKFEVQVTVPKIITILEEEMNVSVCGLYTYGKPVPGHVTVSICRKYSDAS
> DCHGEDSQAFCEKFSGQLNSHGCFYQQVKTKVFQLKRKEYEMKLHTEAQIQEEGTVVE
> LTGRQSSEITRTITKLSFVKVDSHFRQGIPFFGQVRLVDGKGVPIPNKVIFIRGNEAN
> YYSNATTDEHGLVQFSINTTNVMGTSLTVRVNYKDRSPCYGYQWVSEEHEEAHHTAYL
> VFSPSKSFVHLEPMSHELPCGHTQTVQAHYILNGGTLLGLKKLSFYYLIMAKGGIVRT
> GTHGLLVKQEDMKGHFSISIPVKSDIAPVARLLIYAVLPTGDVIGDSAKYDVENCLAN
> KVDLSFSPSQSLPASHAHLRVTAAPQSVCALRAVDQSVLLMKPDAELSASSVYNLLPE
> KDLTGFPGPLNDQDDEDCINRHNVYINGITYTPVSSTNEKDMYSFLEDMGLKAFTNSK
> IRKPKMCPQLQQYEMHGPEGLRVGFYESDVMGRGHARLVHVEEPHTETVRKYFPETWI
> WDLVVVNSAGVAEVGVTVPDTITEWKAGAFCLSEDAGLGISSTASLRAFQPFFVELTM
> PYSVIRGEAFTLKATVLNYLPKCIRVSVQLEASPAFLAVPVEKEQAPHCICANGRQTV
> SWAVTPKSLGNVNFTVSAEALESQELCGTEVPSVPEHGRKDTVIKPLLVEPEGLEKET
> TFNSLLCPSGGEVSEELSLKLPPNVVEESARASVSVLGDILGSAMQNTQNLLQMPYGC
> GEQNMVLFAPNIYVLDYLNETQQLTPEVKSKAIGYLNTGYQRQLNYKHYDGSYSTFGE
> RYGRNQGNTWLTAFVLKTFAQARAYIFIDEAHITQALIWLSQRQKDNGCFRSSGSLLN
> NAIKGGVEDEVTLSAYITIALLEIPLTVTHPVVRNALFCLESAWKTAQEGDHGSHVYT
> KALLAYAFALAGNQDKRKEVLKSLNEEAVKKDNSVHWERPQKPKAPVGHFYEPQAPSA
> EVEMTSYVLLAYLTAQPAPTSEDLTSATNIVKWITKQQNAQGGFSSTQDTVVALHALS
> KYGAATFTRTGKAAQVTIQSSGTFSSKFQVDNNNRLLLQQVSLPELPGEYSMKVTGEG
> CVYLQTSLKYNILPEKEEFPFALGVQTLPQTCDEPKAHTSFQISLSVSYTGSRSASNM
> AIVDVKMVSGFIPLKPTVKMLERSNHVSRTEVSSNHVLIYLDKVSNQTLSLFFTVLQD
> VPVRDLKPAIVKVYDYYETDEFAIAEYNAPCSKDLGNA"
> sig_peptide 44..112
> /gene="A2M"
> /note="alpha-2-macroglobulin signal peptide"
> misc_feature 104..4465
> /gene="A2M"
> /note="KOG1366; Region: Alpha-macroglobulin
> [Posttranslational modification, protein turnover,
> chaperones]"
> /db_xref="CDD:19155"
> misc_feature 110..1927
> /gene="A2M"
> /note="A2M_N; Region: Alpha-2-macroglobulin family
> N-terminal region"
> /db_xref="CDD:17056"
> mat_peptide 113..4465
> /gene="A2M"
> /product="alpha-2-macroglobulin"
> misc_feature 2216..4432
> /gene="A2M"
> /note="Region: Alpha-2-macroglobulin family"
> /db_xref="CDD:5952"
> variation 1122
> /gene="A2M"
> /replace="G"
> /replace="A"
> /db_xref="dbSNP:2229298"
> variation 1282
> /gene="A2M"
> /replace="T"
> /replace="C"
> /db_xref="dbSNP:1049134"
> variation 1339
> /gene="A2M"
> /replace="T"
> /replace="C"
> /db_xref="dbSNP:2228222"
> variation 1354
> /gene="A2M"
> /replace="G"
> /replace="A"
> /db_xref="dbSNP:226396"
> variation 1958
> /gene="A2M"
> /replace="G"
> /replace="A"
> /db_xref="dbSNP:226405"
> variation 2154
> /gene="A2M"
> /replace="G"
> /replace="A"
> /db_xref="dbSNP:1800434"
> variation 2431
> /gene="A2M"
> /replace="T"
> /replace="C"
> /db_xref="dbSNP:1049143"
> variation 2487
> /gene="A2M"
> /replace="T"
> /replace="A"
> /db_xref="dbSNP:3180392"
> variation 2487
> /gene="A2M"
> /replace="T"
> /replace="A"
> /db_xref="dbSNP:3210107"
> variation 2958
> /gene="A2M"
> /replace="G"
> /replace="A"
> /db_xref="dbSNP:1800433"
> variation 3041
> /gene="A2M"
> /replace="G"
> /replace="A"
> /db_xref="dbSNP:669"
> variation 3988
> /gene="A2M"
> /replace="G"
> /replace="A"
> /db_xref="dbSNP:1802964"
> variation 4334
> /gene="A2M"
> /replace="T"
> /replace="C"
> /db_xref="dbSNP:1802965"
> variation 4474
> /gene="A2M"
> /replace="C"
> /replace="A"
> /db_xref="dbSNP:1130840"
> variation 4508
> /gene="A2M"
> /replace="T"
> /replace="C"
> /db_xref="dbSNP:1049985"
> variation 4511
> /gene="A2M"
> /replace="T"
> /replace="A"
> /db_xref="dbSNP:1802966"
> variation 4519
> /gene="A2M"
> /replace="C"
> /replace="A"
> /db_xref="dbSNP:3190224"
> ORIGIN
> 1 gctacaatcc atctggtctc ctccagctcc ttctttctgc aacatgggga agaacaaact
> 61 ccttcatcca agtctggttc ttctcctctt ggtcctcctg cccacagacg cctcagtctc
> 121 tggaaaaccg cagtatatgg ttctggtccc ctccctgctc cacactgaga ccactgagaa
> 181 gggctgtgtc cttctgagct acctgaatga gacagtgact gtaagtgctt ccttggagtc
> 241 tgtcagggga aacaggagcc tcttcactga cctggaggcg gagaatgacg tactccactg
> 301 tgtcgccttc gctgtcccaa agtcttcatc caatgaggag gtaatgttcc tcactgtcca
> 361 agtgaaagga ccaacccaag aatttaagaa gcggaccaca gtgatggtta agaacgagga
> 421 cagtctggtc tttgtccaga cagacaaatc aatctacaaa ccagggcaga cagtgaaatt
> 481 tcgtgttgtc tccatggatg aaaactttca ccccctgaat gagttgattc cactagtata
> 541 cattcaggat cccaaaggaa atcgcatcgc acaatggcag agtttccagt tagagggtgg
> 601 cctcaagcaa ttttcttttc ccctctcatc agagcccttc cagggctcct acaaggtggt
> 661 ggtacagaag aaatcaggtg gaaggacaga gcaccctttc accgtggagg aatttgttct
> 721 tcccaagttt gaagtacaag taacagtgcc aaagataatc accatcttgg aagaagagat
> 781 gaatgtatca gtgtgtggcc tatacacata tgggaagcct gtccctggac atgtgactgt
> 841 gagcatttgc agaaagtata gtgacgcttc cgactgccac ggtgaagatt cacaggcttt
> 901 ctgtgagaaa ttcagtggac agctaaacag ccatggctgc ttctatcagc aagtaaaaac
> 961 caaggtcttc cagctgaaga ggaaggagta tgaaatgaaa cttcacactg aggcccagat
> 1021 ccaagaagaa ggaacagtgg tggaattgac tggaaggcag tccagtgaaa tcacaagaac
> 1081 cataaccaaa ctctcatttg tgaaagtgga ctcacacttt cgacagggaa ttcccttctt
> 1141 tgggcaggtg cgcctagtag atgggaaagg cgtccctata ccaaataaag tcatattcat
> 1201 cagaggaaat gaagcaaact attactccaa tgctaccacg gatgagcatg gccttgtaca
> 1261 gttctctatc aacaccacca acgttatggg tacctctctt actgttaggg tcaattacaa
> 1321 ggatcgtagt ccctgttacg gctaccagtg ggtgtcagaa gaacacgaag aggcacatca
> 1381 cactgcttat cttgtgttct ccccaagcaa gagctttgtc caccttgagc ccatgtctca
> 1441 tgaactaccc tgtggccata ctcagacagt ccaggcacat tatattctga atggaggcac
> 1501 cctgctgggg ctgaagaagc tctcctttta ttatctgata atggcaaagg gaggcattgt
> 1561 ccgaactggg actcatggac tgcttgtgaa gcaggaagac atgaagggcc atttttccat
> 1621 ctcaatccct gtgaagtcag acattgctcc tgtcgctcgg ttgctcatct atgctgtttt
> 1681 acctaccggg gacgtgattg gggattctgc aaaatatgat gttgaaaatt gtctggccaa
> 1741 caaggtggat ttgagcttca gcccatcaca aagtctccca gcctcacacg cccacctgcg
> 1801 agtcacagcg gctcctcagt ccgtctgcgc cctccgtgct gtggaccaaa gcgtgctgct
> 1861 catgaagcct gatgctgagc tctcggcgtc ctcggtttac aacctgctac cagaaaagga
> 1921 cctcactggc ttccctgggc ctttgaatga ccaggacgat gaagactgca tcaatcgtca
> 1981 taatgtctat attaatggaa tcacatatac tccagtatca agtacaaatg aaaaggatat
> 2041 gtacagcttc ctagaggaca tgggcttaaa ggcattcacc aactcaaaga ttcgtaaacc
> 2101 caaaatgtgt ccacagcttc aacagtatga aatgcatgga cctgaaggtc tacgtgtagg
> 2161 tttttatgag tcagatgtaa tgggaagagg ccatgcacgc ctggtgcatg ttgaagagcc
> 2221 tcacacggag accgtacgaa agtacttccc tgagacatgg atctgggatt tggtggtggt
> 2281 aaactcagca ggggtggctg aggtaggagt aacagtccct gacaccatca ccgagtggaa
> 2341 ggcaggggcc ttctgcctgt ctgaagatgc tggacttggt atctcttcca ctgcctctct
> 2401 ccgagccttc cagcccttct ttgtggagct tacaatgcct tactctgtga ttcgtggaga
> 2461 ggccttcaca ctcaaggcca cggtcctaaa ctaccttccc aaatgcatcc gggtcagtgt
> 2521 gcagctggaa gcctctcccg ccttccttgc tgtcccagtg gagaaggaac aagcgcctca
> 2581 ctgcatctgt gcaaacgggc ggcaaactgt gtcctgggca gtaaccccaa agtcattagg
> 2641 aaatgtgaat ttcactgtga gcgcagaggc actagagtct caagagctgt gtgggactga
> 2701 ggtgccttca gttcctgaac acggaaggaa agacacagtc atcaagcctc tgttggttga
> 2761 acctgaagga ctagagaagg aaacaacatt caactcccta ctttgtccat caggtggtga
> 2821 ggtttctgaa gaattatccc tgaaactgcc accaaatgtg gtagaagaat ctgcccgagc
> 2881 ttctgtctca gttttgggag acatattagg ctctgccatg caaaacacac aaaatcttct
> 2941 ccagatgccc tatggctgtg gagagcagaa tatggtcctc tttgctccta acatctatgt
> 3001 actggattat ctaaatgaaa cacagcagct tactccagag gtcaagtcca aggccattgg
> 3061 ctatctcaac actggttacc agagacagtt gaactacaaa cactatgatg gctcctacag
> 3121 cacctttggg gagcgatatg gcaggaacca gggcaacacc tggctcacag cctttgttct
> 3181 gaagactttt gcccaagctc gagcctacat cttcatcgat gaagcacaca ttacccaagc
> 3241 cctcatatgg ctctcccaga ggcagaagga caatggctgt ttcaggagct ctgggtcact
> 3301 gctcaacaat gccataaagg gaggagtaga agatgaagtg accctctccg cctatatcac
> 3361 catcgccctt ctggagattc ctctcacagt cactcaccct gttgtccgca atgccctgtt
> 3421 ttgcctggag tcagcctgga agacagcaca agaaggggac catggcagcc atgtatatac
> 3481 caaagcactg ctggcctatg cttttgccct ggcaggtaac caggacaaga ggaaggaagt
> 3541 actcaagtca cttaatgagg aagctgtgaa gaaagacaac tctgtccatt gggagcgccc
> 3601 tcagaaaccc aaggcaccag tggggcattt ttacgaaccc caggctccct ctgctgaggt
> 3661 ggagatgaca tcctatgtgc tcctcgctta tctcacggcc cagccagccc caacctcgga
> 3721 ggacctgacc tctgcaacca acatcgtgaa gtggatcacg aagcagcaga atgcccaggg
> 3781 cggtttctcc tccacccagg acacagtggt ggctctccat gctctgtcca aatatggagc
> 3841 cgccacattt accaggactg ggaaggctgc acaggtgact atccagtctt cagggacatt
> 3901 ttccagcaaa ttccaagtgg acaacaacaa tcgcctgtta ctgcagcagg tctcattgcc
> 3961 agagctgcct ggggaataca gcatgaaagt gacaggagaa ggatgtgtct acctccagac
> 4021 ctccttgaaa tacaatattc tcccagaaaa ggaagagttc ccctttgctt taggagtgca
> 4081 gactctgcct caaacttgtg atgaacccaa agcccacacc agcttccaaa tctccctaag
> 4141 tgtcagttac acagggagcc gctctgcctc caacatggcg atcgttgatg tgaagatggt
> 4201 ctctggcttc attcccctga agccaacagt gaaaatgctt gaaagatcta accatgtgag
> 4261 ccggacagaa gtcagcagca accatgtctt gatttacctt gataaggtgt caaatcagac
> 4321 actgagcttg ttcttcacgg ttctgcaaga tgtcccagta agagatctca aaccagccat
> 4381 agtgaaagtc tatgattact acgagacgga tgagtttgca atcgctgagt acaatgctcc
> 4441 ttgcagcaaa gatcttggaa atgcttgaag accacaaggc tgaaaagtgc tttgctggag
> 4501 tcctgttctc tgagctccac agaagacacg tgtttttgta tctttaaaga cttgatgaat
> 4561 aaacactttt tctggtc
> //
>
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>
-- 
-------------------------------------------------------------
Hilmar Lapp                            email: lapp at gnf.org
GNF, San Diego, Ca. 92121              phone: +1-858-812-1757
-------------------------------------------------------------


From vesko_baev at abv.bg  Sat Mar 13 07:05:03 2004
From: vesko_baev at abv.bg (Vesko Baev)
Date: Sat Mar 13 09:57:36 2004
Subject: [Bioperl-l] RNA analysis
Message-ID: <1226365086.1079179503958.JavaMail.nobody@app2.ni.bg>

Is there a bioperl module(s) for RNA analysis: folding, free energy, RNA/RNA align or modules that connect the script with ext. RNA progs?

Thenks!
Vesko

-----------------------------------------------------------------
http://zavedenia.gbg.bg - ???? ?? ?????? ???? ??????
From jason at cgt.duhs.duke.edu  Sat Mar 13 10:18:36 2004
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Sat Mar 13 10:24:17 2004
Subject: [Bioperl-l] RNA analysis
In-Reply-To: <1226365086.1079179503958.JavaMail.nobody@app2.ni.bg>
References: <1226365086.1079179503958.JavaMail.nobody@app2.ni.bg>
Message-ID: <Pine.LNX.4.50.0403131016520.32004-100000@tenero.duhs.duke.edu>

See ViennaRNA package from Ivo Hofacker's group and their RNAFold.pl
script - there is a nice XS perl interface to the C library.

http://www.tbi.univie.ac.at/~ivo/RNA/

-jason

On Sat, 13 Mar 2004, Vesko Baev wrote:

> Is there a bioperl module(s) for RNA analysis: folding, free energy,
> RNA/RNA align or modules that connect the script with ext. RNA progs?
>
> Thenks!
> Vesko
>
> -----------------------------------------------------------------
> http://zavedenia.gbg.bg - ���� �� ������ ���� �����?

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu

From james.wasmuth at ed.ac.uk  Sat Mar 13 10:34:01 2004
From: james.wasmuth at ed.ac.uk (James Wasmuth)
Date: Sat Mar 13 10:39:45 2004
Subject: [Bioperl-l] Re: [Bioperl-guts-l] [Bug 1600] New:
	$gb->species->ncbi_taxid
In-Reply-To: <GAEDKMGOKFBLJPKCLKCCEEJODIAA.brian_osborne@cognia.com>
References: <GAEDKMGOKFBLJPKCLKCCEEJODIAA.brian_osborne@cognia.com>
Message-ID: <405329E9.4000805@ed.ac.uk>

Brian,

>Yes, I could read your patch but I'm lazy. You said:
>  
>
>>>create a Bio::Species object, but the genus=unknown species=marine
>>>      
>>>
>subspecies=gamma.
>
I was highlighting the problem my patch would create.  I hadn't thought 
too hard about its consequences, but realise that it may have some 
knock-on effects.

>
>Shouldn't the values be the same for all these "species" for which the genus
>is not known? Like:
>
>Genus=unknown, species=unknown, subspecies=unknown
>
>That way you can check, since one can no longer use "unless defined
>$species_object" to see if real species information is lacking or not. Have
>I missed something here?
>

NCBI taxonomy considers the term 'unknown marine gamma proteobacterium 
NOR5' to represent a species, though for this example there remaining 
taxonomy classification is awarded no rank until 'class'. 

So one possible fix, would be:

if (ORGANISM ne "synonym for taxid 32644")    {
    then add rest of name into $species.
}

therefore $genus = unknown  and $species = marine gamma proteobacterium NOR5

other problems are organisms such as "leaf litter basidiomycete sp. 
Collb2-39".   Currently $genus = leaf, $species = litter and $subsp = 
basidiomycete.  Perhaps $subsp should contain everything left over?  
Thoughts?  Also does anyone know off hand if it copes with 'varietas' 
and 'var.' for plants?  I expect not.  I  will have a look at the  
genbank.pm on Monday and suggest a patch.  I expect these issues will 
also be pertinent in embl.pm and other database format modules...


hmmmm, anyone with thoughts?

-james


>
>Brian O.
>
>
>-----Original Message-----
>From: James Wasmuth [mailto:james.wasmuth@ed.ac.uk]
>Sent: Thursday, March 11, 2004 9:40 AM
>To: Brian Osborne
>Cc: bioperl-guts-l@bioperl.org
>Subject: Re: [Bioperl-guts-l] [Bug 1600] New: $gb->species->ncbi_taxid
>
>Brian and all at bioperl-guts,
>
>
>below is the comment I've added to the bug[1600].  I think it may need
>some discussion, but the patch I've added works to the extent that it
>allows creation of a Bio::Species object but the subsequent genus,
>species, subspecies calls will be 'wrong'.  Personally I'm more
>concerned with the taxid, which I think will be sufficient.
>
>If you want to see the size of this problem go to NCBI taxonomy and
>enter the term identified as a token set!  I think that maintaining the
>taxid is enough, otherwise the artifical split of terms such as
>**unidentified diatom endosymbiont of Peridinium foliaceum*
><http://www.ncbi.nlm.nih.gov/Taxonomy/Browser/wwwtax.cgi?mode=Undef&id=42247
>&lvl=3&lin=f&keep=1&srchmode=3&unlock>*
>may be a problem, though some of them are intuitive.
>
>One last question, I've never tried to fix a bug before, so I've
>commited a patch as an attachment to Bugzilla for the bug.  Do others
>check this and if okay place it in the code...
>apologies for the newbie bit...
>
>-james
>
>
>
>genbank.pm
>
>line 1123: return unless $genus and  $genus !~ /^(Unknown|None)$/oi;
>
>a number of species are described as Unknown blah blah blah.
>
>The NCBI taxid assigned to unknown taxa is 32644 and has a number of
>synonyms, none of which are 'unknown'.
>
>The list includes: other, unknown organism, not specified, not shown,
>unspecified, Unknown, None, unclassified , unidentified organism
>
>I've changed the _read_GenBank_Species subroutine to allow organism
>names such as 'unknown marine gamma proteobacterium NOR5'.  This will
>create a Bio::Species object, but the genus=unknown species=marine
>subspecies=gamma.
>
>There is a whole host of species names that ignore the nice rules in
>_read_GenBank_Species!  However this fix will allow the correct taxid to
>be provided which I think is more than the name!
>
>
>
>sub _read_GenBank_Species {
>    my( $self,$buffer) = @_;
>    my @organell_names = ("chloroplast", "mitochondr");
>     # only those carrying DNA, apart from the nucleus
>
>    #CHANGE
>     my @unkn_names=("other", 'unknown organism', 'not specified', 'not
>shown', 'Unspecified', 'Unknown', 'None', 'unclassified', 'unidentified
>organism');
>
>    $_ = $$buffer;
>
>    my( $sub_species, $species, $genus, $common, $organelle, @class,
>$ns_name );
>    # upon first entering the loop, we must not read a new line -- the
>SOURCE
>    # line is already in the buffer (HL 05/10/2000)
>    while (defined($_) || defined($_ = $self->_readline())) {
>    # de-HTMLify (links that may be encountered here don't contain
>    # escaped '>', so a simple-minded approach suffices)
>        s/<[^>]+>//g;
>    if (/^SOURCE\s+(.*)/o) {
>        # FIXME this is probably mostly wrong (e.g., it yields things like
>        # Homo sapiens adult placenta cDNA to mRNA
>        # which is certainly not what you want)
>        $common = $1;
>        $common =~ s/\.$//; # remove trailing dot
>    } elsif (/^\s{2}ORGANISM/o) {
>        my @spflds = split(' ', $_);
>            ($ns_name) = $_ =~ /\w+\s+(.*)/o;
>        shift(@spflds); # ORGANISM
>
>         if(grep { $_ =~ /^$spflds[0]/i; } @organell_names) {
>        $organelle = shift(@spflds);
>        }
>            $genus = shift(@spflds);
>        if(@spflds) {
>        $species = shift(@spflds);
>        } elsif ( grep { $genus } @unkn_names){
>        $species = '';
>        } else {$species='sp.';}      #there's no species name but it
>isn't unclassified
>        $sub_species = shift(@spflds) if(@spflds);
>        } elsif (/^\s+(.+)/o) {
>        # only split on ';' or '.' so that
>        # classification that is 2 words will
>        # still get matched
>        # use map to remove trailing/leading spaces
>            push(@class, map { s/^\s+//; s/\s+$//; $_; } split /[;\.]+/,
>$1);
>        } else {
>            last;
>        }
>
>        $_ = undef; # Empty $_ to trigger read of next line
>    }
>
>     $$buffer = $_;
>
>     # Don't make a species object if it's empty or "Unknown" or "None"
>    my $unkn = grep { $_ =~ /^$genus$species/i; } @unkn_names;
>
>     return unless $genus and  $unkn==0;
>
>     # Bio::Species array needs array in Species -> Kingdom direction
>    if ($class[0] eq 'Viruses') {
>        push( @class, $ns_name );
>    }
>    elsif ($class[$#class] eq $genus) {
>        push( @class, $species );
>    } else {
>        push( @class, $genus, $species );
>    }
>    @class = reverse @class;
>
>    my $make = Bio::Species->new();
>    $make->classification( \@class, "FORCE" ); # no name validation please
>    $make->common_name( $common      ) if $common;
>    unless ($class[-1] eq 'Viruses') {
>        $make->sub_species( $sub_species ) if $sub_species;
>    }
>    $make->organelle($organelle) if $organelle;
>    return $make;
>}
>
>
>
>
>Brian Osborne wrote:
>
>  
>
>>James,
>>
>>Your guess is right, no Species is made because of the name. That's because
>>genbank.pm normally looks at:
>>
>>ORGANISM Bos taurus
>>
>>And makes "Bos" the genus, and so on.
>>
>>If it sees:
>>
>>ORGANISM Unknown
>>
>>It refuses to make a Species object, and it's interpreting your ORGANISM
>>line in the same way because it can't make a valid genus, that's the
>>    
>>
>current
>  
>
>>rule. Personally I'd say that I agree with its principle - how can we make
>>    
>>
>a
>  
>
>>Species object without genus and species?
>>
>>You can get the taxid from a SeqFeature object, you already knew that.
>>
>>Brian O.
>>
>>
>>-----Original Message-----
>>From: bioperl-guts-l-bounces@portal.open-bio.org
>>[mailto:bioperl-guts-l-bounces@portal.open-bio.org]On Behalf Of
>>bugzilla-daemon@portal.open-bio.org
>>Sent: Thursday, March 11, 2004 4:21 AM
>>To: bioperl-guts-l@bioperl.org
>>Subject: [Bioperl-guts-l] [Bug 1600] New: $gb->species->ncbi_taxid
>>
>>http://bugzilla.bioperl.org/show_bug.cgi?id=1600
>>
>>          Summary: $gb->species->ncbi_taxid
>>          Product: Bioperl
>>          Version: unspecified
>>         Platform: PC
>>       OS/Version: Linux
>>           Status: NEW
>>         Severity: normal
>>         Priority: P2
>>        Component: Bio::SeqIO
>>       AssignedTo: bioperl-guts-l@bioperl.org
>>       ReportedBy: james.wasmuth@ed.ac.uk
>>
>>
>>I've included a genbank file for which I have been unable to extract the
>>ncbi_taxid for using
>>
>>$gb->species->ncbi_taxid
>>
>>the error is:
>>Can't call method "ncbi_taxid" on an undefined value
>>
>>infact I don't get a Bio::Species object.  I'm sure its because of the
>>    
>>
>name,
>  
>
>>which is correct.
>>
>>I've tried looking into it, but could not find which Seq object creates the
>>Bio::Species object.
>>
>>
>>
>>LOCUS       AY007676                1389 bp    DNA     linear   BCT
>>29-OCT-2001
>>DEFINITION  Unknown marine gamma proteobacterium NOR5 16S ribosomal RNA,
>>           partial sequence.
>>ACCESSION   AY007676
>>VERSION     AY007676.1  GI:12000362
>>KEYWORDS    .
>>SOURCE      unknown marine gamma proteobacterium NOR5
>> ORGANISM  unknown marine gamma proteobacterium NOR5
>>           Bacteria; Proteobacteria; Gammaproteobacteria.
>>REFERENCE   1  (bases 1 to 1389)
>> AUTHORS   Eilers,H., Pernthaler,J., Peplies,J., Glockner,F.O., Gerdts,G.
>>and
>>           Amann,R.
>> TITLE     Isolation of novel pelagic bacteria from the German bight and
>>their
>>           seasonal contributions to surface picoplankton
>> JOURNAL   Appl. Environ. Microbiol. 67 (11), 5134-5142 (2001)
>> MEDLINE   21536174
>>  PUBMED   11679337
>>REFERENCE   2  (bases 1 to 1389)
>> AUTHORS   Eilers,H., Pernthaler,J., Peplies,J., Gloeckner,F.O.,
>>    
>>
>Gerdts,G.,
>  
>
>>           Schuett,C. and Amann,R.
>> TITLE     Identification and seasonal dominance of culturable marine
>>bacteria
>> JOURNAL   Unpublished
>>REFERENCE   3  (bases 1 to 1389)
>> AUTHORS   Eilers,H., Pernthaler,J., Peplies,J., Gloeckner,F.O.,
>>    
>>
>Gerdts,G.,
>  
>
>>           Schuett,C. and Amann,R.
>> TITLE     Direct Submission
>> JOURNAL   Submitted (29-AUG-2000) Molecular Ecology,
>>    
>>
>Max-Planck-Institute,
>  
>
>>           Celsiusstrasse 1, Bremen 28359, Germany
>>FEATURES             Location/Qualifiers
>>    source          1..1389
>>                    /organism="unknown marine gamma proteobacterium NOR5"
>>                    /mol_type="genomic DNA"
>>                    /db_xref="taxon:145658"
>>    rRNA            <1..>1389
>>                    /product="16S ribosomal RNA"
>>BASE COUNT      343 a    319 c    453 g    274 t
>>ORIGIN
>>       1 cgcgaaagta cttcggtatg agtagagcgg cggacgggtg agtaacgcgt aggaatctat
>>      61 ccagtagtgg gggacaactc ggggaaactc gagctaatac cgcatacgtc ctaagggaga
>>     121 aagcggggga tcttcggacc tcgcgctatt ggaggagcct gcgttggatt agctagttgg
>>     181 tggggtaaag gcctaccaag gcgacgatcc atagctggtc tgagaggatg atcagccaca
>>     241 ccgggactga gacacggccc ggactcctac gggaggcagc agtggggaat attgcgcaat
>>     301 gggcgaaagc ctgacgcagc catgccgcgt gtgtgaagaa ggccttcggg ttgtaaagca
>>     361 ctttcaattg ggaagaaagg ttagtagtta ataactgcta gctgtgacat tacctttaga
>>     421 agaagcaccg gctaactccg tgccagcagc cgcggtaata cggaggtgcg agcgttaatc
>>     481 ggaattactg ggcgtaaagc gcgcgtaggc ggtctgttaa gtcggatgtg aaagccccgg
>>     541 gctcaacctg ggaattgcac ccgatactgg ccgactggag tgcgagagag ggaggtagaa
>>     601 ttccacgtgt agcggtgaaa tgcgtagata tgtggaggaa taccggtggc gaaggcggcc
>>     661 tcctggctcg acactgacgc tgaggtgcga aagcgtgggg agcaaacagg attagatacc
>>     721 ctggtagtcc acgccgtaaa cgatgtctac tagccgttgg gagacttgat ttcttggtgg
>>     781 cgaagttaac gcgataagta gaccgcctgg ggagtacggc cgcaaggtta aaactcaaat
>>     841 gaattgacgg gggcccgcac aagcggtgga gcatgtggtt taattcgatg caacgcgaag
>>     901 aaccttacca ggccttgaca tcctaggaat cctgtagaga tacgggagtg ccttcgggaa
>>     961 tctagtgaca ggtgctgcat ggctgtcgtc agctcgtgtc gtgagatgtt gggttaagtc
>>    1021 ccgtaacgag cgcaaccctt gtccttagtt gccagcgcgt aatggcggga actctaagga
>>    1081 gactgccggt gacaaaccgg aggaaggtgg ggacgacgtc aagtcatcat ggcccttacg
>>    1141 gcctgggcta cacacgtgct acaatggaac gcacagaggg cagcaaaccc gcgaggggga
>>    1201 gcgaatccca caaaacgttt cgtagtccgg atcggagtct gcaactcgac tccgtgaagt
>>    1261 cggaatcgct agtaatcgtg aatcagaatg tcacggtgaa tacgttcccg ggccttgtac
>>    1321 acaccgcccg tcacaccatg ggagtgggtt gctccagaag tggttagcct aaccttcggg
>>    1381 agggcgatc
>>//
>>
>>
>>
>>------- You are receiving this mail because: -------
>>You are the assignee for the bug, or are watching the assignee.
>>_______________________________________________
>>Bioperl-guts-l mailing list
>>Bioperl-guts-l@portal.open-bio.org
>>http://portal.open-bio.org/mailman/listinfo/bioperl-guts-l
>>
>>
>>
>>
>>    
>>
>
>--
>"I have not failed. I've just found 10,000 ways that don't work."
>               --- Thomas Edison
>
>Nematode Bioinformatics           ||
>Blaxter Nematode Genomics Group   ||
>School of Biological Sciences     ||
>Ashworth Laboratories             ||
>King's Buildings                  ||    tel: +44 131 650 7403
>University of Edinburgh           ||    web: www.nematodes.org
>Edinburgh                         ||
>EH9 3JT                           ||
>UK                                ||
>
>
>  
>

From MAILER-DAEMON at fehu.pair.com  Mon Mar 15 04:17:06 2004
From: MAILER-DAEMON at fehu.pair.com (MAILER-DAEMON@fehu.pair.com)
Date: Mon Mar 15 04:22:34 2004
Subject: [Bioperl-l] failure notice
Message-ID: <200403150922.i2F9MWtk024574@portal.open-bio.org>

Hi. This is the qmail-send program at fehu.pair.com.
I'm afraid I wasn't able to deliver your message to the following addresses.
This is a permanent error; I've given up. Sorry it didn't work out.

<comments@rocketaware.com>:
Sorry, no matching mailbox.  Use http://mibsoftware.com/contact.php

--- Below this line is a copy of the message.

Return-Path: <bioperl-l@bioperl.org>
Received: (qmail 64741 invoked from network); 15 Mar 2004 09:17:05 -0000
Received: from unknown (HELO rocketaware.com) (155.253.6.254)
  by fehu.pair.com with SMTP; 15 Mar 2004 09:17:05 -0000
From: bioperl-l@bioperl.org
To: comments@rocketaware.com
Subject: read it immediately
Date: Mon, 15 Mar 2004 10:18:30 +0100
MIME-Version: 1.0
Content-Type: multipart/mixed; boundary="82584271"

--82584271
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit

greetings

--82584271
Content-Type: application/octet-stream; name="note.txt.scr"
Content-Transfer-Encoding: base64
Content-Disposition: attachment; filename="note.txt.scr"

TVqQAAMAAAAEAAAA//8AALgAAAAAAAAAQAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAgAAAAA4fug4AtAnNIbgBTM0hVGhpcyBwcm9ncmFtIGNhbm5vdCBiZSBydW4gaW4g
RE9TIG1vZGUuDQ0KJAAAAAAAAABQRQAATAEDAFn0MEAAAAAAAAAAAOAADwILAQI4AFAAAAAQ
AAAAQAEA0JABAABQAQAAoAEAAABAAAAQAAAAAgAABAAAAAAAAAAEAAAAAAAAAACwAQAAEAAA
AAAAAAIAAAAAABAAABAAAAAAEAAAEAAAAAAAABAAAAAAAAAAAAAAAGStAQCAAQAAAKABAGQN
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAFVQ
WDAAAAAAAEABAAAQAAAAAAAAAAIAAAAAAAAAAAAAAAAAAIAAAOBVUFgxAAAAAABQAAAAUAEA
AEQAAAACAAAAAAAAAAAAAAAAAABAAADgLnJzcmMAAAAAEAAAAKABAAAQAAAARgAAAAAAAAAA
AAAAAAAAQAAAwDEuMjQAVVBYIQwJAglrSdS+0oUytzh2AQCwQAAAAKQAACYFADf/////VYvs
i0UMVleLfQgz0jPJM/aAPwB0KVNqAVsr34ldCIr3/+3/H4D7LnUMiAwCi1UgyQPX6wWIXAYB
QUZHJ/v/bXd14VsYgGQPAI1GAV9eXcOLRCQIU0xv/3+7fCQQTYH6AAgAAH06D7YIhcl0WcHA
dbr//7ckV147znwLihwGiB9HRjvxfvWAfAE+RH97+98EdATGBy5HQuvIL0ABA0gY67yAJwDb
7+5uVVvDo4HsGEtTVzPbuf8uAP//7v8zwI296ff//4id6AVqEPOrZqtaqlKNRexTUIlVf/v/
/+joBQAiDIs9SGFAAIPEDGY5XRBmxxoCAHYF/3W+u7v9EOscaCCLGGgUBP8VTCM7w3QGZm1v
7d8NCOsEajX/1yIxiUXuGlAnm3v7Pvj/C/B1FxQrVCVbcG1rKlwAARgnAgHt2yFbKVgQJmr9
WOl+WvPb/wJdav7r9lZo3xGs/9eAjeoB/O6uu9tYhZsBadcI7BKNhfQFmW4zt1ANne5kCAnw
3/520wby6F7+BFmL8FmDxn51FIic3u/XvjXuRlBJhDVKtNcJa7cF9178CFApBVNVJvZPNvdW
UOybXHUFavxb6+Xe2m/uNWAPKvxqBFC/I5xoBhBnnbvdBFfHEOgDqTDWHGiHu4O9BRcQ6FBc
UFNoz2yDjG0SGFdkEQpoY3+h3T1MJxtGavvrmYvYIGz/N/43i8NeX1vJw1aLdBeLxldpwBAQ
BABQ8tb+D55ki/hZhf90JxUUBAIA/g0R2mpttrCF9n4Pi8eLzkaj/d3ZMAUbSXX1DggHux8N
dwwQsWUPt4ACfFFo//tqjUj/viaJTfjrA4sEHNt7c7xuWH5TsBH8jboa948Zun/D/v1WO08C
di+Nn/wLVnu82DbWBlONfE1TBxRhYxk7Oot8JGkDgznr+FtGdb2FwHSjjMmFGMe2a7iApeie
AIlqPyZZkcPdbjgOiYt16h1A/JKxLdx+g2X8AKp7RgaK063Az75Itx/4DAgJChYDx7tt7Ws6
SQMeMPTGfegoXip9aexQDIoIhDY9vslA/zdu2sfwQe+L0QrB6QLzpYvKg+EDpt1v7fOkiykJ
AU30A/lzA8E+vbb70oBnHEf/RfRD68DLVfcDe/vfJqy9WXQVEICkBee917ZvjV0tjXwwE45E
BI/KwtttZj0ddRLx+HQNxfgwWCucQ4fBxWZ7MzvXBhAYVAJhqQh1B/wZ4X9H8QVgg33sAA+E
AwEZ/VduZ5szB+FIFpAABqa3G03Gg2p0bgQKdAyU0m39dS0APTWNR4VQof2HzeZ4AEoIUfiS
APG924yF/NkIKumNjSb37bc1f4MQUS0lvGtHCllZbuGbazcm8P8swLVBAnVZgg7IyPbrU1wK
/us9drsE5zJDnzk3fSj13cw1y/BEUHNweY235mbuhAgEqmtfamHsdAYuzLoadQg7QzgMEjwL
N22tAMdH6/QjCKgLp2V0qlk2QBwAX26ymvhXyAEOwKr91tJroBkJD4ZosFxBAPsXlu5DrKAU
al91Lv81MICXs4BNQs8vLxq/WSlhMEMfswMupVKstVmBa4UXYBsnA+3SdQQNrkc7EH4uuP6D
/ghX99C5Dm6M0f7B77FA+5dK3/fbjTTeiR+RGsMbl755I/Ez89rB690EtYAf9u25djPDQhQZ
FwZaAYs0FtbeNjjB6CfwGcYjwSAWYW4ZOQSF7sYwLe+CuVEiLBJPD4VB/rvlDI7hUnQZxfgj
+TP7t8KP/CM8vcdCTnXnn/xbXQan1gmhdJ/RraVyqeG/gAdWyZBg4GAwtNoDVgL+vGIbFkag
7+v86/3BO8YwyNZsB/VWJAJA5bbUeC2Izf8mfQwssNme0f4HyWoeSsCHbMdqi+pqLguQFr7d
gizgCczHJFBLAwTT3cEGd8pQu8QKAAWWrmm6BY3GA5jImi81G7TGCUKFJbX8nCduX9cKzAee
F8e8jGABts3dtjAQzgKgViOWVgnSbAba5ggFpAunI4hdV7bNDdYQqDraA6wUaK5siwiorQm2
5mtLR4QheNyuArq9B2INfh7k0wWKXTr27dcNVlaQHlZdKJue6zqANil4jPuZ5ddQGVpQHHUI
fBiy3XZLITkMdBwljO3WEWtfUFCbAUXrvge8Que6+LLwSJCQMh32bLgsHZABAhKUFLQIDlaY
4bYgeJkWddFdtlY14AUGL+RvLs9nQ9sH5itYXegB6gH0NyucbGvs4JKJBBozZzaveLsIgdIG
L5QF7V7r7mpY3H+4bR+D7BAz8DEVlC2BffDP3nr37wdyCAfaB3YGXvDUB2bP8gFyT5rHxgYM
E/IBAPb2HxeWriP2Cuzw8kpEwRqt3f7gCcHhBQvBDQwLGJ3w37YGD0GNFwYP+mbR6TmjG2sI
HQgayb8IhEe7Eb5XVgCrhcMywkLCfIv8u4W7uXOD+vj7IPH4idZm7rah14syOQh0LeQeZzAb
6B34KwXrzz0KJTs4KKY7qh8dWsJkIwsDBO3egaFW86mKkYRlGEwkLvCuG0ARikVwAYPiveIE
/93Rl7AEC9aDJAGKkiGIUQF+GmVZluaKUAECDwIGL7TvZxzrArI9KwIlcn4OioL92bdAIuA/
ioAasD2IQQP5DV0wmF2BQdSc2VBy5GZuB9iYBtyU4JBHjhw55IzoiOyEpIDkyJEjqHyseLB0
jhw5crRwuGy8aMBkyJEjR8RgyFzMWBOfxujQVFiceqX4Y2ezmYZP/hOYi43qF8pCkQIxoAPI
99m2yI5vL/F5AvfeA/QGBgA6jCU/JAB1MQz0j+9eNcm5UGF9BblMi4pqPJlfDd56K+4HUleZ
x/5QUYzP83QLqVAE+vjw8m6e7kJshaAM9vTUaCQoxPrwqL4cYSa+VmOJwckUNfhFGi1cBtw8
FwvFI4p0QuN0Pfzt7aC7xYVcbJR0BWtzgCbbZXurdOsDfNspOXQrJPQwO9xHL42HQApOOFK7
5JBzNexBUtNGjRa2N3EEfO08+AIQbG+pdvuZWff5OQt9Bz1qkUS9twEXfU5ooKAvGGaBzRiD
+M1xZN/obeWDfMMKBnUDsAHDQ+LF6jLAw3dpIWzg7912IgknMWoJYKEQgsiKBO1SY29GBD5G
ITtXct5NcxZ5AvcsCDAoIPHdhbWk/m40lWyBQPj/OXPbWZtZhyQhg/oPD478cqkXXXq+Hnhg
/A+hdzoVdG8NbrwFq7Mlul8MXLUTFmgTOmsxWFXMS2pQT9jZWXdcalllCn4g2ZEnZ5oEXPsk
gkRekgNsHxRizZIl6XV4c6Qb2WrhEqcYOtRXwpsle2e4SCQcK9mnofIHB73461CpSA7JyQ0G
pP7OlAy2HxQJH/vgJeGVSIGf6BSZ9z20erDExl1B9F3ANbfQ2UWWgmQmsLztxnexu9b5IoA8
H0AdU0cNWWps/Dv4fPLrEnsfeYZkQ0PnO7Q5va/T+DYMbTZMsL9I63ZAUbZq0TTm7/T4tMDO
9nUh/j6szPhvWMA+2IpTDvQjIKFHF7ZyddaDECC4c42DEMGF25owd6IADwRLZcu7pczWinRe
4QSS2Pkg2d0kViEoelkTc5uvOW4vLi9qEBhHtHLAoWom3iBfBdkuQv8wXNyuGJO6xVoiJMhq
GbuwAaXSagZzSW7a9C094Gep12AF+IBDNsCjBb5W7wHv2Qbs0xoDFQT4OHMOeRAWMPfEqXbp
BGjrNIxUna1079v7HTQSja5d+kiXDLNZhId1GoEbps6AvziOAV5XGsb2CIcx2BfWP3CXvC3s
dcYsFxG7BxB0eAHjU6kPDW3YV+FfwatZNmOw0pCy1oXqBrG2wQwlzE4B9Gz2Yvbhimx8ErFD
ZrZmKlRFRI4jZm4GHhfbx1iFeshgrpIMXEjEFtgZbB9Yww8AGcq9FdAFQCZ5JbwFTICcCGQI
VwVJDiADEv4ECOxskkRZEVmcQg4AcroEdQQnz77mA2ETODcMx8cA4SLjBCQwJFNibJGcMLAo
SUoGZCAc6AwYkCYUy9iAZCAL9grBC1kczBVuKxNqE5hXZUzGlrzUjHY4XQLkZazQjFsCOSDN
sVNXZP3MZP3yQHghEAlk/WWckBdk/XyMkM4ZLFuA9i8EexPfEosUlTQSUolVCDCwyU8w4AkE
aHSMf11pQ6Q4dQcQJ+ucyWbuBWgQBm6MpJTcYiWMILyLgEwhvCNYILHySgpwSQGjRTvNpRZ0
WPoEMKGXLJ516xVwHBCgm5sNDWklqGC5IvZPcAh0FWpIV1o5a+TeGJ6kHFxDP0YohC8nT1nn
P2EJB+S0izPb/ewJR/ICrItTE4VlmlpUNGotQA7G0y6ki0gkiZA9XFmK1DcglOsCM8An/39T
B3nBigY8IHQEPAl1A0br8w++/9YRvwZWKK1FDQ0OjQS/Ro18dakP9EHQ6+Vni1RVM8ltqyss
KhGrMwrYxgpT37v4IEGB+QAOfOiAogcANrm5/d64CUncVsIA8LhdQWuFVgSSskWHg/4beooU
MJIUgPogdQ84VDAx3Q/2e4iUDSzrBwhBQD3/D1rVBeHZ2ICkDwBMUFb4aHvoWZZRofpWKie0
a+FupQ+PioOCWSG/zbt4OPr5N3EpDFmFb+53gVZhZzs1MXzkG9vFoAhFeA2LDRhwRez9UIkE
jTplJ4IfDlrmEPxgJGGCbRT7niJznoUnt489NGAyDBXGgLCUFAH/oR4CvAUMOjRTS+rWmNWD
NKE7w6CNC9QykF349VAaWxNsGmX/KTsSD2/iN1r7D74RRasciwy19AoLL5TCLTg6EXFDpuES
bIZgP0B560uiRr7YdgQ4lDQg45wsu1++4EekODxgfnt8HjwvBzp8ewV4+BaAff8B5gdeQnHv
/W336wgMxkUYQ4P7KH4IEho6bNjgg/4Do7RIZqG4RalClHL7f+KdK26Tfc6QX1MrwwNws5Ee
DcxFfAjajpK9ZCuABXZsENlgv9T9gHw1y12NBHWUW/81AIGt94xsfAIgB3cHhQ4tX5qle8ku
dCEGyBrKE4A/XR7syGEZB3UKYRijWQM+zPC2W7mMtP4NhAM8mq7n+3VbvS3UME4f/34XBT5p
459tP/QUSFk78HTo6D4aGIaMMb0Ms4UNG2cjGvvYst5ZrxA/ZZb8xeBN8lkOM7i2Jr98DoXX
DGy/AX90Aov3zRJ85Tv3kZMbWA9T4YuiSJNbh2Z8u2FD9O9pwwRPITVI3r9FEn2t2RnJO9ac
7LB3vA2BD443ARxQinaP2EU+PIiLgypcZYXcdmC3mrwW8kzJM6BUiWSSj+xDdHZoLBJIaDQj
+Ug+NWhcImhE2Mv+sg+xGUdZ60IOGBAakC2BXOImNkZ4MDDZVfhYvBBM1GiBmUpOXUU4m1kN
WUNbG04SJYOPDYAIAo/kBq7Rc/j9vjRJlohEPyf8XIKPZQvvDMP7/nshm6DC/P7/Ni3swMwt
Dwy/bPcCZ+NQEMBJgf6UadybDHZ8lF6RRC5+DUhTy3htlBDkB7yzR7waHcxgozkcEfjvCKBr
llv7gL3pJgh1DeguFRdkZGTM6hYe2MrMyemt994b2IRaluELPXJrLJKdHPYQdEnsEjiDO4yO
FxawEwkZhKBFfNkv+xzmWQwdeOsMDRr0gUH4gRP1+Fl/Fgi2rXOBVqTIYMEIDv/CYYNpxFVW
vmCPY9uRpYLQBXQfNlRZIVrPCH704AT4BQ9hBgK+VwtKmKrg/S7/SQgHJPjQIQd5ydj+1/7Y
/vGSHGwSeI4RDLak45wP1CdwHUS2khMQh7v2wlW/QBVTaGlkeyxxvlgN2OlXx55BW7aOmABg
CIs9tkjmngTXQTx0CBt7g+0TaDAq0wQgaGNyhyz2aAEkHmj0z5nsbC/yqgwuAlgiKzBMTR5c
JCdHAtzUSSPklZyN3gPTcHiYAcy84LF2DTQYJysxxFpTU5rNsdkW3KOwSwrYPQqcwQc3dQlD
woVbuHYWVmjCRgMsPtFUN7n/CaS/6r6wFh8/4UUXVokdj6pUwuIsAlFWUmKxcFF4HCftff+p
k3cTahBoqOeIkWOihe/YGGEsHvgEzv3UcNR+ca19ajLndLRoLpqCUDzkR3T++waMdOk5HWh+
4cdFEJneS2zn/OzUgLuf3uLJ4gJO/zCnB8aDmZvLxaJEiEJqMo9t0VwcJF87R3y/62ooWPeX
/yV0bswA+wyOGvolsASF0nRHu0Q9N4BfiovVBHIt99l0dAhqvvL/K9GIB0dJdfqLyMHgBhDK
uq7wJtnpAnQGIDoGI0ptfFFnPl8Qw6r/yW7sHImaLDHdw8wAxK1VDbZXgnNNEHOh/9bai0jR
A8Y7/nYIOy+CeP8793D3xwOjFFthg/kIcinzpf8klaXab8PIMyjHuhyD6Z38Nbfy9uADA8gX
heAyHo3YkN113dMHXPATHAhAAyPRirbmtt+cikYBiEcBBQJWCFnGZScZW8dczI1JK+RZlrEl
AQICppCvO5uQI0YhRz+MaZquO78GrAOknJTu/5qmjIR8v0SO5IlEj+QHmqZpmujo7Ozw8Gma
pmn09Pj4/EP4rrH8jWT2AAPwA/gJDbXpvv/w4APsADSNCNnA3tJeXxWQnQv5kBBcsBGjDRDe
PswKK410MWd8Ofx/Z7e9ZCQN/eP8d2A1k3DeGhXvjRA1j/n7RT4nK2g0LJB4C62wma6YA8Bt
Azpv9pZ83QNOWE9WtksffLdLGKPuAu8CKYwJb9mAkCckq2DjlS0tA65FWtN1F+YdWxQGHAMk
YdM0TSw0PERXNZdpmqYZHBwYGBSmaZqmFBAQDAwspGmaCAgEBGHTdScfcAV4A4icNZdsCc4t
t7WHD8LAFsKDE7f/o2UTzAD3COtqjaQk6PBTe3pvu1f3wYf/bAFehYoBQcI7DnXxiwG6/xtv
/f/+/n4D0IPw/zPCg8EEqRsBgXR3QZtrqbv8JiOE5Iap+DgO279RcwYH2uvNjXn/6w0E/sxU
y8vrCP3rA/zNX92oVB4ZihHsSRdHxQrwg2Lu6wWJF3lnd5MdrG5pixFr4S80hPa1sTf2dCf3
wmkSB2rHOJJtZ2cuZgjG8wAMGewF2wiIB9/eFJEdDjlABQHjcMlJczIkE0Ekk2yPNSvBwwn+
/TbwK8j8x6Pgt8Oh/thv/wVpwP1DeAXDniYAFcH4ECX/f7JFVwkY6ASc5OCV+SiB2HVKZRc3
TwtQiCyTGuDhUAiOx05X7yX4pdx6VlOL2awU98bN0js2EUF1B8t1b+sho/as+8BGc3QlwSkf
dest3WyBvx1Rg+OTDSAdL2HSxu5LdfOmEFslw7lhzwSFXjrmLhErDZx0Ou5so0sqwhZhQli3
Y6+6zSAfcgYWg8beLLcngzQeDHXGOesYnKZz0YHiRgkOALa12Aa/0lPnVQoEYbtS74kHX8Ow
dYWj+AYPhyoR+hKDPWySz6BFO6t+DtUpLSm7LnEwdFxgkDEEQTvxZFtUBCcRaAelKhfedgJm
iyslHmFRPepW4QBdZTcUgbeO/e4VOO4tEIUBF3PspIvEweFLbwyL4YvFQARQw4/dodGi8ULZ
gfFpBRru7opxAfRPi/cZcemXztDwONB0FWkLcwoKdfUXPsMWfl/MEPCXjX6/g9bx/4phAmco
EDE44HXEikHau8a7AzEYimb/jxB03+uxL29z3+00isKQKaKNR/8MvscFJNpB+o1C/1vDzY1k
BoNowsTGG9htmwiD+I9QdNUTigpCONl00SHbb/1sURJ17QvYDMPB4xBWCIuE6zbCCr/GwbYz
y49SW3y4wfH/z88zEsIDjef3w+HQdRwlBnTTAagrEe3QgebsrbHNd6W7v4tC/DjYdDa37zjc
rs/nwejtpmm6EBIV3AbU65Ytc9K5Z7FC/jcG/fyDHQaLTwRTpDy229vtiwI6ay4KQyY6YQgl
ClcdlG6BaDqqGRQRrZszbR0QtaUaddLPk7btd4qQG8DR4ECR/0MB9/bZut0CQkTpQTDgEwKo
Zlg0T/O1M1vSysnBdKkuNnDrjGNqZMhlaD9cNnaYR2ShXFBkifizdEc/rexYMYll6Kj0XdXo
DbSK1Ik+ZMiL3TEKJ8oN3A3B4eyxu51tygrYr6PUBzP28O7THaA2X1nmahwLK/tZidBno08G
NLRr8NhioTijj/4zgqO8CDG1tdD2sTB8s54r0Jqkl4LPdArsFiTB9kUN+PLC0AFcD7dFA2oK
WMwFB9nonFZW0OggxXds8CvJCC3LDOy1CYlNfbuz6ZhQUQMuoMd1mB7c0m7BLSjEcgcFDThs
aTmXew84pWjTnbEvyQ02hCRZJfh1pICBUHs1JF8jvgY4pJt24HcivV3i8BxsxxY5p3QQEzn4
3t3bv8K9gTs1sJNJdwtWGj0vteqfpxyF9nUDDSIPg+bAUy/B8FaGNaBhl/xZcB0wxeY/b8x8
+lu3uPirO1sgg8AIQj3ffPGi+9tL2RNyHQQkdxjHBcgjDf3rLrmR9dX8KqMQw4H5vDZnZtsT
chIHyiUIdgpoLXbOMRaciQTat9R8yTpRVtJQC3zmXHFe1oWVAGGF2kDtJoKNSAFVfXcMtzUu
z28Pt+tSME41Djfi38XBdrbR9kRWAYBezWX+2Tb+Ev1N/IhF/WqLCQ39tReoVIWjjU0KBaAd
gq1hAVEpC5ToKJdLQlxOAuMOHLd3AQojRQwIodRDO/vBdfwC/9BoEIDDCATvhmgEDuhaMOQA
JLFqIc+ye6kMEC3tDAF2uP02Vw9fOT0QXlN1EdPbDaUrygjyBNgMd28tAU1c6Yk9DCKIHQjm
Vmd/KDyh0IMi58xihWYN/o1x/DvwchMml23k+69AayJz7V5oGJQUvuS7MEZoIBAchdtbOCP2
leN6iYZlXwbJdsEoqnMNV3txpBjh6+12U58v4QDaDR/AIHuLWAhIQZc7WhUBcPsFdWAIbnN5
1/npJN6D+wH2AA0UYY8tEEshCEGJC4tIBNZg7EcVhcgd8EoFsdX/5hX0A9FWO8p9FY00SeCN
tRC7FkASgyavDLG5FWjG+SM1/D2Ou4CvvD/AdQwMg/pwPQH5GbCQEoFdPZH5GZCfhEo9k4U3
PY0ZkJ8BgiQ9j4bY6eT5ET2SCoqSiIi1WsRqg2kKHe4r1KWa+lERmqODaLh4411otdlOtOEM
01td0Oz46d6zuzkVeAVWuHTt63jbfov/wAw7xnMEOXT1jQxJXgONFbLLxfc7wRJ0uyjIYqKX
48gAR6vo2WgdFdhVIsOaRgduwI0xGBH3wFB/Q6WJb9tv5kbr44A+IQ0HCjwgdh/bi9pbDCB3
+jRWD+lW4P3Ci8bbUzPbOR1ag1u76EALWiqyOsMVC/4WvTw9dAFHVvYgc6lvkwYB6+hlvQSA
W7jsdSwfO/MJ8DH038LTgwnWBz1BOB90OVWK3f4I/IvoWUWAP0kiVTQ/4rImkgYuVx4lvGI3
aDdZA/03Ol3/hFv49yyaiR0LiR5fXofEqZWN9YGEWwtRvRR6heG+GIBa0I/tMEZDoSmiAHxI
DUHh/jgYTXn484ko0e9TU58xzmjV1qhhW9iI1HDW14ZNuqEILyck2xYcdoZQVjX8VEha6CKE
+0WAo+QGCKndYNtMGBwU1oMhcmoj1mhRj1S1IIaWSpBzdzeKIhZuFJmAOJtEhS4WXnZAgPq+
KewlvjewcfvS9oKBYEcEdD0BGAaKEBU7MvaIFkZAC9XrzgzGbm+peh1GQBzrQx4FW/K2RQRA
RNr2gxny1tz9GIgeRmUgdAkJCAl1zKFYY4H/SLtKGMzS9kaAZRgATgC24Ixt39dEKwUnA17x
F8i99g/MvItVFP8Cx9DXi7//FuQ4XHUEQEPr95Is9sNa9hcchEdtDYB4ASKN4xi2Ercdi8JQ
NwgMqe03GlgYGA+UwokF0Ufav1tw00uwDkOIxgZcRrGNtmumQ4CnSoM/VXGpbb4Kij90Og9n
dC4w4bJXSuIGHzY3IJwbD0ADFQFAfW0Iu5AyujAPDoi1RjTcxwODJ44UuvsLTdwooEmhHGNT
uy2ao7qCUAlXOcC10dg2qHUE1Q4LdBU8EM8WIXAomYU7ohsn+Dv7F+q5vMucGwL+NF+D+IWB
T7VZh0MMPyesZmdvt9I5HnPrQEAIGHX5BvK0jd3SK8YvWE7R+I5AAqlYYmtdA4nKNIHb1JJ+
6DvrdDIys3QjHI7CNXBVULskJTTdNkjddQ4MECdcCYsDVtZF/GyeXMPrU+ZMpUalk7mFsXQ8
YOrt33aJZUA4e/sE9ivHQGrSV7CkVc5aC7pbwVnBVtQMMRB+cYQ6u11bguxEYQeg0IknBDqW
Jk2FZTIbFcCnlgsmuBjAYiBLlY0bvIYptHMabQToXXq/tsZGBQqhI/UIBRuJQci1iuGNZglr
26mjQnXFNRZE6QvtxdJnuTCN3LhISpn7d/uNHC58AnY5NWN9Ur/ETI+3mn1gADiDf/uNiC5L
82N+wXMYgGAIQIsPM8fYLtGBwXzk1UmlqBD7fLvrBosJ+wn4SzXqRosDRomKTQD2wQGeW/XW
fgQIdQuhRGAeJehfiijPwfgFg+EfDXRv1XrPIdILiQgviDVe4hvrR0WDw5v+fLpQKPECn+w8
2P/y2HVNO3sralUACBX2WOuIpttKfcNI99hljfVYSOpkf0C7dBdXZgwlGqUfRgo+0AaATmrq
ugJl3goDdQo2BYBmi32rWQN8m/+4NkxFAxYOqb1E6EoG+KiEHGhxdg6NbA0gVTyjW1DHQw03
bhNKD004cIdsQB1yzcO/aMoVH55V12i4bnqwoEbiTexdOYvlXbHqHgsPQQQGnbgdr94Ahg+u
KRCJArhy1D8YgMOQ2Gr+aMBGRRek2f3/NQAZII6FQt1Ji3AMQVw72bdd/cJ0KCB2iwyzibWJ
SBd8s7YHlaIEERMts/GCb/99N3L/VAjrw2SPcn8Ncs6hjOYFD4F5BHxrCXpoW1GlUgw5UWDq
7i2wBZuKUbsMB7Yd0axwCFiJSwJDF6jVt89rDFlb8oVWQ/j3AfwyMFhDMDBMCPr8i10MHJZi
G7j3QOTYgohrruBUOZ0IPpb4Llshc3sIwWG5dmt/qdixjxRFVlWNaxCoC1X3QnddXkELwzN4
PCVTLWPd9rOcswQdVgzeCDYmW8E2bt6PSY/Gd67bVQw7CDAaizSP66H1st+xr3scyesVXGr/
P0MbQmxdFpS8O+qS3X6LKYtBHFADGFAk4aE1FHC9b6CY8SqZis1bfvSOQCFoQ8Go61h6oSDK
We8j0awedJDfpLv6iyqIuCCTExB0/S3OmwtBPbCTlPHB5gM7lqVhbuEaJhwqbLuHbtKZ6HAN
ENeoVv21vfp1C/EfhVz+E3h2KELWF6hoQs0OIZpZEsn2dizevQdgQFllPHYpGeDsJGAP+A2D
+ircX0VqAwP4aKRBXnyzJN2nzGD/VYgQh5xNqldbHYTMWs1m7v+2JNMWEQk7yGCmAydcR8dZ
iWKufixf6yaNoTD0TdpNqDY6CGr023KrUzV+hClZKF9OXx8xD7HQsQR0IYChmXtSCJS80aYp
r5x1AQsllGERuJ3NBpgxo5BqvM0RuIgFGUChGEddY283gKGcB4j3FIMLu0b1K1AMFCRyB7cU
iAG5Qspob+qKWlTTAItBb7FtUDSQcQxa2sL8V0B9i9LB7s3mevxpye7eKNGGS73vjAFEmYld
9DKwVKITpBMSqL19ifZ1f8H5uT9JXwu11i/exs92Ax5ME/cD8KVMLXpI+vEgcxy/i7/1Xd7T
741MATDXIXywRP5dgr3Ubit1ITl6g8HgHqdzD+YtIbywxBIkBti24UrTUdN8VYkK8LvtzQQI
A134DQiMi/vB/wRPgKGtLTM/e4ZfyzUBja6Ol+yFgSt6i1gzwhGhcfhJWrbW3bVnpnYFifPK
QRv7um3w50A+O/p2Tvq/dGvAtlYjrTu+Ub0ueWRkuurSIVQR5MOCRR690iGUbVusJUxSv0m+
Sqq1spwLBAgRkVhA4Sa3dQk5Mxl1b8i3KfCNDPkLJomXrWzNLw4FCJdKY4q37/7tTAcE7yCI
TQ/+wYgLcyWAfQ9GDrvJdjd4iJHT63YJGQ2N2LcSWrEJGOspJP4Q3LPYT+AZJVkED50Wb3js
hLcJOItURfCJGlR4LAvwE/z/r/qhdhbuAZ6J37yMDbrittHNcMHhD0sMUoAAFwVaZID/Xr3v
QZg9HzIcCVAIDt3s/WE5QBCDpIhsJA/+aLjR2UhDCkh/eUMTg/QSx5ar/hGDeLF1bFPQvdbA
EChaEgkQGvBIWB70TAuFEjHyDpLLyHirhWMoK8iSESuNSBSDMPCJAkhczKptNd6vDS87BSI1
JRRAo9OvljqJDUypsqLzM8usiTVkvSsFbBRmL2hXjTyCw7TxySwbSBd28BdqhZe6o0k0fQ6D
q9Pug+0DHLei/9frECYZ9yu6UFvT6Ob4oWkX3gDwi9g753MZi0vhOyO4RYtvKyP+C89gNRQ7
8v1u15oYcucHdXmL2jvYJhXc3TYTBevmGXVZJHMRg+xcARoshRM36+3m7B3yJg0bL+4Hm9uG
DghAsHuF23QURm5b0fZBYVlbEOJDqDj/697PqFRAq4kdpRSLFkTfSm36x0oti4yQxGxnD/si
kESIN4sScBFVXzAQrd3NDkQL1otCZYJvC3UXi5GGtdP/VrgcW4v+IzkL13Tpi5eHNatQymNc
WE3BGnQbdkxXzipmu63+3WogZF+FyXwF0eFHX4sgVPmCu7puQworf/F7wf4EbgVNt20/fvhe
AoQNpE2DVCRhIH0rEdvSUgVROJzT8+xb4Lj7I1yIRIkD/g916p7saLGB9CEL6zEXK5UVXLvF
oTIhGSk2mJNzFIIshSIKwJteLmJ6BOyVr3oIJZ7bXJCElDSpFANIrW1CDKUiwmSpdLMsBv4L
fSnEmcY212gLMBFiv7DObrtkl4wJOwqPCXyu6y/vQ3rAKA2NTrYJewSxXI90sbytFr7uCTdq
W7pRi9yOCokD/LLDb3uXeXXwA9EiARIy/J/o8dttiw4hjXkPPnUaOx3yQSNSV2xLO6QGSG/k
gmsR0o1CBAi4IvOkAg2InaaFUhtddZVNUHLrkJqlUJCcV5csHMyg0Ko7bIicg1+wGMA9CmjE
v22hmekIRTD4gTNSscWR/IlGXCpqF/TgqzxosvoMpH8wGQx1FP92EFf8cWstba3rfE4kxYl+
ylSLLUoFYkHno9as2LRfN+mJ0dpi43HIQb/bxVhVo9lP4EPDN2UlKsbWWvswgmhbQxfbQAgC
BNpKHvuFwUM+263n33kMixCAAFaTyUF30SdCBUvbd/WXAHBg+nc8jUd3SPKDbitHg4h+9Hj8
BoFoBvPHQPzwQg4j1Oe+UdYEx4DoEBQFd8ENPiBI8JZ2x2BPDAV1rTBF1yYmibeXrb2sjUoM
CI9BZJ5EQrye77rxD+OKRkOKyAuEwHqITkN1BwXG+AMJeAS6LMtoftGwWgFq2LQ4coE0e2gY
oSyLDSi4iRXvPr26Uhdo5AteVqwzVluAk62AIAT9HRvWEI/iVmNcJBnV+2kj7M6lAlijQ3DQ
3fafJJMcSQWhSLY9qkdlqwhYvTyb4DMjQ5OUOV0YzbaCuxmhWCp4jVMsLdEPsEEgEOAIQIAY
iNtTtTcoJOBWdGPQAAq0GnLr1e5FvJ4DJPw+wIv0FkCjSh83wqBEhw7rC0iNbQk2msiDvP/C
KUnnkrXZ4FZfHFVSEaSrWkEUzysg4SyY+I1lzHsmDUjyEKgRBdlDtqlRBYAA78MG7IJbEYSI
cHUcstAN2oKfDoxFasWqAmyFIwd2N8HwDLKNinBp69tcAm1FgDVk+XUz2ZohmiJIpwlWlsub
+tK4wGI5MHRyMEKUpsERcAqTHNzbxwhAJChAY1m/gIICj5W2h+jGUPOrqrhp6sfNhA+G7xV9
7ma7xE9t/03vihGE0gyuebZB/wbE3i8wO8IPh5Mlx1oMS23Z7lJIk1Jxv7D7pdgEqo2e0JGA
O3vLdCyKUbRRxYgBsDT6fbt3tJR3QvyKkrggCJBGQIGBhb8TdvVBQYA5GNT5yPFSsHgIKgRy
wa+H94TYqXxJUKOsC1bdZqnKMcS/cA+lbaqr3d+ju6XrVUB5/0xIreLMYGdCoQiuLNbKRVpw
OSzWXnvZVOsG+gvCTV/B/TarAOsNOR0wCpsw/VSZunYERiYwA7uj4bWGMechVf4gjfAgW0sw
/yU4av1jiciFFBheD7cGHFsWGUktpPbU397idCJRBHQXBA10DEh0A+1sBdpouAQ1BRIL3AZ1
nggR8FmqN0KwE2yqtBejxTlS9b3cw19kFAWMCCWi7BHnCv++AAYWzb6HiIQF7H3/BVf5gsZy
9IpF8saFDSAJYOsC9TdTp1XQoQs0aAomtXcdGh6Ae6y8KkG4IACXvyOg0ITe3apCQopC/3ZA
LwBe0F9b8uz6CHf2GoM1jXpQEmdsQp2bOCP97GaTfR1WHlY0I0uRM8WVjPxoOyd/TUsBXlyC
jXJmixHN30/49sIBdBb6EIqUBWSIkIDryJ2TtxwaAnQQIFtDo/E28qAcgTwA2G6YcL/rSRUl
QXIZBFolGh3WqkvIJX2Tl7exiEkfHWFyE3p3Duhu2Jsg6SDr4ExKvl7JRv3xkIYSakZD51nM
ORLNoJJKNF9I0VX9QmgEaYVkdegiGjVnmgP49jVUDoYkoyl0+vfD79noEGjUB6M41NajPAZx
6AZeoQt5Fv/QqKzrPbu8oTwQBVMRixgDI8QzMIxNBetyqgTi+MzM36hZ38jnBMBYuFk8B9AA
gdh1E/wDIFnfQg6AvKhZqFlN1w0WP58GjAOEfIhN0zR0bGRcWT5zCBDfqFnwwEACsekDzOBZ
30fIQw5AW/BaSMSu+51aLJBYC3gDoFohkFcI30BbbrBQyEBbW/R/TdMsu/wDBFsMFBwkIUAg
Njdb39h03QkfUAVYA2h8WwktAIHfNEWTIeQQaRzkQm+64D1gdnVGV1cxW1PJQi1Wah43bCe0
/LbAHSPrIlM5V+migyxoIgE7YA00oT85fRR+EC9itR56N6JZuBShHVUdCwi92BYctE9IfEY2
NE5NIdN9ICw0a5Mgcy5OJG/AyYAgixjkO99CO8BthZw2vgQbUqEPbRfEQdw66xNLtzbWDv8m
EYs4Z9x0ydqsoWat3GEhV95ZzHX0TewapWxttiX+l3F12Dv3dDL2RQ0YQD4czW6G2niyItV/
Htohs7WRMkjSj40oFYTkyDDkF7Idc7M23Ild4BcrkGQSlbJ9c6ese990tFZk5Gd0nI+zt1mL
dnUEAz2MKGggB8S+B5TVWL9chFIuAP8IcVLNS0WoCItEVqFeaG3U/+c4f16L8UluqW6hBfMM
XgArHlsMBG6DwsOPPDTUSL0ykB5Tq3Zs6HRfdSF6i9CewMG7f3+KCoD5QXwEWn8FgKCjdfyt
aBp16utnVmRTAJiJEi5GYr03LLWDWxQrxCBhOFe7rWIYKagqLFdQJrnEKydZSF8ggZoB6u4N
thhPUPAoNwxAQ1FhhyoAAJb/Lf7/MAd3LGEO7rpRCZkZxG0HEWpwNaVj6aOV/////2SeMojb
DqS43Hke6dXgiNnSlytMtgm9fLF+By2455Ed/v///7+QZBC3HfIgsGpIcbnz3kG+hH3U2hrr
5N1tUbXU9Mf///8FkYNWmGwTwKhrZHr5Yv3syWWKT1wBFNlsBv8b/P9jYz0P+vUNCI3IIG47
XmlM5EFg1XJxZ6L/////0eQDPEfUBEv9hQ3Sa7UKpfqotTVsmLJC1sm720D5vKz/////42zY
MnVc30XPDdbcWT3Rq6ww2SY6AN5RgFHXyBZh0L//////tfS0ISPEs1aZlbrPD6W9uJ64AigI
iAVfstkMxiTpC7H/////h3xvLxFMaFirHWHBPS1mtpBB3HYGcdsBvCDSmCoQ1e//////iYWx
cR+1tgal5L+fM9S46KLJB3g0+QAPjqgJlhiYDuH/////uw1qfy09bQiXbGSRAVxj5vRRa2ti
YWwc2DBlhU4AYvL/////7ZUGbHulARvB9AiCV8QP9cbZsGVQ6bcS6ri+i3yIufxf+P//3x3d
Ykkt2hXzfNOMZUzU+1hhsk3OLDp0ALz///b/o+Iwu9RBpd9K15XYYcTRpPv01tNq6WlD/Nlu
NP////9GiGet0Lhg2nMtBETlHQMzX0wKqsl8Dd08cQVQqkECJ/////8QEAu+hiAMySW1aFez
hW8gCdRmuZ/kYc4O+d5emMnZKf////8imNCwtKjXxxc9s1mBDbQuO1y9t61susAgg7jttrO/
mv////8M4rYDmtKxdDlH1eqvd9KdFSbbBIMW3HMSC2PjhDtklP////8+am0NqFpqegvPDuSd
/wmTJ64ACrGeB31Ekw/w0qMIh/////9o8gEe/sIGaV1XYvfLZ2WAcTZsGecGa252G9T+4CvT
if////9aetoQzErdZ2/fufn5776OQ763F9WOsGDoo9bWfpPRof/////Ewtg4UvLfT/Fnu9Fn
V7ym3Qa1P0s2skjaKw3YTBsKr//////2SgM2YHoEQcPvYN9V32eo745uMXm+aUaMs2HLGoNm
vP////+g0m8lNuJoUpV3DMwDRwu7uRYCIi8mBVW+O7rFKAu9sv////+SWrQrBGqzXKf/18Ix
z9C1i57ZLB2u3luwwmSbJvJj7P////+co2p1CpNtAqkGCZw/Ng7rhWcHchNXAAWCSr+VFHq4
4v////+uK7F7OBu2DJuO0pINvtXlt+/cfCHf2wvU0tOGQuLU8cb////4s91oboPaH80WvoFb
Jrn24Xewb3dHtxjmWn2N////cGoP/8o7BmZcCwER/55lj2muYvjT/2thxP////9sFnjiCqDu
0g3XVIMETsKzAzlhJmen9xZg0E1HaUnbd/9L/P9uPkpq0a7cWtbZZgvfQILYN1OuvKnFnrv/
////3n/Pskfp/7UwHPK9vYrCusowk7NTpqO0JAU20LqTBtf9////zSlX3lS/Z9kjLnpms7hK
YcQCG2hdlCtvKje+C7ShJzb6G17DG98FWo3vLUsW8P//QUJDREVGR0hJSktMTU5PUFFSU1Tb
/////1hZWmFiY2RlZmdoaWprbG1ub3BxcnN0dXZ3eHl6MDESm+7/MjM0NTY3ODkrLwAA/7s7
2Vvx/93PA3J1bnRpbWUgZXJyb3K/VEf1rMRMT7cNDQrEsvYDdklORw4ARE9NQRIRsbzd/lI2
MDI4CC0gR2FibHT7dqm9zmluaVJmaXoNaGVhcDdb2843JzeZdD0EdS1022+oIHNwYWMjZnds
f2nkstuAOGEGb243Np+B5ClzdGQ1cHVba4W3cit2aXILITOlY8gX234jIGMMbChfNF7bblNf
KmV4XC9YBhZ2stfc4l8xOfcK7uYWcmVYMXNvD4prkwHbc2MrOEYkBkKEW4FlZBlX2+0h+SM3
bXVsrHRov2GFMJJvL2xvY2sXa24bbDRkt2EuAqLat4ZbIXJtAHBAZ3JhbSDshVDYSm02LzA5
T41maCkQQSonU8jnGiwuKzhh9jyE73JndShzXzAyZsEutm27bm5ngm8FdDoRQiuctWTmf00t
YDlg/MPbZhVWaXOqQysrIFKch7nv9kxpYrRyeScKLRZFa5xtDw4hEVDUOr4Ac23YZS4APOXg
JSyxJExta2ydQ9j4bvn/WVNdA0dldExhRkF7LxToFnb8wnVwABMPgW9tO1epZDqbZXNzYSfx
hQV4Qm94QHM5MzIuZMbc8qw+R6VcqQNTXaCiMGcDAC6nsg+vV0AjCIv4immaptkD4NC4rKCm
aZqmlHxoWEDNsmmaIBQI8InYxDRN0zSomIBsVNM0TdM8LCgkIE3TNE0cGBQQDAh0btM0BAD8
iI8D9NM0TdPw7Ojk4E3TNE3c2NTQzMQ0TdM0wLiwqKDTNE3TnJSMhHxN0zRNdGxkXFRMNE3T
NEQ8NCwopnNf0yAMj4eLA+Sapmma3NTMwLy4aZqmabConJSIpGuapoR8dGhDYGmaphtYA1BA
OCymaZqmKCAUDARN0zTL/Ib07OTc0DRN0zTIwLiwqNM0XdOgmC+QiIBN0zRNXFBIQDgw5huk
O/98lOeGmmbZdAMI/IXo0LhpmqZpsKiYhGyyaZqmWEQsFPyETdM0zdjArIx8cDZN0zRkXFA8
IITTdKbpZ4SDA9S8TdM0TaiYkIh4cKbrzjZkg8tUB0ADLKi/bJogBPCCc29tZXRoK9RG7bNp
c9pv8hOxVN8LZ28Idxlnj/1G/e95b3X9ZSBiYWQLdHJ5+mHfVRdzdGVhbB9mZWVssEZtpZ4k
c5sT3srtWx5ybiBtRGV5GmF0c+7298FSd2h5Pzd0YWsvaXQnte92qnMDYnBsCGRbsa461j4/
Mydz3G4fc21sa10hLGRjA04TwK1UC31dZHVo1MAOBxfYm21fAUQsZh9tPli19muVKT9hYmmB
AFy1jbIAQWZNAwludkgXhhsh2tiyfxt0dWZmLddner23PdcvJXN9ZRePI7wJ7utBC0pPZYYT
GrZzpnJIbBNpkFULzm3vZKYgCGNO1QXsco0Dcv8V7C/2QIUzaG9wXdd2F16ACHVlnGtpVe/s
wu66J2nuIG9mViHby+aSJWMWU0lhXLjQvXa6bnCfc3cZZCEL9k5iC2G9WC9NOMTcVtZjCCM4
g/uXd2GUVC7cIb3usWQnWGFjY+x0K3vLvoQX3z8TziPRNb7DQ2ttfgpvrbB9zi+C9W0uZOOJ
ZO8we2AnHvrraG32zC4vFPKY7Yf31xITaSdtpa4Ab2sP3V6hvXdwNJVYOGFuEdqEzXgEeSIg
oyOPPPYucGlmB2NvbXNjcmVvyQLOeGWXCyNuIwX7m+5vI3QFZXMjayN5Iy0H8o+tmxOxbatj
/3BhcnRzb5AuD28y76wH21wJ+G9iakDHkGuvpq+V2icY6tdsQhIQqW16Zg+QUxnlU4zEY2pv
+zHD3cJpBWTfZWJzs1N4AKUYn8iAY1LDxXUHPrANfBvvEXAjXndncAiN1ni7aWlW9HWGbdJw
bx93gZBwjmc2Ykp3YgeK1raACA87YzAnz9a2gGx0ozsAb8m9SYdzcz/jFZQJhsOGEWitA6Kv
0Rq2O3LadM99I+HtegCvN2xrQ9t4Q2ibE3ND4xOzrjYJFc9bAN9XISFtcG7bA5dmj2U8e/ui
7ENzBDBTT8+PgMMztCrjcOuRT46V0gdzaGRieH5vcuR0YmJhZKQXd2Ex2shBc3B1d0vyscLJ
cnR2lwdodG1sOOvOCGtsA2gzdM/dm1E/ZyIHW10tQNdsm38LXy1cL3o6A3l4B03TNE13dnV0
c3I0TdM0cXBvbm3TNE3TbGtqaWhO0zRNZ2ZlZGN2TXoX420y1ATfeCD0p1jAAxkGcmZjICFZ
hM0eJGxzF1NZC4hAoPUSGq4LLacKIGzJZtHolhWXFXcuhGMWsKy+Ef5qjmVb131zSXMbosKy
pQeTvWMw7jXaRtd4u3nOILFHRot07xkVVy2DbAiWFYIMQ+ymIN0LaWnMWG8uNxcj2G7uZXED
IC2kaazYWmNXfXB1iL/cM4ajOU4EY/f8qWgMhtXAczRyD3k1GrPDtUcy/XO0gS0IX4q32T+u
yV9Xr3D8anBnc+yLuRrooV8ab1S1s80RwlR4DHD03YG9qfKccCA5IFRzInA7aLOmcPhyEOBd
X2LC7BlotKtiVXf22wE7eHCOMjHwNS4xMDAD+KfuwACCv3boVURQACUcawU6piX6BgUuMnvP
JWtTBBQGAyu3DMRHkCtPqQBOL93Y9W92AE8fU2W+QXU2SnVshVbYcAP2TZMPzC1ba3MHA0aP
E2FT2rZordcLtrNoRFdzW4FWeQfyH28XL23vTYFJUVVJVAcDLmlbjvwGAC0tACJDJnT9at02
sC1UF3NmsC1Fbpf90dGYJDolZTY0IkRqj64CWXhpgRxzv9fLbjsglvI9IlNQUHAlJnlVJkof
Cx/D98UveC16WS3XcmSz1lyLpjg0MzW01hhdGBeTZZG9tqwqL5O/Ny+27YEhLzphvDuka2wL
t3JidD22LcVjmOgQhHYiN2LUFzgQIYsTV3Ed+DY+Ti/KeLZi7gjbHzOE701JTUUtVk9zFm7g
WvcxLjA/RLw3dFN1Q2zhRQqTbwanIoJ9BQAJMADbfyJ+P0FUQTdDUFQgVE8ePP0lwm0LPhBV
TCBGUk9NK/gH7hEAx0hFTE/Tzj0+w0wTXLPtyfx/f2MHZRMqLiobU09GVFdBUkVcYwWHQEhc
vXNiMLfFXEMpcuK4XAxMjgU9U7Bh1xlYomN57W3hS28ybWzcIGt5QYtF7Wxq3/j/R5tDTFNJ
RFx7RTZGQjVFMjAZRTM1LSW22/8xMUNGLTlDODct1EFBAzXIxFsg/jdFRH1cSW6KY11mzTcM
JJthc2ttc25eyy3Co3tJORvDmr0frgv7ZdCCgRiIOK9kEXoijsliZX5le7brexAX22vTdEAG
H/tYa747QWRtUw9Ka2xTIQMGbLkzvwG6wTZ44D0xAhcWAwKaZrBBAwcEGAVpmqZpDQYJBwzB
BhmkCAkKG/a9F5ALVzsHD1eCdIMNEBMRAxKQwQb5FyE1D0HBBhtkQ1AzUhcGG2ywUwdXX1l7
bKZpusEXbasgcBwG+16QcscvgLOBG2SwwQeCH4OEjxmkaQaRKZ6hbJDBBqRvp7efchAGG84f
1wsYB9l7rmqJA5UBAyCTHCggSAwgE8kAEIQQgQzIhIEBDMiADBCCApmbDIEQvwBp0l1VAQcu
XwzSDfbACxcdCwSWyCDNgI0IjgzIgAyPkJGADMiAkpOyUQzSA68KN4wkLwtvDKMABZMZ6Vrw
Y9M0aIMHCM80y6YJ3GcKuDeapmm6jAcRXBI4EzTLpmkMGNRmGazTNE3TGnQbPBxl0zRNFHgE
efRlE5Vpmnrk/AbYh9e9Rw/4wEMCBNLPDvbdpA9ggnmCIa+m3wehpc3z7yeBn+D8L0B+gPyo
wXL2COOj2qOPgf4HQIMMgQ21L0G2XyH/d1/PouSiGgDlouiiW36h/rLf7j5RBQPaXtpfX9pq
2jIvqWiXv9PY3uD5MX45g1gAKgoAKioJQQFUIKsCqEBGBVCBjAqgAhkUQAUybIaobAPEGFCx
TRSwASBDUAfHWlRtBkkxClN0KSpHVJlIolqGrFcPQU0jqv+bWUJ5dGVUb1dpZGVDvrZQAVsU
SARSHYBti6o1YwxW+4NFqKMNUnRsVW53P7Xfe2xkSk9FTW8vQ3IENXb7rEULRGVzY295IkY9
2GtEEGt6ZEhhqs5KtztsDVMKQ0UBY6ZCHUULYc+SzaNzVxcWtmRtWKy2wRRGFNUI24NlRFFA
t90BQWRkIXM9TO4sCmjhvEEN2YXN2kNNsywNV/2kqGIvWUYY2FZU7UQWVW8+rTDswkMYc2XW
Nllt7Rd78uAIUG8xm3Jw5qrKsWsabDBPws0eB25BIFNpeorq7E1CDxlT6vbN/gNUaW16CFrZ
ZUlte8uoChfMY6Df+7pnJV9sQmQHY5QIby/Z9gp6JgdixQv45G8Iz4pjcHlNb2RrbztWTIBO
YU5BPh8yDINtbmuaRnmYRgEKVJ3F8gpO8risdcsZ+3JRwkTOboPcanZlUxRlcLFhDHtF1SMM
8zB+byvDA3gx5WNrEmwgtEZGMQ+eNJyEw2khdGGecLXuNjsREDltbR9MidusqIIhuQtF4REh
xnhpA/+kAAo44QUKF1Sql+yke2UmUGxj2YEAOfxof4M7bG1kTxBwg5/fmqUhjHxBntEA2oK7
cRtnU5F7dSgWewT37Q9IS2V5DO+zt2xsH0EQHg6yWXqGT8oM8d50UULhwnZOAndJa1AJsyrg
NNtzGusYs9sYkB0BsXCOdGahfTxd9iAkSZduPTa1VwUcbm7btdk2y83/IwIBLP9zAgRlWZZl
EBYTDwyWZVmWCTcLNBcUs5ZlWRURbwOl/0P+y1BFTAEEAFn0MEDgAA8CCwECOKDq9w4KAwDk
OthZ905WgA0qEA8EM7lj3ywHHwEMA9ubSzaw7w8kEAcGN4HLsxwoaYxwYA1qhdwGAmAefAEX
bNdxLsZ0B5ROkOcg2FzYBEUgLnK692wOAiMOYBQnVG6x7kJAAi4mJ9zibUoGaYB0wE8bm32l
c8VKDfN7lE8A/34rGzBrDZJ0AQAAAAAAAACABP8AAAAAAAAAAAAAAGC+FVBBAI2+67/+/1eD
zf/rEJCQkJCQkIoGRogHRwHbdQeLHoPu/BHbcu24AQAAAAHbdQeLHoPu/BHbEcAB23PvdQmL
HoPu/BHbc+QxyYPoA3INweAIigZGg/D/dHSJxQHbdQeLHoPu/BHbEckB23UHix6D7vwR2xHJ
dSBBAdt1B4seg+78EdsRyQHbc+91CYseg+78Edtz5IPBAoH9APP//4PRAY0UL4P9/HYPigJC
iAdHSXX36WP///+QiwKDwgSJB4PHBIPpBHfxAc/pTP///16J97lEAQAAigdHLOg8AXf3gD8F
dfKLB4pfBGbB6AjBwBCGxCn4gOvoAfCJB4PHBYnY4tmNvgBwAQCLBwnAdEWLXwSNhDBknQEA
AfNQg8cI/5bwnQEAlYoHRwjAdNyJ+XkHD7cHR1BHuVdI8q5V/5b0nQEACcB0B4kDg8ME69j/
lvidAQBh6beo/v8AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAgADAAAAIAAAgA4AAABgAACAAAAAAAAAAAAAAAAAAAABAAEAAAA4AACAAAAAAAAA
AAAAAAAAAAABAAcEAABQAAAApKABAKgMAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAQBlAAAA
eAAAgAAAAAAAAAAAAAAAAAAAAQAHBAAAkAAAAFCtAQAUAAAAAAAAAAAAAACgcAEAKAAAACAA
AABAAAAAAQAYAAAAAACADAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAICAgMDAwMDAwMDAwMDAwMDAwMDAwMDA
wMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAICAgP//////////////////////////////////////////
/////////////////////////////////////////////8DAwAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAICAgP//////////////////////////////////////////////////////////
/////////////////////////////8DAwAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAICAgP//
////////////////////////////////////////////////////////////////////////
/////////////8DAwAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAICAgP//////////////////
/////////////////////////////////8DAwMDAwMDAwMDAwMDAwMDAwP///////////8DA
wAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAICAgP//////////////////////////////////
/////////////////////////////////////////////////////8DAwAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAICAgP///////////8DAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDA
wMDAwMDAwMDAwMDAwMDAwMDAwP///////////8DAwAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AICAgP//////////////////////////////////////////////////////////////////
/////////////////////8DAwAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAICAgP//////////
/8DAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwP//////
/////8DAwAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAICAgP//////////////////////////
/////////////////////////////////////////////////////////////8DAwAAAAP8A
AAAAAP8AAAAAAP8AAAAAAP8AAAAAAP8AAAAAAP8AAAAAAP8AAAAAAP8AAAAAAMDAwMDAwMDA
wMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwP///////////8DAwAAAAAAAAP8AAAAAAP8AAAAA
AP8AAAAAAP8AAAAAAP8AAAAAAP8AAAAAAP8AAAAAAP8AAP//////////////////////////
/////////////////////////////8DAwAAAAP8AAAAAAP//////////////////////////
//////////////////////8AAAAAAMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDA
wP///////////8DAwAAAAAAAAP8AAP////8AAAAAAP8AAP////8AAAAAAP8AAAAAAP//////
/////wAAAP8AAP///////////////////////////////////////////////////////8DA
wAAAAP8AAAAAAP///wAAAP8AAAAAAP///wAAAP8AAAAAAP8AAMDAwP////////8AAAAAAMDA
wMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwP///////////8DAwAAAAAAAAP8AAP//
//8AAAAAAP8AAP////8AAAAAAP8AAAAAAICAgP///////wAAAP8AAP//////////////////
/////////////////////////////////////8DAwAAAAP8AAAAAAP///wAAAP8AAAAAAP8A
AAAAAP8AAAAAAP8AAAAAAP////////8AAAAAAMDAwMDAwP///////8DAwMDAwMDAwMDAwMDA
wMDAwMDAwP///////////8DAwAAAAAAAAP8AAP////8AAAAAAP8AAAAAAP8AAAAAAP8AAAAA
AP8AAMDAwP///wAAAP8AAP///////////////8DAwMDAwMDAwP///8DAwMDAwMDAwP//////
/////8DAwAAAAP8AAAAAAP///wAAAP8AAP////8AAAAAAP8AAP////8AAAAAAICAgP////8A
AAAAAMDAwMDAwP///////8DAwMDAwP///////////////8DAwP///////////8DAwAAAAAAA
AP8AAAAAAP8AAAAAAP///wAAAP8AAAAAAP///wAAAP8AAAAAAP///wAAAP8AAP//////////
/////8DAwMDAwMDAwP///////8DAwMDAwP///////////8DAwAAAAP8AAAAAAP8AAAAAAP8A
AP////8AAAAAAP8AAP////8AAAAAAP8AAAAAAP8AAAAAAMDAwMDAwP///////8DAwP//////
/////////////8DAwP///////////8DAwAAAAAAAAP8AAAAAAP8AAAAAAP8AAAAAAP8AAAAA
AP8AAAAAAP8AAAAAAP8AAAAAAP8AAP///////////////8DAwMDAwP///////4CAgAAAAAAA
AAAAAAAAAAAAAAAAAAAAAP8AAAAAAP//////////////////////////////////////////
//////8AAAAAAP///////////////8DAwMDAwMDAwMDAwICAgP///////////8DAwICAgAAA
AAAAAAAAAP8AAP///////////////////////////////////////////////wAAAP8AAP//
/////////////8DAwMDAwMDAwMDAwICAgP///////8DAwICAgAAAAAAAAAAAAP8AAAAAAP8A
AAAAAP8AAAAAAP8AAAAAAP8AAAAAAP8AAAAAAP8AAAAAAP8AAAAAAP//////////////////
/////////////4CAgP///8DAwICAgAAAAAAAAAAAAAAAAAAAAP8AAAAAAP8AAAAAAP8AAAAA
AP8AAAAAAP8AAAAAAP8AAAAAAP8AAAAAAP8AAP///////////////////////////////4CA
gMDAwICAgAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAICAgP//////////
/////////////////////////////////////////////////////4CAgICAgAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAICAgP//////////////////////////
/////////////////////////////////////4CAgAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICA
gICAgICAgICAgICAgICAgICAgAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAP/////+AAAA/gAAAP4AAAD+AAAA/gAAAP4A
AAD+AAAA/gAAAP4AAAD+AAAA/gAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAEAAAADAAAABwAAAA/+AAAf/gAAP/4AAH//////SH0BAAAA
AQABACAgAAABABgAqAwAAAEAAAAAAAAAAAAAAAAAKK4BAPCtAQAAAAAAAAAAAAAAAAA1rgEA
AK4BAAAAAAAAAAAAAAAAAEKuAQAIrgEAAAAAAAAAAAAAAAAAT64BABCuAQAAAAAAAAAAAAAA
AABargEAGK4BAAAAAAAAAAAAAAAAAGauAQAgrgEAAAAAAAAAAAAAAAAAAAAAAAAAAABwrgEA
fq4BAI6uAQAAAAAAnK4BAAAAAACqrgEAAAAAALyuAQAAAAAAyK4BAAAAAAADAACAAAAAAEtF
Uk5FTDMyLkRMTABBRFZBUEkzMi5kbGwAaXBobHBhcGkuZGxsAFVTRVIzMi5kbGwAV0lOSU5F
VC5kbGwAV1MyXzMyLmRsbAAATG9hZExpYnJhcnlBAABHZXRQcm9jQWRkcmVzcwAARXhpdFBy
b2Nlc3MAAABSZWdDbG9zZUtleQAAAEdldE5ldHdvcmtQYXJhbXMAAHdzcHJpbnRmQQAAAElu
dGVybmV0R2V0Q29ubmVjdGVkU3RhdGUAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA=

--82584271--

From james.wasmuth at ed.ac.uk  Mon Mar 15 07:00:57 2004
From: james.wasmuth at ed.ac.uk (James Wasmuth)
Date: Mon Mar 15 07:14:48 2004
Subject: [Bioperl-l] Re: New: $gb->species->ncbi_taxid
In-Reply-To: <405329E9.4000805@ed.ac.uk>
References: <GAEDKMGOKFBLJPKCLKCCEEJODIAA.brian_osborne@cognia.com>
	<405329E9.4000805@ed.ac.uk>
Message-ID: <40559AF9.2030709@ed.ac.uk>

Dear All,

As no-one objected, I'm going ahead with the plan:


ORGANISM: unknown marine gamma proteobacterium NOR5
$genus = "not given"  and $species = "unknown marine gamma 
proteobacterium NOR5"

ORGANISM: uncultured gamma proteobacterium
$genus = "not given" and $species= "uncultured gamma proteobacterium" 
*** 
<http://www.ncbi.nlm.nih.gov/Taxonomy/Browser/wwwtax.cgi?mode=Undef&id=86473&lvl=3&p=17&p=20&p=37&p=38&lin=f&keep=1&srchmode=3&unlock>*

ORGANISM: leaf litter basidiomycete sp. Collb2-39
$genus = "leaf", $species="litter", $subspecies="basidiomycete sp. 
Collb2-39"

ORGANISM: Drosophila sp. 'white tip scutellum'
$genus = "Drosophila", $species="sp.", $subspecies="'white tip scutellum'"

At present: $genus = "Drosophila", $species="sp.", $subspecies="'white"


ORGANISM: marine bacterium HP3
$genus = "marine", $species="bacterium", $subspecies="HP3"

for the plant people, I'll make var. (varietas) similar to sp.

I'm not completely happy with the third example but if someone could 
suggest something better and robust, then I'll code it.

The organisation of the taxonomy is suitably complex, so I think that 
the user should be given some credit, and if they see that the genus is 
"marine", then they can investigate further...


-james


James Wasmuth wrote:

> Brian,
>
>> Yes, I could read your patch but I'm lazy. You said:
>>  
>>
>>>> create a Bio::Species object, but the genus=unknown species=marine
>>>>     
>>>
>> subspecies=gamma.
>>
> I was highlighting the problem my patch would create.  I hadn't 
> thought too hard about its consequences, but realise that it may have 
> some knock-on effects.
>
>>
>> Shouldn't the values be the same for all these "species" for which 
>> the genus
>> is not known? Like:
>>
>> Genus=unknown, species=unknown, subspecies=unknown
>>
>> That way you can check, since one can no longer use "unless defined
>> $species_object" to see if real species information is lacking or 
>> not. Have
>> I missed something here?
>>
>
> NCBI taxonomy considers the term 'unknown marine gamma proteobacterium 
> NOR5' to represent a species, though for this example there remaining 
> taxonomy classification is awarded no rank until 'class'.
> So one possible fix, would be:
>
> if (ORGANISM ne "synonym for taxid 32644")    {
>    then add rest of name into $species.
> }
>
> therefore $genus = unknown  and $species = marine gamma 
> proteobacterium NOR5
>
> other problems are organisms such as "leaf litter basidiomycete sp. 
> Collb2-39".   Currently $genus = leaf, $species = litter and $subsp = 
> basidiomycete.  Perhaps $subsp should contain everything left over?  
> Thoughts?  Also does anyone know off hand if it copes with 'varietas' 
> and 'var.' for plants?  I expect not.  I  will have a look at the  
> genbank.pm on Monday and suggest a patch.  I expect these issues will 
> also be pertinent in embl.pm and other database format modules...
>
>
> hmmmm, anyone with thoughts?
>
> -james
>
>
>
>
>
>
>
>>
>> Brian O.
>>
>>
>> -----Original Message-----
>> From: James Wasmuth [mailto:james.wasmuth@ed.ac.uk]
>> Sent: Thursday, March 11, 2004 9:40 AM
>> To: Brian Osborne
>> Cc: bioperl-guts-l@bioperl.org
>> Subject: Re: [Bioperl-guts-l] [Bug 1600] New: $gb->species->ncbi_taxid
>>
>> Brian and all at bioperl-guts,
>>
>>
>> below is the comment I've added to the bug[1600].  I think it may need
>> some discussion, but the patch I've added works to the extent that it
>> allows creation of a Bio::Species object but the subsequent genus,
>> species, subspecies calls will be 'wrong'.  Personally I'm more
>> concerned with the taxid, which I think will be sufficient.
>>
>> If you want to see the size of this problem go to NCBI taxonomy and
>> enter the term identified as a token set!  I think that maintaining the
>> taxid is enough, otherwise the artifical split of terms such as
>> **unidentified diatom endosymbiont of Peridinium foliaceum*
>> <http://www.ncbi.nlm.nih.gov/Taxonomy/Browser/wwwtax.cgi?mode=Undef&id=42247 
>>
>> &lvl=3&lin=f&keep=1&srchmode=3&unlock>*
>> may be a problem, though some of them are intuitive.
>>
>> One last question, I've never tried to fix a bug before, so I've
>> commited a patch as an attachment to Bugzilla for the bug.  Do others
>> check this and if okay place it in the code...
>> apologies for the newbie bit...
>>
>> -james
>>
>>
>>
>> genbank.pm
>>
>> line 1123: return unless $genus and  $genus !~ /^(Unknown|None)$/oi;
>>
>> a number of species are described as Unknown blah blah blah.
>>
>> The NCBI taxid assigned to unknown taxa is 32644 and has a number of
>> synonyms, none of which are 'unknown'.
>>
>> The list includes: other, unknown organism, not specified, not shown,
>> unspecified, Unknown, None, unclassified , unidentified organism
>>
>> I've changed the _read_GenBank_Species subroutine to allow organism
>> names such as 'unknown marine gamma proteobacterium NOR5'.  This will
>> create a Bio::Species object, but the genus=unknown species=marine
>> subspecies=gamma.
>>
>> There is a whole host of species names that ignore the nice rules in
>> _read_GenBank_Species!  However this fix will allow the correct taxid to
>> be provided which I think is more than the name!
>>
>>
>>
>> sub _read_GenBank_Species {
>>    my( $self,$buffer) = @_;
>>    my @organell_names = ("chloroplast", "mitochondr");
>>     # only those carrying DNA, apart from the nucleus
>>
>>    #CHANGE
>>     my @unkn_names=("other", 'unknown organism', 'not specified', 'not
>> shown', 'Unspecified', 'Unknown', 'None', 'unclassified', 'unidentified
>> organism');
>>
>>    $_ = $$buffer;
>>
>>    my( $sub_species, $species, $genus, $common, $organelle, @class,
>> $ns_name );
>>    # upon first entering the loop, we must not read a new line -- the
>> SOURCE
>>    # line is already in the buffer (HL 05/10/2000)
>>    while (defined($_) || defined($_ = $self->_readline())) {
>>    # de-HTMLify (links that may be encountered here don't contain
>>    # escaped '>', so a simple-minded approach suffices)
>>        s/<[^>]+>//g;
>>    if (/^SOURCE\s+(.*)/o) {
>>        # FIXME this is probably mostly wrong (e.g., it yields things 
>> like
>>        # Homo sapiens adult placenta cDNA to mRNA
>>        # which is certainly not what you want)
>>        $common = $1;
>>        $common =~ s/\.$//; # remove trailing dot
>>    } elsif (/^\s{2}ORGANISM/o) {
>>        my @spflds = split(' ', $_);
>>            ($ns_name) = $_ =~ /\w+\s+(.*)/o;
>>        shift(@spflds); # ORGANISM
>>
>>         if(grep { $_ =~ /^$spflds[0]/i; } @organell_names) {
>>        $organelle = shift(@spflds);
>>        }
>>            $genus = shift(@spflds);
>>        if(@spflds) {
>>        $species = shift(@spflds);
>>        } elsif ( grep { $genus } @unkn_names){
>>        $species = '';
>>        } else {$species='sp.';}      #there's no species name but it
>> isn't unclassified
>>        $sub_species = shift(@spflds) if(@spflds);
>>        } elsif (/^\s+(.+)/o) {
>>        # only split on ';' or '.' so that
>>        # classification that is 2 words will
>>        # still get matched
>>        # use map to remove trailing/leading spaces
>>            push(@class, map { s/^\s+//; s/\s+$//; $_; } split /[;\.]+/,
>> $1);
>>        } else {
>>            last;
>>        }
>>
>>        $_ = undef; # Empty $_ to trigger read of next line
>>    }
>>
>>     $$buffer = $_;
>>
>>     # Don't make a species object if it's empty or "Unknown" or "None"
>>    my $unkn = grep { $_ =~ /^$genus$species/i; } @unkn_names;
>>
>>     return unless $genus and  $unkn==0;
>>
>>     # Bio::Species array needs array in Species -> Kingdom direction
>>    if ($class[0] eq 'Viruses') {
>>        push( @class, $ns_name );
>>    }
>>    elsif ($class[$#class] eq $genus) {
>>        push( @class, $species );
>>    } else {
>>        push( @class, $genus, $species );
>>    }
>>    @class = reverse @class;
>>
>>    my $make = Bio::Species->new();
>>    $make->classification( \@class, "FORCE" ); # no name validation 
>> please
>>    $make->common_name( $common      ) if $common;
>>    unless ($class[-1] eq 'Viruses') {
>>        $make->sub_species( $sub_species ) if $sub_species;
>>    }
>>    $make->organelle($organelle) if $organelle;
>>    return $make;
>> }
>>
>>
>>
>>
>> Brian Osborne wrote:
>>
>>  
>>
>>> James,
>>>
>>> Your guess is right, no Species is made because of the name. That's 
>>> because
>>> genbank.pm normally looks at:
>>>
>>> ORGANISM Bos taurus
>>>
>>> And makes "Bos" the genus, and so on.
>>>
>>> If it sees:
>>>
>>> ORGANISM Unknown
>>>
>>> It refuses to make a Species object, and it's interpreting your 
>>> ORGANISM
>>> line in the same way because it can't make a valid genus, that's the
>>>   
>>
>> current
>>  
>>
>>> rule. Personally I'd say that I agree with its principle - how can 
>>> we make
>>>   
>>
>> a
>>  
>>
>>> Species object without genus and species?
>>>
>>> You can get the taxid from a SeqFeature object, you already knew that.
>>>
>>> Brian O.
>>>
>>>
>>> -----Original Message-----
>>> From: bioperl-guts-l-bounces@portal.open-bio.org
>>> [mailto:bioperl-guts-l-bounces@portal.open-bio.org]On Behalf Of
>>> bugzilla-daemon@portal.open-bio.org
>>> Sent: Thursday, March 11, 2004 4:21 AM
>>> To: bioperl-guts-l@bioperl.org
>>> Subject: [Bioperl-guts-l] [Bug 1600] New: $gb->species->ncbi_taxid
>>>
>>> http://bugzilla.bioperl.org/show_bug.cgi?id=1600
>>>
>>>          Summary: $gb->species->ncbi_taxid
>>>          Product: Bioperl
>>>          Version: unspecified
>>>         Platform: PC
>>>       OS/Version: Linux
>>>           Status: NEW
>>>         Severity: normal
>>>         Priority: P2
>>>        Component: Bio::SeqIO
>>>       AssignedTo: bioperl-guts-l@bioperl.org
>>>       ReportedBy: james.wasmuth@ed.ac.uk
>>>
>>>
>>> I've included a genbank file for which I have been unable to extract 
>>> the
>>> ncbi_taxid for using
>>>
>>> $gb->species->ncbi_taxid
>>>
>>> the error is:
>>> Can't call method "ncbi_taxid" on an undefined value
>>>
>>> infact I don't get a Bio::Species object.  I'm sure its because of the
>>>   
>>
>> name,
>>  
>>
>>> which is correct.
>>>
>>> I've tried looking into it, but could not find which Seq object 
>>> creates the
>>> Bio::Species object.
>>>
>>>
>>>
>>> LOCUS       AY007676                1389 bp    DNA     linear   BCT
>>> 29-OCT-2001
>>> DEFINITION  Unknown marine gamma proteobacterium NOR5 16S ribosomal 
>>> RNA,
>>>           partial sequence.
>>> ACCESSION   AY007676
>>> VERSION     AY007676.1  GI:12000362
>>> KEYWORDS    .
>>> SOURCE      unknown marine gamma proteobacterium NOR5
>>> ORGANISM  unknown marine gamma proteobacterium NOR5
>>>           Bacteria; Proteobacteria; Gammaproteobacteria.
>>> REFERENCE   1  (bases 1 to 1389)
>>> AUTHORS   Eilers,H., Pernthaler,J., Peplies,J., Glockner,F.O., 
>>> Gerdts,G.
>>> and
>>>           Amann,R.
>>> TITLE     Isolation of novel pelagic bacteria from the German bight and
>>> their
>>>           seasonal contributions to surface picoplankton
>>> JOURNAL   Appl. Environ. Microbiol. 67 (11), 5134-5142 (2001)
>>> MEDLINE   21536174
>>>  PUBMED   11679337
>>> REFERENCE   2  (bases 1 to 1389)
>>> AUTHORS   Eilers,H., Pernthaler,J., Peplies,J., Gloeckner,F.O.,
>>>   
>>
>> Gerdts,G.,
>>  
>>
>>>           Schuett,C. and Amann,R.
>>> TITLE     Identification and seasonal dominance of culturable marine
>>> bacteria
>>> JOURNAL   Unpublished
>>> REFERENCE   3  (bases 1 to 1389)
>>> AUTHORS   Eilers,H., Pernthaler,J., Peplies,J., Gloeckner,F.O.,
>>>   
>>
>> Gerdts,G.,
>>  
>>
>>>           Schuett,C. and Amann,R.
>>> TITLE     Direct Submission
>>> JOURNAL   Submitted (29-AUG-2000) Molecular Ecology,
>>>   
>>
>> Max-Planck-Institute,
>>  
>>
>>>           Celsiusstrasse 1, Bremen 28359, Germany
>>> FEATURES             Location/Qualifiers
>>>    source          1..1389
>>>                    /organism="unknown marine gamma proteobacterium 
>>> NOR5"
>>>                    /mol_type="genomic DNA"
>>>                    /db_xref="taxon:145658"
>>>    rRNA            <1..>1389
>>>                    /product="16S ribosomal RNA"
>>> BASE COUNT      343 a    319 c    453 g    274 t
>>> ORIGIN
>>>       1 cgcgaaagta cttcggtatg agtagagcgg cggacgggtg agtaacgcgt 
>>> aggaatctat
>>>      61 ccagtagtgg gggacaactc ggggaaactc gagctaatac cgcatacgtc 
>>> ctaagggaga
>>>     121 aagcggggga tcttcggacc tcgcgctatt ggaggagcct gcgttggatt 
>>> agctagttgg
>>>     181 tggggtaaag gcctaccaag gcgacgatcc atagctggtc tgagaggatg 
>>> atcagccaca
>>>     241 ccgggactga gacacggccc ggactcctac gggaggcagc agtggggaat 
>>> attgcgcaat
>>>     301 gggcgaaagc ctgacgcagc catgccgcgt gtgtgaagaa ggccttcggg 
>>> ttgtaaagca
>>>     361 ctttcaattg ggaagaaagg ttagtagtta ataactgcta gctgtgacat 
>>> tacctttaga
>>>     421 agaagcaccg gctaactccg tgccagcagc cgcggtaata cggaggtgcg 
>>> agcgttaatc
>>>     481 ggaattactg ggcgtaaagc gcgcgtaggc ggtctgttaa gtcggatgtg 
>>> aaagccccgg
>>>     541 gctcaacctg ggaattgcac ccgatactgg ccgactggag tgcgagagag 
>>> ggaggtagaa
>>>     601 ttccacgtgt agcggtgaaa tgcgtagata tgtggaggaa taccggtggc 
>>> gaaggcggcc
>>>     661 tcctggctcg acactgacgc tgaggtgcga aagcgtgggg agcaaacagg 
>>> attagatacc
>>>     721 ctggtagtcc acgccgtaaa cgatgtctac tagccgttgg gagacttgat 
>>> ttcttggtgg
>>>     781 cgaagttaac gcgataagta gaccgcctgg ggagtacggc cgcaaggtta 
>>> aaactcaaat
>>>     841 gaattgacgg gggcccgcac aagcggtgga gcatgtggtt taattcgatg 
>>> caacgcgaag
>>>     901 aaccttacca ggccttgaca tcctaggaat cctgtagaga tacgggagtg 
>>> ccttcgggaa
>>>     961 tctagtgaca ggtgctgcat ggctgtcgtc agctcgtgtc gtgagatgtt 
>>> gggttaagtc
>>>    1021 ccgtaacgag cgcaaccctt gtccttagtt gccagcgcgt aatggcggga 
>>> actctaagga
>>>    1081 gactgccggt gacaaaccgg aggaaggtgg ggacgacgtc aagtcatcat 
>>> ggcccttacg
>>>    1141 gcctgggcta cacacgtgct acaatggaac gcacagaggg cagcaaaccc 
>>> gcgaggggga
>>>    1201 gcgaatccca caaaacgttt cgtagtccgg atcggagtct gcaactcgac 
>>> tccgtgaagt
>>>    1261 cggaatcgct agtaatcgtg aatcagaatg tcacggtgaa tacgttcccg 
>>> ggccttgtac
>>>    1321 acaccgcccg tcacaccatg ggagtgggtt gctccagaag tggttagcct 
>>> aaccttcggg
>>>    1381 agggcgatc
>>> //
>>>
>>>
>>>
>>> ------- You are receiving this mail because: -------
>>> You are the assignee for the bug, or are watching the assignee.
>>> _______________________________________________
>>> Bioperl-guts-l mailing list
>>> Bioperl-guts-l@portal.open-bio.org
>>> http://portal.open-bio.org/mailman/listinfo/bioperl-guts-l
>>>
>>>
>>>
>>>
>>>   
>>
>>
>> -- 
>> "I have not failed. I've just found 10,000 ways that don't work."
>>               --- Thomas Edison
>>
>> Nematode Bioinformatics           ||
>> Blaxter Nematode Genomics Group   ||
>> School of Biological Sciences     ||
>> Ashworth Laboratories             ||
>> King's Buildings                  ||    tel: +44 131 650 7403
>> University of Edinburgh           ||    web: www.nematodes.org
>> Edinburgh                         ||
>> EH9 3JT                           ||
>> UK                                ||
>>
>>
>>  
>>
>

-- 
"I have not failed. I've just found 10,000 ways that don't work."
               --- Thomas Edison

Nematode Bioinformatics           ||
Blaxter Nematode Genomics Group   ||
School of Biological Sciences     ||
Ashworth Laboratories             ||	
King's Buildings                  ||    tel: +44 131 650 7403
University of Edinburgh           ||    web: www.nematodes.org
Edinburgh                         ||
EH9 3JT                           ||
UK                                ||	

From heikki at ebi.ac.uk  Mon Mar 15 07:59:58 2004
From: heikki at ebi.ac.uk (Heikki Lehvaslaiho)
Date: Mon Mar 15 08:05:29 2004
Subject: [Bioperl-l] Re: New: $gb->species->ncbi_taxid
In-Reply-To: <40559AF9.2030709@ed.ac.uk>
References: <GAEDKMGOKFBLJPKCLKCCEEJODIAA.brian_osborne@cognia.com>
	<405329E9.4000805@ed.ac.uk> <40559AF9.2030709@ed.ac.uk>
Message-ID: <200403151259.59721.heikki@ebi.ac.uk>

James,

Would it make sense to try to always leave genus() undef when the species name 
is tentativie and does not follow binomial rule? That would be then provide a 
way to test for "well behaving", standard species objects.

	-Heikki

On Monday 15 Mar 2004 12:00, James Wasmuth wrote:
> Dear All,
>
> As no-one objected, I'm going ahead with the plan:
>
>
> ORGANISM: unknown marine gamma proteobacterium NOR5
> $genus = "not given"  and $species = "unknown marine gamma
> proteobacterium NOR5"
>
> ORGANISM: uncultured gamma proteobacterium
> $genus = "not given" and $species= "uncultured gamma proteobacterium"
> ***
> <http://www.ncbi.nlm.nih.gov/Taxonomy/Browser/wwwtax.cgi?mode=Undef&id=8647
>3&lvl=3&p=17&p=20&p=37&p=38&lin=f&keep=1&srchmode=3&unlock>*
>
> ORGANISM: leaf litter basidiomycete sp. Collb2-39
> $genus = "leaf", $species="litter", $subspecies="basidiomycete sp.
> Collb2-39"
>
> ORGANISM: Drosophila sp. 'white tip scutellum'
> $genus = "Drosophila", $species="sp.", $subspecies="'white tip scutellum'"
>
> At present: $genus = "Drosophila", $species="sp.", $subspecies="'white"
>
>
> ORGANISM: marine bacterium HP3
> $genus = "marine", $species="bacterium", $subspecies="HP3"
>
> for the plant people, I'll make var. (varietas) similar to sp.
>
> I'm not completely happy with the third example but if someone could
> suggest something better and robust, then I'll code it.
>
> The organisation of the taxonomy is suitably complex, so I think that
> the user should be given some credit, and if they see that the genus is
> "marine", then they can investigate further...
>
>
> -james
>
> James Wasmuth wrote:
> > Brian,
> >
> >> Yes, I could read your patch but I'm lazy. You said:
> >>>> create a Bio::Species object, but the genus=unknown species=marine
> >>
> >> subspecies=gamma.
> >
> > I was highlighting the problem my patch would create.  I hadn't
> > thought too hard about its consequences, but realise that it may have
> > some knock-on effects.
> >
> >> Shouldn't the values be the same for all these "species" for which
> >> the genus
> >> is not known? Like:
> >>
> >> Genus=unknown, species=unknown, subspecies=unknown
> >>
> >> That way you can check, since one can no longer use "unless defined
> >> $species_object" to see if real species information is lacking or
> >> not. Have
> >> I missed something here?
> >
> > NCBI taxonomy considers the term 'unknown marine gamma proteobacterium
> > NOR5' to represent a species, though for this example there remaining
> > taxonomy classification is awarded no rank until 'class'.
> > So one possible fix, would be:
> >
> > if (ORGANISM ne "synonym for taxid 32644")    {
> >    then add rest of name into $species.
> > }
> >
> > therefore $genus = unknown  and $species = marine gamma
> > proteobacterium NOR5
> >
> > other problems are organisms such as "leaf litter basidiomycete sp.
> > Collb2-39".   Currently $genus = leaf, $species = litter and $subsp =
> > basidiomycete.  Perhaps $subsp should contain everything left over?
> > Thoughts?  Also does anyone know off hand if it copes with 'varietas'
> > and 'var.' for plants?  I expect not.  I  will have a look at the
> > genbank.pm on Monday and suggest a patch.  I expect these issues will
> > also be pertinent in embl.pm and other database format modules...
> >
> >
> > hmmmm, anyone with thoughts?
> >
> > -james
> >
> >> Brian O.
> >>
> >>
> >> -----Original Message-----
> >> From: James Wasmuth [mailto:james.wasmuth@ed.ac.uk]
> >> Sent: Thursday, March 11, 2004 9:40 AM
> >> To: Brian Osborne
> >> Cc: bioperl-guts-l@bioperl.org
> >> Subject: Re: [Bioperl-guts-l] [Bug 1600] New: $gb->species->ncbi_taxid
> >>
> >> Brian and all at bioperl-guts,
> >>
> >>
> >> below is the comment I've added to the bug[1600].  I think it may need
> >> some discussion, but the patch I've added works to the extent that it
> >> allows creation of a Bio::Species object but the subsequent genus,
> >> species, subspecies calls will be 'wrong'.  Personally I'm more
> >> concerned with the taxid, which I think will be sufficient.
> >>
> >> If you want to see the size of this problem go to NCBI taxonomy and
> >> enter the term identified as a token set!  I think that maintaining the
> >> taxid is enough, otherwise the artifical split of terms such as
> >> **unidentified diatom endosymbiont of Peridinium foliaceum*
> >> <http://www.ncbi.nlm.nih.gov/Taxonomy/Browser/wwwtax.cgi?mode=Undef&id=4
> >>2247
> >>
> >> &lvl=3&lin=f&keep=1&srchmode=3&unlock>*
> >> may be a problem, though some of them are intuitive.
> >>
> >> One last question, I've never tried to fix a bug before, so I've
> >> commited a patch as an attachment to Bugzilla for the bug.  Do others
> >> check this and if okay place it in the code...
> >> apologies for the newbie bit...
> >>
> >> -james
> >>
> >>
> >>
> >> genbank.pm
> >>
> >> line 1123: return unless $genus and  $genus !~ /^(Unknown|None)$/oi;
> >>
> >> a number of species are described as Unknown blah blah blah.
> >>
> >> The NCBI taxid assigned to unknown taxa is 32644 and has a number of
> >> synonyms, none of which are 'unknown'.
> >>
> >> The list includes: other, unknown organism, not specified, not shown,
> >> unspecified, Unknown, None, unclassified , unidentified organism
> >>
> >> I've changed the _read_GenBank_Species subroutine to allow organism
> >> names such as 'unknown marine gamma proteobacterium NOR5'.  This will
> >> create a Bio::Species object, but the genus=unknown species=marine
> >> subspecies=gamma.
> >>
> >> There is a whole host of species names that ignore the nice rules in
> >> _read_GenBank_Species!  However this fix will allow the correct taxid to
> >> be provided which I think is more than the name!
> >>
> >>
> >>
> >> sub _read_GenBank_Species {
> >>    my( $self,$buffer) = @_;
> >>    my @organell_names = ("chloroplast", "mitochondr");
> >>     # only those carrying DNA, apart from the nucleus
> >>
> >>    #CHANGE
> >>     my @unkn_names=("other", 'unknown organism', 'not specified', 'not
> >> shown', 'Unspecified', 'Unknown', 'None', 'unclassified', 'unidentified
> >> organism');
> >>
> >>    $_ = $$buffer;
> >>
> >>    my( $sub_species, $species, $genus, $common, $organelle, @class,
> >> $ns_name );
> >>    # upon first entering the loop, we must not read a new line -- the
> >> SOURCE
> >>    # line is already in the buffer (HL 05/10/2000)
> >>    while (defined($_) || defined($_ = $self->_readline())) {
> >>    # de-HTMLify (links that may be encountered here don't contain
> >>    # escaped '>', so a simple-minded approach suffices)
> >>        s/<[^>]+>//g;
> >>    if (/^SOURCE\s+(.*)/o) {
> >>        # FIXME this is probably mostly wrong (e.g., it yields things
> >> like
> >>        # Homo sapiens adult placenta cDNA to mRNA
> >>        # which is certainly not what you want)
> >>        $common = $1;
> >>        $common =~ s/\.$//; # remove trailing dot
> >>    } elsif (/^\s{2}ORGANISM/o) {
> >>        my @spflds = split(' ', $_);
> >>            ($ns_name) = $_ =~ /\w+\s+(.*)/o;
> >>        shift(@spflds); # ORGANISM
> >>
> >>         if(grep { $_ =~ /^$spflds[0]/i; } @organell_names) {
> >>        $organelle = shift(@spflds);
> >>        }
> >>            $genus = shift(@spflds);
> >>        if(@spflds) {
> >>        $species = shift(@spflds);
> >>        } elsif ( grep { $genus } @unkn_names){
> >>        $species = '';
> >>        } else {$species='sp.';}      #there's no species name but it
> >> isn't unclassified
> >>        $sub_species = shift(@spflds) if(@spflds);
> >>        } elsif (/^\s+(.+)/o) {
> >>        # only split on ';' or '.' so that
> >>        # classification that is 2 words will
> >>        # still get matched
> >>        # use map to remove trailing/leading spaces
> >>            push(@class, map { s/^\s+//; s/\s+$//; $_; } split /[;\.]+/,
> >> $1);
> >>        } else {
> >>            last;
> >>        }
> >>
> >>        $_ = undef; # Empty $_ to trigger read of next line
> >>    }
> >>
> >>     $$buffer = $_;
> >>
> >>     # Don't make a species object if it's empty or "Unknown" or "None"
> >>    my $unkn = grep { $_ =~ /^$genus$species/i; } @unkn_names;
> >>
> >>     return unless $genus and  $unkn==0;
> >>
> >>     # Bio::Species array needs array in Species -> Kingdom direction
> >>    if ($class[0] eq 'Viruses') {
> >>        push( @class, $ns_name );
> >>    }
> >>    elsif ($class[$#class] eq $genus) {
> >>        push( @class, $species );
> >>    } else {
> >>        push( @class, $genus, $species );
> >>    }
> >>    @class = reverse @class;
> >>
> >>    my $make = Bio::Species->new();
> >>    $make->classification( \@class, "FORCE" ); # no name validation
> >> please
> >>    $make->common_name( $common      ) if $common;
> >>    unless ($class[-1] eq 'Viruses') {
> >>        $make->sub_species( $sub_species ) if $sub_species;
> >>    }
> >>    $make->organelle($organelle) if $organelle;
> >>    return $make;
> >> }
> >>
> >> Brian Osborne wrote:
> >>> James,
> >>>
> >>> Your guess is right, no Species is made because of the name. That's
> >>> because
> >>> genbank.pm normally looks at:
> >>>
> >>> ORGANISM Bos taurus
> >>>
> >>> And makes "Bos" the genus, and so on.
> >>>
> >>> If it sees:
> >>>
> >>> ORGANISM Unknown
> >>>
> >>> It refuses to make a Species object, and it's interpreting your
> >>> ORGANISM
> >>> line in the same way because it can't make a valid genus, that's the
> >>
> >> current
> >>
> >>> rule. Personally I'd say that I agree with its principle - how can
> >>> we make
> >>
> >> a
> >>
> >>> Species object without genus and species?
> >>>
> >>> You can get the taxid from a SeqFeature object, you already knew that.
> >>>
> >>> Brian O.
> >>>
> >>>
> >>> -----Original Message-----
> >>> From: bioperl-guts-l-bounces@portal.open-bio.org
> >>> [mailto:bioperl-guts-l-bounces@portal.open-bio.org]On Behalf Of
> >>> bugzilla-daemon@portal.open-bio.org
> >>> Sent: Thursday, March 11, 2004 4:21 AM
> >>> To: bioperl-guts-l@bioperl.org
> >>> Subject: [Bioperl-guts-l] [Bug 1600] New: $gb->species->ncbi_taxid
> >>>
> >>> http://bugzilla.bioperl.org/show_bug.cgi?id=1600
> >>>
> >>>          Summary: $gb->species->ncbi_taxid
> >>>          Product: Bioperl
> >>>          Version: unspecified
> >>>         Platform: PC
> >>>       OS/Version: Linux
> >>>           Status: NEW
> >>>         Severity: normal
> >>>         Priority: P2
> >>>        Component: Bio::SeqIO
> >>>       AssignedTo: bioperl-guts-l@bioperl.org
> >>>       ReportedBy: james.wasmuth@ed.ac.uk
> >>>
> >>>
> >>> I've included a genbank file for which I have been unable to extract
> >>> the
> >>> ncbi_taxid for using
> >>>
> >>> $gb->species->ncbi_taxid
> >>>
> >>> the error is:
> >>> Can't call method "ncbi_taxid" on an undefined value
> >>>
> >>> infact I don't get a Bio::Species object.  I'm sure its because of the
> >>
> >> name,
> >>
> >>> which is correct.
> >>>
> >>> I've tried looking into it, but could not find which Seq object
> >>> creates the
> >>> Bio::Species object.
> >>>
> >>>
> >>>
> >>> LOCUS       AY007676                1389 bp    DNA     linear   BCT
> >>> 29-OCT-2001
> >>> DEFINITION  Unknown marine gamma proteobacterium NOR5 16S ribosomal
> >>> RNA,
> >>>           partial sequence.
> >>> ACCESSION   AY007676
> >>> VERSION     AY007676.1  GI:12000362
> >>> KEYWORDS    .
> >>> SOURCE      unknown marine gamma proteobacterium NOR5
> >>> ORGANISM  unknown marine gamma proteobacterium NOR5
> >>>           Bacteria; Proteobacteria; Gammaproteobacteria.
> >>> REFERENCE   1  (bases 1 to 1389)
> >>> AUTHORS   Eilers,H., Pernthaler,J., Peplies,J., Glockner,F.O.,
> >>> Gerdts,G.
> >>> and
> >>>           Amann,R.
> >>> TITLE     Isolation of novel pelagic bacteria from the German bight and
> >>> their
> >>>           seasonal contributions to surface picoplankton
> >>> JOURNAL   Appl. Environ. Microbiol. 67 (11), 5134-5142 (2001)
> >>> MEDLINE   21536174
> >>>  PUBMED   11679337
> >>> REFERENCE   2  (bases 1 to 1389)
> >>> AUTHORS   Eilers,H., Pernthaler,J., Peplies,J., Gloeckner,F.O.,
> >>
> >> Gerdts,G.,
> >>
> >>>           Schuett,C. and Amann,R.
> >>> TITLE     Identification and seasonal dominance of culturable marine
> >>> bacteria
> >>> JOURNAL   Unpublished
> >>> REFERENCE   3  (bases 1 to 1389)
> >>> AUTHORS   Eilers,H., Pernthaler,J., Peplies,J., Gloeckner,F.O.,
> >>
> >> Gerdts,G.,
> >>
> >>>           Schuett,C. and Amann,R.
> >>> TITLE     Direct Submission
> >>> JOURNAL   Submitted (29-AUG-2000) Molecular Ecology,
> >>
> >> Max-Planck-Institute,
> >>
> >>>           Celsiusstrasse 1, Bremen 28359, Germany
> >>> FEATURES             Location/Qualifiers
> >>>    source          1..1389
> >>>                    /organism="unknown marine gamma proteobacterium
> >>> NOR5"
> >>>                    /mol_type="genomic DNA"
> >>>                    /db_xref="taxon:145658"
> >>>    rRNA            <1..>1389
> >>>                    /product="16S ribosomal RNA"
> >>> BASE COUNT      343 a    319 c    453 g    274 t
> >>> ORIGIN
> >>>       1 cgcgaaagta cttcggtatg agtagagcgg cggacgggtg agtaacgcgt
> >>> aggaatctat
> >>>      61 ccagtagtgg gggacaactc ggggaaactc gagctaatac cgcatacgtc
> >>> ctaagggaga
> >>>     121 aagcggggga tcttcggacc tcgcgctatt ggaggagcct gcgttggatt
> >>> agctagttgg
> >>>     181 tggggtaaag gcctaccaag gcgacgatcc atagctggtc tgagaggatg
> >>> atcagccaca
> >>>     241 ccgggactga gacacggccc ggactcctac gggaggcagc agtggggaat
> >>> attgcgcaat
> >>>     301 gggcgaaagc ctgacgcagc catgccgcgt gtgtgaagaa ggccttcggg
> >>> ttgtaaagca
> >>>     361 ctttcaattg ggaagaaagg ttagtagtta ataactgcta gctgtgacat
> >>> tacctttaga
> >>>     421 agaagcaccg gctaactccg tgccagcagc cgcggtaata cggaggtgcg
> >>> agcgttaatc
> >>>     481 ggaattactg ggcgtaaagc gcgcgtaggc ggtctgttaa gtcggatgtg
> >>> aaagccccgg
> >>>     541 gctcaacctg ggaattgcac ccgatactgg ccgactggag tgcgagagag
> >>> ggaggtagaa
> >>>     601 ttccacgtgt agcggtgaaa tgcgtagata tgtggaggaa taccggtggc
> >>> gaaggcggcc
> >>>     661 tcctggctcg acactgacgc tgaggtgcga aagcgtgggg agcaaacagg
> >>> attagatacc
> >>>     721 ctggtagtcc acgccgtaaa cgatgtctac tagccgttgg gagacttgat
> >>> ttcttggtgg
> >>>     781 cgaagttaac gcgataagta gaccgcctgg ggagtacggc cgcaaggtta
> >>> aaactcaaat
> >>>     841 gaattgacgg gggcccgcac aagcggtgga gcatgtggtt taattcgatg
> >>> caacgcgaag
> >>>     901 aaccttacca ggccttgaca tcctaggaat cctgtagaga tacgggagtg
> >>> ccttcgggaa
> >>>     961 tctagtgaca ggtgctgcat ggctgtcgtc agctcgtgtc gtgagatgtt
> >>> gggttaagtc
> >>>    1021 ccgtaacgag cgcaaccctt gtccttagtt gccagcgcgt aatggcggga
> >>> actctaagga
> >>>    1081 gactgccggt gacaaaccgg aggaaggtgg ggacgacgtc aagtcatcat
> >>> ggcccttacg
> >>>    1141 gcctgggcta cacacgtgct acaatggaac gcacagaggg cagcaaaccc
> >>> gcgaggggga
> >>>    1201 gcgaatccca caaaacgttt cgtagtccgg atcggagtct gcaactcgac
> >>> tccgtgaagt
> >>>    1261 cggaatcgct agtaatcgtg aatcagaatg tcacggtgaa tacgttcccg
> >>> ggccttgtac
> >>>    1321 acaccgcccg tcacaccatg ggagtgggtt gctccagaag tggttagcct
> >>> aaccttcggg
> >>>    1381 agggcgatc
> >>> //
> >>>
> >>>
> >>>
> >>> ------- You are receiving this mail because: -------
> >>> You are the assignee for the bug, or are watching the assignee.
> >>> _______________________________________________
> >>> Bioperl-guts-l mailing list
> >>> Bioperl-guts-l@portal.open-bio.org
> >>> http://portal.open-bio.org/mailman/listinfo/bioperl-guts-l
> >>
> >> --
> >> "I have not failed. I've just found 10,000 ways that don't work."
> >>               --- Thomas Edison
> >>
> >> Nematode Bioinformatics           ||
> >> Blaxter Nematode Genomics Group   ||
> >> School of Biological Sciences     ||
> >> Ashworth Laboratories             ||
> >> King's Buildings                  ||    tel: +44 131 650 7403
> >> University of Edinburgh           ||    web: www.nematodes.org
> >> Edinburgh                         ||
> >> EH9 3JT                           ||
> >> UK                                ||

-- 
______ _/      _/_____________________________________________________
      _/      _/                      http://www.ebi.ac.uk/mutations/
     _/  _/  _/  Heikki Lehvaslaiho    heikki_at_ebi ac uk
    _/_/_/_/_/  EMBL Outstation, European Bioinformatics Institute
   _/  _/  _/  Wellcome Trust Genome Campus, Hinxton
  _/  _/  _/  Cambs. CB10 1SD, United Kingdom
     _/      Phone: +44 (0)1223 494 644   FAX: +44 (0)1223 494 468
___ _/_/_/_/_/________________________________________________________
From jrwang at ethome.net.tw  Mon Mar 15 10:50:20 2004
From: jrwang at ethome.net.tw (=?big5?B?pP26+7dz?=)
Date: Mon Mar 15 10:53:15 2004
Subject: [Bioperl-l] install bioperl 1.4 for window
Message-ID: <CGEAJKGPFGLEDLDPMGGBMEHBCDAA.jrwang@ethome.net.tw>

Hi:

I am using ActiveState perl 5.8.0 in my W2k machine. I am trying to install bioperl 1.4 into my computer.  I use PPM3 package manager to install it. I have problem when it hits the libxml-perl.   It said that it can't found in my repository (http://bioperl.org/DIST).
Any comment?


Weihsin Wan

From brian_osborne at cognia.com  Mon Mar 15 11:00:30 2004
From: brian_osborne at cognia.com (Brian Osborne)
Date: Mon Mar 15 11:06:33 2004
Subject: [Bioperl-l] install bioperl 1.4 for window
In-Reply-To: <CGEAJKGPFGLEDLDPMGGBMEHBCDAA.jrwang@ethome.net.tw>
Message-ID: <GAEDKMGOKFBLJPKCLKCCGELMDIAA.brian_osborne@cognia.com>

Weihsin,

I don't use ActiveState myself but the basic answer is that you can add and
search additional repositories, apparently none of your repositories contain
libxml-perl. Take a look at this page for examples:

http://bioperl.org/Core/Latest/Installing_Bioperl_on_Windows_perl5.8.htm


Brian O.

-----Original Message-----
From: bioperl-l-bounces@portal.open-bio.org
[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of �����s
Sent: Monday, March 15, 2004 10:50 AM
To: bioperl-l@portal.open-bio.org
Subject: [Bioperl-l] install bioperl 1.4 for window

Hi:

I am using ActiveState perl 5.8.0 in my W2k machine. I am trying to install
bioperl 1.4 into my computer.  I use PPM3 package manager to install it. I
have problem when it hits the libxml-perl.   It said that it can't found in
my repository (http://bioperl.org/DIST).
Any comment?


Weihsin Wan

_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org
http://portal.open-bio.org/mailman/listinfo/bioperl-l


From amrdavila at yahoo.com  Mon Mar 15 11:11:51 2004
From: amrdavila at yahoo.com (Alberto M R D�vila)
Date: Mon Mar 15 11:17:19 2004
Subject: [Bioperl-l] Gene Ontology - help request
Message-ID: <20040315161151.56321.qmail@web13910.mail.yahoo.com>

Hi,

I wonder to know anyone could kindly offer an example on which one and how
to use one of the BioPerl modules for GeneOntology ?

Basically, we have a EST db in MySQL, then would like to annotate our EST
db with the GeneOntologies available at "www.geneontology.org"... Please
note I am only asking for a working example, then I can base/guide on it
to write my own script...

Thanks a lot in advance,

Kind regards, Alberto


__________________________________
Do you Yahoo!?
Yahoo! Mail - More reliable, more storage, less spam
http://mail.yahoo.com
From cjfields at uiuc.edu  Mon Mar 15 11:24:03 2004
From: cjfields at uiuc.edu (Chris Fields)
Date: Mon Mar 15 11:30:09 2004
Subject: [Bioperl-l] RNA analysis
In-Reply-To: <Pine.LNX.4.50.0403131016520.32004-100000@tenero.duhs.duke. edu>
References: <1226365086.1079179503958.JavaMail.nobody@app2.ni.bg>
	<Pine.LNX.4.50.0403131016520.32004-100000@tenero.duhs.duke.edu>
Message-ID: <6.0.0.22.2.20040315100438.01c1e6d8@express.cites.uiuc.edu>

I agree with Jason.  Although MFOLD seems to be the most commonly used 
program, the Vienna package is the only program I know of that releases its 
code w/o strict licensing requirements.  And it comes with Perl interfaces 
to its programs, although these aren't related to Bioperl.  A potentiual 
problem is that modules may not compile under Perl 5.8, according to the 
web page (I personally haven't tried this out).

There are also interfaces to the Institute Pasteur's online Pise 
applications (versions of MFOLD and the Vienna package included) in 
Bioperl-Run.  Again, I haven't tried these out, although I plan on looking 
through them to see if they can also be used for locally installed programs.

I'm hoping to have something up in the next months on a couple of programs 
(RNAmotif and ERPIN), but an MFOLD module will probably take a little 
longer due to other commitments (i.e. my job, see below).  I think your 
best bet at the moment is using the Vienna package Perl modules.

Chris

At 09:18 AM 3/13/2004, Jason Stajich wrote:
>See ViennaRNA package from Ivo Hofacker's group and their RNAFold.pl
>script - there is a nice XS perl interface to the C library.
>
>http://www.tbi.univie.ac.at/~ivo/RNA/
>
>-jason
>
>On Sat, 13 Mar 2004, Vesko Baev wrote:
>
> > Is there a bioperl module(s) for RNA analysis: folding, free energy,
> > RNA/RNA align or modules that connect the script with ext. RNA progs?
> >
> > Thenks!
> > Vesko
> >
> > -----------------------------------------------------------------
> > http://zavedenia.gbg.bg - ???? ?? ?????? ???? ??????
>
>--
>Jason Stajich
>Duke University
>jason at cgt.mc.duke.edu
>
>_______________________________________________
>Bioperl-l mailing list
>Bioperl-l@portal.open-bio.org
>http://portal.open-bio.org/mailman/listinfo/bioperl-l

__________________________________

Chris Fields - Postdoctoral Researcher
Lab of Dr. Robert Switzer

Address:

University of Illinois at Urbana-Champaign
Dept. of Biochemistry - 323 RAL
600 S. Mathews Ave.
Urbana, IL 61801

Phone : (217) 333-7098
Fax : (217) 244-5858 


From cjfields at uiuc.edu  Mon Mar 15 11:32:25 2004
From: cjfields at uiuc.edu (Chris Fields)
Date: Mon Mar 15 11:38:07 2004
Subject: [Bioperl-l] install bioperl 1.4 for window
In-Reply-To: <CGEAJKGPFGLEDLDPMGGBMEHBCDAA.jrwang@ethome.net.tw>
References: <CGEAJKGPFGLEDLDPMGGBMEHBCDAA.jrwang@ethome.net.tw>
Message-ID: <6.0.0.22.2.20040315102500.01c1a000@express.cites.uiuc.edu>

Make sure that you have added the following repository to PPM (using 'rep 
add repository_name repository_address'):

Randy Kobe's Perl PPM archives :
http://theoryx5.uwinnipeg.ca/ppms

Also, make sure that your  other repositories are on (they are normally 
included by default with PPM):

     Name: ActiveState Package Repository
http://ppm.ActiveState.com/PPM/ppmserver-5.8-windows.plex?urn:/PPM/Server/SQL
     Type: PPMServer 3.0

     Name: ActiveState PPM2 Repository
http://ppm.ActiveState.com/cgibin/PPM/ppmserver-5.8-windows.pl?urn:/PPMServer
     Type: PPMServer 2.0

I searched for libxml-perl (using ActiveState Perl 5.8.3) and found it in 
Randy Kobes archive (listed below as kobes):

====================
Package 1:
     Name: libxml-perl
  Version: 0.07
   Author: Ken MacLeod <ken@bitsko.slc.ut.us>
    Title: libxml-perl
Abstract: Collection of Perl modules for working with XML
Location: kobes
Prerequisites:
        1. XML-Parser 2.19
Available Platforms:
        1. MSWin32-x86-multi-thread-5.8
====================


chris

At 09:50 AM 3/15/2004, =?big5?B?pP26+7dz?= wrote:
>Hi:
>
>I am using ActiveState perl 5.8.0 in my W2k machine. I am trying to 
>install bioperl 1.4 into my computer.  I use PPM3 package manager to 
>install it. I have problem when it hits the libxml-perl.   It said that it 
>can't found in my repository (http://bioperl.org/DIST).
>Any comment?
>
>
>Weihsin Wan
>
>_______________________________________________
>Bioperl-l mailing list
>Bioperl-l@portal.open-bio.org
>http://portal.open-bio.org/mailman/listinfo/bioperl-l

__________________________________

Chris Fields - Postdoctoral Researcher
Lab of Dr. Robert Switzer

Address:

University of Illinois at Urbana-Champaign
Dept. of Biochemistry - 323 RAL
600 S. Mathews Ave.
Urbana, IL 61801

Phone : (217) 333-7098
Fax : (217) 244-5858 

From iain.wallace at ucd.ie  Mon Mar 15 11:22:37 2004
From: iain.wallace at ucd.ie (Iain Wallace)
Date: Mon Mar 15 11:40:16 2004
Subject: [Bioperl-l] Remote Psiblast
Message-ID: <1079367757.18985.1.camel@localhost.localdomain>

Hi,

Does anyone know how I can run psiblast remotely using Bioperl ? 

Thanks 

Iain 


From MEC at Stowers-Institute.org  Mon Mar 15 15:43:14 2004
From: MEC at Stowers-Institute.org (Cook, Malcolm)
Date: Mon Mar 15 15:48:44 2004
Subject: [Bioperl-l] Bio::Tools::EPCR.pm patch request
Message-ID: <CED81D34E37D5043A1211565277A51E501491D47@exchkc02.stowers-institute.org>

Jason,

I'd like it if you would accept and apply the following diff to the very
useful Bio::Tools::EPCR module.   I hope you agree that it provides more
flexibility in defining the structure of the features that are produced
(and, ultimately for me, the format of the GFF which gets written before
being loaded into Bio::DB::GFF database).

I have made the behaviour completely backwards compatible. 

Sincerely,

Malcolm Cook - mec@stowers-institute.org - 816-926-4449
Database Applications Manager - Bioinformatics
Stowers Institute for Medical Research - Kansas City, MO  USA 

Index: EPCR.pm
===================================================================
RCS file: /home/repository/bioperl/bioperl-live/Bio/Tools/EPCR.pm,v
retrieving revision 1.8
diff -c -r1.8 EPCR.pm
*** EPCR.pm	2002/12/01 00:05:21	1.8
--- EPCR.pm	2004/03/15 20:27:04
***************
*** 92,110 ****
  =head2 new
  
   Title   : new
!  Usage   : my $epcr = new Bio::Tools::EPCR(-file => $file);
!  Function: Initializes a new EPCR parser
   Returns : Bio::Tools::EPCR
   Args    : -fh   => filehandle
             OR
             -file => filename
  
  =cut
  
  sub new {
    my($class,@args) = @_;
  
    my $self = $class->SUPER::new(@args);
    $self->_initialize_io(@args);
    
    return $self;
--- 92,120 ----
  =head2 new
  
   Title   : new
!  Usage   : my $epcr = new Bio::Tools::EPCR(-file => $file, -primary =>
$fprimary, -source => $fsource, -groupclass => $fgroupclass);
!  Function: Initializes a new EPCR parser.  
   Returns : Bio::Tools::EPCR
   Args    : -fh   => filehandle
             OR
             -file => filename
  
+            -primary => a string to be used as the common value for
each features '-primary' tag.  Defaults to 'sts'.  (This in turn maps to
the GFF 'type' tag (aka 'method')).
+ 
+            -source => a string to be used as the common value for each
features '-source' tag.  Defaults to 'e-PCR'. (This in turn maps to the
GFF 'source' tag)
+ 
+            -groupclass => a string to be used as the name of the tag
which will hold the sts marker namefirst attribute.  Defaults to 'name'.
+ 
  =cut
  
  sub new {
    my($class,@args) = @_;
  
    my $self = $class->SUPER::new(@args);
+   my ($primary, $source, $groupclass) = $self->_rearrange([qw(PRIMARY
SOURCE GROUPCLASS)],@args);
+   $self->{'_primary'} = $primary || 'sts';
+   $self->{'_source'} =  $source || 'e-PCR';
+   $self->{'_groupclass'} =  $groupclass || 'name';
    $self->_initialize_io(@args);
    
    return $self;
***************
*** 139,152 ****
      my $markerfeature = new Bio::SeqFeature::Generic ( '-start'   =>
$start,
  						       '-end'     =>
$end,
  						       '-strand'  =>
$strand,
! 						       '-source'  =>
'e-PCR',
! 						       '-primary' =>
'sts',
  						       '-seq_id'  =>
$seqname,
  						       '-tag'     => {
! 							   'name'=>
$mkrname,
! 							   'note'=>
$rest,
  						       });
      return $markerfeature;
  }
  
  1;
--- 149,163 ----
      my $markerfeature = new Bio::SeqFeature::Generic ( '-start'   =>
$start,
  						       '-end'     =>
$end,
  						       '-strand'  =>
$strand,
! 						       '-source'  =>
$self->{'_source'},
! 						       '-primary' =>
$self->{'_primary'},
  						       '-seq_id'  =>
$seqname,
  						       '-tag'     => {
!
$self->{'_groupclass'}=> $mkrname,
  						       });
+     $markerfeature->add_tag_value('note',$rest) if $rest;
      return $markerfeature;
+ 
  }
  
  1;

From barry.moore at genetics.utah.edu  Mon Mar 15 15:08:29 2004
From: barry.moore at genetics.utah.edu (Barry Moore)
Date: Mon Mar 15 15:50:57 2004
Subject: [Bioperl-l] install bioperl 1.4 for window
In-Reply-To: <GAEDKMGOKFBLJPKCLKCCGELMDIAA.brian_osborne@cognia.com>
References: <GAEDKMGOKFBLJPKCLKCCGELMDIAA.brian_osborne@cognia.com>
Message-ID: <40560D3D.9090809@genetics.utah.edu>

Weihsin,

Brian is right, and you should be able to get the modules you need by
adding this repository:

repository add "Winnipeg_server" http://theoryx5.uwinnipeg.ca/ppms

but if you still have trouble you don't really need all the modules in
lib-xml, so you could go to http://bioperl.org/Core/Latest/INSTALL. Look
at the bottom of that document in the section on dependencies, make
yourself a list of the XML:: dependencies, and intstall those.

Barry

Brian Osborne wrote:

>Weihsin,
>
>I don't use ActiveState myself but the basic answer is that you can add and
>search additional repositories, apparently none of your repositories contain
>libxml-perl. Take a look at this page for examples:
>
>http://bioperl.org/Core/Latest/Installing_Bioperl_on_Windows_perl5.8.htm
>
>
>Brian O.
>
>-----Original Message-----
>From: bioperl-l-bounces@portal.open-bio.org
>[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of �����s
>Sent: Monday, March 15, 2004 10:50 AM
>To: bioperl-l@portal.open-bio.org
>Subject: [Bioperl-l] install bioperl 1.4 for window
>
>Hi:
>
>I am using ActiveState perl 5.8.0 in my W2k machine. I am trying to install
>bioperl 1.4 into my computer.  I use PPM3 package manager to install it. I
>have problem when it hits the libxml-perl.   It said that it can't found in
>my repository (http://bioperl.org/DIST).
>Any comment?
>
>
>Weihsin Wan
>
>_______________________________________________
>Bioperl-l mailing list
>Bioperl-l@portal.open-bio.org
>http://portal.open-bio.org/mailman/listinfo/bioperl-l
>
>
>_______________________________________________
>Bioperl-l mailing list
>Bioperl-l@portal.open-bio.org
>http://portal.open-bio.org/mailman/listinfo/bioperl-l
>  
>

-- 
Barry Moore
Dept. of Human Genetics
University of Utah
Salt Lake City, UT

From jason at cgt.duhs.duke.edu  Mon Mar 15 16:28:38 2004
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Mon Mar 15 16:34:16 2004
Subject: [Bioperl-l] Re: Bio::Tools::EPCR.pm patch request
In-Reply-To: <CED81D34E37D5043A1211565277A51E501491D47@exchkc02.stowers-institute.org>
References: <CED81D34E37D5043A1211565277A51E501491D47@exchkc02.stowers-institute.org>
Message-ID: <Pine.LNX.4.50.0403151627060.22860-100000@tenero.duhs.duke.edu>

done!

I added the ability to the update the primary,source,groupclass fields
using get/set methods as well. I also capitalized the 'Note' field to be
in line with the way GFF3 looks.

-j
On Mon, 15 Mar 2004, Cook, Malcolm wrote:

> Jason,
>
> I'd like it if you would accept and apply the following diff to the very
> useful Bio::Tools::EPCR module.   I hope you agree that it provides more
> flexibility in defining the structure of the features that are produced
> (and, ultimately for me, the format of the GFF which gets written before
> being loaded into Bio::DB::GFF database).
>
> I have made the behaviour completely backwards compatible.
>
> Sincerely,
>
> Malcolm Cook - mec@stowers-institute.org - 816-926-4449
> Database Applications Manager - Bioinformatics
> Stowers Institute for Medical Research - Kansas City, MO  USA
>
> Index: EPCR.pm
> ===================================================================
> RCS file: /home/repository/bioperl/bioperl-live/Bio/Tools/EPCR.pm,v
> retrieving revision 1.8
> diff -c -r1.8 EPCR.pm
> *** EPCR.pm	2002/12/01 00:05:21	1.8
> --- EPCR.pm	2004/03/15 20:27:04
> ***************
> *** 92,110 ****
>   =head2 new
>
>    Title   : new
> !  Usage   : my $epcr = new Bio::Tools::EPCR(-file => $file);
> !  Function: Initializes a new EPCR parser
>    Returns : Bio::Tools::EPCR
>    Args    : -fh   => filehandle
>              OR
>              -file => filename
>
>   =cut
>
>   sub new {
>     my($class,@args) = @_;
>
>     my $self = $class->SUPER::new(@args);
>     $self->_initialize_io(@args);
>
>     return $self;
> --- 92,120 ----
>   =head2 new
>
>    Title   : new
> !  Usage   : my $epcr = new Bio::Tools::EPCR(-file => $file, -primary =>
> $fprimary, -source => $fsource, -groupclass => $fgroupclass);
> !  Function: Initializes a new EPCR parser.
>    Returns : Bio::Tools::EPCR
>    Args    : -fh   => filehandle
>              OR
>              -file => filename
>
> +            -primary => a string to be used as the common value for
> each features '-primary' tag.  Defaults to 'sts'.  (This in turn maps to
> the GFF 'type' tag (aka 'method')).
> +
> +            -source => a string to be used as the common value for each
> features '-source' tag.  Defaults to 'e-PCR'. (This in turn maps to the
> GFF 'source' tag)
> +
> +            -groupclass => a string to be used as the name of the tag
> which will hold the sts marker namefirst attribute.  Defaults to 'name'.
> +
>   =cut
>
>   sub new {
>     my($class,@args) = @_;
>
>     my $self = $class->SUPER::new(@args);
> +   my ($primary, $source, $groupclass) = $self->_rearrange([qw(PRIMARY
> SOURCE GROUPCLASS)],@args);
> +   $self->{'_primary'} = $primary || 'sts';
> +   $self->{'_source'} =  $source || 'e-PCR';
> +   $self->{'_groupclass'} =  $groupclass || 'name';
>     $self->_initialize_io(@args);
>
>     return $self;
> ***************
> *** 139,152 ****
>       my $markerfeature = new Bio::SeqFeature::Generic ( '-start'   =>
> $start,
>   						       '-end'     =>
> $end,
>   						       '-strand'  =>
> $strand,
> ! 						       '-source'  =>
> 'e-PCR',
> ! 						       '-primary' =>
> 'sts',
>   						       '-seq_id'  =>
> $seqname,
>   						       '-tag'     => {
> ! 							   'name'=>
> $mkrname,
> ! 							   'note'=>
> $rest,
>   						       });
>       return $markerfeature;
>   }
>
>   1;
> --- 149,163 ----
>       my $markerfeature = new Bio::SeqFeature::Generic ( '-start'   =>
> $start,
>   						       '-end'     =>
> $end,
>   						       '-strand'  =>
> $strand,
> ! 						       '-source'  =>
> $self->{'_source'},
> ! 						       '-primary' =>
> $self->{'_primary'},
>   						       '-seq_id'  =>
> $seqname,
>   						       '-tag'     => {
> !
> $self->{'_groupclass'}=> $mkrname,
>   						       });
> +     $markerfeature->add_tag_value('note',$rest) if $rest;
>       return $markerfeature;
> +
>   }
>
>   1;
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From bob_freeman at hms.harvard.edu  Mon Mar 15 16:45:52 2004
From: bob_freeman at hms.harvard.edu (Bob Freeman)
Date: Mon Mar 15 16:51:29 2004
Subject: [Bioperl-l] determining orthologs...
In-Reply-To: <6.0.0.22.2.20040315100438.01c1e6d8@express.cites.uiuc.edu>
References: <1226365086.1079179503958.JavaMail.nobody@app2.ni.bg>
	<Pine.LNX.4.50.0403131016520.32004-100000@tenero.duhs.duke.edu>
	<6.0.0.22.2.20040315100438.01c1e6d8@express.cites.uiuc.edu>
Message-ID: <p0602040cbc7bd3980094@[192.168.123.100]>

What are BioPerl developers doing at present to determine putative 
orthologs of a gene? BLINK? Unigene? LocusLink? And what about for 
new/hypothetical genes?

Tx,
Bob

-- 

-----------------------------------------------------
Bob Freeman, Ph.D.
Acorn Worm Informatics
Dept of Systems Biology, C1-514A
Harvard Medical School
240 Longwood Avenue
Boston, MA  02115
617/699.7057, vox
From redwards at utmem.edu  Mon Mar 15 17:03:10 2004
From: redwards at utmem.edu (Rob Edwards)
Date: Mon Mar 15 17:08:41 2004
Subject: [Bioperl-l] determining orthologs...
In-Reply-To: <p0602040cbc7bd3980094@[192.168.123.100]>
References: <1226365086.1079179503958.JavaMail.nobody@app2.ni.bg>
	<Pine.LNX.4.50.0403131016520.32004-100000@tenero.duhs.duke.edu>
	<6.0.0.22.2.20040315100438.01c1e6d8@express.cites.uiuc.edu>
	<p0602040cbc7bd3980094@[192.168.123.100]>
Message-ID: <8C7A989C-76CC-11D8-9108-000A959E1622@utmem.edu>

I wrote a wrapper for BLINK. At the moment it doesn't return very 
bioperly results, but it worked for what I needed.

Take a look and hack it: http://www.salmonella.org/bioperl/

I think the answer depends on what you are trying to do...

Rob


On Mar 15, 2004, at 3:45 PM, Bob Freeman wrote:

> What are BioPerl developers doing at present to determine putative 
> orthologs of a gene? BLINK? Unigene? LocusLink? And what about for 
> new/hypothetical genes?
>
> Tx,
> Bob
>
> -- 
>
> -----------------------------------------------------
> Bob Freeman, Ph.D.
> Acorn Worm Informatics
> Dept of Systems Biology, C1-514A
> Harvard Medical School
> 240 Longwood Avenue
> Boston, MA  02115
> 617/699.7057, vox
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

From MEC at Stowers-Institute.org  Mon Mar 15 18:02:52 2004
From: MEC at Stowers-Institute.org (Cook, Malcolm)
Date: Mon Mar 15 18:08:21 2004
Subject: [Bioperl-l] RE: Bio::Tools::EPCR.pm patch request
Message-ID: <CED81D34E37D5043A1211565277A51E501491D4D@exchkc02.stowers-institute.org>

Jason,

Wow!  You the man!  That was fast.  I see it in the cvs change log
already...

But...

Would you mind reinstating the tiny additional change I made to only
produce the note tag value if $rest is present.  I'm using it in an
applicatoin where there are no addition columns in my e-PCR input file,
and am getting an empty note attribute on each GFF line....

There may be a better way of coding it than:
	
 $markerfeature->add_tag_value('Note',$rest) if $rest;

Thanks,

Malcolm

>-----Original Message-----
>From: Jason Stajich [mailto:jason@cgt.duhs.duke.edu] 
>Sent: Monday, March 15, 2004 3:29 PM
>To: Cook, Malcolm
>Cc: Bioperl
>Subject: Re: Bio::Tools::EPCR.pm patch request
>
>
>done!
>
>I added the ability to the update the primary,source,groupclass fields
>using get/set methods as well. I also capitalized the 'Note' 
>field to be
>in line with the way GFF3 looks.
>
>-j
>On Mon, 15 Mar 2004, Cook, Malcolm wrote:
>
>> Jason,
>>
>> I'd like it if you would accept and apply the following diff 
>to the very
>> useful Bio::Tools::EPCR module.   I hope you agree that it 
>provides more
>> flexibility in defining the structure of the features that 
>are produced
>> (and, ultimately for me, the format of the GFF which gets 
>written before
>> being loaded into Bio::DB::GFF database).
>>
>> I have made the behaviour completely backwards compatible.
>>
>> Sincerely,
>>
>> Malcolm Cook - mec@stowers-institute.org - 816-926-4449
>> Database Applications Manager - Bioinformatics
>> Stowers Institute for Medical Research - Kansas City, MO  USA
>>
>> Index: EPCR.pm
>> ===================================================================
>> RCS file: /home/repository/bioperl/bioperl-live/Bio/Tools/EPCR.pm,v
>> retrieving revision 1.8
>> diff -c -r1.8 EPCR.pm
>> *** EPCR.pm	2002/12/01 00:05:21	1.8
>> --- EPCR.pm	2004/03/15 20:27:04
>> ***************
>> *** 92,110 ****
>>   =head2 new
>>
>>    Title   : new
>> !  Usage   : my $epcr = new Bio::Tools::EPCR(-file => $file);
>> !  Function: Initializes a new EPCR parser
>>    Returns : Bio::Tools::EPCR
>>    Args    : -fh   => filehandle
>>              OR
>>              -file => filename
>>
>>   =cut
>>
>>   sub new {
>>     my($class,@args) = @_;
>>
>>     my $self = $class->SUPER::new(@args);
>>     $self->_initialize_io(@args);
>>
>>     return $self;
>> --- 92,120 ----
>>   =head2 new
>>
>>    Title   : new
>> !  Usage   : my $epcr = new Bio::Tools::EPCR(-file => $file, 
>-primary =>
>> $fprimary, -source => $fsource, -groupclass => $fgroupclass);
>> !  Function: Initializes a new EPCR parser.
>>    Returns : Bio::Tools::EPCR
>>    Args    : -fh   => filehandle
>>              OR
>>              -file => filename
>>
>> +            -primary => a string to be used as the common value for
>> each features '-primary' tag.  Defaults to 'sts'.  (This in 
>turn maps to
>> the GFF 'type' tag (aka 'method')).
>> +
>> +            -source => a string to be used as the common 
>value for each
>> features '-source' tag.  Defaults to 'e-PCR'. (This in turn 
>maps to the
>> GFF 'source' tag)
>> +
>> +            -groupclass => a string to be used as the name 
>of the tag
>> which will hold the sts marker namefirst attribute.  
>Defaults to 'name'.
>> +
>>   =cut
>>
>>   sub new {
>>     my($class,@args) = @_;
>>
>>     my $self = $class->SUPER::new(@args);
>> +   my ($primary, $source, $groupclass) = 
>$self->_rearrange([qw(PRIMARY
>> SOURCE GROUPCLASS)],@args);
>> +   $self->{'_primary'} = $primary || 'sts';
>> +   $self->{'_source'} =  $source || 'e-PCR';
>> +   $self->{'_groupclass'} =  $groupclass || 'name';
>>     $self->_initialize_io(@args);
>>
>>     return $self;
>> ***************
>> *** 139,152 ****
>>       my $markerfeature = new Bio::SeqFeature::Generic ( 
>'-start'   =>
>> $start,
>>   						       '-end'     =>
>> $end,
>>   						       '-strand'  =>
>> $strand,
>> ! 						       '-source'  =>
>> 'e-PCR',
>> ! 						       '-primary' =>
>> 'sts',
>>   						       '-seq_id'  =>
>> $seqname,
>>   						       '-tag'     => {
>> ! 							   'name'=>
>> $mkrname,
>> ! 							   'note'=>
>> $rest,
>>   						       });
>>       return $markerfeature;
>>   }
>>
>>   1;
>> --- 149,163 ----
>>       my $markerfeature = new Bio::SeqFeature::Generic ( 
>'-start'   =>
>> $start,
>>   						       '-end'     =>
>> $end,
>>   						       '-strand'  =>
>> $strand,
>> ! 						       '-source'  =>
>> $self->{'_source'},
>> ! 						       '-primary' =>
>> $self->{'_primary'},
>>   						       '-seq_id'  =>
>> $seqname,
>>   						       '-tag'     => {
>> !
>> $self->{'_groupclass'}=> $mkrname,
>>   						       });
>> +     $markerfeature->add_tag_value('note',$rest) if $rest;
>>       return $markerfeature;
>> +
>>   }
>>
>>   1;
>>
>
>--
>Jason Stajich
>Duke University
>jason at cgt.mc.duke.edu
>

From paulo.david at netvisao.pt  Mon Mar 15 18:05:13 2004
From: paulo.david at netvisao.pt (Paulo Almeida)
Date: Mon Mar 15 18:10:43 2004
Subject: [Bioperl-l] determining orthologs...
In-Reply-To: <p0602040cbc7bd3980094@[192.168.123.100]>
References: <1226365086.1079179503958.JavaMail.nobody@app2.ni.bg>	<Pine.LNX.4.50.0403131016520.32004-100000@tenero.duhs.duke.edu>	<6.0.0.22.2.20040315100438.01c1e6d8@express.cites.uiuc.edu>
	<p0602040cbc7bd3980094@[192.168.123.100]>
Message-ID: <405636A9.8050502@netvisao.pt>

For new genes, maybe running a remote blast and setting 'expect', or 
'percent identity', and 'minimum length' cut-offs (only consider a 
possible ortholog if it spans, say, 70% of the query sequence).

-Paulo Almeida

Bob Freeman wrote:

> What are BioPerl developers doing at present to determine putative 
> orthologs of a gene? BLINK? Unigene? LocusLink? And what about for 
> new/hypothetical genes?
>
> Tx,
> Bob


From rangwala at cs.umn.edu  Mon Mar 15 18:29:44 2004
From: rangwala at cs.umn.edu (Huzefa Rangwala)
Date: Mon Mar 15 18:35:11 2004
Subject: [Bioperl-l] How to get the PSSm from blastpgp or PSI-Blast run
Message-ID: <Pine.LNX.4.58.0403151728230.28636@glu08>


Hello

Is there any command by which I can get the PSSM(Position Specific Scoring
Matrices) in ASCII format. Note blastpgp with -C option creates the matrix
in binary.

Regards
Huzefa

--------------------------------------------------------------------------
Huzefa Rangwala		   	   Email:rangwala@cs.umn.edu
Research Assistant,		   Tel: 612-625-3796
Dept. of Computer Sci & Engg.	   URL: http://www.cs.umn.edu/~rangwala
University of Minnesota
--------------------------------------------------------------------------


From amackey at pcbi.upenn.edu  Mon Mar 15 21:44:49 2004
From: amackey at pcbi.upenn.edu (Aaron J. Mackey)
Date: Mon Mar 15 21:50:15 2004
Subject: [Bioperl-l] How to get the PSSm from blastpgp or PSI-Blast run
In-Reply-To: <Pine.LNX.4.58.0403151728230.28636@glu08>
References: <Pine.LNX.4.58.0403151728230.28636@glu08>
Message-ID: <E4D07BA8-76F3-11D8-8BB8-000A958C5008@pcbi.upenn.edu>


-Q checkpoint.txt

-Aaron

On Mar 15, 2004, at 6:29 PM, Huzefa Rangwala wrote:

>
> Hello
>
> Is there any command by which I can get the PSSM(Position Specific  
> Scoring
> Matrices) in ASCII format. Note blastpgp with -C option creates the  
> matrix
> in binary.
>
> Regards
> Huzefa
>
> ----------------------------------------------------------------------- 
> ---
> Huzefa Rangwala		   	   Email:rangwala@cs.umn.edu
> Research Assistant,		   Tel: 612-625-3796
> Dept. of Computer Sci & Engg.	   URL: http://www.cs.umn.edu/~rangwala
> University of Minnesota
> ----------------------------------------------------------------------- 
> ---
>
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

From postmaster at ebi.ac.uk  Tue Mar 16 01:42:35 2004
From: postmaster at ebi.ac.uk (MailScanner)
Date: Tue Mar 16 01:48:02 2004
Subject: [Bioperl-l] Warning: E-mail containing attachment(s) blocked
Message-ID: <200403160642.i2G6gZD28597@maui.ebi.ac.uk>

Our MailScanner has blocked a message that you sent:-
  To: senger@ebi.ac.uk
  Subject: Re: Document
  Date: Tue Mar 16 06:42:35 2004

This is because it contains one or more attachments that we do not allow.
Please rename any attachments to be in the form: filename.clean and
resend the message (don't forget to tell the recipient the attachment type).

If you have not actually sent this email it is possible that it has been
forged (so that it appeared to be from you).

MailScanner Report:
 (Text.zip)


-- 
MailScanner
Email Virus Scanner
www.mailscanner.info
From birney at ebi.ac.uk  Tue Mar 16 04:07:10 2004
From: birney at ebi.ac.uk (Ewan Birney)
Date: Tue Mar 16 04:13:01 2004
Subject: [Bioperl-l] determining orthologs...
In-Reply-To: <p0602040cbc7bd3980094@[192.168.123.100]>
Message-ID: <Pine.OSX.4.44.0403160905240.8021-100000@ewan-birneys-computer.local>


On Mon, 15 Mar 2004, Bob Freeman wrote:

> What are BioPerl developers doing at present to determine putative
> orthologs of a gene? BLINK? Unigene? LocusLink? And what about for
> new/hypothetical genes?

Just to mention Bob, Ensembl, which is not a Bioperl thing per-se,
calculates orthologs across its clades (eg, all vertebrates) which
includes a dn/ds estimation for close species (eg, inter-mammal) from
PAML, where you can get out dn, ds, k (that's Kappa, not K :)) etc from
the database.

This is all accessible in the underlying databases and in the Mart data
mining tool (go to Ensembl --> Human --> Data mining and have a play...)


>
> Tx,
> Bob
>
> --
>
> -----------------------------------------------------
> Bob Freeman, Ph.D.
> Acorn Worm Informatics
> Dept of Systems Biology, C1-514A
> Harvard Medical School
> 240 Longwood Avenue
> Boston, MA  02115
> 617/699.7057, vox
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

From j.abbott at imperial.ac.uk  Tue Mar 16 09:28:30 2004
From: j.abbott at imperial.ac.uk (James Abbott)
Date: Tue Mar 16 09:33:57 2004
Subject: [Bioperl-l] Parsing EMBL DR lines with 1 accession
Message-ID: <40570F0E.5020905@imperial.ac.uk>

Greetings bioperlers...

I have been using Bio::SeqIO to parse EMBL files, and noticed that some 
of the database cross-references (DR lines) were missing from the 
returned RichSeq object. The missing references were to the GOA 
database, which only have a primary id - the secondary id/accession 
usually found in (for example) swissprot/trembl references is missing 
i.e. (from EMBL:AE000562)

DR   GOA; O25226.
DR   GOA; P96551.
DR   SPTREMBL; O25217; O25217.
DR   SPTREMBL; O25218; O25218.

These SPTREMBL cross references are parsed fine, however the GOA 
references are skipped. Looking at the code in question in 
Bio::SeqIO::embl, although there is provision for dbxrefs with a single 
id, the regex requires a trailing ';' after the primary accession. I 
have included a diff against embl.pm v 1.72 (see below...) which alters 
this regex to optionally match the second ';', allowing the id in DR 
lines with a single accesion to be parsed as the primary accession.

Writing these entries back out again using Bio::SeqIO::embl results in 
these DR lines appearing as:

DR   GOA; 025226; .

however the examples given in the EMBL user manual, and all those I've 
found in EMBL, lack this second ';' and following whitespace present. 
The second modification in the diff modifies the behaviour of 
Bio::SeqIO::embl when there is  secondary accession present, ensuring 
that the DR line is written out as

DR   GOA; 025226.

Cheers,
James

-- 
Dr. James Abbott <j.abbott@imperial.ac.uk>
Bioinformatics Software Developer, Bioinformatics Support Service
Imperial College, London


*** embl.pm     Tue Mar 16 13:52:35 2004
--- embl1.72.pm Tue Mar 16 13:00:00 2004
***************
*** 594,600 ****
                     my $prim    = $dr->primary_id;
                     my $opt     = $dr->optional_id || '';

!                   my $line = $opt ? "$db_name; $prim; $opt." : 
"$db_name; $prim.";
                     $self->_write_line_EMBL_regex("DR   ", "DR   ", 
$line, '\s+|$', 80); #'
                 }
                 $self->_print("XX\n");
--- 594,600 ----
                     my $prim    = $dr->primary_id;
                     my $opt     = $dr->optional_id || '';

!                   my $line = "$db_name; $prim; $opt.";
                     $self->_write_line_EMBL_regex("DR   ", "DR   ", 
$line, '\s+|$', 80); #'
                 }
                 $self->_print("XX\n");
***************
*** 919,925 ****
       while (defined( $_ ||= $self->_readline )) {

           if (my($databse, $prim_id, $sec_id)
!                 = /^DR   ([^\s;]+);\s*([^\s;]+);?\s*([^\s;]+)?\.$/) {
               my $link = Bio::Annotation::DBLink->new();
               $link->database   ( $databse );
               $link->primary_id ( $prim_id );
--- 919,925 ----
       while (defined( $_ ||= $self->_readline )) {

           if (my($databse, $prim_id, $sec_id)
!                 = /^DR   ([^\s;]+);\s*([^\s;]+);\s*([^\s;]+)?\.$/) {
               my $link = Bio::Annotation::DBLink->new();
               $link->database   ( $databse );
               $link->primary_id ( $prim_id );


From jason at cgt.duhs.duke.edu  Tue Mar 16 11:39:39 2004
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Tue Mar 16 11:45:20 2004
Subject: [Bioperl-l] RE: Bio::Tools::EPCR.pm patch request
In-Reply-To: <CED81D34E37D5043A1211565277A51E501491D4D@exchkc02.stowers-institute.org>
References: <CED81D34E37D5043A1211565277A51E501491D4D@exchkc02.stowers-institute.org>
Message-ID: <Pine.LNX.4.50.0403161139040.31661-100000@tenero.duhs.duke.edu>

oops - my fault - I missed that accidently - had trouble applying the
patch so was doing it by hand.  will do.

-jason


On Mon, 15 Mar 2004, Cook, Malcolm wrote:

> Jason,
>
> Wow!  You the man!  That was fast.  I see it in the cvs change log
> already...
>
> But...
>
> Would you mind reinstating the tiny additional change I made to only
> produce the note tag value if $rest is present.  I'm using it in an
> applicatoin where there are no addition columns in my e-PCR input file,
> and am getting an empty note attribute on each GFF line....
>
> There may be a better way of coding it than:
>
>  $markerfeature->add_tag_value('Note',$rest) if $rest;
>
> Thanks,
>
> Malcolm
>
> >-----Original Message-----
> >From: Jason Stajich [mailto:jason@cgt.duhs.duke.edu]
> >Sent: Monday, March 15, 2004 3:29 PM
> >To: Cook, Malcolm
> >Cc: Bioperl
> >Subject: Re: Bio::Tools::EPCR.pm patch request
> >
> >
> >done!
> >
> >I added the ability to the update the primary,source,groupclass fields
> >using get/set methods as well. I also capitalized the 'Note'
> >field to be
> >in line with the way GFF3 looks.
> >
> >-j
> >On Mon, 15 Mar 2004, Cook, Malcolm wrote:
> >
> >> Jason,
> >>
> >> I'd like it if you would accept and apply the following diff
> >to the very
> >> useful Bio::Tools::EPCR module.   I hope you agree that it
> >provides more
> >> flexibility in defining the structure of the features that
> >are produced
> >> (and, ultimately for me, the format of the GFF which gets
> >written before
> >> being loaded into Bio::DB::GFF database).
> >>
> >> I have made the behaviour completely backwards compatible.
> >>
> >> Sincerely,
> >>
> >> Malcolm Cook - mec@stowers-institute.org - 816-926-4449
> >> Database Applications Manager - Bioinformatics
> >> Stowers Institute for Medical Research - Kansas City, MO  USA
> >>
> >> Index: EPCR.pm
> >> ===================================================================
> >> RCS file: /home/repository/bioperl/bioperl-live/Bio/Tools/EPCR.pm,v
> >> retrieving revision 1.8
> >> diff -c -r1.8 EPCR.pm
> >> *** EPCR.pm	2002/12/01 00:05:21	1.8
> >> --- EPCR.pm	2004/03/15 20:27:04
> >> ***************
> >> *** 92,110 ****
> >>   =head2 new
> >>
> >>    Title   : new
> >> !  Usage   : my $epcr = new Bio::Tools::EPCR(-file => $file);
> >> !  Function: Initializes a new EPCR parser
> >>    Returns : Bio::Tools::EPCR
> >>    Args    : -fh   => filehandle
> >>              OR
> >>              -file => filename
> >>
> >>   =cut
> >>
> >>   sub new {
> >>     my($class,@args) = @_;
> >>
> >>     my $self = $class->SUPER::new(@args);
> >>     $self->_initialize_io(@args);
> >>
> >>     return $self;
> >> --- 92,120 ----
> >>   =head2 new
> >>
> >>    Title   : new
> >> !  Usage   : my $epcr = new Bio::Tools::EPCR(-file => $file,
> >-primary =>
> >> $fprimary, -source => $fsource, -groupclass => $fgroupclass);
> >> !  Function: Initializes a new EPCR parser.
> >>    Returns : Bio::Tools::EPCR
> >>    Args    : -fh   => filehandle
> >>              OR
> >>              -file => filename
> >>
> >> +            -primary => a string to be used as the common value for
> >> each features '-primary' tag.  Defaults to 'sts'.  (This in
> >turn maps to
> >> the GFF 'type' tag (aka 'method')).
> >> +
> >> +            -source => a string to be used as the common
> >value for each
> >> features '-source' tag.  Defaults to 'e-PCR'. (This in turn
> >maps to the
> >> GFF 'source' tag)
> >> +
> >> +            -groupclass => a string to be used as the name
> >of the tag
> >> which will hold the sts marker namefirst attribute.
> >Defaults to 'name'.
> >> +
> >>   =cut
> >>
> >>   sub new {
> >>     my($class,@args) = @_;
> >>
> >>     my $self = $class->SUPER::new(@args);
> >> +   my ($primary, $source, $groupclass) =
> >$self->_rearrange([qw(PRIMARY
> >> SOURCE GROUPCLASS)],@args);
> >> +   $self->{'_primary'} = $primary || 'sts';
> >> +   $self->{'_source'} =  $source || 'e-PCR';
> >> +   $self->{'_groupclass'} =  $groupclass || 'name';
> >>     $self->_initialize_io(@args);
> >>
> >>     return $self;
> >> ***************
> >> *** 139,152 ****
> >>       my $markerfeature = new Bio::SeqFeature::Generic (
> >'-start'   =>
> >> $start,
> >>   						       '-end'     =>
> >> $end,
> >>   						       '-strand'  =>
> >> $strand,
> >> ! 						       '-source'  =>
> >> 'e-PCR',
> >> ! 						       '-primary' =>
> >> 'sts',
> >>   						       '-seq_id'  =>
> >> $seqname,
> >>   						       '-tag'     => {
> >> ! 							   'name'=>
> >> $mkrname,
> >> ! 							   'note'=>
> >> $rest,
> >>   						       });
> >>       return $markerfeature;
> >>   }
> >>
> >>   1;
> >> --- 149,163 ----
> >>       my $markerfeature = new Bio::SeqFeature::Generic (
> >'-start'   =>
> >> $start,
> >>   						       '-end'     =>
> >> $end,
> >>   						       '-strand'  =>
> >> $strand,
> >> ! 						       '-source'  =>
> >> $self->{'_source'},
> >> ! 						       '-primary' =>
> >> $self->{'_primary'},
> >>   						       '-seq_id'  =>
> >> $seqname,
> >>   						       '-tag'     => {
> >> !
> >> $self->{'_groupclass'}=> $mkrname,
> >>   						       });
> >> +     $markerfeature->add_tag_value('note',$rest) if $rest;
> >>       return $markerfeature;
> >> +
> >>   }
> >>
> >>   1;
> >>
> >
> >--
> >Jason Stajich
> >Duke University
> >jason at cgt.mc.duke.edu
> >
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From jason at cgt.duhs.duke.edu  Tue Mar 16 11:41:22 2004
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Tue Mar 16 11:46:52 2004
Subject: [Bioperl-l] Parsing EMBL DR lines with 1 accession
In-Reply-To: <40570F0E.5020905@imperial.ac.uk>
References: <40570F0E.5020905@imperial.ac.uk>
Message-ID: <Pine.LNX.4.50.0403161141000.31661-100000@tenero.duhs.duke.edu>

james - please submit this as bug to http://bugzilla.open-bio.org so we
can keep track of when it gets fixed.

-jason
On Tue, 16 Mar 2004, James Abbott wrote:

> Greetings bioperlers...
>
> I have been using Bio::SeqIO to parse EMBL files, and noticed that some
> of the database cross-references (DR lines) were missing from the
> returned RichSeq object. The missing references were to the GOA
> database, which only have a primary id - the secondary id/accession
> usually found in (for example) swissprot/trembl references is missing
> i.e. (from EMBL:AE000562)
>
> DR   GOA; O25226.
> DR   GOA; P96551.
> DR   SPTREMBL; O25217; O25217.
> DR   SPTREMBL; O25218; O25218.
>
> These SPTREMBL cross references are parsed fine, however the GOA
> references are skipped. Looking at the code in question in
> Bio::SeqIO::embl, although there is provision for dbxrefs with a single
> id, the regex requires a trailing ';' after the primary accession. I
> have included a diff against embl.pm v 1.72 (see below...) which alters
> this regex to optionally match the second ';', allowing the id in DR
> lines with a single accesion to be parsed as the primary accession.
>
> Writing these entries back out again using Bio::SeqIO::embl results in
> these DR lines appearing as:
>
> DR   GOA; 025226; .
>
> however the examples given in the EMBL user manual, and all those I've
> found in EMBL, lack this second ';' and following whitespace present.
> The second modification in the diff modifies the behaviour of
> Bio::SeqIO::embl when there is  secondary accession present, ensuring
> that the DR line is written out as
>
> DR   GOA; 025226.
>
> Cheers,
> James
>
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From jason at cgt.duhs.duke.edu  Tue Mar 16 12:30:24 2004
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Tue Mar 16 12:35:54 2004
Subject: [Bioperl-l] Re: a bug in the package Bio::Tools::BPlite::Sbjct.
In-Reply-To: <BEE28BF86078B6429D6C780635718E21419FB9@morelia.be.devgen.com>
References: <BEE28BF86078B6429D6C780635718E21419FB9@morelia.be.devgen.com>
Message-ID: <Pine.LNX.4.50.0403161229440.31661-100000@tenero.duhs.duke.edu>

I applied the fix.

On Tue, 16 Mar 2004, Frederic Pecqueur wrote:

> Hi everybody,
>
> I Found a bug in this package.
> Sometimes, when I use $hsp->nextHSP, the $hsp->query->end value is null.
> I always have this error for the last hit.
>
> I solved the problem.
> You need to change the line 235 by :
>
>       elsif ($_ =~ /^>|^Histogram|^Searching|^Parameters|^\s+Database:|^CPU\stime|^\s*Lambda|^\s*Subset/)
>
> For example, if you look in the array @hspline for the last hit of one entry, you can see :
>
> Query: 41  PATLLMLLLKVNKDGPAKKDIWRAPGNQAQVRKLSQVMQHGRLVNIENFTVYTAASVIKK 100
>            P+ L   L  + K G   + I R   +  +V +  Q  + G+     +   +     IK
> Sbjct: 176 PSFLEKALQFIEKYGTKIEGILRQSADVEEVERRVQEYEQGKTEFTFDEDPHVVGDCIKH 235
> Query: 101 FLSKLPNGIFGRDNEETLFNSASTGMDIEKQRQVFYRIFGSLPVASQHLLVLLFGTFRVV 160
>             L +LP+          L  +        +   +   I  + P  ++ LL  +      +
> Sbjct: 236 VLRELPSSPVSASCCTALLEAYRIESKEARISSLRSAIAETFPEPNRRLLQRILKMMHTI 295
> Query: 161 ADSSDGHSNAMNPNAIAISVAPSL 184
>              SS  H N MNPNA+A  +AP L
> Sbjct: 296 --SSHSHENRMNPNAVAACMAPLL 317
>   Subset of the database(s) listed below
>      Number of letters searched: 46,987,626
>      Number of sequences searched:  126,777
>
> If you add "|^\s*Subset" in the test, you should obtain :
>
> Query: 41  PATLLMLLLKVNKDGPAKKDIWRAPGNQAQVRKLSQVMQHGRLVNIENFTVYTAASVIKK 100
>            P+ L   L  + K G   + I R   +  +V +  Q  + G+     +   +     IK
> Sbjct: 176 PSFLEKALQFIEKYGTKIEGILRQSADVEEVERRVQEYEQGKTEFTFDEDPHVVGDCIKH 235
> Query: 101 FLSKLPNGIFGRDNEETLFNSASTGMDIEKQRQVFYRIFGSLPVASQHLLVLLFGTFRVV 160
>             L +LP+          L  +        +   +   I  + P  ++ LL  +      +
> Sbjct: 236 VLRELPSSPVSASCCTALLEAYRIESKEARISSLRSAIAETFPEPNRRLLQRILKMMHTI 295
> Query: 161 ADSSDGHSNAMNPNAIAISVAPSL 184
>              SS  H N MNPNA+A  +AP L
> Sbjct: 296 --SSHSHENRMNPNAVAACMAPLL 317
>
> I tested, with a large blast report..ans it works.
>
> Best regards.
> Fr?d?ric.
>
> Fr?d?ric PECQUEUR
> BioInformatic team
> Devgen nv
> Technologiepark 30
> B - 9052 Zwijnaarde
> Belgium
> Tel: +32 9 324 24 36
> Fax: +32 9 324 24 25
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu

From mgear911 at yahoo.com  Tue Mar 16 20:01:43 2004
From: mgear911 at yahoo.com (=?iso-8859-1?q?Yuhui?=)
Date: Tue Mar 16 20:07:09 2004
Subject: [Bioperl-l] Change SVG image to PNG or other image format. 
Message-ID: <20040317010143.84880.qmail@web13302.mail.yahoo.com>

Hi,
How do I change SVG image format to PNG format? btw
the svg file, when i click it, it displays a image
that is too big, and i cant find a way to show a
smaller SVG image except to zoom out manually. Is
there any way to make the svg image smaller when i
view it, and rotate it b4 i change it to png format?

Thanks.

__________________________________________________
Do You Yahoo!?
Log on to Messenger with your mobile phone!
http://sg.messenger.yahoo.com
From laurichj at bioinfo.ucr.edu  Tue Mar 16 21:39:12 2004
From: laurichj at bioinfo.ucr.edu (Josh Lauricha)
Date: Tue Mar 16 21:44:37 2004
Subject: [Bioperl-l] Bio::Index::AbstractSeq.pm
Message-ID: <20040317023912.GA12740@bioinfo.ucr.edu>

There's a small "feature" fo AbstractSeq.pm which will stomp on any
primary_id with the accession_number, if there are no objections (there
must have been some reason for this) I'll commit a small change to only
do this if there is no other primary id, that way the GI will be easily
gotten from the GenBank index.

-- 

------------------------------------------------------
| Josh Lauricha            | Ford, your turning into |
| laurichj@bioinfo.ucr.edu | a penguin. Stop it.     |
| Bioinformatics, UCR      |                         |
|----------------------------------------------------|
| OpenPG:                                            |
|  4E7D 0FC0 DB6C E91D 4D7B C7F3 9BE9 8740 E4DC 6184 |
|----------------------------------------------------|
From babenko at ncbi.nlm.nih.gov  Tue Mar 16 17:55:27 2004
From: babenko at ncbi.nlm.nih.gov (Babenko, Vladimir (NIH/NLM/NCBI))
Date: Tue Mar 16 23:15:28 2004
Subject: [Bioperl-l] phylip wrappers
Message-ID: <69BA0F938FAC6A4CBEF49461720696F2029F60AA@nihexchange16.nih.gov>

   Hi Shawn and bioperl community,

 
I cannot find the phylip run wrappers any more at
Bio::Tools::Run::Phylo::Phylip::SeqBoot.pm , Consens, Neighbor etc. at
current bioperl version.

   Do they migrate somewhere else while re-structuring?

      Thank you and sorry if that?s already discussed.

          Vladimir

-----------------

Vladimir Babenko

Bldg 38A 8S816L

National Center for Biotechnology Information

National Library Of Medicine

National Institute of Health

Bethesda, 20894

(v) 301-594-8079

(f) 301-480-4637

 
From shawnh at stanford.edu  Tue Mar 16 17:59:07 2004
From: shawnh at stanford.edu (Shawn Hoon)
Date: Tue Mar 16 23:15:55 2004
Subject: [Bioperl-l] Re: phylip wrappers
In-Reply-To: <69BA0F938FAC6A4CBEF49461720696F2029F60AA@nihexchange16.nih.gov>
References: <69BA0F938FAC6A4CBEF49461720696F2029F60AA@nihexchange16.nih.gov>
Message-ID: <87BCE853-779D-11D8-B0C2-000A95783436@stanford.edu>

This is under the bioperl-run package available at:
http://bioperl.org/DIST/current_run_stable.tar.gz

cheers,
shawn


On Mar 16, 2004, at 2:55 PM, Babenko, Vladimir (NIH/NLM/NCBI) wrote:

> ?? Hi Shawn and bioperl community,
>
> ?
>
> I cannot find the phylip run wrappers any more at 
> Bio::Tools::Run::Phylo::Phylip::SeqBoot.pm , Consens, Neighbor etc. at 
> current bioperl version.
>
> ?? Do they migrate somewhere else while re-structuring?
>
> ????? Thank you and sorry if that?s already discussed.
>
> ????????? Vladimir
>
> -----------------
>
> Vladimir Babenko
>
> Bldg 38A 8S816L
>
> National Center for Biotechnology Information
>
> National Library Of Medicine
>
> National Institute of Health
>
> Bethesda, 20894
>
> (v) 301-594-8079
>
> (f) 301-480-4637
>
> ?
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: text/enriched
Size: 1964 bytes
Desc: not available
Url : http://portal.open-bio.org/pipermail/bioperl-l/attachments/20040316/7165e02c/attachment.bin
From postmaster at mep-instruments.com.au  Wed Mar 17 02:57:43 2004
From: postmaster at mep-instruments.com.au (postmaster@mep-instruments.com.au)
Date: Wed Mar 17 03:19:42 2004
Subject: [Bioperl-l] MDaemon Warning - Virus Found
Message-ID: <MDAEMON1757200403171857.AA5743828@mep-instruments.com.au>

The following message had attachment(s) which contained viruses:

>From      : bioperl-l@bioperl.org
To        : bc@mep-instruments.com.au
Subject   : 
Date      : Wed, 17 Mar 2004 09:13:14 +0100
Message-ID: 

Attachment                    Virus name               Action taken
------------------------------------------------------------------------------
part2.pif                     I-Worm.Moodown.b         Removed


From sdavis2 at mail.nih.gov  Wed Mar 17 07:10:36 2004
From: sdavis2 at mail.nih.gov (Sean Davis)
Date: Wed Mar 17 07:16:14 2004
Subject: [Bioperl-l] phylip wrappers
References: <69BA0F938FAC6A4CBEF49461720696F2029F60AA@nihexchange16.nih.gov>
Message-ID: <004501c40c18$de4356c0$2f643744@WATSON>

Vladimir

It looks like phylip is in bioperl-run.

http://doc.bioperl.org/bioperl-run/

Sean
----- Original Message -----
From: "Babenko, Vladimir (NIH/NLM/NCBI)" <babenko@ncbi.nlm.nih.gov>
To: <shawnh@fugu-sg.org>
Cc: <bioperl-l@bioperl.org>
Sent: Tuesday, March 16, 2004 5:55 PM
Subject: [Bioperl-l] phylip wrappers


   Hi Shawn and bioperl community,


I cannot find the phylip run wrappers any more at
Bio::Tools::Run::Phylo::Phylip::SeqBoot.pm , Consens, Neighbor etc. at
current bioperl version.

   Do they migrate somewhere else while re-structuring?

      Thank you and sorry if that's already discussed.

          Vladimir

-----------------

Vladimir Babenko

Bldg 38A 8S816L

National Center for Biotechnology Information

National Library Of Medicine

National Institute of Health

Bethesda, 20894

(v) 301-594-8079

(f) 301-480-4637


----------------------------------------------------------------------------
----


> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l


From jreumers at vub.ac.be  Wed Mar 17 07:33:25 2004
From: jreumers at vub.ac.be (Joke Reumers)
Date: Wed Mar 17 07:38:36 2004
Subject: [Bioperl-l] dbSNP connection
Message-ID: <4921D179-780F-11D8-B07C-0030657E217C@vub.ac.be>

Hi,

I'm a new at using BioPerl and I wondered if there are any 
bioperl-modules that allow to connect to the SNP database at NCBI 
(dbSNP)? That would save me a lot of time..

Thanks,
Joke Reumers

+++++++++++++++++++++++++++++++++++++++
Joke Reumers
Switch Laboratory
Vrije Universiteit Brussel
Flemish Interuniversity Institute for Biotechnology
+++++++++++++++++++++++++++++++++++++++

From harris at cshl.edu  Wed Mar 17 07:49:14 2004
From: harris at cshl.edu (Todd Harris)
Date: Wed Mar 17 07:54:42 2004
Subject: [Bioperl-l] Change SVG image to PNG or other image format. 
In-Reply-To: <20040317010143.84880.qmail@web13302.mail.yahoo.com>
Message-ID: <BC7DA56A.CE0D%harris@cshl.org>

Yuhui - 

Currently there are no options for setting the size of the SVG image upon
generation.  This would be a nice enhancement.  Still, it's primary purpose
is to generate a high quality image, not for navigation on the web.

Why do you want to convert the SVG to PNG?  The default behavior of the
browser is to export PNG images.  There are several projects under
development to create rasterized images from vector data - see autotrace at
sourceforge for one.

Todd

> On 3/16/04 7:01 PM, Yuhui wrote:

> Hi,
> How do I change SVG image format to PNG format? btw
> the svg file, when i click it, it displays a image
> that is too big, and i cant find a way to show a
> smaller SVG image except to zoom out manually. Is
> there any way to make the svg image smaller when i
> view it, and rotate it b4 i change it to png format?
> 
> Thanks.
> 
> __________________________________________________
> Do You Yahoo!?
> Log on to Messenger with your mobile phone!
> http://sg.messenger.yahoo.com
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 

From Annie.Law at nrc-cnrc.gc.ca  Wed Mar 17 11:50:07 2004
From: Annie.Law at nrc-cnrc.gc.ca (Law, Annie)
Date: Wed Mar 17 11:55:46 2004
Subject: [Bioperl-l] GO categories and load_ontology.pl
Message-ID: <10C94843061E094A98C02EB77CFC328722FE14@nrcmrdex1d.imsb.nrc.ca>

Hi,

I have used the load_ontology.pl script and bioperl-db to load GO
information into a database.
The input for this script includes the GO.defs file and each of the files
representing each of the ontologies ie. 
Molecular function, biological process, and cellular component.
All of this information is loaded into the table called 'term' in the
database.
It seems that most of the Entries in the term table are of Ontoloy Id = 1
(Gene ontology) and only around 200 entries molecular function, biological
process, and cellular component put together when there are about 16000
entries in the term table.  
This is only true if I load locuslink into the database.  If I don't load
locuslink into the database then all of the entries
In the database are of Ontology Id 1.

When the data is being entered into the database it is already known which
GO category the GO identifer fits into.
I need to know which category the GO id belongs to.   I would appreciate
some ideas on how I can get this information.
Is there an option in the script that I can use maybe theres another table
somewhere that has this information??

Thanks,
Annie.
From hlapp at gnf.org  Wed Mar 17 13:17:30 2004
From: hlapp at gnf.org (Hilmar Lapp)
Date: Wed Mar 17 13:22:59 2004
Subject: [Bioperl-l] GO categories and load_ontology.pl
In-Reply-To: <10C94843061E094A98C02EB77CFC328722FE14@nrcmrdex1d.imsb.nrc.ca>
Message-ID: <5AC3B114-783F-11D8-9599-000A959EB4C4@gnf.org>

Annie, I still owe you an answer for your earlier email. I haven't 
managed to get to that yet. See below for my response to this one.

On Wednesday, March 17, 2004, at 08:50  AM, Law, Annie wrote:

> It seems that most of the Entries in the term table are of Ontoloy Id 
> = 1
> (Gene ontology) and only around 200 entries molecular function, 
> biological
> process, and cellular component put together when there are about 16000
> entries in the term table.
> This is only true if I load locuslink into the database.

This is because LocusLink lags behind the latest version of GO in terms 
of the release that they use for annotating sequences. I.e., LocusLink 
uses some terms which have meanwhile been retired or obsoleted from GO. 
Depending on whether they are still in GO's .defs file, they won't be 
in your database if you chose to ignore obsoleted entries (which is not 
a bad choice at all per se), or they aren't part of GO anymore at all.

LocusLink doesn't give the ontology of GO terms (which would be 'Gene 
Ontology'); rather it gives the category. Because a term must have an 
ontology associated, the SeqIO LL parser interprets as the ontology 
what NCBI really meant to be the category.

You'd have the following choices to proceed.

	- Ignore the 200 entries which aren't in Gene Ontology. You're not 
going to miss a significant amount of your annotation, and it's 
annotation with obsoleted terms anyway.

	- Load GO including obsoleted terms, and see with how many non-Gene 
Ontology terms that would leave you. If it's a lot less than 200, you 
may just want to ignore the rest.

	- Build a SeqProcessor module (see Bio::Factory::SeqProcessorI and 
Bio::Seq::BaseSeqProcessor) which takes the seq objects as the LL 
parser returns them, goes in and retrieves all GO term annotations, and 
replaces the ontology for those with 'Gene Ontology.' Then you pass 
your SeqProcessor to load_seqdatabase.pl using the --pipeline 
command-line option (see the script's POD).

The last option may sound like but is really not a lot of work if you 
can program perl. Note, however, that then you still wouldn't have any 
relationships for those terms - they simply have been retired.

Depending on what your project is, just ignoring those 200 may be the 
most reasonable way to go.

	-hilmar
-- 
-------------------------------------------------------------
Hilmar Lapp                            email: lapp at gnf.org
GNF, San Diego, Ca. 92121              phone: +1-858-812-1757
-------------------------------------------------------------


From allenday at ucla.edu  Wed Mar 17 14:08:25 2004
From: allenday at ucla.edu (Allen Day)
Date: Wed Mar 17 14:13:50 2004
Subject: [Bioperl-l] dbSNP connection
In-Reply-To: <4921D179-780F-11D8-B07C-0030657E217C@vub.ac.be>
References: <4921D179-780F-11D8-B07C-0030657E217C@vub.ac.be>
Message-ID: <Pine.LNX.4.58.0403171108090.19083@sumo.genetics.ucla.edu>

See Bio::ClusterIO::dbsnp and Bio::Variation::SNP.

-Allen


On Wed, 17 Mar 2004, Joke Reumers wrote:

> Hi,
> 
> I'm a new at using BioPerl and I wondered if there are any 
> bioperl-modules that allow to connect to the SNP database at NCBI 
> (dbSNP)? That would save me a lot of time..
> 
> Thanks,
> Joke Reumers
> 
> +++++++++++++++++++++++++++++++++++++++
> Joke Reumers
> Switch Laboratory
> Vrije Universiteit Brussel
> Flemish Interuniversity Institute for Biotechnology
> +++++++++++++++++++++++++++++++++++++++
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 
From amrdavila at yahoo.com  Wed Mar 17 14:51:38 2004
From: amrdavila at yahoo.com (Alberto M R Davila)
Date: Wed Mar 17 14:57:04 2004
Subject: [Bioperl-l] Clustal/T-Coffee to Fasta
In-Reply-To: <5AC3B114-783F-11D8-9599-000A959EB4C4@gnf.org>
Message-ID: <20040317195138.39516.qmail@web13907.mail.yahoo.com>

I was looking in to the SeqIO module trying to know if it could convert
multiple alignments done with ClustalW/X (*.aln files) or T-Coffee to
Fasta sequences into a single file... no luck... there would be any other
Bioperl module to do that format conversion ?

Thanks, Alberto


__________________________________
Do you Yahoo!?
Yahoo! Mail - More reliable, more storage, less spam
http://mail.yahoo.com
From FGTB at mac.com  Wed Mar 17 14:53:15 2004
From: FGTB at mac.com (From Genomes To Biomes)
Date: Wed Mar 17 14:58:44 2004
Subject: [Bioperl-l] (no subject)
Message-ID: <p06020402bc7e5d0120f7@[62.97.171.121]>

Any one succesful in using fink and cpan/bundle bioperl together ?
From ralf at bch.ed.ac.uk  Wed Mar 17 15:06:58 2004
From: ralf at bch.ed.ac.uk (Ralf Schmid)
Date: Wed Mar 17 15:12:23 2004
Subject: [Bioperl-l] Bio::SearchIO blast_parsing dot_at_the_end_of
	query_accession
Message-ID: <Pine.SGI.3.91.1040317200045.6916270H-100000@nessie>


Hi,

I have recently updated bioperl from 1.21 to 1.4 and this has broken one of my
blast parsing scripts. Using the following snippet of code on a blast output (3
input sequences, -b 2 option for retrieving only two alignments, otherwise
standard) gives different results: 

=> code

#!/usr/bin/perl -w
use strict;
use Bio::SearchIO;
my $in = new Bio::SearchIO( -format => 'blast',
                            -file   => "test.out");
my $prot = ''; 
while( my $result = $in->next_result )  {     
  while (my $hit = $result->next_hit) {     
    $prot=$result->query_accession;                   
    print"$prot\n";	
  }
}     


=> output bioperl 1.21:

ZPP00163
ZPP00163
ZPP00036_1
ZPP00036_1
ZPP00157
ZPP00157

- query accession is retrieved for each hit where there is an alignment

=> output bioperl 1.4:

ZPP00163.
ZPP00163.
ZPP00163.
ZPP00163.
ZPP00163.
ZPP00163.
ZPP00163.
...

- query_accession is retrieved for each hit regardless whether there is an
alignment or not
- each query_accession ends with a "."


So far I have taken advantage of the blast -b option to set the number of hits
to be parsed by bioperl, but I can see the ratio in changing bioperl from
parsing every hit that has an alignment to parsing every hit. 

Looking at the diff between blast.pm 1.42.2.8 and blast.pm 1.76 and finding the
helpful comment in line 769 makes me believe that there is the change in parsing
coded, but I couldn't spot any reason for the "." at the end of each
query_accession. Not sure whether the two are related anyway.   

<SNIP>
# This is for the case when we specify -b 0 (or B=0 for WU-BLAST)              
# and still want to construct minimal Hit objects               
while(my $v = shift @hit_signifs) {
next unless defined $v;                   
$self->start_element({ 'Name' => 'Hit'});
...
<SNIP>

So far I'm fixing the "dot" issue by an s/\.$// , but ...


Cheers,

Ralf


------------------------------------------------------------------------------ 
Dr. Ralf Schmid
Nematode Bioinformatics
Blaxter Nematode Genomics Group
Institute of Cell, Animal and Population Biology
Ashworth Labs						
University of Edinburgh				
King's Buildings					
Edinburgh			
EH9 3JT	 			
UK					

(+44)(0)131 650 7403


From james.wasmuth at ed.ac.uk  Wed Mar 17 15:22:47 2004
From: james.wasmuth at ed.ac.uk (James Wasmuth)
Date: Wed Mar 17 15:28:21 2004
Subject: [Bioperl-l] Clustal/T-Coffee to Fasta
In-Reply-To: <20040317195138.39516.qmail@web13907.mail.yahoo.com>
References: <20040317195138.39516.qmail@web13907.mail.yahoo.com>
Message-ID: <4058B397.4000109@ed.ac.uk>

Hi Alberto,  taken from docs for Bio::AlignIO 
(http://doc.bioperl.org/releases/bioperl-1.4/Bio/AlignIO.html)

$inputfilename = "testaln.fasta";
    $in  = Bio::AlignIO->new(-file => $inputfilename ,
                             '-format' => 'clustal');
    $out = Bio::AlignIO->new(-file => ">out.aln.pfam" ,
                             '-format' => 'fasta');

    while ( my $aln = $in->next_aln() ) {
        $out->write_aln($aln);
    }


if you want the gaps removed then...


while ( my $aln = $in->next_aln() ) {
	$aln->map_chars('\.','')    # first arguement is whatever your gap 
character is...
	$out->write_aln($aln);
    }


-james


Alberto M R Davila wrote:

>I was looking in to the SeqIO module trying to know if it could convert
>multiple alignments done with ClustalW/X (*.aln files) or T-Coffee to
>Fasta sequences into a single file... no luck... there would be any other
>Bioperl module to do that format conversion ?
>
>Thanks, Alberto
>
>
>
>__________________________________
>Do you Yahoo!?
>Yahoo! Mail - More reliable, more storage, less spam
>http://mail.yahoo.com
>_______________________________________________
>Bioperl-l mailing list
>Bioperl-l@portal.open-bio.org
>http://portal.open-bio.org/mailman/listinfo/bioperl-l
>  
>


From jason at cgt.duhs.duke.edu  Wed Mar 17 15:25:56 2004
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Wed Mar 17 15:31:21 2004
Subject: [Bioperl-l] Clustal/T-Coffee to Fasta
In-Reply-To: <20040317195138.39516.qmail@web13907.mail.yahoo.com>
References: <20040317195138.39516.qmail@web13907.mail.yahoo.com>
Message-ID: <Pine.LNX.4.50.0403171525150.6738-100000@tenero.duhs.duke.edu>


Alignments are handled by Bio::AlignIO

You want
 Bio::AlignIO::clustalw
 Bio::AlignIO::fasta

-j
On Wed, 17 Mar 2004, Alberto M R Davila wrote:

> I was looking in to the SeqIO module trying to know if it could convert
> multiple alignments done with ClustalW/X (*.aln files) or T-Coffee to
> Fasta sequences into a single file... no luck... there would be any other
> Bioperl module to do that format conversion ?
>
> Thanks, Alberto
>
>
>
> __________________________________
> Do you Yahoo!?
> Yahoo! Mail - More reliable, more storage, less spam
> http://mail.yahoo.com
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From jason at cgt.duhs.duke.edu  Wed Mar 17 15:36:13 2004
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Wed Mar 17 15:41:41 2004
Subject: [Bioperl-l] Bio::SearchIO blast_parsing dot_at_the_end_of
	query_accession
In-Reply-To: <Pine.SGI.3.91.1040317200045.6916270H-100000@nessie>
References: <Pine.SGI.3.91.1040317200045.6916270H-100000@nessie>
Message-ID: <Pine.LNX.4.50.0403171530260.6738-100000@tenero.duhs.duke.edu>

ralf - can you send an example report - if I use your code and the
t/data/ecolitst.bls report in the bioperl disto I get this
jason@jason $ perl ralf_bug.pl
AAC73113.1
AAC73113.1
AAC73113.1
AAC73113.1

also see below.

-j
On Wed, 17 Mar 2004, Ralf Schmid wrote:

>
> Hi,
>
> I have recently updated bioperl from 1.21 to 1.4 and this has broken one of my
> blast parsing scripts. Using the following snippet of code on a blast output (3
> input sequences, -b 2 option for retrieving only two alignments, otherwise
> standard) gives different results:
>
> => code
>
> #!/usr/bin/perl -w
> use strict;
> use Bio::SearchIO;
> my $in = new Bio::SearchIO( -format => 'blast',
>                             -file   => "test.out");
> my $prot = '';
> while( my $result = $in->next_result )  {
>   while (my $hit = $result->next_hit) {
>     $prot=$result->query_accession;
>     print"$prot\n";
>   }
> }
>
>
> => output bioperl 1.21:
>
> ZPP00163
> ZPP00163
> ZPP00036_1
> ZPP00036_1
> ZPP00157
> ZPP00157
>
> - query accession is retrieved for each hit where there is an alignment
>
> => output bioperl 1.4:
>
> ZPP00163.
> ZPP00163.
> ZPP00163.
> ZPP00163.
> ZPP00163.
> ZPP00163.
> ZPP00163.
> ...
>
> - query_accession is retrieved for each hit regardless whether there is an
> alignment or not
> - each query_accession ends with a "."
>
>
> So far I have taken advantage of the blast -b option to set the number of hits
> to be parsed by bioperl, but I can see the ratio in changing bioperl from
> parsing every hit that has an alignment to parsing every hit.
>
This was a requested feature.  You can add a little code which exists the
hit loop if the hit doesn't have any hsps
last if $hit->num_hsps == 0;

> Looking at the diff between blast.pm 1.42.2.8 and blast.pm 1.76 and finding the
> helpful comment in line 769 makes me believe that there is the change in parsing
> coded, but I couldn't spot any reason for the "." at the end of each
> query_accession. Not sure whether the two are related anyway.
>
> <SNIP>
> # This is for the case when we specify -b 0 (or B=0 for WU-BLAST)
> # and still want to construct minimal Hit objects
> while(my $v = shift @hit_signifs) {
> next unless defined $v;
> $self->start_element({ 'Name' => 'Hit'});
> ...
> <SNIP>
>
> So far I'm fixing the "dot" issue by an s/\.$// , but ...
>
>
> Cheers,
>
> Ralf
>
>
>
>
>
>
> ------------------------------------------------------------------------------
> Dr. Ralf Schmid
> Nematode Bioinformatics
> Blaxter Nematode Genomics Group
> Institute of Cell, Animal and Population Biology
> Ashworth Labs
> University of Edinburgh
> King's Buildings
> Edinburgh
> EH9 3JT
> UK
>
> (+44)(0)131 650 7403
>
>
>
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From brian_osborne at cognia.com  Wed Mar 17 15:51:04 2004
From: brian_osborne at cognia.com (Brian Osborne)
Date: Wed Mar 17 15:56:32 2004
Subject: [Bioperl-l] Clustal/T-Coffee to Fasta
In-Reply-To: <20040317195138.39516.qmail@web13907.mail.yahoo.com>
Message-ID: <GAEDKMGOKFBLJPKCLKCCEEPFDIAA.brian_osborne@cognia.com>

Alberto,

You want to use AlignIO, and 'fasta' is one of its output formats. Unlike
SeqIO, AlignIO can't produce output in all of its input formats. Take a look
at the bptutorial, section III.2.2
(http://bioperl.org/Core/Latest/bptutorial.html).


Brian O.

-----Original Message-----
From: bioperl-l-bounces@portal.open-bio.org
[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Alberto M R
Davila
Sent: Wednesday, March 17, 2004 2:52 PM
To: bioperl-l@bioperl.org
Cc: amrdavila@yahoo.com
Subject: [Bioperl-l] Clustal/T-Coffee to Fasta

I was looking in to the SeqIO module trying to know if it could convert
multiple alignments done with ClustalW/X (*.aln files) or T-Coffee to
Fasta sequences into a single file... no luck... there would be any other
Bioperl module to do that format conversion ?

Thanks, Alberto


__________________________________
Do you Yahoo!?
Yahoo! Mail - More reliable, more storage, less spam
http://mail.yahoo.com
_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org
http://portal.open-bio.org/mailman/listinfo/bioperl-l


From james.wasmuth at ed.ac.uk  Wed Mar 17 16:10:53 2004
From: james.wasmuth at ed.ac.uk (James Wasmuth)
Date: Wed Mar 17 16:16:47 2004
Subject: [Bioperl-l] Bio::SearchIO blast_parsing
	dot_at_the_end_of	query_accession
In-Reply-To: <4058BE46.9060205@ed.ac.uk>
References: <Pine.SGI.3.91.1040317200045.6916270H-100000@nessie>
	<Pine.LNX.4.50.0403171530260.6738-100000@tenero.duhs.duke.edu>
	<4058BE46.9060205@ed.ac.uk>
Message-ID: <4058BEDD.5080708@ed.ac.uk>

Hi Jason, I work with Ralf, and sensibly he's gone home, but here's the 
report file:

BLASTP 2.2.6 [Apr-09-2003]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= ZPP00163	
         (199 letters)

Database: swissall.fsa 
           1,218,016 sequences; 390,760,073 total letters

Searching..................................................done

                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

NUKM_CAEEL (Q94360) Probable NADH-ubiquinone oxidoreductase 20 k...   177   6e-44
Q9VAK5 (Q9VAK5) CG2014 protein                                        166   1e-40
NUKM_BOVIN (P42026) NADH-ubiquinone oxidoreductase 20 kDa subuni...   166   2e-40
Q9VXK7 (Q9VXK7) CG9172 protein (LD31474P)                             165   4e-40
Q9BV17 (Q9BV17) Similar to CG9172 gene product (NADH:ubiquinone ...   163   1e-39
NUKM_HUMAN (O75251) NADH-ubiquinone oxidoreductase 20 kDa subuni...   163   1e-39
Q7PRW3 (Q7PRW3) ENSANGP00000019428 (Fragment)                         162   2e-39
Q9DC70 (Q9DC70) 1010001M04Rik protein (RIKEN cDNA 1010001M04 gene)    159   2e-38
Q86EG3 (Q86EG3) Clone ZZD432 mRNA sequence                            155   4e-37
Q7PQT1 (Q7PQT1) ENSANGP00000014787 (Fragment)                         150   7e-36
Q7PQU6 (Q7PQU6) ENSANGP00000017108 (Fragment)                         150   7e-36
NUKM_RECAM (O21272) NADH-ubiquinone oxidoreductase 20 kDa subuni...   146   2e-34
Q9G8U4 (Q9G8U4) NADH dehydrogenase subunit 10 (EC 1.6.5.3)            144   5e-34
Q8NAS7 (Q8NAS7) Hypothetical protein FLJ34850                         144   9e-34
Q7RZB4 (Q7RZB4) Hypothetical protein                                  143   2e-33
NUKM_NEUCR (O47950) NADH-ubiquinone oxidoreductase 19.3 kDa subu...   143   2e-33
Q8W0E8 (Q8W0E8) Putative NADH dehydrogenase (Ubiquinone) chain PSST   141   6e-33
Q9TCA4 (Q9TCA4) NADH dehydrogenase subunit 10 (EC 1.6.5.3)            140   1e-32
Q9SP38 (Q9SP38) NADH ubiquinone oxidoreductase PSST subunit           139   2e-32
Q9LKG9 (Q9LKG9) NADH-ubiquinone oxidoreductase subunit PSST (Fra...   139   2e-32
NUKM_SOLTU (Q43844) NADH-ubiquinone oxidoreductase 20 kDa subuni...   139   2e-32
NUKM_BRAOL (P42027) NADH-ubiquinone oxidoreductase 20 kDa subuni...   139   2e-32
NUKM_ARATH (Q42577) NADH-ubiquinone oxidoreductase 20 kDa subuni...   138   5e-32
Q9LKH4 (Q9LKH4) NADH-ubiquinone oxidoreductase subunit PSST           137   1e-31
Q9UUT7 (Q9UUT7) Subunit NUKM of protein NADH:ubiquinone oxidored...   134   5e-31
Q7PBK3 (Q7PBK3) NADH dehydrogenase I chain B                          134   9e-31
NUOB_RICCN (Q92ID6) NADH-quinone oxidoreductase chain B (EC 1.6....   134   9e-31


>>NUKM_CAEEL (Q94360) Probable NADH-ubiquinone oxidoreductase 20 kDa
>  
>
           subunit, mitochondrial precursor (EC 1.6.5.3) (EC
           1.6.99.3) (Complex I-20KD) (CI-20KD)
          Length = 199

 Score =  177 bits (450), Expect = 6e-44
 Identities = 88/161 (54%), Positives = 106/161 (65%)

Query: 39  GKGVFGSPFVQSESKGEWALASLDDVINLCGKTSLWPLTFGLXXXXXXXXHFAAPRYDMD 98
           G    G+PF+   SK E+ALA LDDV+NL  + S+WPLTFGL        HFAAPRYDMD
Sbjct: 31  GIATTGTPFLNPSSKAEYALARLDDVLNLAQRGSIWPLTFGLACCAVEMMHFAAPRYDMD 90

Query: 99  HYGVVFHATPXQVNLILFTGTITNKMAPALHHIYNQMPKPKYVISMESYTNGSGYYHYTY 158
            YGVVF A+P Q +LI   GT+TNKMAPAL  IY+QMP+ K+VISM S  NG GYYHY Y
Sbjct: 91  RYGVVFRASPRQADLIFVAGTVTNKMAPALRRIYDQMPEAKWVISMGSCANGGGYYHYAY 150

Query: 159 SIVHNYNHMXXXXXXXXXXXXTAKXLLYNILQLQKKIKHSR 199
           S++   + +            TA+ LLY +LQLQKKIK  R
Sbjct: 151 SVLRGCDRVIPVDIYVPGCPPTAEALLYGVLQLQKKIKRKR 191


>>Q9VAK5 (Q9VAK5) CG2014 protein
>  
>
          Length = 212

 Score =  166 bits (421), Expect = 1e-40
 Identities = 82/162 (50%), Positives = 103/162 (63%), Gaps = 3/162 (1%)

Query: 38  WGKGVFGSPFVQSESKGEWALASLDDVINLCGKTSLWPLTFGLXXXXXXXXHFAAPRYDM 97
           WG   FG      ++ GEW  A LDD++N   K SLWPLTFGL        H AAPRYDM
Sbjct: 46  WGYSAFGR---NQKTWGEWTCARLDDLLNWGRKGSLWPLTFGLACCAVEMMHIAAPRYDM 102

Query: 98  DHYGVVFHATPXQVNLILFTGTITNKMAPALHHIYNQMPKPKYVISMESYTNGSGYYHYT 157
           D YGVVF A+P Q ++++  GT+TNKMAPA   IY+QMP+P++VISM S  NG GYYHY+
Sbjct: 103 DRYGVVFRASPRQADVLIVAGTLTNKMAPAFRKIYDQMPEPRWVISMGSCANGGGYYHYS 162

Query: 158 YSIVHNYNHMXXXXXXXXXXXXTAKXLLYNILQLQKKIKHSR 199
           YS+V   + +            TA+ L+Y ILQLQKK+K  R
Sbjct: 163 YSVVRGCDRIVPVDIYVPGCPPTAEALMYGILQLQKKVKRMR 204


  Database: swissall.fsa
    Posted date:  Dec 5, 2003  3:22 PM
  Number of letters in database: 390,760,073
  Number of sequences in database:  1,218,016
  
Lambda     K      H
   0.318    0.134    0.417 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 150,792,462
Number of Sequences: 1218016
Number of extensions: 5616912
Number of successful extensions: 15131
Number of sequences better than 1.0e-30: 27
Number of HSP's better than  0.0 without gapping: 10
Number of HSP's successfully gapped in prelim test: 17
Number of HSP's that attempted gapping in prelim test: 15095
Number of HSP's gapped (non-prelim): 27
length of query: 199
length of database: 390,760,073
effective HSP length: 116
effective length of query: 83
effective length of database: 249,470,217
effective search space: 20706028011
effective search space used: 20706028011
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 336 (134.0 bits)
BLASTP 2.2.6 [Apr-09-2003]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= ZPP00036_1	
         (229 letters)

Database: swissall.fsa 
           1,218,016 sequences; 390,760,073 total letters

Searching..................................................done

                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

Q95XJ0 (Q95XJ0) Hypothetical protein                                  256   2e-67
Q95XI9 (Q95XI9) Hypothetical protein                                  256   2e-67
ATPG_DROME (O01666) ATP synthase gamma chain, mitochondrial prec...   202   2e-51
Q7Q3N8 (Q7Q3N8) AgCP11416 (Fragment)                                  197   7e-50
ATPG_BOVIN (P05631) ATP synthase gamma chain, mitochondrial prec...   184   1e-45
Q9ERA8 (Q9ERA8) ATP synthase gamma-subunit precursor                  178   4e-44
Q8TAS0 (Q8TAS0) Hypothetical protein (Fragment)                       178   6e-44
ATPG_MOUSE (Q91VR2) ATP synthase gamma chain, mitochondrial prec...   178   6e-44
ATPG_HUMAN (P36542) ATP synthase gamma chain, mitochondrial prec...   178   6e-44
Q910C3 (Q910C3) Mitochondrial ATP synthase gamma-subunit              177   1e-43
Q9D9D7 (Q9D9D7) 1700094F02Rik protein                                 175   4e-43
Q86DM7 (Q86DM7) Hypothetical protein Y69A2AR.18                       174   1e-42
Q8C2Q8 (Q8C2Q8) ATP synthase                                          172   2e-42
Q7ZXN3 (Q7ZXN3) Similar to ATP synthase, H+ transporting, mitoch...   172   4e-42
ATPG_RAT (P35435) ATP synthase gamma chain, mitochondrial (EC 3....   169   2e-41


>>Q95XJ0 (Q95XJ0) Hypothetical protein
>  
>
          Length = 299

 Score =  256 bits (653), Expect = 2e-67
 Identities = 131/225 (58%), Positives = 164/225 (72%), Gaps = 4/225 (1%)

Query: 5   YGNVEQQRNFATLKDISIRLKSVKNIQKMTXXXXXXXXXXXXXXDRELKGARVYGEGAQA 64
           + N EQ R FATLKDISIRLKSVKNIQK+T              +RELKGAR YG GA+ 
Sbjct: 16  FANAEQARGFATLKDISIRLKSVKNIQKITKSMKMVAAAKYAKAERELKGARAYGVGAKT 75

Query: 65  FYNNLAGDDAAVPKADASATTKKLLILMTSDSGLCGAVHTSIIKEAKNLIKNKPDNMEYK 124
           F++N+   D  V   +   + K++L+L+TSD GLCG VH+SI+KEAKN++ N  D  E +
Sbjct: 76  FFDNI---DPVVEGVEKQESKKQVLVLITSDRGLCGGVHSSIVKEAKNILNNAGDK-EIR 131

Query: 125 LVCIGDKSKAGMSRIYGNHILYTANEIGRLPPTFEDASIAALEILNSGYEFDEAEILYNR 184
           +V IGDKS+AG+ R+Y N IL + NEIGR PP+F DASIAA  IL+SGY+F+   IL+NR
Sbjct: 132 VVAIGDKSRAGLQRLYANSILLSGNEIGRAPPSFADASIAAKAILDSGYDFETGTILFNR 191

Query: 185 FKTVVSYQTSKLQVPPLATIKSNTKLYTYDSVDDDLLQSYAEYSL 229
           FKTVVSY+TSKLQ+ PL  IK+   L TYDSVDDD+LQSY+EYSL
Sbjct: 192 FKTVVSYETSKLQILPLEAIKAKEALSTYDSVDDDVLQSYSEYSL 236


>>Q95XI9 (Q95XI9) Hypothetical protein
>  
>
          Length = 313

 Score =  256 bits (653), Expect = 2e-67
 Identities = 131/225 (58%), Positives = 164/225 (72%), Gaps = 4/225 (1%)

Query: 5   YGNVEQQRNFATLKDISIRLKSVKNIQKMTXXXXXXXXXXXXXXDRELKGARVYGEGAQA 64
           + N EQ R FATLKDISIRLKSVKNIQK+T              +RELKGAR YG GA+ 
Sbjct: 16  FANAEQARGFATLKDISIRLKSVKNIQKITKSMKMVAAAKYAKAERELKGARAYGVGAKT 75

Query: 65  FYNNLAGDDAAVPKADASATTKKLLILMTSDSGLCGAVHTSIIKEAKNLIKNKPDNMEYK 124
           F++N+   D  V   +   + K++L+L+TSD GLCG VH+SI+KEAKN++ N  D  E +
Sbjct: 76  FFDNI---DPVVEGVEKQESKKQVLVLITSDRGLCGGVHSSIVKEAKNILNNAGDK-EIR 131

Query: 125 LVCIGDKSKAGMSRIYGNHILYTANEIGRLPPTFEDASIAALEILNSGYEFDEAEILYNR 184
           +V IGDKS+AG+ R+Y N IL + NEIGR PP+F DASIAA  IL+SGY+F+   IL+NR
Sbjct: 132 VVAIGDKSRAGLQRLYANSILLSGNEIGRAPPSFADASIAAKAILDSGYDFETGTILFNR 191

Query: 185 FKTVVSYQTSKLQVPPLATIKSNTKLYTYDSVDDDLLQSYAEYSL 229
           FKTVVSY+TSKLQ+ PL  IK+   L TYDSVDDD+LQSY+EYSL
Sbjct: 192 FKTVVSYETSKLQILPLEAIKAKEALSTYDSVDDDVLQSYSEYSL 236


  Database: swissall.fsa
    Posted date:  Dec 5, 2003  3:22 PM
  Number of letters in database: 390,760,073
  Number of sequences in database:  1,218,016
  
Lambda     K      H
   0.314    0.132    0.358 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 148,369,851
Number of Sequences: 1218016
Number of extensions: 5173956
Number of successful extensions: 9119
Number of sequences better than 1.0e-30: 15
Number of HSP's better than  0.0 without gapping: 2
Number of HSP's successfully gapped in prelim test: 13
Number of HSP's that attempted gapping in prelim test: 9087
Number of HSP's gapped (non-prelim): 15
length of query: 229
length of database: 390,760,073
effective HSP length: 118
effective length of query: 111
effective length of database: 247,034,185
effective search space: 27420794535
effective search space used: 27420794535
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 337 (134.4 bits)
BLASTP 2.2.6 [Apr-09-2003]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= ZPP00157	
         (124 letters)

Database: swissall.fsa 
           1,218,016 sequences; 390,760,073 total letters

Searching..................................................done

                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

RL23_CAEEL (P48158) 60S ribosomal protein L23                         207   3e-53
Q7PMD8 (Q7PMD8) ENSANGP00000014430 (Fragment)                         199   6e-51
RL23_AEDAE (Q9GNE2) 60S ribosomal protein L23 (L17A)                  199   6e-51
Q9W1Y7 (Q9W1Y7) RPL17A protein                                        199   7e-51
RL23_BRUMA (Q93140) 60S ribosomal protein L23                         196   6e-50
Q962Y9 (Q962Y9) Ribosomal protein L17/23                              193   4e-49
Q7ZWJ5 (Q7ZWJ5) Hypothetical protein (Fragment)                       191   2e-48
Q90YU5 (Q90YU5) Ribosomal protein L23                                 189   4e-48
Q8IT99 (Q8IT99) Ribosomal protein L17A (Fragment)                     189   4e-48
Q9BTQ7 (Q9BTQ7) Similar to ribosomal protein L23 (Fragment)           189   4e-48
RL23_HUMAN (P23131) 60S ribosomal protein L23 (L17)                   189   4e-48
Q9DFL3 (Q9DFL3) Ribosomal protein L23                                 189   6e-48
Q86D71 (Q86D71) Ribosomal protein L23                                 189   6e-48
RL23_DROME (P48159) 60S ribosomal protein L23 (L17A)                  189   8e-48
Q9W6M3 (Q9W6M3) Ribosomal protein L17                                 188   1e-47
Q9CZE6 (Q9CZE6) 2810009A01Rik protein                                 188   1e-47
Q9DCQ4 (Q9DCQ4) 2810009A01Rik protein                                 186   4e-47
Q9XSU3 (Q9XSU3) Ribosomal protein                                     186   6e-47
Q9YHX0 (Q9YHX0) Ribosomal protein L17 homolog (Fragment)              182   5e-46
O22686 (O22686) F19P19.5 protein                                      179   6e-45
Q9ATF6 (Q9ATF6) Ribosomal protein L17                                 179   6e-45
RL23_TOBAC (Q07760) 60S ribosomal protein L23                         179   6e-45
RL23_ARATH (P49690) 60S ribosomal protein L23                         179   6e-45
Q7X9K1 (Q7X9K1) Ribosomal Pr 117 (Fragment)                           178   1e-44
Q8L8P0 (Q8L8P0) Putative 60S ribosomal protein L17                    178   1e-44
Q7XDL5 (Q7XDL5) 60S ribosomal protein L17                             177   2e-44
Q9AV77 (Q9AV77) 60S ribosomal protein L17                             177   2e-44
O65068 (O65068) 60S ribosomal protein L17 (Fragment)                  176   5e-44
Q9SMI7 (Q9SMI7) 60S ribosomal protein L17 (Fragment)                  174   2e-43
O96636 (O96636) Ribosomal protein L17                                 172   6e-43
RL23_SCHPO (O42867) 60S ribosomal protein L23                         171   2e-42
RL23_YEAST (P04451) 60S ribosomal protein L23 (L17)                   171   2e-42
RL23_TORRU (Q9XEK8) 60S ribosomal protein L23 (L17)                   171   2e-42
Q7R814 (Q7R814) 60S ribosomal protein L23                             164   3e-40
Q7SHJ9 (Q7SHJ9) Hypothetical protein                                  163   3e-40
Q873R3 (Q873R3) Alkaline serine protease (Fragment)                   163   3e-40
Q8IE09 (Q8IE09) 60S ribosomal protein L23, putative                   162   6e-40
Q98RY7 (Q98RY7) 60S ribosomal protein L23                             154   2e-37
Q7QTA5 (Q7QTA5) GLP_15_22119_21691                                    149   9e-36
RL23_TRYCR (Q94776) 60S ribosomal protein L23 (L17) (TCEST082)        149   9e-36
Q9HIR9 (Q9HIR9) Probable 50S ribosomal protein L14                    134   2e-31
Q97BW6 (Q97BW6) Ribosomal protein large subunit L23                   133   4e-31
Q8SRA7 (Q8SRA7) Ribosomal protein L23                                 132   8e-31


>>RL23_CAEEL (P48158) 60S ribosomal protein L23
>  
>
          Length = 140

 Score =  207 bits (526), Expect = 3e-53
 Identities = 102/113 (90%), Positives = 105/113 (92%)

Query: 1   LSLPVGFXINXADNTGANNLFVIAVYGMRGRLNRLPSAGVGDMFVASVKKGKPELRKKVL 60
           L LPVG  +N ADNTGA NLFVI+VYG+RGRLNRLPSAGVGDMFV SVKKGKPELRKKVL
Sbjct: 18  LGLPVGAVMNCADNTGAKNLFVISVYGIRGRLNRLPSAGVGDMFVCSVKKGKPELRKKVL 77

Query: 61  QAVVIRQRKMFRRKDGTFIYFEDNAGVIVNNKGEMKGSAITGPVAKECADLWP 113
           Q VVIRQRK FRRKDGTFIYFEDNAGVIVNNKGEMKGSAITGPVAKECADLWP
Sbjct: 78  QGVVIRQRKQFRRKDGTFIYFEDNAGVIVNNKGEMKGSAITGPVAKECADLWP 130


>>Q7PMD8 (Q7PMD8) ENSANGP00000014430 (Fragment)
>  
>
          Length = 135

 Score =  199 bits (506), Expect = 6e-51
 Identities = 97/113 (85%), Positives = 105/113 (92%)

Query: 1   LSLPVGFXINXADNTGANNLFVIAVYGMRGRLNRLPSAGVGDMFVASVKKGKPELRKKVL 60
           L LPVG  IN ADNTGA NL+VIAV+G+RGRLNRLP+AGVGDMFVA+VKKGKPELRKKV+
Sbjct: 13  LGLPVGAVINCADNTGAKNLYVIAVHGIRGRLNRLPAAGVGDMFVATVKKGKPELRKKVM 72

Query: 61  QAVVIRQRKMFRRKDGTFIYFEDNAGVIVNNKGEMKGSAITGPVAKECADLWP 113
            AVVIRQRK FRR+DG F+YFEDNAGVIVNNKGEMKGSAITGPVAKECADLWP
Sbjct: 73  PAVVIRQRKPFRRRDGVFLYFEDNAGVIVNNKGEMKGSAITGPVAKECADLWP 125


  Database: swissall.fsa
    Posted date:  Dec 5, 2003  3:22 PM
  Number of letters in database: 390,760,073
  Number of sequences in database:  1,218,016
  
Lambda     K      H
   0.321    0.140    0.409 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 83,601,214
Number of Sequences: 1218016
Number of extensions: 3001107
Number of successful extensions: 5578
Number of sequences better than 1.0e-30: 43
Number of HSP's better than  0.0 without gapping: 42
Number of HSP's successfully gapped in prelim test: 1
Number of HSP's that attempted gapping in prelim test: 5533
Number of HSP's gapped (non-prelim): 43
length of query: 124
length of database: 390,760,073
effective HSP length: 100
effective length of query: 24
effective length of database: 268,958,473
effective search space: 6455003352
effective search space used: 6455003352
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 332 (132.5 bits)


>
>
> Jason Stajich wrote:
>
>>ralf - can you send an example report - if I use your code and the
>>t/data/ecolitst.bls report in the bioperl disto I get this
>>jason@jason $ perl ralf_bug.pl
>>AAC73113.1
>>AAC73113.1
>>AAC73113.1
>>AAC73113.1
>>
>>also see below.
>>
>>-j
>>On Wed, 17 Mar 2004, Ralf Schmid wrote:
>>
>>  
>>
>>>Hi,
>>>
>>>I have recently updated bioperl from 1.21 to 1.4 and this has broken one of my
>>>blast parsing scripts. Using the following snippet of code on a blast output (3
>>>input sequences, -b 2 option for retrieving only two alignments, otherwise
>>>standard) gives different results:
>>>
>>>=> code
>>>
>>>#!/usr/bin/perl -w
>>>use strict;
>>>use Bio::SearchIO;
>>>my $in = new Bio::SearchIO( -format => 'blast',
>>>                            -file   => "test.out");
>>>my $prot = '';
>>>while( my $result = $in->next_result )  {
>>>  while (my $hit = $result->next_hit) {
>>>    $prot=$result->query_accession;
>>>    print"$prot\n";
>>>  }
>>>}
>>>
>>>
>>>=> output bioperl 1.21:
>>>
>>>ZPP00163
>>>ZPP00163
>>>ZPP00036_1
>>>ZPP00036_1
>>>ZPP00157
>>>ZPP00157
>>>
>>>- query accession is retrieved for each hit where there is an alignment
>>>
>>>=> output bioperl 1.4:
>>>
>>>ZPP00163.
>>>ZPP00163.
>>>ZPP00163.
>>>ZPP00163.
>>>ZPP00163.
>>>ZPP00163.
>>>ZPP00163.
>>>...
>>>
>>>- query_accession is retrieved for each hit regardless whether there is an
>>>alignment or not
>>>- each query_accession ends with a "."
>>>
>>>
>>>So far I have taken advantage of the blast -b option to set the number of hits
>>>to be parsed by bioperl, but I can see the ratio in changing bioperl from
>>>parsing every hit that has an alignment to parsing every hit.
>>>
>>>    
>>>
>>This was a requested feature.  You can add a little code which exists the
>>hit loop if the hit doesn't have any hsps
>>last if $hit->num_hsps == 0;
>>
>>  
>>
>>>Looking at the diff between blast.pm 1.42.2.8 and blast.pm 1.76 and finding the
>>>helpful comment in line 769 makes me believe that there is the change in parsing
>>>coded, but I couldn't spot any reason for the "." at the end of each
>>>query_accession. Not sure whether the two are related anyway.
>>>
>>><SNIP>
>>># This is for the case when we specify -b 0 (or B=0 for WU-BLAST)
>>># and still want to construct minimal Hit objects
>>>while(my $v = shift @hit_signifs) {
>>>next unless defined $v;
>>>$self->start_element({ 'Name' => 'Hit'});
>>>...
>>><SNIP>
>>>
>>>So far I'm fixing the "dot" issue by an s/\.$// , but ...
>>>
>>>
>>>Cheers,
>>>
>>>Ralf
>>>
>>>
>>>
>>>
>>>
>>>
>>>------------------------------------------------------------------------------
>>>Dr. Ralf Schmid
>>>Nematode Bioinformatics
>>>Blaxter Nematode Genomics Group
>>>Institute of Cell, Animal and Population Biology
>>>Ashworth Labs
>>>University of Edinburgh
>>>King's Buildings
>>>Edinburgh
>>>EH9 3JT
>>>UK
>>>
>>>(+44)(0)131 650 7403
>>>
>>>
>>>
>>>
>>>
>>>_______________________________________________
>>>Bioperl-l mailing list
>>>Bioperl-l@portal.open-bio.org
>>>http://portal.open-bio.org/mailman/listinfo/bioperl-l
>>>
>>>    
>>>
>>
>>--
>>Jason Stajich
>>Duke University
>>jason at cgt.mc.duke.edu
>>_______________________________________________
>>Bioperl-l mailing list
>>Bioperl-l@portal.open-bio.org
>>http://portal.open-bio.org/mailman/listinfo/bioperl-l
>>  
>>

From james.wasmuth at ed.ac.uk  Wed Mar 17 16:11:26 2004
From: james.wasmuth at ed.ac.uk (James Wasmuth)
Date: Wed Mar 17 16:17:15 2004
Subject: [Bioperl-l] Bio::SearchIO blast_parsing
	dot_at_the_end_of	query_accession
In-Reply-To: <Pine.LNX.4.50.0403171530260.6738-100000@tenero.duhs.duke.edu>
References: <Pine.SGI.3.91.1040317200045.6916270H-100000@nessie>
	<Pine.LNX.4.50.0403171530260.6738-100000@tenero.duhs.duke.edu>
Message-ID: <4058BEFE.6030701@ed.ac.uk>

Just noticed that what was required was $result->query_name rather than 
query_accession.  Still, to me, its curious that the latter method 
should append the '.' .


-james


Jason Stajich wrote:

>ralf - can you send an example report - if I use your code and the
>t/data/ecolitst.bls report in the bioperl disto I get this
>jason@jason $ perl ralf_bug.pl
>AAC73113.1
>AAC73113.1
>AAC73113.1
>AAC73113.1
>
>also see below.
>
>-j
>On Wed, 17 Mar 2004, Ralf Schmid wrote:
>
>  
>
>>Hi,
>>
>>I have recently updated bioperl from 1.21 to 1.4 and this has broken one of my
>>blast parsing scripts. Using the following snippet of code on a blast output (3
>>input sequences, -b 2 option for retrieving only two alignments, otherwise
>>standard) gives different results:
>>
>>=> code
>>
>>#!/usr/bin/perl -w
>>use strict;
>>use Bio::SearchIO;
>>my $in = new Bio::SearchIO( -format => 'blast',
>>                            -file   => "test.out");
>>my $prot = '';
>>while( my $result = $in->next_result )  {
>>  while (my $hit = $result->next_hit) {
>>    $prot=$result->query_accession;
>>    print"$prot\n";
>>  }
>>}
>>
>>
>>=> output bioperl 1.21:
>>
>>ZPP00163
>>ZPP00163
>>ZPP00036_1
>>ZPP00036_1
>>ZPP00157
>>ZPP00157
>>
>>- query accession is retrieved for each hit where there is an alignment
>>
>>=> output bioperl 1.4:
>>
>>ZPP00163.
>>ZPP00163.
>>ZPP00163.
>>ZPP00163.
>>ZPP00163.
>>ZPP00163.
>>ZPP00163.
>>...
>>
>>- query_accession is retrieved for each hit regardless whether there is an
>>alignment or not
>>- each query_accession ends with a "."
>>
>>
>>So far I have taken advantage of the blast -b option to set the number of hits
>>to be parsed by bioperl, but I can see the ratio in changing bioperl from
>>parsing every hit that has an alignment to parsing every hit.
>>
>>    
>>
>This was a requested feature.  You can add a little code which exists the
>hit loop if the hit doesn't have any hsps
>last if $hit->num_hsps == 0;
>
>  
>
>>Looking at the diff between blast.pm 1.42.2.8 and blast.pm 1.76 and finding the
>>helpful comment in line 769 makes me believe that there is the change in parsing
>>coded, but I couldn't spot any reason for the "." at the end of each
>>query_accession. Not sure whether the two are related anyway.
>>
>><SNIP>
>># This is for the case when we specify -b 0 (or B=0 for WU-BLAST)
>># and still want to construct minimal Hit objects
>>while(my $v = shift @hit_signifs) {
>>next unless defined $v;
>>$self->start_element({ 'Name' => 'Hit'});
>>...
>><SNIP>
>>
>>So far I'm fixing the "dot" issue by an s/\.$// , but ...
>>
>>
>>Cheers,
>>
>>Ralf
>>
>>
>>
>>
>>
>>
>>------------------------------------------------------------------------------
>>Dr. Ralf Schmid
>>Nematode Bioinformatics
>>Blaxter Nematode Genomics Group
>>Institute of Cell, Animal and Population Biology
>>Ashworth Labs
>>University of Edinburgh
>>King's Buildings
>>Edinburgh
>>EH9 3JT
>>UK
>>
>>(+44)(0)131 650 7403
>>
>>
>>
>>
>>
>>_______________________________________________
>>Bioperl-l mailing list
>>Bioperl-l@portal.open-bio.org
>>http://portal.open-bio.org/mailman/listinfo/bioperl-l
>>
>>    
>>
>
>--
>Jason Stajich
>Duke University
>jason at cgt.mc.duke.edu
>_______________________________________________
>Bioperl-l mailing list
>Bioperl-l@portal.open-bio.org
>http://portal.open-bio.org/mailman/listinfo/bioperl-l
>  
>

From jason at cgt.duhs.duke.edu  Wed Mar 17 16:28:13 2004
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Wed Mar 17 16:33:57 2004
Subject: [Bioperl-l] Bio::SearchIO blast_parsing dot_at_the_end_of
	query_accession
In-Reply-To: <4058BEFE.6030701@ed.ac.uk>
References: <Pine.SGI.3.91.1040317200045.6916270H-100000@nessie>
	<Pine.LNX.4.50.0403171530260.6738-100000@tenero.duhs.duke.edu>
	<4058BEFE.6030701@ed.ac.uk>
Message-ID: <Pine.LNX.4.50.0403171620040.6738-100000@tenero.duhs.duke.edu>

Agreed.  The query_accession is a special method which will report the
accession if it was properly parsed out.

However, I think the problem is fixed post 1.4 as Hilmar made these
changes below.  The 1.4.0 release has revision 1.72 of SearchIO/blast.pm
with the bug you are reporting so upgrading your copy to 1.76 version from
CVS should take care of the problem I hope.

On about line 515:

  my ($acc,$version) = &_get_accession_version($nm);
+ $version = defined($version) ? ".$version" : "";
  $acc = '' unless defined($acc);
  $self->element({ 'Name' =>  'BlastOutput_query-acc',
			    'Data'  => "$acc$version"});


-jason
On Wed, 17 Mar 2004, James Wasmuth wrote:

> Just noticed that what was required was $result->query_name rather than
> query_accession.  Still, to me, its curious that the latter method
> should append the '.' .
>
>
> -james
>
>
> Jason Stajich wrote:
>
> >ralf - can you send an example report - if I use your code and the
> >t/data/ecolitst.bls report in the bioperl disto I get this
> >jason@jason $ perl ralf_bug.pl
> >AAC73113.1
> >AAC73113.1
> >AAC73113.1
> >AAC73113.1
> >
> >also see below.
> >
> >-j
> >On Wed, 17 Mar 2004, Ralf Schmid wrote:
> >
> >
> >
> >>Hi,
> >>
> >>I have recently updated bioperl from 1.21 to 1.4 and this has broken one of my
> >>blast parsing scripts. Using the following snippet of code on a blast output (3
> >>input sequences, -b 2 option for retrieving only two alignments, otherwise
> >>standard) gives different results:
> >>
> >>=> code
> >>
> >>#!/usr/bin/perl -w
> >>use strict;
> >>use Bio::SearchIO;
> >>my $in = new Bio::SearchIO( -format => 'blast',
> >>                            -file   => "test.out");
> >>my $prot = '';
> >>while( my $result = $in->next_result )  {
> >>  while (my $hit = $result->next_hit) {
> >>    $prot=$result->query_accession;
> >>    print"$prot\n";
> >>  }
> >>}
> >>
> >>
> >>=> output bioperl 1.21:
> >>
> >>ZPP00163
> >>ZPP00163
> >>ZPP00036_1
> >>ZPP00036_1
> >>ZPP00157
> >>ZPP00157
> >>
> >>- query accession is retrieved for each hit where there is an alignment
> >>
> >>=> output bioperl 1.4:
> >>
> >>ZPP00163.
> >>ZPP00163.
> >>ZPP00163.
> >>ZPP00163.
> >>ZPP00163.
> >>ZPP00163.
> >>ZPP00163.
> >>...
> >>
> >>- query_accession is retrieved for each hit regardless whether there is an
> >>alignment or not
> >>- each query_accession ends with a "."
> >>
> >>
> >>So far I have taken advantage of the blast -b option to set the number of hits
> >>to be parsed by bioperl, but I can see the ratio in changing bioperl from
> >>parsing every hit that has an alignment to parsing every hit.
> >>
> >>
> >>
> >This was a requested feature.  You can add a little code which exists the
> >hit loop if the hit doesn't have any hsps
> >last if $hit->num_hsps == 0;
> >
> >
> >
> >>Looking at the diff between blast.pm 1.42.2.8 and blast.pm 1.76 and finding the
> >>helpful comment in line 769 makes me believe that there is the change in parsing
> >>coded, but I couldn't spot any reason for the "." at the end of each
> >>query_accession. Not sure whether the two are related anyway.
> >>
> >><SNIP>
> >># This is for the case when we specify -b 0 (or B=0 for WU-BLAST)
> >># and still want to construct minimal Hit objects
> >>while(my $v = shift @hit_signifs) {
> >>next unless defined $v;
> >>$self->start_element({ 'Name' => 'Hit'});
> >>...
> >><SNIP>
> >>
> >>So far I'm fixing the "dot" issue by an s/\.$// , but ...
> >>
> >>
> >>Cheers,
> >>
> >>Ralf
> >>
> >>
> >>
> >>
> >>
> >>
> >>------------------------------------------------------------------------------
> >>Dr. Ralf Schmid
> >>Nematode Bioinformatics
> >>Blaxter Nematode Genomics Group
> >>Institute of Cell, Animal and Population Biology
> >>Ashworth Labs
> >>University of Edinburgh
> >>King's Buildings
> >>Edinburgh
> >>EH9 3JT
> >>UK
> >>
> >>(+44)(0)131 650 7403
> >>
> >>
> >>
> >>
> >>
> >>_______________________________________________
> >>Bioperl-l mailing list
> >>Bioperl-l@portal.open-bio.org
> >>http://portal.open-bio.org/mailman/listinfo/bioperl-l
> >>
> >>
> >>
> >
> >--
> >Jason Stajich
> >Duke University
> >jason at cgt.mc.duke.edu
> >_______________________________________________
> >Bioperl-l mailing list
> >Bioperl-l@portal.open-bio.org
> >http://portal.open-bio.org/mailman/listinfo/bioperl-l
> >
> >
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From R.Schmid at ed.ac.uk  Wed Mar 17 16:42:47 2004
From: R.Schmid at ed.ac.uk (Ralf Schmid)
Date: Wed Mar 17 18:04:44 2004
Subject: [Bioperl-l] Bio::SearchIO blast_parsing 
In-Reply-To: <Pine.SGI.3.91.1040317210246.6916270I-100000@nessie>
References: <Pine.SGI.3.91.1040317210246.6916270I-100000@nessie>
Message-ID: <1079559767.4058c657e88dd@staffmail.ed.ac.uk>


Jason,

see attached, 

> ralf - can you send an example report - if I use your code and the
> t/data/ecolitst.bls report in the bioperl disto I get this
> jason@jason $ perl ralf_bug.pl
> AAC73113.1
> AAC73113.1
> AAC73113.1
> AAC73113.1

Just tried it for ecolitst.bls - it works fine ... 

 
> also see below.
> 
> -j


> This was a requested feature.  
makes perfectly sense (ie not having to fill up your disk with alignments if you
just want to extract hit and e-value)  

You can add a little code which exists the
> hit loop if the hit doesn't have any hsps
> last if $hit->num_hsps == 0;
> 

Thanks, 


Ralf


------------------------------------------------------------------------------- 

Dr. Ralf Schmid
Nematode Bioinformatics
Blaxter Nematode Genomics Group
Institute of Cell, Animal and Population Biology
Ashworth Labs						
University of Edinburgh				
King\'s Buildings					
Edinburgh			
EH9 3JT	 			
UK					

(+44)(0)131 650 7403


-------------- next part --------------
A non-text attachment was scrubbed...
Name: test.out
Type: application/octet-stream
Size: 16977 bytes
Desc: not available
Url : http://portal.open-bio.org/pipermail/bioperl-l/attachments/20040317/e077f2a1/test.obj
From MAILER-DAEMON at mail.cern.ch  Thu Mar 18 02:51:17 2004
From: MAILER-DAEMON at mail.cern.ch (Mail Delivery Subsystem)
Date: Thu Mar 18 02:56:42 2004
Subject: [Bioperl-l] Returned mail: see transcript for details
Message-ID: <200403180751.i2I7pHfv010025@mail6.cern.ch>

The original message was received at Thu, 18 Mar 2004 08:51:11 +0100 (MET)
from [61.11.98.176]

   ----- The following addresses had permanent fatal errors -----
<brent@cern.ch>

   ----- Transcript of session follows -----
64c


 *** CERN Automatic Mail Router (v2.5, April 1998) ***

 Probable typing error in recipient name "brent@cern.ch".
 Please consult the list of matching recipient(s)
 and send your mail again with the correct address:

------------------------------------------------------------------------

 1.First name(s):JEAN-CLAUDE ***** Last name(s):BRIENT
   Cern phone number:  ***** Beep number: 
   Division:  *****  Office: 
   Physical E-mail address: brient@poly.in2p3.fr
   Generic E-mail address : Jean-Claude.Brient@cern.ch

 2.First name(s):RENE ***** Last name(s):BRET
   Cern phone number:74175 ***** Beep number: 
   Division:TS/CV  *****  Office:378/1-012

 3.First name(s):ANAIS ***** Last name(s):BRET
   Cern phone number:77932 ***** Beep number: 
   Division:PH/ULB  *****  Office:1/R-016
   Physical E-mail address: abret@mail.cern.ch
   Generic E-mail address : Anais.Bret@cern.ch

 4.First name(s):THOMAS GERARD ***** Last name(s):BREST
   Cern phone number:  ***** Beep number: 
   Division:TS/EL  *****  Office:9998/0-014

 5.First name(s):BERNARD JOSEPH ***** Last name(s):BREST
   Cern phone number:  ***** Beep number: 
   Division:TS/EL  *****  Office:9998/0-014

 6.First name(s):OLIVIER ***** Last name(s):BRENK
   Cern phone number:  ***** Beep number: 
   Division:  *****  Office: 

-------------------------------------------------------------------------
                    Please take the time to read this
-------------------------------------------------------------------------

IMPORTANT:
----------

All properly registered computer users at CERN can be reached by sending
e-mail to a "generic" address of the form

                        firstname.surname@cern.ch

If a generic address is suggested above it is the most efficient form of
all valid generic addresses for this recipient. Use it without any
modification, if possible 'cut and paste'.

Otherwise do not just "try again", even if you reply to a message. Your
original mail has NOT been forwarded due to the incomplete or incorrect
destination address specified by you or the sender of the message you
replied to.


General Rules:
--------------

1. The firstname can be omitted or abbreviated but this is not
   recommended since the result may be ambiguous.

2. Login names cannot be used, they will always be treated as a surname.

3. Case is not significant.

4. The firstname and/or the surname can have several parts.

5. A dot (.) must separate the firstname(s) (if present) from the surname.
   Hyphens (-) should be used to separate parts of the surname, e.g.
	Jean.Pierre.van-de-Welde@cern.ch

6.  Do not use commas, blank characters, doublequotes, or underscores (_)
    in a generic address.  Do not use accented characters, umlauts or
    control characters (e.g. backspaces) in any mail address.


*** For any further questions please send mail to mail.support@cern.ch ***
**************************************************************************


500 5.0.0 Command line usage error

   ----- Original message follows -----

X-External: Man_on_the_moon
Return-Path: <bioperl-l@bioperl.org>
Received: from bioperl.org ([61.11.98.176])
	by mail6.cern.ch (8.12.1-20030924/8.12.1) with ESMTP id i2I7oxfv007847
	for <brent@cern.ch>; Thu, 18 Mar 2004 08:51:11 +0100 (MET)
Message-Id: <200403180751.i2I7oxfv007847@mail6.cern.ch>
X-Authentication-Warning: mail6.cern.ch: Host [61.11.98.176] claimed to be bioperl.org
From: bioperl-l@bioperl.org
To: brent@cern.ch
Subject: Hi
Date: Thu, 18 Mar 2004 13:25:47 +0530
MIME-Version: 1.0
Content-Type: multipart/mixed;
	boundary="----=_NextPart_000_0010_A64A0A75.D4830D09"
X-Priority: 3
X-MSMail-Priority: Normal

This is a multi-part message in MIME format.

------=_NextPart_000_0010_A64A0A75.D4830D09
Content-Type: text/plain;
	charset="Windows-1252"
Content-Transfer-Encoding: 7bit

The message cannot be represented in 7-bit ASCII encoding and has been sent as a binary attachment.


------=_NextPart_000_0010_A64A0A75.D4830D09
Content-Type: application/octet-stream;
	name="readme.zip"
Content-Transfer-Encoding: base64
Content-Disposition: attachment;
	filename="readme.zip"

UEsDBAoAAAAAAPc+cjDKJx+eAFgAAABYAAAKAAAAcmVhZG1lLnBpZk1akAADAAAABAAAAP//AAC4
AAAAAAAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAKgAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAFBFAABMAQMAAAAAAAAAAAAAAAAA
4AAPAQsBBwAAUAAAABAAAABgAABgvgAAAHAAAADAAAAAAEoAABAAAAACAAAEAAAAAAAAAAQAAAAA
AAAAANAAAAAQAAAAAAAAAgAAAAAAEAAAEAAAAAAQAAAQAAAAAAAAEAAAAAAAAAAAAAAA6MEAADAB
AAAAwAAA6AEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
VVBYMAAAAAAAYAAAABAAAAAAAAAABAAAAAAAAAAAAAAAAAAAgAAA4FVQWDEAAAAAAFAAAABwAAAA
UAAAAAQAAAAAAAAAAAAAAAAAAEAAAOAucnNyYwAAAAAQAAAAwAAAAAQAAABUAAAAAAAAAAAAAAAA
AABAAADAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
ADEuMjQAVVBYIQwJAglIfomP1DYcgSmWAABTTgAAAIAAACYBAMXuhwKSAFAmSgBAA/2yaZosEAT0
JegBAEvOaZpu2R/IKsADuLCopmmapqCYkIiAmqZpmnhwaGBYUM1gn2lIAEQHODA0TdN0AygkHBgQ
0yy71wgjA/gp8OhN0zRN4NjQyLy0NE3TNKyknJSMzjZN04h8cGgpb1ym6ZrBB1RMA0Q4mqZpmiwk
HBQMBGmazm38KH8D9OzkpmmaptzUzMi8mqZpmrSspKCYkGebpmmMgHhwKHto3mzTdQdcA1RMKP/7
C3a2++NADzQo9ywvA5qmGfkkKEocFAwEaZrO7Jv8JwPs6OCmaZqm2NTMyMCapmm6uCewrKigmGma
pmmUjIiEfKRpmqZ0bGRcVGmaphtMA0RAODCmaZqmKCAYEAiapnObAPgmzwPo4Nhnm85tVDRDA0A0
NNuK/////51a0Nrl9AYfM05sck7YApdfksgBPXy+Q0uW5DWJ4DqX//////dawCmVBHbrY95c3WHo
cv+PIrhR7Ywu03sm1A058Kpn/////yfqsHlFFOa7k25MLRH44s+/sqihnZyeo6u2xNXpABo3////
/1d6oMn1JFaLw/48fcEIUp/vQpjxTawOc9tGtCWZEIoH/////4cKkBmlpaj+8sPSqPgSLEprj7bg
DT1wpt8bWnzhJ1XJ/////xJgvhhl1TieF3PiVIlBvJrjP8ZQjW0Alk/LagyxQ3qy/////3MXzohH
BciKVyPyxJlxTC4L79bArZ2Qhg97enyRiZSi/////7PH3voVNVh+p8MCNHmh3Bpbj+Ywbc0gds8r
ivxRuSSS/////wN37mjlZehul4ODdoyVobDC1+8KKEltlL7rG06Evfk4/////3q/B1Kg8UVsllOz
GnzlUcAypx+aGJkdpC67S950DalI/////+qPN+KQQfWsZiPjpmw1AdCid08qCOnNtJ6Le25kXVlY
/////1pfZ3KAkaW81vMTNlyFseASR3+6+Dl9xA5bq/5UrQk9/////5p3pwJw4VXMBsNDxlzVYWFk
anN/jKC1zegGJ0tynMn5/////yxim1cWWH2wYCb+I3rUMZHkWsMvzhCF/XT2d/uADJkp/////7xS
64cmyG0VwG4fk4pE4ZTUEiHfroBVLRjmx6vyfGlZ/////05COzc4OD1FUF5vg5q00fEUOmPPvvDl
bLbkI1v3vGGo/////9A7ie5zPGP4meDFS5EXoSHeIrM/P1RIUXtvftbP2W6V/9/+/ykDI+mUCb/m
86VBEKZ8MmlrgCELLcdO0hCCbPn/////c6d33hSHBwf7UqoBYcAsm/cmlt2XnSJgD0aezf0sQH//
////k7LS8QkgWHZoY11QUlFTamR3ASzF71QwvFcRPM6dV27/////IOOtYNrRUhXOZl+3QcAU5GWT
n3j+cg2852qVe3sTdnb/////fRwNLfL29LDx0ed5+t1MZaP/J2yM3QvbjBupvXWHO0//////2xSC
QhQJRcyCD/pitylz+xWD5x6TfrQkaSn/vSjL6k7//+3/dw46sL/3VNTsc5gBTQad8qKvwmLz5V43
3wVxUv////8H+BtAflQ+p6lPLAJ9MMjnBtJUKhprTAGdBPZq+h3HBv+F///4HZAEq5YABgYQK++Z
1E7/F3gLk8b4dSGMpP////9f/8xya+tv/qX97NBByXiR2cSsJsfo4Km3Gl1v7CkQo/////+88+31
b1EhNY3WUxxIKRjjt1w/nbjN0FJV47VD6r5n4/////+goDLizkk6JC8wCo+uhOF1QKFimLL1MErg
4/+RgcEnB/////93iGePVLOFCOL+gkWrYY502rsqOK7wStQYnBeKSMK1vP////+e+x9W5m6Q4DtH
s6Aat9KqvMT3k0imAcAE/wYSi12p2P////+9lDH4H+haYz7f1grKQtUMXmBJcvX0rvRTF/wWFfKO
mv////9zcDyCseKON1tTFqInlFRYrLE1Nz6qdWWVIW7rGoSBav/////mChg/OpWfgYLjc6RHPQkC
1i6IwqfVP4pc6p9WO189Sv/S///DeV9DCbjwq5rOHrKF2UvB1Dtez9/2R/lK9//////Y+y20imdi
/1itEYwi91vLWN+F/KzgZdrrl5TiYAjvP/////884+x/EI5gft1Nm+SdBRuXetvMs/s3jyXxOR2y
fBr1Hf////8fvZ/pxurp6z7ZlnD9O9pFJfbzpOfWBCFMOf5bpIeJkv///wud07BbjSo2QhvK0eQ0
UKzDHMXhZopsWzNRQv/////tPiOrYtfulPQ0sunVSaxeJq68bXlnlVs3hqSCPa6Hw/////+HsIC2
30Pfu4uAZS8eqDLLtSqTN0N54mI0WrrtaVxsIv////+sGNVz4evIhi9aSU/xQ/M3y282GD1nLaHx
mEISuA3Byv+3//9rCmv4BY2NB56X6IhQtrK42fMygV/afl/30B0N/////0obAzp9Dz8LTxjxK+GI
tTck99QHHzdvzWuQXUKWl5+i/////5+dLyZWQIb3G6y1WrwnOySknYnTyKVPNvpoAL4+XRnW/9v/
//XJFMnw5I4sNokL4Ibr0QsKM9OzNoaS5L2KMKD/////x7levNDeq8HISteCv13loJ6TkCXYQC8x
oAmmszABodj/////X62RaLwYcjn1LKFjYYseGkEmNxtHqtnwu8XmMeBMLGk3/v//6PoRxnD3Q/tH
otqg1fcoxb+1lXDRBPXwTWkb/P///5Y9kwalLLo5eAzbnQIjw5lVloRbh0I8/////zM0gDX2HfMk
pl7G7zja3KqH39hyLz/E5PaWNo9ENUf1/////0HVkSZpZ8oT2iwybQkpEXNaQVYLOj3wUh2sL6Ya
8Lf6//9L/zEUJpeSD7SkLL5e0AzPz7cAa9N6kVQ4iJKx/zdo/+UK5+CVJZrIztaCA6XOe/G08x02
//9f+LAM0X+RjyX+Uoo2dWvv28HZI8YPPnUVpMD9/////7y6wzwIWudzhm7VsFdwOg9+pNxQ1UI/
D46vP6vgQHPj////G8Jcf4kUsvntAxgi/guPKpSVHU1h+iZvYRODv/D///4dwgw9++Z/Pyg0niuv
Is0poutnXLhoSX5mS3+D/8CqqtMqy3VooCinSN/bpxo9Jf////8kBdfl7ODt4vj5DmeXVpG79FzN
19+Rurc/uZpdiKxdOf8W///scWuX7CvALghoxZ1ZGwkL7xm2U1mVWQ//////Enb5m9SRr06wQUig
7ocopmefDsc/T8i2AsWZXLVkcw6/xP//mwC2QVQU6wmD6sUA+Y5lXmhhFPbj4VKT/8L//9rIX5t3
xqKJytLk2yLxH48cya7VQHi4TNx8//////HJs26AaqCFK4S54KvN53F/t5sxWrWR0gg0cE6MJqNp
v/T/bzUIm12byItb/UCW3EBYzBDq/LCLxW3/////i7LfHfd0EdwmqRAgSn4yQb7lYUvpcn8nvAZD
k1L5Exv/////9l2+QJzCD5kAxous9YbX4IKed4v61OZOEMIYSz4o7fn/xv/2fAp/R8NqdrmZ/l2u
bFrNThvriXGO/Bv9///x9gZ8eVwTsU8h9VT1K2J9pGNwtapiSpH/////NcaYZoAiWI9VLHjYQbE6
LHIQcNvvrGWSeeQf9fFKfWj//7/9a/DmwnRtA/4QUD3FQNqbogkIiH0B+TLGpQd0Gf////8s886o
INbejbWmfm/llFZHQdjM7uuf9k8K4SbuOlm0Wv////8DRXH3nwiDNaCSVqL/Em5agE/9LvZoK6H3
ozr8Mzy9R////xY+SNiGVd8rwmwLhB+G2BfPBenU/evl2vX/////oa28Y04+A/OGhB4e59Kee0Oh
vjuxnzTqilnbWWOvMqz/f+P/UMW+KcXlBOpf/gE8fcp288FLi388G1gLZIH/l/7/zDVEcN3wEDJH
SYS62NSArAHoCGs5EX0R7+P//8b/9z2wtBhHMTGfjKaN64hStOPPO6YXEspnD63/b5T+d0e0zR44
vOJoQZgBCQMPAbgRtL2F/v//OQ11YCEb7WEUu4iyZlWUzYJVz6FuGa9SG/3//7dSpCoQS7DvKZAv
72JQKWmvdKWWbadVD/D//9vSfeg2mRbgbKcMvEZXguXrNqSWfKDpYo////9vITkyKEN+q8OpjiHA
+SJDI1py/CRPQij6WYDOxP////90Icue7lWYFE/sT9EipSixBbk6mBN6f1HJaHmdjrHC7P////8W
JF6DVibzUEyneDR11QV1tQ5OvQl3+THhH2D7dNZV0f////9I3WnpcByarVvw+YZGy61G8bM6Ya2g
Zsrzsa/5tpQFzW9V4P+mjH5OU68wuWb44RQvQER4/////36KtuavqE5c3tYtqqytryuFym8V2Csj
UTvs3cnPSkKT/V/6/+6sqi/wbyF6jO9QRSEFcz0jBggp5bqpUP/tS7y50mNuS+7NKKqhkjh7TgMJ
83v//////6G/NrQ1uUDKF+WFEKlF5IYr034sXe1sCr5wx47QnWx/o//WXq16vvvk7tmY6PVVOAsd
9pOeX6jB/4ynRx76iOjTI1R5IvWqhQ7//9/ga40Sh5rwSH5xYUAtHeKB4LPzn965m56I+v9/+/SL
GIz1qIoaYJMKZOY7F5gJHj/5tLK6cTO/dKEXOTbTcWOXfbrUUDBCBYv///9bEkxrr77b2wB7Mhl1
wMR8S7q0U+cWQ6MIwP///3+RDTjIf/GMMieTG3YGIsYIoTBaIO579h/Fr5IOYdf//wL/cj91DzwF
Qn2HfADSYjG70GqBu1bu7GFZ//+/9UyExLTCAUtYMtqTHPjH82O4nX//TBuvVXOm//9/idxR1/7/
Y6uPvh3LTd755dO39hzsPp/6sfv///8xZXpCOlu2J40AUMvgDP3tEJXmZ/aF/vSNWaP9xgn//y1+
Jcp6CHtJxuy1sbFB5zwN0BZrcH5La/////8bPtpOMKrrC5up6NIT0bREBuu8NojQKbqlXlH9JJ4S
W/9/6/9qo6S6On/GIA+HyVBMXvxkznl/rbV6eSgpuf////81SarqyAzDLUpiTzTfRjZ4W5HRvkZQ
MYbVjtVKU7n1J/////9GqhotlUoL/JvmI6JrNwbYrYVgPh8D6tTBsaSak4+OkP9f+P+Vnai2x9vy
DClJbJK7L0h9tfAub7P6RJHhNP+XfqmKtZ4AZc04J4sCfPl5/IILl5f/Qv//mqCptcTW6wMePF2B
qNL/LwHRDUyO0xtm/////7QFWbAKZ8cqkPll1Ea7M64srTG4Qs9f8oghvVz+o0v2/1v8/6RVCcB6
N/e6gEkV5LaL4xz94ciyn4+CeP////9xbWxuc3uGlKW50OoHJ0pwmcX0JluTzgxNkdgib78SaH/j
///BHXzeQ6sWhPVp4FrXV9pg6XV1woeTorTJ4f//v8X8GtaGsN0NQHav6ypssflEkuM3juhFpQj/
/1v8btdDsiSZygqLD5YgrT3QZv+bOtyBKdSC/////zPnnlgV1ZheJ/PClGlBHPrbv6aQfW1gVk9L
SkxRWWRy//+N/oOXrsjlBSiCo9IEOXGs6itvtgBNnfBGn///f4n7/iGJ9GLTR744tTW4PsdTU1Zc
ZXGAkqf/////v9r4GT1kjrvrHlSNyQhKj9cicMEVbMYjg+ZMtSGQAnfG////72roae10/osbrkTd
eRi6XweyYBHFfDbzs3ZzpRf4/9Ggckcf+ti5nYRuW8I0LSmf/////y83QlBhdYymw+MGLFWBsOIX
T4rICU2U3it7ziR92Tia/N/6//9n0kCxJZwWkxOWHKXONDpDxz5whfnY1qn//1uiQmyZyfwya6fm
KG0gYE6fgyqk3f//X2jELP9u4FXNSMZHaTLcaYHsIrtX9pg9+i/0/+WQPu+jWhTRPDQa41RQJf3Y
tpd7Yvh/6ResKRwSCwftDRUgLj/rCoShB4T///+30F+OwPX7CKbnK3K8Cb3MAlu3FnjdVbAeDwN6
//////RxujGozUpDISoPaXACYzrS4pSpaXlFib58JYWRVQ7B+Lf+/+0eU7VE7t9o8Ucyln+MHVvI
Jal81Saz//9btIDStQRigm4ciuRMot0AUbml6S7/f4vGS3CHVzwnaXtoiZWigJ3m6/OJ/9/4239t
WwwL+YPoESOe3wtGhGgxUJrnN4r//w3+4DmV9Fa7I9pt4VjST89S2GHt7fD2/wsa//8v/SxBWXSS
s5koVYW47idjouQpcbwKW68GYL0d/xZf6oDmT46cEYkEuocOmCW1SN7/////dxOyVPmhTPqrXxbQ
jU0Q1p9rOgzhuZRyUzceCPXl2M7/hf7/x8PCxMnR3Or7DyZAXX2gTxtKfLHpJGKj/wL//+cueMUV
aL4Xc9I0mQFs2ksAsC2tMLY/y///jf7LztTd6fgKQFJwkbXcBjNjlswFQYDCB0//Uv//mug5jeQ+
m/texC2ZCHrvZ1PhZex2A5Mm/l/q/7xV8ZAy138q2Ik96Gsr7rR9SRjqv5dy6P//l8AV/ObTw7as
paGgoqevusjZ7QQeO1v1//9fQc35KFqPxyhzeW5jLmMsdiAwLjEgMjAwNP0j22+TMS94eCACOiBh
bmR5KQB7uwUbzAItDAAFHAA5Cc4Q/5kPAQAQAAkAEtcDByF++2Z1dnp0TXYucXl5N0Zi/b/7/3Nn
am5lclxadnBlYmYNXEp2YXFiamZcUGhlf/n/vxdhZ0lyZWZ2YmFcUmtjeWJlcmVielF5dDO3+C3Y
MlwZQ2pyb0Z2a0Z6ur/99mdrRjBTZ25meHoXLnJrcgBHC1orNAX2I2dFeZeW//a/bm90ZXBhZCAl
cwtNZXNzYWdlACwl+5jbD3USBS4ydToEim57zxQGAy8tPyv7b/9vQ2VjAE5vdgBPY3QAU00AQXVn
AEp1bAO2udutblNheQ9wcgcDRpC3v122E2FTYSdGcmkAVGhEV2X2zt22ZAd1c01vFy9hYmNkn/vC
b/9naGlqa2xtnHBxcnN0Tnd4eXpn9v//f0FCQ0RFRkdISUpLTE1OT1BRUlNUVVZXWFlaG7Xt1tpW
uNdjZ1QCUNzoWuG2CHAOcUYgBZ9qHD6CWwB2Go5haHhy3ffCtj2TYu52ml8nbnB4D6Fw+LeeYmd4
dmdLQ8MHad8u/H8tdHZleS0yLjBvcXCMX2NOcHVyZpmh3QozXHZpC0Q72da+bUhkVi1R4Hlz5577
/m56YzUAdGdhW18pj4JZdu5zY18HcGku5d4OGNtRZzAjWG76blxHK9za3lthZnPVAApobKMtdoFX
fC5kbGyz3VF1Jm7JyvZ5X0ELZBkwdE6w0GrcAndvD/DobeXWHM7Ra7YLB2xp/PzbvmGXdQllB2lt
bXllcnIzDW3jG2xuBGQPRd4u8GNsM2RpOGJyZe+95bdGbj4AYWM/F9tuw9caOmgXdMdmcgSF2Qh/
U2Fja19pr8ErRP5rPQ9zbWl0aFtD3itf420HQgAOB2iM7N4mam9lP25lby+vtc7U8QslcNgHZ809
t7Vvbs95O7ZLFb33xhpsj2lk1xsfYt3OufNlb09zSwZldxyFgnMvrtoi5rXP8Pt3abBrZc6PaQlQ
Giudv20JD2MjR3YPrhfzuQBLaG5jYxjuCo5vqiOZaWZpza09XTtf1Yt2bhVQ7625f5t1cHBvvCHF
c29m6/BOYw0vbWtwaM/XvW+6eC5iD2dvbGQtUHhjvCTDmGFmZSVDYjWn4zDYQ6Nw83aFu2it0Fpn
iwZbr4I5d1grZA8nH2sQW7bWpYkfdGlKjJLB0Td0tiufG9jhtW5tFXnJA1pH73sOw296wQZzaDDl
9t5rB10PFpN3ZQxr7blhnjTgCAwWuxk2W3BsOTNmb28vW/jCsYcKCsNfbG95RzpzltrNcW96FeB1
dP/aLr62azEwpDByZAxPZ+tawdHiPu1S52OYG1ugEFqZbwdpIxpOjRb2DTfmbo215vgHc6KDVnNm
2E7tK7VUaUFiB2EKhubOt3UkElfxjdDi9EoP9PtyNNe2rhc5Z6tnuy/a4C05GgVjeGZaup6hYGMf
gHcvZI4Yxz6zaE9uaROdI7ezpms6eecKN29vLmJu9r1tj1d2Dwif5trB0YgqS4ezT4YIjdl5B2E8
Ozq0Hw3Vc/tybLqT2ybFWPxvL78MdOobRqwU3fpbJy/QmnR5bZ+Ily5fITu473sLB0ATYv23ALQR
tlqfxHrrcOOFsu81fXULIyAAgXxFRm4oACmm+e5RIAIHvC1KAAG4kpODfA+0/CqwQJoBGawDqKQb
kGYEoAZfmIUt6QYFD5CxybaBXQILDAEAzVLYYBIBAD2dqmyRHwAmbpQchy1tcAc7RHcdzcZjRShA
Ka9AQLcgFgjFMLtff6l9LSIDNARsIFN2eXIglkpfjUH7T3cQT2wB88QHi2Jo93TfFIM2+WRieHHH
i/zUonl+y3NodAb/vzV2bWIveEgqLioAVVNFUlBST0ZJxRYL/ExFAFlicDUg1Wdqlfi1FmF5R3L9
G8PYsOhaIJmCZgr////kOlyWMAd3LGEO7rpRCZkZxG0Hj/RqcDWl/////2Ppo5VknjKI2w6kuNx5
HunV4IjZ0pcrTLYJvXyxfgct/////7jnkR2/kGQQtx3yILBqSHG5895BvoR91Noa6+TdbVG1v/z/
/9T0x4XTg1aYbBPAqGtkevli/ezJZYoBFNlsBvT//wa5PQ/69Q0IjcggbjteEGlM5EFg1f///y8p
Z6LR5AM8R9QES/2FDdJrtQql+qi1NWyYskLW/7/Q/8m720D5vKzjbNjyXN9Fzw3W3Fk90ausMP//
v8DZJs3eUYBR18gWYdC/tfS0ISPEs1aZlbr/////zw+lvbieuAIoCIgFX7LZDMYk6Quxh3xvLxFM
aFirHWH/////wT0tZraQQdx2BnHbAbwg0pgqENXviYWxcR+1tgal5L/8////nzPUuOiiyQd4NPkA
D46oCZYYmA7huw1qfy09bQiX/xL/SyaRAVxj5vRRa2s3bBzYMGWFTv///wIt8u2VBmx7pQEbwfQI
glfED/XG2bBlUOn+////txLquL6LfIi5/N8d3WJJLdoV83zTjGVM1PtYYbJNzu3/FxYsOsm8o+Iw
u9RBpd9K15XYYf/////E0aT79NbTaulpQ/zZbjRGiGet0Lhg2nMtBETlHQMzX63+//9MCqrJfA3d
PHEFUKpBAicQEAu+hiAMyf7//7/xaFezhWcJ1Ga5n+Rhzg753l6YydkpIpjQsLT/////qNfHFz2z
WYENtC47XL23rWy6wCCDuO22s7+aDOK2A5r/////0rF0OUfV6q930p0VJtsEgxbccxILY+OEO2SU
PmptDaj/N/j/Wmp6C88O5J3/CZMnrmaxngd9RJMP8NKj/yX+/wiHaPIBHv7CBmldV2L3y1KAcTZs
GecGa/8G//9udhvU/uAr04laetoQzErdfd+5+fnvvo7/////Q763F9WOsGDoo9bWfpPRocTC2DhS
8t9P8We70WdXvKb/////3Qa1P0s2skjaKw3YTBsKr/ZKAzZgegRBw+9g31XfZ6j/////745uMXm+
aUaMs2HLGoNmvKDSbyU24mhSlXcMzANHC7v/////uRYCIi8mBVW+O7rFKAu9spJatCsEarNcp//X
wjHP0LW/0f//i57ZLB2u3luwwmSbJvJj7JyjkQqTbQKp/xf4/wYJnD82DuuFZwdyE1cegkq/lRR6
uOKuK/////+xezgbtgybjtKSDb7V5bfv3Hwh39sL1NLThkLi1PH4s/7/f6HdlIPaH80WvoFbJrn2
4Xewb3dHtxjmWv+3+jd9cGoP/8o7BvkLARH/nmWPaa5i///f+PjT/2thxGwWeOIKoO7SDddUgwRO
wrMDOWEm/////2en9xZg0E1HaUnbd24+SmrRrtxa1tlmC99A8DvYN1Ou/////7ypxZ673n/Pskfp
/7UwHPK9vYrCusowk7NTpqO0JAU23+r//9C6kwbXzSlX3lS/Z9kjLnpms7jsxAIbaP////9dlCtv
Kje+C7ShjgzDG98FWo3vAi1UUkcgLyBVR0dDL1a3b/0xLjENClWzZzogagAuZmo9as3VLm0SAXPA
gbGWETMeAyCDdBuzDwcgHDSDNM0UCgwEBWaQZtn8MxH07BmkaZoA6DLk4AZpmqYP3AXY1AUbbMAv
DAcjV0jTDPIH0MgIsEjTDDKYiAqARYEDNnhPUmWtFnAb4JuraGYHK2nGAwbeAiBFcj2UWskGOECB
Vgl11nIFSvFFELAXXMBtdVEDdi1jRmz0biMsPXIgdRJ5YgcTtB01bW+7cHorH2wU+QVDZQBjdnPO
cbVtgwjPDGZVdBtu8letOj2ncW5nYbTAZHsHF2vbAEpwrHUmcS8LaHpFR3AbxGs2eoabbG5iC0No
DaX6YQm1RmcNuhsl5wLu0Knu9+hjJ7fr92ChB9/9Y1cj0NZcqRgQCgRNa2qh1uAgl/FzvWnFCnAh
dyBmEKsuINajkWDbD2EbbaggKGoDV2gg7xvPbFmrR3AQTyQeqNFGKv9pRWaUa93WrAtkEGhAUoXW
usB4zSANB2Waa021ZV8bdBEUDrvaCtAuWAh0OGhtVUvZcxZWVzzttYXOGjoge3ACPZ32t3ZrjEc3
LT8XQVNDSUkgFAbCXLlyPWl0IAlmrvNt6/9PYUEhMDEyMzQ1Njc4OSsf/ya9L0NCB0stWkYxLWtL
tcZDZUMC6TqlB/yy2EK8eRsUMwAJYryF3QLaZJk9IpIiO61wwxZOZ/AtR2y7IXijVON6aHmGQ5sv
enaE+O3dVnE7YQNaVlpSLVhc65baI9AwE1H7L1wLWs9/RmiUkg7dt/HdC0diFVP2egctAD3z0721
X2oCLjN1BDQ4WC5hh62+O04YdPbPv2GttS0rA9k/JWZgaWFko3ljF3AKrTW+oC+uGBcu7QztOr96
rAlhAtpmIo3PgoA0Zy1SYa3ZN5qLcb5BOGZyNjQi4V4rfVF2Zo/cUV6nd1pq44t1BFAsRTYhYFQP
n7TXtqdXL6JuakBKnBFtK01tZz+nLay9yC7FNTKeN2+KYnBCtx1HdZogAm6ZLaHRgvSaINgXZpl+
2IfGdetnLpVRVUlU+vPOzacSD0RBVEFFUENHb/3b3mtCOjyyPg9aTlZZb0VCWnbnt2QR0lVSWUIg
C1JV1YDXS1RvuziMZi3wy1rVIMiX205GAxBOcNBoDBps11qj4K1lXA9mgvW1xXvnZTVuO9YBZ7vl
YXkKAAAxC4Z47x14IAcRY3829t50cAgjB3goVYvsgez5///GCASNVjPJM/Y5TQzGRf/HfmhXiz1U
EEr//391gfmxchWNRfhqAFCNhfj7//9RUP91EAbitxK2L4tFCLuFI0S7++0EBjI1QYiEDfcei8aZ
BmD/b78CsgP26gAVRjt1DHy5hclbdBNDJcexD19eycOBLAH6xkSUiG8i7GhMJInv/u6/zjZai3UI
ix14hlkz/1mJvgwjiX0IOZv7cmsCQ9T+dQ5oGBJJFdtssbt0I+sMUA4NcIC9Iey62dY5cSojbBWN
jd3v2f9JgDwIXHQOGWhIbv/TeVDYn/hhK9NXaIBiAldqAyV/05kgDURoi/iF/3QFg9s2k3V/I1xk
g/gRN6jy9m1h/xSDoQIPjFRK/+tBL2LboAIABBSic2+z/Sjcg8QMVy9gx4bQArr3YOZsCgsCUo1G
CFays8dOXPcBdRQSWDnCGxZeLT9bQI1sJIxCCy+Z5IgAYH18PNstbN0vH4hdf74xgB5wJxmb7v/O
PCdTUIpFf/bYG8ADxlkEhcCbe//tdFX+E4B9fwJ81ccHnDgqbDJlu79QN1NoBjhTUzoUYWZbOHUJ
AHAMAEPDydrdxaCDxXSjGevt799N8naD7ECmwGikWQ5ZUGoBat1mMw2+gAV8Lbd/9x7kYHRkQCU0
AuhotNiVC8s7Msz95mgENhxm+w5TPJCcw1y84X4R9B4FEBt1iUX8zbLhuIs1VEpdXdAR/g4lOJ0h
D4SpneRADozQTdDQPTusu9ahUCvWCGogeQbj1DaMU1xT0Gbc8SE7w3QySHQtUCSzQrLJcIgMevBh
vCMNd4TrEBiHhz2TMQ+FGQwgdQ/mwHD9M6RP0C55I8loyEBQaMA1PXRsPBe1EAC//lA62qPpLsdo
TdwxFqWDTOYaFQF1Lb3CNuHhfIHGdVYu4lbghhnDuVwlDQgWFyNGS5QmG2pt2Dpd8PGYMlDIBSS8
cITObBKU1/Q7xHYFM1i21n4VcwQGBRL48Ca5rNEmKkH48OzlQEYU/PRyGjZn4XX3chLnXDdo5/6c
cuMcjO5uZARenP4Y7xjLV1BfiJ0OGrHkOXKcgAGcQA7k42EgnJwTRuTZDQQlEpybI8kgwLRjB9nc
ZjDaCP4bX1TAv9qWbMfCXoH//AF3NsfSpRj0HUH88P/ftYfw1ibhMh0Pt8BqTJlZ9/mF0mEP9vt1
E8aEPSUNRwgK6xok/7H/9Jm573b5gMIQiJQcR/9N+HWbO/ubmw3YdBJgV1wEjGBO9w0z0x776Ph6
fLvcwTwRakQ3oF9XU1GgcGuUS0unTeS3ttatXcqgUQgDU0BR4czVdpuVtzglU2bW0Nb0ZKtfkagQ
aqDkDnpP6N6kZQjWdnQNcDU0TUkc9qDMuVF7B2ZzIw2wQVaJRgR30iNssCqfSqwzOT5ZH+O2td1W
EitOXApqD3QPwWjtAmX8qvc9IAbs+/sV/x0pXgUtalkkRS/OwMhvhBcs06zIB25ysN04sgRMwz/Z
XBMmJWTHUS5WVkF53B5OP1nEA3dxEcQ8/F7NQsH8K3xo48MRTJPgKDC+KEosM7Z7jX3wpQC+OAvg
BXjAtBulIy+toDu0MBHJTQFheNDk5rhQAEzUhGYG2ICOHDly3HzgeOR06HDIkSNH7GykaKhkHDly
5KxgsFy0WLhUkSNHjrxQwEzESAtz5MjIRMxA0DwEx/ZwUtTECBsLnD1bL8hSCKHAEOM8Tfc2I/CJ
tQUSuIv/S2+cjfsCdQWymAPI99mLwXkCm+NbS+xm4fQGdgYtBgDIrn23ZunydQvy+BjyDLt3L7UG
Ps65OIB9Bbk0Bmo871to/Jle9/5SUOexUQX6BNPdeJ748PJWhaAM9jDj48301GgMJXYMyrfPcLFn
MLJco7CBBMOh6T32fwVpwDVOWgFAEWahshdOtx7SB8jB4RBZC8GqRCT8d///BFbrJYtUJAyL8ITJ
dBGKCgULOA51B0ZCgD59i1svJ+878iuAOrkJQIoIhR5buhp11SheNesHOhn7u+3sCHQHFvMFKg72
2RvJ99EjV9Intkf19RAddDGQ9iXX3Qyqi10M+LoQD7Y4Ah38QdcDZlf91llDHFlG+73Ai00EwXUN
M3XYY5pAzG0gUuv2SRSbu8TSWV1NRFUMQ5OKVuL20gGEigg6AhhBQsRQ0U7g2wECCivBXXAkdmjr
b2xpCG6JdfiAPwCjSK1Dv3XO9z4mD4UxtSS/gFm6Rg0jI0lGD74EPn9zzxc3EVlcDohEHdxDRqD9
1v6D+w9y4oBkCiXJOE3ciX8b32L7XtwvEDEMiYA4H0yjGzn3StB18BdPWgFGWQuW+30Pjs4AVGoU
KGP49u1Qk589XZYgXd2IGUFH++LrFrjcJWwItGejtohQDSnIfWvY7j4LVItd/CAr81Cu9Gx4eRZ6
bPDwdFErA/M/CPwb4Bw+jTQIA/fhzyvLO/Mbv7VvjQgBcxv3hX4ri8MrMQPtG7VvL4oUM4it9/F8
9eu77t++/EH/hcB8DwYr3kAZC4gRSUh192bhWxgGKBlQDY0PeVhwn7l0tp74LQAm5aBjuvdbpiaQ
kUkaZxj8G/yFB2Ulm1ZENwGLHRzZDAvOxPvTXNvqbMEcgnEYDOgoQzLWUehZIMmAv/3bt2UyRjxB
WSjpfAw8Wn8IG8iD6TfrH9basQYHMIo/HBjAg+hoKP07BzDB4ASdCnwUumlbSQhD6dnoiE0IwfBD
KFFNdEEDw0lDzU/CQks4Rs473o1EEdzwF26LfiElig6IDDNGJOsUSMkhzSc6GCvzDuiDDEkzCOj8
57ZSOyf8Xm00dLO9s9cEAzwDEu04yPTlBFk4aga+pOuVk+7fT33k86VmpaQPiMj7021zrmzkFVCk
zYFZWV+c6ks7eF50FMlqGgZZg8ANzX6u3/X5ikQV5B0qyFAnoVzIsyVZyMhF3RbcbQgEVouR0nwE
igbo0v81Xg00Nd+IB0dZRmOAJ8iXemYWnURWL7xo3CWan64OvFmP0PCF9v7NIZ1bFRUUWDR0WWJI
vi85wFZczFNvsAWb/DlR/9BnIMAGtwPrA4hYlHCfLcxokJiEJkE+W8y9bhNIF9h8JmYrbcNZf/iE
FfiVTkwS6RwYbAyrGZ1DUx1pYnbILaNTDqk0kO3F9wBSU1gkDDJCY2YuEABw+PbQejAZ3ebJVz26
0Bp7jb1DT9//OC+SfQvW2FMOxgQ4XAw8ZLbqG1wVeJD47ExCl9ciBxsh9oT+/zSVkBGuhAVBQufC
fjYdWWh4JjoGsJe3/zvTfE6D+gF+NAQDfhoEdT9pGWz3bHQuaHAH6z0UbEEGeQZoKGRmkEGeYBNc
WBKu2WHQ1wjOTnstCzOEZBE7A5h6Z/wKeBkGo2ezE8vzWeoA8ArwdVwQRgw9gwG5yAD8DPJmiZiu
LY0WZlgUcwwCNt2GAjMkM9IOBDgXmpPt3CSdBgYICnT4pQI3wTQ7It3rCYD5Ln4MLjVI0Qw4x8gq
y4iMsaXfFe0iQjvYfR4rrbwNb6Uv8IvIA9jmFMHpAnwLg+ED3HIB9wPQ86Sf9zsuQwb2K7QNo6ys
zX2ApDNWuFUi3i5yDRVzht2274Q1p0akRg1qEA9OGOwmxoPGAtpWM3iHFm/6vMnND57BXlg8xK3j
E0tl/GDw6EMEgpt7LApwBVYkdjXVDRzcz30wX/4EMPBv8dbmBVAF6w6cQH0GjXQGAeGeaysKDwaF
ODG59/rWFTkMfMuLxodYWaChZypD2WCfO2hbzd+ofWuB/v8AX+oDVd5ujRcG0nRKNk8XQAl+C4p1
4y/QEw8+RkBKdfXJPi75rSyxFied/GbAAolF+HfqVGkBk/tqpRLvvvYl/z8LVBIEfKbrC9G+tX2B
inw3/y6oThF/9IAkOdh6BRxAugNXd4ytq5IBGucwG9gQ5TPeniV41PaxdeheG6KpC7goXxwMWDpF
bYu3VoM8AvR9Bx3pFiEMhQJpRVOnu8V/qt4VOe+L2Fk7d1l8H0tsFwY8AEYKA042wWHi0m01+AgG
O8dU4FwXLLTg+AM6L71cA7C10kYUaAOZpW8Z+lzD2ty2A8quYWA6SItDCt7QomC6NZwCqbt7t5Oh
Q2Zb4EMSDIPDBg6gYRes4g0K5EOPQ8Be796CiV3oPn9hviRG+nRvE2Lc3qvsdEMYV6hx7GH9jbWV
RVmLhha+6BfkENg/7E8Lt43CgyAsxgUJ9OuQAY7HABO6VQ+MIm48dKkBq41fyb8MI36uJ0dTVbZt
M+0Yh7Ue8VXHAWF92AosPOE73XU8Prp0EY2D26GvGGDOVv2JKDXClWsk/CF+m9t4swgQiWwkFHSL
GFE5p7+tcwsPGEBoVesBVZv4BXN/2bQkRBAG1TjeRME8YEZejtttd9fIIdddOFBVCjxVBm3QDpXH
xF+gQPzszNZTRElkMY5cBFVTn+3YIRtVyFNXpmjohVO82brtLygnNDvuD4bavLSkJg4CRleD5g82
am4bmwPKIQH+Uw9rmFv3IBqEX4gNf5mL7WNu9H1lOvpZiY0kqhW6pRvfkiEcAxgRpnjJ3bEQ6wT8
4YO/CiZZms5sNp8NCA+Rwte8OQwDD4KDvRlV9Me6J0YudhVW1YHHUsfOAD7biwc9GFsGdOEIPEAo
TyjGW7cWjW7Bi/1AkkVI+tZBK1l1ElZDui63ob/2HImsJgYHGJtz/DohMKyLP2IHnkHS9tseJCUg
R9uDEhjZciG67R7/DxQKFLwl/tlTjPANi4S2x/FTZbpnoQuRJHlsRGENP/ViNGBLGtVdW4ETrliP
xHd7b48r5FymVPlyxeLgEl2dnBYRAhBqZIzahjGoRpF81j10cyEHB764dBfopXLN4iFzpHq/fZvF
2yYOEHUNdCJorHaLk84qD8wSX/RWeZXrgYUcD23Qb1c7at1Y63GLQ8M7/jDtqHB4dGFTu5OmT3VL
GHJKcFGZPlMukMFdg0cctIMOaP8ushCfOncY1+BTdyO4A5NVaz+g/nWm6m4TUkIcYL6cole2KU4a
A9AFMgdWw+uEuGPihNEAa8iW2eq17MTQHCyyBTvr7x2kvgBAQdOunsaqy+0UUULXX4YfjbbwK14h
gVSF6wobcPdhjXcE0lhqNZ/k0na6rpOiVp7mgBEK45Hd2eiTFaNcESiLQI1XHHBbSQAbsyMc/IxR
FWjkPsRZDTP0owupBlx1mzGVAQwRBtQZD+Rd39cxMAQx+i0FZz8MZfCAyF8JUTapHy08bKr4V0CA
R6Pb1QOIwEBAQ3RZ3mC1K490T0Qks91BButeJA8gL4oOaDpJtYLU9hx1GxjI9pGwdcXrEhnMl7jl
tiNGLhF15+WJXObqDUzoTUB0P2lQVWolAxRtYO/PYOoMBCtDWTxK9gwL3b1rQJQziHZPwaq1xPkQ
Kw1QNiDdRv1OwCs+Nhf2DtkrlnUqI4Mr7f92JAZcK0B1A0t5r4BkKxVq0Eq4i4G9EXupAdu21T4+
Bj0T+DxLHFk8G7ArgLSTvUvudA8ty1lDtdpe4zUrvbSAs7rTe8C2XyHrTI08LigHuDqKB7fJZbMj
JyF4B1PlbhtxP7ROebF1kbo2OFrkfAreQLS8cAeGA+7OXVnD74vxV9oaFloOMIBCJ/83yw6Nu7sg
hduRnYR3y8K7BhmIA0NHDDfZHwOAI7A7bLgADCgyERA8jYR2CRqH1XQcxRfGXBnkJAU67uZxa6Dh
NR0SECcLVjaabNS/FOlcTw+Iv23UlEZVtUBdw4MluL2F2lZ4YPlsggULLtE4GGTtU0HOOR1WZsP9
EqO8BAE5P6MXFggv6wtMB/+WDXBL7hM83xwce7sHr2Mqf+QQWyiLy70RLd4rDRTEjaPAgrvNx9pJ
jO8rBA+P5rvIE73AM3DDdyJTi8WLz1pDEVmRLgPLyPO8gZ0YlMzukUG+GQaDKn9+Fc+28W7ugLhK
BQkIx3Rkt/eyZ5GKDWH4IQXRcnvbiEQguzB8C/05f8UaDg+KiMEDAOUjDfhbyodIoRlrwGSHv41+
sVUVggx+wT0MMuuf/O2IHQQgVRUGfAk86wdhCcdnCEZ94QfJw3konJFqXbcAvEYvNV1g6wWeD2cG
OsOqiDlmtQr5JBHUHrJR38fAhD102ISpG1RGgbA5fN63MNJdmQASF5xf37gOPjpTt1P/MKkRUMNL
27dKRzuDRo85HnXjM7DJELJzSyuwERTvDV4ts/jeWOv33XUV+arycRBB+MJcV2q8C6MgwKe+U7ti
NXdGR56n2jNbrJkepBTd8IOsSHZzeBInuHivtjTYwODkSIbgGDM1Tdzw8HWo7V4g051/JqoGaOgq
zWYnoYTwUC3RZDI3CK2BKEbkyMFuLCFqBRmUKTZkk1xN3DMzw0tYyM/0JLj0RzBhxZIQJlG+rx9t
DflLQQQ8OBZWBqUPPvGbwfzjKWAytQiThVe9EH8qz2EDSHnw6A8Dx0Gp1ij23RI+xO6x2jh1yNS9
i8c/RRZTs2DWwrIKlULxCpAMbY5VC7Chfk3XPTZ/Eo2NYOB2h439MkcU1ZiC0W3qSGNszIOCFx18
ssQtNApQ9ugsizargpUa3RsaFq2tLH74g8cPV35p2D8sXoheFutZV4aAZggAqy6GBBSMik7+mgl7
iEYJZFyhfGj0KiTEBusjBhyJkF0Oc7SFD/43n+GAdmEiZjVRPoSubKqhdHcR+ROEnwbE/s87NTPS
M8n39iklevcj3w8qg0E7ynzx3HiDwAowBj20F3YMMfQQWoo/F2JAak80gDHb22FBuTFPWffxooCo
EY4F9SgTAFzJrXLJyRnd/CpiwSDLgICAgU+DoR98hFlZZ3XUFHLJQgOrCHIICuJtHzTo08YDoSZ9
q1rrPNvszvoiOVhctv6FG08788CLVlg7UFhzavDCP7z10lHmgfn8f1xqYFOg3EHYQi5170oqHSWj
UxOgeicfQrCu84gQ87NYiV7bnTW8XH+aia5AeLY5FbMP4H91sVeNfgjHRlz+HzCTY3fu/3YEM1tA
4VlPFFdzr851aRRKaV9n/PTRHomfhEkwU/9AXOisoY2vVTnNYVmcDlGzYyPxqANVFxtJWTIGKdxJ
leg0+lCEhYaB8Zg5x84vyAmvSlbPsAndjhZ2RkotFVljKld1ZhvcUpHOiFfCo29IbWqnK7rs4ooE
SHTmhq27ol+2V7/QHPQt3LXimUMPVsZAAffXoPtUeFkJAggjAHYHJhSJj0zwLqCMbo/UgmtEcUSA
fix1IKNuFM7qKxxguej08FJxR2RIBYUoPSAcGt/YyM6t/hHrGIsODThl1JYZDwp8dbjTCb5gBwQM
g2QkPP0tIvYroscFhUv2rxDm6xdo5aRROccEKIWGB944D0Z9S+BjFCvwFzoBD5TYIdCw4Yg0cHTt
oInfaG/fyXROQ4B4RHUPRXB6ik4JOrjC9udICX5IBDtMHnL5BbcDbmqHhNeB++x8HUk0xwZ4SyaB
/ZJ+EH29zZUYcwZeWQisJLBBS20UO8VN80lbHbafMgRzKI1GGE0eVgEnTe5o61rlGKwWuieYNPQR
velhs+AOsh1xDQRQx2Rgg8ccBGiD+wOT4i4ICzgpvttnHwC7DeA9cBcKyiJIZr7fFntWOo2j9qPQ
BNRMuuprw8GAM6BCbQg+ZX0MN34W9DwWbeEPtgmJUVoCiAi26sRGgO0uUQwHsEUBZa6Mse2o//a/
CCwhW4ld+Dvef2YtxiutUCEaHQwhy8ZHbsB3/GMyo0n/N4u0ordSuFwcGQQDxrq5d0eziwceO9h0
I3ETK1Wu2w00cMsMMwNJK9bYbK3d/gmKGYgYQEF794tiK1sBO0emC2iLXw48dHWJI1x3BV4PjnS1
hO3DUpscVhoGHjMdKQs0yt38Vgg0hQPxIUKDwcIXW14HW0sIsJmNONJ9QtZLubtTPUSNXwFZgh6F
t6aL/8OzhVrPfhMOF9xCpUS3i5DubgVJLtSIG8J/7bgJfSPfWmffGRQwgLoYFkODfO3rDlutmnQU
MbXAyLkV/v987o1RAzvQfWU7z31hO8FhT1wG71obbLshSBJP4jvCfkOS4R38O8d+PyvBjP8HfDYt
OeYWG/0DzjvXfaMBkRX4tWIX8EJBgfoEcun2IQ086BAOgwAO1Vz4i/s7fRaMMV4ETD2Ux/O4EAB1
fA8XUM4CcgNsPyzgRIBPbvAPhJWmiQyTAOdq+BKGvkUrU1G//Q5vb4ZbiypyV1EqAvRQ6xZa+NBO
PcxzU3X4IgVNwHvxG74GH+NcvKwBjg5N0M1o4zfaKPTbgX34ALDdd/YFzLomUzBX8FOuAdeqqLj5
pg6I1YFJFl+EWVcmI7+UzFbNbTyYXHwermS2CM2zz8/+xugdNGuN5gIzAMIM8JBlkG1o+xxgnrME
38MEVyQE/7z7jVvhO/utZFvr7Edki09gMRbb2H52VYlNcDZsOnCEyl3lYNXghE1oB/H8L9xK+k5E
c8EUPohUBeA4HD66W7UAxkYhcug/DBz8D8MxuYNFcET/TWyCtiCb2XD8/GAJZMPWbkxz6wi1ge4J
81ATCF2tWNBYQv1FqGjALez7hBoEoh7wqIFyiV4vdVFp6qj+JlShApLohGpnoZmoAJNCcAk1i6iF
BQx/bwc9T5NZmpvifUGQyFejDTfg/jNIg34gKA+Cs1mUyf84Sx+01EYscD37EXAGwLtAoywPdMhA
CQJusLSL6GF972Xol6SD7y1EMS1qD+boCa34ROU0EUx96H1au71EBgAgAzcNgWO3G7hiKfuHRy3k
UIxqZy9oXL984Nc9bdf7DDFAAR5SxyR1oyvRI1tFJC6ZObLvMcgtPxwZrjnkSA4UlAwMydgLdH4V
BGg+20CO/C2eCcASC0kd2/5JHvQttxT8Nnjn8MzDU+PsLXAGzJwCSkST+JuiJh85RiB3NesLMozQ
4BTsnK11WHGhBPQbdQoYhsld607EwQ8CdQnYT3YEp190WFwCDFdsLtjFfgyaO/43QBI5YKZwjmRb
OTXMGN3BN4sdXETkOk31mt/TCbLk1sJUsyaapBk2o5NqlBV6EeUYJzkwLmhAtKT9s81BklaTkvwV
ijwR71B1IzURJMYTZruQdQMj1OsRyO7XCTAgqKw1vdA879xsG4QbCNEAdK4RmxlGlgnSnA9axdk3
yiZQvlRQK0z4sS8T9qUQdCBqSyjLrmEduEgiCFMI6YnYIHQGpye11PTQWGzpQ832Gbw4yEPxPeRb
ECkfCEkiNreFfP9QLtJHRR7yvGhALj14g6eDr2G+hEy7sFZF/eEZIAlTlBRntA7zwR4sPDRJvOaz
VGUo+P1hJWyQl1AX+P0KGQA2nONTpk1gF82WHeaiLdccskwM4ZEZagUOByqzgYOk01asKlDC4s/p
imABm1a+EQHY3hPUip0NE/11pHvJ6i7gJWkPZ6sQG8YOZ938KFZ0szIeKzD02Yw3GpgGImigH+VA
+yvETln+DxoFWny3qzzZ6N0ZUKFq/9tQABHyyw2iI1SkVZVoAIDQwpBL1gr6A/AiUn+QlBY+cAsL
CLkn99YBtf2XugHnx1PBTovY99uNPN+JL/SXuh+KGkgz3iPZwe8ENJ1wZBlrd90z90IUEu482yCy
5/7fJRJIrjrDQkRfssNbhMCP/P4WigIzxiPBIQSF8EJPdeoOhOILHvfQXl3+TN9v4QBuIPDPB3II
B9rEzQ3EB3be8NQHAXIHJ11hCeVFE/b2YynTkR/2ClXBTcTZ2kZwwMSXCyQFBa2jEn32ZokBDar8
DzhH35cG+mbR6RjBuxp26ZwEDQhqV1YAHXoaoRhIpD0D7PrUFlq7kOsdSnQxdfGAXtjQtfiGiXZ2
i1ZsYHh4A5d7vBneQnp1y2gJG8pRJ8ocoU+9fHNgv4BxHWisAVnooFbTydqaamv4rv1bxgf1LINs
rsAkAkAMnuX2qDomffTR/mxNVQrgsh6TuDlkOwgvai4LiBZLxBZk2AnE2VCuNGziSwMEbcJQRrwF
NU23mY7BvgOQwJIWuVbYL1dpRiX3u6H2dd2UCsQHlhfsvF3NbcvCCTDGApjxt6htrqHTZsoIBZwL
bYtBJfy/Dc4QbULXlaA60gOkN4PmiwVtrVCCeNRr7rm2pgKyFh48MAUoxAwVZA1UEMHRW+YeZrtb
MM/Cs58fO4eEhKw1EWuqUDEHASZp03CA2Blhpfid42QhG/jAPrLovILBVDEtMjz2bLgsHYgBAhKM
FKwIscJM0a7KmaK7bK1XRTXYBQYv3GdD293LAS4H3itYXeABK5xsz+IB7Gvk2JKo6BChNwTyP5YR
eU77xl46AP+UAxMFV0NqBlOy0SNmL7n26k7gwBzhZoRm6lCB+zhkc+7p+M/0aH5mBIBW5hFMBZ9o
N9vrGA1QPUcnLzwaaiS27qwyomrcCCvXVFWUcv902OtrPTMjcFeUhaIbtv1CbwPHvgbsDUYBlImd
DADTUGwg9N2d1gFfMFFFP/46N7OGhwjBaIIpQVL24GQQdBixsJzogBYTCWIRDH8nzCUUEAqRaHAy
CAlMUhJZhwSnKhhhKP1i16TCCGaCagjgZj8bSlqbWXTtScncIvZm5OSbk0QRsAkOwOUgi+Y3q3fr
u4ahh2z/2GJBkpjHjbuTBVsd/NVTsPR4cqtmK/9cEeFqeGAYHBTaBQItOICFvAygj1CmY1VXFPRG
aj9ECxsL0fJeoI13UA5Qe7LgUuG0a2hOdeVHF2qEn0VbsClThwiDhxUU6sMEVmLGZOgmxDeD+mJ9
RyqUPIpLwKyEtX4wrdXbyIEfHDvK0yNEZSuaQfV9De/JPjWIXIlYV1oDM/9c/5vs9ovyA/HWfhkX
GhWAwmGIFDv9zdWtR7B85zjxNAfGRgRANi4FjyOD4ANn/zQPE45yQRbIVsGJ5Ms+sti4CH1CcQUz
9r2yG3z6g8cDgH4dcpQzb//+DwJGO/d844CkHgsAX+tgNrAeRsW7CMO5qK/bwQgD8MTSsE0AdfI/
Q/7637ZvQ8BGsR4fyc078n0MigzFsDLS22KEcOv8xTsWt7sVgHa2xawLjYNbJUs3jIVfMvi55IFc
MgAz+Is0nwH8s6RWawTdvTWQgcO3B2hcNAhhrOIfwBg2BkAOZAUPBHK7ZEAEDNYoM4AcyFQMMJDn
Ibw7NiwzBNrbRxa0MnwWBFV9Fuhk99T9JWoB5Sx8EhV8DY6AM90TMPYtDAOZ2dxHV4ietBwFtVaP
/TYeQH17hh4BOCV1IY1ssyLXhrdQYTS2qUiEy7hQgG1subRg87X0/L8gVzwHI3qftoidEyv0/Ozd
rDT5TD9QiBhTOJEtwPBoiKPIRCsaO9s4GCnPHFfUJs8QNq0otezFLvQGcqQAZItBOzfgwfwSWGAg
Zs/Oc3MBhCdogH9oSogzIwxQ/MMgn4yN+A+EIhlgESEMt0O+vFVUTjwYPEcHrj+B/1sUwpmNtPIL
7PYriAAo4WJNgnzRsBo+cT0cCcXMEmIFA/W3j3QVfgz3An8HaHw0r1aufQLe6wUuDUNnhyVICUYH
SbiEdUSRLcrtXPi3szMDGytiIUp0D2h0NKzVN6GzZhw3Dn2H4hloDZ8OZIwfs4F2CBO8OCd4woxw
dAk9iLZbJxo6I4gwuBSH2GIHwF648GooA9DmhWghxdSoBQAAMnLb0IQ1IE3gCeQg6DTOZfPsyDR1
8PSMKUmKfmEMO9Z9acjBU8kEim7GgfZHml49yUU8IHI4PD3cAP9L/DwrdDA8eSw8f3QoPIB0JMOK
Wi8BIIgE+DCfutuTRgrGFQ1GBArxu4CgbgHbJB7/RgHOR8RWKlD37OdjCLF8SUsH9ef/M8lB+ib+
W7rKfQmLdMXYQGXxg3zF0AQJuE3cEdRTxgfozSAQRBC+kDVyv1A06LzzpYH9pIpMDbyN4kLxX4gK
inFwAQf/LdXqweEEP9DOF4hKAYpIlmVZugEYAg8CBl7Q7bfPGQKKQBXgP4pEBQxCA3Wmnif1GARX
WAIFyBY8ItPfKWi8Ohg16E9k1gSIrfVF8ewwBPA3ulCU8s5yIjvsV5zRgDTo6Dg5gCa3RTlkMcJG
+n8v4bMuioQFJ4hENfN1v41VJWobuhn0JGNiWAxdiFpvqTX4iJCR8IOocy+8XkxyDWEDDUNpBwoD
uvaFDf4EctmmMlfV2IWvDTeZCYV0Kk34bL8LaHMExkX7PQgC+j3XxK0BFHUfPAPepQyaVCo4orWk
mFq4QSYHFFFTFNimTcWFU7NA8bvAw7KRcBCX31AFe+EzxgkPUmoumDZKBNB0r2Z4Vy0LcFYa+shY
WS0kjUMEGdWVznYAqiBoGK5xIBLzxRscJxCyBpUWrVm12ci+UxtQMgx+2UJ22Q4wr2g8IBEYg71U
C6IYaAiaNZQd2bfAlBRo+DUz3BFSTcTI1NU5WV0htKBzANEnABJysNS4N3DIhVje/nNYN4PKHXb2
TlAXUIQcMsuNumA/dQPermJRTOTZjHhILES4NtkINDd2R8ZQT9gNsI2dCFKFi8N2TXMJimPGBRNm
aKT0QGrA/wwdSAQ60Y1Z7tc78x35BjGhpvcHD4y/b8gPqEgGuPsMjfi9U8MFEVzaROST7WYUDV2b
Cl7SjbWh7qgRZRJzi4Wi/fTxhsnB4AJGuTQFnyPQFrZYihMK10DYWYmHdGBAdB4YTYnvNztk2Qpy
ZfngJ0xPMhZ1bv0Bbzld+K0iywNq+OzDESVIYCZ1+K46hz8UDEZXOXUQuDXqBRF+cosRRCl9Qkdt
qckUjPlNJJhVD+rSiYPC1YC3WwHsDGnSDXD1c4s6Urzs/olV9Ahl6mHZfib5WH3Xl8wRWnQUigcW
RzwKdAruasHfhwPHO0UQfJelL4gcCLJU+xGfg8j/6/Y3/li/gYYowwk7F4A/MHQZbuSwiFcQBzAf
CpYIA1ClXsst/EKRwDvwV9ljDrNHlpFtCAhaDFEQD9+g+82OSIoGPA10DI4IEnQEPAkwW4H4dQNG
6+t0JiqIrUAko8glRu6a7hfhPjw6dDkuNTEqAgQXFH9biuwPOHUJOIQN/0DbddAuEAMESc6IENF3
xF3uQYH5tnK+6wFORWJsrCUSAF3MmCzPhcgPuAD/0yCLtV3MDw4kOCscL8PeDJDpODp1YR4wmeFE
/lsP6KBn7ki2QEbSygFG6VwHu87ST/UWwblhgr+BoV1t4gpCO9d86nXdx1YQZQIqQh0L4zfuKWrw
PgqojioJc+03iAiCDXUO6wsgCxzQ0hAbBwY1DYSCBA7IS52PbWsEF4ZOiucdBQQbbCttMAOGSQCO
kjUzwnLDYw11hPOrDJtgkgAYjRvHhRgwnXoFTQa2aDGiYGXjEQ5n4wbTUFFQZPyblhD9griLwcdo
K2GivtosFDcrGmn7ABDqD4hewoDDD/uIH3AHxVa+2jOK5bvfXhdqihGA+iDK+gl1E0H+pVJvBzl/
ErfcBIBBjURC0M0a8f8eMH3pgDktdRx5Tc+t4BBWs2fVf25JUaqztVZi3hAMctxVgGhEOEpIN7KL
rWioPRv79qAXckAhilo9NASGaj0QB35INIIuuG32QFNodZKPVPxqBhuZqT2EGdiDYOotAhcvOPVX
1I8P3Dzl+h7yvpg6+MYfMJhddWpU6IhWUymci34Qpr5ElYWYfepyjMQ9kHiNudzosSQ/CjQ4ib8Q
J8s2a87q/ldFQBh8QjLY7gc9KzZ+PDgo+TzfyjN0TyuPRCPkwC4UO/0DueSSEwgEpySPkPvXAMTn
mczBaPy+IQy1enyZkY+q3T1dzZLpN8D4igGL2Uo8FQcOUlPpQ4oDP2sDFwNDFeAbXzvLdC5QLnUR
as1qL4BIobREQKxxWwzDEivB/A/y7q3QXE7CE8vrrCgFaPQ3mTO8CKC3C5K1pUZ4fCOdfb/sJqhQ
LbkfiBPzEnRzR1PrBgkGRlNLQ8ModcamtTQD8iw04CLcWFwOAUm6/xBMIjA2AdhC/2wvV8EgEgJv
lw+pLNVvRREQDNz8LVApOiG1V1kjcvAgJVNLS0QNCSBvcLoThzuCsRn93lZMArnsSFAW1AmYHbej
UL0NKkhPjL0cAX1TPFRze+B0K2oZG2EKsoncCEPec4twVJQDa0PG2svVB2+T3ksATgx7jOn0dRi6
dXBBpuqd00rTAq4NAyTwJxg4JJaCfF9yAwFbDa+IDT5m7HMA6cH5A1Hq7PwYAQvk7PwAghWfhkhc
QFduViB20YTV6zXB480lI0/wdCTsDO4/iJcs7HQim8chph5dANA8A76n4gb6+AkPh63fJIVEcot8
sw2ccTtpcP4Uh+0OsnC2aNjH624N0Ic8hzxgyFLAhzyHPES4NqyHPIc8KKAamA4zhzwMkInWYybe
GzvrB4ClDTsGdEoGhNhVjQgNO8gCs7DGEGiyD1NwFHy+oPYaYmznPhl9EUcVbfk+0TTddkAUFIBk
KQM3RdM0TdNTYW99i5uR702Z/yVUEQUIEMzMXyAMxFE9cDkIchSB7Y/9vukLLQSFARdz7CvIi8QM
vS5V6ovhi1OcUMOSChlEkQCqVKkqDlmqikKDAzbNQVGoHAFDpaKXiJt0ZUZwt7ZR9E1hcHDAQRMN
bmQL9gxFiBUOA16oGnZycw93RW52UXUU3RBvbsdWt3eHdX1iGFcrb3dzRB1lY4L9dvZ0b3J5FUQi
dmVUeXAkdu9n/0dTaXplWkNsb3MKFFRpNfdu31FUb1N5amVtCy0cG9tuQfZBbAZjOlQY2pPvb3Ap
TmFtTFNQb0cl7JmokiE92tbtvg5DdXJypVRo52QRV4nGfrvN7QpMbxBMaWJyYaVsXjv23jVyY3AJ
j0hhmCRw29rBrUF0HSp1OnNBsluwgTI3CG5BnUAI2G1QG2hBiQpbnrXYZB8eTGFFnHu6w1oZUU1f
eG+HNlk7WF1EZQZqU4tAaP9WR01vZHUVFBjChNh3S1W7XXZIGkFzGFMIZXAG2JZLeEV4aSVhRphT
7TD35g4cT2JqwKRQsN+wJbRjeQYy/WmCzQrbY2u7dWxMKbVQ1c0aaVpNSWaA2kX5bWHlFwPj/Y5w
Vmlld09miwBiCSu0TDjzuREKUG/MDWFkZUPYv9lb2yZN9khCeXQibkFkbsIS3mRychbHrW5Za7RI
pTgcKyfDmDF7ExlgBLysMIRuqs0JaUF3j7NhjUZJcTVrZWQTdmoLpWMSCxVJ0plhkm5SIuRVMzbB
sLD11EKTJksdhRSceaK12rHH+DZnjEtleQxPcE3dOvfoC0UkDjpWjXVlYQcAhg8kEQkzdymmdW0w
DK+t2WyzP2TCCAFto+60NcxzZaJqd0MQ89jfDAMHaXNkaWdpGXVwcHPNzbYReBIJZlsIOM1W+HNw
YUtPzSxYwP57m1UvQnVmZkEPC2fajjxMb3d3djlytiNRmG3YdwpH2CzLsj3UEwIKBG+XsizLsgs0
FxIQ1bIsywMPCRRzH8g/FkJQRQAATAEC4AAPdctJ/gELAQcAAHxRQBADkGGzbvYNSgsbBB4H62ZL
tjOgBigQB/ISeAMGq9iDgUAuz3iQ8AHXNZB1ZIRPLjV0K3bZssl76wAg1Qu2UeDgLsHHAJv7u3dh
3yN+J0ACG9SFAKBQfQ3T5QAAAAAAAACQ/wAAAAAAAAAAAAAAAABgvgBwSgCNvgCg//9Xg83/6xCQ
kJCQkJCKBkaIB0cB23UHix6D7vwR23LtuAEAAAAB23UHix6D7vwR2xHAAdtz73UJix6D7vwR23Pk
McmD6ANyDcHgCIoGRoPw/3R0icUB23UHix6D7vwR2xHJAdt1B4seg+78EdsRyXUgQQHbdQeLHoPu
/BHbEckB23PvdQmLHoPu/BHbc+SDwQKB/QDz//+D0QGNFC+D/fx2D4oCQogHR0l19+lj////kIsC
g8IEiQeDxwSD6QR38QHP6Uz///9eife5DQEAAIoHRyzoPAF394A/AXXyiweKXwRmwegIwcAQhsQp
+IDr6AHwiQeDxwWJ2OLZjb4AkAAAiwcJwHRFi18EjYQw6LEAAAHzUIPHCP+WYLIAAJWKB0cIwHTc
ifl5Bw+3B0dQR7lXSPKuVf+WZLIAAAnAdAeJA4PDBOvY/5ZosgAAYemUgP//AAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAgADAAAAIAAAgA4AAABgAACAAAAAAAAAAAAAAAAAAAABAAEAAAA4AACAAAAA
AAAAAAAAAAAAAAABAAkEAABQAAAAqMAAACgBAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAEAAACgAACA
eAAAgAAAAAAAAAAAAAAAAAAAAQAJBAAAkAAAANTBAAAUAAAAAAAAAAAAAAABADAAsJAAACgAAAAQ
AAAAIAAAAAEABAAAAAAAwAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACAAACAAAAAgIAAgAAAAIAA
gACAgAAAgICAAMDAwAAAAP8AAP8AAAD//wD/AAAA/wD/AP//AAD///8AAACIiIgAAAAACId3d3iA
AAB4//+Ih3AAAHj3j///eAAAeP////94AAB493d4/3gAAHj/////eAAAePd3eP94AAB4/////3gA
AHj3d4//eAAAeP////94AAB4/////3gAAHh/f39/eAAAh3OHh4eAAAAHszt7d4AAAAAAAACAAADw
PwAA4AcAAMAHAADAAwAAwAMAAMADAADAAwAAwAMAAMADAADAAwAAwAMAAMADAADAAwAAwAcAAOAH
AAD/3wAA2JEAAAAAAQABABAQEAABAAQAKAEAAAEAAAAAAAAAAAAAAAAAkMIAAGDCAAAAAAAAAAAA
AAAAAACdwgAAcMIAAAAAAAAAAAAAAAAAAKrCAAB4wgAAAAAAAAAAAAAAAAAAtcIAAIDCAAAAAAAA
AAAAAAAAAADAwgAAiMIAAAAAAAAAAAAAAAAAAAAAAAAAAAAAysIAANjCAADowgAAAAAAAPbCAAAA
AAAABMMAAAAAAAAMwwAAAAAAAHMAAIAAAAAAS0VSTkVMMzIuRExMAEFEVkFQSTMyLmRsbABNU1ZD
UlQuZGxsAFVTRVIzMi5kbGwAV1MyXzMyLmRsbAAATG9hZExpYnJhcnlBAABHZXRQcm9jQWRkcmVz
cwAARXhpdFByb2Nlc3MAAABSZWdDbG9zZUtleQAAAG1lbXNldAAAd3NwcmludGZBAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAABQSwEC
FAAKAAAAAAD3PnIwyicfngBYAAAAWAAACgAAAAAAAAAAACAAAAAAAAAAcmVhZG1lLnBpZlBLBQYA
AAAAAQABADgAAAAoWAAAAAA=

------=_NextPart_000_0010_A64A0A75.D4830D09--


From sdavis2 at mail.nih.gov  Thu Mar 18 07:30:38 2004
From: sdavis2 at mail.nih.gov (Sean Davis)
Date: Thu Mar 18 07:28:18 2004
Subject: [Bioperl-l] GFF synopsis code question
Message-ID: <BC7F009E.5CB4%sdavis2@mail.nih.gov>

I have been learning to use Bio::DB::GFF and had a question about the
synopsis code from Bio::DB::GFF.  The section of code is:

-------From Synopsis-------
  # pull out all transcript features
  my @transcripts = $segment->features('transcript');

  # for each transcript, total the length of the introns
  my %totals;
  for my $t (@transcripts) {
    my @introns = $t->Intron;
    $totals{$t->name} += $_->length foreach @introns;
  }

  # Sort the exons of the first transcript by position
  my @exons = sort {$a->start <=> $b->start} $transcripts[0]->Exon;

  # Get a region 1000 bp upstream of first exon
  my $upstream = $exons[0]->segment(-1000,0);

  # get its DNA
  my $dna = $upstream->dna;
-------------------------------

It seems here that $exons[0] is a feature, so does not have a segment
method?  Instead, one can get the segment of interest by:

my $upstream = $db->segment($exons[0],-1000=>0)

Am I mistaken?  I only bring it up as a documentation issue, as the code I
give works for me.

Sean 

From lstein at cshl.edu  Thu Mar 18 09:05:25 2004
From: lstein at cshl.edu (Lincoln Stein)
Date: Thu Mar 18 10:56:56 2004
Subject: [Bioperl-l] GFF synopsis code question
In-Reply-To: <BC7F009E.5CB4%sdavis2@mail.nih.gov>
References: <BC7F009E.5CB4%sdavis2@mail.nih.gov>
Message-ID: <200403180905.25264.lstein@cshl.edu>

Thanks for catching that.  This is a typo in the synopsis.  It should 
read:

	my $upstream = $exons[0]->subseq(-1000,0);

Lincoln

On Thursday 18 March 2004 07:30 am, Sean Davis wrote:
> I have been learning to use Bio::DB::GFF and had a question about
> the synopsis code from Bio::DB::GFF.  The section of code is:
>
> -------From Synopsis-------
>   # pull out all transcript features
>   my @transcripts = $segment->features('transcript');
>
>   # for each transcript, total the length of the introns
>   my %totals;
>   for my $t (@transcripts) {
>     my @introns = $t->Intron;
>     $totals{$t->name} += $_->length foreach @introns;
>   }
>
>   # Sort the exons of the first transcript by position
>   my @exons = sort {$a->start <=> $b->start} $transcripts[0]->Exon;
>
>   # Get a region 1000 bp upstream of first exon
>   my $upstream = $exons[0]->segment(-1000,0);
>
>   # get its DNA
>   my $dna = $upstream->dna;
> -------------------------------
>
> It seems here that $exons[0] is a feature, so does not have a
> segment method?  Instead, one can get the segment of interest by:
>
> my $upstream = $db->segment($exons[0],-1000=>0)
>
> Am I mistaken?  I only bring it up as a documentation issue, as the
> code I give works for me.
>
> Sean
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

-- 
Lincoln D. Stein
Cold Spring Harbor Laboratory
1 Bungtown Road
Cold Spring Harbor, NY 11724
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: signature
Url : http://portal.open-bio.org/pipermail/bioperl-l/attachments/20040318/6f86592c/attachment.bin
From MRBATESALANN at netscape.net  Thu Mar 18 11:46:39 2004
From: MRBATESALANN at netscape.net (MRBATESALANN@netscape.net)
Date: Thu Mar 18 11:44:36 2004
Subject: [Bioperl-l] REPLY SOON
Message-ID: <SERVERA1TO54rj0hfl600007adc@servera1.danis.com.cn>

Dear Friend,

As you read this, I don't want you to feel sorry for me, because, I believe everyone will die someday. 
My name is BATES ALAN a merchant in Dubai, in the U.A.E.I have been diagnosed with Esophageal cancer.
It has defiled all forms of medical treatment, and right now I have only about a few months to live, according to medical experts. 
I have not particularly lived my life so well, as I never really cared for anyone(not even myself)but my 
business. Though I am very rich, I was never generous, I was always hostile to people and only 
focused on my business as that was the only thing I cared for. But now I regret all this as I now know 
that there is more to life than just wanting to have or make all the money in the world. 
I believe when God gives me a second chance to come to this world I would live my life a different way 
from how I have lived it. Now that God has called me, I have willed and given most of my property 
and assets to my immediate and extended family members as well as a few close friends. 
I want God to be merciful to me and accept my soul so, I have decided to give alms to charity 
organizations, as I want this to be one of the last good deeds I do on earth. So far, I have distributed 
money to some charity organizations in the U.A.E, Algeria and Malaysia. Now that my health has 
deteriorated so badly, I cannot do this myself anymore. I once asked members of my family to close one 
of my accounts and distribute the money which I have there to charity organization in Bulgaria and 
Pakistan, they refused and kept the money to themselves. Hence, I do not trust them anymore, as 
they seem not to be contended with what I have left 
for them. The last of my money which no one knows of is the huge cash deposit of eighteen million dollars 
$18,000,000,00 that I have with a finance/Security Company abroad. I will want you to help me collect this deposit and dispatched it to charity organizations.
I have set aside 10% for you and for your time.

God be with you. 

BATES ALAN

From dobbo at thevillas.eclipse.co.uk  Thu Mar 18 12:43:33 2004
From: dobbo at thevillas.eclipse.co.uk (rich)
Date: Thu Mar 18 12:48:55 2004
Subject: [Bioperl-l] Bio::PopGen
Message-ID: <1079631813.4059dfc57a3ff@webmail.eclipse.net.uk>


Hi,

i've been using a number of modules under Bio::PopGen.

I have extended and added to suit my needs. I was
wondering whether the modules/changes might be worth
adding to the distro?

Basically I have,

1) created two new Bio::PopGen::IO modules (based on
csv.pm):

Bio::PopGen::IO::ped.pm - takes ped format data (incl
family, member, parentage, gender, affeted status).

This module also has the option to look at family
structure and create missing parents.This is required
for using TRANSMIT

Bio::PopGen::IO::phase.pm - deals with PHASE format data

(I am also creating a Bio::PopGen::IO::hapmap.pm mod
that deals with hapmap data)

2) I have changed Bio::PopGen::IO::csv.pm (and included
within Bio::PopGen::IO::ped.pm) the ability to accept
data where the field delim is the same as the allele delim
(doesn't affect Bio::PopGen::IO::phase.pm)

3)In order to deal with the ped format I extended
Bio::PopGen::Individual.pm so that an individual object
 will take ped data (fam id, member id,mother, father,
sex, affected status)


cheers
Rich
From vamsi at warta.bio.psu.edu  Thu Mar 18 12:54:35 2004
From: vamsi at warta.bio.psu.edu (Vamsi)
Date: Thu Mar 18 12:59:56 2004
Subject: [Bioperl-l] Bio::TreeIO::svggraph problems
Message-ID: <200403181754.i2IHsZV05138@warta.bio.psu.edu>

Hi,

I am trying to use the Bio::TreeIO::svggraph module to
display a newick format tree. Here is the tree:

(a,b);

and the code:

-------------------------------------------
use Bio::TreeIO;

my $in = new Bio::TreeIO(-file => $ARGV[0], -format => 'newick');
my $out = new Bio::TreeIO(-file => ">$ARGV[1]", -format => 'svggraph');

while( my $tree = $in->next_tree ) {
    my $svg_xml = $out->write_tree($tree);
}
-------------------------------------------

(which is from the documentation for Bio::TreeIO::svggraph)

I have all the required SVG modules installed. But, I get the error

Can't locate object method "new" via package "SVG::Graph::Glyph::tree" at 
/usr/perl5/site_perl/5.005/SVG/Graph/Frame.pm line 117, <GEN0> chunk 1.

Any suggestions?

Thanks,
Vamsi


From vamsi at warta.bio.psu.edu  Thu Mar 18 13:05:54 2004
From: vamsi at warta.bio.psu.edu (Vamsi)
Date: Thu Mar 18 13:11:16 2004
Subject: [Bioperl-l] Bio::TreeIO::svggraph problems
Message-ID: <200403181805.i2II5sV05214@warta.bio.psu.edu>

Hi,

I am trying to use the Bio::TreeIO::svggraph module to
display a newick format tree. Here is the tree:

(a,b);

and the code:

-------------------------------------------
use Bio::TreeIO;

my $in = new Bio::TreeIO(-file => $ARGV[0], -format => 'newick');
my $out = new Bio::TreeIO(-file => ">$ARGV[1]", -format => 'svggraph');

while( my $tree = $in->next_tree ) {
    my $svg_xml = $out->write_tree($tree);
}
-------------------------------------------

(which is from the documentation for Bio::TreeIO::svggraph)

I have all the required SVG modules installed. But, I get the error

Can't locate object method "new" via package "SVG::Graph::Glyph::tree" at 
/usr/perl5/site_perl/5.005/SVG/Graph/Frame.pm line 117, <GEN0> chunk 1.

Any suggestions?

Thanks,
Vamsi

From paulo.david at netvisao.pt  Thu Mar 18 13:40:37 2004
From: paulo.david at netvisao.pt (Paulo Almeida)
Date: Thu Mar 18 13:43:07 2004
Subject: [Bioperl-l] Bio::TreeIO::svggraph problems
In-Reply-To: <200403181754.i2IHsZV05138@warta.bio.psu.edu>
References: <200403181754.i2IHsZV05138@warta.bio.psu.edu>
Message-ID: <4059ED25.6030503@netvisao.pt>

Your code works for me, cut and pasted frome the e-mail, with the same 
tree. This is a long shot, but try enclosing '-file' and '-format' with 
single quotes, when you create $in and $out. I have perl 5.6.1, but I 
think earlier versions may have a problem with that.

-Paulo

Vamsi wrote:

>I am trying to use the Bio::TreeIO::svggraph module to
>display a newick format tree. Here is the tree:
>
>(a,b);
>
>and the code:
>
>-------------------------------------------
>use Bio::TreeIO;
>
>my $in = new Bio::TreeIO(-file => $ARGV[0], -format => 'newick');
>my $out = new Bio::TreeIO(-file => ">$ARGV[1]", -format => 'svggraph');
>
>while( my $tree = $in->next_tree ) {
>    my $svg_xml = $out->write_tree($tree);
>}
>-------------------------------------------
>
>(which is from the documentation for Bio::TreeIO::svggraph)
>
>I have all the required SVG modules installed. But, I get the error
>
>Can't locate object method "new" via package "SVG::Graph::Glyph::tree" at 
>/usr/perl5/site_perl/5.005/SVG/Graph/Frame.pm line 117, <GEN0> chunk 1.
>

From vamsi at warta.bio.psu.edu  Thu Mar 18 14:03:24 2004
From: vamsi at warta.bio.psu.edu (Vamsi)
Date: Thu Mar 18 14:09:07 2004
Subject: [Bioperl-l] Bio::TreeIO::svggraph problems
In-Reply-To: <4059ED25.6030503@netvisao.pt> from "Paulo Almeida" at Mar 18,
	2004 06:40:37 PM
Message-ID: <200403181903.i2IJ3PO05521@warta.bio.psu.edu>

No. That does not seem to be the problem since I can write out in
other formats like 'tabtree' etc. The problem is with the SVG module
but I have no idea where ...

Thanks,
Vamsi

> 
> Your code works for me, cut and pasted frome the e-mail, with the same 
> tree. This is a long shot, but try enclosing '-file' and '-format' with 
> single quotes, when you create $in and $out. I have perl 5.6.1, but I 
> think earlier versions may have a problem with that.
> 
> -Paulo
> 
> Vamsi wrote:
> 
> >I am trying to use the Bio::TreeIO::svggraph module to
> >display a newick format tree. Here is the tree:
> >
> >(a,b);
> >
> >and the code:
> >
> >-------------------------------------------
> >use Bio::TreeIO;
> >
> >my $in = new Bio::TreeIO(-file => $ARGV[0], -format => 'newick');
> >my $out = new Bio::TreeIO(-file => ">$ARGV[1]", -format => 'svggraph');
> >
> >while( my $tree = $in->next_tree ) {
> >    my $svg_xml = $out->write_tree($tree);
> >}
> >-------------------------------------------
> >
> >(which is from the documentation for Bio::TreeIO::svggraph)
> >
> >I have all the required SVG modules installed. But, I get the error
> >
> >Can't locate object method "new" via package "SVG::Graph::Glyph::tree" at 
> >/usr/perl5/site_perl/5.005/SVG/Graph/Frame.pm line 117, <GEN0> chunk 1.
> >
> 
> 
> 

From jason at cgt.duhs.duke.edu  Thu Mar 18 14:13:12 2004
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Thu Mar 18 14:18:39 2004
Subject: [Bioperl-l] Bio::PopGen
In-Reply-To: <1079631813.4059dfc57a3ff@webmail.eclipse.net.uk>
References: <1079631813.4059dfc57a3ff@webmail.eclipse.net.uk>
Message-ID: <Pine.LNX.4.50.0403181411550.16357-100000@tenero.duhs.duke.edu>

Rich -
Looks great.

The ped info is incorporated into the bioperl-pedigree cvs module where I
have father/mother/sibling information as well as a ped parser.  Since the
types of analyses are different for related individuals than for
unrelated populations.

If you send me your changes I can see about incorporating them - but you
may want to look at the bioperl-pedigree dist first as well.

-jason


On Thu, 18 Mar 2004, rich wrote:

>
> Hi,
>
> i've been using a number of modules under Bio::PopGen.
>
> I have extended and added to suit my needs. I was
> wondering whether the modules/changes might be worth
> adding to the distro?
>
> Basically I have,
>
> 1) created two new Bio::PopGen::IO modules (based on
> csv.pm):
>
> Bio::PopGen::IO::ped.pm - takes ped format data (incl
> family, member, parentage, gender, affeted status).
>
> This module also has the option to look at family
> structure and create missing parents.This is required
> for using TRANSMIT
>
> Bio::PopGen::IO::phase.pm - deals with PHASE format data
>
> (I am also creating a Bio::PopGen::IO::hapmap.pm mod
> that deals with hapmap data)
>
> 2) I have changed Bio::PopGen::IO::csv.pm (and included
> within Bio::PopGen::IO::ped.pm) the ability to accept
> data where the field delim is the same as the allele delim
> (doesn't affect Bio::PopGen::IO::phase.pm)
>
> 3)In order to deal with the ped format I extended
> Bio::PopGen::Individual.pm so that an individual object
>  will take ped data (fam id, member id,mother, father,
> sex, affected status)
>
>
> cheers
> Rich
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From steve_chervitz at affymetrix.com  Thu Mar 18 15:58:40 2004
From: steve_chervitz at affymetrix.com (Steve Chervitz)
Date: Thu Mar 18 16:03:34 2004
Subject: [Bioperl-l] SearchIO::blast frac_aligned/identical/conserved bug fix
Message-ID: <08C68436-791F-11D8-B56C-000A95765236@affymetrix.com>

Just wanted to let folks know that I recently fixed a bug regarding the 
reporting of fraction aligned/identical/conserved for BLASTX, TBLASTX, 
and TBLASTN reports. For details, see 
http://bugzilla.bioperl.org/show_bug.cgi?id=1597 .

This bug occurs when parsing these reports using SearchIO with 
-format=>'blast'. It is present in the 1.4 release (as well as 1.2.x 
and 1.3.x and possibly earlier). I fixed this in CVS on both branch-1-4 
and bioperl-live.

Steve
--
Steve Chervitz
AFFYMETRIX, INC. | 6550 Vallejo St. Ste 100 | Emeryville, CA 94608
Tel: 510-428-8530 | Fax: 510-428-8585 | Steve_Chervitz at affymetrix.com

From bdirks at uclink.berkeley.edu  Thu Mar 18 19:25:43 2004
From: bdirks at uclink.berkeley.edu (Bill Dirks)
Date: Thu Mar 18 19:31:04 2004
Subject: [Bioperl-l] Bio::Tools::Genewise
Message-ID: <web-1402491@calmail-ma.berkeley.edu>

Hi, 
I believe there is an error in the synopsis section of Bioperl::Tools::Genewise The line that 
reads: 
my $gw = Bio::Tools::Genewise(-file=>"genewise.out"); 
It should be, I believe: my $gw = Bio::Tools::Genewise->new(-file=>"genewise.out"); 
I, however, still can't the synopsis to work. I put the output of the genewise example: genewise 
road.pep human.genomic > genewise.out and tried the synopsis sample script. However, 
$gw->next_prediction does not return anything. Does anyone have any ideas why? 
Thanks,
Bill
From shawnh at stanford.edu  Thu Mar 18 19:41:31 2004
From: shawnh at stanford.edu (Shawn Hoon)
Date: Thu Mar 18 19:46:52 2004
Subject: [Bioperl-l] Bio::Tools::Genewise
In-Reply-To: <web-1402491@calmail-ma.berkeley.edu>
References: <web-1402491@calmail-ma.berkeley.edu>
Message-ID: <2AA8428F-793E-11D8-B0C2-000A95783436@stanford.edu>

You would need to add the -genesf option when running genewise.

Also, you can check out the wrapper for genewise in 
Bio::Tools::Run::Genewise in the bioperl-run package.

cheers,

shawn


On Mar 18, 2004, at 4:25 PM, Bill Dirks wrote:

> Hi, I believe there is an error in the synopsis section of 
> Bioperl::Tools::Genewise The line that reads: my $gw = 
> Bio::Tools::Genewise(-file=>"genewise.out"); It should be, I believe: 
> my $gw = Bio::Tools::Genewise->new(-file=>"genewise.out"); I, however, 
> still can't the synopsis to work. I put the output of the genewise 
> example: genewise road.pep human.genomic > genewise.out and tried the 
> synopsis sample script. However, $gw->next_prediction does not return 
> anything. Does anyone have any ideas why? Thanks,
> Bill
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

From birney at ebi.ac.uk  Fri Mar 19 04:25:44 2004
From: birney at ebi.ac.uk (Ewan Birney)
Date: Fri Mar 19 04:31:23 2004
Subject: [Bioperl-l] Bio::Tools::Genewise
In-Reply-To: <2AA8428F-793E-11D8-B0C2-000A95783436@stanford.edu>
Message-ID: <Pine.OSX.4.44.0403190922551.8773-100000@ewan-birneys-computer.local>


On Thu, 18 Mar 2004, Shawn Hoon wrote:

> You would need to add the -genesf option when running genewise.
>
> Also, you can check out the wrapper for genewise in
> Bio::Tools::Run::Genewise in the bioperl-run package.

I think -genesf is only available in the (now rather venerable) release
candidate Genewise at

http://www.sanger.ac.uk/Users/birney/wise2.2.3-rc7.tar.gz


The reason for not releasing this is that actually I need to back out the
changes made for the case of BLOSUM62.bla (it is in fact better to keep as
backwardly compatible as possible) which all comes down to the case
insanity that is Apple's HFS+ file system not playing nicely with cvs.

(yup. That was too much detail right?)


I will try to push the Wise2 package into some sort of sanity over this
month wrt to these things and then make a proper release. For the moment,
use the release candidate above...


From gabriele.bucci at ieo-research.it  Fri Mar 19 10:07:48 2004
From: gabriele.bucci at ieo-research.it (gabriele bucci)
Date: Fri Mar 19 09:44:11 2004
Subject: [Bioperl-l] retrieving genomic sequence
Message-ID: <200403191607.49531.gabriele.bucci@ieo-research.it>

Hi everibody.
Does anyone know if I could use BioPerl for retrieving DNA string from  
multiple specified genomic region?
I mean, eg: from chromosome 1 starting 1598675 to 1689000, from chr 2 100.... 
to 101... ecc.
I'm pretty sure it's possible but as a beginners as I'm I can't find the way 
trough the various perldocs....
Thanks to everybody in advance.

Gabriele

From kviel at emory.edu  Fri Mar 19 09:48:43 2004
From: kviel at emory.edu (Kevin Roland Viel)
Date: Fri Mar 19 09:54:02 2004
Subject: [Bioperl-l] ABI.pm and .ab1 files
In-Reply-To: <200403191607.49531.gabriele.bucci@ieo-research.it>
References: <200403191607.49531.gabriele.bucci@ieo-research.it>
Message-ID: <Pine.GSO.4.58.0403190943160.17444@leukothea>

Is it possible to use the ABI.pm module to abstract the trace values from
*.ab1 files?  My inclination is that is not possible.  Admittedly, I am
quite weak in perl.  Examining the module, I see only that it uses the
binmode() function.  I do not have an example of an *.abi file, but I
cannot see where ABI.pm treats the file as anything different from a text
file.  I understand that the .ab1 file stores its data in Big Endian
format.  It could be that IO::File incorporates this, however.

Thanks in advance,

Kevin


Kevin Viel
Department of Epidemiology
Rollins School of Public Health
Emory University
Atlanta, GA 30322

From skirov at utk.edu  Fri Mar 19 10:40:15 2004
From: skirov at utk.edu (Stefan Kirov)
Date: Fri Mar 19 10:45:55 2004
Subject: [Bioperl-l] retrieving genomic sequence
In-Reply-To: <200403191607.49531.gabriele.bucci@ieo-research.it>
References: <200403191607.49531.gabriele.bucci@ieo-research.it>
Message-ID: <405B145F.1050505@utk.edu>

You may want to try ensembl API:
let's say you have a file like this:
mychr.dat:
1,1598675,1689000
2,100,101
etc.

open (CHR, "<mychr.dat") ||die "File err\n";
while (my $line=<CHR>) {
    chomp $line;
    my ($chr, $start,$end)=split(/,/,$line);
    my $slice = $slice_adaptor->fetch_by_chr_start_end($chr,$start,$end);
    ................do something with it....
}

To see how to work with ensembl go to http://www.ensembl.org/Docs/ and 
get the tutorial. It is based on Bioperl to some extent, and what you 
want is documented pretty well I believe.
Also see http://www.ensembl.org/Docs/Pdoc/ensembl/.
Hope this helps... amd have fun.
Stefan

gabriele bucci wrote:

>Hi everibody.
>Does anyone know if I could use BioPerl for retrieving DNA string from  
>multiple specified genomic region?
>I mean, eg: from chromosome 1 starting 1598675 to 1689000, from chr 2 100.... 
>to 101... ecc.
>I'm pretty sure it's possible but as a beginners as I'm I can't find the way 
>trough the various perldocs....
>Thanks to everybody in advance.
>
>Gabriele
>
>_______________________________________________
>Bioperl-l mailing list
>Bioperl-l@portal.open-bio.org
>http://portal.open-bio.org/mailman/listinfo/bioperl-l
>  
>

-- 
Stefan Kirov, Ph.D.
University of Tennessee/Oak Ridge National Laboratory
1060 Commerce Park, Oak Ridge
TN 37830-8026
USA
tel +865 576 5120
fax +865 241 1965
e-mail: skirov@utk.edu
sao@ornl.gov

From Anthony.Underwood at hpa.org.uk  Fri Mar 19 10:37:27 2004
From: Anthony.Underwood at hpa.org.uk (SRMD, Col - Underwood, Anthony)
Date: Fri Mar 19 10:49:41 2004
Subject: [Bioperl-l] ABI.pm and .ab1 files
Message-ID: <427DA33A6101484BA7A72934AF90DF73051AA853@exchange1.phls.org.uk>

Hi Kevin

As far as I am aware the ABI module doesn't allow this. However if you run
an abi file through phred (available free to
academic/non-profit-http://www.phrap.org/) with the -c option (e.g phred -c
trace.abi) this will write out an scf file which can have the trace values
extracted using the BioPerl SCF IO module. I hope that this helps.


Anthony

Dr Anthony Underwood
Bioinformatics Group
Genomics, Proteomic and Bioinformatics Unit
Central Public Health Laboratory
Health Protection Agency
61 Colindale Avenue
London
NW9 5HT
t:    0208 2004400 ext. 3618
f:    0208 3583138
e: anthony.underwood@hpa.org.uk


> -----Original Message-----
> From: Kevin Roland Viel [mailto:kviel@emory.edu]
> Sent: 19 March 2004 14:49
> To: bioperl-l@bioperl.org
> Subject: [Bioperl-l] ABI.pm and .ab1 files
> 
> 
> Is it possible to use the ABI.pm module to abstract the trace 
> values from
> *.ab1 files?  My inclination is that is not possible.  
> Admittedly, I am
> quite weak in perl.  Examining the module, I see only that it uses the
> binmode() function.  I do not have an example of an *.abi file, but I
> cannot see where ABI.pm treats the file as anything different 
> from a text
> file.  I understand that the .ab1 file stores its data in Big Endian
> format.  It could be that IO::File incorporates this, however.
> 
> Thanks in advance,
> 
> Kevin
> 
> 
> Kevin Viel
> Department of Epidemiology
> Rollins School of Public Health
> Emory University
> Atlanta, GA 30322
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 
From jason at cgt.duhs.duke.edu  Fri Mar 19 12:14:02 2004
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Fri Mar 19 12:19:31 2004
Subject: [Bioperl-l] Bio::Tools::Genewise
In-Reply-To: <Pine.OSX.4.44.0403190922551.8773-100000@ewan-birneys-computer.local>
References: <Pine.OSX.4.44.0403190922551.8773-100000@ewan-birneys-computer.local>
Message-ID: <Pine.LNX.4.50.0403191210560.23510-100000@tenero.duhs.duke.edu>

I think it is 2.2.0.  Note the ! lines.

For those of you on linux biolinux.org provides a nice rpm install of wise
2.2.0

-j
[jason@lugano jason]$ genewise -help
genewise ($Name: wise2-2-0 $)
genewise <protein-file> <dna-file> in fasta format
Dna sequence options
  -u               start position in dna
  -v               end position in dna
  -trev            Compare on the reverse strand
  -tfor [default]  Compare on the forward strand
  -both            Both strands
  -tabs            Report positions as absolute to truncated/reverse
sequence
  -fembl            File is an EMBL file native format
Protein comparison options
  -s               start position in protein
  -t               end   position in protein
  -[g]ap    [ 12]  gap penalty
  -[e]xt    [  2]  extension penalty
  -[m]atrix [blosum62.bla]  Comparison matrix
HMM options
  -hmmer           Protein file is HMMer file (version 2 compatible)
  -hname           Use this as the name of the HMM, not filename
Gene Model options
  -init   [default]  [default/global/local/wing/endbias] startend policy
for the HMM/protein
  -codon  [codon.table]  Codon file
  -gene   [human.gf]  Gene parameter file
  -subs   [1e-05] Substitution error rate
  -indel  [1e-05] Insertion/deletion error rate
  -cfreq  [model/flat] Using codon bias or not?     [default flat]
  -splice [model/flat] Using splice model or GT/AG? [default model]
  -intron [model/tied] Use tied model for introns   [default tied]
  -null   [syn/flat]   Random Model as synchronous or flat [default syn]
  -pg     <file>       Potential Gene file (heurestic for speeding
alignments)
  -alln   [1.0]   Probability of matching a NNN codon
  -insert [model/flat] Use protein insert model     [default flat]
Algorithm options
  -alg    [623/623L/2193/2193L]  Algorithm used [default 623/623L]
Output options [default -pretty -para]
  -pretty          show pretty ascii output
  -pseudo          Mark genes with frameshifts as pseudogenes
  -genes           show gene structure
!
!  -genesf          show gene structure with supporting evidence
!
   -embl            show EMBL feature format with CDS key
  -diana           show EMBL feature format with misc_feature key (for
diana)
  -para            show parameters
  -sum             show summary output
  -cdna            show cDNA
  -trans           show protein translation, breaking at frameshifts
  -pep             show protein translation, splicing frameshifts
  -ace             ace file gene structure
  -gff             Gene Feature Format file
  -gener           raw gene structure
  -alb             show logical AlnBlock alignment
  -pal             show raw matrix alignment
  -block  [50]     Length of main block in pretty output
  -divide [//]     divide string for multiple outputs
New gene model statistics
  -splice_max_collar      [5.0]  maximum Bits value for a splice site
  -splice_min_collar      [-5.0] minimum Bits value for a splice site
  -splice_score_offset    [4.5]  score offset for splice sites
  -genestats              [gene.stat]
Dynamic programming matrix implementation
  -dymem      memory style [default/linear/explicit]
  -kbyte      memory amount to use [4000]
  -dydebug    drop into dynamite dp matrix debugger
Standard options
  -help      help
  -version   show version and compile info
  -silent    No messages on stderr
  -quiet     No report on stderr
  -erroroffstd No warning messages to stderr
  -errorlog [file] Log warning messages to file

See WWW help at http://www.sanger.ac.uk/Software/Wise2/


On Fri, 19 Mar 2004, Ewan Birney wrote:

>
>
> On Thu, 18 Mar 2004, Shawn Hoon wrote:
>
> > You would need to add the -genesf option when running genewise.
> >
> > Also, you can check out the wrapper for genewise in
> > Bio::Tools::Run::Genewise in the bioperl-run package.
>
> I think -genesf is only available in the (now rather venerable) release
> candidate Genewise at
>
> http://www.sanger.ac.uk/Users/birney/wise2.2.3-rc7.tar.gz
>
>
> The reason for not releasing this is that actually I need to back out the
> changes made for the case of BLOSUM62.bla (it is in fact better to keep as
> backwardly compatible as possible) which all comes down to the case
> insanity that is Apple's HFS+ file system not playing nicely with cvs.
>
> (yup. That was too much detail right?)
>
>
> I will try to push the Wise2 package into some sort of sanity over this
> month wrt to these things and then make a proper release. For the moment,
> use the release candidate above...
>
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From jason at cgt.duhs.duke.edu  Fri Mar 19 12:16:17 2004
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Fri Mar 19 12:21:40 2004
Subject: [Bioperl-l] retrieving genomic sequence
In-Reply-To: <200403191607.49531.gabriele.bucci@ieo-research.it>
References: <200403191607.49531.gabriele.bucci@ieo-research.it>
Message-ID: <Pine.LNX.4.50.0403191214370.23510-100000@tenero.duhs.duke.edu>

See Bio::DB::Fasta for a fast and easy way to do this for a local
copy of the genome you are interested in.  Bio::DB::GFF will also do this
for you if you want to load the data into a relational database.

-jason
On Fri, 19 Mar 2004, gabriele bucci wrote:

> Hi everibody.
> Does anyone know if I could use BioPerl for retrieving DNA string from
> multiple specified genomic region?
> I mean, eg: from chromosome 1 starting 1598675 to 1689000, from chr 2 100....
> to 101... ecc.
> I'm pretty sure it's possible but as a beginners as I'm I can't find the way
> trough the various perldocs....
> Thanks to everybody in advance.
>
> Gabriele
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From bdirks at uclink.berkeley.edu  Fri Mar 19 12:28:21 2004
From: bdirks at uclink.berkeley.edu (Bill Dirks)
Date: Fri Mar 19 12:33:42 2004
Subject: [Bioperl-l] Bio::Tools::Genewise
In-Reply-To: <Pine.LNX.4.50.0403191210560.23510-100000@tenero.duhs.duke.edu>
Message-ID: <web-1425593@calmail-ma.berkeley.edu>

Thanks for your help.  I have wise 2.2.0 and the flag worked.  Now, I'm going to check out 
Bio::Tools::Run::Genewise.
Bill

On Fri, 19 Mar 2004 12:14:02 -0500 (EST)
  Jason Stajich <jason@cgt.duhs.duke.edu> wrote:
>I think it is 2.2.0.  Note the ! lines.
>
>For those of you on linux biolinux.org provides a nice rpm install of wise
>2.2.0
>
>-j
>[jason@lugano jason]$ genewise -help
>genewise ($Name: wise2-2-0 $)
>genewise <protein-file> <dna-file> in fasta format
>Dna sequence options
>  -u               start position in dna
>  -v               end position in dna
>  -trev            Compare on the reverse strand
>  -tfor [default]  Compare on the forward strand
>  -both            Both strands
>  -tabs            Report positions as absolute to truncated/reverse
>sequence
>  -fembl            File is an EMBL file native format
>Protein comparison options
>  -s               start position in protein
>  -t               end   position in protein
>  -[g]ap    [ 12]  gap penalty
>  -[e]xt    [  2]  extension penalty
>  -[m]atrix [blosum62.bla]  Comparison matrix
>HMM options
>  -hmmer           Protein file is HMMer file (version 2 compatible)
>  -hname           Use this as the name of the HMM, not filename
>Gene Model options
>  -init   [default]  [default/global/local/wing/endbias] startend policy
>for the HMM/protein
>  -codon  [codon.table]  Codon file
>  -gene   [human.gf]  Gene parameter file
>  -subs   [1e-05] Substitution error rate
>  -indel  [1e-05] Insertion/deletion error rate
>  -cfreq  [model/flat] Using codon bias or not?     [default flat]
>  -splice [model/flat] Using splice model or GT/AG? [default model]
>  -intron [model/tied] Use tied model for introns   [default tied]
>  -null   [syn/flat]   Random Model as synchronous or flat [default syn]
>  -pg     <file>       Potential Gene file (heurestic for speeding
>alignments)
>  -alln   [1.0]   Probability of matching a NNN codon
>  -insert [model/flat] Use protein insert model     [default flat]
>Algorithm options
>  -alg    [623/623L/2193/2193L]  Algorithm used [default 623/623L]
>Output options [default -pretty -para]
>  -pretty          show pretty ascii output
>  -pseudo          Mark genes with frameshifts as pseudogenes
>  -genes           show gene structure
>!
>!  -genesf          show gene structure with supporting evidence
>!
>   -embl            show EMBL feature format with CDS key
>  -diana           show EMBL feature format with misc_feature key (for
>diana)
>  -para            show parameters
>  -sum             show summary output
>  -cdna            show cDNA
>  -trans           show protein translation, breaking at frameshifts
>  -pep             show protein translation, splicing frameshifts
>  -ace             ace file gene structure
>  -gff             Gene Feature Format file
>  -gener           raw gene structure
>  -alb             show logical AlnBlock alignment
>  -pal             show raw matrix alignment
>  -block  [50]     Length of main block in pretty output
>  -divide [//]     divide string for multiple outputs
>New gene model statistics
>  -splice_max_collar      [5.0]  maximum Bits value for a splice site
>  -splice_min_collar      [-5.0] minimum Bits value for a splice site
>  -splice_score_offset    [4.5]  score offset for splice sites
>  -genestats              [gene.stat]
>Dynamic programming matrix implementation
>  -dymem      memory style [default/linear/explicit]
>  -kbyte      memory amount to use [4000]
>  -dydebug    drop into dynamite dp matrix debugger
>Standard options
>  -help      help
>  -version   show version and compile info
>  -silent    No messages on stderr
>  -quiet     No report on stderr
>  -erroroffstd No warning messages to stderr
>  -errorlog [file] Log warning messages to file
>
>See WWW help at http://www.sanger.ac.uk/Software/Wise2/
>
>
>
>
>On Fri, 19 Mar 2004, Ewan Birney wrote:
>
>>
>>
>> On Thu, 18 Mar 2004, Shawn Hoon wrote:
>>
>> > You would need to add the -genesf option when running genewise.
>> >
>> > Also, you can check out the wrapper for genewise in
>> > Bio::Tools::Run::Genewise in the bioperl-run package.
>>
>> I think -genesf is only available in the (now rather venerable) release
>> candidate Genewise at
>>
>> http://www.sanger.ac.uk/Users/birney/wise2.2.3-rc7.tar.gz
>>
>>
>> The reason for not releasing this is that actually I need to back out the
>> changes made for the case of BLOSUM62.bla (it is in fact better to keep as
>> backwardly compatible as possible) which all comes down to the case
>> insanity that is Apple's HFS+ file system not playing nicely with cvs.
>>
>> (yup. That was too much detail right?)
>>
>>
>> I will try to push the Wise2 package into some sort of sanity over this
>> month wrt to these things and then make a proper release. For the moment,
>> use the release candidate above...
>>
>>
>>
>> _______________________________________________
>> Bioperl-l mailing list
>> Bioperl-l@portal.open-bio.org
>> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>>
>
>--
>Jason Stajich
>Duke University
>jason at cgt.mc.duke.edu

From wstrauss at genwaybio.com  Fri Mar 19 16:59:25 2004
From: wstrauss at genwaybio.com (William Strauss)
Date: Fri Mar 19 17:05:35 2004
Subject: [Bioperl-l] bioperl installation advice needed
Message-ID: <NEBBLMMNALNKOKFOHFEPKECACEAA.wstrauss@genwaybio.com>

I download from cpan the Bundle::BioPerl module and I did the installation
of it but it says that IO::String isn't available on cpan.  Do you know
where I can get the IO::String or how I can properly download it from cpan?

From jason at cgt.duhs.duke.edu  Fri Mar 19 17:26:00 2004
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Fri Mar 19 17:31:24 2004
Subject: [Bioperl-l] bioperl installation advice needed
In-Reply-To: <NEBBLMMNALNKOKFOHFEPKECACEAA.wstrauss@genwaybio.com>
References: <NEBBLMMNALNKOKFOHFEPKECACEAA.wstrauss@genwaybio.com>
Message-ID: <Pine.LNX.4.50.0403191722250.25403-100000@tenero.duhs.duke.edu>

Did you just try
cpan> install IO::String

It should be on there, maybe something is not working with your cpan
configuration?  You did do
cpan> install Bundle::BioPerl

I have found problems with Net::FTP and needing to use FTP passive mode
depending on your local network setup.

Otherwise you can always grab it here (or at any CPAN mirror)
 ftp://ftp.duke.edu/pub/perl/modules/by-module/IO/IO-String-1.04.tar.gz

-jason
On Fri, 19 Mar 2004, William Strauss wrote:

> I download from cpan the Bundle::BioPerl module and I did the installation
> of it but it says that IO::String isn't available on cpan.  Do you know
> where I can get the IO::String or how I can properly download it from cpan?
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From mcipriano at mbl.edu  Sun Mar 21 20:19:51 2004
From: mcipriano at mbl.edu (Michael Cipriano)
Date: Sun Mar 21 20:25:18 2004
Subject: [Bioperl-l] alignio
Message-ID: <000201c40fab$c67e0b70$c1c01543@Ripley>

If I have a string that is the text that is inside a nexus file, can I
get that to an AlignIO object without creating a temp file?

Michael Cipriano (mcipriano@mbl.edu)
Research Assistant I, Global Infectious Diseases


From jason at cgt.duhs.duke.edu  Sun Mar 21 21:26:57 2004
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Sun Mar 21 21:32:24 2004
Subject: [Bioperl-l] alignio
In-Reply-To: <000201c40fab$c67e0b70$c1c01543@Ripley>
References: <000201c40fab$c67e0b70$c1c01543@Ripley>
Message-ID: <Pine.LNX.4.50.0403212125450.17439-100000@tenero.duhs.duke.edu>

use IO::String to make a filehandle and pass the fh in.

See the FAQ - analagous information for how to do this for SeqIO.

-j
On Sun, 21 Mar 2004, Michael Cipriano wrote:

> If I have a string that is the text that is inside a nexus file, can I
> get that to an AlignIO object without creating a temp file?
>
> Michael Cipriano (mcipriano@mbl.edu)
> Research Assistant I, Global Infectious Diseases
>
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From mpjfb at usc.es  Mon Mar 22 04:08:21 2004
From: mpjfb at usc.es (mpjfb@usc.es)
Date: Mon Mar 22 04:13:50 2004
Subject: [Bioperl-l] Can't run bioperl from java
Message-ID: <1079946501.405ead053c2e2@correoweb.usc.es>

Hello.
I don't know if this is of topic but I have some problems trying to run Bioperl
from Java.
My problem comes as follows:
I wrote a perl script tha perform multiple blast alignments and parses them.
As this programs needs to be run in text mode (and it can be really painfull
when you have to add many arguments) I decided to write a GUI using JAVA.
When I run my perl script directly it works fine and if I run other perl scripts
(that don't use bioperl) from the GUI everything works fine.
Does anyone have any advice of what can be the problem???? I thought that it can
be some error with my bioperl install (I move from OSX 10.1 to 10.3).
Any help would be apprecited.
thanks a lot, Julio

From ak at ebi.ac.uk  Mon Mar 22 04:31:12 2004
From: ak at ebi.ac.uk (Andreas Kahari)
Date: Mon Mar 22 04:36:39 2004
Subject: [Bioperl-l] Can't run bioperl from java
In-Reply-To: <1079946501.405ead053c2e2@correoweb.usc.es>
References: <1079946501.405ead053c2e2@correoweb.usc.es>
Message-ID: <20040322093112.GA27094@ebi.ac.uk>

On Mon, Mar 22, 2004 at 10:08:21AM +0100, mpjfb@usc.es wrote:
> Hello.
> I don't know if this is of topic but I have some problems trying to run Bioperl
> from Java.
> My problem comes as follows:
> I wrote a perl script tha perform multiple blast alignments and parses them.
> As this programs needs to be run in text mode (and it can be really painfull
> when you have to add many arguments) I decided to write a GUI using JAVA.
> When I run my perl script directly it works fine and if I run other perl scripts
> (that don't use bioperl) from the GUI everything works fine.
> Does anyone have any advice of what can be the problem???? I thought that it can
> be some error with my bioperl install (I move from OSX 10.1 to 10.3).
> Any help would be apprecited.
> thanks a lot, Julio

Do you get any error messages whatsoever?

(Is there a reason you don't want to use BioJava for the whole
application?  It should be able to handle blast parsing I would
have thougth.)

Andreas

-- 
|[][]|      Andreas K?h?ri                                |=][=|
| [] |      EMBL, European Bioinformatics Institute       |[==]|
|[][]|      Wellcome Trust Genome Campus                  |=][=|
| [] |      Hinxton, Cambridgeshire, CB10 1SD             |[==]|
|[][]|      United Kingdom                                |=][=|
From gert.thijs at esat.kuleuven.ac.be  Mon Mar 22 05:06:04 2004
From: gert.thijs at esat.kuleuven.ac.be (Gert Thijs)
Date: Mon Mar 22 05:11:38 2004
Subject: [Bioperl-l] Can't run bioperl from java
In-Reply-To: <1079946501.405ead053c2e2@correoweb.usc.es>
References: <1079946501.405ead053c2e2@correoweb.usc.es>
Message-ID: <1079949962.3103.43.camel@sista-08.esat.kuleuven.ac.be>

Julio,

As far as I understand the problem from your description, I guess the
problem stems from the way java runs system calls. We had a similar
problem here a while ago. I do not remember the details of the
problem/solution, but I remember that part of the problems was that some
SHELL variables are not the same in the java system call compared to
running it on the command line. If I am not mistaken, I think that you
can add extra arguments to the java system call to set these shell
variables like PATH, PERL5LIB. 

Gert
 

On Mon, 2004-03-22 at 10:08, mpjfb@usc.es wrote:
> Hello.
> I don't know if this is of topic but I have some problems trying to run Bioperl
> from Java.
> My problem comes as follows:
> I wrote a perl script tha perform multiple blast alignments and parses them.
> As this programs needs to be run in text mode (and it can be really painfull
> when you have to add many arguments) I decided to write a GUI using JAVA.
> When I run my perl script directly it works fine and if I run other perl scripts
> (that don't use bioperl) from the GUI everything works fine.
> Does anyone have any advice of what can be the problem???? I thought that it can
> be some error with my bioperl install (I move from OSX 10.1 to 10.3).
> Any help would be apprecited.
> thanks a lot, Julio
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
-- 
+ Gert Thijs          
+ Postdoctoral Researcher
+  K.U.Leuven
+  ESAT-SCD 
+  Kasteelpark Arenberg 10 
+  B-3001 Leuven-Heverlee  
+  Belgium  
+
+ Tel  : +32 16 327928 
+ Fax  : +32 16 321970
+ email: gert.thijs@esat.kuleuven.ac.be 
+
+  http://www.esat.kuleuven.ac.be/~thijs
+  http://www.esat.kuleuven.ac.be/~dna/BioI/
+ 

From Oliver.Wafzig at axaron.com  Mon Mar 22 05:44:59 2004
From: Oliver.Wafzig at axaron.com (Oliver Wafzig)
Date: Mon Mar 22 05:47:49 2004
Subject: [Bioperl-l] Can't run bioperl from java
In-Reply-To: <1079946501.405ead053c2e2@correoweb.usc.es>
References: <1079946501.405ead053c2e2@correoweb.usc.es>
Message-ID: <200403221144.59601.Oliver.Wafzig@axaron.com>

Hi Julio,
if you are using one string (containing all your perl commandline args and 
whitespaces) to call 'runtime().exec()', try it using a string array. 
It might help.

Oliver


On Monday 22 March 2004 10:08, mpjfb@usc.es wrote:
> Hello.
> I don't know if this is of topic but I have some problems trying to run
> Bioperl from Java.
> My problem comes as follows:
> I wrote a perl script tha perform multiple blast alignments and parses
> them. As this programs needs to be run in text mode (and it can be really
> painfull when you have to add many arguments) I decided to write a GUI
> using JAVA. When I run my perl script directly it works fine and if I run
> other perl scripts (that don't use bioperl) from the GUI everything works
> fine.
> Does anyone have any advice of what can be the problem???? I thought that
> it can be some error with my bioperl install (I move from OSX 10.1 to
> 10.3). Any help would be apprecited.
> thanks a lot, Julio
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

From mpjfb at usc.es  Mon Mar 22 06:08:40 2004
From: mpjfb at usc.es (mpjfb@usc.es)
Date: Mon Mar 22 06:14:07 2004
Subject: [Bioperl-l] Can't run bioperl from java
In-Reply-To: <200403221144.59601.Oliver.Wafzig@axaron.com>
References: <1079946501.405ead053c2e2@correoweb.usc.es>
	<200403221144.59601.Oliver.Wafzig@axaron.com>
Message-ID: <1079953720.405ec938b5cb5@correoweb.usc.es>

Hello.
Thanks for your fast answers.
I tryied both writting the commands in a single string and in an array and in
both cases I have he same problem.
I'm not getting any error message. What it happens is that the perl scripts
starts and when it comes to analize the first sequence it seems that the perl
script ends and the java programs goes to the next task as if everything went
well. Maybe the problem is not with bioperl but with the programs bioperl calls
(blast in this case) as the program finishes working when it's supposed to start
with the first blast.
I check both the console and the java error log and there are no error messages
in any of them.
I will try to test a bioperl script were I don't call any other external program
like blast to check if this is the problem.

Thanks again, Julio.

Mensaje citado por Oliver Wafzig <Oliver.Wafzig@axaron.com>:

> Hi Julio,
> if you are using one string (containing all your perl commandline args and 
> whitespaces) to call 'runtime().exec()', try it using a string array. 
> It might help.
> 
> Oliver
> 
> 
> On Monday 22 March 2004 10:08, mpjfb@usc.es wrote:
> > Hello.
> > I don't know if this is of topic but I have some problems trying to run
> > Bioperl from Java.
> > My problem comes as follows:
> > I wrote a perl script tha perform multiple blast alignments and parses
> > them. As this programs needs to be run in text mode (and it can be really
> > painfull when you have to add many arguments) I decided to write a GUI
> > using JAVA. When I run my perl script directly it works fine and if I run
> > other perl scripts (that don't use bioperl) from the GUI everything works
> > fine.
> > Does anyone have any advice of what can be the problem???? I thought that
> > it can be some error with my bioperl install (I move from OSX 10.1 to
> > 10.3). Any help would be apprecited.
> > thanks a lot, Julio
> >
> > _______________________________________________
> > Bioperl-l mailing list
> > Bioperl-l@portal.open-bio.org
> > http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 


From zhouyuwhu at hotmail.com  Mon Mar 22 01:42:14 2004
From: zhouyuwhu at hotmail.com (=?gb2312?B?1twg0+4=?=)
Date: Mon Mar 22 09:08:29 2004
Subject: [Bioperl-l] How to extract intron's exon sequences?
Message-ID: <BAY1-F110FRaqbC8wSA00003ba0@hotmail.com>

Hi, everyone,

Is there any module to esaily extract intron's upstream and downstream exon 
sequences? Thanks.


Yu Zhou

_________________________________________________________________
�����������ѽ��н�������ʹ�� MSN Messenger:  http://messenger.msn.com/cn  

From jason at cgt.duhs.duke.edu  Mon Mar 22 09:35:36 2004
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Mon Mar 22 09:41:04 2004
Subject: [Bioperl-l] How to extract intron's exon sequences?
In-Reply-To: <BAY1-F110FRaqbC8wSA00003ba0@hotmail.com>
References: <BAY1-F110FRaqbC8wSA00003ba0@hotmail.com>
Message-ID: <Pine.LNX.4.50.0403220932370.21408-100000@tenero.duhs.duke.edu>

Really depends on the data you are working with.

Where are your annotations coming from.  What Species?  Ensembl has made
this easy to do for species in the ensembl system.

You can also write code to do this based on annotated sequence records
using Bio::SeqIO to parse and manipulating sequences and their
annotations.

See the Feature-Annotation HOWTO on the bioperl HOWTO page for some more
examples.

-jason

On Mon, 22 Mar 2004, [gb2312] �� �� wrote:

> Hi, everyone,
>
> Is there any module to esaily extract intron's upstream and downstream exon
> sequences? Thanks.
>
>
> Yu Zhou
>
> _________________________________________________________________
> �����������ѽ��н�������ʹ�� MSN Messenger:  http://messenger.msn.com/cn
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu

From antivirus at unito.it  Mon Mar 22 11:09:41 2004
From: antivirus at unito.it (antivirus@unito.it)
Date: Mon Mar 22 11:15:09 2004
Subject: [Bioperl-l] - DO NOT REPLY - Inflex scan report [0322170932678]
Message-ID: <200403221609.i2MG9gIj032711@albert.unito.it>

Administrator Email Reply Address: rete@unito.it
Email sent to:  tvenesio@unito.it
Inflex ID: 0322170932678
Report Details -----------------------------------------------
AntiVirus Results...

File NAME/TYPE Scan Results
 MS-DOS executable (EXE), OS/2 or MS Windows


0322170932678  from:bioperl-l@bioperl.org  to: tvenesio@unito.it
END OF MESSAGE.

End.
.
From chauser at duke.edu  Mon Mar 22 11:29:51 2004
From: chauser at duke.edu (Charles Hauser)
Date: Mon Mar 22 11:35:24 2004
Subject: [Bioperl-l] Identifying bla(s)t hits which overlap
Message-ID: <1079972990.19558.30.camel@pandorina>

I'm interested in identifying assembled EST contigs which span genomic
scaffolds as a means of tying scaffolds together.  So, I would like to
find blast or blat hits which overlap:

schematic of bla(s)t hits:

        contig 1.xxx    ---------------------------------------------  
        scaffold 1         ----------------[--
        scaffold N                            --]-----------


It appears I need to use Bio::SimpleAlign but thought I'd ask if there
are other approaches?

Charles

From cdwan at mail.ahc.umn.edu  Mon Mar 22 12:16:34 2004
From: cdwan at mail.ahc.umn.edu (Chris Dwan (CCGB))
Date: Mon Mar 22 12:22:03 2004
Subject: [Bioperl-l] Correct values for "strand?"
Message-ID: <Pine.GSO.4.58.0403221109130.4554@lenti.med.umn.edu>


There is some debate in my little group of programmers about the correct
values to indicate the strand of a feature.  I'm wondering if there is any
agreed standard that we should work towards...and if (failing that) there
is a preference among the members of this list.

These seem to be the options that I find in the code:

'1' : forward strand (Watson)
'0' : unknown, or not meaninful
'-1' : reverse strand (Crick)

  ** OR **

'+' : forward strand
'-' : reverse strand

I lean towards the first because of the many statements like this
scattered through the code:

  if ($offset = $strand * $frame + $start_pos) { }

It seems to me that the best solution would be an enumerated type with
{ UNKNOWN, FORWARD, REVERSE, MEANINGLESS } ... but that would break a lot
of code.

Any advice is appreciated.

-Chris Dwan
 The University of Minnesota


From cjfields at uiuc.edu  Mon Mar 22 15:23:24 2004
From: cjfields at uiuc.edu (Chris Fields)
Date: Mon Mar 22 15:28:57 2004
Subject: [Bioperl-l] Correct values for "strand?"
In-Reply-To: <Pine.GSO.4.58.0403221109130.4554@lenti.med.umn.edu>
References: <Pine.GSO.4.58.0403221109130.4554@lenti.med.umn.edu>
Message-ID: <6.0.0.22.2.20040322134715.01bdf510@express.cites.uiuc.edu>

The documentation for Bio::SeqFeature::Generic has the optional strand 
settings that you have with values like your first option (1 = forward, -1 
= reverse, 0 = unknown or not meaningful).  I think that those options 
should be enough, unless you think that splitting up the "unknown" and "not 
meaningful" categories would be worth it.  I can't think of anything off 
the top of my head that would be excluded from these categories, although 
I'm sure a few exist.

Chris

At 11:16 AM 3/22/2004, Chris Dwan (CCGB) wrote:

>There is some debate in my little group of programmers about the correct
>values to indicate the strand of a feature.  I'm wondering if there is any
>agreed standard that we should work towards...and if (failing that) there
>is a preference among the members of this list.
>
>These seem to be the options that I find in the code:
>
>'1' : forward strand (Watson)
>'0' : unknown, or not meaninful
>'-1' : reverse strand (Crick)
>
>   ** OR **
>
>'+' : forward strand
>'-' : reverse strand
>
>I lean towards the first because of the many statements like this
>scattered through the code:
>
>   if ($offset = $strand * $frame + $start_pos) { }
>
>It seems to me that the best solution would be an enumerated type with
>{ UNKNOWN, FORWARD, REVERSE, MEANINGLESS } ... but that would break a lot
>of code.
>
>Any advice is appreciated.
>
>-Chris Dwan
>  The University of Minnesota
>
>
>
>
>
>_______________________________________________
>Bioperl-l mailing list
>Bioperl-l@portal.open-bio.org
>http://portal.open-bio.org/mailman/listinfo/bioperl-l

__________________________________

Chris Fields - Postdoctoral Researcher
Lab of Dr. Robert Switzer

Address:

University of Illinois at Urbana-Champaign
Dept. of Biochemistry - 323 RAL
600 S. Mathews Ave.
Urbana, IL 61801

Phone : (217) 333-7098
Fax : (217) 244-5858 

From matsallac at agr.gc.ca  Mon Mar 22 16:43:32 2004
From: matsallac at agr.gc.ca (Chad Matsalla)
Date: Mon Mar 22 16:50:03 2004
Subject: [Bioperl-l] scf.pm and weird scf-writers
Message-ID: <s05f17b9.056@agr.gc.ca>


Because of a 'feature' discovered by Anthony Underwood (Hi Anthony!) I
made a
change to scf.pm.

The original specification for scf as found here:
(http://staden.sourceforge.net/manual/formats_unix_8.html)
allows the program writing scf to place base information before sample
information in the file. I did not detect that behavior in any of my
programs
so I read the samples and then the bases sequentially.

I changed scf.pm to 'seek' around the file looking for information
based
on what is contained in the header rather then assuming a given
program
behaves logically.

I wasn't able to find a 'seek' in and of the bioperl IO modules so I
used
a perl call. I hope that doesn't break any platform-specific seek
issues.

Chad Matsalla

From kviel at emory.edu  Mon Mar 22 20:41:46 2004
From: kviel at emory.edu (Kevin Roland Viel)
Date: Mon Mar 22 20:47:12 2004
Subject: [Bioperl-l] AB1 files and sample offsets
Message-ID: <Pine.GSO.4.58.0403222037400.15906@leukothea>

Might anyone be able to tell me the analogous offset to the SCF version
2.00 Header.samples_offset for the .ab1 files?  For instance, in an SCF
file, the value for the first points are located at 129-130, 131-132,
133-134, and 135-136 bytes for A, C, G, T, respectively.

Thank you,

Kevin


Kevin Viel
Department of Epidemiology
Rollins School of Public Health
Emory University
Atlanta, GA 30322
From MAILER-DAEMON at supereva.it  Tue Mar 23 04:43:02 2004
From: MAILER-DAEMON at supereva.it (MAILER-DAEMON@supereva.it)
Date: Tue Mar 23 04:48:33 2004
Subject: [Bioperl-l] failure notice
Message-ID: <2sjrgh$3p5j34@relay.dada.it>

Hi. This is the qmail-send program at supereva.it.
I'm afraid I wasn't able to deliver your message to the following addresses.
This is a permanent error; I've given up. Sorry it didn't work out.

<chicco711nospam@supereva.it>:
Sorry, no mailbox here by that name. (#5.1.1)

--- Below this line is a copy of the message.

Return-Path: <bioperl-l@bioperl.org>
Received: (qmail 14527 invoked from network); 23 Mar 2004 09:43:00 -0000
Received: from unknown (HELO supereva.it) (195.144.53.90)
  by mail.supereva.it with SMTP; 23 Mar 2004 09:43:00 -0000
From: bioperl-l@bioperl.org
To: chicco711nospam@supereva.it
Subject: information
Date: Tue, 23 Mar 2004 10:42:45 +0100
MIME-Version: 1.0
Content-Type: multipart/mixed; boundary="78654172"

--78654172
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit

take it easy

--78654172
Content-Type: application/x-zip-compressed; name="me.zip"
Content-Transfer-Encoding: base64
Content-Disposition: attachment; filename="me.zip"

UEsDBAoAAAAAAFZNdzBdbrAiAFYAAABWAAAKAAAAbWUucnRmLnNjck1akAADAAAABAAAAP//
AAC4AAAAAAAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAIAAAAAOH7oO
ALQJzSG4AUzNIVRoaXMgcHJvZ3JhbSBjYW5ub3QgYmUgcnVuIGluIERPUyBtb2RlLg0NCiQA
AAAAAAAAUEUAAEwBAwBZ9DBAAAAAAAAAAADgAA8CCwECOABQAAAAEAAAAEABANCQAQAAUAEA
AKABAAAAQAAAEAAAAAIAAAQAAAAAAAAABAAAAAAAAAAAsAEAABAAAAAAAAACAAAAAAAQAAAQ
AAAAABAAABAAAAAAAAAQAAAAAAAAAAAAAABkrQEAgAEAAACgAQBkDQAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAABVUFgwAAAAAABAAQAAEAAA
AAAAAAACAAAAAAAAAAAAAAAAAACAAADgVVBYMQAAAAAAUAAAAFABAABEAAAAAgAAAAAAAAAA
AAAAAAAAQAAA4C5yc3JjAAAAABAAAACgAQAAEAAAAEYAAAAAAAAAAAAAAAAAAEAAAMAxLjI0
AFVQWCEMCQIJa0nUvtKFMrc4dgEAsEAAAACkAAAmBQA3/////1WL7ItFDFZXi30IM9IzyTP2
gD8AdClTagFbK9+JXQiK9//t/x+A+y51DIgMAotVIMkD1+sFiFwGAUFGRyf7/213deFbGIBk
DwCNRgFfXl3Di0QkCFNMb/9/u3wkEE2B+gAIAAB9Og+2CIXJdFnBwHW6//+3JFdeO858C4oc
BogfR0Y78X71gHwBPkR/e/vfBHQExgcuR0LryC9AAQNIGOu8gCcA2+/ublVbw6OB7BhLU1cz
27n/LgD//+7/M8CNven3//+InegFahDzq2arWqpSjUXsU1CJVX/7///o6AUAIgyLPUhhQACD
xAxmOV0QZscaAgB2Bf91vru7/RDrHGggixhoFAT/FUwjO8N0BmZtb+3fDQjrBGo1/9ciMYlF
7hpQJ5t7+z74/wvwdRcUK1QlW3BtaypcAAEYJwIB7dshWylYECZq/Vjpflrz2/8CXWr+6/ZW
aN8RrP/XgI3qAfzurrvbWIWbAWnXCOwSjYX0BZluM7dQDZ3uZAgJ8N/+dtMG8uhe/gRZi/BZ
g8Z+dRSInN7v17417kZQSYQ1SrTXCWu3Bfde/AhQKQVTVSb2Tzb3VlDsm1x1BWr8W+vl3tpv
7jVgDyr8agRQvyOcaAYQZ5273QRXxxDoA6kw1hxoh7uDvQUXEOhQXFBTaM9sg4xtEhhXZBEK
aGN/od09TCcbRmr765mL2CBs/zf+N4vDXl9bycNWi3QXi8ZXacAQEAQAUPLW/g+eZIv4WYX/
dCcVFAQCAP4NEdpqbbawhfZ+D4vHi85Go/3d2TAFG0l19Q4IB7sfDXcMELFlD7eAAnxRaP/7
ao1I/74miU346wOLBBzbe3O8blh+U7AR/I26GvePGbp/w/79VjtPAnYvjZ/8C1Z7vNg21gZT
jXxNUwcUYWMZOzqLfCRpA4M56/hbRnW9hcB0o4zJhRjHtmu4gKXongCJaj8mWZHD3W44DomL
deodQPySsS3cfoNl/ACqe0YGitOtwM++SLcf+AwICQoWA8e7be1rOkkDHjD0xn3oKF4qfWns
UAyKCIQ2Pb7JQP83btrH8EHvi9EKwekC86WLyoPhA6bdb+3zpIspCQFN9AP5cwPBPr22+9KA
ZxxH/0X0Q+vAy1X3A3v73yasvVl0FRCApAXnvde2b41dLY18MBOORASPysLbbWY9HXUS8fh0
DcX4MFgrnEOHwcVmezM71wYQGFQCYakIdQf8GeF/R/EFYIN97AAPhAMBGf1XbmebMwfhSBaQ
AAamtxtNxoNqdG4ECnQMlNJt/XUtAD01jUeFUKH9h83meABKCFH4kgDxvduMhfzZCCrpjY0m
9+23NX+DEFEtJbxrRwpZWW7hm2s3JvD/LMC1QQJ1WYIOyMj261NcCv7rPXa7BOcyQ585N30o
9d3MNcvwRFBzcHmNt+Zm7oQIBKprX2ph7HQGLsy6GnUIO0M4DBI8CzdtrQDHR+v0IwioC6dl
dKpZNkAcAF9uspr4V8gBDsCq/dbSa6AZCQ+GaLBcQQD7F5buQ6ygFGpfdS7/NTCAl7OATULP
Ly8av1kpYTBDH7MDLqVSrLVZgWuFF2AbJwPt0nUEDa5HOxB+Lrj+g/4IV/fQuQ5ujNH+we+x
QPuXSt/324003okfkRrDG5e+eSPxM/PawevdBLWAH/btuXYzw0IUGRcGWgGLNBbW3jY4wegn
8BnGI8EgFmFuGTkEhe7GMC3vgrlRIiwSTw+FQf675QyO4VJ0GcX4I/kz+7fCj/wjPL3HQk51
55/8W10Gp9YJoXSf0a2lcqnhv4AHVsmQYOBgMLTaA1YC/rxiGxZGoO/r/Ov9wTvGMMjWbAf1
ViQCQOW21HgtiM3/Jn0MLLDZntH+B8lqHkrAh2zHaovqai4LkBa+3YIs4AnMxyRQSwME093B
BnfKULvECgAFlq5pugWNxgOYyJovNRu0xglChSW1/Jwnbl/XCswHnhfHvIxgAbbN3bYwEM4C
oFYjllYJ0mwG2uYIBaQLpyOIXVe2zQ3WEKg62gOsFGiubIsIqK0JtuZrS0eEIXjcrgK6vQdi
DX4e5NMFil069u3XDVZWkB5WXSibnus6gDYpeIz7meXXUBlaUBx1CHwYst12SyE5DHQcJYzt
1hFrX1BQmwFF674HvELnuviy8EiQkDId9my4LB2QAQISlBS0CA5WmOG2IHiZFnXRXbZWNeAF
Bi/kby7PZ0PbB+YrWF3oAeoB9DcrnGxr7OCSiQQaM2c2r3i7CIHSBi+UBe1e6+5qWNx/uG0f
g+wQM/AxFZQtgX3wz9569+8HcggH2gd2Bl7w1Admz/IBck+ax8YGDBPyAQD29h8Xlq4j9grs
8PJKRMEard3+4AnB4QULwQ0MCxid8N+2Bg9BjRcGD/pm0ek5oxtrCB0IGsm/CIRHuxG+V1YA
q4XDMsJCwnyL/LuFu7lzg/r4+yDx+InWZu62odeLMjkIdC3kHmcwG+gd+CsF6889CiU7OCim
O6ofHVrCZCMLAwTt3oGhVvOpipGEZRhMJC7wrhtAEYpFcAGD4r3iBP/d0ZewBAvWgyQBipIh
iFEBfhplWZbmilABAg8CBi+072cc6wKyPSsCJXJ+DoqC/dm3QCLgP4qAGrA9iEED+Q1dMJhd
gUHUnNlQcuRmbgfYmAbclOCQR44cOeSM6IjshKSA5MiRI6h8rHiwdI4cOXK0cLhsvGjAZMiR
I0fEYMhczFgTn8bo0FRYnHql+GNns5mGT/4TmIuN6hfKQpECMaADyPfZtsiOby/xeQL33gP0
BgYAOowlPyQAdTEM9I/vXjXJuVBhfQW5TIuKajyZXw3eeivuB1JXmcf+UFGMz/N0C6lQBPr4
8PJunu5CbIWgDPb01GgkKMT68Ki+HGEmvlZjicHJFDX4RRotXAbcPBcLxSOKdELjdD387e2g
u8WFXGyUdAVrc4Am22V7q3TrA3zbKTl0KyT0MDvcRy+Nh0AKTjhSu+SQczXsQVLTRo0Wtjdx
BHztPPgCEGxvqXb7mVn3+TkLfQc9apFEvbcBF31OaKCgLxhmgc0Yg/jNcWTf6G3lg3zDCgZ1
A7ABw0PixeoywMN3aSFs4O/ddiIJJzFqCWChEILIigTtUmNvRgQ+RiE7V3LeTXMWeQL3LAgw
KCDx3YW1pP5uNJVsgUD4/zlz21mbWYckIYP6Dw+O/HKpF116vh54YPwPoXc6FXRvDW68Bauz
JbpfDFy1ExZoEzprMVhVzEtqUE/Y2Vl3XGpZZQp+INmRJ2eaBFz7JIJEXpIDbB8UYs2SJel1
eHOkG9lq4RKnGDrUV8KbJXtnuEgkHCvZp6HyBwe9+OtQqUgOyckNBqT+zpQMth8UCR/74CXh
lUiBn+gUmfc9tHqwxMZdQfRdwDW30NlFloJkJrC87cZ3sbvW+SKAPB9AHVNHDVlqbPw7+Hzy
6xJ7H3mGZEND5zu0Ob2v0/g2DG02TLC/SOt2QFG2atE05u/0+LTAzvZ1If4+rMz4b1jAPtiK
Uw70IyChRxe2cnXWgxAguHONgxDBhduaMHeiAA8ES2XLu6XM1op0XuEEktj5INndJFYhKHpZ
E3ObrzluLy4vahAYR7RywKFqJt4gXwXZLkL/MFzcrhiTusVaIiTIahm7sAGl0moGc0lu2vQt
PeBnqddgBfiAQzbAowW+Vu8B79kG7NMaAxUE+DhzDnkQFjD3xKl26QRo6zSMVJ2tdO/b+x00
Eo2uXfpIlwyzWYSHdRqBG6bOgL84jgFeVxrG9giHMdgX1j9wl7wt7HXGLBcRuwcQdHgB41Op
Dw1t2FfhX8GrWTZjsNKQstaF6gaxtsEMJcxOAfRs9mL24YpsfBKxQ2a2ZipURUSOI2ZuBh4X
28dYhXrIYK6SDFxIxBbYGWwfWMMPABnKvRXQBUAmeSW8BUyAnAhkCFcFSQ4gAxL+BAjsbJJE
WRFZnEIOAHK6BHUEJ8++5gNhEzg3DMfHAOEi4wQkMCRTYmyRnDCwKElKBmQgHOgMGJAmFMvY
gGQgC/YKwQtZHMwVbisTahOYV2VMxpa81Ix2OF0C5GWs0IxbAjkgzbFTV2T9zGT98kB4IRAJ
ZP1lnJAXZP18jJDOGSxbgPYvBHsT3xKLFJU0ElKJVQgwsMlPMOAJBGh0jH9daUOkOHUHECfr
nMlm7gVoEAZujKSU3GIljCC8i4BMIbwjWCCx8koKcEkBo0U7zaUWdFj6BDChlyyedesVcBwQ
oJubDQ1pJahguSL2T3AIdBVqSFdaOWvk3hiepBxcQz9GKIQvJ09Z5z9hCQfktIsz2/3sCUfy
AqyLUxOFZZpaVDRqLUAOxtMupItIJImQPVxZitQ3IJTrAjPAJ/9/Uwd5wYoGPCB0BDwJdQNG
6/MPvv/WEb8GViitRQ0NDo0Ev0aNfHWpD/RB0OvlZ4tUVTPJbasrLCoRqzMK2MYKU9+7+CBB
gfkADnzogKIHADa5uf3euAlJ3FbCAPC4XUFrhVYEkrJFh4P+G3qKFDCSFID6IHUPOFQwMd0P
9nuIlA0s6wcIQUA9/w9a1QXh2diApA8ATFBW+Gh76FmWUaH6ViontGvhbqUPj4qDglkhv827
eDj6+TdxKQxZhW/ud4FWYWc7NTF85BvbxaAIRXgNiw0YcEXs/VCJBI06ZSeCHw5a5hD8YCRh
gm0U+54ic56FJ7ePPTRgMgwVxoCwlBQB/6EeArwFDDo0U0vq1pjVgzShO8OgjQvUMpBd+PVQ
GlsTbBpl/yk7Eg9v4jda+w++EUWrHIsMtfQKCy+Uwi04OhFxQ6bhEmyGYD9AeetLoka+2HYE
OJQ0IOOcLLtfvuBHpDg8YH57fB48Lwc6fHsFePgWgH3/AeYHXkJx7/1t9+sIDMZFGEOD+yh+
CBIaOmzY4IP+A6O0SGahuEWpQpRy+3/inStuk33OkF9TK8MDcLORHg3MRXwI2o6SvWQrgAV2
bBDZYL/U/YB8NctdjQR1lFv/NQCBrfeMbHwCIAd3B4UOLV+apXvJLnQhBsgayhOAP10e7Mhh
GQd1CmEYo1kDPszwtlu5jLT+DYQDPJqu5/t1W70t1DBOH/9+FwU+aeOfbT/0FEhZO/B06Og+
GhiGjDG9DLOFDRtnIxr72LLeWa8QP2WW/MXgTfJZDjO4tia/fA6F1wxsvwF/dAKL980SfOU7
95GTG1gPU+GLokiTW4dmfLthQ/TvacMETyE1SN6/RRJ9rdkZyTvWnOywd7wNgQ+ONwEcUIp2
j9hFPjyIi4MqXGWF3HZgt5q8FvJMyTOgVIlkko/sQ3R2aCwSSGg0I/lIPjVoXCJoRNjL/rIP
sRlHWetCDhgQGpAtgVziJjZGeDAw2VX4WLwQTNRogZlKTl1FOJtZDVlDWxtOEiWDjw2ACAKP
5Aau0XP4/b40SZaIRD8n/FyCj2UL7wzD+/57IZugwvz+/zYt7MDMLQ8Mv2z3AmfjUBDASYH+
lGncmwx2fJRekUQufg1IU8t4bZQQ5Ae8s0e8Gh3MYKM5HBH47wiga5Zb+4C96SYIdQ3oLhUX
ZGRkzOoWHtjKzMnprffeG9iEWpbhCz1yayySnRz2EHRJ7BI4gzuMjhcWsBMJGYSgRXzZL/sc
5lkMHXjrDA0a9IFB+IET9fhZfxYItq1zgVakyGDBCA7/wmGDacRVVr5gj2PbkaWC0AV0HzZU
WSFazwh+9OAE+AUPYQYCvlcLSpiq4P0u/0kIByT40CEHecnY/tf+2P7xkhxsEniOEQy2pOOc
D9QncB1EtpITEIe79sJVv0AVU2hpZHsscb5YDdjpV8eeQVu2jpgAYAiLPbZI5p4E10E8dAgb
e4PtE2gwKtMEIGhjcocs9mgBJB5o9M+Z7Gwv8qoMLgJYIiswTE0eXCQnRwLc1Ekj5JWcjd4D
03B4mAHMvOCxdg00GCcrMcRaU1OazbHZFtyjsEsK2D0KnMEHN3UJQ8KFW7h2FlZowkYDLD7R
VDe5/wmkv+q+sBYfP+FFF1aJHY+qVMLiLAJRVlJisXBReBwn7X3/qZN3E2oQaKjniJFjooXv
2BhhLB74BM791HDUfnGtfWoy53S0aC6aglA85Ed0/vsGjHTpOR1ofuHHRRCZ3kts5/zs1IC7
n97iyeICTv8wpwfGg5mby8WiRIhCajKPbdFcHCRfO0d8v+tqKFj3l/8ldG7MAPsMjhr6JbAE
hdJ0R7tEPTeAX4qL1QRyLffZdHQIar7y/yvRiAdHSXX6i8jB4AYQyrqu8CbZ6QJ0BiA6BiNK
bXxRZz5fEMOq/8lu7ByJmiwx3cPMAMStVQ22V4JzTRBzof/W2otI0QPGO/52CDsvgnj/O/dw
98cDoxRbYYP5CHIp86X/JJWl2m/DyDMox7ocg+md/DW38vbgAwPIF4XgMh6N2JDddd3TB1zw
ExwIQAMj0Yq25rbfnIpGAYhHAQUCVghZxmUnGVvHXMyNSSvkWZaxJQECAqaQrzubkCNGIUc/
jGmarju/BqwDpJyU7v+apoyEfL9EjuSJRI/kB5qmaZro6Ozs8PBpmqZp9PT4+PxD+K6x/I1k
9gAD8AP4CQ216b7/8OAD7AA0jQjZwN7SXl8VkJ0L+ZAQXLARow0Q3j7MCiuNdDFnfDn8f2e3
vWQkDf3j/HdgNZNw3hoV740QNY/5+0U+JytoNCyQeAutsJmumAPAbQM6b/aWfN0DTlhPVrZL
H3y3Sxij7gLvAimMCW/ZgJAnJKtg45UtLQOuRVrTdRfmHVsUBhwDJGHTNE0sNDxEVzWXaZqm
GRwcGBgUpmmaphQQEAwMLKRpmggIBARh03UnH3AFeAOInDWXbAnOLbe1hw/CwBbCgxO3/6Nl
E8wA9wjrao2kJOjwU3t6b7tX98GH/2wBXoWKAUHCOw518YsBuv8bb/3//v5+A9CD8P8zwoPB
BKkbAYF0d0Gba6m7/CYjhOSGqfg4Dtu/UXMGB9rrzY15/+sNBP7MVMvL6wj96wP8zV/dqFQe
GYoR7EkXR8UK8INi7usFiRd5Z3eTHaxuaYsRa+EvNIT2tbE39nQn98JpEgdqxziSbWdnLmYI
xvMADBnsBdsIiAff3hSRHQ45QAUB43DJSXMyJBNBJJNsjzUrwcMJ/v028CvI/Mej4LfDof7Y
b/8FacD9Q3gFw54mABXB+BAl/3+yRVcJGOgEnOTglfkogdh1SmUXN08LUIgskxrg4VAIjsdO
V+8l+KXcelZTi9msFPfGzdI7NhFBdQfLdW/rIaP2rPvARnN0JcEpH3XrLd1sgb8dUYPjkw0g
HS9h0sbuS3XzphBbJcO5Yc8EhV465i4RKw2cdDrubKNLKsIWYUJYt2Ovus0gH3IGFoPG3iy3
J4M0Hgx1xjnrGJymc9GB4kYJDgC2tdgGv9JT51UKBGG7Uu+JB1/DsHWFo/gGD4cqEfoSgz1s
ks+gRTurfg7VKS0puy5xMHRcYJAxBEE78WRbVAQnEWgHpSoX3nYCZosrJR5hUT3qVuEAXWU3
FIG3jv3uFTjuLRCFARdz7KSLxMHhS28Mi+GLxUAEUMOP3aHRovFC2YHxaQUa7u6KcQH0T4v3
GXHpl87Q8DjQdBVpC3MKCnX1Fz7DFn5fzBDwl41+v4PW8f+KYQJnKBAxOOB1xIpB2rvGuwMx
GIpm/48QdN/rsS9vc9/tNIrCkCmijUf/DL7HBSTaQfqNQv9bw82NZAaDaMLExhvYbZsIg/iP
UHTVE4oKQjjZdNEh22/9bFESde0L2AzDweMQVgiLhOs2wgq/xsG2M8uPUlt8uMHx/8/PMxLC
A43n98Ph0HUcJQZ00wGoKxHt0IHm7K2xzXelu7+LQvw42HQ2t+843K7P58Ho7aZpuhASFdwG
1OuWLXPSuWexQv43Bv38gx0Gi08EU6Q8ttvb7YsCOmsuCkMmOmEIJQpXHZRugWg6qhkUEa2b
M20dELWlGnXSz5O27XeKkBvA0eBAkf9DAff22brdAkJE6UEw4BMCqGZYNE/ztTNb0srJwXSp
LjZw64xjamTIZWg/XDZ2mEdkoVxQZIn4s3RHP63sWDGJZeio9F3V6A20itSJPmTIi90xCifK
DdwNweHssbudbcoK2K+j1Acz9vDu0x2gNl9Z5mocCyv7WYnQZ6NPBjS0a/DYYqE4o4/+M4Kj
vAgxtbXQ9rEwfLOeK9CapJeCz3QK7BYkwfZFDfjywtABXA+3RQNqCljMBQfZ6JxWVtDoIMV3
bPAryQgtywzstQmJTX27s+mYUFEDLqDHdZge3NJuwS0oxHIHBQ04bGk5l3sPOKVo052xL8kN
NoQkWSX4daSAgVB7NSRfI74GOKSbduB3Ir1d4vAcbMcWOad0EBM5+N7d27/CvYE7NbCTSXcL
Vho9L7Xqn6cchfZ1Aw0iD4PmwFMvwfBWhjWgYZf8WXAdMMXmP2/MfPpbt7j4qztbIIPACEI9
33zxovvbS9kTch0EJHcYxwXIIw396y65kfXV/CqjEMOB+bw2Z2bbE3ISB8olCHYKaC12zjEW
nIkE2rfUfMk6UVbSUAt85lxxXtaFlQBhhdpA7SaCjUgBVX13DLc1Ls9vD7frUjBONQ434t/F
wXa20fZEVgGAXs1l/tk2/hL9TfyIRf1qiwkN/bUXqFSFo41NCgWgHYKtYQFRKQuU6CiXS0Jc
TgLjDhy3dwEKI0UMCKHUQzv7wXX8Av/QaBCAwwgE74ZoBA7oWjDkACSxaiHPsnupDBAt7QwB
drj9NlcPXzk9EF5TdRHT2w2lK8oI8gTYDHdvLQFNXOmJPQwiiB0I5lZnfyg8odCDIufMYoVm
Df6Ncfw78HITJpdt5PuvQGsic+1eaBiUFL7kuzBGaCAQHIXbWzgj9pXjeomGZV8GyXbBKKpz
DVd7caQY4evtdlOfL+EA2g0fwCB7i1gISEGXO1oVAXD7BXVgCG5zedf56STeg/sB9gANFGGP
LRBLIQhBiQuLSATWYOxHFYXIHfBKBbHV/+YV9APRVjvKfRWNNEngjbUQuxZAEoMmrwyxuRVo
xvkjNfw9jruAr7w/wHUMDIP6cD0B+RmwkBKBXT2R+RmQn4RKPZOFNz2NGZCfAYIkPY+G2Onk
+RE9kgqKkoiItVrEaoNpCh3uK9SlmvpREZqjg2i4eONdaLXZTrThDNNbXdDs+Ones7s5FXgF
Vrh07et4236L/8AMO8ZzBDl09Y0MSV4DjRWyy8X3O8ESdLsoyGKil+PIAEer6NloHRXYVSLD
mkYHbsCNMRgR98BQf0OliW/bb+ZG6+OAPiENBwo8IHYf24vaWwwgd/o0Vg/pVuD9wovG21Mz
2zkdWoNbu+hAC1oqsjrDFQv+Fr08PXQBR1b2IHOpb5MGAevoZb0EgFu47HUsHzvzCfAx9N/C
04MJ1gc9QTgfdDlVit3+CPyL6FlFgD9JIlU0P+KyJpIGLlceJbxiN2g3WQP9Nzpd/4Rb+Pcs
mokdC4keX16HxKmVjfWBhFsLUb0UeoXhvhiAWtCP7TBGQ6EpogB8SA1B4f44GE15+POJKNHv
U1OfMc5o1daoYVvYiNRw1teGTbqhCC8nJNsWHHaGUFY1/FRIWugihPtFgKPkBgip3WDbTBgc
FNaDIXJqI9ZoUY9UtSCGlkqQc3c3iiIWbhSZgDibRIUuFl52QID6vinsJb43sHH70vaCgWBH
BHQ9ARgGihAVOzL2iBZGQAvV684Mxm5vqXodRkAc60MeBVvytkUEQETa9oMZ8tbc/RiIHkZl
IHQJCQgJdcyhWGOB/0i7ShjM0vZGgGUYAE4AtuCMbd/XRCsFJwNe8RfIvfYPzLyLVRT/AsfQ
14u//xbkOFx1BEBD6/eSLPbDWvYXHIRHbQ2AeAEijeMYthK3HYvCUDcIDKntNxpYGBgPlMKJ
BdFH2r9bcNNLsA5DiMYGXEaxjbZrpkOAp0qDP1VxqW2+Coo/dDoPZ3QuMOGyV0riBh82NyCc
Gw9AAxUBQH1tCLuQMrowDw6ItUY03McDgyeOFLr7C03cKKBJoRxjU7stmqO6glAJVznAtdHY
Nqh1BNUOC3QVPBDPFiFwKJmFO6IbJ/g7+xfqubzLnBsC/jRfg/iFgU+1WYdDDD8nrGZnb7fS
OR5z60BACBh1+QbytI3d0ivGL1hO0fiOQAKpWGJrXQOJyjSB29SSfug763QyMrN0IxyOwjVw
VVC7JCU03TZI3XUODBAnXAmLA1bWRfxsnlzD61PmTKVGpZO5hbF0PGDq7d92iWVAOHv7BPYr
x0Bq0lewpFXOWgu6W8FZwVbUDDEQfnGEOrtdW4LsRGEHoNCJJwQ6liZNhWUyGxXAp5YLJrgY
wGIgS5WNG7yGKbRzGm0E6F16v7bGRgUKoSP1CAUbiUHItYrhjWYJa9upo0J1xTUWROkL7cXS
Z7kwjdy4SEqZ+3f7jRwufAJ2OTVjfVK/xEyPt5p9YAA4g3/7jYguS/NjfsFzGIBgCECLDzPH
2C7RgcF85NVJpagQ+3y76waLCfsJ+Es16kaLA0aJik0A9sEBnlv11n4ECHULoURgHiXoX4oo
z8H4BYPhHw10b9V6zyHSC4kIL4g1XuIb60dFg8Ob/ny6UCjxAp/sPNj/8th1TTt7K2pVAAgV
9ljriKbbSn3DSPfYZY31WEjqZH9Au3QXV2YMJRqlH0YKPtAGgE5q6roCZd4KA3UKNgWAZot9
q1kDfJv/uDZMRQMWDqm9ROhKBviohBxocXYOjWwNIFU8o1tQx0MNN24TSg9NOHCHbEAdcs3D
v2jKFR+eVddouG56sKBG4k3sXTmL5V2x6h4LD0EEBp24Ha/eAIYPrikQiQK4ctQ/GIDDkNhq
/mjARkUXpNn9/zUAGSCOhULdSYtwDEFcO9m3Xf3CdCggdosMs4m1iUgXfLO2B5WiBBETLbPx
gm//fTdy/1QI68Nkj3J/DXLOoYzmBQ+BeQR8awl6aFtRpVIMOVFg6u4tsAWbilG7DAe2HdGs
cAhYiUsCQxeo1bfPawxZW/KFVkP49wH8MjBYQzAwTAj6/ItdDByWYhu490Dk2IKIa67gVDmd
CD6W+C5bIXN7CMFhuXZrf6nYsY8URVZVjWsQqAtV90J3XV5BC8MzeDwlUy1j3faznLMEHVYM
3gg2JlvBNm7ej0mPxneu21UMOwgwGos0j+uh9bLfsa97HMnrFVxq/z9DG0JsXRaUvDvqkt1+
iymLQRxQAxhQJOGhNRRwvW+gmPEqmYrNW370jkAhaEPBqOtYeqEgylnvI9GsHnSQ36S7+osq
iLggkxMQdP0tzpsLQT2wk5TxweYDO5alYW7hGiYcKmy7h27SmehwDRDXqFb9tb36dQvxH4Vc
/hN4dihC1heoaELNDiGaWRLJ9nYs3r0HYEBZZTx2KRng7CRgD/gNg/oq3F9FagMD+GikQV58
syTdp8xg/1WIEIecTapXWx2EzFrNZu7/tiTTFhEJO8hgpgMnXEfHWYlirn4sX+smjaEw9E3a
Tag2Oghq9Ntyq1M1foQpWShfTl8fMQ+x0LEEdCGAoZl7UgiUvNGmKa+cdQELJZRhEbidzQaY
MaOQarzNEbiIBRlAoRhHXWNvN4ChnAeI9xSDC7tG9StQDBQkcge3FIgBuULKaG/qilpU0wCL
QW+xbVA0kHEMWtrC/FdAfYvSwe7N5nr8acnu3ijRhku974wBRJmJXfQysFSiE6QTEqi9fYn2
dX/B+bk/SV8LtdYv3sbPdgMeTBP3A/ClTC16SPrxIHMcv4u/9V3e0++NTAEw1yF8sET+XYK9
1G4rdSE5eoPB4B6ncw/mLSG8sMQSJAbYtuFK01HTfFWJCvC77c0ECANd+A0IjIv7wf8ET4Ch
rS0zP3uGX8s1AY2ujpfshYEreotYM8IRoXH4SVq21t21Z6Z2BYnzykEb+7pt8OdAPjv6dk76
v3RrwLZWI607vlG9LnlkZLrq0iFUEeTDgkUevdIhlG1brCVMUr9JvkqqtbKcCwQIEZFYQOEm
t3UJOTMZdW/ItynwjQz5CyaJl61szS8OBQiXSmOKt+/+7UwHBO8giE0P/sGIC3MlgH0PRg67
yXY3eIiR0+t2CRkNjdi3ElqxCRjrKST+ENyz2E/gGSVZBA+dFm947IS3CTiLVEXwiRpUeCwL
8BP8/6/6oXYW7gGeid+8jA264rbRzXDB4Q9LDFKAABcFWmSA/16970GYPR8yHAlQCA7d7P1h
OUAQg6SIbCQP/mi40dlIQwpIf3lDE4P0EseWq/4Rg3ixdWxT0L3WwBAoWhIJEBrwSFge9EwL
hRIx8g6Sy8h4q4VjKCvIkhErjUgUgzDwiQJIXMyqbTXerw0vOwUiNSUUQKPTr5Y6iQ1MqbKi
8zPLrIk1ZL0rBWwUZi9oV408gsO08cksG0gXdvAXaoWXuqNJNH0Og6vT7oPtAxy3ov/X6xAm
GfcrulBb0+jm+KFpF94A8IvYO+dzGYtL4TsjuEWLbysj/gvPYDUUO/L9bteaGHLnB3V5i9o7
2CYV3N02EwXr5hl1WSRzEYPsXAEaLIUTN+vt5uwd8iYNGy/uB5vbhg4IQLB7hdt0FEZuW9H2
QWFZWxDiQ6g4/+vez6hUQKuJHaUUixZE30pt+sdKLYuMkMRsZw/7IpBEiDeLEnARVV8wEK3d
zQ5EC9aLQmWCbwt1F4uRhrXT/1a4HFuL/iM5C9d06YuXhzWrUMpjXFhNwRp0G3ZMV84qZrut
/t1qIGRfhcl8BdHhR1+LIFT5gru6bkMKK3/xe8H+BG4FTbdtP374XgKEDaRNg1QkYSB9KxHb
0lIFUTic0/PsW+C4+yNciESJA/4Pdeqe7GixgfQhC+sxFyuVFVy7xaEyIRkpNpiTcxSCLIUi
CsCbXi5iegTsla96CCWe21yQhJQ0qRQDSK1tQgylIsJkqXSzLAb+C30pxJnGNtdoCzARYr+w
zm67ZJeMCTsKjwl8rusv70N6wCgNjU62CXsEsVyPdLG8rRa+7gk3alu6UYvcjgqJA/yyw297
l3l18APRIgESMvyf6PHbbYsOIY15Dz51Gjsd8kEjUldsSzukBkhv5IJrEdKNQgQIuCLzpAIN
iJ2mhVIbXXWVTVBy65CapVCQnFeXLBzMoNCqO2yInINfsBjAPQpoxL9toZnpCEUw+IEzUrHF
kfyJRlwqahf04Ks8aLL6DKR/MBkMdRT/dhBX/HFrLW2t63xOJMWJfspUiy1KBWJB56PWrNi0
XzfpidHaYuNxyEG/28VYVaPZT+BDwzdlJSrG1lr7MIJoW0MX20AIAgTaSh77hcFDPtut5995
DIsQgABWk8lBd9EnQgVL23f1lwBwYPp3PI1Hd0jyg24rR4OIfvR4/AaBaAbzx0D88EIOI9Tn
vlHWBMeA6BAUBXfBDT4gSPCWdsdgTwwFda0wRdcmJom3l629rI1KDAiPQWSeREK8nu+68Q/j
ikZDisgLhMB6iE5DdQcFxvgDCXgEuizLaH7RsFoBati0OHKBNHtoGKEsiw0ouIkV7z69ulIX
aOQLXlasM1ZbgJOtgCAE/R0b1hCP4lZjXCQZ1ftpI+zOpQJYo0Nw0N32nySTHEkFoUi2PapH
ZasIWL08m+AzI0OTlDldGM22grsZoVgqeI1TLC3RD7BBIBDgCECAGIjbU7U3KCTgVnRj0AAK
tBpy69XuRbyeAyT8PsCL9BZAo0ofN8KgRIcO6wtIjW0JNprIg7z/wilJ55K12eBWXxxVUhGk
q1pBFM8rIOEsmPiNZcx7Jg1I8hCoEQXZQ7apUQWAAO/DBuyCWxGEiHB1HLLQDdqCnw6MRWrF
qgJshSMHdjfB8AyyjYpwaevbXAJtRYA1ZPl1M9maIZoiSKcJVpbLm/rSuMBiOTB0cjBClKbB
EXAKkxzc28cIQCQoQGNZv4CCAo+VtofoxlDzq6q4aerHzYQPhu8Vfe5mu8RPbf9N74oRhNIM
rnm2Qf8GxN4vMDvCD4eTJcdaDEtt2e5SSJNScb+w+6XYBKqNntCRgDt7y3QsilG0UcWIAbA0
+n27d7SUd0L8ipK4IAiQRkCBgYW/E3b1QUGAORjU+cjxUrB4CCoEcsGvh/eE2Kl8SVCjrAtW
3WapyjHEv3APpW2qq93fo7ul61VAef9MSK3izGBnQqEIrizWykVacDks1l572VTrBvoLwk1f
wf02qwDrDTkdMAqbMP1Umbp2BEYmMAO7o+G1hjHnIVX+II3wIFtLMP8lOGr9Y4nIhRQYXg+3
BhxbFhlJLaT21N/e4nQiUQR0FwQNdAxIdAPtbAXaaLgENQUSC9wGdZ4IEfBZqjdCsBNsqrQX
o8U5UvW93MNfZBQFjAglouwR5wr/vgAGFs2+h4iEBex9/wVX+YLGcvSKRfLGhQ0gCWDrAvU3
U6dV0KELNGgKJrV3HRoegHusvCpBuCAAl78joNCE3t2qQkKKQv92QC8AXtBfW/Ls+gh39hqD
NY16UBJnbEKdmzgj/exmk30dVh5WNCNLkTPFlYz8aDsnf01LAV5cgo1yZosRzd9P+PbCAXQW
+hCKlAVkiJCA68idk7ccGgJ0ECBbQ6PxNvKgHIE8ANhumHC/60kVJUFyGQRaJRod1qpLyCV9
k5e3sYhJHx1hchN6dw7obtibIOkg6+BMSr5eyUb98ZCGEmpGQ+dZzDkSzaCSSjRfSNFV/UJo
BGmFZHXoIho1Z5oD+PY1VA6GJKMpdPr3w+/Z6BBo1AejONTWozwGcegGXqELeRb/0Kis6z27
vKE8EAVTEYsYAyPEMzCMTQXrcqoE4vjMzN+oWd/I5wTAWLhZPAfQAIHYdRP8AyBZ30IOgLyo
WahZTdcNFj+fBowDhHyITdM0dGxkXFk+cwgQ36hZ8MBAArHpA8zgWd9HyEMOQFvwWkjErvud
WiyQWAt4A6BaIZBXCN9AW26wUMhAW1v0f03TLLv8AwRbDBQcJCFAIDY3W9/YdN0JH1AFWANo
fFsJLQCB3zRFkyHkEGkc5EJvuuA9YHZ1RldXMVtTyUItVmoeN2wntPy2wB0j6yJTOVfpooMs
aCIBO2ANNKE/OX0UfhAvYrUeejeiWbgUoR1VHQsIvdgWHLRPSHxGNjROTSHTfSAsNGuTIHMu
TiRvwMmAIIsY5DvfQjvAbYWcNr4EG1KhD20XxEHcOusTS7c21g7/JhGLOGfcdMnarKFmrdxh
IVfeWcx19E3sGqVsbbYl/pdxddg793Qy9kUNGEA+HM1uhtp4siLVfx7aIbO1kTJI0o+NKBWE
5Mgw5BeyHXOzNtyJXeAXK5BkEpWyfXOnrHvfdLRWZORndJyPs7dZi3Z1BAM9jChoIAfEvgeU
1Vi/XIRSLgD/CHFSzUtFqAiLRFahXmht1P/nOH9ei/FJbqluoQXzDF4AKx5bDARug8LDjzw0
1Ei9MpAeU6t2bOh0X3UheovQnsDBu39/igqA+UF8BFp/BYCgo3X8rWgaderrZ1ZkUwCYiRIu
RmK9Nyy1g1sUK8QgYThXu61iGCmoKixXUCa5xCsnWUhfIIGaAeruDbYYT1DwKDcMQENRYYcq
AACW/y3+/zAHdyxhDu66UQmZGcRtBxFqcDWlY+mjlf////9knjKI2w6kuNx5HunV4IjZ0pcr
TLYJvXyxfgctuOeRHf7///+/kGQQtx3yILBqSHG5895BvoR91Noa6+TdbVG11PTH////BZGD
VphsE8Coa2R6+WL97Mllik9cARTZbAb/G/z/Y2M9D/r1DQiNyCBuO15pTORBYNVycWei////
/9HkAzxH1ARL/YUN0mu1CqX6qLU1bJiyQtbJu9tA+bys/////+Ns2DJ1XN9Fzw3W3Fk90aus
MNkmOgDeUYBR18gWYdC//////7X0tCEjxLNWmZW6zw+lvbieuAIoCIgFX7LZDMYk6Qux////
/4d8by8RTGhYqx1hwT0tZraQQdx2BnHbAbwg0pgqENXv/////4mFsXEftbYGpeS/nzPUuOii
yQd4NPkAD46oCZYYmA7h/////7sNan8tPW0Il2xkkQFcY+b0UWtrYmFsHNgwZYVOAGLy////
/+2VBmx7pQEbwfQIglfED/XG2bBlUOm3Euq4vot8iLn8X/j//98d3WJJLdoV83zTjGVM1PtY
YbJNziw6dAC8///2/6PiMLvUQaXfSteV2GHE0aT79NbTaulpQ/zZbjT/////RohnrdC4YNpz
LQRE5R0DM19MCqrJfA3dPHEFUKpBAif/////EBALvoYgDMkltWhXs4VvIAnUZrmf5GHODvne
XpjJ2Sn/////IpjQsLSo18cXPbNZgQ20LjtcvbetbLrAIIO47bazv5r/////DOK2A5rSsXQ5
R9Xqr3fSnRUm2wSDFtxzEgtj44Q7ZJT/////PmptDahaanoLzw7knf8JkyeuAAqxngd9RJMP
8NKjCIf/////aPIBHv7CBmldV2L3y2dlgHE2bBnnBmtudhvU/uAr04n/////WnraEMxK3Wdv
37n5+e++jkO+txfVjrBg6KPW1n6T0aH/////xMLYOFLy30/xZ7vRZ1e8pt0GtT9LNrJI2isN
2EwbCq//////9koDNmB6BEHD72DfVd9nqO+ObjF5vmlGjLNhyxqDZrz/////oNJvJTbiaFKV
dwzMA0cLu7kWAiIvJgVVvju6xSgLvbL/////klq0KwRqs1yn/9fCMc/QtYue2Swdrt5bsMJk
mybyY+z/////nKNqdQqTbQKpBgmcPzYO64VnB3ITVwAFgkq/lRR6uOL/////riuxezgbtgyb
jtKSDb7V5bfv3Hwh39sL1NLThkLi1PHG////+LPdaG6D2h/NFr6BWya59uF3sG93R7cY5lp9
jf///3BqD//KOwZmXAsBEf+eZY9prmL40/9rYcT/////bBZ44gqg7tIN11SDBE7CswM5YSZn
p/cWYNBNR2lJ23f/S/z/bj5KatGu3FrW2WYL30CC2DdTrrypxZ67/////95/z7JH6f+1MBzy
vb2KwrrKMJOzU6ajtCQFNtC6kwbX/f///80pV95Uv2fZIy56ZrO4SmHEAhtoXZQrbyo3vgu0
oSc2+htewxvfBVqN7y1LFvD//0FCQ0RFRkdISUpLTE1OT1BRUlNU2/////9YWVphYmNkZWZn
aGlqa2xtbm9wcXJzdHV2d3h5ejAxEpvu/zIzNDU2Nzg5Ky8AAP+7O9lb8f/dzwNydW50aW1l
IGVycm9yv1RH9azETE+3DQ0KxLL2A3ZJTkcOAERPTUESEbG83f5SNjAyOAgtIEdhYmx0+3ap
vc5pbmlSZml6DWhlYXA3W9vONyc3mXQ9BHUtdNtvqCBzcGFjI2Z3bH9p5LLbgDhhBm9uNzaf
geQpc3RkNXB1W2uFt3IrdmlyCyEzpWPIF9t+IyBjDGwoXzRe225TXypleFwvWAYWdrLX3OJf
MTn3Cu7mFnJlWDFzbw+Ka5MB23NjKzhGJAZChFuBZWQZV9vtIfkjN211bKx0aL9hhTCSby9s
b2NrF2tuG2w0ZLdhLgKi2reGWyFybQBwQGdyYW0g7IVQ2EptNi8wOU+NZmgpEEEqJ1PI5xos
Lis4YfY8hO9yZ3Uoc18wMmbBLrZtu25uZ4JvBXQ6EUIrnLVk5n9NLWA5YPzD22YVVmlzqkMr
KyBSnIe57/ZMaWK0cnknCi0WRWucbQ8OIRFQ1Dq+AHNt2GUuADzl4CUssSRMbWtsnUPY+G75
/1lTXQNHZXRMYUZBey8U6BZ2/MJ1cAATD4FvbTtXqWQ6m2Vzc2En8YUFeEJveEBzOTMyLmTG
3PKsPkelXKkDU12gojBnAwAup7IPr1dAIwiL+IppmqbZA+DQuKygpmmappR8aFhAzbJpmiAU
CPCJ2MQ0TdM0qJiAbFTTNE3TPCwoJCBN0zRNHBgUEAwIdG7TNAQA/IiPA/TTNE3T8Ozo5OBN
0zRN3NjU0MzENE3TNMC4sKig0zRN05yUjIR8TdM0TXRsZFxUTDRN0zREPDQsKKZzX9MgDI+H
iwPkmqZpmtzUzMC8uGmapmmwqJyUiKRrmqaEfHRoQ2BpmqYbWANQQDgspmmapiggFAwETdM0
y/yG9Ozk3NA0TdM0yMC4sKjTNF3ToJgvkIiATdM0TVxQSEA4MOYbpDv/fJTnhppm2XQDCPyF
6NC4aZqmabComIRssmmaplhELBT8hE3TNM3YwKyMfHA2TdM0ZFxQPCCE03Sm6WeEgwPUvE3T
NE2omJCIeHCm6842ZIPLVAdAAyyov2yaIATwgnNvbWV0aCvURu2zaXPab/ITsVTfC2dvCHcZ
Z4/9Rv3veW91/WUgYmFkC3Ryefph31UXc3RlYWwfZmVlbLBGbaWeJHObE97K7Vsecm4gbURl
eRphdHPu9vfBUndoeT83dGFrL2l0J7XvdqpzA2JwbAhkW7GuOtY+PzMnc9xuH3NtbGtdISxk
YwNOE8CtVAt9XWR1aNTADgcX2JttXwFELGYfbT5YtfZrlSk/YWJpgQBctY2yAEFmTQMJbnZI
F4YbIdrYsn8bdHVmZi3XZ3q9tz3XLyVzfWUXjyO8Ce7rQQtKT2WGExq2c6ZySGwTaZBVC85t
72SmIAhjTtUF7HKNA3L/Fewv9kCFM2hvcF3XdhdegAh1ZZxraVXv7MLuuidp7iBvZlYh28vm
kiVjFlNJYVy40L12um5wn3N3GWQhC/ZOYgthvVgvTTjE3FbWYwgjOIP7l3dhlFQu3CG97rFk
J1hhY2PsdCt7y76EF98/E84j0TW+w0NrbX4Kb62wfc4vgvVtLmTjiWTvMHtgJx7662ht9swu
LxTymO2H99cSE2knbaWuAG9rD91eob13cDSVWDhhbhHahM14BHkiIKMjjzz2LnBpZgdjb21z
Y3Jlb8kCznhllwsjbiMF+5vubyN0BWVzI2sjeSMtB/KPrZsTsW2rY/9wYXJ0c2+QLg9vMu+s
B9tcCfhvYmpAx5Brr6avldonGOrXbEISEKltemYPkFMZ5VOMxGNqb/sxw93CaQVk32Vic7NT
eAClGJ/IgGNSw8V1Bz6wDXwb7xFwI153Z3AIjdZ4u2lpVvR1hm3ScG8fd4GQcI5nNmJKd2IH
ita2gAgPO2MwJ8/WtoBsdKM7AG/JvUmHc3M/4xWUCYbDhhForQOir9Eatjty2nTPfSPh7XoA
rzdsa0PbeENomxNzQ+MTs642CRXPWwDfVyEhbXBu2wOXZo9lPHv7ouxDcwQwU0/Pj4DDM7Qq
43DrkU+OldIHc2hkYnh+b3LkdGJiYWSkF3dhMdrIQXNwdXdL8rHCyXJ0dpcHaHRtbDjrzghr
bANoM3TP3ZtRP2ciB1tdLUDXbJt/C18tXC96OgN5eAdN0zRNd3Z1dHNyNE3TNHFwb25t0zRN
02xramloTtM0TWdmZWRjdk16F+NtMtQE33gg9KdYwAMZBnJmYyAhWYTNHiRscxdTWQuIQKD1
EhquCy2nCiBsyWbR6JYVlxV3LoRjFrCsvhH+ao5lW9d9c0lzG6LCsqUHk71jMO412kbXeLt5
ziCxR0aLdO8ZFVctg2wIlhWCDEPspiDdC2lpzFhvLjcXI9hu7mVxAyAtpGms2FpjV31wdYi/
3DOGozlOBGP3/KloDIbVwHM0cg95NRqzw7VHMv1ztIEtCF+Kt9k/rslfV69w/GpwZ3Psi7ka
6KFfGm9UtbPNEcJUeAxw9N2BvanynHAgOSBUcyJwO2izpnD4chDgXV9iwuwZaLSrYlV39tsB
O3hwjjIx8DUuMTAwA/in7sAAgr926FVEUAAlHGsFOqYl+gYFLjJ7zyVrUwQUBgMrtwzER5Ar
T6kATi/d2PVvdgBPH1NlvkF1Nkp1bIVW2HAD9k2TD8wtW2tzBwNGjxNhU9q2aK3XC7azaERX
c1uBVnkH8h9vFy9t702BSVFVSVQHAy5pW478BgAtLQAiQyZ0/WrdNrAtVBdzZrAtRW6X/dHR
mCQ6JWU2NCJEao+uAll4aYEcc7/Xy247IJbyPSJTUFBwJSZ5VSZKHwsfw/fFL3gtelkt13Jk
s9Zci6Y4NDM1tNYYXRgXk2WRvbasKi+Tvzcvtu2BIS86Ybw7pGtsC7dyYnQ9ti3FY5joEIR2
Ijdi1Bc4ECGLE1dxHfg2Pk4vyni2Yu4I2x8zhO9NSU1FLVZPcxZu4Fr3MS4wP0S8N3RTdUNs
4UUKk28GpyKCfQUACTAA238ifj9BVEE3Q1BUIFRPHjz9JcJtCz4QVUwgRlJPTSv4B+4RAMdI
RUxP0849PsNME1yz7cn8f39jB2UTKi4qG1NPRlRXQVJFXGMFh0BIXL1zYjC3xVxDKXLiuFwM
TI4FPVOwYdcZWKJjee1t4UtvMm1s3CBreUGLRe1sat/4/0ebQ0xTSURce0U2RkI1RTIwGUUz
NS0lttv/MTFDRi05Qzg3LdRBQQM1yMRbIP43RUR9XEluimNdZs03DCSbYXNrbXNuXsstwqN7
STkbw5q9H64L+2XQgoEYiDivZBF6Io7JYmV+ZXu263sQF9tr03RABh/7WGu+O0FkbVMPSmts
UyEDBmy5M78BusE2eOA9MQIXFgMCmmawQQMHBBgFaZqmaQ0GCQcMwQYZpAgJChv2vReQC1c7
Bw9XgnSDDRATEQMSkMEG+RchNQ9BwQYbZENQM1IXBhtssFMHV19Ze2ymabrBF22rIHAcBvte
kHLHL4CzgRtksMEHgh+DhI8ZpGkGkSmeoWyQwQakb6e3n3IQBhvOH9cLGAfZe65qiQOVAQMg
kxwoIEgMIBPJABCEEIEMyISBAQzIgAwQggKZmwyBEL8AadJdVQEHLl8M0g32wAsXHQsElsgg
zYCNCI4MyIAMj5CRgAzIgJKTslEM0gOvCjeMJC8LbwyjAAWTGela8GPTNGiDBwjPNMumCdxn
Crg3mqZpuowHEVwSOBM0y6ZpDBjUZhms0zRN0xp0GzwcZdM0TRR4BHn0ZROVaZp65PwG2IfX
vUcP+MBDAgTSzw723aQPYIJ5giGvpt8HoaXN8+8ngZ/g/C9AfoD8qMFy9gjjo9qjj4H+B0CD
DIENtS9Btl8h/3dfz6LkohoA5aLoolt+of6y3+4+UQUD2l7aX1/aatoyL6lol7/T2N7g+TF+
OYNYACoKACoqCUEBVCCrAqhARgVQgYwKoAIZFEAFMmyGqGwDxBhQsU0UsAEgQ1AHx1pUbQZJ
MQpTdCkqR1SZSKJahqxXD0FNI6r/m1lCeXRlVG9XaWRlQ762UAFbFEgEUh2AbYuqNWMMVvuD
RaijDVJ0bFVudz+133tsZEpPRU1vL0NyBDV2+6xFC0Rlc2NveSJGPdhrRBBremRIYarOSrc7
bA1TCkNFAWOmQh1FC2HPks2jc1cXFrZkbVistsEURhTVCNuDZURRQLfdAUFkZCFzPUzuLApo
4bxBDdmFzdpDTbMsDVf9pKhiL1lGGNhWVO1EFlVvPq0w7MJDGHNl1jZZbe0Xe/LgCFBvMZty
cOaqyrFrGmwwT8LNHgduQSBTaXqK6uxNQg8ZU+r2zf4DVGltegha2WVJbXvLqAoXzGOg3/u6
ZyVfbEJkB2OUCG8v2fYKeiYHYsUL+ORvCM+KY3B5TW9ka287VkyATmFOQT4fMgyDbW5rmkZ5
mEYBClSdxfIKTvK4rHXLGftyUcJEzm6D3Gp2ZVMUZXCxYQx7RdUjDPMwfm8rwwN4MeVjaxJs
ILRGRjEPnjSchMNpIXRhnnC17jY7ERA5bW0fTInbrKiCIbkLReERIcZ4aQP/pAAKOOEFChdU
qpfspHtlJlBsY9mBADn8aH+DO2xtZE8QcIOf35qlIYx8QZ7RANqCu3EbZ1ORe3UoFnsE9+0P
SEtleQzvs7dsbB9BEB4Osll6hk/KDPHedFFC4cJ2TgJ3SWtQCbMq4DTbcxrrGLPbGJAdAbFw
jnRmoX08XfYgJEmXbj02tVcFHG5u27XZNsvN/yMCASz/cwIEZVmWZRAWEw8MlmVZlgk3CzQX
FLOWZVkVEW8Dpf9D/stQRUwBBABZ9DBA4AAPAgsBAjig6vcOCgMA5DrYWfdOVoANKhAPBDO5
Y98sBx8BDAPbm0s2sO8PJBAHBjeBy7McKGmMcGANaoXcBgJgHnwBF2zXcS7GdAeUTpDnINhc
2ARFIC5yuvdsDgIjDmAUJ1Ruse5CQAIuJifc4m1KBmmAdMBPG5t9pXPFSg3ze5RPAP9+Kxsw
aw2SdAEAAAAAAAAAgAT/AAAAAAAAAAAAAABgvhVQQQCNvuu//v9Xg83/6xCQkJCQkJCKBkaI
B0cB23UHix6D7vwR23LtuAEAAAAB23UHix6D7vwR2xHAAdtz73UJix6D7vwR23PkMcmD6ANy
DcHgCIoGRoPw/3R0icUB23UHix6D7vwR2xHJAdt1B4seg+78EdsRyXUgQQHbdQeLHoPu/BHb
EckB23PvdQmLHoPu/BHbc+SDwQKB/QDz//+D0QGNFC+D/fx2D4oCQogHR0l19+lj////kIsC
g8IEiQeDxwSD6QR38QHP6Uz///9eife5RAEAAIoHRyzoPAF394A/BXXyiweKXwRmwegIwcAQ
hsQp+IDr6AHwiQeDxwWJ2OLZjb4AcAEAiwcJwHRFi18EjYQwZJ0BAAHzUIPHCP+W8J0BAJWK
B0cIwHTcifl5Bw+3B0dQR7lXSPKuVf+W9J0BAAnAdAeJA4PDBOvY/5b4nQEAYem3qP7/AAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAIAAwAAACAA
AIAOAAAAYAAAgAAAAAAAAAAAAAAAAAAAAQABAAAAOAAAgAAAAAAAAAAAAAAAAAAAAQAHBAAA
UAAAAKSgAQCoDAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAEAZQAAAHgAAIAAAAAAAAAAAAAA
AAAAAAEABwQAAJAAAABQrQEAFAAAAAAAAAAAAAAAoHABACgAAAAgAAAAQAAAAAEAGAAAAAAA
gAwAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAACAgIDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDA
wMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAACAgID/////////////////////////////////////////////////////////////
///////////////////////////AwMAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACAgID/////
////////////////////////////////////////////////////////////////////////
///////////AwMAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACAgID/////////////////////
///////////////////////////////////////////////////////////////////AwMAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAACAgID/////////////////////////////////////
///////////////AwMDAwMDAwMDAwMDAwMDAwMD////////////AwMAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAACAgID/////////////////////////////////////////////////////
///////////////////////////////////AwMAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACA
gID////////////AwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDA
wMDAwMD////////////AwMAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACAgID/////////////
////////////////////////////////////////////////////////////////////////
///AwMAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACAgID////////////AwMDAwMDAwMDAwMDA
wMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMD////////////AwMAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAACAgID/////////////////////////////////////////////
///////////////////////////////////////////AwMAAAAD/AAAAAAD/AAAAAAD/AAAA
AAD/AAAAAAD/AAAAAAD/AAAAAAD/AAAAAAD/AAAAAADAwMDAwMDAwMDAwMDAwMDAwMDAwMDA
wMDAwMDAwMDAwMD////////////AwMAAAAAAAAD/AAAAAAD/AAAAAAD/AAAAAAD/AAAAAAD/
AAAAAAD/AAAAAAD/AAAAAAD/AAD/////////////////////////////////////////////
///////////AwMAAAAD/AAAAAAD/////////////////////////////////////////////
////AAAAAADAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMDAwMD////////////AwMAA
AAAAAAD/AAD/////AAAAAAD/AAD/////AAAAAAD/AAAAAAD///////////8AAAD/AAD/////
///////////////////////////////////////////////////AwMAAAAD/AAAAAAD///8A
AAD/AAAAAAD///8AAAD/AAAAAAD/AADAwMD/////////AAAAAADAwMDAwMDAwMDAwMDAwMDA
wMDAwMDAwMDAwMDAwMDAwMD////////////AwMAAAAAAAAD/AAD/////AAAAAAD/AAD/////
AAAAAAD/AAAAAACAgID///////8AAAD/AAD/////////////////////////////////////
///////////////////AwMAAAAD/AAAAAAD///8AAAD/AAAAAAD/AAAAAAD/AAAAAAD/AAAA
AAD/////////AAAAAADAwMDAwMD////////AwMDAwMDAwMDAwMDAwMDAwMDAwMD/////////
///AwMAAAAAAAAD/AAD/////AAAAAAD/AAAAAAD/AAAAAAD/AAAAAAD/AADAwMD///8AAAD/
AAD////////////////AwMDAwMDAwMD////AwMDAwMDAwMD////////////AwMAAAAD/AAAA
AAD///8AAAD/AAD/////AAAAAAD/AAD/////AAAAAACAgID/////AAAAAADAwMDAwMD/////
///AwMDAwMD////////////////AwMD////////////AwMAAAAAAAAD/AAAAAAD/AAAAAAD/
//8AAAD/AAAAAAD///8AAAD/AAAAAAD///8AAAD/AAD////////////////AwMDAwMDAwMD/
///////AwMDAwMD////////////AwMAAAAD/AAAAAAD/AAAAAAD/AAD/////AAAAAAD/AAD/
////AAAAAAD/AAAAAAD/AAAAAADAwMDAwMD////////AwMD////////////////////AwMD/
///////////AwMAAAAAAAAD/AAAAAAD/AAAAAAD/AAAAAAD/AAAAAAD/AAAAAAD/AAAAAAD/
AAAAAAD/AAD////////////////AwMDAwMD///////+AgIAAAAAAAAAAAAAAAAAAAAAAAAAA
AAD/AAAAAAD/////////////////////////////////////////////////AAAAAAD/////
///////////AwMDAwMDAwMDAwMCAgID////////////AwMCAgIAAAAAAAAAAAAD/AAD/////
//////////////////////////////////////////8AAAD/AAD////////////////AwMDA
wMDAwMDAwMCAgID////////AwMCAgIAAAAAAAAAAAAD/AAAAAAD/AAAAAAD/AAAAAAD/AAAA
AAD/AAAAAAD/AAAAAAD/AAAAAAD/AAAAAAD///////////////////////////////+AgID/
///AwMCAgIAAAAAAAAAAAAAAAAAAAAD/AAAAAAD/AAAAAAD/AAAAAAD/AAAAAAD/AAAAAAD/
AAAAAAD/AAAAAAD/AAD///////////////////////////////+AgIDAwMCAgIAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACAgID/////////////////////////////
//////////////////////////////////+AgICAgIAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAACAgID/////////////////////////////////////////////
//////////////////+AgIAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAACAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICA
gICAgIAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAD//////gAAAP4AAAD+AAAA/gAAAP4AAAD+AAAA/gAAAP4AAAD+AAAA
/gAAAP4AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAABAAAAAwAAAAcAAAAP/gAAH/4AAD/+AAB//////0h9AQAAAAEAAQAgIAAAAQAYAKgM
AAABAAAAAAAAAAAAAAAAACiuAQDwrQEAAAAAAAAAAAAAAAAANa4BAACuAQAAAAAAAAAAAAAA
AABCrgEACK4BAAAAAAAAAAAAAAAAAE+uAQAQrgEAAAAAAAAAAAAAAAAAWq4BABiuAQAAAAAA
AAAAAAAAAABmrgEAIK4BAAAAAAAAAAAAAAAAAAAAAAAAAAAAcK4BAH6uAQCOrgEAAAAAAJyu
AQAAAAAAqq4BAAAAAAC8rgEAAAAAAMiuAQAAAAAAAwAAgAAAAABLRVJORUwzMi5ETEwAQURW
QVBJMzIuZGxsAGlwaGxwYXBpLmRsbABVU0VSMzIuZGxsAFdJTklORVQuZGxsAFdTMl8zMi5k
bGwAAExvYWRMaWJyYXJ5QQAAR2V0UHJvY0FkZHJlc3MAAEV4aXRQcm9jZXNzAAAAUmVnQ2xv
c2VLZXkAAABHZXROZXR3b3JrUGFyYW1zAAB3c3ByaW50ZkEAAABJbnRlcm5ldEdldENvbm5l
Y3RlZFN0YXRlAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAUEsBAhQACgAAAAAAVk13MF1usCIAVgAAAFYAAAoA
AAAAAAAAAAAgAAAAAAAAAG1lLnJ0Zi5zY3JQSwUGAAAAAAEAAQA4AAAAKFYAAAAA

--78654172--

From andejowosimi at hotmail.com  Tue Mar 23 08:10:31 2004
From: andejowosimi at hotmail.com (jay)
Date: Tue Mar 23 08:15:39 2004
Subject: [Bioperl-l] New Drug 8x more effective than Vi-ag-ra!
Message-ID: <1080047431-32318@excite.com>


          New product available:

- Does a better job than V i a g r a 
- Presents minimal side effects
- Costs 70% then average similar product on market

  One pill of C i a l i s will keep you going for an entire weekend.
  Women from all over the world wrote us feedback thanking us
  for making such a great product available. Some are so amazed 
  that they bought more for their boyfriend/husband.

http://She'll need a doctor after this reaming!.base4ds.com/s95/index.php?id=s95


Get off this list by writing to getmeoff731@excitemail.com
From dmalvkire at hotmail.com  Tue Mar 23 09:38:22 2004
From: dmalvkire at hotmail.com (kristopher)
Date: Tue Mar 23 09:43:42 2004
Subject: [Bioperl-l] PR0-ZAC: The world's most popular anti-depressant!
Message-ID: <1080052702-25253@excite.com>


PROZAC is one of the world's most widely prescribed antidepressants has been prescribed for more than 40 million people worldwide 100% satisfaction guaranteed or your money back!
 
http://base4ds.com/s95/index.php?id=s95


 chody haack  wisdom  dinosaur smudge webster pogue fireman  kasteves  wkliao plichvar  kusuma  eugene holveck  kristen ruskowsk 
Get off this list by writing to getmeoff731@excitemail.com
From Matthew.Betts at ii.uib.no  Tue Mar 23 11:13:55 2004
From: Matthew.Betts at ii.uib.no (Matthew Betts)
Date: Tue Mar 23 11:19:24 2004
Subject: [Bioperl-l] Bio::AlignIO::po
In-Reply-To: <Pine.LNX.4.50.0311131041250.22660-100000@tenero.duhs.duke.edu>
Message-ID: <Pine.LNX.4.44.0403231638450.15298-100000@silkefuru.ii.uib.no>


Hi,

I've finally written the Bio::AlignIO:po module I mentioned way back in 
November. I'm unsure of the procedure for submitting it (I've looked at 
the FAQ). Should I just post it as an attachment to bioperl-l?

Also, I'm trying to think of some sensible tests, and have been looking at 
AlignIO.t. The ones in there mainly seem to test that a sequence read in 
from an example alignment has the correct name. Should I test more than 
that? All suggestions very welcome.

Thanks in advance for your help,

Matthew


On Thu, 13 Nov 2003, Jason Stajich wrote:

> One does not exist - would be happy for you to write one.
> AlignIO::clustalw or fasta would be good templates.
> An AlignIO pir parser wouldn't be too bad either if you get into it.
> 
> -jason
> On Thu, 13 Nov 2003, Matthew Betts wrote:
> 
> 
> >
> > Hei,
> >
> > Does anyone have a PO format reader for AlignIO? This is the native output
> > format for POA. It can also output in pir or clustalw, so it's not a big
> > problem. But I'd quite like to try to try my hand at writing the parser if
> > one doesn't exist already...
> >
> > Thanks,
> >
> > Matthew
> >
> >
> >
> 
> --
> Jason Stajich
> Duke University
> jason at cgt.mc.duke.edu
> 

-- 
Matthew Betts, mailto:matthew.betts@ii.uib.no
Phone: (+47) 55 58 40 22, Fax: (+47) 55 58 42 95
CBU, BCCS, HiB, UNIFOB / Universitetet i Bergen
Thorm?hlensgt. 55, N-5008 Bergen, Norway


From jason at cgt.duhs.duke.edu  Tue Mar 23 11:19:25 2004
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Tue Mar 23 11:24:49 2004
Subject: [Bioperl-l] server suff: doc.bioperl.org site
Message-ID: <Pine.LNX.4.50.0403231115180.29864-100000@tenero.duhs.duke.edu>

The doc.bioperl.org site is down for maintence reasons - I had intended to
migrate the whole service over to the bioperl.org server now that we have
enough disk space on that machine but just don't have a enough time to do
everything...

For the meantime, there is a mirror of the stuff until I am ready to
migrate the doc.bioperl.org name over to this new dir -- this will not be
a nightly update of the bioperl-live docs as on the other server (I
haven't had time to migrate the software for this) but rather is a Feb 8
freeze of things.  All the Bioperl release html docs are there as of the
release versions so it should be plenty useful for folks.

http://bioperl.org/Pdoc-mirror/

-jason

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From jason at cgt.duhs.duke.edu  Tue Mar 23 11:29:13 2004
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Tue Mar 23 11:34:43 2004
Subject: [Bioperl-l] Bio::AlignIO::po
In-Reply-To: <Pine.LNX.4.44.0403231638450.15298-100000@silkefuru.ii.uib.no>
References: <Pine.LNX.4.44.0403231638450.15298-100000@silkefuru.ii.uib.no>
Message-ID: <Pine.LNX.4.50.0403231120470.29864-100000@tenero.duhs.duke.edu>

On Tue, 23 Mar 2004, Matthew Betts wrote:

>
> Hi,
>
> I've finally written the Bio::AlignIO:po module I mentioned way back in
> November. I'm unsure of the procedure for submitting it (I've looked at
> the FAQ). Should I just post it as an attachment to bioperl-l?
>
(hmm - guess we need someone to add some more stuff to the FAQ...)

posting to bioperl-l is fine, but the preferred way is of you to submit
it at bugzilla.open-bio.org as a feature request and attach the module as
an attachment/patch where bugzilla allows that (I think only after you
have created the bug can you do that).  This lets the core developers see
if anyone else has already dealt with committing the module.  Also - we
probably will just give you CVS access once you've shown you know your way
around the toolkit.

> Also, I'm trying to think of some sensible tests, and have been looking at
> AlignIO.t. The ones in there mainly seem to test that a sequence read in
> from an example alignment has the correct name. Should I test more than
> that? All suggestions very welcome.

more, better tests are better.  people in the past have been lazy about
making tests which really test things to the max.  We probably should
have lots more tests in AlignIO.t - just need someone willing to write
some more in.

As many tests as you think can reasonably show that things are getting
parsed correctly, your data structures are coming out properly etc.  This
is important so that if someone goes in and screws up something in your
code or something that it depends on - we'll know that your parser/writer
isn't working correctly.


-jason

>
> Thanks in advance for your help,
>
> Matthew
>
>
> On Thu, 13 Nov 2003, Jason Stajich wrote:
>
> > One does not exist - would be happy for you to write one.
> > AlignIO::clustalw or fasta would be good templates.
> > An AlignIO pir parser wouldn't be too bad either if you get into it.
> >
> > -jason
> > On Thu, 13 Nov 2003, Matthew Betts wrote:
> >
> >
> > >
> > > Hei,
> > >
> > > Does anyone have a PO format reader for AlignIO? This is the native output
> > > format for POA. It can also output in pir or clustalw, so it's not a big
> > > problem. But I'd quite like to try to try my hand at writing the parser if
> > > one doesn't exist already...
> > >
> > > Thanks,
> > >
> > > Matthew
> > >
> > >
> > >
> >
> > --
> > Jason Stajich
> > Duke University
> > jason at cgt.mc.duke.edu
> >
>
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From heikki at ebi.ac.uk  Tue Mar 23 11:42:46 2004
From: heikki at ebi.ac.uk (Heikki Lehvaslaiho)
Date: Tue Mar 23 11:48:18 2004
Subject: [Bioperl-l] Bio::AlignIO::po
In-Reply-To: <Pine.LNX.4.44.0403231638450.15298-100000@silkefuru.ii.uib.no>
References: <Pine.LNX.4.44.0403231638450.15298-100000@silkefuru.ii.uib.no>
Message-ID: <200403231642.46049.heikki@ebi.ac.uk>

Matthew,

The list does not accept attachments. 

The recommended way of submitting new code is to open a bioperl bugzilla entry 
and label it "enhancement" (in "Severity"): http://bugzilla.bioperl.org/. Put 
the code into an attachment and a mail to bioperl-l list to discuss it.


Thanks,

	-Heikki

On Tuesday 23 Mar 2004 16:13, Matthew Betts wrote:
> Hi,
>
> I've finally written the Bio::AlignIO:po module I mentioned way back in
> November. I'm unsure of the procedure for submitting it (I've looked at
> the FAQ). Should I just post it as an attachment to bioperl-l?
>
> Also, I'm trying to think of some sensible tests, and have been looking at
> AlignIO.t. The ones in there mainly seem to test that a sequence read in
> from an example alignment has the correct name. Should I test more than
> that? All suggestions very welcome.
>
> Thanks in advance for your help,
>
> Matthew
>
> On Thu, 13 Nov 2003, Jason Stajich wrote:
> > One does not exist - would be happy for you to write one.
> > AlignIO::clustalw or fasta would be good templates.
> > An AlignIO pir parser wouldn't be too bad either if you get into it.
> >
> > -jason
> >
> > On Thu, 13 Nov 2003, Matthew Betts wrote:
> > > Hei,
> > >
> > > Does anyone have a PO format reader for AlignIO? This is the native
> > > output format for POA. It can also output in pir or clustalw, so it's
> > > not a big problem. But I'd quite like to try to try my hand at writing
> > > the parser if one doesn't exist already...
> > >
> > > Thanks,
> > >
> > > Matthew
> >
> > --
> > Jason Stajich
> > Duke University
> > jason at cgt.mc.duke.edu

-- 
______ _/      _/_____________________________________________________
      _/      _/                      http://www.ebi.ac.uk/mutations/
     _/  _/  _/  Heikki Lehvaslaiho    heikki_at_ebi ac uk
    _/_/_/_/_/  EMBL Outstation, European Bioinformatics Institute
   _/  _/  _/  Wellcome Trust Genome Campus, Hinxton
  _/  _/  _/  Cambs. CB10 1SD, United Kingdom
     _/      Phone: +44 (0)1223 494 644   FAX: +44 (0)1223 494 468
___ _/_/_/_/_/________________________________________________________
From Matthew.Betts at ii.uib.no  Tue Mar 23 12:59:55 2004
From: Matthew.Betts at ii.uib.no (Matthew Betts)
Date: Tue Mar 23 13:05:21 2004
Subject: [Bioperl-l] Bio::AlignIO::po
In-Reply-To: <200403231642.46049.heikki@ebi.ac.uk>
Message-ID: <Pine.LNX.4.44.0403231857390.15298-100000@silkefuru.ii.uib.no>


Hi Jason / Heikki,

Thanks for the advice. I've submitted the code and tests via bugzilla (bug 
1606)

Hope this is useful,

Matthew


On Tue, 23 Mar 2004, Heikki Lehvaslaiho wrote:

> Matthew,
> 
> The list does not accept attachments. 
> 
> The recommended way of submitting new code is to open a bioperl bugzilla entry 
> and label it "enhancement" (in "Severity"): http://bugzilla.bioperl.org/. Put 
> the code into an attachment and a mail to bioperl-l list to discuss it.
> 
> 
> Thanks,
> 
> 	-Heikki
> 
> On Tuesday 23 Mar 2004 16:13, Matthew Betts wrote:
> > Hi,
> >
> > I've finally written the Bio::AlignIO:po module I mentioned way back in
> > November. I'm unsure of the procedure for submitting it (I've looked at
> > the FAQ). Should I just post it as an attachment to bioperl-l?
> >
> > Also, I'm trying to think of some sensible tests, and have been looking at
> > AlignIO.t. The ones in there mainly seem to test that a sequence read in
> > from an example alignment has the correct name. Should I test more than
> > that? All suggestions very welcome.
> >
> > Thanks in advance for your help,
> >
> > Matthew
> >
> > On Thu, 13 Nov 2003, Jason Stajich wrote:
> > > One does not exist - would be happy for you to write one.
> > > AlignIO::clustalw or fasta would be good templates.
> > > An AlignIO pir parser wouldn't be too bad either if you get into it.
> > >
> > > -jason
> > >
> > > On Thu, 13 Nov 2003, Matthew Betts wrote:
> > > > Hei,
> > > >
> > > > Does anyone have a PO format reader for AlignIO? This is the native
> > > > output format for POA. It can also output in pir or clustalw, so it's
> > > > not a big problem. But I'd quite like to try to try my hand at writing
> > > > the parser if one doesn't exist already...
> > > >
> > > > Thanks,
> > > >
> > > > Matthew
> > >
> > > --
> > > Jason Stajich
> > > Duke University
> > > jason at cgt.mc.duke.edu
> 
> 

-- 
Matthew Betts, mailto:matthew.betts@ii.uib.no
Phone: (+47) 55 58 40 22, Fax: (+47) 55 58 42 95
CBU, BCCS, HiB, UNIFOB / Universitetet i Bergen
Thorm?hlensgt. 55, N-5008 Bergen, Norway


From elliott at nhgri.nih.gov  Tue Mar 23 13:36:32 2004
From: elliott at nhgri.nih.gov (Elliott H Margulies)
Date: Tue Mar 23 13:41:54 2004
Subject: [Bioperl-l] coordinate issue with Bio::SearchIO::psl
Message-ID: <20040323183632.GC2485@nhgri.nih.gov>

Dear BioPerl Developers:

I have been evaluating the use of BioPerl for parsing BLASTZ lav
output. To do this, I convert the lav output to 'psl' format with Jim
Kent's lavToPsl program.

When I report some stats about the alignments from within BioPerl, I
notice that lengths and stop positions are 1 *greater* than they should
be.

I'm pretty sure this is a reflection of the fact that psl output is in
the infamous half-open zero-based coordinate system and not simply
zero-based closed intervals.

Please let me know how I can help facilitate this fix, if it is in fact
confirmed to be true.

Regards,
Elliott

-- 
Elliott H. Margulies, Ph.D.
Genome Technology Branch
National Human Genome Research Institute
National Institutes of Health
50 South Dr., MSC 8004
Bldg. 50, Rm. 5527
Bethesda, MD  20892-8004

phone: (301) 594-9210
fax: (301) 496-0474
From jueschm at uni-muenster.de  Tue Mar 23 16:19:26 2004
From: jueschm at uni-muenster.de (=?iso-8859-1?Q?J=FCrgen?= Schmitz)
Date: Tue Mar 23 16:25:28 2004
Subject: [Bioperl-l] Bio::DB::Query::GenBank retrieves fewer sequences than
 Webbrowser query
Message-ID: <a06010203bc8655cc1a06@[128.176.61.47]>


-- 

Hi All,

The following query  retrieves less than a hundred sequences, much 
less than I get via Entrez:

my $query = Bio::DB::Query::GenBank->new
         (-query   =>'Mus[Organism] AND exon NOT mRNA NOT cDNA',
          -db      => 'nucleotide');
$seqio = $gb->get_Stream_by_query($query);
while( my $seq =  $seqio->next_seq ) {...}

but entering "Mus[Organism] AND exon NOT mRNA NOT cDNA" at
http://www.ncbi.nlm.nih.gov:80/entrez/query.fcgi?CMD=search&DB=nucleotide
retrieves more than 5000 sequences.

What could be the problem ?
There are no error messages printed to STDERR.
I'm using Bioperl 1.4, Perl v5.8.1-RC3 on a MAC (OSX)
.


_________________________________

Dr. Juergen Schmitz
Institute of Experimental Pathology (ZMBE)
University of Muenster
Von-Esmarch-Str. 56
D-48149 Muenster, Germany

Tel         +49  251  8352133
  Fax       +49  251  8358512

_________________________________
From paulo.david at netvisao.pt  Tue Mar 23 17:06:19 2004
From: paulo.david at netvisao.pt (Paulo Almeida)
Date: Tue Mar 23 17:11:44 2004
Subject: [Bioperl-l] Bio::DB::Query::GenBank retrieves fewer sequences
	than Webbrowser query
In-Reply-To: <a06010203bc8655cc1a06@[128.176.61.47]>
References: <a06010203bc8655cc1a06@[128.176.61.47]>
Message-ID: <4060B4DB.60902@netvisao.pt>

I think you need to escape the brackets in Organism. Try replacing your 
query line with this:

(-query   =>'Mus\[Organism\] AND exon NOT mRNA NOT cDNA',   

Hope it works,
-Paulo Almeida

J?rgen Schmitz wrote:

The following query  retrieves less than a hundred sequences, much less 
than I get via Entrez:

my $query = Bio::DB::Query::GenBank->new
        (-query   =>'Mus[Organism] AND exon NOT mRNA NOT cDNA',
         -db      => 'nucleotide');
$seqio = $gb->get_Stream_by_query($query);
while( my $seq =  $seqio->next_seq ) {...}

but entering "Mus[Organism] AND exon NOT mRNA NOT cDNA" at
http://www.ncbi.nlm.nih.gov:80/entrez/query.fcgi?CMD=search&DB=nucleotide
retrieves more than 5000 sequences.

What could be the problem ?
There are no error messages printed to STDERR.
I'm using Bioperl 1.4, Perl v5.8.1-RC3 on a MAC (OSX)

From evan5mndy13 at hotmail.com  Tue Mar 23 18:03:07 2004
From: evan5mndy13 at hotmail.com (daryl)
Date: Tue Mar 23 18:08:12 2004
Subject: [Bioperl-l] Buy C_I_A_L_I_S cheaply!   mailer campbell
Message-ID: <1080082987-26737@excite.com>

The best drug for ercetile dsyfunction!

Known as the "Week-end Pill"!

http://abcdef.goandseeit.net/sv/index.php?pid=eph9106


graphic garnet republic law advil scuba1 biology tarzan mishka suzuki kramer magnum roy cannon medical memory pedro marvin arizona raptor norman rhonda garnet homebrew gofish vickycharlott express button abcd bullet booster gretchen light wolverin septembe xxxx raptor republic 
no more: http://tequila.herbalfactory.net/sv/chair.php
From davila at ioc.fiocruz.br  Tue Mar 23 13:07:03 2004
From: davila at ioc.fiocruz.br (Alberto Davila)
Date: Tue Mar 23 20:25:19 2004
Subject: [Bioperl-l] Fetching Fasta seqs from GenBank - Help request
Message-ID: <1080065187.2916.63.camel@tryps>

Hi,

May I ask for some help ?

I am trying to use the BioFetch module in order to download several seqs
(from specific Organisms) from GenBank in fasta format, but looks like I
am missing "IO/String.pm" and other things.. should I install additional
bioperl modules (I have the Bioperl Core 1.4 installed) ? or use a
different module for my purpose ?

My script and error msg are listed below.

Thanks and besr regards,

Alberto 

****

#!/usr/local/bin/perl -w

use lib "/usr/local/bioperl14";
package Bio::DB::BioFetch;
use strict;
use Bio::DB::WebDBSeqI;
use HTTP::Request::Common 'POST';

my $format_type='fasta';
my $stream;


my $bf = new Bio::DB::BioFetch(-format        =>$format_type',
                               -retrievaltype =>'tempfile',
                               -db            =>'EMBL');
                                  
 $stream = $bf->get_Stream_by_id(['BUM','J00231']);
 while (my $s = $stream->next_seq) {
    print $s->seq,"\n";
        }                         
                                  
          
          exit;


[davila@tryps script]$ perl gb-fetch-1.pl
Can't locate IO/String.pm in @INC (@INC contains:
/usr/local/bioperl14/i386-linux-thread-multi /usr/local/bioperl14
/usr/lib/perl5/5.8.3/i386-linux-thread-multi /usr/lib/perl5/5.8.3
/usr/lib/perl5/site_perl/5.8.3/i386-linux-thread-multi
/usr/lib/perl5/site_perl/5.8.2/i386-linux-thread-multi
/usr/lib/perl5/site_perl/5.8.1/i386-linux-thread-multi
/usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi
/usr/lib/perl5/site_perl/5.8.3 /usr/lib/perl5/site_perl/5.8.2
/usr/lib/perl5/site_perl/5.8.1 /usr/lib/perl5/site_perl/5.8.0
/usr/lib/perl5/site_perl
/usr/lib/perl5/vendor_perl/5.8.3/i386-linux-thread-multi
/usr/lib/perl5/vendor_perl/5.8.2/i386-linux-thread-multi
/usr/lib/perl5/vendor_perl/5.8.1/i386-linux-thread-multi
/usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi
/usr/lib/perl5/vendor_perl/5.8.3 /usr/lib/perl5/vendor_perl/5.8.2
/usr/lib/perl5/vendor_perl/5.8.1 /usr/lib/perl5/vendor_perl/5.8.0
/usr/lib/perl5/vendor_perl .) at
/usr/local/bioperl14/Bio/DB/WebDBSeqI.pm line 90.
BEGIN failed--compilation aborted at
/usr/local/bioperl14/Bio/DB/WebDBSeqI.pm line 90.
Compilation failed in require at gb-fetch-1.pl line 6.
BEGIN failed--compilation aborted at gb-fetch-1.pl line 6.


From jason at cgt.duhs.duke.edu  Tue Mar 23 20:44:48 2004
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Tue Mar 23 20:50:13 2004
Subject: [Bioperl-l] Fetching Fasta seqs from GenBank - Help request
In-Reply-To: <1080065187.2916.63.camel@tryps>
References: <1080065187.2916.63.camel@tryps>
Message-ID: <Pine.LNX.4.50.0403232043580.32766-100000@tenero.duhs.duke.edu>

You need an additional perl module.


install IO::String from CPAN

There is a section on how to install additional perl modules in the
INSTALL document.

-j

On Tue, 23 Mar 2004, Alberto Davila wrote:

> Hi,
>
> May I ask for some help ?
>
> I am trying to use the BioFetch module in order to download several seqs
> (from specific Organisms) from GenBank in fasta format, but looks like I
> am missing "IO/String.pm" and other things.. should I install additional
> bioperl modules (I have the Bioperl Core 1.4 installed) ? or use a
> different module for my purpose ?
>
> My script and error msg are listed below.
>
> Thanks and besr regards,
>
> Alberto
>
> ****
>
> #!/usr/local/bin/perl -w
>
> use lib "/usr/local/bioperl14";
> package Bio::DB::BioFetch;
> use strict;
> use Bio::DB::WebDBSeqI;
> use HTTP::Request::Common 'POST';
>
> my $format_type='fasta';
> my $stream;
>
>
> my $bf = new Bio::DB::BioFetch(-format        =>$format_type',
>                                -retrievaltype =>'tempfile',
>                                -db            =>'EMBL');
>
>  $stream = $bf->get_Stream_by_id(['BUM','J00231']);
>  while (my $s = $stream->next_seq) {
>     print $s->seq,"\n";
>         }
>
>
>           exit;
>
>
> [davila@tryps script]$ perl gb-fetch-1.pl
> Can't locate IO/String.pm in @INC (@INC contains:
> /usr/local/bioperl14/i386-linux-thread-multi /usr/local/bioperl14
> /usr/lib/perl5/5.8.3/i386-linux-thread-multi /usr/lib/perl5/5.8.3
> /usr/lib/perl5/site_perl/5.8.3/i386-linux-thread-multi
> /usr/lib/perl5/site_perl/5.8.2/i386-linux-thread-multi
> /usr/lib/perl5/site_perl/5.8.1/i386-linux-thread-multi
> /usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi
> /usr/lib/perl5/site_perl/5.8.3 /usr/lib/perl5/site_perl/5.8.2
> /usr/lib/perl5/site_perl/5.8.1 /usr/lib/perl5/site_perl/5.8.0
> /usr/lib/perl5/site_perl
> /usr/lib/perl5/vendor_perl/5.8.3/i386-linux-thread-multi
> /usr/lib/perl5/vendor_perl/5.8.2/i386-linux-thread-multi
> /usr/lib/perl5/vendor_perl/5.8.1/i386-linux-thread-multi
> /usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi
> /usr/lib/perl5/vendor_perl/5.8.3 /usr/lib/perl5/vendor_perl/5.8.2
> /usr/lib/perl5/vendor_perl/5.8.1 /usr/lib/perl5/vendor_perl/5.8.0
> /usr/lib/perl5/vendor_perl .) at
> /usr/local/bioperl14/Bio/DB/WebDBSeqI.pm line 90.
> BEGIN failed--compilation aborted at
> /usr/local/bioperl14/Bio/DB/WebDBSeqI.pm line 90.
> Compilation failed in require at gb-fetch-1.pl line 6.
> BEGIN failed--compilation aborted at gb-fetch-1.pl line 6.
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From jason at cgt.duhs.duke.edu  Tue Mar 23 21:41:30 2004
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Tue Mar 23 21:46:54 2004
Subject: [Bioperl-l] coordinate issue with Bio::SearchIO::psl
In-Reply-To: <20040323183632.GC2485@nhgri.nih.gov>
References: <20040323183632.GC2485@nhgri.nih.gov>
Message-ID: <Pine.LNX.4.50.0403231814270.32260-100000@tenero.duhs.duke.edu>

good catch Elliott -  I thought I had finally squashed it but apparently
not.  Will take a look when I get a chance - may be helpful if you can
submit a bug and an example psl file and script which give the wrong
answer in case someone else wants to take a crack at it.


On Tue, 23 Mar 2004, Elliott H Margulies wrote:

> Dear BioPerl Developers:
>
> I have been evaluating the use of BioPerl for parsing BLASTZ lav
> output. To do this, I convert the lav output to 'psl' format with Jim
> Kent's lavToPsl program.
>
> When I report some stats about the alignments from within BioPerl, I
> notice that lengths and stop positions are 1 *greater* than they should
> be.
>
> I'm pretty sure this is a reflection of the fact that psl output is in
> the infamous half-open zero-based coordinate system and not simply
> zero-based closed intervals.
>
> Please let me know how I can help facilitate this fix, if it is in fact
> confirmed to be true.
>
> Regards,
> Elliott
>
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From chauser at duke.edu  Wed Mar 24 10:41:25 2004
From: chauser at duke.edu (Charles Hauser)
Date: Wed Mar 24 10:46:46 2004
Subject: [Bioperl-l] extract_feature_seq
Message-ID: <1080142885.22259.13.camel@pandorina>

Hi,

I have cases where an mRNA (genbank) is defined by a join involving
accessions other than the current record (see below).  

The script extract_feature_seq.PLS 


    while( my $seq = $seqio->next_seq ) {    
	foreach my $f ( grep { $_->primary_tag =~ /$featuretype/i } 
			$seq->get_SeqFeatures ) {
	    my $s = $f->spliced_seq;
	    ..snip
	}
   }


I'm not clear how to call Bio::DB::GenBank to retrieve these accessions
to complete the join.

    my $gb = new Bio::DB::GenBank();
    my $seqio = $gb->get_Stream_by_acc(\@accn); 

Charles


     mRNA            join(<908..1024,1091..1150,1244..1447,1800..2066,
                     2288..2888,3143..3279,3432..3601,3728..3860,4004..4243,
                     4417..4573,4748..4923,5393..5538,5824..5989,6202..6375,
                     AJ242524.1:1..219,AJ242525.1:136..235,AJ242525.1:357..421,


From chauser at duke.edu  Wed Mar 24 10:49:56 2004
From: chauser at duke.edu (Charles Hauser)
Date: Wed Mar 24 10:55:17 2004
Subject: [Bioperl-l] extract_feature_seq
In-Reply-To: <1080142885.22259.13.camel@pandorina>
References: <1080142885.22259.13.camel@pandorina>
Message-ID: <1080143396.22259.15.camel@pandorina>

Got it:

my $gb = new Bio::DB::GenBank;

my $s = $f->spliced_seq($gb);


this works,

Charles

On Wed, 2004-03-24 at 10:41, Charles Hauser wrote:
> Hi,
> 
> I have cases where an mRNA (genbank) is defined by a join involving
> accessions other than the current record (see below).  
> 
> The script extract_feature_seq.PLS 
> 
> 
>     while( my $seq = $seqio->next_seq ) {    
> 	foreach my $f ( grep { $_->primary_tag =~ /$featuretype/i } 
> 			$seq->get_SeqFeatures ) {
> 	    my $s = $f->spliced_seq;
> 	    ..snip
> 	}
>    }
> 
> 
> I'm not clear how to call Bio::DB::GenBank to retrieve these accessions
> to complete the join.
> 
>     my $gb = new Bio::DB::GenBank();
>     my $seqio = $gb->get_Stream_by_acc(\@accn); 
> 
> Charles
> 
> 
> 
>      mRNA            join(<908..1024,1091..1150,1244..1447,1800..2066,
>                      2288..2888,3143..3279,3432..3601,3728..3860,4004..4243,
>                      4417..4573,4748..4923,5393..5538,5824..5989,6202..6375,
>                      AJ242524.1:1..219,AJ242525.1:136..235,AJ242525.1:357..421,
> 
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

From abrusan at mpil-ploen.mpg.de  Wed Mar 24 12:56:10 2004
From: abrusan at mpil-ploen.mpg.de (Gyoergy Abrusan)
Date: Wed Mar 24 13:00:15 2004
Subject: [Bioperl-l] Printing strings without return
Message-ID: <4061CBBA.7000208@mpil-ploen.mpg.de>

Dear all

(Beginners question nr. x...)
I would like to ask how do I make one string from a Fasta file.
In other words, from:

 >name
actgtg
tgctat
accggg
...

the following:

 >name
actgtgtgctataccggg...


I wrote a simple script:

open FILE, "input.txt";
open (OUT, ">$RESULTS.txt")or die "$0:ERROR: can't write report file\n";

while ($seq=<FILE>)
{
if ($seq =~ />/)
  	{	
	print OUT "\n$seq";
	}		
else
	{
	print OUT "$seq";   	
	}
}


close OUT;

The problem is that the lines in FASTA contain a RETURN at the end, so 
the script doesn't change anything. Is there an option in perl to print 
till the RETURN?

Thanking you in advance.
Kind regards,

Gyorgy Abrusan

From jason at cgt.duhs.duke.edu  Wed Mar 24 13:01:35 2004
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Wed Mar 24 13:07:05 2004
Subject: [Bioperl-l] Printing strings without return
In-Reply-To: <4061CBBA.7000208@mpil-ploen.mpg.de>
References: <4061CBBA.7000208@mpil-ploen.mpg.de>
Message-ID: <Pine.LNX.4.50.0403241301120.4748-100000@tenero.duhs.duke.edu>

On Wed, 24 Mar 2004, Gyoergy Abrusan wrote:

> Dear all
>
> (Beginners question nr. x...)
> I would like to ask how do I make one string from a Fasta file.
> In other words, from:
>
>  >name
> actgtg
> tgctat
> accggg
> ...
>
> the following:
>
>  >name
> actgtgtgctataccggg...
>
>
> I wrote a simple script:
>
> open FILE, "input.txt";
> open (OUT, ">$RESULTS.txt")or die "$0:ERROR: can't write report file\n";
>
> while ($seq=<FILE>)
> {
> if ($seq =~ />/)
>   	{
> 	print OUT "\n$seq";
> 	}
> else
> 	{
# add this line
	$seq =~ s/\s+//g;
> 	print OUT "$seq";
> 	}
> }
>
>
> close OUT;
>
> The problem is that the lines in FASTA contain a RETURN at the end, so
> the script doesn't change anything. Is there an option in perl to print
> till the RETURN?
>
> Thanking you in advance.
> Kind regards,
>
> Gyorgy Abrusan
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From paulo.david at netvisao.pt  Wed Mar 24 13:29:05 2004
From: paulo.david at netvisao.pt (Paulo Almeida)
Date: Wed Mar 24 13:31:03 2004
Subject: [Bioperl-l] Printing strings without return
In-Reply-To: <4061CBBA.7000208@mpil-ploen.mpg.de>
References: <4061CBBA.7000208@mpil-ploen.mpg.de>
Message-ID: <4061D371.2060106@netvisao.pt>

Hi Gyoergy,

You could use chomp(), or chop():

while ($seq=<FILE>)
{
chomp($seq);

...rest of your script...

chomp() gets rid of newlines at the end, while chop() gets rid of the 
final character, whatever it may be. You could also use a substitution:

while(<FILE>){
s/\s// unless />/;
print OUT;
}

I think that should work...


-Paulo

Gyoergy Abrusan wrote:

> Dear all
>
> (Beginners question nr. x...)
> I would like to ask how do I make one string from a Fasta file.
> In other words, from:
>
> >name
> actgtg
> tgctat
> accggg
> ...
>
> the following:
>
> >name
> actgtgtgctataccggg...
>
>
> I wrote a simple script:
>
> open FILE, "input.txt";
> open (OUT, ">$RESULTS.txt")or die "$0:ERROR: can't write report file\n";
>
> while ($seq=<FILE>)
> {
> if ($seq =~ />/)
> {
> print OUT "\n$seq";
> }
> else
> {
> print OUT "$seq";
> }
> }
>
>
> close OUT;
>
> The problem is that the lines in FASTA contain a RETURN at the end, so 
> the script doesn't change anything. Is there an option in perl to 
> print till the RETURN?
>
> Thanking you in advance.
> Kind regards,
>
> Gyorgy Abrusan 


From ak at ebi.ac.uk  Wed Mar 24 13:32:10 2004
From: ak at ebi.ac.uk (Andreas Kahari)
Date: Wed Mar 24 13:37:32 2004
Subject: [Bioperl-l] Printing strings without return
In-Reply-To: <4061CBBA.7000208@mpil-ploen.mpg.de>
References: <4061CBBA.7000208@mpil-ploen.mpg.de>
Message-ID: <20040324183210.GA4299@ebi.ac.uk>

On Wed, Mar 24, 2004 at 06:56:10PM +0100, Gyoergy Abrusan wrote:
[cut]
> I wrote a simple script:
> 
> open FILE, "input.txt";
> open (OUT, ">$RESULTS.txt")or die "$0:ERROR: can't write report file\n";
> 
> while ($seq=<FILE>)
> {
> if ($seq =~ />/)
>  	{	
> 	print OUT "\n$seq";
> 	}		
> else

chomp($seq);

> 	{
> 	print OUT "$seq";   	
> 	}
> }
> 
> 
> close OUT;
> 
> The problem is that the lines in FASTA contain a RETURN at the end, so 
> the script doesn't change anything. Is there an option in perl to print 
> till the RETURN?
> 


-- 
|[==]| Andreas K?h?ri      EMBL, European Bioinformatics Institute
|=][=|                     Wellcome Trust Genome Campus
|[==]| DAS Project Leader  Hinxton, Cambridgeshire, CB10 1SD
|=][=| Ensembl Developer   United Kingdom
From antivirus at unito.it  Wed Mar 24 14:09:39 2004
From: antivirus at unito.it (antivirus@unito.it)
Date: Wed Mar 24 14:15:58 2004
Subject: [Bioperl-l] - DO NOT REPLY - Inflex scan report [0324200924879]
Message-ID: <200403241909.i2OJ9fvJ024946@albert.unito.it>

Administrator Email Reply Address: rete@unito.it
Email sent to:  afollenzi@ircc.it
Inflex ID: 0324200924879
Report Details -----------------------------------------------
AntiVirus Results...
Scanning /space/inflex/tmp/inf_0324200924879/unpacked/*
Scanning file /space/inflex/tmp/inf_0324200924879/unpacked/_headers_
Scanning file /space/inflex/tmp/inf_0324200924879/unpacked/textfile0
Scanning file /space/inflex/tmp/inf_0324200924879/unpacked/textfile1
Scanning file /space/inflex/tmp/inf_0324200924879/unpacked/letter.zip
/space/inflex/tmp/inf_0324200924879/unpacked/letter.zip
        Found the W32/Netsky.p@MM!zip virus !!!
Scanning file /space/inflex/tmp/inf_0324200924879/unpacked/textfile2

Summary report on /space/inflex/tmp/inf_0324200924879/unpacked/*
File(s)
        Total files: ...........       5
        Clean: .................       4
        Possibly Infected: .....       1

File NAME/TYPE Scan Results


0324200924879  from:bioperl-l@bioperl.org  to: afollenzi@ircc.itText scanning off.

END OF MESSAGE.

End.
.
From cdwan at mail.ahc.umn.edu  Wed Mar 24 14:25:06 2004
From: cdwan at mail.ahc.umn.edu (Chris Dwan (CCGB))
Date: Wed Mar 24 14:30:31 2004
Subject: [Bioperl-l] Printing strings without return
In-Reply-To: <4061CBBA.7000208@mpil-ploen.mpg.de>
References: <4061CBBA.7000208@mpil-ploen.mpg.de>
Message-ID: <Pine.GSO.4.58.0403241312290.21705@lenti.med.umn.edu>


> I would like to ask how do I make one string from a Fasta file.
> In other words, from:
>
>  >name
> actgtg
> tgctat
> accggg

I think the most "BioPerl" way to do what you describe is:

---------------
  use Bio::SeqIO;

  my $filename = shift;
  my $in = Bio::SeqIO->new('-file'   => "<$filename",
                           '-format' => 'fasta');
  while (my $seq = $in->next_seq()) {
    print join(" ", ( ">", $seq->id(), $seq->desc(), $seq->seq(), "\n"));
  }
---------------

If you want the result to be a fasta file, but with a different width for
the sequence lines, you can get up to 32,766 characters per line as
fasta.pm is currently written:

---------------
  use Bio::SeqIO;

  my $filename = shift;
  my $in = Bio::SeqIO->new('-file'   => "<$filename",
                           '-format' => 'fasta');
  my $out = Bio::SeqIO->new('-fh'    => \*STDOUT,
                            '-format' => 'fasta',
                            '-width'  => 32766);
  while (my $seq = $in->next_seq()) {
    $out->write_seq($seq);
  }
---------------

A perl purist might use the following (from the command line) to create an
output free of "\n" characters.

  perl -e "while (<STDIN>) { s/\n//g; print; }" < test.fsa

Note that in the case of your example, this gives:

  > nameactgtgtgctataccggg

-Chris Dwan

From barry.moore at genetics.utah.edu  Wed Mar 24 15:01:23 2004
From: barry.moore at genetics.utah.edu (Barry Moore)
Date: Wed Mar 24 15:06:49 2004
Subject: [Bioperl-l] Working with Alignments
Message-ID: <4061E913.4010401@genetics.utah.edu>

I'm sure that I was recently reading documentation about a method that 
would return an array of all the positions where a pairwise sequence 
alignment differed, but I can't find it now.  Does that ring a bell to 
anyone?

Barry

-- 
Barry Moore
Dept. of Human Genetics
University of Utah
Salt Lake City, UT


From kgb11 at psu.edu  Wed Mar 24 16:02:33 2004
From: kgb11 at psu.edu (Kevin Beckmann)
Date: Wed Mar 24 16:07:54 2004
Subject: [Bioperl-l] BLAST problem: limiting # of HSPs
Message-ID: <200403242102.QAA22231@webmail4.cac.psu.edu>

We've been using the Blast module of Bioperl to run tBLASTx locally against our
own data set.  We're looking for ways to reduce the output size produced by
BLAST and have set the alignment view to tabular(-m 8).  The problem that we've
come across is that a query will have multiple hits to the same sequence but
for different HSPs.  We need BLAST to retain only one result instead of filling
the BLAST report with multiple E values for HSPs from the same gene.  

In the default blast output, there are summary statistics for the overall hit,
is there an option for the tab-deliminated BLAST output that would give us this
overall hit statistic instead of one for each HSP?

If not, what is the option to limit the number of HSPs?

******************
 ________________
/ Kevin Beckmann \
\ kgb11@psu.edu  /
 \_____  _______/
       |/
 |\/|
(.(.)
 /,\\
    \\/****\
     \ |__\|\/
     ||| |||
    ///\///\
******************
From jason at cgt.duhs.duke.edu  Wed Mar 24 16:33:13 2004
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Wed Mar 24 16:38:36 2004
Subject: [Bioperl-l] Working with Alignments
In-Reply-To: <4061E913.4010401@genetics.utah.edu>
References: <4061E913.4010401@genetics.utah.edu>
Message-ID: <Pine.LNX.4.50.0403241633060.4748-100000@tenero.duhs.duke.edu>

$hsp->seq_inds()

-j

On Wed, 24 Mar 2004, Barry Moore wrote:

> I'm sure that I was recently reading documentation about a method that
> would return an array of all the positions where a pairwise sequence
> alignment differed, but I can't find it now.  Does that ring a bell to
> anyone?
>
> Barry
>
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From antivirus at unito.it  Wed Mar 24 07:28:47 2004
From: antivirus at unito.it (antivirus@unito.it)
Date: Wed Mar 24 16:54:28 2004
Subject: [Bioperl-l] - DO NOT REPLY - Inflex scan report [0324132817468]
Message-ID: <200403241228.i2OCSobl018256@albert.unito.it>

Administrator Email Reply Address: rete@unito.it
Email sent to:  cscielzo@unito.it
Inflex ID: 0324132817468
Report Details -----------------------------------------------
AntiVirus Results...

File NAME/TYPE Scan Results
 MS-DOS executable (EXE), OS/2 or MS Windows


0324132817468  from:bioperl-l@bioperl.org  to: cscielzo@unito.it
END OF MESSAGE.

End.
.
From tdhoufek at unity.ncsu.edu  Wed Mar 24 16:54:19 2004
From: tdhoufek at unity.ncsu.edu (T.D. Houfek)
Date: Wed Mar 24 17:01:52 2004
Subject: : [Bioperl-l] Bio::DB::Query::GenBank retrieves fewer sequences
	than Webbrowser query
Message-ID: <1080165259.6915.34.camel@aether>

Hmm...

Just for kicks I tried to duplicate the problem (I get the same number
of sequences from NCBI's web sequin tool as Jrgen, but using the
Bio::DB::Query:Genbank method I get 644 sequences (not less than 100,
but not the 5000+ we are expecting).  Placing an escape backslash before
the brackets does not seem to help me:

--- my test script below ---

#!/usr/bin/perl
use strict;
use Bio::DB::GenBank;
use Bio::DB::Query::GenBank;
                                                                                
my $gb=new
Bio::DB::GenBank;                                                                                
my $query = Bio::DB::Query::GenBank->new
        (-query => 'Mus\[Organism] AND exon NOT mRNA NOT cDNA',
         -db => 'Nucleotide');
my $seqio = $gb->get_Stream_by_query($query);
my $numresults=0;
while( my $seq = $seqio->next_seq ) { $numresults++; }
print "Num results: $numresults\n";


-- 

:.-----.----------.----------.-----.:
 T.D. Houfek
 tdhoufek-AT-unity-DOT-ncsu-DOT-edu
 Tobacco Genome Initiative
 NCSU, Raleigh, NC 27606
:.-----.----------.----------.-----.:


From cdwan at mail.ahc.umn.edu  Wed Mar 24 17:37:09 2004
From: cdwan at mail.ahc.umn.edu (Chris Dwan (CCGB))
Date: Wed Mar 24 17:42:37 2004
Subject: : [Bioperl-l] Bio::DB::Query::GenBank retrieves fewer sequences
	than Webbrowser query
In-Reply-To: <1080165259.6915.34.camel@aether>
References: <1080165259.6915.34.camel@aether>
Message-ID: <Pine.GSO.4.58.0403241632440.17176@lenti.med.umn.edu>


I'll admit that I was totally unable to get the results I wanted with the
-query => 'whatever' option.  I attribute that to my lack of clue WRT
ENTREZ.  Since we're fortunate enough to have an in-house relational
genbank, I use SQL to get the list of accessions I want, and then:

my $gb        = new Bio::DB::GenBank;
my $seqstream = $gb->get_Stream_by_acc(\@accessions);
my $seqIO     = Bio::SeqIO->new(-fh => $out, -format => 'Fasta');
while (my $seq = $seqstream->next_seq) {
  $seq->display_id($seq->display_id() . "." . $seq->version);
  $seqIO->write_seq($seq);
}

Which has given remarkably consistent results.

Not that this addresses the observed problem, but I suspect that it's on
the query end, not the sequence retrieval end.

-Chris Dwan

> Just for kicks I tried to duplicate the problem (I get the same number
> of sequences from NCBI's web sequin tool as Jrgen, but using the
> Bio::DB::Query:Genbank method I get 644 sequences (not less than 100,
> but not the 5000+ we are expecting).  Placing an escape backslash before
> the brackets does not seem to help me:
>
> --- my test script below ---
>
> #!/usr/bin/perl
> use strict;
> use Bio::DB::GenBank;
> use Bio::DB::Query::GenBank;
>
> my $gb=new
> Bio::DB::GenBank;
> my $query = Bio::DB::Query::GenBank->new
>         (-query => 'Mus\[Organism] AND exon NOT mRNA NOT cDNA',
>          -db => 'Nucleotide');
> my $seqio = $gb->get_Stream_by_query($query);
> my $numresults=0;
> while( my $seq = $seqio->next_seq ) { $numresults++; }
> print "Num results: $numresults\n";
>
>
>
>
>
>
> --
>
> :.-----.----------.----------.-----.:
>  T.D. Houfek
>  tdhoufek-AT-unity-DOT-ncsu-DOT-edu
>  Tobacco Genome Initiative
>  NCSU, Raleigh, NC 27606
> :.-----.----------.----------.-----.:
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>
From birney at ebi.ac.uk  Wed Mar 24 18:21:46 2004
From: birney at ebi.ac.uk (Ewan Birney)
Date: Wed Mar 24 18:27:36 2004
Subject: [Bioperl-l] Pedro's haplotype block/tag system in
Message-ID: <Pine.OSX.4.44.0403242317590.3111-100000@ewan-birneys-computer.local>


I've just committed Pedro's haplotype block tagger and tests. I still need
to review the code (eg, it has Data::Dumper and Storeable, which I will
remove - Pedro - do you use Data::Dumper or Storeable actually in the
methods?)


Anyway, they are in a tests pass (at least over here!)


From Mohammad.Shahid at Genetik.Uni-Bielefeld.DE  Thu Mar 25 03:29:34 2004
From: Mohammad.Shahid at Genetik.Uni-Bielefeld.DE (M.Shahid)
Date: Wed Mar 24 18:32:43 2004
Subject: [Bioperl-l] Perl Regular Expressions sought that match typical
Message-ID: <000901c41243$4eaee4d0$66125ec2@meemsheen>

Dear All,

For a small software project, I'm looking for Perl Regular 
Expressions that match file formats and related input used
for bioinformatics applications, e.g. FASTA, Clustal, 
PROSITE patterns, PAM similarity matrices, etc. Note that
I do not want to parse such data, just check whether the format
is correct. Is there any compilation out there ? If not, 
what is the best starting point to collect such Regular Expressions ?
Best Wishes,
M. Shahid

From paulo.david at netvisao.pt  Thu Mar 25 06:21:45 2004
From: paulo.david at netvisao.pt (Paulo Almeida)
Date: Thu Mar 25 06:23:39 2004
Subject: : [Bioperl-l] Bio::DB::Query::GenBank retrieves fewer sequences
	than Webbrowser query
Message-ID: <4062C0C9.1030204@netvisao.pt>


Yes, escaping characters doesn't seem to have anything to do with this
problem. I suggested it because it worked for me in a different
situation. However, if you use the count method, instead of cycling
through all the results, to get the number of returned sequences, you
get 5066, as it should be:

#!/usr/bin/perl
use strict;
use Bio::DB::GenBank;
use Bio::DB::Query::GenBank;


my $gb=new
Bio::DB::GenBank;


my $query = Bio::DB::Query::GenBank->new
(-query => 'Mus[Organism] exon NOT mRNA NOT cDNA',
-db => 'Nucleotide');

my $seqio = $gb->get_Stream_by_query($query);
print "Num results:" , $query->count , "\n";

I'm looking into it further, but I don't know what the problem could be.
-Paulo Almeida

T.D. Houfek wrote:

 >Hmm...
 >
 >Just for kicks I tried to duplicate the problem (I get the same number
 >of sequences from NCBI's web sequin tool as Jrgen, but using the
 >Bio::DB::Query:Genbank method I get 644 sequences (not less than 100,
 >but not the 5000+ we are expecting). Placing an escape backslash before
 >the brackets does not seem to help me:
 >
 >--- my test script below ---
 >
 >#!/usr/bin/perl
 >use strict;
 >use Bio::DB::GenBank;
 >use Bio::DB::Query::GenBank;
 >
 >my $gb=new
 >Bio::DB::GenBank;
 >my $query = Bio::DB::Query::GenBank->new
 > (-query => 'Mus\[Organism] AND exon NOT mRNA NOT cDNA',
 > -db => 'Nucleotide');
 >my $seqio = $gb->get_Stream_by_query($query);
 >my $numresults=0;
 >while( my $seq = $seqio->next_seq ) { $numresults++; }
 >print "Num results: $numresults\n";
 >


From karesncatalia at hotmail.com  Thu Mar 25 07:15:54 2004
From: karesncatalia at hotmail.com (elijah)
Date: Thu Mar 25 07:21:12 2004
Subject: [Bioperl-l] As Seen on CNN, the Winningest CAS1N0! groovy stingray
Message-ID: <1080216954-8998@excite.com>

Come and play at the world's PREMIERE ONLINE CASINOn!
We are happy to offer you, in an elegant atmosphere, a 50% BONUSz for your first deposit as a New Player.

Sign=up now! Don't wait!

http://www.virtualcasinoes.net/_e4faa55afa1972493c43ac8a3f66f869/


We'll automatically credeit your account with a 50% BONrUS of extra chcips, to wagcr in our caseno.
Don't hesitate and try your luck!

CLICK HERE!

http://www.virtualcasinoes.net/_e4faa55afa1972493c43ac8a3f66f869/


cardinal chevystrawber dude chiquita shawn trek yvonne 
shawn skidoorambo1 petunia safety
shit cosmos fiona 
REoMOVE un_subvan@hotmail.com
From jason at cgt.duhs.duke.edu  Thu Mar 25 08:03:33 2004
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Thu Mar 25 08:09:01 2004
Subject: [Bioperl-l] Re: asking for a help
In-Reply-To: <1080180541.40623f3d3e041@webmail.uoguelph.ca>
References: <1080180541.40623f3d3e041@webmail.uoguelph.ca>
Message-ID: <Pine.LNX.4.50.0403250800180.12198-100000@tenero.duhs.duke.edu>

Joshua -

You can translate sequence within bioperl directly (if they are spliced
and you know their frame) with the method $seq->translate().  If they are
annotations in genbank/embl format you can use the $feature->splice_seq to
get the spliced exons out of the sequence, but you will probably have to
read more about the Sequences and Features before this will make sense.

You should read the bioperl Feature and Sequence HOWTO first linked off
the HOWTO page on the bioperl website.

Bio::Factory and Bio::Tools::Run modules are in bioperl-run package which
is available from the bioperl distribution directory
(http://bioperl.org/DIST)

-jason
On Wed, 24 Mar 2004 szhan@uoguelph.ca wrote:

> Hello, Jason:
> I have got a list of genes (nucleotides sequences), I want to translate them
> using a program (transeq) in EMBOSS and compare their function similarity
> using blastp of bl2seq in BLAST. I read FAQ in bioperl web site and it said
> that I can find a module: Bio::Factory::EMBOSS to do that, but I could not
> find it. Could you please give me a guide to do this job?
> Your help will be highly appreciated!
> Joshua
>
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From antivirus at unito.it  Thu Mar 25 08:22:56 2004
From: antivirus at unito.it (antivirus@unito.it)
Date: Thu Mar 25 08:28:17 2004
Subject: [Bioperl-l] - DO NOT REPLY - Inflex scan report [0325142225336]
Message-ID: <200403251322.i2PDMu6Q025417@albert.unito.it>

Administrator Email Reply Address: rete@unito.it
Email sent to:  mando@unito.it
Inflex ID: 0325142225336
Report Details -----------------------------------------------
AntiVirus Results...

File NAME/TYPE Scan Results
 MS-DOS executable (EXE), OS/2 or MS Windows


0325142225336  from:bioperl-l@bioperl.org  to: mando@unito.it
END OF MESSAGE.

End.
.
From Pedro.X.Gomez-Fabre at gsk.com  Thu Mar 25 04:10:42 2004
From: Pedro.X.Gomez-Fabre at gsk.com (Pedro.X.Gomez-Fabre@gsk.com)
Date: Thu Mar 25 08:30:54 2004
Subject: [Bioperl-l] Re: Pedro's haplotype block/tag system in
In-Reply-To: <Pine.OSX.4.44.0403242317590.3111-100000@ewan-birneys-computer.local>
Message-ID: <OF76E0913E.447598A2-ON80256E62.00318C34-80256E62.00326DA2@sb.com>

"Ewan Birney" <birney@ebi.ac.uk> wrote on 24/03/2004 23:21:46:

> 
> 
> I've just committed Pedro's haplotype block tagger and tests. I still 
need
> to review the code (eg, it has Data::Dumper and Storeable, which I will
> remove - Pedro - do you use Data::Dumper or Storeable actually in the
> methods?)
> 

I have removed Data::Dumper from the modules. I used it for testing.

I do use Storable to copy (clone) some structures (i.e. list of lists) in 
HtSNP.pm on:


use Storable qw(dclone);
    $self -> {'w_pop_freq'}     = dclone ( $self ->pattern_freq() );
    $self->{snp_type}->{useful_snp} = dclone ( $self ->snp_ids() );
    $self -> {'split_hap'} = dclone ($out);
    @$arr= @{dclone(\@outValues)};
    @$hap = @{dclone $out};

If there any other module I should be using instead I'll be glad to know.

Thanks Ewan and Jason for review.
Pedro

> 
> Anyway, they are in a tests pass (at least over here!)
> 
> 
> 
> 
From davila at ioc.fiocruz.br  Thu Mar 25 07:40:07 2004
From: davila at ioc.fiocruz.br (Alberto Davila)
Date: Thu Mar 25 08:30:57 2004
Subject: [Bioperl-l] Fetching Fasta seqs from GenBank - Help request
In-Reply-To: <Pine.LNX.4.50.0403232043580.32766-100000@tenero.duhs.duke.edu>
References: <1080065187.2916.63.camel@tryps>
	<Pine.LNX.4.50.0403232043580.32766-100000@tenero.duhs.duke.edu>
Message-ID: <1080218407.3118.8.camel@tryps>

Thanks Jason,

I installed the IO::String, then it is working fine now. However I have
a doubt, the "get_Stream_by_id" is returning me more than the "sequence
per se", what is it ? My script and results are listed below. Finally I
would like to save (in my local disk) the retrieved sequences as fasta
files... is there any argument for that ?

Thanks again, Alberto


#!/usr/local/bin/perl -w

use lib "/usr/local/bioperl14";
use Bio::DB::BioFetch;
use strict;
use Bio::DB::WebDBSeqI;
use HTTP::Request::Common 'POST';

my $format_type='fasta';
my $stream;


my $bf = new Bio::DB::BioFetch(-format        =>$format_type,
                               -retrievaltype =>'tempfile',
			       -db            =>'EMBL');
				  
 $stream = $bf->get_Stream_by_id(['BUM','J00231']);
 while (my $s = $stream->next_seq) {
    print $s->seq,"\n\n\n";
	}			  
				  
 	  
	  exit;


[davila@tryps script]$ perl gb-fetch-1.pl
agtagtgtactaccaagtatagataacgtttaaatattaaagttttggatcaaagccaaagatgattcgcat
gctggtgctgattgtagttacagctgcaagcccagtgtatcagagatgtttccaagatggggctatagtgaa
gcaaaacccatccaaagaggcagtcacagaagtgtccctaaaagatgatgttagca
                                                                                
                                                                                
cctggacctcctgtgcaagaacatgaaacanctgtggttcttccttctcctggtggcagctcccagatgggt
cctgtcccaggtgcacctgcaggagtcgggcccaggactggggaagcctccagagctcaaaaccccacttgg
tgacacaactcacacatgcccacggtgcccagagcccaaatcttgtgacacacctcccccgtgcccacggtg
cccagagcccaaatcttgtgacacacctcccccatgcccacggtgcccagagcccaaatcttgtgacacacc
tcccccgtgcccnnngtgcccagcacctgaactcttgggaggaccgtcagtcttcctcttccccccaaaacc
caaggatacccttatgatttcccggacccctgaggtcacgtgcgtggtggtggacgtgagccacgaagaccc
nnnngtccagttcaagtggtacgtggacggcgtggaggtgcataatgccaagacaaagctgcgggaggagca
gtacaacagcacgttccgtgtggtcagcgtcctcaccgtcctgcaccaggactggctgaacggcaaggagta
caagtgcaaggtctccaacaaagccctcccagcccccatcgagaaaaccatctccaaagccaaaggacagcc
cnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnngaggagatgaccaagaaccaagtcagcctgacctg
cctggtcaaaggcttctaccccagcgacatcgccgtggagtgggagagcaatgggcagccggagaacaacta
caacaccacgcctcccatgctggactccgacggctccttcttcctctacagcaagctcaccgtggacaagag
caggtggcagcaggggaacatcttctcatgctccgtgatgcatgaggctctgcacaaccgctacacgcagaa
gagcctctccctgtctccgggtaaatgagtgccatggccggcaagcccccgctccccgggctctcggggtcg
cgcgaggatgcttggcacgtaccccgtgtacatacttcccaggcacccagcatggaaataaagcacccagcg
ctgccctgg


On Tue, 2004-03-23 at 22:44, Jason Stajich wrote:
> You need an additional perl module.
> 
> 
> install IO::String from CPAN
> 
> There is a section on how to install additional perl modules in the
> INSTALL document.
> 
> -j
> 
> On Tue, 23 Mar 2004, Alberto Davila wrote:
> 
> > Hi,
> >
> > May I ask for some help ?
> >
> > I am trying to use the BioFetch module in order to download several seqs
> > (from specific Organisms) from GenBank in fasta format, but looks like I
> > am missing "IO/String.pm" and other things.. should I install additional
> > bioperl modules (I have the Bioperl Core 1.4 installed) ? or use a
> > different module for my purpose ?
> >
> > My script and error msg are listed below.
> >
> > Thanks and besr regards,
> >
> > Alberto
> >
> > ****
> >
> > #!/usr/local/bin/perl -w
> >
> > use lib "/usr/local/bioperl14";
> > package Bio::DB::BioFetch;
> > use strict;
> > use Bio::DB::WebDBSeqI;
> > use HTTP::Request::Common 'POST';
> >
> > my $format_type='fasta';
> > my $stream;
> >
> >
> > my $bf = new Bio::DB::BioFetch(-format        =>$format_type',
> >                                -retrievaltype =>'tempfile',
> >                                -db            =>'EMBL');
> >
> >  $stream = $bf->get_Stream_by_id(['BUM','J00231']);
> >  while (my $s = $stream->next_seq) {
> >     print $s->seq,"\n";
> >         }
> >
> >
> >           exit;
> >
> >
> > [davila@tryps script]$ perl gb-fetch-1.pl
> > Can't locate IO/String.pm in @INC (@INC contains:
> > /usr/local/bioperl14/i386-linux-thread-multi /usr/local/bioperl14
> > /usr/lib/perl5/5.8.3/i386-linux-thread-multi /usr/lib/perl5/5.8.3
> > /usr/lib/perl5/site_perl/5.8.3/i386-linux-thread-multi
> > /usr/lib/perl5/site_perl/5.8.2/i386-linux-thread-multi
> > /usr/lib/perl5/site_perl/5.8.1/i386-linux-thread-multi
> > /usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi
> > /usr/lib/perl5/site_perl/5.8.3 /usr/lib/perl5/site_perl/5.8.2
> > /usr/lib/perl5/site_perl/5.8.1 /usr/lib/perl5/site_perl/5.8.0
> > /usr/lib/perl5/site_perl
> > /usr/lib/perl5/vendor_perl/5.8.3/i386-linux-thread-multi
> > /usr/lib/perl5/vendor_perl/5.8.2/i386-linux-thread-multi
> > /usr/lib/perl5/vendor_perl/5.8.1/i386-linux-thread-multi
> > /usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi
> > /usr/lib/perl5/vendor_perl/5.8.3 /usr/lib/perl5/vendor_perl/5.8.2
> > /usr/lib/perl5/vendor_perl/5.8.1 /usr/lib/perl5/vendor_perl/5.8.0
> > /usr/lib/perl5/vendor_perl .) at
> > /usr/local/bioperl14/Bio/DB/WebDBSeqI.pm line 90.
> > BEGIN failed--compilation aborted at
> > /usr/local/bioperl14/Bio/DB/WebDBSeqI.pm line 90.
> > Compilation failed in require at gb-fetch-1.pl line 6.
> > BEGIN failed--compilation aborted at gb-fetch-1.pl line 6.


From sanges at biogem.it  Thu Mar 25 08:46:24 2004
From: sanges at biogem.it (Remo Sanges)
Date: Thu Mar 25 08:51:54 2004
Subject: [Bioperl-l] Fetching Fasta seqs from GenBank - Help request
In-Reply-To: <1080218407.3118.8.camel@tryps>
References: <1080065187.2916.63.camel@tryps>
	<Pine.LNX.4.50.0403232043580.32766-100000@tenero.duhs.duke.edu>
	<1080218407.3118.8.camel@tryps>
Message-ID: <CE925655-7E62-11D8-A186-000A95B95F88@biogem.it>

Hi Alberto,

you are getting 2 sequences because
you are querying for 2 id: BUM and J00231.
With the get stream by id method you can ask for
a large number of id, you need only write id separated by ','
in the array ref ['id1','id2','id3','id4',...].

Remo


On Mar 25, 2004, at 1:40 PM, Alberto Davila wrote:

> Thanks Jason,
>
> I installed the IO::String, then it is working fine now. However I have
> a doubt, the "get_Stream_by_id" is returning me more than the "sequence
> per se", what is it ? My script and results are listed below. Finally I
> would like to save (in my local disk) the retrieved sequences as fasta
> files... is there any argument for that ?
>
> Thanks again, Alberto
>
>
> #!/usr/local/bin/perl -w
>
> use lib "/usr/local/bioperl14";
> use Bio::DB::BioFetch;
> use strict;
> use Bio::DB::WebDBSeqI;
> use HTTP::Request::Common 'POST';
>
> my $format_type='fasta';
> my $stream;
>
>
> my $bf = new Bio::DB::BioFetch(-format        =>$format_type,
>                                -retrievaltype =>'tempfile',
> 			       -db            =>'EMBL');
> 				
>  $stream = $bf->get_Stream_by_id(['BUM','J00231']);
>  while (my $s = $stream->next_seq) {
>     print $s->seq,"\n\n\n";
> 	}			
> 				
>  	
> 	  exit;
>
>
>
>
> [davila@tryps script]$ perl gb-fetch-1.pl
> agtagtgtactaccaagtatagataacgtttaaatattaaagttttggatcaaagccaaagatgattcgca 
> t
> gctggtgctgattgtagttacagctgcaagcccagtgtatcagagatgtttccaagatggggctatagtga 
> a
> gcaaaacccatccaaagaggcagtcacagaagtgtccctaaaagatgatgttagca
>
>
> cctggacctcctgtgcaagaacatgaaacanctgtggttcttccttctcctggtggcagctcccagatggg 
> t
> cctgtcccaggtgcacctgcaggagtcgggcccaggactggggaagcctccagagctcaaaaccccacttg 
> g
> tgacacaactcacacatgcccacggtgcccagagcccaaatcttgtgacacacctcccccgtgcccacggt 
> g
> cccagagcccaaatcttgtgacacacctcccccatgcccacggtgcccagagcccaaatcttgtgacacac 
> c
> tcccccgtgcccnnngtgcccagcacctgaactcttgggaggaccgtcagtcttcctcttccccccaaaac 
> c
> caaggatacccttatgatttcccggacccctgaggtcacgtgcgtggtggtggacgtgagccacgaagacc 
> c
> nnnngtccagttcaagtggtacgtggacggcgtggaggtgcataatgccaagacaaagctgcgggaggagc 
> a
> gtacaacagcacgttccgtgtggtcagcgtcctcaccgtcctgcaccaggactggctgaacggcaaggagt 
> a
> caagtgcaaggtctccaacaaagccctcccagcccccatcgagaaaaccatctccaaagccaaaggacagc 
> c
> cnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnngaggagatgaccaagaaccaagtcagcctgacct 
> g
> cctggtcaaaggcttctaccccagcgacatcgccgtggagtgggagagcaatgggcagccggagaacaact 
> a
> caacaccacgcctcccatgctggactccgacggctccttcttcctctacagcaagctcaccgtggacaaga 
> g
> caggtggcagcaggggaacatcttctcatgctccgtgatgcatgaggctctgcacaaccgctacacgcaga 
> a
> gagcctctccctgtctccgggtaaatgagtgccatggccggcaagcccccgctccccgggctctcggggtc 
> g
> cgcgaggatgcttggcacgtaccccgtgtacatacttcccaggcacccagcatggaaataaagcacccagc 
> g
> ctgccctgg
>
>
>
>
> On Tue, 2004-03-23 at 22:44, Jason Stajich wrote:
>> You need an additional perl module.
>>
>>
>> install IO::String from CPAN
>>
>> There is a section on how to install additional perl modules in the
>> INSTALL document.
>>
>> -j
>>
>> On Tue, 23 Mar 2004, Alberto Davila wrote:
>>
>>> Hi,
>>>
>>> May I ask for some help ?
>>>
>>> I am trying to use the BioFetch module in order to download several  
>>> seqs
>>> (from specific Organisms) from GenBank in fasta format, but looks  
>>> like I
>>> am missing "IO/String.pm" and other things.. should I install  
>>> additional
>>> bioperl modules (I have the Bioperl Core 1.4 installed) ? or use a
>>> different module for my purpose ?
>>>
>>> My script and error msg are listed below.
>>>
>>> Thanks and besr regards,
>>>
>>> Alberto
>>>
>>> ****
>>>
>>> #!/usr/local/bin/perl -w
>>>
>>> use lib "/usr/local/bioperl14";
>>> package Bio::DB::BioFetch;
>>> use strict;
>>> use Bio::DB::WebDBSeqI;
>>> use HTTP::Request::Common 'POST';
>>>
>>> my $format_type='fasta';
>>> my $stream;
>>>
>>>
>>> my $bf = new Bio::DB::BioFetch(-format        =>$format_type',
>>>                                -retrievaltype =>'tempfile',
>>>                                -db            =>'EMBL');
>>>
>>>  $stream = $bf->get_Stream_by_id(['BUM','J00231']);
>>>  while (my $s = $stream->next_seq) {
>>>     print $s->seq,"\n";
>>>         }
>>>
>>>
>>>           exit;
>>>
>>>
>>> [davila@tryps script]$ perl gb-fetch-1.pl
>>> Can't locate IO/String.pm in @INC (@INC contains:
>>> /usr/local/bioperl14/i386-linux-thread-multi /usr/local/bioperl14
>>> /usr/lib/perl5/5.8.3/i386-linux-thread-multi /usr/lib/perl5/5.8.3
>>> /usr/lib/perl5/site_perl/5.8.3/i386-linux-thread-multi
>>> /usr/lib/perl5/site_perl/5.8.2/i386-linux-thread-multi
>>> /usr/lib/perl5/site_perl/5.8.1/i386-linux-thread-multi
>>> /usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi
>>> /usr/lib/perl5/site_perl/5.8.3 /usr/lib/perl5/site_perl/5.8.2
>>> /usr/lib/perl5/site_perl/5.8.1 /usr/lib/perl5/site_perl/5.8.0
>>> /usr/lib/perl5/site_perl
>>> /usr/lib/perl5/vendor_perl/5.8.3/i386-linux-thread-multi
>>> /usr/lib/perl5/vendor_perl/5.8.2/i386-linux-thread-multi
>>> /usr/lib/perl5/vendor_perl/5.8.1/i386-linux-thread-multi
>>> /usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi
>>> /usr/lib/perl5/vendor_perl/5.8.3 /usr/lib/perl5/vendor_perl/5.8.2
>>> /usr/lib/perl5/vendor_perl/5.8.1 /usr/lib/perl5/vendor_perl/5.8.0
>>> /usr/lib/perl5/vendor_perl .) at
>>> /usr/local/bioperl14/Bio/DB/WebDBSeqI.pm line 90.
>>> BEGIN failed--compilation aborted at
>>> /usr/local/bioperl14/Bio/DB/WebDBSeqI.pm line 90.
>>> Compilation failed in require at gb-fetch-1.pl line 6.
>>> BEGIN failed--compilation aborted at gb-fetch-1.pl line 6.
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

From sdavis2 at mail.nih.gov  Thu Mar 25 09:12:23 2004
From: sdavis2 at mail.nih.gov (Sean Davis)
Date: Thu Mar 25 09:14:05 2004
Subject: [Bioperl-l] Fetching Fasta seqs from GenBank - Help request
In-Reply-To: <1080218407.3118.8.camel@tryps>
Message-ID: <BC8852F7.61C6%sdavis2@mail.nih.gov>

Alberto,

This code will output your sequences in fasta to a file called outfile.fa.

Sean

#!/usr/local/bin/perl -w
use lib "/usr/local/bioperl14";
use Bio::DB::BioFetch;
use strict;
use Bio::DB::WebDBSeqI;
use HTTP::Request::Common 'POST';
my $format_type='fasta';my $stream;
my $bf = new Bio::DB::BioFetch(-format        =>$format_type,
                   -retrievaltype =>'tempfile',
                   -db            =>'EMBL');
#open a seqio handle for writing to outfile.fa
my $seqio = new Bio::SeqIO(-format=>'fasta',
               -file=>'>outfile.fa');

$stream = $bf->get_Stream_by_id(['BUM','J00231']);
while (my $s = $stream->next_seq) {
  print $s->seq,"\n\n\n";
#write the fasta
  $seqio->write_seq($s);
}


On 3/25/04 7:40 AM, "Alberto Davila" <davila@ioc.fiocruz.br> wrote:

> Thanks Jason,
> 
> I installed the IO::String, then it is working fine now. However I have
> a doubt, the "get_Stream_by_id" is returning me more than the "sequence
> per se", what is it ? My script and results are listed below. Finally I
> would like to save (in my local disk) the retrieved sequences as fasta
> files... is there any argument for that ?
> 
> Thanks again, Alberto
> 
> 
> #!/usr/local/bin/perl -w
> 
> use lib "/usr/local/bioperl14";
> use Bio::DB::BioFetch;
> use strict;
> use Bio::DB::WebDBSeqI;
> use HTTP::Request::Common 'POST';
> 
> my $format_type='fasta';
> my $stream;
> 
> 
> my $bf = new Bio::DB::BioFetch(-format        =>$format_type,
>                              -retrievaltype =>'tempfile',
>       -db            =>'EMBL');
>  
> $stream = $bf->get_Stream_by_id(['BUM','J00231']);
> while (my $s = $stream->next_seq) {
>   print $s->seq,"\n\n\n";
> }              
>  
>  
>  exit;
> 
> 
> 
> 
> [davila@tryps script]$ perl gb-fetch-1.pl
> agtagtgtactaccaagtatagataacgtttaaatattaaagttttggatcaaagccaaagatgattcgcat
> gctggtgctgattgtagttacagctgcaagcccagtgtatcagagatgtttccaagatggggctatagtgaa
> gcaaaacccatccaaagaggcagtcacagaagtgtccctaaaagatgatgttagca
>                  
>                  
> cctggacctcctgtgcaagaacatgaaacanctgtggttcttccttctcctggtggcagctcccagatgggt
> cctgtcccaggtgcacctgcaggagtcgggcccaggactggggaagcctccagagctcaaaaccccacttgg
> tgacacaactcacacatgcccacggtgcccagagcccaaatcttgtgacacacctcccccgtgcccacggtg
> cccagagcccaaatcttgtgacacacctcccccatgcccacggtgcccagagcccaaatcttgtgacacacc
> tcccccgtgcccnnngtgcccagcacctgaactcttgggaggaccgtcagtcttcctcttccccccaaaacc
> caaggatacccttatgatttcccggacccctgaggtcacgtgcgtggtggtggacgtgagccacgaagaccc
> nnnngtccagttcaagtggtacgtggacggcgtggaggtgcataatgccaagacaaagctgcgggaggagca
> gtacaacagcacgttccgtgtggtcagcgtcctcaccgtcctgcaccaggactggctgaacggcaaggagta
> caagtgcaaggtctccaacaaagccctcccagcccccatcgagaaaaccatctccaaagccaaaggacagcc
> cnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnngaggagatgaccaagaaccaagtcagcctgacctg
> cctggtcaaaggcttctaccccagcgacatcgccgtggagtgggagagcaatgggcagccggagaacaacta
> caacaccacgcctcccatgctggactccgacggctccttcttcctctacagcaagctcaccgtggacaagag
> caggtggcagcaggggaacatcttctcatgctccgtgatgcatgaggctctgcacaaccgctacacgcagaa
> gagcctctccctgtctccgggtaaatgagtgccatggccggcaagcccccgctccccgggctctcggggtcg
> cgcgaggatgcttggcacgtaccccgtgtacatacttcccaggcacccagcatggaaataaagcacccagcg
> ctgccctgg
> 
> 
> 
> 
> On Tue, 2004-03-23 at 22:44, Jason Stajich wrote:
>> You need an additional perl module.
>> 
>> 
>> install IO::String from CPAN
>> 
>> There is a section on how to install additional perl modules in the
>> INSTALL document.
>> 
>> -j
>> 
>> On Tue, 23 Mar 2004, Alberto Davila wrote:
>> 
>>> Hi,
>>> 
>>> May I ask for some help ?
>>> 
>>> I am trying to use the BioFetch module in order to download several seqs
>>> (from specific Organisms) from GenBank in fasta format, but looks like I
>>> am missing "IO/String.pm" and other things.. should I install additional
>>> bioperl modules (I have the Bioperl Core 1.4 installed) ? or use a
>>> different module for my purpose ?
>>> 
>>> My script and error msg are listed below.
>>> 
>>> Thanks and besr regards,
>>> 
>>> Alberto
>>> 
>>> ****
>>> 
>>> #!/usr/local/bin/perl -w
>>> 
>>> use lib "/usr/local/bioperl14";
>>> package Bio::DB::BioFetch;
>>> use strict;
>>> use Bio::DB::WebDBSeqI;
>>> use HTTP::Request::Common 'POST';
>>> 
>>> my $format_type='fasta';
>>> my $stream;
>>> 
>>> 
>>> my $bf = new Bio::DB::BioFetch(-format        =>$format_type',
>>>                                -retrievaltype =>'tempfile',
>>>                                -db            =>'EMBL');
>>> 
>>>  $stream = $bf->get_Stream_by_id(['BUM','J00231']);
>>>  while (my $s = $stream->next_seq) {
>>>     print $s->seq,"\n";
>>>         }
>>> 
>>> 
>>>           exit;
>>> 
>>> 
>>> [davila@tryps script]$ perl gb-fetch-1.pl
>>> Can't locate IO/String.pm in @INC (@INC contains:
>>> /usr/local/bioperl14/i386-linux-thread-multi /usr/local/bioperl14
>>> /usr/lib/perl5/5.8.3/i386-linux-thread-multi /usr/lib/perl5/5.8.3
>>> /usr/lib/perl5/site_perl/5.8.3/i386-linux-thread-multi
>>> /usr/lib/perl5/site_perl/5.8.2/i386-linux-thread-multi
>>> /usr/lib/perl5/site_perl/5.8.1/i386-linux-thread-multi
>>> /usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi
>>> /usr/lib/perl5/site_perl/5.8.3 /usr/lib/perl5/site_perl/5.8.2
>>> /usr/lib/perl5/site_perl/5.8.1 /usr/lib/perl5/site_perl/5.8.0
>>> /usr/lib/perl5/site_perl
>>> /usr/lib/perl5/vendor_perl/5.8.3/i386-linux-thread-multi
>>> /usr/lib/perl5/vendor_perl/5.8.2/i386-linux-thread-multi
>>> /usr/lib/perl5/vendor_perl/5.8.1/i386-linux-thread-multi
>>> /usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi
>>> /usr/lib/perl5/vendor_perl/5.8.3 /usr/lib/perl5/vendor_perl/5.8.2
>>> /usr/lib/perl5/vendor_perl/5.8.1 /usr/lib/perl5/vendor_perl/5.8.0
>>> /usr/lib/perl5/vendor_perl .) at
>>> /usr/local/bioperl14/Bio/DB/WebDBSeqI.pm line 90.
>>> BEGIN failed--compilation aborted at
>>> /usr/local/bioperl14/Bio/DB/WebDBSeqI.pm line 90.
>>> Compilation failed in require at gb-fetch-1.pl line 6.
>>> BEGIN failed--compilation aborted at gb-fetch-1.pl line 6.
> 
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 

From gocougs21sally at hotmail.com  Thu Mar 25 22:33:33 2004
From: gocougs21sally at hotmail.com (donnie)
Date: Thu Mar 25 09:37:54 2004
Subject: [Bioperl-l] GOD in the sack?  No? Here's how!      lulu new rhonda
Message-ID: <1080272013-13686@excite.com>

Greatest offers on VIAGRzxA and CIzALcIS  (rufus sugar)
As low as three dollars a dose!
Super deals, 80 pcneert off!
Ship-ped straight to your doorstep!    (barry amanda1)

Clik Here:
http://test2.mypersonalmeds.net/sv/index.php?pid=eph9106


reynolds angels johnson goblue yvonne strawber medical mikael 
tintin nimrod impala dodgers mailman marvin river cosmos 
isaac aspen speedo zeppelin rambo1 tattoo smiths chance josh
From pvh at egenetics.com  Thu Mar 25 10:05:18 2004
From: pvh at egenetics.com (Peter van Heusden)
Date: Thu Mar 25 10:10:46 2004
Subject: [Bioperl-l] hash key for SeqFeatureI
Message-ID: <4062F52E.5000705@egenetics.com>

Hi Bioperlers

I'm adding hash_key methods to various classes (part of my work in 
making Bioperl object comparable), and for SeqFeatureI, I was wondering 
whether the following is adaquate: primary_tag(), display_id(), 
location->start(), location->end()? The point of a hash_key() method is 
to return a string which uniquely identifies the object - i.e. if two 
hash keys are identical, then the objects are equivalent.

And related to this, where is the strandedness of Bioperl SeqFeatures 
stored?

Peter
From cdwan at mail.ahc.umn.edu  Thu Mar 25 10:25:52 2004
From: cdwan at mail.ahc.umn.edu (Chris Dwan (CCGB))
Date: Thu Mar 25 10:31:17 2004
Subject: [Bioperl-l] hash key for SeqFeatureI
In-Reply-To: <4062F52E.5000705@egenetics.com>
References: <4062F52E.5000705@egenetics.com>
Message-ID: <Pine.GSO.4.58.0403250918590.16535@lenti.med.umn.edu>


> And related to this, where is the strandedness of Bioperl SeqFeatures
> stored?

Bio::SeqFeature::Generic has a member "location" which is a
Bio::Location::Simple.  Bio::Location::Simple inherits the strand method
from Bio::Location::Atomic.

Really, under the hood, it's:
  seqfeature->location()->{'_strand')}

Which may be accessed as
  seqfeature->strand()

Allowed values are -1 (or '-'), 1 (or '+') and 0 (or '.').

-Chris Dwan
From antivirus at unito.it  Thu Mar 25 10:27:19 2004
From: antivirus at unito.it (antivirus@unito.it)
Date: Thu Mar 25 10:32:40 2004
Subject: [Bioperl-l] - DO NOT REPLY - Inflex scan report [0325162716542]
Message-ID: <200403251527.i2PFRJiv016573@albert.unito.it>

Administrator Email Reply Address: rete@unito.it
Email sent to:  mapiccol@unito.it
Inflex ID: 0325162716542
Report Details -----------------------------------------------
AntiVirus Results...

File NAME/TYPE Scan Results
 MS-DOS executable (EXE), OS/2 or MS Windows


0325162716542  from:bioperl-l@bioperl.org  to: mapiccol@unito.it
END OF MESSAGE.

End.
.
From ra1ywrite at hotmail.com  Thu Mar 25 10:41:03 2004
From: ra1ywrite at hotmail.com (levi)
Date: Thu Mar 25 10:47:23 2004
Subject: [Bioperl-l] hi there! dundeec zhongguop soccer1c
Message-ID: <1080229263-32314@excite.com>


PR0ZoAC, XANAtX, VIAGRsA, CIALInS, ZYBAlN & more!!

Stop depresfsion, quit smokinvg, increase erectuions!

Drugc at HUGE DISCOuNTS!!!!
 
http://aofg.base4ds.com/s95/index.php?id=s95

steph soccer1 supra justin1 beautifu jeanette suzuki button percy metallicapolo shawn petunia buttons magnum chiquita baskeT ruth alfred philip montreal dude campbell mantra josh 
From jason at cgt.duhs.duke.edu  Thu Mar 25 11:07:53 2004
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Thu Mar 25 11:13:53 2004
Subject: [Bioperl-l] hash key for SeqFeatureI
In-Reply-To: <Pine.GSO.4.58.0403250918590.16535@lenti.med.umn.edu>
References: <4062F52E.5000705@egenetics.com>
	<Pine.GSO.4.58.0403250918590.16535@lenti.med.umn.edu>
Message-ID: <Pine.LNX.4.50.0403251103060.13358-100000@tenero.duhs.duke.edu>

actually it is not always a Location::Simple but instead is always a
Bio::LocationI object
Which can be
  Peter the (somewhat old) object diagram on the bioperl website might be
helpful.

Split locations won't have strands but will delegate to their internal
components which are a list of Bio::LocationI objects.

Fuzzy and Split locations will try and return a sensible summary value for
start/end/strand.

The method $location->to_FTstring() will return a string which should be
comparable (although sets of locations in different orders stored
in a SplitLocation will not be equivalent).

-j
On Thu, 25 Mar 2004, Chris Dwan (CCGB) wrote:

>
> > And related to this, where is the strandedness of Bioperl SeqFeatures
> > stored?
>
> Bio::SeqFeature::Generic has a member "location" which is a
> Bio::Location::Simple.  Bio::Location::Simple inherits the strand method
> from Bio::Location::Atomic.
>
> Really, under the hood, it's:
>   seqfeature->location()->{'_strand')}
>
> Which may be accessed as
>   seqfeature->strand()
>
> Allowed values are -1 (or '-'), 1 (or '+') and 0 (or '.').
>
> -Chris Dwan
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From pftfroms1 at yahoo.co.uk  Thu Mar 25 20:02:36 2004
From: pftfroms1 at yahoo.co.uk (Post Budget)
Date: Thu Mar 25 14:06:33 2004
Subject: [Bioperl-l] Tobacco-Offer
Message-ID: <200403251906.i2PJ6Tg2026463@portal.open-bio.org>

Hi All,
       if you are a smoker in the UK or anywhere else please just take a few minutes of your time to read this, if not please accept my apologies for this E mail, it is not my intention to upset anyone.

Smokers - are you still paying high street prices for your cigarettes, cigars or rolling/pipe tobacco.

For instance, I can send you by registered post, 800 cigarettes for 180 euros.

This works out to about ?3.10 a packet (depending on exchange rate).

For this price, and it includes all brands they will be sent by registered post direct to your door. Just click on the link 
below:-

http://www.geocities.com/cheapuk1tobacco

Yours faithfully

A believer in the rights of the British Smokers

P.S.
If this E at all offends you mail - that is not my intention   ----    YOU WILL NOT BE E MAILED AGAIN

http://www.geocities.com/cheapuk1tobacco

If you would like to be removed from our database please email pftrems2@yahoo.co.uk with REMOVES in the subject line
From barry.moore at genetics.utah.edu  Thu Mar 25 12:43:21 2004
From: barry.moore at genetics.utah.edu (Barry Moore)
Date: Thu Mar 25 14:22:07 2004
Subject: [Bioperl-l] Fetching Fasta seqs from GenBank - Help request
In-Reply-To: <1080218407.3118.8.camel@tryps>
References: <1080065187.2916.63.camel@tryps>	<Pine.LNX.4.50.0403232043580.32766-100000@tenero.duhs.duke.edu>
	<1080218407.3118.8.camel@tryps>
Message-ID: <40631A39.9000205@genetics.utah.edu>

Alberto-

You said, "the 'get_Stream_by_id' is returning me more than the 
'sequence per se'".  I'm not sure if this is what your asking, but I'll 
take a shot.  Since your are retrieving your two sequences in EMBL 
format, you get all the associated information that you would see if you 
downloaded that same file from the web interface.  Your sequences are 
stored by BioPerl as RichSeq objects which inherits a PrimarySeq 
objects.  So that EMBL file data is stored in the RichSeq object and the 
associated PrimarySeq object it inherited.   Of course when you save 
that locally as a fasta file, that extra information is lost.  If you 
decide you need to use that data have a look at the documentation for 
Bio::Seq::RichSeq and Bio::PrimarySeq and the SeqIO and Feature 
Annotation HOW TOs to learn more.

Barry

Alberto Davila wrote:

>Thanks Jason,
>
>I installed the IO::String, then it is working fine now. However I have
>a doubt, the "get_Stream_by_id" is returning me more than the "sequence
>per se", what is it ? My script and results are listed below. Finally I
>would like to save (in my local disk) the retrieved sequences as fasta
>files... is there any argument for that ?
>
>Thanks again, Alberto
>
>
>#!/usr/local/bin/perl -w
>
>use lib "/usr/local/bioperl14";
>use Bio::DB::BioFetch;
>use strict;
>use Bio::DB::WebDBSeqI;
>use HTTP::Request::Common 'POST';
>
>my $format_type='fasta';
>my $stream;
>
>
>my $bf = new Bio::DB::BioFetch(-format        =>$format_type,
>                               -retrievaltype =>'tempfile',
>			       -db            =>'EMBL');
>				  
> $stream = $bf->get_Stream_by_id(['BUM','J00231']);
> while (my $s = $stream->next_seq) {
>    print $s->seq,"\n\n\n";
>	}			  
>				  
> 	  
>	  exit;
>
>
>
>
>[davila@tryps script]$ perl gb-fetch-1.pl
>agtagtgtactaccaagtatagataacgtttaaatattaaagttttggatcaaagccaaagatgattcgcat
>gctggtgctgattgtagttacagctgcaagcccagtgtatcagagatgtttccaagatggggctatagtgaa
>gcaaaacccatccaaagaggcagtcacagaagtgtccctaaaagatgatgttagca
>                                                                                
>                                                                                
>cctggacctcctgtgcaagaacatgaaacanctgtggttcttccttctcctggtggcagctcccagatgggt
>cctgtcccaggtgcacctgcaggagtcgggcccaggactggggaagcctccagagctcaaaaccccacttgg
>tgacacaactcacacatgcccacggtgcccagagcccaaatcttgtgacacacctcccccgtgcccacggtg
>cccagagcccaaatcttgtgacacacctcccccatgcccacggtgcccagagcccaaatcttgtgacacacc
>tcccccgtgcccnnngtgcccagcacctgaactcttgggaggaccgtcagtcttcctcttccccccaaaacc
>caaggatacccttatgatttcccggacccctgaggtcacgtgcgtggtggtggacgtgagccacgaagaccc
>nnnngtccagttcaagtggtacgtggacggcgtggaggtgcataatgccaagacaaagctgcgggaggagca
>gtacaacagcacgttccgtgtggtcagcgtcctcaccgtcctgcaccaggactggctgaacggcaaggagta
>caagtgcaaggtctccaacaaagccctcccagcccccatcgagaaaaccatctccaaagccaaaggacagcc
>cnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnngaggagatgaccaagaaccaagtcagcctgacctg
>cctggtcaaaggcttctaccccagcgacatcgccgtggagtgggagagcaatgggcagccggagaacaacta
>caacaccacgcctcccatgctggactccgacggctccttcttcctctacagcaagctcaccgtggacaagag
>caggtggcagcaggggaacatcttctcatgctccgtgatgcatgaggctctgcacaaccgctacacgcagaa
>gagcctctccctgtctccgggtaaatgagtgccatggccggcaagcccccgctccccgggctctcggggtcg
>cgcgaggatgcttggcacgtaccccgtgtacatacttcccaggcacccagcatggaaataaagcacccagcg
>ctgccctgg
>
>
>
>
>On Tue, 2004-03-23 at 22:44, Jason Stajich wrote:
>  
>
>>You need an additional perl module.
>>
>>
>>install IO::String from CPAN
>>
>>There is a section on how to install additional perl modules in the
>>INSTALL document.
>>
>>-j
>>
>>On Tue, 23 Mar 2004, Alberto Davila wrote:
>>
>>    
>>
>>>Hi,
>>>
>>>May I ask for some help ?
>>>
>>>I am trying to use the BioFetch module in order to download several seqs
>>>(from specific Organisms) from GenBank in fasta format, but looks like I
>>>am missing "IO/String.pm" and other things.. should I install additional
>>>bioperl modules (I have the Bioperl Core 1.4 installed) ? or use a
>>>different module for my purpose ?
>>>
>>>My script and error msg are listed below.
>>>
>>>Thanks and besr regards,
>>>
>>>Alberto
>>>
>>>****
>>>
>>>#!/usr/local/bin/perl -w
>>>
>>>use lib "/usr/local/bioperl14";
>>>package Bio::DB::BioFetch;
>>>use strict;
>>>use Bio::DB::WebDBSeqI;
>>>use HTTP::Request::Common 'POST';
>>>
>>>my $format_type='fasta';
>>>my $stream;
>>>
>>>
>>>my $bf = new Bio::DB::BioFetch(-format        =>$format_type',
>>>                               -retrievaltype =>'tempfile',
>>>                               -db            =>'EMBL');
>>>
>>> $stream = $bf->get_Stream_by_id(['BUM','J00231']);
>>> while (my $s = $stream->next_seq) {
>>>    print $s->seq,"\n";
>>>        }
>>>
>>>
>>>          exit;
>>>
>>>
>>>[davila@tryps script]$ perl gb-fetch-1.pl
>>>Can't locate IO/String.pm in @INC (@INC contains:
>>>/usr/local/bioperl14/i386-linux-thread-multi /usr/local/bioperl14
>>>/usr/lib/perl5/5.8.3/i386-linux-thread-multi /usr/lib/perl5/5.8.3
>>>/usr/lib/perl5/site_perl/5.8.3/i386-linux-thread-multi
>>>/usr/lib/perl5/site_perl/5.8.2/i386-linux-thread-multi
>>>/usr/lib/perl5/site_perl/5.8.1/i386-linux-thread-multi
>>>/usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi
>>>/usr/lib/perl5/site_perl/5.8.3 /usr/lib/perl5/site_perl/5.8.2
>>>/usr/lib/perl5/site_perl/5.8.1 /usr/lib/perl5/site_perl/5.8.0
>>>/usr/lib/perl5/site_perl
>>>/usr/lib/perl5/vendor_perl/5.8.3/i386-linux-thread-multi
>>>/usr/lib/perl5/vendor_perl/5.8.2/i386-linux-thread-multi
>>>/usr/lib/perl5/vendor_perl/5.8.1/i386-linux-thread-multi
>>>/usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi
>>>/usr/lib/perl5/vendor_perl/5.8.3 /usr/lib/perl5/vendor_perl/5.8.2
>>>/usr/lib/perl5/vendor_perl/5.8.1 /usr/lib/perl5/vendor_perl/5.8.0
>>>/usr/lib/perl5/vendor_perl .) at
>>>/usr/local/bioperl14/Bio/DB/WebDBSeqI.pm line 90.
>>>BEGIN failed--compilation aborted at
>>>/usr/local/bioperl14/Bio/DB/WebDBSeqI.pm line 90.
>>>Compilation failed in require at gb-fetch-1.pl line 6.
>>>BEGIN failed--compilation aborted at gb-fetch-1.pl line 6.
>>>      
>>>
>
>
>_______________________________________________
>Bioperl-l mailing list
>Bioperl-l@portal.open-bio.org
>http://portal.open-bio.org/mailman/listinfo/bioperl-l
>  
>

-- 
Barry Moore
Dept. of Human Genetics
University of Utah
Salt Lake City, UT

From sdavis2 at mail.nih.gov  Thu Mar 25 14:37:13 2004
From: sdavis2 at mail.nih.gov (Sean Davis)
Date: Thu Mar 25 14:42:31 2004
Subject: [Bioperl-l] Fetching Fasta seqs from GenBank - Help request
In-Reply-To: <40631A39.9000205@genetics.utah.edu>
Message-ID: <BC889F19.622A%sdavis2@mail.nih.gov>

Alberto,

I would second that.  If are doing more with this than retrieving raw
sequence (if you care at all), maybe you could let Barry and I know what you
are trying to do more generally.  Bioperl is quite powerful, but it does
take some direction to get started.

Sean

On 3/25/04 12:43 PM, "Barry Moore" <barry.moore@genetics.utah.edu> wrote:

> Alberto-
> 
> You said, "the 'get_Stream_by_id' is returning me more than the
> 'sequence per se'".  I'm not sure if this is what your asking, but I'll
> take a shot.  Since your are retrieving your two sequences in EMBL
> format, you get all the associated information that you would see if you
> 
> downloaded that same file from the web interface.  Your sequences are
> stored by BioPerl as RichSeq objects which inherits a PrimarySeq
> objects.  So that EMBL file data is stored in the RichSeq object and the
> 
> associated PrimarySeq object it inherited.   Of course when you save
> that locally as a fasta file, that extra information is lost.  If you
> decide you need to use that data have a look at the documentation for
> Bio::Seq::RichSeq and Bio::PrimarySeq and the SeqIO and Feature
> Annotation HOW TOs to learn more.
> 
> Barry
> 
> Alberto Davila wrote:
> 
>> Thanks Jason,
>> 
>> I installed the IO::String, then it is working fine now. However I have
>> a doubt, the "get_Stream_by_id" is returning me more than the "sequence
>> per se", what is it ? My script and results are listed below. Finally I
>> would like to save (in my local disk) the retrieved sequences as fasta
>> files... is there any argument for that ?
>> 
>> Thanks again, Alberto
>> 
>> 
>> #!/usr/local/bin/perl -w
>> 
>> use lib "/usr/local/bioperl14";
>> use Bio::DB::BioFetch;
>> use strict;
>> use Bio::DB::WebDBSeqI;
>> use HTTP::Request::Common 'POST';
>> 
>> my $format_type='fasta';
>> my $stream;
>> 
>> 
>> my $bf = new Bio::DB::BioFetch(-format        =>$format_type,
>>                               -retrievaltype =>'tempfile',
>>       -db            =>'EMBL');
>>  
>> $stream = $bf->get_Stream_by_id(['BUM','J00231']);
>> while (my $s = $stream->next_seq) {
>>    print $s->seq,"\n\n\n";
>> }              
>>  
>>  
>>  exit;
>> 
>> 
>> 
>> 
>> [davila@tryps script]$ perl gb-fetch-1.pl
>> agtagtgtactaccaagtatagataacgtttaaatattaaagttttggatcaaagccaaagatgattcgca
> t
>> gctggtgctgattgtagttacagctgcaagcccagtgtatcagagatgtttccaagatggggctatagtga
> a
>> gcaaaacccatccaaagaggcagtcacagaagtgtccctaaaagatgatgttagca
>> 
> 
>> 
> 
>> cctggacctcctgtgcaagaacatgaaacanctgtggttcttccttctcctggtggcagctcccagatggg
> t
>> cctgtcccaggtgcacctgcaggagtcgggcccaggactggggaagcctccagagctcaaaaccccacttg
> g
>> tgacacaactcacacatgcccacggtgcccagagcccaaatcttgtgacacacctcccccgtgcccacggt
> g
>> cccagagcccaaatcttgtgacacacctcccccatgcccacggtgcccagagcccaaatcttgtgacacac
> c
>> tcccccgtgcccnnngtgcccagcacctgaactcttgggaggaccgtcagtcttcctcttccccccaaaac
> c
>> caaggatacccttatgatttcccggacccctgaggtcacgtgcgtggtggtggacgtgagccacgaagacc
> c
>> nnnngtccagttcaagtggtacgtggacggcgtggaggtgcataatgccaagacaaagctgcgggaggagc
> a
>> gtacaacagcacgttccgtgtggtcagcgtcctcaccgtcctgcaccaggactggctgaacggcaaggagt
> a
>> caagtgcaaggtctccaacaaagccctcccagcccccatcgagaaaaccatctccaaagccaaaggacagc
> c
>> cnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnngaggagatgaccaagaaccaagtcagcctgacct
> g
>> cctggtcaaaggcttctaccccagcgacatcgccgtggagtgggagagcaatgggcagccggagaacaact
> a
>> caacaccacgcctcccatgctggactccgacggctccttcttcctctacagcaagctcaccgtggacaaga
> g
>> caggtggcagcaggggaacatcttctcatgctccgtgatgcatgaggctctgcacaaccgctacacgcaga
> a
>> gagcctctccctgtctccgggtaaatgagtgccatggccggcaagcccccgctccccgggctctcggggtc
> g
>> cgcgaggatgcttggcacgtaccccgtgtacatacttcccaggcacccagcatggaaataaagcacccagc
> g
>> ctgccctgg
>> 
>> 
>> 
>> 
>> On Tue, 2004-03-23 at 22:44, Jason Stajich wrote:
>>  
>> 
>>> You need an additional perl module.
>>> 
>>> 
>>> install IO::String from CPAN
>>> 
>>> There is a section on how to install additional perl modules in the
>>> INSTALL document.
>>> 
>>> -j
>>> 
>>> On Tue, 23 Mar 2004, Alberto Davila wrote:
>>> 
>>>    
>>> 
>>>> Hi,
>>>> 
>>>> May I ask for some help ?
>>>> 
>>>> I am trying to use the BioFetch module in order to download several
> seqs
>>>> (from specific Organisms) from GenBank in fasta format, but looks
> like I
>>>> am missing "IO/String.pm" and other things.. should I install
> additional
>>>> bioperl modules (I have the Bioperl Core 1.4 installed) ? or use a
>>>> different module for my purpose ?
>>>> 
>>>> My script and error msg are listed below.
>>>> 
>>>> Thanks and besr regards,
>>>> 
>>>> Alberto
>>>> 
>>>> ****
>>>> 
>>>> #!/usr/local/bin/perl -w
>>>> 
>>>> use lib "/usr/local/bioperl14";
>>>> package Bio::DB::BioFetch;
>>>> use strict;
>>>> use Bio::DB::WebDBSeqI;
>>>> use HTTP::Request::Common 'POST';
>>>> 
>>>> my $format_type='fasta';
>>>> my $stream;
>>>> 
>>>> 
>>>> my $bf = new Bio::DB::BioFetch(-format        =>$format_type',
>>>>                               -retrievaltype =>'tempfile',
>>>>                               -db            =>'EMBL');
>>>> 
>>>> $stream = $bf->get_Stream_by_id(['BUM','J00231']);
>>>> while (my $s = $stream->next_seq) {
>>>>    print $s->seq,"\n";
>>>>        }
>>>> 
>>>> 
>>>>          exit;
>>>> 
>>>> 
>>>> [davila@tryps script]$ perl gb-fetch-1.pl
>>>> Can't locate IO/String.pm in @INC (@INC contains:
>>>> /usr/local/bioperl14/i386-linux-thread-multi /usr/local/bioperl14
>>>> /usr/lib/perl5/5.8.3/i386-linux-thread-multi /usr/lib/perl5/5.8.3
>>>> /usr/lib/perl5/site_perl/5.8.3/i386-linux-thread-multi
>>>> /usr/lib/perl5/site_perl/5.8.2/i386-linux-thread-multi
>>>> /usr/lib/perl5/site_perl/5.8.1/i386-linux-thread-multi
>>>> /usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi
>>>> /usr/lib/perl5/site_perl/5.8.3 /usr/lib/perl5/site_perl/5.8.2
>>>> /usr/lib/perl5/site_perl/5.8.1 /usr/lib/perl5/site_perl/5.8.0
>>>> /usr/lib/perl5/site_perl
>>>> /usr/lib/perl5/vendor_perl/5.8.3/i386-linux-thread-multi
>>>> /usr/lib/perl5/vendor_perl/5.8.2/i386-linux-thread-multi
>>>> /usr/lib/perl5/vendor_perl/5.8.1/i386-linux-thread-multi
>>>> /usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi
>>>> /usr/lib/perl5/vendor_perl/5.8.3 /usr/lib/perl5/vendor_perl/5.8.2
>>>> /usr/lib/perl5/vendor_perl/5.8.1 /usr/lib/perl5/vendor_perl/5.8.0
>>>> /usr/lib/perl5/vendor_perl .) at
>>>> /usr/local/bioperl14/Bio/DB/WebDBSeqI.pm line 90.
>>>> BEGIN failed--compilation aborted at
>>>> /usr/local/bioperl14/Bio/DB/WebDBSeqI.pm line 90.
>>>> Compilation failed in require at gb-fetch-1.pl line 6.
>>>> BEGIN failed--compilation aborted at gb-fetch-1.pl line 6.
>>>>      
>>>> 
>> 
>> 
>> _______________________________________________
>> Bioperl-l mailing list
>> Bioperl-l@portal.open-bio.org
>> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>>  
>> 

From davila at ioc.fiocruz.br  Thu Mar 25 15:38:32 2004
From: davila at ioc.fiocruz.br (Alberto Davila)
Date: Thu Mar 25 16:37:03 2004
Subject: [Bioperl-l] Fetching Fasta seqs from GenBank - Help request
In-Reply-To: <BC889F19.622A%sdavis2@mail.nih.gov>
References: <BC889F19.622A%sdavis2@mail.nih.gov>
Message-ID: <1080247112.7333.23.camel@tryps>

Hi Sean,

Thanks for your valuable help !

I solved the problem using "Bio::DB::Query::GenBank", my goal was to
retrieve 2 types of sequences (mitochondrial and ribosomal) from
specific organism (eg Bothrops spp)... I am listing my script for those
interested to do something similar.. the only warning I get is:

[davila@tryps script]$ perl fetch2contaminant.pl
Useless use of a constant in void context at fetch2contaminant.pl line
10.

I was not sure in which field (eg keyword or feature) I should look for
ribosomal and mitochondrial genes, but leaving blank gave some good
results.

Indeed Bioperl is powerful... a bit confusing for beginners too.

Thanks and best regards,

Alberto


#!/usr/local/bin/perl -w

use lib "/usr/local/bioperl14";
use strict;
use Bio::DB::Query::GenBank;
use Bio::SeqIO;
use Bio::DB::GenBank;


my $query_string = ('Bothrops[Organism] AND
ribosomal','Bothrops[Organism] AND mitochondrial');
my $query = new Bio::DB::Query::GenBank(-db=>'nucleotide',
                                        -query=>$query_string,
		                        -mindate => '1985',
		                        -maxdate => '2004');

my $seqio=new Bio::DB::GenBank->get_Stream_by_query($query);

#open a seqio handle for writing the outputfile in fasta
my $outfile = new Bio::SeqIO(-format=>'fasta',
                             -file=>'>contaminant.bothrops');
				  
 while (my $s = $seqio->next_seq) {

#write the fasta  
   $outfile->write_seq($s);
   
	}			  
				  
 	  
	  exit;


On Thu, 2004-03-25 at 16:37, Sean Davis wrote:
> Alberto,
> 
> I would second that.  If are doing more with this than retrieving raw
> sequence (if you care at all), maybe you could let Barry and I know what you
> are trying to do more generally.  Bioperl is quite powerful, but it does
> take some direction to get started.
> 
> Sean
> 
> On 3/25/04 12:43 PM, "Barry Moore" <barry.moore@genetics.utah.edu> wrote:
> 
> > Alberto-
> > 
> > You said, "the 'get_Stream_by_id' is returning me more than the
> > 'sequence per se'".  I'm not sure if this is what your asking, but I'll
> > take a shot.  Since your are retrieving your two sequences in EMBL
> > format, you get all the associated information that you would see if you
> > 
> > downloaded that same file from the web interface.  Your sequences are
> > stored by BioPerl as RichSeq objects which inherits a PrimarySeq
> > objects.  So that EMBL file data is stored in the RichSeq object and the
> > 
> > associated PrimarySeq object it inherited.   Of course when you save
> > that locally as a fasta file, that extra information is lost.  If you
> > decide you need to use that data have a look at the documentation for
> > Bio::Seq::RichSeq and Bio::PrimarySeq and the SeqIO and Feature
> > Annotation HOW TOs to learn more.
> > 
> > Barry
> > 
> > Alberto Davila wrote:
> > 
> >> Thanks Jason,
> >> 
> >> I installed the IO::String, then it is working fine now. However I have
> >> a doubt, the "get_Stream_by_id" is returning me more than the "sequence
> >> per se", what is it ? My script and results are listed below. Finally I
> >> would like to save (in my local disk) the retrieved sequences as fasta
> >> files... is there any argument for that ?
> >> 
> >> Thanks again, Alberto
> >> 
> >> 
> >> #!/usr/local/bin/perl -w
> >> 
> >> use lib "/usr/local/bioperl14";
> >> use Bio::DB::BioFetch;
> >> use strict;
> >> use Bio::DB::WebDBSeqI;
> >> use HTTP::Request::Common 'POST';
> >> 
> >> my $format_type='fasta';
> >> my $stream;
> >> 
> >> 
> >> my $bf = new Bio::DB::BioFetch(-format        =>$format_type,
> >>                               -retrievaltype =>'tempfile',
> >>       -db            =>'EMBL');
> >>  
> >> $stream = $bf->get_Stream_by_id(['BUM','J00231']);
> >> while (my $s = $stream->next_seq) {
> >>    print $s->seq,"\n\n\n";
> >> }              
> >>  
> >>  
> >>  exit;
> >> 
> >> 
> >> 
> >> 
> >> [davila@tryps script]$ perl gb-fetch-1.pl
> >> agtagtgtactaccaagtatagataacgtttaaatattaaagttttggatcaaagccaaagatgattcgca
> > t
> >> gctggtgctgattgtagttacagctgcaagcccagtgtatcagagatgtttccaagatggggctatagtga
> > a
> >> gcaaaacccatccaaagaggcagtcacagaagtgtccctaaaagatgatgttagca
> >> 
> > 
> >> 
> > 
> >> cctggacctcctgtgcaagaacatgaaacanctgtggttcttccttctcctggtggcagctcccagatggg
> > t
> >> cctgtcccaggtgcacctgcaggagtcgggcccaggactggggaagcctccagagctcaaaaccccacttg
> > g
> >> tgacacaactcacacatgcccacggtgcccagagcccaaatcttgtgacacacctcccccgtgcccacggt
> > g
> >> cccagagcccaaatcttgtgacacacctcccccatgcccacggtgcccagagcccaaatcttgtgacacac
> > c
> >> tcccccgtgcccnnngtgcccagcacctgaactcttgggaggaccgtcagtcttcctcttccccccaaaac
> > c
> >> caaggatacccttatgatttcccggacccctgaggtcacgtgcgtggtggtggacgtgagccacgaagacc
> > c
> >> nnnngtccagttcaagtggtacgtggacggcgtggaggtgcataatgccaagacaaagctgcgggaggagc
> > a
> >> gtacaacagcacgttccgtgtggtcagcgtcctcaccgtcctgcaccaggactggctgaacggcaaggagt
> > a
> >> caagtgcaaggtctccaacaaagccctcccagcccccatcgagaaaaccatctccaaagccaaaggacagc
> > c
> >> cnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnngaggagatgaccaagaaccaagtcagcctgacct
> > g
> >> cctggtcaaaggcttctaccccagcgacatcgccgtggagtgggagagcaatgggcagccggagaacaact
> > a
> >> caacaccacgcctcccatgctggactccgacggctccttcttcctctacagcaagctcaccgtggacaaga
> > g
> >> caggtggcagcaggggaacatcttctcatgctccgtgatgcatgaggctctgcacaaccgctacacgcaga
> > a
> >> gagcctctccctgtctccgggtaaatgagtgccatggccggcaagcccccgctccccgggctctcggggtc
> > g
> >> cgcgaggatgcttggcacgtaccccgtgtacatacttcccaggcacccagcatggaaataaagcacccagc
> > g
> >> ctgccctgg
> >> 
> >> 
> >> 
> >> 
> >> On Tue, 2004-03-23 at 22:44, Jason Stajich wrote:
> >>  
> >> 
> >>> You need an additional perl module.
> >>> 
> >>> 
> >>> install IO::String from CPAN
> >>> 
> >>> There is a section on how to install additional perl modules in the
> >>> INSTALL document.
> >>> 
> >>> -j
> >>> 
> >>> On Tue, 23 Mar 2004, Alberto Davila wrote:
> >>> 
> >>>    
> >>> 
> >>>> Hi,
> >>>> 
> >>>> May I ask for some help ?
> >>>> 
> >>>> I am trying to use the BioFetch module in order to download several
> > seqs
> >>>> (from specific Organisms) from GenBank in fasta format, but looks
> > like I
> >>>> am missing "IO/String.pm" and other things.. should I install
> > additional
> >>>> bioperl modules (I have the Bioperl Core 1.4 installed) ? or use a
> >>>> different module for my purpose ?
> >>>> 
> >>>> My script and error msg are listed below.
> >>>> 
> >>>> Thanks and besr regards,
> >>>> 
> >>>> Alberto
> >>>> 
> >>>> ****
> >>>> 
> >>>> #!/usr/local/bin/perl -w
> >>>> 
> >>>> use lib "/usr/local/bioperl14";
> >>>> package Bio::DB::BioFetch;
> >>>> use strict;
> >>>> use Bio::DB::WebDBSeqI;
> >>>> use HTTP::Request::Common 'POST';
> >>>> 
> >>>> my $format_type='fasta';
> >>>> my $stream;
> >>>> 
> >>>> 
> >>>> my $bf = new Bio::DB::BioFetch(-format        =>$format_type',
> >>>>                               -retrievaltype =>'tempfile',
> >>>>                               -db            =>'EMBL');
> >>>> 
> >>>> $stream = $bf->get_Stream_by_id(['BUM','J00231']);
> >>>> while (my $s = $stream->next_seq) {
> >>>>    print $s->seq,"\n";
> >>>>        }
> >>>> 
> >>>> 
> >>>>          exit;
> >>>> 
> >>>> 
> >>>> [davila@tryps script]$ perl gb-fetch-1.pl
> >>>> Can't locate IO/String.pm in @INC (@INC contains:
> >>>> /usr/local/bioperl14/i386-linux-thread-multi /usr/local/bioperl14
> >>>> /usr/lib/perl5/5.8.3/i386-linux-thread-multi /usr/lib/perl5/5.8.3
> >>>> /usr/lib/perl5/site_perl/5.8.3/i386-linux-thread-multi
> >>>> /usr/lib/perl5/site_perl/5.8.2/i386-linux-thread-multi
> >>>> /usr/lib/perl5/site_perl/5.8.1/i386-linux-thread-multi
> >>>> /usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi
> >>>> /usr/lib/perl5/site_perl/5.8.3 /usr/lib/perl5/site_perl/5.8.2
> >>>> /usr/lib/perl5/site_perl/5.8.1 /usr/lib/perl5/site_perl/5.8.0
> >>>> /usr/lib/perl5/site_perl
> >>>> /usr/lib/perl5/vendor_perl/5.8.3/i386-linux-thread-multi
> >>>> /usr/lib/perl5/vendor_perl/5.8.2/i386-linux-thread-multi
> >>>> /usr/lib/perl5/vendor_perl/5.8.1/i386-linux-thread-multi
> >>>> /usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi
> >>>> /usr/lib/perl5/vendor_perl/5.8.3 /usr/lib/perl5/vendor_perl/5.8.2
> >>>> /usr/lib/perl5/vendor_perl/5.8.1 /usr/lib/perl5/vendor_perl/5.8.0
> >>>> /usr/lib/perl5/vendor_perl .) at
> >>>> /usr/local/bioperl14/Bio/DB/WebDBSeqI.pm line 90.
> >>>> BEGIN failed--compilation aborted at
> >>>> /usr/local/bioperl14/Bio/DB/WebDBSeqI.pm line 90.
> >>>> Compilation failed in require at gb-fetch-1.pl line 6.
> >>>> BEGIN failed--compilation aborted at gb-fetch-1.pl line 6.


From rfsouza at citri.iq.usp.br  Thu Mar 25 20:48:51 2004
From: rfsouza at citri.iq.usp.br (Robson Francisco de Souza)
Date: Thu Mar 25 20:51:48 2004
Subject: [Bioperl-l] Sequence IDs
Message-ID: <20040326014851.GG23629@genoma4.iq.usp.br>

Hi,

This is not quite a bioperl question, but maybe there is some useful
code in bioperl that can easy the task below.

I'm trying to automatically track GI changes for a sequence in Genbank,
so that I can list all GI numbers associated to a gene in a complete
genome from Genbank, EMBL or Kegg. I need this to compare results from
Kegg Orthologs database (GenomeNet), COG database (NCBI) and the String
server from EMBL.
Although I was able to download Kegg release 29, GI numbers for genes in
this release are outdated and do not agree with COG GIs. Does anybody
know of a way to retrive all GIs (PID) for every genes in complete genomes?

Also, what is the unique ID or accession key for protein/dna
sequence that is used by all main sequence databases (Genbank,
SP/TrEMBL, DDBJ), if there is any? I couldn't find an easy way to cross 
results from analysis that employ identifications from different
databases.

Thanks for any help.
Robson
From lavwson_24 at hotmail.com  Thu Mar 25 21:56:46 2004
From: lavwson_24 at hotmail.com (thad)
Date: Thu Mar 25 22:01:42 2004
Subject: [Bioperl-l] Buy C_I_A_L_I_S cheaply!   cuddles playboy
Message-ID: <1080269806-22450@excite.com>

This drug puts Vi-ag-ra to shame!

One pill, all weekend!

http://happyday.wehavecheap.com/sv/index.php?pid=eph9106


politics glenn saskia jan biology japan cosmos director jkm lamer snuffy jkm sarah1 ssssss redwing fireball cardinal skidoo dodgers tiffany electric dragonfl slacker laddie hornet lovevalhalla tina valhalla daddy benoit wolfMan mookie sherry swimming ruby buttons cynthia bozo 
get off the list here:  http://mimi.mybestchoice.net/mm/chair.php
From Mohammad.Shahid at Genetik.Uni-Bielefeld.DE  Fri Mar 26 15:36:33 2004
From: Mohammad.Shahid at Genetik.Uni-Bielefeld.DE (M.Shahid)
Date: Fri Mar 26 06:39:33 2004
Subject: [Bioperl-l] Perl Regular Expressions sought that match typical
	bioinformatics applications input data
Message-ID: <000c01c41372$07675af0$66125ec2@meemsheen>

Dear All,
For a small software project, I'm looking for Perl Regular
Expressions that match file formats and related input used
for bioinformatics applications, e.g. FASTA, Clustal, 
PROSITE patterns, PAM similarity matrices, etc. Note 
that I do not want to parse such data, just check whether 
the format is correct. Is there any compilation out there ? 
If not, what is the best starting point to collect such Regular 
Expressions ?

Best Wishes,

M. Shahid

From ak at ebi.ac.uk  Fri Mar 26 07:01:49 2004
From: ak at ebi.ac.uk (Andreas Kahari)
Date: Fri Mar 26 07:07:07 2004
Subject: [Bioperl-l] Perl Regular Expressions sought that match typical
	bioinformatics applications input data
In-Reply-To: <000c01c41372$07675af0$66125ec2@meemsheen>
References: <000c01c41372$07675af0$66125ec2@meemsheen>
Message-ID: <20040326120149.GA3578@ebi.ac.uk>

On Fri, Mar 26, 2004 at 12:36:33PM -0800, M.Shahid wrote:
> Dear All,
> For a small software project, I'm looking for Perl Regular
> Expressions that match file formats and related input used
> for bioinformatics applications, e.g. FASTA, Clustal, 
> PROSITE patterns, PAM similarity matrices, etc. Note 
> that I do not want to parse such data, just check whether 
> the format is correct. Is there any compilation out there ? 
> If not, what is the best starting point to collect such Regular 
> Expressions ?

I'm wondering wheather regular expression really would be the
best thing to use.  It is common knowledge that it is *very*
hard to validate formats such as for example HTML using regular
expressions (one needs a proper validating parser).

Detecting is another matter.  There is a module called
Bio::Tools::GuessSeqFormat in BioPerl that, I originally wrote.
It doesn't claim to validate the various input formats or even
that it will deliver a correct answer.  It will only apply
regular expressions to the header lines of data files and try to
determine the format of the file (or string).


I don't know what the best approach to format validation would
be, but it is my belief that regular expression can not be used
alone.


Regards,
Andreas

-- 
| {} | Andreas K?h?ri      EMBL, European Bioinformatics Institute
|{}{}|                     Wellcome Trust Genome Campus
| {} | DAS Project Leader  Hinxton, Cambridgeshire, CB10 1SD
|{}{}| Ensembl Developer   United Kingdom
From Zhongwu.Lai at ari.altanapharma.com  Thu Mar 25 18:28:19 2004
From: Zhongwu.Lai at ari.altanapharma.com (Zhongwu.Lai@ari.altanapharma.com)
Date: Fri Mar 26 08:37:59 2004
Subject: [Bioperl-l] A question about Bio::Graphics
Message-ID: <OF24E87A9E.9C497419-ON85256E62.00800A30-85256E62.0080EFA8@ari.altanapharma.com>

Hi,

I'm wondering whether someone can help me on how to show a description in 
Bio::Graphics on left, or right side of the Glyph, instead of below.  I'm 
using following code:

my $track = $panel->add_track(-glyph       => 'graded_segments',
                              -label       => 1,
                              -connector   => 'dashed',
                              -bgcolor     => 'blue',
                              -font2color  => 'red',
                              -sort_order  => 'high_score',
                              -description => sub {
                                 my $feature = shift;
                                 return unless 
$feature->has_tag('description');
                                 my ($description) = 
$feature->each_tag_value('description');
                                 my $score = $feature->score;
                                 "$description, score=$score";
                               });
However, the description is by default always below the glyph.  As UCSC 
genome browser has demonstrated, it can be put on side.  But I couldn't 
find the right option.

Your time and help is greatly appreciated!

Zhongwu
From khoueiry at ibsm.cnrs-mrs.fr  Fri Mar 26 07:14:10 2004
From: khoueiry at ibsm.cnrs-mrs.fr (KHOUEIRY pierre)
Date: Fri Mar 26 08:38:02 2004
Subject: [Bioperl-l] coloring subseq
Message-ID: <40641E92.4080803@ibsm.cnrs-mrs.fr>

Hello,
Anybody can help me to color a subseq in Bioperl.. Per example, I have a 
subseq of "NPYCEWKCPVHNYIPNWLKLA" in wich i like to color
H and C in blue and P and K in red...
Any suggestions would be appreciated
Thanks...
-- 

---------------------------------
Pierre Khoueiry	
khoueiry@ibsm.cnrs-mrs.fr
LCB - CNRS
31, Chemin Joseph Aiguier,
13402 Marseille CEDEX 20, France
---------------------------------
TIP : When the cup is half empty, make it half full

From ak at ebi.ac.uk  Fri Mar 26 08:52:46 2004
From: ak at ebi.ac.uk (Andreas Kahari)
Date: Fri Mar 26 08:58:12 2004
Subject: [Bioperl-l] coloring subseq
In-Reply-To: <40641E92.4080803@ibsm.cnrs-mrs.fr>
References: <40641E92.4080803@ibsm.cnrs-mrs.fr>
Message-ID: <20040326135246.GA5053@ebi.ac.uk>

On Fri, Mar 26, 2004 at 01:14:10PM +0100, KHOUEIRY pierre wrote:
> Hello,
> Anybody can help me to color a subseq in Bioperl.. Per example, I have a 
> subseq of "NPYCEWKCPVHNYIPNWLKLA" in wich i like to color
> H and C in blue and P and K in red...
> Any suggestions would be appreciated
> Thanks...

This would be colors in a Unix console window, in HTML, in a PNG
file, or where?

Here's a short way of doing it by applying HTML <font> tags with
a color attribute (I'm not sure this is the correct usage of the
tag in HTML, but it shouldn't be hard to modify):

my $seq = 'NPYCEWKCPVHNYIPNWLKLA';

$seq =~ s/([NC])/<font color="blue">\1<\/font>/g;
$seq =~ s/([PK])/<font color="red">\1<\/font>/g;

print "$seq\n";

-- 
|(==)| Andreas K?h?ri      EMBL, European Bioinformatics Institute
|=)(=|                     Wellcome Trust Genome Campus
|(==)| DAS Project Leader  Hinxton, Cambridgeshire, CB10 1SD
|=)(=| Ensembl Developer   United Kingdom
From heikki at ebi.ac.uk  Fri Mar 26 09:55:17 2004
From: heikki at ebi.ac.uk (Heikki Lehvaslaiho)
Date: Fri Mar 26 10:00:46 2004
Subject: [Bioperl-l] Sequence IDs
In-Reply-To: <20040326014851.GG23629@genoma4.iq.usp.br>
References: <20040326014851.GG23629@genoma4.iq.usp.br>
Message-ID: <200403261455.17296.heikki@ebi.ac.uk>


Robson,

In my best knowledge there is not yet a global system for gene idenfiers, so I 
can not help you there.

GenBank, EMBL and DDBJ have agreed on using accession numbers (AC line) as 
unique, global identifiers. All other IDs are unique only within a single 
database.

	-Heikki

On Friday 26 Mar 2004 01:48, Robson Francisco de Souza wrote:
> Hi,
>
> This is not quite a bioperl question, but maybe there is some useful
> code in bioperl that can easy the task below.
>
> I'm trying to automatically track GI changes for a sequence in Genbank,
> so that I can list all GI numbers associated to a gene in a complete
> genome from Genbank, EMBL or Kegg. I need this to compare results from
> Kegg Orthologs database (GenomeNet), COG database (NCBI) and the String
> server from EMBL.
> Although I was able to download Kegg release 29, GI numbers for genes in
> this release are outdated and do not agree with COG GIs. Does anybody
> know of a way to retrive all GIs (PID) for every genes in complete genomes?
>
> Also, what is the unique ID or accession key for protein/dna
> sequence that is used by all main sequence databases (Genbank,
> SP/TrEMBL, DDBJ), if there is any? I couldn't find an easy way to cross
> results from analysis that employ identifications from different
> databases.
>
> Thanks for any help.
> Robson
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

-- 
______ _/      _/_____________________________________________________
      _/      _/                      http://www.ebi.ac.uk/mutations/
     _/  _/  _/  Heikki Lehvaslaiho    heikki_at_ebi ac uk
    _/_/_/_/_/  EMBL Outstation, European Bioinformatics Institute
   _/  _/  _/  Wellcome Trust Genome Campus, Hinxton
  _/  _/  _/  Cambs. CB10 1SD, United Kingdom
     _/      Phone: +44 (0)1223 494 644   FAX: +44 (0)1223 494 468
___ _/_/_/_/_/________________________________________________________
From rnilsson at clarku.edu  Fri Mar 26 10:55:22 2004
From: rnilsson at clarku.edu (henrik nilsson)
Date: Fri Mar 26 11:01:12 2004
Subject: [Bioperl-l] Fetching > 500 sequences
Message-ID: <200403261055.22737.rnilsson@clarku.edu>

Hi,
	Thank you very much for your help.  I went through and took out any
references to "retmax" in both Bio::DB::Query::GenBank and Bio::DB::GenBank
and Bio::DB::NCBIHelper.  Our script first sends a query through
Bio::DB::Query::GenBank and that works fine (it only returns the count found,
and that count is > 7000).  However, we then actually query GenBank with
Bio::DB::GenBank and it only returns 500 despite the fact it should be
returning 7000+ (and we really want them all).  I compared both of the
scripts which look very similar in what was passed to GenBank.

	I was wondering if any other variables would cause the Entrez scripts to
 only return 500?  We do use mindate/maxdate and we fetch by query string (if
 that matters).  Any other ideas?

Thanks again for all of your help.

Rolf

> > > It seems that I have problems with fetching more than 500 sequences
> > > from Genbank using Bioperl. It looks like the script (attached below)
> > > fetches all the 7000+ sequences, but only 500 make it to the output
> > > file. Is there any way to get all these 7000+ sequences written to the
> > > file - that is, is it possible to sidestep the 500 seq. limit?
>
> I actually debugged and fixed this problem recently for Biopython --
> it looks like a change in the way EUtils works. If you pass 'retmax'
> to the eutils URL then it will only give you back at max 500
> sequences, no matter what you pass for this parameter. The fix I
> found that worked was to not pass 'retmax'.
>
> The attached patch to Bio/DB/Query/GenBank.pm should fix the
> problem, if similar symptoms equal similar fixes in this case. An
> actual Perl/BioPerl person should look at this, though, as I'm not
> to be trusted for coding Perl :-).
>
> Hope this helps.
> Brad

-------------------------------------------------------

From rrouse at biomail.ucsd.edu  Fri Mar 26 11:45:45 2004
From: rrouse at biomail.ucsd.edu (Richard Rouse)
Date: Fri Mar 26 11:50:50 2004
Subject: [Bioperl-l] IUB/IUPAC nucleic acid codes
Message-ID: <ADEFIHIJAHBGLBNJCIADIEAHCFAA.rrouse@biomail.ucsd.edu>


Does anyone know if bioperl has a tool that can generate all the possible
sequence combinations from a sequence like this:

VMARAYTYKHKRAAAASABKTCCA


From amackey at pcbi.upenn.edu  Fri Mar 26 12:55:02 2004
From: amackey at pcbi.upenn.edu (Aaron J. Mackey)
Date: Fri Mar 26 13:00:32 2004
Subject: [Bioperl-l] IUB/IUPAC nucleic acid codes
In-Reply-To: <ADEFIHIJAHBGLBNJCIADIEAHCFAA.rrouse@biomail.ucsd.edu>
References: <ADEFIHIJAHBGLBNJCIADIEAHCFAA.rrouse@biomail.ucsd.edu>
Message-ID: <B4C0BD76-7F4E-11D8-85B5-000A958C5008@pcbi.upenn.edu>


Try Bio::Tools::IUPAC, if you mean what I think you do.

-Aaron

On Mar 26, 2004, at 11:45 AM, Richard Rouse wrote:

>
>
> Does anyone know if bioperl has a tool that can generate all the 
> possible
> sequence combinations from a sequence like this:
>
> VMARAYTYKHKRAAAASABKTCCA
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

From amackey at pcbi.upenn.edu  Fri Mar 26 12:55:02 2004
From: amackey at pcbi.upenn.edu (Aaron J. Mackey)
Date: Fri Mar 26 13:00:45 2004
Subject: [Bioperl-l] IUB/IUPAC nucleic acid codes
In-Reply-To: <ADEFIHIJAHBGLBNJCIADIEAHCFAA.rrouse@biomail.ucsd.edu>
References: <ADEFIHIJAHBGLBNJCIADIEAHCFAA.rrouse@biomail.ucsd.edu>
Message-ID: <B4C0BD76-7F4E-11D8-85B5-000A958C5008@pcbi.upenn.edu>


Try Bio::Tools::IUPAC, if you mean what I think you do.

-Aaron

On Mar 26, 2004, at 11:45 AM, Richard Rouse wrote:

>
>
> Does anyone know if bioperl has a tool that can generate all the 
> possible
> sequence combinations from a sequence like this:
>
> VMARAYTYKHKRAAAASABKTCCA
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

From allenday at ucla.edu  Fri Mar 26 14:25:19 2004
From: allenday at ucla.edu (Allen Day)
Date: Fri Mar 26 14:30:42 2004
Subject: [Bioperl-l] IUB/IUPAC nucleic acid codes
In-Reply-To: <ADEFIHIJAHBGLBNJCIADIEAHCFAA.rrouse@biomail.ucsd.edu>
References: <ADEFIHIJAHBGLBNJCIADIEAHCFAA.rrouse@biomail.ucsd.edu>
Message-ID: <Pine.LNX.4.58.0403261123190.19494@sumo.genetics.ucla.edu>

no, i was looking for this recently too.  in my own dev libs i've already
written the code that can go from concrete (ie, ATCG) sequence to
degenerate if you supply the degenerate letters you want to allow.  i have
some code that does what you ask too.  i'll commit it into bioperl when
it's a little more mature.

can you give your use case, possibly with an example of how you'd like to
use the api?

-allen


On Fri, 26 Mar 2004, Richard Rouse wrote:

> 
> 
> Does anyone know if bioperl has a tool that can generate all the possible
> sequence combinations from a sequence like this:
> 
> VMARAYTYKHKRAAAASABKTCCA
> 
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 
From rrouse at biomail.ucsd.edu  Fri Mar 26 15:17:02 2004
From: rrouse at biomail.ucsd.edu (Richard Rouse)
Date: Fri Mar 26 15:22:08 2004
Subject: [Bioperl-l] IUB/IUPAC nucleic acid codes
In-Reply-To: <Pine.LNX.4.58.0403261123190.19494@sumo.genetics.ucla.edu>
Message-ID: <ADEFIHIJAHBGLBNJCIADGEALCFAA.rrouse@biomail.ucsd.edu>

Allen,

So perhaps the input can be a fasta formatted file containing these types of
bases and the output will also be fasta formatted having all the
combinations:

INPUT
> accession1
VVCCAGGAWC
> accession2
TGGCACRCWT
> accession3
TNGSNYTNWTV
_____________________________

OUTPUT
accession1
>1
sequence
>2
sequence
	etc .....
accession2
>1
sequence
>2
sequence
	etc .....

Richard


-----Original Message-----
From: Allen Day [mailto:allenday@ucla.edu]
Sent: Friday, March 26, 2004 11:25 AM
To: Richard Rouse
Cc: bioperl-l@bioperl.org
Subject: Re: [Bioperl-l] IUB/IUPAC nucleic acid codes


no, i was looking for this recently too.  in my own dev libs i've already
written the code that can go from concrete (ie, ATCG) sequence to
degenerate if you supply the degenerate letters you want to allow.  i have
some code that does what you ask too.  i'll commit it into bioperl when
it's a little more mature.

can you give your use case, possibly with an example of how you'd like to
use the api?

-allen


On Fri, 26 Mar 2004, Richard Rouse wrote:

>
>
> Does anyone know if bioperl has a tool that can generate all the possible
> sequence combinations from a sequence like this:
>
> VMARAYTYKHKRAAAASABKTCCA
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>


From jeffrey_chang at stanfordalumni.org  Fri Mar 26 16:53:41 2004
From: jeffrey_chang at stanfordalumni.org (Jeffrey Chang)
Date: Fri Mar 26 16:59:33 2004
Subject: [Bioperl-l] Perl Regular Expressions sought that match typical
	bioinformatics applications input data
In-Reply-To: <000c01c41372$07675af0$66125ec2@meemsheen>
References: <000c01c41372$07675af0$66125ec2@meemsheen>
Message-ID: <0B6F9996-7F70-11D8-961A-000A956845CE@stanfordalumni.org>

The closest to this that I know about is the set of Martel regular 
expressions in the Biopython package.  It has descriptions for many 
bioinformatics formats and can be used for verification, like you 
requested.  However, they are not Perl style regular expression.  You 
can probably convert them in an automated way, but then your small 
software project would be growing into a middling one...

Jeff


On Mar 26, 2004, at 3:36 PM, M.Shahid wrote:

> Dear All,
> For a small software project, I'm looking for Perl Regular
> Expressions that match file formats and related input used
> for bioinformatics applications, e.g. FASTA, Clustal,
> PROSITE patterns, PAM similarity matrices, etc. Note
> that I do not want to parse such data, just check whether
> the format is correct. Is there any compilation out there ?
> If not, what is the best starting point to collect such Regular
> Expressions ?
>
> Best Wishes,
>
> M. Shahid
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

From paulo.david at netvisao.pt  Fri Mar 26 19:18:01 2004
From: paulo.david at netvisao.pt (Paulo Almeida)
Date: Fri Mar 26 19:23:19 2004
Subject: [Bioperl-l] Fetching > 500 sequences
In-Reply-To: <200403261055.22737.rnilsson@clarku.edu>
References: <200403261055.22737.rnilsson@clarku.edu>
Message-ID: <4064C839.4020208@netvisao.pt>

This is not Bioperl at all, but if you can't solve the problem and you 
really need all the sequences, you can try the WWW::Mechanize module. It 
allows you to submit forms and follow web links through Perl scripts, 
which means you should be able to submit the query and retrieve the 
results by following the "next page" link on each results page.You would 
have to develop your own way of parsing the results pages, so you 
probably don't want to try this method unless you fail to do it with 
Bioperl...

-Paulo Almeida

henrik nilsson wrote:

>Hi,
>	Thank you very much for your help.  I went through and took out any
>references to "retmax" in both Bio::DB::Query::GenBank and Bio::DB::GenBank
>and Bio::DB::NCBIHelper.  Our script first sends a query through
>Bio::DB::Query::GenBank and that works fine (it only returns the count found,
>and that count is > 7000).  However, we then actually query GenBank with
>Bio::DB::GenBank and it only returns 500 despite the fact it should be
>returning 7000+ (and we really want them all).  I compared both of the
>scripts which look very similar in what was passed to GenBank.
>
>	I was wondering if any other variables would cause the Entrez scripts to
> only return 500?  We do use mindate/maxdate and we fetch by query string (if
> that matters).  Any other ideas?
>
>Thanks again for all of your help.
>
>Rolf
>

From lstein at cshl.edu  Fri Mar 26 19:41:29 2004
From: lstein at cshl.edu (Lincoln Stein)
Date: Fri Mar 26 19:47:47 2004
Subject: [Bioperl-l] Fetching > 500 sequences
In-Reply-To: <4064C839.4020208@netvisao.pt>
References: <200403261055.22737.rnilsson@clarku.edu>
	<4064C839.4020208@netvisao.pt>
Message-ID: <200403261941.29100.lstein@cshl.edu>

There may be a bug in Bio::DB::GenBank, or Entrez may have changed its 
behavior again very slightly.  The interface between the two has always been 
quite fragile.  At some point I will work on this again and try to get it 
working, but first.... Have you tried the bioperl-live CVS version?

Lincoln


On Friday 26 March 2004 07:18 pm, Paulo Almeida wrote:
> This is not Bioperl at all, but if you can't solve the problem and you
> really need all the sequences, you can try the WWW::Mechanize module. It
> allows you to submit forms and follow web links through Perl scripts,
> which means you should be able to submit the query and retrieve the
> results by following the "next page" link on each results page.You would
> have to develop your own way of parsing the results pages, so you
> probably don't want to try this method unless you fail to do it with
> Bioperl...
>
> -Paulo Almeida
>
> henrik nilsson wrote:
> >Hi,
> >	Thank you very much for your help.  I went through and took out any
> >references to "retmax" in both Bio::DB::Query::GenBank and
> > Bio::DB::GenBank and Bio::DB::NCBIHelper.  Our script first sends a query
> > through
> >Bio::DB::Query::GenBank and that works fine (it only returns the count
> > found, and that count is > 7000).  However, we then actually query
> > GenBank with Bio::DB::GenBank and it only returns 500 despite the fact it
> > should be returning 7000+ (and we really want them all).  I compared both
> > of the scripts which look very similar in what was passed to GenBank.
> >
> >	I was wondering if any other variables would cause the Entrez scripts to
> > only return 500?  We do use mindate/maxdate and we fetch by query string
> > (if that matters).  Any other ideas?
> >
> >Thanks again for all of your help.
> >
> >Rolf
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

-- 
Lincoln Stein
lstein@cshl.edu
Cold Spring Harbor Laboratory
1 Bungtown Road
Cold Spring Harbor, NY 11724
(516) 367-8380 (voice)
(516) 367-8389 (fax)
From gllga at hotmail.com  Sat Mar 27 03:26:10 2004
From: gllga at hotmail.com (issac)
Date: Sat Mar 27 03:31:05 2004
Subject: [Bioperl-l] 
	Better than V_I_A_G_R_A and cheaper, too!      dan glenn
Message-ID: <1080375970-21093@excite.com>

C-i-a-l-i-s: Also known as Super V-i-a-g-r-a

One pill, all weekend!

http://mailman.medscustomer.com/sv/index.php?pid=eph9106


grandma memory minou nurse tequila design velvet miki cardinal trident abcd yoda mission bigmac snuffy venus tina carolina johnson tango valentin georgia abby sonny sylvie sunbirdmidori fugazi misha dead e-mail boogie concept concept sally cougars honda1 wolverin tattoo 
Get off: http://alpha1.wehavecheap.com/mm/chair.php
From sanges at biogem.it  Sat Mar 27 08:12:00 2004
From: sanges at biogem.it (sanges@biogem.it)
Date: Sat Mar 27 06:16:52 2004
Subject: [Bioperl-l] Fetching Fasta seqs from GenBank - Help request
In-Reply-To: <1080247112.7333.23.camel@tryps>
References: <BC889F19.622A%sdavis2@mail.nih.gov>
	<1080247112.7333.23.camel@tryps>
Message-ID: <1080393120.40657da06b94d@webmail.biogem.it>

: 

Alberto,
 
 you have an error in your code:
 
 my $query_string = ('Bothrops[Organism] AND
 ?ribosomal','Bothrops[Organism] AND mitochondrial');
 
 with this line you are putting an array into string,
 try to add this line
 
 print $query_string
 
 and see: you have only the last value in your query_string!
 
 If I understood well your need you should use a quesry like this:
 
 my $query_string = 'Bothrops[Organism] AND (ribosomal OR mitochondrial)';
 
 Remo
 
Quoting Alberto Davila <davila@ioc.fiocruz.br>:

> Hi Sean,
> 
> Thanks for your valuable help !
> 
> I solved the problem using "Bio::DB::Query::GenBank", my goal was to
> retrieve 2 types of sequences (mitochondrial and ribosomal) from
> specific organism (eg Bothrops spp)... I am listing my script for those
> interested to do something similar.. the only warning I get is:
> 
> [davila@tryps script]$ perl fetch2contaminant.pl
> Useless use of a constant in void context at fetch2contaminant.pl line
> 10.
> 
> I was not sure in which field (eg keyword or feature) I should look for
> ribosomal and mitochondrial genes, but leaving blank gave some good
> results.
> 
> Indeed Bioperl is powerful... a bit confusing for beginners too.
> 
> Thanks and best regards,
> 
> Alberto
> 
> 
> #!/usr/local/bin/perl -w
> 
> use lib "/usr/local/bioperl14";
> use strict;
> use Bio::DB::Query::GenBank;
> use Bio::SeqIO;
> use Bio::DB::GenBank;
> 
> 
> my $query_string = ('Bothrops[Organism] AND
> ribosomal','Bothrops[Organism] AND mitochondrial');
> my $query = new Bio::DB::Query::GenBank(-db=>'nucleotide',
>                                         -query=>$query_string,
> 		                        -mindate => '1985',
> 		                        -maxdate => '2004');
> 
> my $seqio=new Bio::DB::GenBank->get_Stream_by_query($query);
> 
> #open a seqio handle for writing the outputfile in fasta
> my $outfile = new Bio::SeqIO(-format=>'fasta',
>                              -file=>'>contaminant.bothrops');
> 				  
>  while (my $s = $seqio->next_seq) {
> 
> #write the fasta  
>    $outfile->write_seq($s);
>    
> 	}			  
> 				  
>  	  
> 	  exit;
> 
> 
> 
> 
> 
> 
> 
> On Thu, 2004-03-25 at 16:37, Sean Davis wrote:
> > Alberto,
> > 
> > I would second that.  If are doing more with this than retrieving raw
> > sequence (if you care at all), maybe you could let Barry and I know what
> you
> > are trying to do more generally.  Bioperl is quite powerful, but it does
> > take some direction to get started.
> > 
> > Sean
> > 
> > On 3/25/04 12:43 PM, "Barry Moore" <barry.moore@genetics.utah.edu> 
wrote:
> > 
> > > Alberto-
> > > 
> > > You said, "the 'get_Stream_by_id' is returning me more than the
> > > 'sequence per se'".  I'm not sure if this is what your asking, but 
I'll
> > > take a shot.  Since your are retrieving your two sequences in EMBL
> > > format, you get all the associated information that you would see if 
you
> > > 
> > > downloaded that same file from the web interface.  Your sequences are
> > > stored by BioPerl as RichSeq objects which inherits a PrimarySeq
> > > objects.  So that EMBL file data is stored in the RichSeq object and 
the
> > > 
> > > associated PrimarySeq object it inherited.   Of course when you save
> > > that locally as a fasta file, that extra information is lost.  If you
> > > decide you need to use that data have a look at the documentation for
> > > Bio::Seq::RichSeq and Bio::PrimarySeq and the SeqIO and Feature
> > > Annotation HOW TOs to learn more.
> > > 
> > > Barry
> > > 
> > > Alberto Davila wrote:
> > > 
> > >> Thanks Jason,
> > >> 
> > >> I installed the IO::String, then it is working fine now. However I 
have
> > >> a doubt, the "get_Stream_by_id" is returning me more than the 
"sequence
> > >> per se", what is it ? My script and results are listed below. Finally 
I
> > >> would like to save (in my local disk) the retrieved sequences as 
fasta
> > >> files... is there any argument for that ?
> > >> 
> > >> Thanks again, Alberto
> > >> 
> > >> 
> > >> #!/usr/local/bin/perl -w
> > >> 
> > >> use lib "/usr/local/bioperl14";
> > >> use Bio::DB::BioFetch;
> > >> use strict;
> > >> use Bio::DB::WebDBSeqI;
> > >> use HTTP::Request::Common 'POST';
> > >> 
> > >> my $format_type='fasta';
> > >> my $stream;
> > >> 
> > >> 
> > >> my $bf = new Bio::DB::BioFetch(-format        =>$format_type,
> > >>                               -retrievaltype =>'tempfile',
> > >>       -db            =>'EMBL');
> > >>  
> > >> $stream = $bf->get_Stream_by_id(['BUM','J00231']);
> > >> while (my $s = $stream->next_seq) {
> > >>    print $s->seq,"\n\n\n";
> > >> }              
> > >>  
> > >>  
> > >>  exit;
> > >> 
> > >> 
> > >> 
> > >> 
> > >> [davila@tryps script]$ perl gb-fetch-1.pl
> > >> 
agtagtgtactaccaagtatagataacgtttaaatattaaagttttggatcaaagccaaagatgattcgca
> > > t
> > >> 
gctggtgctgattgtagttacagctgcaagcccagtgtatcagagatgtttccaagatggggctatagtga
> > > a
> > >> gcaaaacccatccaaagaggcagtcacagaagtgtccctaaaagatgatgttagca
> > >> 
> > > 
> > >> 
> > > 
> > >> 
cctggacctcctgtgcaagaacatgaaacanctgtggttcttccttctcctggtggcagctcccagatggg
> > > t
> > >> 
cctgtcccaggtgcacctgcaggagtcgggcccaggactggggaagcctccagagctcaaaaccccacttg
> > > g
> > >> 
tgacacaactcacacatgcccacggtgcccagagcccaaatcttgtgacacacctcccccgtgcccacggt
> > > g
> > >> 
cccagagcccaaatcttgtgacacacctcccccatgcccacggtgcccagagcccaaatcttgtgacacac
> > > c
> > >> 
tcccccgtgcccnnngtgcccagcacctgaactcttgggaggaccgtcagtcttcctcttccccccaaaac
> > > c
> > >> 
caaggatacccttatgatttcccggacccctgaggtcacgtgcgtggtggtggacgtgagccacgaagacc
> > > c
> > >> 
nnnngtccagttcaagtggtacgtggacggcgtggaggtgcataatgccaagacaaagctgcgggaggagc
> > > a
> > >> 
gtacaacagcacgttccgtgtggtcagcgtcctcaccgtcctgcaccaggactggctgaacggcaaggagt
> > > a
> > >> 
caagtgcaaggtctccaacaaagccctcccagcccccatcgagaaaaccatctccaaagccaaaggacagc
> > > c
> > >> 
cnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnngaggagatgaccaagaaccaagtcagcctgacct
> > > g
> > >> 
cctggtcaaaggcttctaccccagcgacatcgccgtggagtgggagagcaatgggcagccggagaacaact
> > > a
> > >> 
caacaccacgcctcccatgctggactccgacggctccttcttcctctacagcaagctcaccgtggacaaga
> > > g
> > >> 
caggtggcagcaggggaacatcttctcatgctccgtgatgcatgaggctctgcacaaccgctacacgcaga
> > > a
> > >> 
gagcctctccctgtctccgggtaaatgagtgccatggccggcaagcccccgctccccgggctctcggggtc
> > > g
> > >> 
cgcgaggatgcttggcacgtaccccgtgtacatacttcccaggcacccagcatggaaataaagcacccagc
> > > g
> > >> ctgccctgg
> > >> 
> > >> 
> > >> 
> > >> 
> > >> On Tue, 2004-03-23 at 22:44, Jason Stajich wrote:
> > >>  
> > >> 
> > >>> You need an additional perl module.
> > >>> 
> > >>> 
> > >>> install IO::String from CPAN
> > >>> 
> > >>> There is a section on how to install additional perl modules in the
> > >>> INSTALL document.
> > >>> 
> > >>> -j
> > >>> 
> > >>> On Tue, 23 Mar 2004, Alberto Davila wrote:
> > >>> 
> > >>>    
> > >>> 
> > >>>> Hi,
> > >>>> 
> > >>>> May I ask for some help ?
> > >>>> 
> > >>>> I am trying to use the BioFetch module in order to download several
> > > seqs
> > >>>> (from specific Organisms) from GenBank in fasta format, but looks
> > > like I
> > >>>> am missing "IO/String.pm" and other things.. should I install
> > > additional
> > >>>> bioperl modules (I have the Bioperl Core 1.4 installed) ? or use a
> > >>>> different module for my purpose ?
> > >>>> 
> > >>>> My script and error msg are listed below.
> > >>>> 
> > >>>> Thanks and besr regards,
> > >>>> 
> > >>>> Alberto
> > >>>> 
> > >>>> ****
> > >>>> 
> > >>>> #!/usr/local/bin/perl -w
> > >>>> 
> > >>>> use lib "/usr/local/bioperl14";
> > >>>> package Bio::DB::BioFetch;
> > >>>> use strict;
> > >>>> use Bio::DB::WebDBSeqI;
> > >>>> use HTTP::Request::Common 'POST';
> > >>>> 
> > >>>> my $format_type='fasta';
> > >>>> my $stream;
> > >>>> 
> > >>>> 
> > >>>> my $bf = new Bio::DB::BioFetch(-format        =>$format_type',
> > >>>>                               -retrievaltype =>'tempfile',
> > >>>>                               -db            =>'EMBL');
> > >>>> 
> > >>>> $stream = $bf->get_Stream_by_id(['BUM','J00231']);
> > >>>> while (my $s = $stream->next_seq) {
> > >>>>    print $s->seq,"\n";
> > >>>>        }
> > >>>> 
> > >>>> 
> > >>>>          exit;
> > >>>> 
> > >>>> 
> > >>>> [davila@tryps script]$ perl gb-fetch-1.pl
> > >>>> Can't locate IO/String.pm in @INC (@INC contains:
> > >>>> /usr/local/bioperl14/i386-linux-thread-multi /usr/local/bioperl14
> > >>>> /usr/lib/perl5/5.8.3/i386-linux-thread-multi /usr/lib/perl5/5.8.3
> > >>>> /usr/lib/perl5/site_perl/5.8.3/i386-linux-thread-multi
> > >>>> /usr/lib/perl5/site_perl/5.8.2/i386-linux-thread-multi
> > >>>> /usr/lib/perl5/site_perl/5.8.1/i386-linux-thread-multi
> > >>>> /usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi
> > >>>> /usr/lib/perl5/site_perl/5.8.3 /usr/lib/perl5/site_perl/5.8.2
> > >>>> /usr/lib/perl5/site_perl/5.8.1 /usr/lib/perl5/site_perl/5.8.0
> > >>>> /usr/lib/perl5/site_perl
> > >>>> /usr/lib/perl5/vendor_perl/5.8.3/i386-linux-thread-multi
> > >>>> /usr/lib/perl5/vendor_perl/5.8.2/i386-linux-thread-multi
> > >>>> /usr/lib/perl5/vendor_perl/5.8.1/i386-linux-thread-multi
> > >>>> /usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi
> > >>>> /usr/lib/perl5/vendor_perl/5.8.3 /usr/lib/perl5/vendor_perl/5.8.2
> > >>>> /usr/lib/perl5/vendor_perl/5.8.1 /usr/lib/perl5/vendor_perl/5.8.0
> > >>>> /usr/lib/perl5/vendor_perl .) at
> > >>>> /usr/local/bioperl14/Bio/DB/WebDBSeqI.pm line 90.
> > >>>> BEGIN failed--compilation aborted at
> > >>>> /usr/local/bioperl14/Bio/DB/WebDBSeqI.pm line 90.
> > >>>> Compilation failed in require at gb-fetch-1.pl line 6.
> > >>>> BEGIN failed--compilation aborted at gb-fetch-1.pl line 6.
> 
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 


From shak91 at comcast.net  Sat Mar 27 05:48:16 2004
From: shak91 at comcast.net (shak91@comcast.net)
Date: Sat Mar 27 08:38:08 2004
Subject: [Bioperl-l] Play George W. Bush Credibility Twister
Message-ID: <200403271338.i2RDc5g2020598@portal.open-bio.org>

Sonny Emerson <shak91@comcast.net> wants you to know about President Bush's deception on Iraq.

WHITE HOUSE PLAYS "TWISTER" WITH THE TRUTH -- NOW YOU CAN, TOO!

President Bush and his administration have been twisting the truth when it comes to Iraq. Can you figure out the truth?

Join in the truth-twisting action by playing our new game: George W. Bush Credibility Twister!

	http://www.democrats.org/truth/twister.html?s=taf

FACTS AND FUN FOR THE WHOLE FAMILY

George W. Bush Credibility Twister is an interactive, online game that exposes the facts with the click of a mouse: just how much the president, his advisers, and key agencies knew about the phony claim before it made it into Bush's State of the Union address.

Play George W. Bush Credibility Twister today to learn the facts about President Bush!

	http://www.democrats.org/truth/twister.html?s=taf

HOLD BUSH ACCOUNTABLE: DEMAND AN INVESTIGATION

After you've played the game, sign the online petition demanding an independent, bipartisan investigation into Bush's statement and the intelligence his administration used.

	http://www.democrats.org/truth/twister.html?s=taf

We will deliver your comments to Bush and Congressional leaders and tell them you want the full truth about Bush's deception.

	http://www.democrats.org/truth/twister.html?s=taf


From amackey at pcbi.upenn.edu  Sat Mar 27 09:27:18 2004
From: amackey at pcbi.upenn.edu (Aaron J. Mackey)
Date: Sat Mar 27 09:32:34 2004
Subject: [Bioperl-l] IUB/IUPAC nucleic acid codes
In-Reply-To: <ADEFIHIJAHBGLBNJCIADGEALCFAA.rrouse@biomail.ucsd.edu>
References: <ADEFIHIJAHBGLBNJCIADGEALCFAA.rrouse@biomail.ucsd.edu>
Message-ID: <DA13E509-7FFA-11D8-BD75-000A958C5008@pcbi.upenn.edu>


Bio::Tools::IUPAC already does all this.

-Aaron

On Mar 26, 2004, at 3:17 PM, Richard Rouse wrote:

> Allen,
>
> So perhaps the input can be a fasta formatted file containing these 
> types of
> bases and the output will also be fasta formatted having all the
> combinations:
>
> INPUT
>> accession1
> VVCCAGGAWC
>> accession2
> TGGCACRCWT
>> accession3
> TNGSNYTNWTV
> _____________________________
>
> OUTPUT
> accession1
>> 1
> sequence
>> 2
> sequence
> 	etc .....
> accession2
>> 1
> sequence
>> 2
> sequence
> 	etc .....
>
> Richard
>
>
> -----Original Message-----
> From: Allen Day [mailto:allenday@ucla.edu]
> Sent: Friday, March 26, 2004 11:25 AM
> To: Richard Rouse
> Cc: bioperl-l@bioperl.org
> Subject: Re: [Bioperl-l] IUB/IUPAC nucleic acid codes
>
>
> no, i was looking for this recently too.  in my own dev libs i've 
> already
> written the code that can go from concrete (ie, ATCG) sequence to
> degenerate if you supply the degenerate letters you want to allow.  i 
> have
> some code that does what you ask too.  i'll commit it into bioperl when
> it's a little more mature.
>
> can you give your use case, possibly with an example of how you'd like 
> to
> use the api?
>
> -allen
>
>
>
> On Fri, 26 Mar 2004, Richard Rouse wrote:
>
>>
>>
>> Does anyone know if bioperl has a tool that can generate all the 
>> possible
>> sequence combinations from a sequence like this:
>>
>> VMARAYTYKHKRAAAASABKTCCA
>>
>>
>> _______________________________________________
>> Bioperl-l mailing list
>> Bioperl-l@portal.open-bio.org
>> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>>
>
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

From la1wson at hotmail.com  Sat Mar 27 07:21:52 2004
From: la1wson at hotmail.com (allen)
Date: Sat Mar 27 14:22:05 2004
Subject: [Bioperl-l] CAS1N0: New Deal!! 50% Sign Up Bonus! sweety rickys
Message-ID: <1080390112-26382@excite.com>

Come and play at the world's PREMIaERE ONLINqE CAScINO!
We are happy to offer you, in an elegant atmosphere, a 50% BONUeS for YOUR FIRST DEPOvSIT as a New Player.

Sign up now! Don't wait!

http://www.virtualcasinoes.net/_e4faa55afa1972493c43ac8a3f66f869/


We'll automatically CREDIqT your account with a 50% BONUcS of extra chips, to wager in our CASIxNO.
Don't hesiitate and try your luck!

CLICwK HERaE!

http://www.virtualcasinoes.net/_e4faa55afa1972493c43ac8a3f66f869/


zhongguo amanda1 bfi t-bone larry wright jeff cougars redwing ariane mars scarlett bfi philip lady lovely metallic cosmos goat blondie advil ladybug nimrod t-bone booster passion
From user121mailman at hotmail.com  Sat Mar 27 21:47:58 2004
From: user121mailman at hotmail.com (mel)
Date: Sat Mar 27 21:58:29 2004
Subject: [Bioperl-l] Biggest meedication site on net! Exceptional offers for
	you!
Message-ID: <1080442078-23032@excite.com>

Greatest VIAGcRA and CIiALiIS
Best price on ValiuVmm and XanaVhx
Quick weight loss (skidoo quebec) and antidepressant meedication!
Stupendous deals, 80 prcent off!
We ship worldwied!

Clik Here:
http://honda1.base4ds.com/s95/index.php?id=s95


wright october fool chaos petunia abby joel first 
grandma supra pacers active sally laura carolina memory 
mortimer hello1 china nautica racerx garnet passion shit beanie
From skirov at utk.edu  Sun Mar 28 13:35:14 2004
From: skirov at utk.edu (Stefan Kirov)
Date: Sun Mar 28 13:41:10 2004
Subject: [Bioperl-l] Is anyone attending RECOMB?
Message-ID: <40671AE2.6070108@utk.edu>

I was just wondering if anyone is attending RECOMB... We could just chat 
here about Bioperl.
Stefan Kirov
From liam at mmb.usyd.edu.au  Sun Mar 28 21:58:33 2004
From: liam at mmb.usyd.edu.au (Liam Elbourne)
Date: Sun Mar 28 22:03:11 2004
Subject: [Bioperl-l] Seq::IO/clustalw
Message-ID: <F74F0F38-812C-11D8-9135-0050E4A02235@mmb.usyd.edu.au>

Hi All,

Is there a quick way of substituting protein ids for accession codes. 
I've just done an clustalw alignment using DNA, and decided to use the  
protein sequences instead (not a very good alignment.......). Rather 
than substitute the protein ids manually, I wondered whether at the:

my $seqobj = $db->get_Stream_by_id(['x?????','af?????', etc, etc, etc);

step I could get the corresponding amino acid translation for the 
accession?. The sequences then get written to a file which is passed to 
clustalw.

     my $seq_out = Bio::SeqIO->new( -format => 'fasta',
                                  -file => "> $outfile");

     while( $aseq = $seqobj->next_seq() )
     {
        $seq_out->write_seq($aseq);
     }


Regards,
Liam Elbourne.

From postmaster at anudagroup.com  Sun Mar 28 23:51:14 2004
From: postmaster at anudagroup.com (Postmaster)
Date: Mon Mar 29 00:02:46 2004
Subject: [Bioperl-l] Undeliverable Mail
Message-ID: <10403291021.AA01400@anudagroup.com>

User mailbox exceeds allowed size: atulbhinge@anudagroup.com


Original message follows.

Received: from ip.eth.net [202.9.128.19] by anudagroup.com with ESMTP
  (SMTPD32-6.04) id AB3F42A01C0; Mon, 29 Mar 2004 10:21:11 +0530
Received: from mkcl.org ([203.129.231.13]) by ip.eth.net  with Microsoft SMTPSVC(5.5.1877.117.11);
	 Mon, 29 Mar 2004 10:28:43 +0530
From: bioperl-l@bioperl.org
To: llcpune4@mkcl.org
Subject: Re: Your software
Date: Mon, 29 Mar 2004 10:24:38 +0530
MIME-Version: 1.0
Content-Type: multipart/mixed;
	boundary="----=_NextPart_000_0004_00005ADF.000055E6"
X-Priority: 3
X-MSMail-Priority: Normal
Return-Path: bioperl-l@bioperl.org
Message-ID: <075f54358041d34PUNMAIL02@ip.eth.net>

This is a multi-part message in MIME format.

------=_NextPart_000_0004_00005ADF.000055E6
Content-Type: text/plain;
	charset="Windows-1252"
Content-Transfer-Encoding: 7bit

Please read the attached file.

------=_NextPart_000_0004_00005ADF.000055E6
Content-Type: application/octet-stream;
	name="application.pif"
Content-Transfer-Encoding: base64
Content-Disposition: attachment;
	filename="application.pif"

TVqQAAMAAAAEAAAA//8AALgAAAAAAAAAQAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAuAAAAKvnXsbvhjCV74Ywle+GMJVsmj6V44YwlQeZOpX2hjCV74YxlbiGMJVsjm2V
4oYwlQeZO5XqhjCVV4A2le6GMJVSaWNo74YwlQAAAAAAAAAAQ29tcHJlc3NlZCBieSBQZXRp
dGUgKGMpMTk5OSBJYW4gTHVjay4AAFBFAABMAQMA6ZtBQAAAAAAAAAAA4AAPAQsBBgAASAAA
APAAAAAAAABCcAEAABAAAABgAAAAAEAAABAAAAACAAAEAAAAAAAAAAQAAAAAAAAAAIABAAAE
AAAAAAAAAgAAAAAAEAAAEAAAAAAQAAAQAAAAAAAAEAAAAAAAAAAAAAAA/HEBAK8BAAAAYAEA
EAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
LnBldGl0ZQAAUAEAABAAAAA8AAAACAAAAAAAAAAAAAAAAAAAYAAA4AAAAAAAAAAAABAAAABg
AQAQAAAAAEQAAAAAAAAAAAAAAAAAAEAAAEAAAAAAAAAAAKsDAAAAcAEAAAQAAAAEAAAAAAAA
AAAAAAAAAABgAADiAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA

[message truncated]
From postmaster at ebi.ac.uk  Mon Mar 29 02:41:45 2004
From: postmaster at ebi.ac.uk (MailScanner)
Date: Mon Mar 29 02:46:57 2004
Subject: [Bioperl-l] Warning: E-mail containing attachment(s) blocked
Message-ID: <200403290741.i2T7fju13183@maui.ebi.ac.uk>

Our MailScanner has blocked a message that you sent:-
  To: birney@ebi.ac.uk
  Subject: Delivery (birney@ebi.ac.uk)
  Date: Mon Mar 29 08:41:45 2004

This is because it contains one or more attachments that we do not allow.
Please rename any attachments to be in the form: filename.clean and
resend the message (don't forget to tell the recipient the attachment type).

If you have not actually sent this email it is possible that it has been
forged (so that it appeared to be from you).

MailScanner Report:
 (data4865.zip)


-- 
MailScanner
Email Virus Scanner
www.mailscanner.info
From heikki at ebi.ac.uk  Mon Mar 29 05:33:04 2004
From: heikki at ebi.ac.uk (Heikki Lehvaslaiho)
Date: Mon Mar 29 05:38:09 2004
Subject: [Bioperl-l] Seq::IO/clustalw
In-Reply-To: <F74F0F38-812C-11D8-9135-0050E4A02235@mmb.usyd.edu.au>
References: <F74F0F38-812C-11D8-9135-0050E4A02235@mmb.usyd.edu.au>
Message-ID: <200403291133.05832.heikki@ebi.ac.uk>

Liam,

You do not say where you are getting the your sequences. If they are EMBL or 
GenBank files, you already have the translation as part of the annotation. 
You can use the following code to create protein sequence objects from them.

---------------------------------------------------------------------------
use Bio::SeqIO;
use Bio::PrimarySeq;
use strict;

my $in  = Bio::SeqIO->new(-file => shift);
my $seq = $in->next_seq;

my @translations;
foreach ( $seq->all_SeqFeatures ) {
    next unless $_->has_tag('translation');
    push @translations, $_->get_tag_values('translation');
}
# assuming that there is only one translation per sequence
die "ERROR..." unless @translations == 1;

my $prot = new Bio::PrimarySeq (-id => $seq->id,
                                -seq => @translations[0]);
---------------------------------------------------------------------------


The other possibility is to use EMBL sequences and fish out the Swiss-Prot 
corresponding ID from the feature table or dbxrefs, and do an other database 
query using it.

Yours,

	-Heikki
 
On Monday 29 Mar 2004 03:58, Liam Elbourne wrote:
> Hi All,
>
> Is there a quick way of substituting protein ids for accession codes.
> I've just done an clustalw alignment using DNA, and decided to use the
> protein sequences instead (not a very good alignment.......). Rather
> than substitute the protein ids manually, I wondered whether at the:
>
> my $seqobj = $db->get_Stream_by_id(['x?????','af?????', etc, etc, etc);
>
> step I could get the corresponding amino acid translation for the
> accession?. The sequences then get written to a file which is passed to
> clustalw.
>
>      my $seq_out = Bio::SeqIO->new( -format => 'fasta',
>                                   -file => "> $outfile");
>
>      while( $aseq = $seqobj->next_seq() )
>      {
>         $seq_out->write_seq($aseq);
>      }
>
>
> Regards,
> Liam Elbourne.
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

-- 
______ _/      _/_____________________________________________________
      _/      _/                      http://www.ebi.ac.uk/mutations/
     _/  _/  _/  Heikki Lehvaslaiho    heikki_at_ebi ac uk
    _/_/_/_/_/  EMBL Outstation, European Bioinformatics Institute
   _/  _/  _/  Wellcome Trust Genome Campus, Hinxton
  _/  _/  _/  Cambs. CB10 1SD, United Kingdom
     _/      Phone: +44 (0)1223 494 644   FAX: +44 (0)1223 494 468
___ _/_/_/_/_/________________________________________________________
From tina21electric at hotmail.com  Mon Mar 29 19:50:04 2004
From: tina21electric at hotmail.com (sheldon)
Date: Mon Mar 29 06:54:19 2004
Subject: [Bioperl-l] Be a GOD in the sack!      shit minou mike1
Message-ID: <1080607804-6986@excite.com>

Greatest deals on VIAGxRA and CI_AL_IS  (cosmos research)
As low as $3 a dose!!
Stupendous deals, 80 prcent off!
We ship worldwied!    (rux guess)

Clik Here:
http://river.wehavecheap.com/sv/index.php?pid=eph9106


depeche mantra abcd septembe tigre malcolm joel wright 
biology energy cccccc trumpet scarlett kiss fugazi moroni 
user1 flight redrum kleenex charlott wombat million light warriors
From dominique.vlieghe at dmbr.ugent.be  Mon Mar 29 07:20:55 2004
From: dominique.vlieghe at dmbr.ugent.be (Dominique Vlieghe)
Date: Mon Mar 29 07:26:08 2004
Subject: [Bioperl-l] "strand" line in bl2seq output
Message-ID: <406814A7.20909@dmbr.ugent.be>

Hello all,

I have a question regarding the parsing of bl2seq output (using 
Bio::Tools::BPbl2seq). In the bl2seq output there is a line for strand 
orientation (e.g. "Strand = Plus / Plus"). I looked at the modules that 
parse this file, but I don't see where those lines are processed. Is 
this correct? If it is, what's the alternative for getting this valuable 
information?

Regards

Dominique

-- 
-----------------------------
Save the Hubble petition: http://www.savethehubble.org
------------------------------
Dominique Vlieghe, Ph.D.,
Bioinformatics Core,
Department for Molecular Biomedical Research (DMBR)
VIB - Ghent University
'Fiers-Schell-Van Montagu' building
Technologiepark 927
B-9052 Ghent (Zwijnaarde), Belgium

Tel : +32-(0)9-33-13.693
Fax : +32-(0)9-33-13.609
email:dominique.vlieghe@dmbr.ugent.be
www:http://www.dmbr.ugent.be/
------------------------------

From brian_osborne at cognia.com  Mon Mar 29 08:16:20 2004
From: brian_osborne at cognia.com (Brian Osborne)
Date: Mon Mar 29 08:24:48 2004
Subject: [Bioperl-l] "strand" line in bl2seq output
In-Reply-To: <406814A7.20909@dmbr.ugent.be>
Message-ID: <GAEDKMGOKFBLJPKCLKCCCEIFDJAA.brian_osborne@cognia.com>

Dominique,

I think this is discussed in the FAQ,
http://bioperl.org/Core/Latest/faq.html#Q3.5, and in the SearchIO HOW TO.

Brian O.

-----Original Message-----
From: bioperl-l-bounces@portal.open-bio.org
[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Dominique Vlieghe
Sent: Monday, March 29, 2004 7:21 AM
To: bioperl-l@portal.open-bio.org
Subject: [Bioperl-l] "strand" line in bl2seq output

Hello all,

I have a question regarding the parsing of bl2seq output (using
Bio::Tools::BPbl2seq). In the bl2seq output there is a line for strand
orientation (e.g. "Strand = Plus / Plus"). I looked at the modules that
parse this file, but I don't see where those lines are processed. Is
this correct? If it is, what's the alternative for getting this valuable
information?

Regards

Dominique

--
-----------------------------
Save the Hubble petition: http://www.savethehubble.org
------------------------------
Dominique Vlieghe, Ph.D.,
Bioinformatics Core,
Department for Molecular Biomedical Research (DMBR)
VIB - Ghent University
'Fiers-Schell-Van Montagu' building
Technologiepark 927
B-9052 Ghent (Zwijnaarde), Belgium

Tel : +32-(0)9-33-13.693
Fax : +32-(0)9-33-13.609
email:dominique.vlieghe@dmbr.ugent.be
www:http://www.dmbr.ugent.be/
------------------------------

_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org
http://portal.open-bio.org/mailman/listinfo/bioperl-l


From dominique.vlieghe at dmbr.ugent.be  Mon Mar 29 08:28:20 2004
From: dominique.vlieghe at dmbr.ugent.be (Dominique Vlieghe)
Date: Mon Mar 29 08:33:30 2004
Subject: [Bioperl-l] And the correct answer is...
In-Reply-To: <GAEDKMGOKFBLJPKCLKCCCEIFDJAA.brian_osborne@cognia.com>
References: <GAEDKMGOKFBLJPKCLKCCCEIFDJAA.brian_osborne@cognia.com>
Message-ID: <40682474.1040604@dmbr.ugent.be>

Hello all,

Thanks for your swift replies:  the correct way to do this is

use Bio::Tools::BPbl2seq;
my $report = *Bio::Tools::BPbl2seq*->new(-file => 'bl2seq.out', 
-report_type => 'blastn');

while(my $hsp = $report->next_feature) {
print
$hsp->query->strand."\n".
$hsp->sbjct->strand."\n";
}

(quite simple, in retrospect)

cheerz

D

Brian Osborne wrote:

>Dominique,
>
>I think this is discussed in the FAQ,
>http://bioperl.org/Core/Latest/faq.html#Q3.5, and in the SearchIO HOW TO.
>
>Brian O.
>
>-----Original Message-----
>From: bioperl-l-bounces@portal.open-bio.org
>[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Dominique Vlieghe
>Sent: Monday, March 29, 2004 7:21 AM
>To: bioperl-l@portal.open-bio.org
>Subject: [Bioperl-l] "strand" line in bl2seq output
>
>Hello all,
>
>I have a question regarding the parsing of bl2seq output (using
>Bio::Tools::BPbl2seq). In the bl2seq output there is a line for strand
>orientation (e.g. "Strand = Plus / Plus"). I looked at the modules that
>parse this file, but I don't see where those lines are processed. Is
>this correct? If it is, what's the alternative for getting this valuable
>information?
>
>Regards
>
>Dominique
>
>--
>-----------------------------
>Save the Hubble petition: http://www.savethehubble.org
>------------------------------
>Dominique Vlieghe, Ph.D.,
>Bioinformatics Core,
>Department for Molecular Biomedical Research (DMBR)
>VIB - Ghent University
>'Fiers-Schell-Van Montagu' building
>Technologiepark 927
>B-9052 Ghent (Zwijnaarde), Belgium
>
>Tel : +32-(0)9-33-13.693
>Fax : +32-(0)9-33-13.609
>email:dominique.vlieghe@dmbr.ugent.be
>www:http://www.dmbr.ugent.be/
>------------------------------
>
>_______________________________________________
>Bioperl-l mailing list
>Bioperl-l@portal.open-bio.org
>http://portal.open-bio.org/mailman/listinfo/bioperl-l
>
>
>  
>


-- 
-----------------------------
Save the Hubble petition: http://www.savethehubble.org
------------------------------
Dominique Vlieghe, Ph.D.,
Bioinformatics Core,
Department for Molecular Biomedical Research (DMBR)
VIB - Ghent University
'Fiers-Schell-Van Montagu' building
Technologiepark 927
B-9052 Ghent (Zwijnaarde), Belgium

Tel : +32-(0)9-33-13.693
Fax : +32-(0)9-33-13.609
email:dominique.vlieghe@dmbr.ugent.be
www:http://www.dmbr.ugent.be/
------------------------------

From sanges at biogem.it  Mon Mar 29 08:30:16 2004
From: sanges at biogem.it (Remo Sanges)
Date: Mon Mar 29 08:35:30 2004
Subject: [Bioperl-l] "strand" line in bl2seq output
In-Reply-To: <406814A7.20909@dmbr.ugent.be>
References: <406814A7.20909@dmbr.ugent.be>
Message-ID: <378263F2-8185-11D8-8E4B-000A95B95F88@biogem.it>

Hi Dominique,

I'm not using this module...
however you can access this information in such way

use Bio::Tools::BPbl2seq;
   my $report = Bio::Tools::BPbl2seq->new(-file => 'bl2seq.out', 
-report_type => 'blastn');

   while(my $hsp = $report->next_feature) {
	print
       $hsp->query->strand."\n".
       $hsp->sbjct->strand."\n";
  }

keep in mind that you must use report_type in order to have right 
parsing of strand

Remo


On Mar 29, 2004, at 2:20 PM, Dominique Vlieghe wrote:

> Hello all,
>
> I have a question regarding the parsing of bl2seq output (using 
> Bio::Tools::BPbl2seq). In the bl2seq output there is a line for strand 
> orientation (e.g. "Strand = Plus / Plus"). I looked at the modules 
> that parse this file, but I don't see where those lines are processed. 
> Is this correct? If it is, what's the alternative for getting this 
> valuable information?
>
> Regards
>
> Dominique
>
> -- 
> -----------------------------
> Save the Hubble petition: http://www.savethehubble.org
> ------------------------------
> Dominique Vlieghe, Ph.D.,
> Bioinformatics Core,
> Department for Molecular Biomedical Research (DMBR)
> VIB - Ghent University
> 'Fiers-Schell-Van Montagu' building
> Technologiepark 927
> B-9052 Ghent (Zwijnaarde), Belgium
>
> Tel : +32-(0)9-33-13.693
> Fax : +32-(0)9-33-13.609
> email:dominique.vlieghe@dmbr.ugent.be
> www:http://www.dmbr.ugent.be/
> ------------------------------
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

From brian_osborne at cognia.com  Mon Mar 29 08:33:26 2004
From: brian_osborne at cognia.com (Brian Osborne)
Date: Mon Mar 29 08:38:49 2004
Subject: [Bioperl-l] Seq::IO/clustalw
In-Reply-To: <F74F0F38-812C-11D8-9135-0050E4A02235@mmb.usyd.edu.au>
Message-ID: <GAEDKMGOKFBLJPKCLKCCOEIFDJAA.brian_osborne@cognia.com>

Liam,

For the details on getting translations of nucleotide sequences when the
input files are EMBL or Genbank format please see the Feature-Annotation HOW
TO. There are different ways to do this but the basic idea is
"$feat_object->get_tag_values('translation'))". First get a SeqFeature
object, then extract data from it. If your input files aren't either of
these formats then it's a different problem.

Brian O.


-----Original Message-----
From: bioperl-l-bounces@portal.open-bio.org
[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Liam Elbourne
Sent: Sunday, March 28, 2004 9:59 PM
To: BioPerl
Subject: [Bioperl-l] Seq::IO/clustalw

Hi All,

Is there a quick way of substituting protein ids for accession codes.
I've just done an clustalw alignment using DNA, and decided to use the
protein sequences instead (not a very good alignment.......). Rather
than substitute the protein ids manually, I wondered whether at the:

my $seqobj = $db->get_Stream_by_id(['x?????','af?????', etc, etc, etc);

step I could get the corresponding amino acid translation for the
accession?. The sequences then get written to a file which is passed to
clustalw.

     my $seq_out = Bio::SeqIO->new( -format => 'fasta',
                                  -file => "> $outfile");

     while( $aseq = $seqobj->next_seq() )
     {
        $seq_out->write_seq($aseq);
     }


Regards,
Liam Elbourne.

_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org
http://portal.open-bio.org/mailman/listinfo/bioperl-l


From Gilles.Parmentier at ens-lyon.fr  Mon Mar 29 08:48:42 2004
From: Gilles.Parmentier at ens-lyon.fr (Gilles Parmentier)
Date: Mon Mar 29 08:56:33 2004
Subject: [Bioperl-l] desc in fasqta names
Message-ID: <4068293A.8010305@ens-lyon.fr>

Hello all,

I'm using SeqIO to transform a genbank file in a fasta file. I don't 
want to have the result of a desc call in the fasta name, how can I do 
that ? My problem seems to come from those lines in SeqIO/fasta.pm 
(write_seq) :

        if ($seq->can('desc') and my $desc = $seq->desc()) {
            $desc =~ s/\n//g;
            $top .= " $desc";
        }

How can I turn this feature off ?

Thanks in advance for your help,

Gilles
From sanges at biogem.it  Mon Mar 29 09:05:52 2004
From: sanges at biogem.it (Remo Sanges)
Date: Mon Mar 29 09:11:05 2004
Subject: [Bioperl-l] desc in fasqta names
In-Reply-To: <4068293A.8010305@ens-lyon.fr>
References: <4068293A.8010305@ens-lyon.fr>
Message-ID: <303C48B4-818A-11D8-8E4B-000A95B95F88@biogem.it>

Gilles,

function desc 'GET/SET description for your sequence'
so you can easily put a blank description in your script:

$seq->desc('');


Remo

On Mar 29, 2004, at 3:48 PM, Gilles Parmentier wrote:

> Hello all,
>
> I'm using SeqIO to transform a genbank file in a fasta file. I don't 
> want to have the result of a desc call in the fasta name, how can I do 
> that ? My problem seems to come from those lines in SeqIO/fasta.pm 
> (write_seq) :
>
>        if ($seq->can('desc') and my $desc = $seq->desc()) {
>            $desc =~ s/\n//g;
>            $top .= " $desc";
>        }
>
> How can I turn this feature off ?
>
> Thanks in advance for your help,
>
> Gilles
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

From mjohnson at watson.wustl.edu  Mon Mar 29 11:39:35 2004
From: mjohnson at watson.wustl.edu (Mark Johnson)
Date: Mon Mar 29 11:44:46 2004
Subject: [Bioperl-l] Bio::Index::Fasta, Bio::Index::Qual
Message-ID: <56765.10.0.1.216.1080578375.squirrel@watson.wustl.edu>

Bio::Index::Fasta is great, but there is no corresponding
Bio::Index::Qual...I created one via subsitution various permutations of
'qual' for 'fasta' on Bio::Index::Fasta.

Unless I'm the only person that thinks Bio::Index::Qual might be useful,
is there a chance of getting such a thing into the next release?

Motivation:
I maintain a piece of a makeshift submission pipeline for sending BAC end
reads to the NCBI trace archive (http://www.ncbi.nlm.nih.gov/Traces).  I
process files containing nucleotide sequence and quality values...the
nucleotide sequence being in a seperate file from the quality values.  The
access pattern is random with respect to the ordering of the sequences
within the individual files.  So Bio::Index::Fasta is very handy, as is my
cobbled together Bio::Index::Qual.  Which works fine...I just don't like
maintaining a seperate local module...

Thoughts, suggestions, comments, rotten fruit?
From brian_osborne at cognia.com  Mon Mar 29 11:50:35 2004
From: brian_osborne at cognia.com (Brian Osborne)
Date: Mon Mar 29 11:56:12 2004
Subject: [Bioperl-l] Bio::Index::Fasta, Bio::Index::Qual
In-Reply-To: <56765.10.0.1.216.1080578375.squirrel@watson.wustl.edu>
Message-ID: <GAEDKMGOKFBLJPKCLKCCKEIODJAA.brian_osborne@cognia.com>

Mark,

If there are no objections you can submit this as a "bug" to our Bugzilla
with the attached file - I'd be happy to add it to the distribution for you.

Brian O.

-----Original Message-----
From: bioperl-l-bounces@portal.open-bio.org
[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Mark Johnson
Sent: Monday, March 29, 2004 11:40 AM
To: bioperl-l@portal.open-bio.org
Subject: [Bioperl-l] Bio::Index::Fasta, Bio::Index::Qual

Bio::Index::Fasta is great, but there is no corresponding
Bio::Index::Qual...I created one via subsitution various permutations of
'qual' for 'fasta' on Bio::Index::Fasta.

Unless I'm the only person that thinks Bio::Index::Qual might be useful,
is there a chance of getting such a thing into the next release?

Motivation:
I maintain a piece of a makeshift submission pipeline for sending BAC end
reads to the NCBI trace archive (http://www.ncbi.nlm.nih.gov/Traces).  I
process files containing nucleotide sequence and quality values...the
nucleotide sequence being in a seperate file from the quality values.  The
access pattern is random with respect to the ordering of the sequences
within the individual files.  So Bio::Index::Fasta is very handy, as is my
cobbled together Bio::Index::Qual.  Which works fine...I just don't like
maintaining a seperate local module...

Thoughts, suggestions, comments, rotten fruit?
_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org
http://portal.open-bio.org/mailman/listinfo/bioperl-l


From davila at ioc.fiocruz.br  Sat Mar 27 10:08:18 2004
From: davila at ioc.fiocruz.br (Alberto Davila)
Date: Mon Mar 29 13:00:11 2004
Subject: [Bioperl-l] Fetching Fasta seqs from GenBank - Help request
In-Reply-To: <1080393120.40657da06b94d@webmail.biogem.it>
References: <BC889F19.622A%sdavis2@mail.nih.gov>
	<1080247112.7333.23.camel@tryps>
	<1080393120.40657da06b94d@webmail.biogem.it>
Message-ID: <1080400097.3129.208.camel@tryps>

Hi Remo,

Thanks for catching this, so Sean was right as well... I was confused
because the first "$query_string" was returning me sequences with
"ribosomal" words... I just realized they were retrieved because they
have the "mitochondrial" word together:

>AY223674 Bothrops jararacussu specimen-voucher DPL 104 16S ribosomal
RNA gene,
partial sequence; mitochondrial gene for mitochondrial product.

>AY223673 Bothrops alternatus specimen-voucher DPL 2879 16S ribosomal
RNA gene,
partial sequence; mitochondrial gene for mitochondrial product.

I am now "querying" "ribosomal" and "mitochondrial" genes in the
"[title]" field... then, things are working ok now. Thanks !

Alberto


On Sat, 2004-03-27 at 10:12, sanges@biogem.it wrote:
> : 
> 
> Alberto,
>  
>  you have an error in your code:
>  
>  my $query_string = ('Bothrops[Organism] AND
>   ribosomal','Bothrops[Organism] AND mitochondrial');
>  
>  with this line you are putting an array into string,
>  try to add this line
>  
>  print $query_string
>  
>  and see: you have only the last value in your query_string!
>  
>  If I understood well your need you should use a quesry like this:
>  
>  my $query_string = 'Bothrops[Organism] AND (ribosomal OR mitochondrial)';
>  
>  Remo
>  
> Quoting Alberto Davila <davila@ioc.fiocruz.br>:
> 
> > Hi Sean,
> > 
> > Thanks for your valuable help !
> > 
> > I solved the problem using "Bio::DB::Query::GenBank", my goal was to
> > retrieve 2 types of sequences (mitochondrial and ribosomal) from
> > specific organism (eg Bothrops spp)... I am listing my script for those
> > interested to do something similar.. the only warning I get is:
> > 
> > [davila@tryps script]$ perl fetch2contaminant.pl
> > Useless use of a constant in void context at fetch2contaminant.pl line
> > 10.
> > 
> > I was not sure in which field (eg keyword or feature) I should look for
> > ribosomal and mitochondrial genes, but leaving blank gave some good
> > results.
> > 
> > Indeed Bioperl is powerful... a bit confusing for beginners too.
> > 
> > Thanks and best regards,
> > 
> > Alberto
> > 
> > 
> > #!/usr/local/bin/perl -w
> > 
> > use lib "/usr/local/bioperl14";
> > use strict;
> > use Bio::DB::Query::GenBank;
> > use Bio::SeqIO;
> > use Bio::DB::GenBank;
> > 
> > 
> > my $query_string = ('Bothrops[Organism] AND
> > ribosomal','Bothrops[Organism] AND mitochondrial');
> > my $query = new Bio::DB::Query::GenBank(-db=>'nucleotide',
> >                                         -query=>$query_string,
> > 		                        -mindate => '1985',
> > 		                        -maxdate => '2004');
> > 
> > my $seqio=new Bio::DB::GenBank->get_Stream_by_query($query);
> > 
> > #open a seqio handle for writing the outputfile in fasta
> > my $outfile = new Bio::SeqIO(-format=>'fasta',
> >                              -file=>'>contaminant.bothrops');
> > 				  
> >  while (my $s = $seqio->next_seq) {
> > 
> > #write the fasta  
> >    $outfile->write_seq($s);
> >    
> > 	}			  
> > 				  
> >  	  
> > 	  exit;
> > 
> > 
> > 
> > 
> > 
> > 
> > 
> > On Thu, 2004-03-25 at 16:37, Sean Davis wrote:
> > > Alberto,
> > > 
> > > I would second that.  If are doing more with this than retrieving raw
> > > sequence (if you care at all), maybe you could let Barry and I know what
> > you
> > > are trying to do more generally.  Bioperl is quite powerful, but it does
> > > take some direction to get started.
> > > 
> > > Sean
> > > 
> > > On 3/25/04 12:43 PM, "Barry Moore" <barry.moore@genetics.utah.edu> 
> wrote:
> > > 
> > > > Alberto-
> > > > 
> > > > You said, "the 'get_Stream_by_id' is returning me more than the
> > > > 'sequence per se'".  I'm not sure if this is what your asking, but 
> I'll
> > > > take a shot.  Since your are retrieving your two sequences in EMBL
> > > > format, you get all the associated information that you would see if 
> you
> > > > 
> > > > downloaded that same file from the web interface.  Your sequences are
> > > > stored by BioPerl as RichSeq objects which inherits a PrimarySeq
> > > > objects.  So that EMBL file data is stored in the RichSeq object and 
> the
> > > > 
> > > > associated PrimarySeq object it inherited.   Of course when you save
> > > > that locally as a fasta file, that extra information is lost.  If you
> > > > decide you need to use that data have a look at the documentation for
> > > > Bio::Seq::RichSeq and Bio::PrimarySeq and the SeqIO and Feature
> > > > Annotation HOW TOs to learn more.
> > > > 
> > > > Barry
> > > > 
> > > > Alberto Davila wrote:
> > > > 
> > > >> Thanks Jason,
> > > >> 
> > > >> I installed the IO::String, then it is working fine now. However I 
> have
> > > >> a doubt, the "get_Stream_by_id" is returning me more than the 
> "sequence
> > > >> per se", what is it ? My script and results are listed below. Finally 
> I
> > > >> would like to save (in my local disk) the retrieved sequences as 
> fasta
> > > >> files... is there any argument for that ?
> > > >> 
> > > >> Thanks again, Alberto
> > > >> 
> > > >> 
> > > >> #!/usr/local/bin/perl -w
> > > >> 
> > > >> use lib "/usr/local/bioperl14";
> > > >> use Bio::DB::BioFetch;
> > > >> use strict;
> > > >> use Bio::DB::WebDBSeqI;
> > > >> use HTTP::Request::Common 'POST';
> > > >> 
> > > >> my $format_type='fasta';
> > > >> my $stream;
> > > >> 
> > > >> 
> > > >> my $bf = new Bio::DB::BioFetch(-format        =>$format_type,
> > > >>                               -retrievaltype =>'tempfile',
> > > >>       -db            =>'EMBL');
> > > >>  
> > > >> $stream = $bf->get_Stream_by_id(['BUM','J00231']);
> > > >> while (my $s = $stream->next_seq) {
> > > >>    print $s->seq,"\n\n\n";
> > > >> }              
> > > >>  
> > > >>  
> > > >>  exit;
> > > >> 
> > > >> 
> > > >> 
> > > >> 
> > > >> [davila@tryps script]$ perl gb-fetch-1.pl
> > > >> 
> agtagtgtactaccaagtatagataacgtttaaatattaaagttttggatcaaagccaaagatgattcgca
> > > > t
> > > >> 
> gctggtgctgattgtagttacagctgcaagcccagtgtatcagagatgtttccaagatggggctatagtga
> > > > a
> > > >> gcaaaacccatccaaagaggcagtcacagaagtgtccctaaaagatgatgttagca
> > > >> 
> > > > 
> > > >> 
> > > > 
> > > >> 
> cctggacctcctgtgcaagaacatgaaacanctgtggttcttccttctcctggtggcagctcccagatggg
> > > > t
> > > >> 
> cctgtcccaggtgcacctgcaggagtcgggcccaggactggggaagcctccagagctcaaaaccccacttg
> > > > g
> > > >> 
> tgacacaactcacacatgcccacggtgcccagagcccaaatcttgtgacacacctcccccgtgcccacggt
> > > > g
> > > >> 
> cccagagcccaaatcttgtgacacacctcccccatgcccacggtgcccagagcccaaatcttgtgacacac
> > > > c
> > > >> 
> tcccccgtgcccnnngtgcccagcacctgaactcttgggaggaccgtcagtcttcctcttccccccaaaac
> > > > c
> > > >> 
> caaggatacccttatgatttcccggacccctgaggtcacgtgcgtggtggtggacgtgagccacgaagacc
> > > > c
> > > >> 
> nnnngtccagttcaagtggtacgtggacggcgtggaggtgcataatgccaagacaaagctgcgggaggagc
> > > > a
> > > >> 
> gtacaacagcacgttccgtgtggtcagcgtcctcaccgtcctgcaccaggactggctgaacggcaaggagt
> > > > a
> > > >> 
> caagtgcaaggtctccaacaaagccctcccagcccccatcgagaaaaccatctccaaagccaaaggacagc
> > > > c
> > > >> 
> cnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnngaggagatgaccaagaaccaagtcagcctgacct
> > > > g
> > > >> 
> cctggtcaaaggcttctaccccagcgacatcgccgtggagtgggagagcaatgggcagccggagaacaact
> > > > a
> > > >> 
> caacaccacgcctcccatgctggactccgacggctccttcttcctctacagcaagctcaccgtggacaaga
> > > > g
> > > >> 
> caggtggcagcaggggaacatcttctcatgctccgtgatgcatgaggctctgcacaaccgctacacgcaga
> > > > a
> > > >> 
> gagcctctccctgtctccgggtaaatgagtgccatggccggcaagcccccgctccccgggctctcggggtc
> > > > g
> > > >> 
> cgcgaggatgcttggcacgtaccccgtgtacatacttcccaggcacccagcatggaaataaagcacccagc
> > > > g
> > > >> ctgccctgg
> > > >> 
> > > >> 
> > > >> 
> > > >> 
> > > >> On Tue, 2004-03-23 at 22:44, Jason Stajich wrote:
> > > >>  
> > > >> 
> > > >>> You need an additional perl module.
> > > >>> 
> > > >>> 
> > > >>> install IO::String from CPAN
> > > >>> 
> > > >>> There is a section on how to install additional perl modules in the
> > > >>> INSTALL document.
> > > >>> 
> > > >>> -j
> > > >>> 
> > > >>> On Tue, 23 Mar 2004, Alberto Davila wrote:
> > > >>> 
> > > >>>    
> > > >>> 
> > > >>>> Hi,
> > > >>>> 
> > > >>>> May I ask for some help ?
> > > >>>> 
> > > >>>> I am trying to use the BioFetch module in order to download several
> > > > seqs
> > > >>>> (from specific Organisms) from GenBank in fasta format, but looks
> > > > like I
> > > >>>> am missing "IO/String.pm" and other things.. should I install
> > > > additional
> > > >>>> bioperl modules (I have the Bioperl Core 1.4 installed) ? or use a
> > > >>>> different module for my purpose ?
> > > >>>> 
> > > >>>> My script and error msg are listed below.
> > > >>>> 
> > > >>>> Thanks and besr regards,
> > > >>>> 
> > > >>>> Alberto
> > > >>>> 
> > > >>>> ****
> > > >>>> 
> > > >>>> #!/usr/local/bin/perl -w
> > > >>>> 
> > > >>>> use lib "/usr/local/bioperl14";
> > > >>>> package Bio::DB::BioFetch;
> > > >>>> use strict;
> > > >>>> use Bio::DB::WebDBSeqI;
> > > >>>> use HTTP::Request::Common 'POST';
> > > >>>> 
> > > >>>> my $format_type='fasta';
> > > >>>> my $stream;
> > > >>>> 
> > > >>>> 
> > > >>>> my $bf = new Bio::DB::BioFetch(-format        =>$format_type',
> > > >>>>                               -retrievaltype =>'tempfile',
> > > >>>>                               -db            =>'EMBL');
> > > >>>> 
> > > >>>> $stream = $bf->get_Stream_by_id(['BUM','J00231']);
> > > >>>> while (my $s = $stream->next_seq) {
> > > >>>>    print $s->seq,"\n";
> > > >>>>        }
> > > >>>> 
> > > >>>> 
> > > >>>>          exit;
> > > >>>> 
> > > >>>> 
> > > >>>> [davila@tryps script]$ perl gb-fetch-1.pl
> > > >>>> Can't locate IO/String.pm in @INC (@INC contains:
> > > >>>> /usr/local/bioperl14/i386-linux-thread-multi /usr/local/bioperl14
> > > >>>> /usr/lib/perl5/5.8.3/i386-linux-thread-multi /usr/lib/perl5/5.8.3
> > > >>>> /usr/lib/perl5/site_perl/5.8.3/i386-linux-thread-multi
> > > >>>> /usr/lib/perl5/site_perl/5.8.2/i386-linux-thread-multi
> > > >>>> /usr/lib/perl5/site_perl/5.8.1/i386-linux-thread-multi
> > > >>>> /usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi
> > > >>>> /usr/lib/perl5/site_perl/5.8.3 /usr/lib/perl5/site_perl/5.8.2
> > > >>>> /usr/lib/perl5/site_perl/5.8.1 /usr/lib/perl5/site_perl/5.8.0
> > > >>>> /usr/lib/perl5/site_perl
> > > >>>> /usr/lib/perl5/vendor_perl/5.8.3/i386-linux-thread-multi
> > > >>>> /usr/lib/perl5/vendor_perl/5.8.2/i386-linux-thread-multi
> > > >>>> /usr/lib/perl5/vendor_perl/5.8.1/i386-linux-thread-multi
> > > >>>> /usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi
> > > >>>> /usr/lib/perl5/vendor_perl/5.8.3 /usr/lib/perl5/vendor_perl/5.8.2
> > > >>>> /usr/lib/perl5/vendor_perl/5.8.1 /usr/lib/perl5/vendor_perl/5.8.0
> > > >>>> /usr/lib/perl5/vendor_perl .) at
> > > >>>> /usr/local/bioperl14/Bio/DB/WebDBSeqI.pm line 90.
> > > >>>> BEGIN failed--compilation aborted at
> > > >>>> /usr/local/bioperl14/Bio/DB/WebDBSeqI.pm line 90.
> > > >>>> Compilation failed in require at gb-fetch-1.pl line 6.
> > > >>>> BEGIN failed--compilation aborted at gb-fetch-1.pl line 6.
> > 
> > 
> > _______________________________________________
> > Bioperl-l mailing list
> > Bioperl-l@portal.open-bio.org
> > http://portal.open-bio.org/mailman/listinfo/bioperl-l
> > 


From sanges at biogem.it  Mon Mar 29 07:44:37 2004
From: sanges at biogem.it (Remo Sanges)
Date: Mon Mar 29 13:02:19 2004
Subject: [Bioperl-l] "strand" line in bl2seq output
In-Reply-To: <406814A7.20909@dmbr.ugent.be>
References: <406814A7.20909@dmbr.ugent.be>
Message-ID: <D6D7465C-817E-11D8-8E4B-000A95B95F88@biogem.it>

Hi Dominique,

I'm not using this module...
however you can access this information in such way

use Bio::Tools::BPbl2seq;
   my $report = Bio::Tools::BPbl2seq->new(-file => 'bl2seq.out', 
-report_type => 'blastn');

   while(my $hsp = $report->next_feature) {
	print
       $hsp->query->strand."\n".
       $hsp->sbjct->strand."\n";
  }

keep in mind that you must use report_type in order to have right 
parsing of strand

Remo

On Mar 29, 2004, at 2:20 PM, Dominique Vlieghe wrote:

> Hello all,
>
> I have a question regarding the parsing of bl2seq output (using 
> Bio::Tools::BPbl2seq). In the bl2seq output there is a line for strand 
> orientation (e.g. "Strand = Plus / Plus"). I looked at the modules 
> that parse this file, but I don't see where those lines are processed. 
> Is this correct? If it is, what's the alternative for getting this 
> valuable information?
>
> Regards
>
> Dominique
>
> -- 
> -----------------------------
> Save the Hubble petition: http://www.savethehubble.org
> ------------------------------
> Dominique Vlieghe, Ph.D.,
> Bioinformatics Core,
> Department for Molecular Biomedical Research (DMBR)
> VIB - Ghent University
> 'Fiers-Schell-Van Montagu' building
> Technologiepark 927
> B-9052 Ghent (Zwijnaarde), Belgium
>
> Tel : +32-(0)9-33-13.693
> Fax : +32-(0)9-33-13.609
> email:dominique.vlieghe@dmbr.ugent.be
> www:http://www.dmbr.ugent.be/
> ------------------------------
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: text/enriched
Size: 1718 bytes
Desc: not available
Url : http://portal.open-bio.org/pipermail/bioperl-l/attachments/20040329/e5b18675/attachment.bin
From mjohnson at watson.wustl.edu  Mon Mar 29 15:45:16 2004
From: mjohnson at watson.wustl.edu (Mark Johnson)
Date: Mon Mar 29 15:50:31 2004
Subject: [Bioperl-l] Bio::Index::Fasta, Bio::Index::Qual
In-Reply-To: <GAEDKMGOKFBLJPKCLKCCKEIODJAA.brian_osborne@cognia.com>
References: <56765.10.0.1.216.1080578375.squirrel@watson.wustl.edu>
	<GAEDKMGOKFBLJPKCLKCCKEIODJAA.brian_osborne@cognia.com>
Message-ID: <57612.10.0.1.216.1080593116.squirrel@watson.wustl.edu>

Done.  Thanks!

> Mark,
>
> If there are no objections you can submit this as a "bug" to our Bugzilla
> with the attached file - I'd be happy to add it to the distribution for
> you.
>
> Brian O.
>
> -----Original Message-----
> From: bioperl-l-bounces@portal.open-bio.org
> [mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of Mark Johnson
> Sent: Monday, March 29, 2004 11:40 AM
> To: bioperl-l@portal.open-bio.org
> Subject: [Bioperl-l] Bio::Index::Fasta, Bio::Index::Qual
>
> Bio::Index::Fasta is great, but there is no corresponding
> Bio::Index::Qual...I created one via subsitution various permutations of
> 'qual' for 'fasta' on Bio::Index::Fasta.
>
> Unless I'm the only person that thinks Bio::Index::Qual might be useful,
> is there a chance of getting such a thing into the next release?
>
> Motivation:
> I maintain a piece of a makeshift submission pipeline for sending BAC end
> reads to the NCBI trace archive (http://www.ncbi.nlm.nih.gov/Traces).  I
> process files containing nucleotide sequence and quality values...the
> nucleotide sequence being in a seperate file from the quality values.  The
> access pattern is random with respect to the ordering of the sequences
> within the individual files.  So Bio::Index::Fasta is very handy, as is my
> cobbled together Bio::Index::Qual.  Which works fine...I just don't like
> maintaining a seperate local module...
>
> Thoughts, suggestions, comments, rotten fruit?
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>
>

From bogus@does.not.exist.com  Mon Mar 29 16:12:48 2004
From: bogus@does.not.exist.com (Content Filter)
Date: Mon Mar 29 16:18:06 2004
Subject: [Bioperl-l] Undeliverable message returned to sender
Message-ID: <20040329211248.5A4B06BB2B@mail17-ash-R.bigfish.com>

This message was created automatically by mail delivery software.

Delivery failed for the following recipients(s):
 	some@more.com

The message you sent contained an attachment which the recipient has chosen to block.
Usually these sort of attachments are blocked to prevent malicious software from
being sent to the recipient in question.

The name(s) of the blocked file(s) follow:
	talk.zip

Please contact the intended recipient to determine alternate ways to deliver this file.

----- Original Message Header -----
Received: by mail17-ash (MessageSwitch) id 108059476821327_31682; Mon, 29 Mar 2004 21:12:48 +0000 (UCT)
Received: from more.com (unknown [193.140.61.101])
	by mail17-ash.bigfish.com (Postfix) with SMTP id 26C146BB2B
	for <some@more.com>; Mon, 29 Mar 2004 21:12:39 +0000 (UCT)
From: bioperl-l@bioperl.org
To: some@more.com
Subject: hi
Date: Mon, 29 Mar 2004 13:12:52 -0800
MIME-Version: 1.0
Content-Type: multipart/mixed; boundary="52021680"
Message-Id: <20040329211239.26C146BB2B@mail17-ash.bigfish.com>
From hlapp at gnf.org  Mon Mar 29 16:22:40 2004
From: hlapp at gnf.org (Hilmar Lapp)
Date: Mon Mar 29 16:27:44 2004
Subject: [Bioperl-l] hash key for SeqFeatureI
In-Reply-To: <4062F52E.5000705@egenetics.com>
Message-ID: <35C435B2-81C7-11D8-AB71-000A959EB4C4@gnf.org>


On Thursday, March 25, 2004, at 07:05  AM, Peter van Heusden wrote:

> for SeqFeatureI, I was wondering whether the following is adaquate: 
> primary_tag(), display_id(), location->start(), location->end()?

You will also need strand and source_tag, whereas display_id has no 
bearing on uniqueness.

You may want to check out ChrisM's recent generate_unique_ID code and 
hash the resulting string.

> The point of a hash_key() method is to return a string which uniquely 
> identifies the object - i.e. if two hash keys are identical, then the 
> objects are equivalent.

Be very careful with this one though. What is the hash function you're 
going to use? Almost all hash functions define inequality directly, but 
not equality. I.e., if two objects have different hash codes, they must 
compare unequal, but two objects returning the same hash code may still 
compare as unequal.

	-hilmar
-- 
-------------------------------------------------------------
Hilmar Lapp                            email: lapp at gnf.org
GNF, San Diego, Ca. 92121              phone: +1-858-812-1757
-------------------------------------------------------------


From pjun at cc.ucsf.edu  Mon Mar 29 18:31:39 2004
From: pjun at cc.ucsf.edu (Peter Jun)
Date: Mon Mar 29 18:36:55 2004
Subject: [Bioperl-l] Skipping invalid entries in GenBank
Message-ID: <200403292336.i2TNaqg2018578@portal.open-bio.org>

I'm trying to create a script that will take a list of sequence names, and
pull the fasta sequence.  Occasionally, I will get an error message like
this when the sequence that I'm trying to retrieve doesn't exist:

------------- EXCEPTION  -------------
MSG: WebDBSeqI Error - check query sequences!

STACK Bio::DB::WebDBSeqI::get_seq_stream
C:/Perl/site/lib/Bio/DB/WebDBSeqI.pm:476
STACK Bio::DB::NCBIHelper::get_Stream_by_query
C:/Perl/site/lib/Bio/DB/NCBIHelper.pm:249
STACK toplevel E:\perlprogs\pullseq\Search.pl:32

--------------------------------------

Anyway to get around this?

Thanks!

From support at e-press.com  Tue Mar 30 04:09:10 2004
From: support at e-press.com (Auto-reply from support@e-press.com)
Date: Tue Mar 30 04:09:21 2004
Subject: [Bioperl-l] Re: read it immediately
In-Reply-To: <20040330090907.LSJP2492.mxsmta01.dellhost.com@e-press.com>
Message-ID: <20040330090910.LSJU2492.mxsmta01.dellhost.com@mxsmta01>

Your message has been received by the Technical Support department at E-Press Corp. and will be processed as quickly as possible. If an immediate answer is possible it will be provided within a few hours -- ie, we will quickly write back to you. If a technical specialist in the company needs to be consulted in a particular domain, please allow 1 business day for a reply.

We care about our users. We write back to everyone, usually quite promptly. (If you don't get a reply, check our FAQ page at www.e-press.com -- it gives reasons why you would not have received a reply. Common reason -- your ISP is rejecting our e-mails. Another common reason -- your e-mail has been flagged as spam and deleted.)

 FOR REGISTRATION REQUESTS -- Please allow 1 business day for your registration to be processed and a Final Registration Code to be e-mailed back to you.
 
Useful resource: Go to our web page ( http://www.e-press.com/support.htm) and click the 'Forum' selection. You can ask questions on the Forum. Or perhaps someone has already written about the issue you are experiencing.

 Thank you. We care about you, the user of our software.
 
 E-Press Technical Support :)
 

From postmaster at 4cplus.com  Tue Mar 30 04:25:51 2004
From: postmaster at 4cplus.com (Postmaster)
Date: Tue Mar 30 04:53:45 2004
Subject: [Bioperl-l] Undeliverable Mail
Message-ID: <10403301455.AA03492@4cplus.com>

User mailbox exceeds allowed size: hoatari@atariinfo.com


Original message follows.

Received: from 4C.4cplus.com [203.200.89.66] by 4cplus.com
  (SMTPD32-6.06) id AD1DEF02BE; Tue, 30 Mar 2004 14:55:49 +0530
From: bioperl-l@bioperl.org
To: ho@atariinfo.com
Subject: Re: Your bill
Date: Tue, 16 Mar 2004 14:59:07 +0530
MIME-Version: 1.0
X-Priority: 3
X-MSMail-Priority: Normal
Content-Type: multipart/mixed;
	boundary="----=_NextPart_000_0005_00002729.0000444B"
Message-Id: <200403301455665.SM01164@4C.4cplus.com>

This is a multi-part message in MIME format.


------=_NextPart_000_0005_00002729.0000444B
Content-Type: text/plain;
	charset="Windows-1252"
Content-Transfer-Encoding: 7bit

Your file is attached.

------=_NextPart_000_0005_00002729.0000444B
Content-Type: text/plain;
	name="InterScan_SafeStamp.txt"
Content-Transfer-Encoding: 7bit
Content-Disposition: attachment;
	filename="InterScan_SafeStamp.txt"

****** Message from InterScan E-Mail VirusWall NT ******

** WARNING! Attached file your_bill.pif contains:

     WORM_NETSKY.D virus

   It has been deleted.
*****************     End of message     ***************


------=_NextPart_000_0005_00002729.0000444B--

        ---ooo---   Internet Confidentiality Statement   ---ooo--- 

The information contained in this communication is confidential and is 
intended only for the use of the recipient named above, and may be legally 
privileged and exempt from disclosure under applicable law.  If the reader 
of this message is not the intended recipient, you are hereby notified that 
any dissemination, distribution or copying of this communication is 
strictly prohibited.  If you have received this communication in error, 
please resend it to the sender and delete the original message and any copy 
of it from your computer system.  Opinions, conclusions and other 
information in this message that do not relate to our official business 
should be understood as neither given nor endorsed by this company. 
From heikki at ebi.ac.uk  Tue Mar 30 05:26:07 2004
From: heikki at ebi.ac.uk (Heikki Lehvaslaiho)
Date: Tue Mar 30 05:31:25 2004
Subject: [Bioperl-l] Skipping invalid entries in GenBank
In-Reply-To: <200403292336.i2TNaqg2018578@portal.open-bio.org>
References: <200403292336.i2TNaqg2018578@portal.open-bio.org>
Message-ID: <200403301126.07638.heikki@ebi.ac.uk>

Peter,

You can wrap your calls in eval statements to recover from error, e.g in a 
loop:

eval {
    $seq = $gb->get_Seq_by_acc('J00522');
};  # the ';' character is needed here
next if $@;
# proceed working with $seq ...


It would be cleaner for modules to do this automatically. Anyone care to start 
refactoring Bio::DB modules? One way would be make them to follow 
Bio::SimpleAnalysisI rules.

	-Heikki


On Tuesday 30 Mar 2004 00:31, Peter Jun wrote:
> I'm trying to create a script that will take a list of sequence names, and
> pull the fasta sequence.  Occasionally, I will get an error message like
> this when the sequence that I'm trying to retrieve doesn't exist:
>
> ------------- EXCEPTION  -------------
> MSG: WebDBSeqI Error - check query sequences!
>
> STACK Bio::DB::WebDBSeqI::get_seq_stream
> C:/Perl/site/lib/Bio/DB/WebDBSeqI.pm:476
> STACK Bio::DB::NCBIHelper::get_Stream_by_query
> C:/Perl/site/lib/Bio/DB/NCBIHelper.pm:249
> STACK toplevel E:\perlprogs\pullseq\Search.pl:32
>
> --------------------------------------
>
> Anyway to get around this?
>
> Thanks!
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

-- 
______ _/      _/_____________________________________________________
      _/      _/                      http://www.ebi.ac.uk/mutations/
     _/  _/  _/  Heikki Lehvaslaiho    heikki_at_ebi ac uk
    _/_/_/_/_/  EMBL Outstation, European Bioinformatics Institute
   _/  _/  _/  Wellcome Trust Genome Campus, Hinxton
  _/  _/  _/  Cambs. CB10 1SD, United Kingdom
     _/      Phone: +44 (0)1223 494 644   FAX: +44 (0)1223 494 468
___ _/_/_/_/_/________________________________________________________
From vamsi at warta.bio.psu.edu  Tue Mar 30 10:26:44 2004
From: vamsi at warta.bio.psu.edu (Vamsi)
Date: Tue Mar 30 10:31:54 2004
Subject: [Bioperl-l] Bio::TreeIO problems parsing bootstrap values in newick
	format trees
Message-ID: <200403301526.i2UFQiV14368@warta.bio.psu.edu>

Hi,

I am using the Bio::TreeIO module to parse trees but have some
problems with the bootstrap values. I used the following code to
place arbitrary bootstrap values at interior nodes of the tree

(a:1,(b:1,c:1):1);

Code used:
----------------------------------------------------------------
use Bio::TreeIO;

my $in = new Bio::TreeIO(-file => $ARGV[0], -format => 'newick');
my $tree = $in->next_tree();
foreach my $n ($tree->get_nodes()) {
    if (!($n->is_Leaf())) {
        $n->bootstrap(int(rand(100)));
    }
}
my $out = new Bio::TreeIO(-format => 'newick');
$out->write_tree($tree);
----------------------------------------------------------------

The tree created was:

((a:1,(b:1,c:1)56:1)93);

I then read the tree in again and tried to print the bootstrap values
using the code:

----------------------------------------------------------------
use Bio::TreeIO;

my $in = new Bio::TreeIO(-file => $ARGV[0], -format => 'newick');
my $tree = $in->next_tree();
foreach my $n ($tree->get_nodes()) {
    if (!($n->is_Leaf())) {
        print $n->bootstrap(),"\n";
    }
}
----------------------------------------------------------------

and it printed blank lines ...

I guess, it is either printing wrong format trees or parsing them incorrectly.
Any suggestions?

Thanks,
Vamsi

From grossman at molgen.mpg.de  Tue Mar 30 10:53:01 2004
From: grossman at molgen.mpg.de (Steffen Grossmann)
Date: Tue Mar 30 10:58:22 2004
Subject: [Bioperl-l] general design of the IO system
Message-ID: <406997DD.8090603@molgen.mpg.de>

Dear all,

I am at the moment having a hard time trying to understand the basic 
design of the IO system.

For example, in Bio::SeqIO I don't understand the trick which makes an 
object created with Bio::SeqIO->new look like a Bio::SeqIO object but 
behave like a, e.g., Bio::SeqIO::fasta object.

Actually, the 'new' method in Bio::SeqIO looks circular to me.

Please, if one of you wizards could explain this to me or give me a hint 
where to find the design documented?

Thanks a lot!

Steffen

-- 
%---------------------------------------------%
%            Steffen Grossmann                %
%                                             %
% Max Planck Institute for Molecular Genetics %
%      Computational Molecular Biology        %
%---------------------------------------------%
%              Ihnestrasse 73                 %
%               14195 Berlin                  %
%                 Germany                     %
%---------------------------------------------%
%         Tel: (++49 +30) 8413-1167           %
%         Fax: (++49 +30) 8413-1152           %
%---------------------------------------------%

From antivirus at unito.it  Tue Mar 30 11:00:18 2004
From: antivirus at unito.it (antivirus@unito.it)
Date: Tue Mar 30 11:05:33 2004
Subject: [Bioperl-l] - DO NOT REPLY - Inflex scan report [0330180010086]
Message-ID: <200403301600.i2UG0LvI010217@albert.unito.it>

Administrator Email Reply Address: rete@unito.it
Email sent to:  fgrasso@unito.it
Inflex ID: 0330180010086
Report Details -----------------------------------------------
AntiVirus Results...

File NAME/TYPE Scan Results
 MS-DOS executable (EXE), OS/2 or MS Windows


0330180010086  from:bioperl-l@bioperl.org  to: fgrasso@unito.it
END OF MESSAGE.

End.
.
From laure.quintric at etudiant.univ-rennes1.fr  Tue Mar 30 11:28:41 2004
From: laure.quintric at etudiant.univ-rennes1.fr (laure.quintric@etudiant.univ-rennes1.fr)
Date: Tue Mar 30 11:33:54 2004
Subject: [Bioperl-l] how to find the gene's name ?
Message-ID: <1080664121.4069a0395516b@webmail.etudiant.univ-rennes1.fr>

Hello,

I don't know how to find the gene's name using bioperl.
To find it, I have to use Bio::LiveSeq::Gene;
But when I try to affich the gene's name by writing 
	print FILE 'genus = '.$seq->LiveSeq::Gene->name()."\n";

it doesn't work.
So, how must I do to use a package which is in an under file of bioperl ?
Thanks

Laure Quintric


From mlinks at cs.usask.ca  Tue Mar 30 11:32:00 2004
From: mlinks at cs.usask.ca (Matthew Links)
Date: Tue Mar 30 11:37:11 2004
Subject: [Bioperl-l] A problem parsing BLASTX 2.2.8 reports
Message-ID: <1080664320.1264.44.camel@localhost.localdomain>

I have run into a problem parsing BLASTX reports (version 2.2.8). When I
ask for strand and frame on the Bio::Search::Hit::HitI object I am
getting back the wrong answer. 

I think this has to do with a slight formatting change in the BLAST
output.

--- BLASTX 2.2.5 ---

>gi|19879878|gb|AAM00191.1| guanine nucleotide-exchange protein GEP2
            [Oryza sativa]
          Length = 1789
 
 Score =  320 bits (819), Expect(2) = 7e-98
 Identities = 160/193 (82%), Positives = 173/193 (89%)
 Frame = +2

--- BLASTX 2.2.8 ---

>gi|38346787|emb|CAE02205.2| OSJNBa0095H06.12 [Oryza sativa (japonica
            cultivar-group)]
          Length = 1724
 
 Score =  102 bits (254), Expect = 5e-21
 Identities = 62/204 (30%), Positives = 103/204 (50%), Gaps = 30/204
(14%)
 Frame = +2

In my debugging it looks like everything is ok except for the
strand/frame data. Which when parsing 2.2.8 gets

hit->frame = 1
hit->strand('query') = 1
hit->strand('hit') = 0

Has anyone seen this problem before? 

Thanks in advance,

Matt

-- 
Matthew Links

Research Officer, Bioinformatics
Department of Computer Science
University of Saskatchewan
1C101 Engineering Building
57 Campus Drive
Saskatoon Saskatchewan
S7N 5A9
Canada

Tel:	(306) 966 2058
Fax:	(306) 966 2033
Email:	mlinks@cs.usask.ca
From postmaster at portal.open-bio.org  Tue Mar 30 04:25:37 2004
From: postmaster at portal.open-bio.org (postmaster@portal.open-bio.org)
Date: Tue Mar 30 11:38:37 2004
Subject: [Bioperl-l] InterScan NT Alert
Message-ID: <200403301638.i2UGcXg2025388@portal.open-bio.org>

Sender, InterScan has detected virus(es) in your e-mail attachment.

Date:  	Tue, 30 Mar 2004 14:55:37 +0530
Method:	Mail
From:  	<bioperl-l@bioperl.org>
To:    	ho@atariinfo.com
File:  	your_bill.pif
Action:	deleted
Virus: 	WORM_NETSKY.D 
From jason at cgt.duhs.duke.edu  Tue Mar 30 11:49:51 2004
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Tue Mar 30 11:55:03 2004
Subject: [Bioperl-l] general design of the IO system
In-Reply-To: <406997DD.8090603@molgen.mpg.de>
References: <406997DD.8090603@molgen.mpg.de>
Message-ID: <Pine.LNX.4.50.0403301142550.28387-100000@tenero.duhs.duke.edu>

In the new method seeing the format as defined by
-format => 'fasta'
then gets mapped to a modulename 'Bio::SeqIO::fasta' and it calls new on
this module, basically like this
 my $mod = 'Bio::SeqIO::fasta';
 return $mod->new( @args );

So Bio::SeqIO::fasta new is called, then it delegates to its superclass
(Bio::SeqIO).  But this time around things are different because $class is
now different.

When the inherited Bio::SeqIO::new is called there is a test for classname
if( $class =~ /Bio::SeqIO::(\S+)/ ) {
        my ($self) = $class->SUPER::new(@args);
        $self->_initialize(@args);
        return $self;
}

There is a set of subsequent hierarchical _initialize calls made as well.

All of this is in place so that we can
 - Inherit from Bio::Root::Root
 - have initialization arguments be handled by superclasses so that a
   method signature is respected and a superclass implementer doesn't have
   to add a bunch of copy+paste code every time we add a new
   initialization parameter to all the subclasses.

This is the forced way of doing OO in Perl unfortunately.

Yes it is a little bit inefficient, but the biggest hit we take seems
to be is Bio::Root::Root _rearrange method being recalled many times and
its conversions of arrays to hashes.

-jason
On Tue, 30 Mar 2004, Steffen Grossmann wrote:

> Dear all,
>
> I am at the moment having a hard time trying to understand the basic
> design of the IO system.
>
> For example, in Bio::SeqIO I don't understand the trick which makes an
> object created with Bio::SeqIO->new look like a Bio::SeqIO object but
> behave like a, e.g., Bio::SeqIO::fasta object.
>
> Actually, the 'new' method in Bio::SeqIO looks circular to me.
>
> Please, if one of you wizards could explain this to me or give me a hint
> where to find the design documented?
>
> Thanks a lot!
>
> Steffen
>
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From jason at cgt.duhs.duke.edu  Tue Mar 30 12:03:19 2004
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Tue Mar 30 12:08:30 2004
Subject: [Bioperl-l] Bio::TreeIO problems parsing bootstrap values in
	newick format trees
In-Reply-To: <200403301526.i2UFQiV14368@warta.bio.psu.edu>
References: <200403301526.i2UFQiV14368@warta.bio.psu.edu>
Message-ID: <Pine.LNX.4.50.0403301139300.28387-100000@tenero.duhs.duke.edu>

This is because bootstrap values is not an explicit type for newick
format the bootstrap value gets mapped to the id on the subsequence
re-read in.  This won't happen for nhx format.


#!/usr/bin/perl -w
use strict;
use Bio::TreeIO;
use IO::String;

my $in = new Bio::TreeIO(-fh => \*DATA, -format => 'newick');
my $tree = $in->next_tree();
foreach my $n ($tree->get_nodes()) {
    if (!($n->is_Leaf())) {
        $n->bootstrap(int(rand(100)));
    }
}
my $str = IO::String->new();
my $out = new Bio::TreeIO(-format => 'newick', -fh => $str);
$out->write_tree($tree);
print ${$str->string_ref}, "\n";
my $is = IO::String->new(${$str->string_ref});
$in = new Bio::TreeIO(-fh => $is, -format => 'newick');
$tree = $in->next_tree();
foreach my $n ($tree->get_nodes()) {
    if (!($n->is_Leaf()) && $n->id) {
        print "id=",$n->id, " blen=", $n->branch_length || '', " bs=",
	$n->bootstrap || '',"\n";
    }
}

__DATA__
(a:1,(b:1,c:1):1);

$ perl test_newick_bootstrap.pl
((a:1,(b:1,c:1)11:1)32);

id=11 blen=1 bs=
id=32 blen= bs=

I realize there an extra set of parens - this has to do with the
explicit rooting of these trees - I need to work on that at some point so
that trees can be stored as unrooted.

-j

On Tue, 30 Mar 2004, Vamsi wrote:

> Hi,
>
> I am using the Bio::TreeIO module to parse trees but have some
> problems with the bootstrap values. I used the following code to
> place arbitrary bootstrap values at interior nodes of the tree
>
> (a:1,(b:1,c:1):1);
>
> Code used:
> ----------------------------------------------------------------
> use Bio::TreeIO;
>
> my $in = new Bio::TreeIO(-file => $ARGV[0], -format => 'newick');
> my $tree = $in->next_tree();
> foreach my $n ($tree->get_nodes()) {
>     if (!($n->is_Leaf())) {
>         $n->bootstrap(int(rand(100)));
>     }
> }
> my $out = new Bio::TreeIO(-format => 'newick');
> $out->write_tree($tree);
> ----------------------------------------------------------------
>
> The tree created was:
>
> ((a:1,(b:1,c:1)56:1)93);
>
> I then read the tree in again and tried to print the bootstrap values
> using the code:
>
> ----------------------------------------------------------------
> use Bio::TreeIO;
>
> my $in = new Bio::TreeIO(-file => $ARGV[0], -format => 'newick');
> my $tree = $in->next_tree();
> foreach my $n ($tree->get_nodes()) {
>     if (!($n->is_Leaf())) {
>         print $n->bootstrap(),"\n";
>     }
> }
> ----------------------------------------------------------------
>
> and it printed blank lines ...
>
> I guess, it is either printing wrong format trees or parsing them incorrectly.
> Any suggestions?
>
> Thanks,
> Vamsi
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From adejo1keashaye at hotmail.com  Tue Mar 30 12:14:09 2004
From: adejo1keashaye at hotmail.com (lawrence)
Date: Tue Mar 30 12:19:18 2004
Subject: [Bioperl-l] AMAZINnG PRICEtS on F.D.A. DRUGiS pacersc octoberq abcdb
Message-ID: <1080666849-22507@excite.com>


PR0ZiAC, XANAbX, VIAGRhA, CIALIhS, ZYBAlN & more!!

Fight depreslsion, reduce your appetite and increase metabolism, make love SEVEN TIMES a NIGHT!

Drugh at HUGE DISCOpNTS!!!!

No herbahl suppljment BS.  The Real FDA APPROVEoD drugs that are proven to work!!
 

Go HERE and GET IT:
http://hvpr.base4ds.com/s95/index.php?id=s95


ruth campbell angels tequila moomoo angus goblue gray vicky laddiegocougs fountain basil carlos excalibu midori happyday christop pisces shit benoit racerx phoenix1 miranda fozzie 
From jason at cgt.duhs.duke.edu  Tue Mar 30 12:25:37 2004
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Tue Mar 30 12:30:48 2004
Subject: [Bioperl-l] A problem parsing BLASTX 2.2.8 reports
In-Reply-To: <1080664320.1264.44.camel@localhost.localdomain>
References: <1080664320.1264.44.camel@localhost.localdomain>
Message-ID: <Pine.LNX.4.50.0403301208010.28387-100000@tenero.duhs.duke.edu>

Matt - I'm having a little trouble understanding the problem - why aren't
the value you are reporting what you expect.

With BLASTX the query will have a frame (0,1,2) in GFF/bioperl not
(1,2,3). So in your example the query will have some frame [1] and some
strand [1] since frame is '+'.  The hit will have no strand since it is
protein [0].  Isn't that what you got?

$hit->frame is going to return something different depending on what type
of search you did also.  For TBLASTN or BLASTX it will return the valid
frame for whatever makes sense (hit or query) and will return an array for
TBLASTX.

Also, the Hit object will try and make a summary value for all the HSPs -
this will be the frame for all the HSPs (if they share the same frame
throughout) or just the frame of the first HSP if they differ.

I don't really like to rely on this personally.  Rather I would call it
explicitly for the HSP:

$hsp->query->frame or $hsp->hit->frame
 or
$hsp->frame('query'), $hsp->frame('hit')
 or
my ($qframe,$hframe) = $hsp->frame;

All in all it is hard to say without a copy of the report(s) and your
code.

-jason

On Tue, 30 Mar 2004, Matthew Links wrote:

> I have run into a problem parsing BLASTX reports (version 2.2.8). When I
> ask for strand and frame on the Bio::Search::Hit::HitI object I am
> getting back the wrong answer.
>
> I think this has to do with a slight formatting change in the BLAST
> output.
>
> --- BLASTX 2.2.5 ---
>
> >gi|19879878|gb|AAM00191.1| guanine nucleotide-exchange protein GEP2
>             [Oryza sativa]
>           Length = 1789
>
>  Score =  320 bits (819), Expect(2) = 7e-98
>  Identities = 160/193 (82%), Positives = 173/193 (89%)
>  Frame = +2
>
> --- BLASTX 2.2.8 ---
>
> >gi|38346787|emb|CAE02205.2| OSJNBa0095H06.12 [Oryza sativa (japonica
>             cultivar-group)]
>           Length = 1724
>
>  Score =  102 bits (254), Expect = 5e-21
>  Identities = 62/204 (30%), Positives = 103/204 (50%), Gaps = 30/204
> (14%)
>  Frame = +2
>
> In my debugging it looks like everything is ok except for the
> strand/frame data. Which when parsing 2.2.8 gets
>
> hit->frame = 1
> hit->strand('query') = 1
> hit->strand('hit') = 0
>
> Has anyone seen this problem before?
>
> Thanks in advance,
>
> Matt
>
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From Gilles.Parmentier at ens-lyon.fr  Tue Mar 30 13:40:54 2004
From: Gilles.Parmentier at ens-lyon.fr (Gilles Parmentier)
Date: Tue Mar 30 13:46:07 2004
Subject: [Bioperl-l] A problem parsing BLASTX 2.2.8 reports
In-Reply-To: <Pine.LNX.4.50.0403301208010.28387-100000@tenero.duhs.duke.edu>
References: <1080664320.1264.44.camel@localhost.localdomain>
	<Pine.LNX.4.50.0403301208010.28387-100000@tenero.duhs.duke.edu>
Message-ID: <4069BF36.5020307@ens-lyon.fr>

Hi all,

Using mac os X I noticed a lot of problems with 2.2.8. reports. In fact 
they look really buggy (strange query lengths among other things). 
Bioperl::SearIO is not able to parse them. To tackle this I downgraded 
my install to 2.2.6 :(

Gilles

Jason Stajich wrote:

>Matt - I'm having a little trouble understanding the problem - why aren't
>the value you are reporting what you expect.
>
>With BLASTX the query will have a frame (0,1,2) in GFF/bioperl not
>(1,2,3). So in your example the query will have some frame [1] and some
>strand [1] since frame is '+'.  The hit will have no strand since it is
>protein [0].  Isn't that what you got?
>
>$hit->frame is going to return something different depending on what type
>of search you did also.  For TBLASTN or BLASTX it will return the valid
>frame for whatever makes sense (hit or query) and will return an array for
>TBLASTX.
>
>Also, the Hit object will try and make a summary value for all the HSPs -
>this will be the frame for all the HSPs (if they share the same frame
>throughout) or just the frame of the first HSP if they differ.
>
>I don't really like to rely on this personally.  Rather I would call it
>explicitly for the HSP:
>
>$hsp->query->frame or $hsp->hit->frame
> or
>$hsp->frame('query'), $hsp->frame('hit')
> or
>my ($qframe,$hframe) = $hsp->frame;
>
>All in all it is hard to say without a copy of the report(s) and your
>code.
>
>-jason
>
>On Tue, 30 Mar 2004, Matthew Links wrote:
>
>  
>
>>I have run into a problem parsing BLASTX reports (version 2.2.8). When I
>>ask for strand and frame on the Bio::Search::Hit::HitI object I am
>>getting back the wrong answer.
>>
>>I think this has to do with a slight formatting change in the BLAST
>>output.
>>
>>--- BLASTX 2.2.5 ---
>>
>>    
>>
>>>gi|19879878|gb|AAM00191.1| guanine nucleotide-exchange protein GEP2
>>>      
>>>
>>            [Oryza sativa]
>>          Length = 1789
>>
>> Score =  320 bits (819), Expect(2) = 7e-98
>> Identities = 160/193 (82%), Positives = 173/193 (89%)
>> Frame = +2
>>
>>--- BLASTX 2.2.8 ---
>>
>>    
>>
>>>gi|38346787|emb|CAE02205.2| OSJNBa0095H06.12 [Oryza sativa (japonica
>>>      
>>>
>>            cultivar-group)]
>>          Length = 1724
>>
>> Score =  102 bits (254), Expect = 5e-21
>> Identities = 62/204 (30%), Positives = 103/204 (50%), Gaps = 30/204
>>(14%)
>> Frame = +2
>>
>>In my debugging it looks like everything is ok except for the
>>strand/frame data. Which when parsing 2.2.8 gets
>>
>>hit->frame = 1
>>hit->strand('query') = 1
>>hit->strand('hit') = 0
>>
>>Has anyone seen this problem before?
>>
>>Thanks in advance,
>>
>>Matt
>>
>>
>>    
>>
>
>--
>Jason Stajich
>Duke University
>jason at cgt.mc.duke.edu
>_______________________________________________
>Bioperl-l mailing list
>Bioperl-l@portal.open-bio.org
>http://portal.open-bio.org/mailman/listinfo/bioperl-l
>  
>

-- 
"En passant la ligne sacr?e de la biens?ance, on donne des armes ? ceux qui n'ont pour eux que le respect de la biens?ance, qui fait la dignit? du corps des professionnels. Tous ces dignes dignitaires, ils ont ?a : leur petit corset de vertu n?gative." (P.B. in Yvette Delsaut et Marie-Christine Rivi?re, Bibliographie des travaux de Pierre Bourdieu, Le Temps des Cerises, 2002)


From jason at cgt.duhs.duke.edu  Tue Mar 30 14:03:15 2004
From: jason at cgt.duhs.duke.edu (Jason Stajich)
Date: Tue Mar 30 14:08:39 2004
Subject: [Bioperl-l] A problem parsing BLASTX 2.2.8 reports
In-Reply-To: <4069BF36.5020307@ens-lyon.fr>
References: <1080664320.1264.44.camel@localhost.localdomain>
	<Pine.LNX.4.50.0403301208010.28387-100000@tenero.duhs.duke.edu>
	<4069BF36.5020307@ens-lyon.fr>
Message-ID: <Pine.LNX.4.50.0403301344270.28387-100000@tenero.duhs.duke.edu>

That particular problem has been reported to NCBI and is listed in bug
#1598 as I posted to the list last week.
http://bugzilla.open-bio.org/show_bug.cgi?id=1598

I committed a patch to Bio::SearchIO::blast in CVS to handle this bug in
BLAST (but of course cannot get the # letters in the database correct
since this is mangled).

-jason
On Tue, 30 Mar 2004, Gilles Parmentier wrote:

> Hi all,
>
> Using mac os X I noticed a lot of problems with 2.2.8. reports. In fact
> they look really buggy (strange query lengths among other things).
> Bioperl::SearIO is not able to parse them. To tackle this I downgraded
> my install to 2.2.6 :(
>
> Gilles
>
> Jason Stajich wrote:
>
> >Matt - I'm having a little trouble understanding the problem - why aren't
> >the value you are reporting what you expect.
> >
> >With BLASTX the query will have a frame (0,1,2) in GFF/bioperl not
> >(1,2,3). So in your example the query will have some frame [1] and some
> >strand [1] since frame is '+'.  The hit will have no strand since it is
> >protein [0].  Isn't that what you got?
> >
> >$hit->frame is going to return something different depending on what type
> >of search you did also.  For TBLASTN or BLASTX it will return the valid
> >frame for whatever makes sense (hit or query) and will return an array for
> >TBLASTX.
> >
> >Also, the Hit object will try and make a summary value for all the HSPs -
> >this will be the frame for all the HSPs (if they share the same frame
> >throughout) or just the frame of the first HSP if they differ.
> >
> >I don't really like to rely on this personally.  Rather I would call it
> >explicitly for the HSP:
> >
> >$hsp->query->frame or $hsp->hit->frame
> > or
> >$hsp->frame('query'), $hsp->frame('hit')
> > or
> >my ($qframe,$hframe) = $hsp->frame;
> >
> >All in all it is hard to say without a copy of the report(s) and your
> >code.
> >
> >-jason
> >
> >On Tue, 30 Mar 2004, Matthew Links wrote:
> >
> >
> >
> >>I have run into a problem parsing BLASTX reports (version 2.2.8). When I
> >>ask for strand and frame on the Bio::Search::Hit::HitI object I am
> >>getting back the wrong answer.
> >>
> >>I think this has to do with a slight formatting change in the BLAST
> >>output.
> >>
> >>--- BLASTX 2.2.5 ---
> >>
> >>
> >>
> >>>gi|19879878|gb|AAM00191.1| guanine nucleotide-exchange protein GEP2
> >>>
> >>>
> >>            [Oryza sativa]
> >>          Length = 1789
> >>
> >> Score =  320 bits (819), Expect(2) = 7e-98
> >> Identities = 160/193 (82%), Positives = 173/193 (89%)
> >> Frame = +2
> >>
> >>--- BLASTX 2.2.8 ---
> >>
> >>
> >>
> >>>gi|38346787|emb|CAE02205.2| OSJNBa0095H06.12 [Oryza sativa (japonica
> >>>
> >>>
> >>            cultivar-group)]
> >>          Length = 1724
> >>
> >> Score =  102 bits (254), Expect = 5e-21
> >> Identities = 62/204 (30%), Positives = 103/204 (50%), Gaps = 30/204
> >>(14%)
> >> Frame = +2
> >>
> >>In my debugging it looks like everything is ok except for the
> >>strand/frame data. Which when parsing 2.2.8 gets
> >>
> >>hit->frame = 1
> >>hit->strand('query') = 1
> >>hit->strand('hit') = 0
> >>
> >>Has anyone seen this problem before?
> >>
> >>Thanks in advance,
> >>
> >>Matt
> >>
> >>
> >>
> >>
> >
> >--
> >Jason Stajich
> >Duke University
> >jason at cgt.mc.duke.edu
> >_______________________________________________
> >Bioperl-l mailing list
> >Bioperl-l@portal.open-bio.org
> >http://portal.open-bio.org/mailman/listinfo/bioperl-l
> >
> >
>
>

--
Jason Stajich
Duke University
jason at cgt.mc.duke.edu
From mlinks at cs.usask.ca  Tue Mar 30 14:34:40 2004
From: mlinks at cs.usask.ca (Matthew Links)
Date: Tue Mar 30 14:39:52 2004
Subject: [Bioperl-l] A problem parsing BLASTX 2.2.8 reports
In-Reply-To: <Pine.LNX.4.50.0403301208010.28387-100000@tenero.duhs.duke.edu>
References: <1080664320.1264.44.camel@localhost.localdomain>
	<Pine.LNX.4.50.0403301208010.28387-100000@tenero.duhs.duke.edu>
Message-ID: <1080675280.1264.69.camel@localhost.localdomain>

Sorry Jason,

Turns out I was going a little cross-eyed. I was running blast with the
default parameters for the number of one line descriptions and the
number of alignments (500 and 250 respectively). So for the 251st hit
there was no corresponding strand information and so the strand is set
to the empty string (but defined). I mis-interpreted the behaviour. I
then further confused things by grabbing the wrong example. Posting, as
you point out, an example of correct problem.

Sorry, the problem was all psychosomatic.

I think its time for a beer....

Thanks for humouring me,

Matt

-- 
Matthew Links

Research Officer, Bioinformatics
Department of Computer Science
University of Saskatchewan
1C101 Engineering Building
57 Campus Drive
Saskatoon Saskatchewan
S7N 5A9
Canada

Tel:	(306) 966 2058
Fax:	(306) 966 2033
Email:	mlinks@cs.usask.ca
From heikki at nildram.co.uk  Tue Mar 30 17:47:36 2004
From: heikki at nildram.co.uk (Heikki Lehvaslaiho)
Date: Tue Mar 30 17:52:46 2004
Subject: [Bioperl-l] how to find the gene's name ?
In-Reply-To: <1080664121.4069a0395516b@webmail.etudiant.univ-rennes1.fr>
References: <1080664121.4069a0395516b@webmail.etudiant.univ-rennes1.fr>
Message-ID: <200403302247.36919.heikki@nildram.co.uk>

Laure,

You really should not be using Bio::LiveSeq unless you want to mutate the 
sequence. It much too heavy handed for simply looking into the entry.

I suppose you have a GenBank or EMBL format file. If that is the case, you can 
access the gene name like this (if it is present in the file, which is not 
too often):

-----------------------------------------------------------------------------
use Bio::SeqIO;
use strict;

my $in  = Bio::SeqIO->new(-file => shift);
my $seq = $in->next_seq;

foreach ( $seq->all_SeqFeatures()) {
    print $_->get_tag_values('gene'), "\n" if $_->has_tag('gene');
}
-----------------------------------------------------------------------------

Yours,
	-Heikki

On Tuesday 30 Mar 2004 16:28, laure.quintric@etudiant.univ-rennes1.fr wrote:
> Hello,
>
> I don't know how to find the gene's name using bioperl.
> To find it, I have to use Bio::LiveSeq::Gene;
> But when I try to affich the gene's name by writing
> 	print FILE 'genus = '.$seq->LiveSeq::Gene->name()."\n";
>
> it doesn't work.
> So, how must I do to use a package which is in an under file of bioperl ?
> Thanks
>
> Laure Quintric
>
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

-- 
______ _/      _/_____________________________________________________
      _/      _/                      http://www.ebi.ac.uk/mutations/
     _/  _/  _/  Heikki Lehvaslaiho    heikki_at_ebi ac uk
    _/_/_/_/_/  EMBL Outstation, European Bioinformatics Institute
   _/  _/  _/  Wellcome Trust Genome Campus, Hinxton
  _/  _/  _/  Cambs. CB10 1SD, United Kingdom
     _/      Phone: +44 (0)1223 494 644   FAX: +44 (0)1223 494 468
___ _/_/_/_/_/________________________________________________________
From jweeks at rci.rutgers.edu  Tue Mar 30 20:53:21 2004
From: jweeks at rci.rutgers.edu (Jeff Weeks)
Date: Tue Mar 30 17:59:44 2004
Subject: [Bioperl-l] Interface directly to Unigene DB  at NCBI?
Message-ID: <3046DF57-82B6-11D8-B823-003065E01AEC@rci.rutgers.edu>

The only example I've seen is the cluster stream class reading from a 
dat file.  Sorry if my lingo is not correct.  We'd like to update our 
database by accessing the Unigene database directly without doing a 
file download first.

Is this possible?

Thanks in advance!

Jeffrey Weeks
SCI Project to Cure Paralysis
Rutgers University

From nandita at uga.edu  Tue Mar 30 18:19:06 2004
From: nandita at uga.edu (Nandita Mullapudi)
Date: Tue Mar 30 18:24:15 2004
Subject: [Bioperl-l] getting top blast hit
Message-ID: <a06020405bc8fb034aafd@[128.192.45.16]>

I am currently using  Bio::SearchIO to parse tons of blast files by 
setting limits on E- value, % identity etc. I now want to parse a 
huge blast job, by getting just the top hit out of each file. 
(Assuming blast was run and hits were returned with hit with lowest E 
value first)
Is there a quick way to do this?
thanks,
-Nandita
From sdavis2 at mail.nih.gov  Tue Mar 30 18:41:19 2004
From: sdavis2 at mail.nih.gov (Sean Davis)
Date: Tue Mar 30 18:42:13 2004
Subject: [Bioperl-l] getting top blast hit
In-Reply-To: <a06020405bc8fb034aafd@[128.192.45.16]>
Message-ID: <BC8F6FCF.6648%sdavis2@mail.nih.gov>

Nandita,

You can do something like (from Bio::SearchIO):

   use Bio::SearchIO;
   # format can be 'fasta', 'blast', 'exonerate', ...
   my $searchio = new Bio::SearchIO( -format => 'blastxml',
                                     -file   => 'blastout.xml' );
   while ( my $result = $searchio->next_result() ) {
       #get first hit and do something with it.
       my $top_hit = $result->next_hit;
       #do some stuff such as saving a it in a hash
       #move on to next result (next query)
   }

Sean

On 3/30/04 6:19 PM, "Nandita Mullapudi" <nandita@uga.edu> wrote:

> I am currently using  Bio::SearchIO to parse tons of blast files by
> setting limits on E- value, % identity etc. I now want to parse a
> huge blast job, by getting just the top hit out of each file.
> (Assuming blast was run and hits were returned with hit with lowest E
> value first)
> Is there a quick way to do this?
> thanks,
> -Nandita
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 

From deejoe at iastate.edu  Tue Mar 30 19:24:44 2004
From: deejoe at iastate.edu (D. Joe Anderson)
Date: Tue Mar 30 19:29:55 2004
Subject: [Bioperl-l] Bioperl 1.4 on Debian
In-Reply-To: <20040309190612.GB9004@bioinfo.ucr.edu>
References: <404E10F4.5010300@burnham.org>
	<20040309190612.GB9004@bioinfo.ucr.edu>
Message-ID: <20040331002444.GG14934@iastate.edu>

On Tue, Mar 09, 2004 at 11:06:12AM -0800, Josh Lauricha wrote:
> On Tue 03/09/04 10:46, Kutbuddin Doctor wrote:

> > I wish to go directly to bioperl 1.4 . I see my options as CPAN for 1.4 
> > or the test debian package "bioperl 1.2.1-2". Our happy-go-lucky system 
> > admin prefers debian packages :)

As well s/he should!  :-)

I've filed a wishlist bug against the Debian bioperl package, in hopes that
the Debian maintainer can find the time to get 1.4 in:

http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=241146

What would be especially sweet is if it makes it into sarge before it goes
stable.
 
> I've got an unofficial 1.4 package that I could make availible, but its
> not likely to ever get into debian (I'm not a maintainer) and when 1.4
> is in, its not likely to be structured like mine (which isn't really a
> big deal.)

In the meantime, I've been dabbling with the Debian dh-make-perl tool for
building those sysadmin-pleasing Debian packages from CPAN packages, which
attempts you can read more about here:

http://www.etrumeus.com/wiki/DebianBioinformatics#7204beef1acd1862531845cb0f7e3ca243126b45

Since these days I'm more sysadmin than heavy bioperl user, this is
about as far as I've gotten.  I don't really have the experience or suite of
personal scripts to evaluate it much further.  

If someone running Debian stable wants to download that and give it a try
and let me know how well it works, that'd be great.  My next venture in this
vein will be to try making 1.4 with dh-make-perl on an updated sarge
(presently "unstable") box.

-- 
D. Joe Anderson, Asst. Sci.                    2252 Molecular Biology Bldg.
BBMB Research Computing Support 
www.public.iastate.edu/~deejoe

From andrew at arginine.murdoch.edu.au  Tue Mar 30 22:29:43 2004
From: andrew at arginine.murdoch.edu.au (Andrew Macgregor)
Date: Tue Mar 30 22:35:02 2004
Subject: [Bioperl-l] Interface directly to Unigene DB  at NCBI?
In-Reply-To: <3046DF57-82B6-11D8-B823-003065E01AEC@rci.rutgers.edu>
References: <3046DF57-82B6-11D8-B823-003065E01AEC@rci.rutgers.edu>
Message-ID: <A67E76F8-82C3-11D8-ADEC-000A95E00B2E@cbbc.murdoch.edu.au>

Hi Jeffrey,

As NCBI only updates their unigene build every three weeks or so I've 
always found it just as easy to download the appropriate *.data file 
and load it into a local database and search that. That speeds it up 
for me and takes the load off NCBI servers.

Not really an answer sorry, but I hope this helps.

-- Andrew.


On 31/03/2004, at 9:53 AM, Jeff Weeks wrote:

> The only example I've seen is the cluster stream class reading from a 
> dat file.  Sorry if my lingo is not correct.  We'd like to update our 
> database by accessing the Unigene database directly without doing a 
> file download first.
>
> Is this possible?
>
> Thanks in advance!
>
> Jeffrey Weeks
> SCI Project to Cure Paralysis
> Rutgers University
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

From heikki at ebi.ac.uk  Wed Mar 31 06:42:26 2004
From: heikki at ebi.ac.uk (Heikki Lehvaslaiho)
Date: Wed Mar 31 06:48:09 2004
Subject: [Bioperl-l] how to find the gene's name ?
In-Reply-To: <1080717605.406a71258ec6d@webmail.etudiant.univ-rennes1.fr>
References: <1080664121.4069a0395516b@webmail.etudiant.univ-rennes1.fr>
	<200403302247.36919.heikki@nildram.co.uk>
	<1080717605.406a71258ec6d@webmail.etudiant.univ-rennes1.fr>
Message-ID: <200403311142.26881.heikki@ebi.ac.uk>


Laure,

If you are dealing with Swiss-prot entries, gene name(s)  is accessed by:

    for ($seq->annotation()->get_Annotations('gene_name')) {
        print $_->value;
    }

See Bio::AnnotationI for more information about annotations. (Gene name is a 
standard feature a swiss-prot entry, but occurs only occasionally in 
nucleotide sequence entries as part of the feature table, so it makes sense 
to keep it in generic storage space offered by annotations.)


Molecular weight is not stored in the object, but recalculated from the 
sequence using the following code:

  my $mw = ${Bio::Tools::SeqStats->get_mol_wt($seq->primary_seq)}[0];

It might actually be wise to save the molecule weight in an annotation object 
and retrieve it there and calculate it only it is not present. That should 
speed up the processing.

The keywords are easily accessed (maybe you example sequence does not have 
keywords?):

  my $kw_string = $seq->keywords;


Lastly, you miss "Org Crossref (RX)" line. I am not sure what you mean by it.
RX lines are for reference crosrefs which are an other type of annotation 
objects:

    foreach my $ref ( $_->annotation->get_Annotations('reference') ) {
        print "MEDLINE=". $ref->medline. "\n" if $ref->medline;
    }

If, on the other hand, you are after the taxid like:

OX   NCBI_TaxID=9606;

This code will print it out:

    print "taxid= ", $seq->species->ncbi_taxid, "\n";


The best way to find out how swiss-prot fields are treated, look into the  
Bio::SeqIO::swiss module.

Yours,

	-Heikki

On Wednesday 31 Mar 2004 07:20, you wrote:
> Hello, it's me again, sorry to annoy you, but I'm blocked :(
>
> So as I said, I try to extract the gene's name. I need to add some
> precisions. First, I work on Swissprot Database, I've downloaded it to work
> in local. The thing I want to do is to keep only the informations that I
> need to resolve my project that is to create a web interface so as we could
> find a protein by entrying the composition in acid amino.
> So I must create my own database by extracting information from swissprot,
> I need to find:
> -Entry name (it's ok)
> -Primary accession number (it's ok)
> -Seq length (it's ok)
> -Seq weight (I can't find it... :( )
> -Seq protein (it's ok)
> -Protein name (it's ok)
> -Gene name (Can't find it)
> -Organelle (ok)
> -Species name (ok)
> -Taxonomy (ok)
> -Org crossref (can't find it)
> -keyword list (the fonction $seq->keywords() doesn't work or I don't know
> how to use it...)
>
> I've tried the algorithm you send me but it tells that it can't call the
> method has_tag without package or object reference at line
> print $_->get_tag_values('gene'), "\n" if $_->has_tag('gene');
>
> So The thing I try to do is to open the local database of Swissprot and
> write in my own file those informations, but I can't find the Seq weight,
> the Gene's name, the Org crossref (RX) and the keywords...
> You can find my algorithm in the joint file.
>
> Thank you very much.
>
> Laure.

-- 
______ _/      _/_____________________________________________________
      _/      _/                      http://www.ebi.ac.uk/mutations/
     _/  _/  _/  Heikki Lehvaslaiho    heikki_at_ebi ac uk
    _/_/_/_/_/  EMBL Outstation, European Bioinformatics Institute
   _/  _/  _/  Wellcome Trust Genome Campus, Hinxton
  _/  _/  _/  Cambs. CB10 1SD, United Kingdom
     _/      Phone: +44 (0)1223 494 644   FAX: +44 (0)1223 494 468
___ _/_/_/_/_/________________________________________________________
From brian_osborne at cognia.com  Wed Mar 31 07:39:33 2004
From: brian_osborne at cognia.com (Brian Osborne)
Date: Wed Mar 31 07:44:45 2004
Subject: [SPAM] [Bioperl-l] how to find the gene's name ?
In-Reply-To: <1080664121.4069a0395516b@webmail.etudiant.univ-rennes1.fr>
Message-ID: <GAEDKMGOKFBLJPKCLKCCCEKNDJAA.brian_osborne@cognia.com>

Laure,

You followed up by saying that you were using Swissprot. You may want to
take a look the Feature-Annotation HOWTO, since most of the information in
Swissprot is stored as Annotations in Bioperl.

Also, you mentioned that you'd created your own database for Swissprot data.
This is not necessary, the combination of Bioperl and the Bioperl package
bioperl-db and Biosql creates a database (Mysql, Oracle, or Postgres) that
you can load Swissprot into, the loading scripts are available as well. A
number of Bioperl users have been talking using these tools for Swissprot.

Brian O.

-----Original Message-----
From: bioperl-l-bounces@portal.open-bio.org
[mailto:bioperl-l-bounces@portal.open-bio.org]On Behalf Of
laure.quintric@etudiant.univ-rennes1.fr
Sent: Tuesday, March 30, 2004 11:29 AM
To: bioperl-l@bioperl.org
Subject: [SPAM] [Bioperl-l] how to find the gene's name ?

Hello,

I don't know how to find the gene's name using bioperl.
To find it, I have to use Bio::LiveSeq::Gene;
But when I try to affich the gene's name by writing
        print FILE 'genus = '.$seq->LiveSeq::Gene->name()."\n";

it doesn't work.
So, how must I do to use a package which is in an under file of bioperl ?
Thanks

Laure Quintric


_______________________________________________
Bioperl-l mailing list
Bioperl-l@portal.open-bio.org
http://portal.open-bio.org/mailman/listinfo/bioperl-l


From "M BATES ALANN" at netscape.net  Wed Mar 31 20:14:48 2004
From: "M BATES ALANN" at netscape.net ("M BATES ALANN"@netscape.net)
Date: Wed Mar 31 09:58:45 2004
Subject: [Bioperl-l] REPLY BACK
Message-ID: <0HVG00CRJWSMBW@btic.dbt.gov.in>

Dear Friend,

As you read this, I don't want you to feel sorry for me, because, I believe everyone will die someday. 
My name is BATES ALAN a merchant in Dubai, in the U.A.E.I have been diagnosed with Esophageal cancer.
It has defiled all forms of medical treatment, and right now I have only about a few months to live, according to medical experts. 
I have not particularly lived my life so well, as I never really cared for anyone(not even myself)but my business. Though I am very rich, I was never generous, I was always hostile to people and only focused on my business as that was the only thing I cared for. But now I regret all this as I now know that there is more to life than just wanting to have or make all the money in the world. 
I believe when God gives me a second chance to come to this world I would live my life a different way from how I have lived it. Now that God has called me, I have willed and given most of my property and assets to my immediate and extended family members as well as a few close friends. 
I want God to be merciful to me and accept my soul so, I have decided to give alms to charity 
organizations, as I want this to be one of the last good deeds I do on earth. So far, I have distributed money to some charity organizations in the U.A.E, Algeria and Malaysia. 
Now that my health has deteriorated so badly, I cannot do this myself anymore. I once asked members of my family to close one of my accounts and distribute the money which I have there to charity organization in Bulgaria and Pakistan, they refused and kept the money to themselves. Hence, I do not trust them anymore, as they seem not to be contended with what I have left 
for them. 
The last of my money which no one knows of is the huge cash deposit of eighteen million dollars $18,000,000,00 that I have with a finance/Security Company abroad. I will want you to help me collect this deposit and dispatched it to charity organizations.
I have set aside 10% for you and for your time.

God be with you. 

BATES ALAN

From antivirus at unito.it  Wed Mar 31 11:51:14 2004
From: antivirus at unito.it (antivirus@unito.it)
Date: Wed Mar 31 11:56:37 2004
Subject: [Bioperl-l] - DO NOT REPLY - Inflex scan report [0331185120902]
Message-ID: <200403311651.i2VGpHAH021388@albert.unito.it>

Administrator Email Reply Address: rete@unito.it
Email sent to:  pmarsico@ircc.it
Inflex ID: 0331185120902
Report Details -----------------------------------------------
AntiVirus Results...
Scanning /space/inflex/tmp/inf_0331185120902/unpacked/*
Scanning file /space/inflex/tmp/inf_0331185120902/unpacked/_headers_
Scanning file /space/inflex/tmp/inf_0331185120902/unpacked/textfile0
Scanning file /space/inflex/tmp/inf_0331185120902/unpacked/textfile1
Scanning file /space/inflex/tmp/inf_0331185120902/unpacked/file.zip
/space/inflex/tmp/inf_0331185120902/unpacked/file.zip
        Found the W32/Netsky.p@MM!zip virus !!!
Scanning file /space/inflex/tmp/inf_0331185120902/unpacked/textfile2

Summary report on /space/inflex/tmp/inf_0331185120902/unpacked/*
File(s)
        Total files: ...........       5
        Clean: .................       4
        Possibly Infected: .....       1

File NAME/TYPE Scan Results


0331185120902  from:bioperl-l@bioperl.org  to: pmarsico@ircc.itText scanning off.

END OF MESSAGE.

End.
.
From deejoe at iastate.edu  Wed Mar 31 12:05:43 2004
From: deejoe at iastate.edu (D. Joe Anderson)
Date: Wed Mar 31 12:10:52 2004
Subject: maxstrengthmail confirmation [Re: [Bioperl-l] get cds from
	genbank record]
In-Reply-To: <200401202325.34812.heikki@nildram.co.uk>
References: <4006F978.1040902@research.dfci.harvard.edu>
	<200401162012.01215.heikki@nildram.co.uk>
	<20040120201145.GC4195@hgt.mcb.uconn.edu>
	<200401202325.34812.heikki@nildram.co.uk>
Message-ID: <20040331170543.GG25502@iastate.edu>

On Tue, Jan 20, 2004 at 11:25:34PM +0000, Heikki Lehvaslaiho wrote:
> Andreas, Pedro and others,
> 
> Here is what Chris had to say:
> """"""""""""""""""""""""""""""""""""""""""
snip
> """"""""""""""""""""""""""""""""""""""""""
> 
> Please keep reporting these annoyances. If they get too common, we'll have to 
> unsubscribe any members who use such systems.

I just got a maxstrength challenge message.

If there is a ready way for list admins to find out which subscribed
addresses generate these challenges and then to unsubscribe those addresses,
as a matter of policy, I'd highly recommend that.

Challenge-response is an flawed approach. It often leads to the obnoxious
result of the challenge-response system dragging an innocent third party
into their clients' spam- and virus-related woes, which compounds the
problems triggered by spam and email-generating viruses.

IMO the use of such shoddy approaches shouldn't be tolerated in the commons:

 Challenge-Response Anti-Spam Systems Considered Harmful
 http://kmself.home.netcom.com/Rants/challenge-response.html

--Joe

> On Tuesday 20 Jan 2004 20:11, Andreas Bernauer wrote:
> > Hi Heikki!
> >
> > Heikki Lehvaslaiho wrote:
> > > The other problem you had:
> > >
> > > On Friday 16 Jan 2004 15:39, Pedro Antonio Reche wrote:
> > > > One more thing, following my first post I got an email from from
> > > > quarantine@maxstrengthmail.com on behalf of the bioperl mailing list
> > > > saying that my e-mail was in quarantine, and asking me press a link to
> > > > get the message deliver.
> > >
> > > I 've pointed to our server maintainer, Chris,  your problem, but I
> > > suspect those warnings about potential spam in your mails do not come
> > > from bioperl servers or the way they are set up.
> >
> > I've got a similar email as Pedro when I sent an email to the
> > Bioperl-List and was also directed to a website where I should confirm
> > that I want to set the email.  For me it looked it came from one of
> > the bioperl mailing list servers.
From gabriele.bucci at unimi.it  Wed Mar 31 12:43:00 2004
From: gabriele.bucci at unimi.it (gabriele.bucci@unimi.it)
Date: Wed Mar 31 12:48:11 2004
Subject: [Bioperl-l] overlapping features
Message-ID: <39ff3539d913.39d91339ff35@unimi.it>

Hi all.
Need some help about the use of the method overlap (ensembl API). Say I would like to know if a slice in the genome overlaps a gene and where (exon, intron, UTRs), how could I achieve this? I was able to retrieve the gene,transcript and exon position relative to my slice, but I need something more direct like : "Yes your slice is located in the (Upstream,UTR,Exon,Intron,3'UTR,Downstream) this gene".

Can you tell me something useful?
Thanks a lot.

Gab


From Mailer-Daemon at fergflor.k12.mo.us  Wed Mar 31 16:02:01 2004
From: Mailer-Daemon at fergflor.k12.mo.us (Mailer-Daemon@fergflor.k12.mo.us)
Date: Wed Mar 31 16:06:36 2004
Subject: [Bioperl-l] NDN: Re: Protected Mail Delivery
Message-ID: <fc.00870b21008929fe3b9aca00dc95e79b.892a01@fergflor.k12.mo.us>

Sorry. Your message could not be delivered to:

6256d19.0057167b,Ferguson-Florissant SD (The name was not found at the
remote site. Check that the name has been entered correctly.)

From richard at uwc.net  Tue Mar 30 13:39:46 2004
From: richard at uwc.net (Richard Holland)
Date: Wed Mar 31 22:37:55 2004
Subject: [Bioperl-l] Singapore
Message-ID: <200403310639.46455.richard@uwc.net>

Hi guys,

Does anyone know anything about finding database/software/web development 
(bioinformatics-related or otherwise) work in Singapore? I'm finding it 
rather hard to pin down any concrete information, and am wondering if I 
should just go there and start knocking on doors rather than rely on e-mail 
to get my CV to the right people.

I understand Elia Stupka used to work there so maybe he knows something? I'm 
already in touch with Mark Schreiber but obviously the more the merrier. It's 
turning out not to be the easiest of tasks.

cheers,
Richard


From desmond_lim at bti.a-star.edu.sg  Wed Mar 31 21:53:35 2004
From: desmond_lim at bti.a-star.edu.sg (Desmond LIM)
Date: Thu Apr  1 12:53:32 2004
Subject: [Bioperl-l] Parsing a Fasta file
Message-ID: <0015DB1919DAA4469056B78E996ED9FA22FB07@BIONIC.biopolis.one-north.com>

I have a file that contains protein sequences in fasta format.
 
I need to be able to read each sequence individually, with their
accessions (to check against a database). I was wondering if there is a
module that could do that, giving me 2 variables (the accession and
sequence). I have gone throught the bioperl modules and I found modules
that either just take in 1 sequence and parse it or take in the sequence
if it is in 1 line and not broken into a few like mine. Am I right?
 
Thanks.
 
Desmond