From bmoore at genetics.utah.edu Thu Dec 1 08:14:44 2005 From: bmoore at genetics.utah.edu (Barry Moore) Date: Thu Dec 1 08:11:54 2005 Subject: [Bioperl-l] clustalw.pm: could not open sequence file error Message-ID: Olena, Does the filename for the file in question have any spaces anywhere in the path? I know clustalx won't open files with a space in the path even though Windows allows that. Don't know for sure on clustalw, but seems like it might behave the same way. Barry -----Original Message----- From: bioperl-l-bounces@portal.open-bio.org [mailto:bioperl-l-bounces@portal.open-bio.org] On Behalf Of Olena Morozova Sent: Tuesday, November 29, 2005 3:34 PM To: bioperl-ml List Subject: [Bioperl-l] clustalw.pm: could not open sequence file error Hi all, I am trying to use this script use Bio::Tools::Run::Alignment::Clustalw; $ENV{CLUSTALDIR} = 'C:/perl/clustalw1.8/'; my @params = ('ktuple' => 2, 'matrix' => 'BLOSUM', 'outfile'=> 'al_mouse.txt'); my $factory = Bio::Tools::Run::Alignment::Clustalw->new(@params); $inputfilename = 'c:/perl/mouse_unique.txt'; my $aln = $factory->align($inputfilename); to do a MSA, and it works for a test file with 2 or 3 sequences. However, when I try to run it on my actual file (has 97 sequences) which is in exactly the same format as the test file (fasta), I get a "could not open the sequence file" error. Is this because the file is too big and is there a way to fix this? Thanks a lot for your help! Olena On 11/29/05, Jason Stajich wrote: > > > Begin forwarded message: > > > From: neeti somaiya > > Date: November 29, 2005 1:27:27 AM EST > > To: Jason Stajich > > Subject: Re: [Bioperl-l] need BLAT parse code > > > > I use the following code : > > > > open(FH,"output.psl"); > > while() > > { > > if( /^psLayout/ ) > > { > > for( 1..4 ) { <> } > > } > > my @line = split; > > my ( $matches,$mismatches,$rep_matches,$n_count, > > $q_num_insert,$q_base_insert, > > $t_num_insert, $t_base_insert, > > $strand, $q_name, $q_length, $q_start, > > $q_end, $t_name, $t_length,$t_start, $t_end, $block_count, > > $block_sizes, $q_starts, $t_starts > > ) = split; > > > > > > print $t_start; > > print "\n"; > > print $t_end; > > > > } > > > > for output.psl file : > > > > match mis- rep. N's Q gap Q gap T gap T gap > > strand Q Q Q Q T > > T T T block blockSizes qStarts tStarts > > match match count bases count > > bases name size start end > > name size start end count > > ---------------------------------------------------------------------- > > ---------------------------------------------------------------------- > > ------------------- > > 27025 0 0 0 0 0 0 0 > > + query_sequence3 27025 0 27025 > > database_sequence3 57701691 132995 160020 1 > > 27025, 0, 132995, > > ~ > > > > > > It gave me output : > > > > Q > > Q > > > > 132995 > > 160020 > > > > What is the Q? Cant I obtain the coordinates (132995, 160020) alone? > > > > Please let me know. > > Thanks. > > > > On 11/28/05, Jason Stajich wrote: > > Bio::SearchIO::psl can parse psl output. > > > > or more simply: > > > > while(<>) { > > if( /^psLayout/ ) { # if there is a header > > for( 1..4 ) { <> } # take next 4 lines to skip the header > > } > > my @line = split; > > my ( $matches,$mismatches,$rep_matches,$n_count, > > $q_num_insert,$q_base_insert, > > $t_num_insert, $t_base_insert, > > $strand, $q_name, $q_length, $q_start, > > $q_end, $t_name, $t_length,$t_start, $t_end, > > $block_count, > > $block_sizes, $q_starts, $t_starts > > ) = split; > > > > # query aln vals are $q_start, and $q_end values > > # hit aln vals are $t_start, $t_end > > } > > > > On Nov 28, 2005, at 8:06 AM, neeti somaiya wrote: > > > > > Hi, > > > > > > I am using BLAT in a project.I am having simple .psl output files > > > after > > > running BLAT of a gene sequences against full chromosomal > > > sequences.Doesanyone have a simple BLAT parse code. I am only > > > interested in obtaining the > > > alignment start and end positions on the target. > > > -- > > > -Neeti > > > Even my blood says, B positive > > > > > > _______________________________________________ > > > Bioperl-l mailing list > > > Bioperl-l@portal.open-bio.org > > > http://portal.open-bio.org/mailman/listinfo/bioperl-l > > > > -- > > Jason Stajich > > Duke University > > http://www.duke.edu/~jes12 > > > > > > > > > > > > -- > > -Neeti > > Even my blood says, B positive > > -- > Jason Stajich > Duke University > http://www.duke.edu/~jes12 > > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l@portal.open-bio.org > http://portal.open-bio.org/mailman/listinfo/bioperl-l > _______________________________________________ Bioperl-l mailing list Bioperl-l@portal.open-bio.org http://portal.open-bio.org/mailman/listinfo/bioperl-l From qfdong at iastate.edu Thu Dec 1 15:04:44 2005 From: qfdong at iastate.edu (Qunfeng) Date: Thu Dec 1 15:02:33 2005 Subject: solved Re: [Bioperl-l] Deep recursion on subroutine In-Reply-To: <438E2A2D.6010305@med.usyd.edu.au> References: <6.1.2.0.2.20051130154054.04161190@qfdong.mail.iastate.edu> <438E2A2D.6010305@med.usyd.edu.au> Message-ID: <6.1.2.0.2.20051201140243.040b25c8@qfdong.mail.iastate.edu> Hello Jason and Jonathan, Thanks for your help. Jason figures out the problem " This is just Perl complaining because the recursion is deep because there are so many levels in your tree (449 deep). It thinks it hit a snag because it doesn't expect to usually have a recursive call go that many levels. You can make the warnings go away by adding this no warnings 'recursion';" Qunfeng At 04:39 PM 11/30/2005, Jonathan Arthur wrote: >Hello Qunfeng, > >I have not seen this specifically with bioperl, but have had it occur once >or twice in my own code and have always traced the problem back to an >error in the tree where one node is its own ancestor, thereby causing an >infinite recursion when you attempt to find all descendants from that node. > >If each node has a unique identifier, and if the tree is not too large, >you could find the offedning node with a small script to traverse the >tree, testing the unique identifer of each node against a list of all the >nodes seen before and dying when it sees offending node again. > >Cheers, > >Jonathan > >Qunfeng wrote: > >>Hi, >> >>I am using bioperl (5.8.0, linux) to work on a UPGMA tree (newick format, >>generated by PHYLIP). My code works well on a small tree. However, when >>I applied it to a big (ugly) tree, it produces the following error msg. >>Has anybody encountered a similar problem? Is this triggered by any >>invalid part of my tree? Thanks! >> >>Qunfeng >>===========Error message begins ===================== >>Deep recursion on subroutine "Bio::Tree::NodeI::get_all_Descendents" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/NodeI.pm line 172, line 1. >>Deep recursion on subroutine "Bio::Tree::NodeI::get_all_Descendents" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/NodeI.pm line 172, line 1. >>Deep recursion on subroutine "Bio::Tree::NodeI::get_all_Descendents" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/NodeI.pm line 172, line 1. >>Deep recursion on subroutine "Bio::Tree::NodeI::get_all_Descendents" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/NodeI.pm line 172, line 1. >>Deep recursion on subroutine "Bio::Tree::NodeI::get_all_Descendents" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/NodeI.pm line 172, line 1. >>Deep recursion on subroutine "Bio::Tree::NodeI::get_all_Descendents" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/NodeI.pm line 172, line 1. >>Deep recursion on subroutine "Bio::Tree::NodeI::get_all_Descendents" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/NodeI.pm line 172, line 1. >>Deep recursion on subroutine "Bio::Tree::NodeI::get_all_Descendents" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/NodeI.pm line 172, line 1. >>Deep recursion on subroutine "Bio::Tree::NodeI::get_all_Descendents" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/NodeI.pm line 172, line 1. >>Deep recursion on subroutine "Bio::Tree::NodeI::get_all_Descendents" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/NodeI.pm line 172, line 1. >>Deep recursion on subroutine "Bio::Tree::Node::each_Descendent" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/Node.pm line 495, line 1. >>Deep recursion on subroutine "Bio::Tree::Node::height" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/Node.pm line 201, line 1. >>Deep recursion on subroutine "Bio::Tree::Node::height" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/Node.pm line 201, line 1. >>Deep recursion on subroutine "Bio::Tree::Node::height" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/Node.pm line 496, line 1. >>Deep recursion on subroutine "Bio::Tree::Node::height" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/Node.pm line 496, line 1. >>Deep recursion on subroutine "Bio::Tree::Node::each_Descendent" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/Node.pm line 495, line 1. >>Deep recursion on subroutine "Bio::Tree::Node::height" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/Node.pm line 201, line 1. >>Deep recursion on subroutine "Bio::Tree::Node::height" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/Node.pm line 201, line 1. >>Deep recursion on subroutine "Bio::Tree::Node::height" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/Node.pm line 496, line 1. >>Deep recursion on subroutine "Bio::Tree::Node::height" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/Node.pm line 496, line 1. >>Deep recursion on subroutine "Bio::Tree::Node::each_Descendent" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/Node.pm line 495, line 1. >>Deep recursion on subroutine "Bio::Tree::Node::height" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/Node.pm line 201, line 1. >>Deep recursion on subroutine "Bio::Tree::Node::height" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/Node.pm line 201, line 1. >>Deep recursion on subroutine "Bio::Tree::Node::height" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/Node.pm line 496, line 1. >>Deep recursion on subroutine "Bio::Tree::Node::height" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/Node.pm line 496, line 1. >>Deep recursion on subroutine "Bio::Tree::Node::each_Descendent" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/Node.pm line 495, line 1. >>Deep recursion on subroutine "Bio::Tree::Node::height" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/Node.pm line 201, line 1. >>Deep recursion on subroutine "Bio::Tree::Node::height" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/Node.pm line 201, line 1. >>Deep recursion on subroutine "Bio::Tree::Node::height" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/Node.pm line 496, line 1. >>Deep recursion on subroutine "Bio::Tree::Node::height" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/Node.pm line 496, line 1. >>Deep recursion on subroutine "Bio::Tree::Node::each_Descendent" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/Node.pm line 495, line 1. >>Deep recursion on subroutine "Bio::Tree::Node::height" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/Node.pm line 201, line 1. >>Deep recursion on subroutine "Bio::Tree::Node::height" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/Node.pm line 201, line 1. >>Deep recursion on subroutine "Bio::Tree::Node::height" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/Node.pm line 496, line 1. >>Deep recursion on subroutine "Bio::Tree::Node::height" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/Node.pm line 496, line 1. >>Deep recursion on subroutine "Bio::Tree::NodeI::get_all_Descendents" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/NodeI.pm line 172, line 1. >>Deep recursion on subroutine "Bio::Tree::NodeI::get_all_Descendents" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/NodeI.pm line 172, line 1. >>Deep recursion on subroutine "Bio::Tree::NodeI::get_all_Descendents" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/NodeI.pm line 172, line 1. >>Deep recursion on subroutine "Bio::Tree::NodeI::get_all_Descendents" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/NodeI.pm line 172, line 1. >>Deep recursion on subroutine "Bio::Tree::NodeI::get_all_Descendents" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/NodeI.pm line 172, line 1. >>Deep recursion on subroutine "Bio::Tree::NodeI::get_all_Descendents" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/NodeI.pm line 172, line 1. >>Deep recursion on subroutine "Bio::Tree::NodeI::get_all_Descendents" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/NodeI.pm line 172, line 1. >>Deep recursion on subroutine "Bio::Tree::NodeI::get_all_Descendents" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/NodeI.pm line 172, line 1. >>Deep recursion on subroutine "Bio::Tree::NodeI::get_all_Descendents" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/NodeI.pm line 172, line 1. >>Deep recursion on subroutine "Bio::Tree::NodeI::get_all_Descendents" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Tree/NodeI.pm line 172, line 1. >>Deep recursion on subroutine "Bio::Root::Root::DESTROY" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Root/Root.pm line 407, line 1. >>Deep recursion on subroutine "Bio::Root::Root::DESTROY" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Root/Root.pm line 407, line 1. >>Deep recursion on subroutine "Bio::Root::Root::DESTROY" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Root/Root.pm line 407, line 1. >>Deep recursion on subroutine "Bio::Root::Root::DESTROY" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Root/Root.pm line 407, line 1. >>Deep recursion on subroutine "Bio::Root::Root::DESTROY" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Root/Root.pm line 407, line 1. >>Deep recursion on subroutine "Bio::Root::Root::DESTROY" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Root/Root.pm line 407, line 1. >>Deep recursion on subroutine "Bio::Root::Root::DESTROY" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Root/Root.pm line 407, line 1. >>Deep recursion on subroutine "Bio::Root::Root::DESTROY" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Root/Root.pm line 407, line 1. >>Deep recursion on subroutine "Bio::Root::Root::DESTROY" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Root/Root.pm line 407, line 1. >>Deep recursion on subroutine "Bio::Root::Root::DESTROY" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Root/Root.pm line 407, line 1. >>Deep recursion on subroutine "Bio::Root::Root::DESTROY" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Root/Root.pm line 407, line 1. >>Deep recursion on subroutine "Bio::Root::Root::DESTROY" at >>/usr/lib/perl5/site_perl/5.8.0/Bio/Root/Root.pm line 407, line 1. >>=========Error msg ends============================================== >> >>_______________________________________________ >>Bioperl-l mailing list >>Bioperl-l@portal.open-bio.org >>http://portal.open-bio.org/mailman/listinfo/bioperl-l > > > >-- >Dr Jonathan Arthur >Sesqui Lecturer in Bioinformatics >Central Clinical School, Faculty of Medicine and SUBIT >Medical Foundation Building, K25 >University of Sydney >Ph: +61 2 9036 3132 >Email: jarthur@med.usyd.edu.au > > >_______________________________________________ >Bioperl-l mailing list >Bioperl-l@portal.open-bio.org >http://portal.open-bio.org/mailman/listinfo/bioperl-l From chen_li3 at yahoo.com Fri Dec 2 00:00:10 2005 From: chen_li3 at yahoo.com (chen li) Date: Fri Dec 2 00:04:32 2005 Subject: [Bioperl-l] bioperl-db and MySQL In-Reply-To: <545bf9a43c75b54463ffd2077e9b562e@gmx.net> Message-ID: <20051202050010.58169.qmail@web36812.mail.mud.yahoo.com> Very special thanks to Hilmar and Barry who help me out for installing bioperl-db. The follwoings are my experience for installing bioperl-db: 1) Have a Linux operation system (in my case) 2) Use anonymouse CVS to install bioperl and follow the HOWTO 3) the CPAN method doesn't work in my case Li __________________________________________ Yahoo! DSL ? Something to write home about. Just $16.99/mo. or less. dsl.yahoo.com From hubert.prielinger at gmx.at Thu Dec 1 17:49:21 2005 From: hubert.prielinger at gmx.at (Hubert Prielinger) Date: Fri Dec 2 10:44:48 2005 Subject: [Bioperl-l] remoteblast doesn't save the Output File Message-ID: <438F7DF1.5060000@gmx.at> Hi, I'm quite desperated, because since two days I'm trying to save my remoteblast Output and it doesn't work here we go.... #!/usr/bin/perl -w use strict; use warnings; use Bio::SeqIO; use Bio::Tools::Run::RemoteBlast; use Bio::Seq; use IO::String; use Bio::SearchIO; my $prog = 'blastp'; my $db = 'swissprot'; my $e_val= '20000'; my $matrix = 'PAM30'; #my $outfile = 'Output'; my @data; my $line_dataArray; my $rid; my $count = 1; my @params = ( '-prog' => $prog, '-data' => $db, '-expect' => $e_val, '-matrix' => $matrix); my $seqio_obj = Bio::SeqIO->new(-file => "Perm.txt", -format => "raw", ); print "entering blast...."; my $factory = Bio::Tools::Run::RemoteBlast->new (@params); print "Blast entered successfully \n"; while( my $query = $seqio_obj->next_seq ) { print "submit Sequence...just do it....\n"; my $r = $factory->submit_blast($query); print $query->seq; print "\n"; # Wait for the reply and save the output file print "entering while loop for saving Output.... \n"; while ( my @rids = $factory->each_rid ) { foreach my $rid ( @rids ) { my $rc = $factory->retrieve_blast($rid); print "retrieved Results successfully \n"; if( !ref($rc) ) { if( $rc < 0 ) { $factory->remove_rid($rid); } sleep 5; } else { #my $result = $rc->next_result; print $rid; print "\n"; my $filename = " $count .out"; $factory->save_output($filename); print "File saved successfully \n"; $count++; $factory->remove_rid($rid); # } } } print "\n"; print "\n"; } } I hope somebody can help me, thanks in advance sincereley Hubert From erik.sjolund at gmail.com Fri Dec 2 11:47:22 2005 From: erik.sjolund at gmail.com (=?ISO-8859-1?Q?Erik_Sj=F6lund?=) Date: Fri Dec 2 19:33:32 2005 Subject: [Bioperl-l] abi2xml a new parser for abi trace files Message-ID: Bioperl contains a module to parse abi trace files, called Bio::SeqIO::abi http://doc.bioperl.org/releases/bioperl-1.4/Bio/SeqIO/abi.html So you might be interested to know that a new command line utility has been released http://abi2xml.sourceforge.net that converts abi trace files to xml files. This bioinformatics utility is written in C++ and released under the GPL license. A perl programmer could first convert the abi files to xml files and then access the information over a DOM interface or over XPATH. The advantage with this over Bio::SeqIO::abi is to get access to more information of the abi file. Like for instance the time when the experiment was done. I don't think that is possible with Bio::SeqIO:abi ( correct me if I'm wrong ). cheers, Erik Sj?lund From saldroubi at yahoo.com Sun Dec 4 15:17:34 2005 From: saldroubi at yahoo.com (Sam Al-Droubi) Date: Sun Dec 4 15:21:57 2005 Subject: [Bioperl-l] Bio:Seq $seq_obj->accession_number not returning accession number? Message-ID: <20051204201734.20504.qmail@web34305.mail.mud.yahoo.com> The fasta format for this sequence AF410462 from NCBI looks like this >gi|17066572|gb|AF410462.1|AF410462 Mus musculus PEM homeobox (Pem) gene, promoter region and partial cds ATGCGTGTGGGCATGCGCTCATGCCCACTTGCTTGAGCACATGTGTGCTCACATGGACGTTAGAGGCAAC TTTCAGGAGTTATTTTTTTCCCTTCTAACTTGAGTTCCTGGACCTCAGACTTGTATAATAGGTACTTTCC CAACTTAAGTCTTACTGGCTCCAGGGTATCTGGTATACTCTTCTAGCCTCCAAGGGCAGCCACTCATGCT TCTTCAGGTGTGAAGAGGTGAGCCAGATACAACGGTGGGAGGCAGTGTGCCCTCAGTGTGTAGACTCTTT ATGCCCTTGGGGATTAGCGCCTCTAGCTGCCAGTCGGGTCTCTGGGTCCCTCCTGCTAAGGCCACTCTCG TCATGGTTCCTCTTGTCCTGGTGAGCCATTACGACCCTCTCACTTCCTTGTGTTCTCTTCCCTGTGTTCT CTCTCTGCTGCTGTGGCCATTCTAGCTCCCTGCACAGTCCTTCAAGCTCACCTCCTGCCTTCCGTGGACA AGAGGAAGCACAAAGAATCATCCAGTATGTATGCTCATGGCATAAGGGGATCCTGGGGAAGGGCTGAAGC CTGAGCCGGGCTGGTCAACAGAATCTCCCTCTCCCTAACTCCATCTCCCTCTCCTTCCCTCTTCCTCTCT CTATCCCTCCCCCCTCTCTCCCCCCACCACCGCATGTTTTGGGTCAGCTGACTGCTCTAGCCTTGATGAG ATATCTTCCCAGGAAGAGTTGGTGCTGACTGTACAGATTGAGTTAGAGGGAGGGAAGAAAGCTCCTGTTT GATCACTGGAGATCTTTATGCCTAGCTACATGTCTTACCAAAGCCAGGGGAGTCAGCTGAGCTGTAACTG GGCACCCTAAGTTCTGCACACCCACATGCCCATGAACTGTGTCCATCTTGCAAGCACATCGTGCTCATTA CATCCCCAAACTGCTATCACTTGTGTACCCCAAAGGCTCGGCCCACAGGAACGTCCTGTGAGCAAATCAC AAAGACCAGCTTAGGGCTGGAAACATTGTAACCTGAAGTAGGCCAGAGGAGATCCCTGCCAGGTTGAGCA TCACAGATCTCATTCTGTTCCCGGGGACACCAGGGGCCCAAGCTCAGAATCTGCCGAAGCATAACTTCAT CATTGATCCTATTCAGGGTATGGAAGCTGAGGGTTCCAGCCGCAAGGTCACCAGGCTACTCCGCCTGGGA GTCAAGGAAG When I read this from a file as a sequence object using Bio::Seq I get accession_number unknow. The accession number is in the header of the fasta file. Anyone knows why this happens. My code looks like this: print "primary id is: ",$seq_obj->primary_id."\n"; print "Description is ",$seq_obj->desc."\n"; print "Accession Number is ",$seq_obj->accession_number."\n"; Output looks like this: primary id is: gi|17066572|gb|AF410462.1|AF410462 Description is Mus musculus PEM homeobox (Pem) gene, promoter region and partial cds Accession Number is unknown Thank you. Sincerely, Sam Al-Droubi, M.S. saldroubi@yahoo.com From bmoore at genetics.utah.edu Sun Dec 4 16:23:48 2005 From: bmoore at genetics.utah.edu (Barry Moore) Date: Sun Dec 4 16:32:49 2005 Subject: [Bioperl-l] Bio:Seq $seq_obj->accession_number not returningaccession number? Message-ID: Sam- The fasta parser makes no attempt to parse the fasta header since there is no standard format for what should be in a fasta header. Parse the accession out of the primary_id field with a regular expression in your script or use GenBank or ENSEMBL format sequences to get all the goodies parsed for you. Google on "accession fasta parse site:bioperl.org" to read other posts on this topic. Barry -----Original Message----- From: bioperl-l-bounces@portal.open-bio.org [mailto:bioperl-l-bounces@portal.open-bio.org] On Behalf Of Sam Al-Droubi Sent: Sunday, December 04, 2005 1:18 PM To: BioPerl list BioPerl list Subject: [Bioperl-l] Bio:Seq $seq_obj->accession_number not returningaccession number? The fasta format for this sequence AF410462 from NCBI looks like this >gi|17066572|gb|AF410462.1|AF410462 Mus musculus PEM homeobox (Pem) gene, promoter region and partial cds ATGCGTGTGGGCATGCGCTCATGCCCACTTGCTTGAGCACATGTGTGCTCACATGGACGTTAGAGGCAAC TTTCAGGAGTTATTTTTTTCCCTTCTAACTTGAGTTCCTGGACCTCAGACTTGTATAATAGGTACTTTCC CAACTTAAGTCTTACTGGCTCCAGGGTATCTGGTATACTCTTCTAGCCTCCAAGGGCAGCCACTCATGCT TCTTCAGGTGTGAAGAGGTGAGCCAGATACAACGGTGGGAGGCAGTGTGCCCTCAGTGTGTAGACTCTTT ATGCCCTTGGGGATTAGCGCCTCTAGCTGCCAGTCGGGTCTCTGGGTCCCTCCTGCTAAGGCCACTCTCG TCATGGTTCCTCTTGTCCTGGTGAGCCATTACGACCCTCTCACTTCCTTGTGTTCTCTTCCCTGTGTTCT CTCTCTGCTGCTGTGGCCATTCTAGCTCCCTGCACAGTCCTTCAAGCTCACCTCCTGCCTTCCGTGGACA AGAGGAAGCACAAAGAATCATCCAGTATGTATGCTCATGGCATAAGGGGATCCTGGGGAAGGGCTGAAGC CTGAGCCGGGCTGGTCAACAGAATCTCCCTCTCCCTAACTCCATCTCCCTCTCCTTCCCTCTTCCTCTCT CTATCCCTCCCCCCTCTCTCCCCCCACCACCGCATGTTTTGGGTCAGCTGACTGCTCTAGCCTTGATGAG ATATCTTCCCAGGAAGAGTTGGTGCTGACTGTACAGATTGAGTTAGAGGGAGGGAAGAAAGCTCCTGTTT GATCACTGGAGATCTTTATGCCTAGCTACATGTCTTACCAAAGCCAGGGGAGTCAGCTGAGCTGTAACTG GGCACCCTAAGTTCTGCACACCCACATGCCCATGAACTGTGTCCATCTTGCAAGCACATCGTGCTCATTA CATCCCCAAACTGCTATCACTTGTGTACCCCAAAGGCTCGGCCCACAGGAACGTCCTGTGAGCAAATCAC AAAGACCAGCTTAGGGCTGGAAACATTGTAACCTGAAGTAGGCCAGAGGAGATCCCTGCCAGGTTGAGCA TCACAGATCTCATTCTGTTCCCGGGGACACCAGGGGCCCAAGCTCAGAATCTGCCGAAGCATAACTTCAT CATTGATCCTATTCAGGGTATGGAAGCTGAGGGTTCCAGCCGCAAGGTCACCAGGCTACTCCGCCTGGGA GTCAAGGAAG When I read this from a file as a sequence object using Bio::Seq I get accession_number unknow. The accession number is in the header of the fasta file. Anyone knows why this happens. My code looks like this: print "primary id is: ",$seq_obj->primary_id."\n"; print "Description is ",$seq_obj->desc."\n"; print "Accession Number is ",$seq_obj->accession_number."\n"; Output looks like this: primary id is: gi|17066572|gb|AF410462.1|AF410462 Description is Mus musculus PEM homeobox (Pem) gene, promoter region and partial cds Accession Number is unknown Thank you. Sincerely, Sam Al-Droubi, M.S. saldroubi@yahoo.com _______________________________________________ Bioperl-l mailing list Bioperl-l@portal.open-bio.org http://portal.open-bio.org/mailman/listinfo/bioperl-l From jason.stajich at duke.edu Sun Dec 4 16:49:40 2005 From: jason.stajich at duke.edu (Jason Stajich) Date: Sun Dec 4 16:47:21 2005 Subject: [Bioperl-l] Bio:Seq $seq_obj->accession_number not returningaccession number? In-Reply-To: References: Message-ID: <539A294D-C541-4BF2-A501-17F69BCA34C6@duke.edu> Sam - Yeah what Barry said. It doesn't get set when reading fasta files - see Hilmar's link below for more info - all the info is in the display id, available in $seq- >display_id my ($gi,$acc,$locus); (undef,$gi,undef,$acc,$locus) = split(/\|/,$seq->display_id); $seq->accession_number($acc); I thought there was a function already to do this for you, but I guess not. There is something Search::Hit objects to parse accession number so maybe we can consolidate this if someone volunteers to do it. See also Hilmar's response about this: http://bioperl.org/pipermail/bioperl-l/2005-August/019579.html I've added it as a Q&A to the new wiki FAQ which we'll roll out soon. -jason On Dec 4, 2005, at 4:23 PM, Barry Moore wrote: > Sam- > > The fasta parser makes no attempt to parse the fasta header since > there > is no standard format for what should be in a fasta header. Parse the > accession out of the primary_id field with a regular expression in > your > script or use GenBank or ENSEMBL format sequences to get all the > goodies > parsed for you. Google on "accession fasta parse site:bioperl.org" to > read other posts on this topic. > > Barry > > -----Original Message----- > From: bioperl-l-bounces@portal.open-bio.org > [mailto:bioperl-l-bounces@portal.open-bio.org] On Behalf Of Sam > Al-Droubi > Sent: Sunday, December 04, 2005 1:18 PM > To: BioPerl list BioPerl list > Subject: [Bioperl-l] Bio:Seq $seq_obj->accession_number not > returningaccession number? > > The fasta format for this sequence AF410462 from NCBI looks like this > > >> gi|17066572|gb|AF410462.1|AF410462 Mus musculus PEM homeobox (Pem) > gene, promoter region and partial cds > ATGCGTGTGGGCATGCGCTCATGCCCACTTGCTTGAGCACATGTGTGCTCACATGGACGTTAGAGGCAAC > TTTCAGGAGTTATTTTTTTCCCTTCTAACTTGAGTTCCTGGACCTCAGACTTGTATAATAGGTACTTTCC > CAACTTAAGTCTTACTGGCTCCAGGGTATCTGGTATACTCTTCTAGCCTCCAAGGGCAGCCACTCATGCT > TCTTCAGGTGTGAAGAGGTGAGCCAGATACAACGGTGGGAGGCAGTGTGCCCTCAGTGTGTAGACTCTTT > ATGCCCTTGGGGATTAGCGCCTCTAGCTGCCAGTCGGGTCTCTGGGTCCCTCCTGCTAAGGCCACTCTCG > TCATGGTTCCTCTTGTCCTGGTGAGCCATTACGACCCTCTCACTTCCTTGTGTTCTCTTCCCTGTGTTCT > CTCTCTGCTGCTGTGGCCATTCTAGCTCCCTGCACAGTCCTTCAAGCTCACCTCCTGCCTTCCGTGGACA > AGAGGAAGCACAAAGAATCATCCAGTATGTATGCTCATGGCATAAGGGGATCCTGGGGAAGGGCTGAAGC > CTGAGCCGGGCTGGTCAACAGAATCTCCCTCTCCCTAACTCCATCTCCCTCTCCTTCCCTCTTCCTCTCT > CTATCCCTCCCCCCTCTCTCCCCCCACCACCGCATGTTTTGGGTCAGCTGACTGCTCTAGCCTTGATGAG > ATATCTTCCCAGGAAGAGTTGGTGCTGACTGTACAGATTGAGTTAGAGGGAGGGAAGAAAGCTCCTGTTT > GATCACTGGAGATCTTTATGCCTAGCTACATGTCTTACCAAAGCCAGGGGAGTCAGCTGAGCTGTAACTG > GGCACCCTAAGTTCTGCACACCCACATGCCCATGAACTGTGTCCATCTTGCAAGCACATCGTGCTCATTA > CATCCCCAAACTGCTATCACTTGTGTACCCCAAAGGCTCGGCCCACAGGAACGTCCTGTGAGCAAATCAC > AAAGACCAGCTTAGGGCTGGAAACATTGTAACCTGAAGTAGGCCAGAGGAGATCCCTGCCAGGTTGAGCA > TCACAGATCTCATTCTGTTCCCGGGGACACCAGGGGCCCAAGCTCAGAATCTGCCGAAGCATAACTTCAT > CATTGATCCTATTCAGGGTATGGAAGCTGAGGGTTCCAGCCGCAAGGTCACCAGGCTACTCCGCCTGGGA > GTCAAGGAAG > > When I read this from a file as a sequence object using Bio::Seq I > get > accession_number unknow. The > accession number is in the header of the fasta file. Anyone knows > why > this happens. > > My code looks like this: > > print "primary id is: ",$seq_obj->primary_id."\n"; > print "Description is ",$seq_obj->desc."\n"; > print "Accession Number is ",$seq_obj->accession_number."\n"; > > Output looks like this: > > primary id is: gi|17066572|gb|AF410462.1|AF410462 > Description is Mus musculus PEM homeobox (Pem) gene, promoter region > and partial cds > Accession Number is unknown > > > Thank you. > > > > > > Sincerely, > Sam Al-Droubi, M.S. > saldroubi@yahoo.com > _______________________________________________ > Bioperl-l mailing list > Bioperl-l@portal.open-bio.org > http://portal.open-bio.org/mailman/listinfo/bioperl-l > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l@portal.open-bio.org > http://portal.open-bio.org/mailman/listinfo/bioperl-l -- Jason Stajich Duke University http://www.duke.edu/~jes12 From angshu96 at gmail.com Sun Dec 4 20:32:20 2005 From: angshu96 at gmail.com (Angshu Kar) Date: Sun Dec 4 22:11:49 2005 Subject: [Bioperl-l] parsing a BLAST output Message-ID: Hi, To begin with, I'm new to Bioperl. Now, I've written the following simple piece of code to parse a WU-Blast output which filters data *for a given e-value and >50% overlap*. I'm writing the main algorithm here: my $blast_report = $ARG[1]; my $threshold_evalue = $ARG[2]; my $in = new Bio::SearchIO(-format => 'blast', -file => $blast_report); while (my $result = $in -> next_result) { while(my $hit = $result->next_hit) { if(($line{$hit->name} == $line{$result->query_accession})) { next; } if($hit->hsp->evalue <= $threshold_evalue) { if($hit->hsp->frac_indentical>=0.5) { print $line{$result->query_accession} . "\t" . $line{$hit->name} . "\t" . $hit->hsp-evalue . "\n"; } } } } My questions are: 1. does the frac_identical gives the measure of % overlap? Or, are there any other methods? 2. now, i don't have any blast data sets to test my code upon.could any of the experienced users let me know whether the algorithm is fine?any tip-offs on any point (from optimization to syntactical errors) are heartily welcome. 3. could any one please let me know if i can find sample wu-blast outputs to test my script upon? Appreciate your guidance. Thanks, Angshu From jason.stajich at duke.edu Sun Dec 4 23:00:06 2005 From: jason.stajich at duke.edu (Jason Stajich) Date: Sun Dec 4 22:57:57 2005 Subject: [Bioperl-l] parsing a BLAST output In-Reply-To: References: Message-ID: <82B9FC7C-BE37-4961-85BB-6014CEADE8E7@duke.edu> frac_identical gives the fraction of bases in the HSP that are identical. overlap would be calculated from (length of the query aligned) / (length of query) or (length of hit aligned) / (length of hit). So for an HSP you can calculate this $fracqaligned = $hsp->query->length / $result->query_length; $frachaligned = $hsp->hit->length / $hit->length; But remember there may be multiple HSPs so you may need to merge the HSP information to get the total length aligned. If there is a repeated domain or multiple high scoring suboptimal alignments this can cause things to get a little tricky. There are two methods provided in the HitI interface called frac_aligned_query() and frac_aligned_hit() do try and take all of this into account for you, but I admit this is less well tested code. But do give them a try: $fracqaligned = $hit->frac_aligned_query(); $frachaligned = $hit->frac_aligned_hit(); If you are using WU-BLASTP add the -postsw option to get a refined alignment which will merge HSPs where appropriate so you should use that. You can also use the -links option to get WU-BLAST to get the logical ordering and a consistent path through the HSPs. On Dec 4, 2005, at 8:32 PM, Angshu Kar wrote: > Hi, > > To begin with, I'm new to Bioperl. > Now, I've written the following simple piece of code to parse a WU- > Blast > output which filters data *for a given e-value and >50% overlap*. > > I'm writing the main algorithm here: > > my $blast_report = $ARG[1]; > my $threshold_evalue = $ARG[2]; > > my $in = new Bio::SearchIO(-format => 'blast', -file => > $blast_report); > > while (my $result = $in -> next_result) > { > while(my $hit = $result->next_hit) > { > if(($line{$hit->name} == $line{$result->query_accession})) > { > next; > } > if($hit->hsp->evalue <= $threshold_evalue) > { > if($hit->hsp->frac_indentical>=0.5) > { > print $line{$result->query_accession} . "\t" . > $line{$hit->name} . "\t" . $hit->hsp-evalue . "\n"; > } > } > } > } > > My questions are: > > 1. does the frac_identical gives the measure of % overlap? Or, are > there any > other methods? > 2. now, i don't have any blast data sets to test my code upon.could > any of > the experienced users let me know whether the algorithm is fine?any > tip-offs on any point (from optimization to syntactical errors) are > heartily > welcome. > 3. could any one please let me know if i can find sample wu-blast > outputs to > test my script upon? http://fungal.genome.duke.edu/~jes12/BGT203.2005/sample_reports/ Also checkout the biodata repository from bioperl and look in the DB_Searching directory, we had started a project cataloging example reports in all the different formats. This sort of fizzled out, but could still use some volunteers to better organize things and incorporate more examples. http://cvs.open-bio.org/cgi-bin/viewcvs/viewcvs.cgi/biodata/ DB_Searching/ > Appreciate your guidance. > > Thanks, > Angshu > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l@portal.open-bio.org > http://portal.open-bio.org/mailman/listinfo/bioperl-l -- Jason Stajich Duke University http://www.duke.edu/~jes12 From osborne1 at optonline.net Mon Dec 5 09:46:06 2005 From: osborne1 at optonline.net (Brian Osborne) Date: Mon Dec 5 09:50:22 2005 Subject: [Bioperl-l] parsing a BLAST output In-Reply-To: Message-ID: Angshu, It looks like there's WU Blast output used in various tests: t/data/dnaEbsub_ecoli.wublastx t/data/ecolitst.wublastp t/data/ecolitst.noseqs.wublastp Brian O. On 12/4/05 8:32 PM, "Angshu Kar" wrote: > Hi, > > To begin with, I'm new to Bioperl. > Now, I've written the following simple piece of code to parse a WU-Blast > output which filters data *for a given e-value and >50% overlap*. > > I'm writing the main algorithm here: > > my $blast_report = $ARG[1]; > my $threshold_evalue = $ARG[2]; > > my $in = new Bio::SearchIO(-format => 'blast', -file => $blast_report); > > while (my $result = $in -> next_result) > { > while(my $hit = $result->next_hit) > { > if(($line{$hit->name} == $line{$result->query_accession})) > { > next; > } > if($hit->hsp->evalue <= $threshold_evalue) > { > if($hit->hsp->frac_indentical>=0.5) > { > print $line{$result->query_accession} . "\t" . > $line{$hit->name} . "\t" . $hit->hsp-evalue . "\n"; > } > } > } > } > > My questions are: > > 1. does the frac_identical gives the measure of % overlap? Or, are there any > other methods? > 2. now, i don't have any blast data sets to test my code upon.could any of > the experienced users let me know whether the algorithm is fine?any > tip-offs on any point (from optimization to syntactical errors) are heartily > welcome. > 3. could any one please let me know if i can find sample wu-blast outputs to > test my script upon? > > Appreciate your guidance. > > Thanks, > Angshu > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l@portal.open-bio.org > http://portal.open-bio.org/mailman/listinfo/bioperl-l From rsultana at jimmy.harvard.edu Mon Dec 5 13:27:13 2005 From: rsultana at jimmy.harvard.edu (Razvan Sultana) Date: Mon Dec 5 13:45:50 2005 Subject: [Bioperl-l] remote location support in BioSQL (bioperl-db) /Oracle Message-ID: <43948681.1050606@jimmy.harvard.edu> Hi ! I have a question regarding the bioperl-db implementation in Oracle. I have created a BioSQL schema in Oracle and populated with the latest version of Genbank. I used the "bleeding edge" version of bioperl, bioperl-db and bioperl-schema from the CVS repository. When I try to extract the spliced sequences of the CDS features of entries that have remote locations (e.g., for AF327267: join(4..190,AF327268.1:89..275,AF327268.1:780..930, AF327268.1:1049..1196,AF327269.1:63..229, AF327269.1:522..659,AF327269.1:784..917, AF327269.1:1461..1582,AF327270.1:100..173, AF327270.1:349..417) the appended code complains that it Can't locate object method "get_Seq_by_acc" via package "Bio::DB::BioSQL::DBAdaptor" and rightly so, because the above object doesn't implement the Bio::DB::RandomAccessI interface. If I omit $db from the spliced_seq() method call, I obviously get the message: MSG: cannot get remote location for AF327268.1 without a valid Bio::DB::RandomAccessI database handle (like Bio::DB::GenBank) My question is: is there an object to the BioSQL schema that implements Bio::DB::RandomAccessI ? If there is, how do I get a handle to it? If there isn't, I would be willing to help implementing it. Thank you, Razvan Sultana #!/usr/local/bin/perl use strict; use Bio::DB::BioDB; use Bio::Seq::RichSeq; use Bio::DB::Query::BioQuery; use Getopt::Long; my $host = 'kevin'; my $dbuser = 'biosql'; my $dbpass; my $dbname = 'biocompd'; my $driver = 'Oracle'; my $acc; my $biodbname = 'genbank'; &GetOptions( 'host=s' => \$host, 'driver=s' => \$driver, 'dbuser=s' => \$dbuser, 'dbpass=s' => \$dbpass, 'dbname=s' => \$dbname, 'accession=s' => \$acc); $acc = 'AF327270' unless $acc; my $db = Bio::DB::BioDB->new(-database => "biosql", -host => $host, -dbname => $dbname, -driver => $driver, -user => $dbuser, -pass => $dbpass, ); my $seqadaptor = $db->get_object_adaptor('Bio::SeqI'); my $query = Bio::DB::Query::BioQuery->new(-datacollections => ['Bio::PrimarySeqI e', 'BioNamespace=>Bio::PrimarySeqI db'], -where => ["e.accession_number = '$acc'", "db.namespace= '$biodbname'"]); my $seq_object = Bio::Seq::RichSeq->new(); my $query_result = $seqadaptor->find_by_query($query); while ($seq_object = $query_result->next_object()) { for my $feature ($seq_object->get_SeqFeatures()) { my $full_sequence = $seq_object->seq(); my $cds_seq = $feature->spliced_seq($db)->seq() if ($feature->primary_tag() eq 'CDS'); } } From dyhyun at rda.go.kr Mon Dec 5 09:26:21 2005 From: dyhyun at rda.go.kr (=?ks_c_5601-1987?B?x/a1tcCx?=) Date: Mon Dec 5 18:52:07 2005 Subject: [Bioperl-l] trouble to retrieve sequence.. I'm a BEGINNER... Message-ID: <000a01c5f9a7$dd9a2a90$28661e0a@NIAB102040> Hi, I'm a bioperl beginner and studying bioperl code according to Beginners HOWTO. I have a trouble in section 9. Retrievin a sequence from a database. How can I solve this promblem? Please, help me~~ 1. ver. of Bioperl : 1.4 2. OS : linux 3. I am trying to retrieve seq. from GenBank. 4. code #!/usr/bin/perl -w use Bio::DB::GenBank; use Bio::DB::Query::GenBank; $query = "Arabidopsis[ORGN] AND topoisomerase[TITL] an 0:3000[SLEN]"; $query_obj = Bio::DB::Query::GenBank->new(-db => 'nucleotide', -query => $query); $gb_obj = Bio::DB::GenBank->new; $stream_obj = $gb_obj->get_Stream_by_query($query_obj); while ($seq_obj = $stream_obj->next_seq){ #do something with the sequence object print $seq_obj->display_id,"\t", $seq_obj->length, "\n"; } 5. error messages Can't locate IO/String.pm in @INC (@INC contains: /usr/lib/perl5/5.6.0/i386-linux /usr/lib/perl5/5.6.0 /usr/lib/perl5/site_perl/5.6.0/i386-linux /usr/lib/perl5/site_perl/5.6.0 /usr/lib/perl5/site_perl .) at /usr/lib/perl5/site_perl/5.6.0/Bio/DB/WebDBSeqI.pm line 90. BEGIN failed--compilation aborted at /usr/lib/perl5/site_perl/5.6.0/Bio/DB/WebDBSeqI.pm line 90. Compilation failed in require at /usr/lib/perl5/site_perl/5.6.0/Bio/DB/NCBIHelper.pm line 82. BEGIN failed--compilation aborted at /usr/lib/perl5/site_perl/5.6.0/Bio/DB/NCBIHelper.pm line 82. Compilation failed in require at /usr/lib/perl5/site_perl/5.6.0/Bio/DB/GenBank.pm line 124. BEGIN failed--compilation aborted at /usr/lib/perl5/site_perl/5.6.0/Bio/DB/GenBank.pm line 124. Compilation failed in require at query.pl line 3. BEGIN failed--compilation aborted at query.pl line 3. ps. I read the FAQ pages. I found this answer. "NCBI changed the web CGI script that provided this access. You must be using bioperl <= 0.7.2. The developer release 0.9.3 contains this fix as does the 1.0 release." I cant't understand what it means.... From torsten.seemann at infotech.monash.edu.au Mon Dec 5 19:56:33 2005 From: torsten.seemann at infotech.monash.edu.au (Torsten Seemann) Date: Mon Dec 5 20:12:57 2005 Subject: [Bioperl-l] remoteblast doesn't save the Output File In-Reply-To: <438F7DF1.5060000@gmx.at> References: <438F7DF1.5060000@gmx.at> Message-ID: <1133830593.20822.13.camel@chauvel.csse.monash.edu.au> Hubert, > I'm quite desperated, because since two days I'm trying to save my > remoteblast Output and it doesn't work > here we go.... I tried your script and it works for me. Here is the output: % ./hubert.pl Name "Bio::Tools::Run::RemoteBlast::OUT" used only once: possible typo at p\uffff \uffff\uffff \uffff\uffffJ line 613. entering blast....Blast entered successfully submit Sequence...just do it.... MMVVVVVVVVVVVVVVVVVVVVGGGGGRGYHTHHGHHPLQWWWW entering while loop for saving Output.... retrieved Results successfully retrieved Results successfully retrieved Results successfully 1133830353-27473-43968273674.BLASTQ4 File saved successfully % ls 40 1 .out 4 hubert.pl* 4 Perm.txt % head 1\ .out BLASTP 2.2.12 [Aug-07-2005] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Sch\uffffffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: 1133830353-27473-43968273674.BLASTQ4 Database: Non-redundant SwissProt sequences -- Torsten Seemann Victorian Bioinformatics Consortium From jason.stajich at duke.edu Mon Dec 5 19:02:41 2005 From: jason.stajich at duke.edu (Jason Stajich) Date: Mon Dec 5 20:14:09 2005 Subject: [Bioperl-l] trouble to retrieve sequence.. I'm a BEGINNER... In-Reply-To: <000a01c5f9a7$dd9a2a90$28661e0a@NIAB102040> References: <000a01c5f9a7$dd9a2a90$28661e0a@NIAB102040> Message-ID: <124F8CE0-A0D0-40DD-9772-2C337801D53C@duke.edu> You need IO::String and LWP::UserAgent perl modules installed to access GenBank remote database. The FAQ answer you are referring to only has to do with bioperl versions before 0.7 - more than 3 years ago so it isn't relevant anymore. -jason On Dec 5, 2005, at 9:26 AM, ??? wrote: > Hi, > > I'm a bioperl beginner and studying bioperl code according to > Beginners HOWTO. > I have a trouble in section 9. Retrievin a sequence from a database. > How can I solve this promblem? Please, help me~~ > > 1. ver. of Bioperl : 1.4 > > 2. OS : linux > > 3. I am trying to retrieve seq. from GenBank. > > 4. code > #!/usr/bin/perl -w > > use Bio::DB::GenBank; > use Bio::DB::Query::GenBank; > > $query = "Arabidopsis[ORGN] AND topoisomerase[TITL] an 0:3000[SLEN]"; > $query_obj = Bio::DB::Query::GenBank->new(-db => 'nucleotide', > -query => $query); > > $gb_obj = Bio::DB::GenBank->new; > > $stream_obj = $gb_obj->get_Stream_by_query($query_obj); > > while ($seq_obj = $stream_obj->next_seq){ > #do something with the sequence object > print $seq_obj->display_id,"\t", $seq_obj->length, "\n"; > } > > 5. error messages > > Can't locate IO/String.pm in @INC (@INC contains: /usr/lib/ > perl5/5.6.0/i386-linux /usr/lib/perl5/5.6.0 /usr/lib/perl5/ > site_perl/5.6.0/i386-linux /usr/lib/perl5/site_perl/5.6.0 /usr/lib/ > perl5/site_perl .) at /usr/lib/perl5/site_perl/5.6.0/Bio/DB/ > WebDBSeqI.pm line 90. > BEGIN failed--compilation aborted at /usr/lib/perl5/site_perl/5.6.0/ > Bio/DB/WebDBSeqI.pm line 90. > Compilation failed in require at /usr/lib/perl5/site_perl/5.6.0/Bio/ > DB/NCBIHelper.pm line 82. > BEGIN failed--compilation aborted at /usr/lib/perl5/site_perl/5.6.0/ > Bio/DB/NCBIHelper.pm line 82. > Compilation failed in require at /usr/lib/perl5/site_perl/5.6.0/Bio/ > DB/GenBank.pm line 124. > BEGIN failed--compilation aborted at /usr/lib/perl5/site_perl/5.6.0/ > Bio/DB/GenBank.pm line 124. > Compilation failed in require at query.pl line 3. > BEGIN failed--compilation aborted at query.pl line 3. > > ps. I read the FAQ pages. I found this answer. > "NCBI changed the web CGI script that provided this access. You > must be using bioperl <= 0.7.2. The developer release 0.9.3 > contains this fix as does the 1.0 release." > I cant't understand what it means.... > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l@portal.open-bio.org > http://portal.open-bio.org/mailman/listinfo/bioperl-l -- Jason Stajich Duke University http://www.duke.edu/~jes12 From torsten.seemann at infotech.monash.edu.au Mon Dec 5 22:05:53 2005 From: torsten.seemann at infotech.monash.edu.au (Torsten Seemann) Date: Mon Dec 5 22:04:53 2005 Subject: [Bioperl-l] remoteblast doesn't save the Output File In-Reply-To: <4394DC21.90401@gmx.at> References: <438F7DF1.5060000@gmx.at> <1133830593.20822.13.camel@chauvel.csse.monash.edu.au> <4394DC21.90401@gmx.at> Message-ID: <1133838353.20822.32.camel@chauvel.csse.monash.edu.au> On Mon, 2005-12-05 at 18:32 -0600, Hubert Prielinger wrote: > Hello Torsten, > thanks for your Response, I have already managed to run the program, > thank you very much. > but I still have the Problem, that I don't know how set the Gap > Parameters, because I want to use PAM30, but > it doesn't work with the Default Parameters...... > If you know how to change them, I would really appreciate telling me. I > have tried "gap" and "ext", but it doesn't work. > and I can't find anything online... If you read the module manual page ie. perldoc Bio::Tools::Run::RemoteBlast there is an example - just set: $Bio::Tools::Run::RemoteBlast::HEADER{'MATRIX_NAME'} = 'PAM30'; 'MATRIX_NAME' is just one possibility, the perldoc refers to this http://www.ncbi.nlm.nih.gov/BLAST/Doc/urlapi.html which has the 'GAP_COSTS' you are looking for: http://www.ncbi.nlm.nih.gov/BLAST/Doc/node28.html -- Torsten Seemann Victorian Bioinformatics Consortium From torsten.seemann at infotech.monash.edu.au Mon Dec 5 19:49:17 2005 From: torsten.seemann at infotech.monash.edu.au (Torsten Seemann) Date: Tue Dec 6 00:26:46 2005 Subject: [Bioperl-l] trouble to retrieve sequence.. I'm a BEGINNER... In-Reply-To: <000a01c5f9a7$dd9a2a90$28661e0a@NIAB102040> References: <000a01c5f9a7$dd9a2a90$28661e0a@NIAB102040> Message-ID: <1133830157.20822.9.camel@chauvel.csse.monash.edu.au> > I'm a bioperl beginner and studying bioperl code according to Beginners HOWTO. > I have a trouble in section 9. Retrievin a sequence from a database. > How can I solve this promblem? Please, help me~~ > 1. ver. of Bioperl : 1.4 > 2. OS : linux > 3. I am trying to retrieve seq. from GenBank. > 4. code > 5. error messages > Can't locate IO/String.pm in @INC (@INC contains: /usr/lib/perl5/5.6.0/i386-linux /usr/lib/perl5/5.6.0 /usr/lib/perl5/site_perl/5.6.0/i386-linux /usr/lib/perl5/site_perl/5.6.0 /usr/lib/perl5/site_perl .) at /usr/lib/perl5/site_perl/5.6.0/Bio/DB/WebDBSeqI.pm line 90. When you get a lot of error messages, you should look at the first one and try and solve that first. It says it is trying to load the Perl IO::String module (via @INC search path) but can't locate it. You need to install the IO::String module. How to install Perl modules: http://perl.about.com/od/perlmodule1/l/aa030500a.htm IO::String source code: http://search.cpan.org/CPAN/authors/id/G/GA/GAAS/IO-String-1.07.tar.gz Then try your script again. You may find there are other modules which need to be installed too. -- Torsten Seemann Victorian Bioinformatics Consortium From torsten.seemann at infotech.monash.edu.au Tue Dec 6 01:28:00 2005 From: torsten.seemann at infotech.monash.edu.au (Torsten Seemann) Date: Tue Dec 6 02:59:18 2005 Subject: [Bioperl-l] trouble to retrieve sequence.. I'm a BEGINNER... In-Reply-To: <001601c5fa26$277816d0$28661e0a@NIAB102040> References: <000a01c5f9a7$dd9a2a90$28661e0a@NIAB102040> <1133830157.20822.9.camel@chauvel.csse.monash.edu.au> <001601c5fa26$277816d0$28661e0a@NIAB102040> Message-ID: <1133850480.20822.58.camel@chauvel.csse.monash.edu.au> Hyun, > Thank you for your help. > I solved the problem by installing IO:String module...but > there are another problems like this; > ------------- EXCEPTION ------------- > MSG: Error from Genbank: Your query produced warning/error messages - check flagged link below. > STACK Bio::DB::Query::GenBank::_parse_response /usr/lib/perl5/site_perl/5.6.0/Bio/DB/Query/GenBank.pm:267 > STACK Bio::DB::Query::WebQuery::_run_query /usr/lib/perl5/site_perl/5.6.0/Bio/DB/Query/WebQuery.pm:268 > STACK Bio::DB::Query::GenBank::cookie /usr/lib/perl5/site_perl/5.6.0/Bio/DB/Query/GenBank.pm:177 > STACK Bio::DB::NCBIHelper::get_request /usr/lib/perl5/site_perl/5.6.0/Bio/DB/NCBIHelper.pm:187 > STACK Bio::DB::WebDBSeqI::get_seq_stream /usr/lib/perl5/site_perl/5.6.0/Bio/DB/WebDBSeqI.pm:438 > STACK Bio::DB::NCBIHelper::get_Stream_by_query /usr/lib/perl5/site_perl/5.6.0/Bio/DB/NCBIHelper.pm:248 > STACK toplevel query.pl:12 > Could you help me one more time, please? It seems the MSG: line says that your Genbank query was invalid. Your original query was: $query = "Arabidopsis[ORGN] AND topoisomerase[TITL] an 0:3000[SLEN]"; Should that be " AND 0:3000[SLEN]" ? -- Torsten Seemann Victorian Bioinformatics Consortium From hlapp at gmx.net Tue Dec 6 11:44:41 2005 From: hlapp at gmx.net (Hilmar Lapp) Date: Tue Dec 6 11:49:08 2005 Subject: [Bioperl-l] remote location support in BioSQL (bioperl-db) /Oracle In-Reply-To: <43948681.1050606@jimmy.harvard.edu> References: <43948681.1050606@jimmy.harvard.edu> Message-ID: Good point. No there is no such module, i.e., one that would implement RandomAccessI for BioSQL. Shouldn't be hard to write though; would you be inclined to volunteer? -hilmar On Dec 5, 2005, at 10:27 AM, Razvan Sultana wrote: > Hi ! > I have a question regarding the bioperl-db implementation in Oracle. > I have created a BioSQL schema in Oracle and populated with the latest > version of Genbank. > I used the "bleeding edge" version of bioperl, bioperl-db and > bioperl-schema from the CVS repository. > > When I try to extract the spliced sequences of the CDS features of > entries that have remote locations (e.g., for AF327267: > join(4..190,AF327268.1:89..275,AF327268.1:780..930, > AF327268.1:1049..1196,AF327269.1:63..229, > AF327269.1:522..659,AF327269.1:784..917, > AF327269.1:1461..1582,AF327270.1:100..173, > AF327270.1:349..417) > the appended code complains that it > Can't locate object method "get_Seq_by_acc" via package > "Bio::DB::BioSQL::DBAdaptor" > and rightly so, because the above object doesn't implement the > Bio::DB::RandomAccessI interface. > If I omit $db from the spliced_seq() method call, I obviously get the > message: > MSG: cannot get remote location for AF327268.1 without a valid > Bio::DB::RandomAccessI database handle (like Bio::DB::GenBank) > > My question is: is there an object to the BioSQL schema that > implements Bio::DB::RandomAccessI ? > If there is, how do I get a handle to it? > If there isn't, I would be willing to help implementing it. > > Thank you, > Razvan Sultana > > #!/usr/local/bin/perl > use strict; > use Bio::DB::BioDB; > use Bio::Seq::RichSeq; > use Bio::DB::Query::BioQuery; > use Getopt::Long; > > my $host = 'kevin'; > my $dbuser = 'biosql'; > my $dbpass; > my $dbname = 'biocompd'; > my $driver = 'Oracle'; > my $acc; > my $biodbname = 'genbank'; > > &GetOptions( 'host=s' => \$host, > 'driver=s' => \$driver, > 'dbuser=s' => \$dbuser, > 'dbpass=s' => \$dbpass, > 'dbname=s' => \$dbname, > 'accession=s' => \$acc); > $acc = 'AF327270' unless $acc; > > my $db = Bio::DB::BioDB->new(-database => "biosql", > -host => $host, > -dbname => $dbname, > -driver => $driver, > -user => $dbuser, > -pass => $dbpass, > ); > > my $seqadaptor = $db->get_object_adaptor('Bio::SeqI'); > my $query = Bio::DB::Query::BioQuery->new(-datacollections => > ['Bio::PrimarySeqI e', > > 'BioNamespace=>Bio::PrimarySeqI db'], > -where => > ["e.accession_number = '$acc'", > "db.namespace= > '$biodbname'"]); > my $seq_object = Bio::Seq::RichSeq->new(); > my $query_result = $seqadaptor->find_by_query($query); > > while ($seq_object = $query_result->next_object()) { > for my $feature ($seq_object->get_SeqFeatures()) { > my $full_sequence = $seq_object->seq(); > my $cds_seq = $feature->spliced_seq($db)->seq() if > ($feature->primary_tag() eq 'CDS'); > } > } > _______________________________________________ > Bioperl-l mailing list > Bioperl-l@portal.open-bio.org > http://portal.open-bio.org/mailman/listinfo/bioperl-l > > -- ------------------------------------------------------------- Hilmar Lapp email: lapp at gnf.org GNF, San Diego, Ca. 92121 phone: +1-858-812-1757 ------------------------------------------------------------- From olenka.m at gmail.com Tue Dec 6 12:13:53 2005 From: olenka.m at gmail.com (Olena Morozova) Date: Tue Dec 6 12:11:35 2005 Subject: [Bioperl-l] (no subject) Message-ID: <259a224c0512060913m71f41da7q5791a4b9bd6270c7@mail.gmail.com> Hi, I am trying to extract sequences from a fasta file using the script below (the fasta file has been indexed). I keep getting the message: failure to provide a valid PrimarySeqI object. Why is this? Thanks a lot for all your help!! Olena #! /usr/bin/perl -w use Bio::Index::Fasta; use strict; my $Index_File_Name = "c:/perl/known.fa"; my $inx = Bio::Index::Fasta->new('-filename' => $Index_File_Name); my $out = Bio::SeqIO->new('-format' => 'Fasta','-fh' => \*STDOUT); foreach my $id ('puffer_tf_id1.txt') { my $seq = $inx->fetch($id); # Returns Bio::Seq object $out->write_seq($seq); } On 12/6/05, Hilmar Lapp wrote: > Good point. No there is no such module, i.e., one that would implement > RandomAccessI for BioSQL. Shouldn't be hard to write though; would you > be inclined to volunteer? > > -hilmar > > On Dec 5, 2005, at 10:27 AM, Razvan Sultana wrote: > > > Hi ! > > I have a question regarding the bioperl-db implementation in Oracle. > > I have created a BioSQL schema in Oracle and populated with the latest > > version of Genbank. > > I used the "bleeding edge" version of bioperl, bioperl-db and > > bioperl-schema from the CVS repository. > > > > When I try to extract the spliced sequences of the CDS features of > > entries that have remote locations (e.g., for AF327267: > > join(4..190,AF327268.1:89..275,AF327268.1:780..930, > > AF327268.1:1049..1196,AF327269.1:63..229, > > AF327269.1:522..659,AF327269.1:784..917, > > AF327269.1:1461..1582,AF327270.1:100..173, > > AF327270.1:349..417) > > the appended code complains that it > > Can't locate object method "get_Seq_by_acc" via package > > "Bio::DB::BioSQL::DBAdaptor" > > and rightly so, because the above object doesn't implement the > > Bio::DB::RandomAccessI interface. > > If I omit $db from the spliced_seq() method call, I obviously get the > > message: > > MSG: cannot get remote location for AF327268.1 without a valid > > Bio::DB::RandomAccessI database handle (like Bio::DB::GenBank) > > > > My question is: is there an object to the BioSQL schema that > > implements Bio::DB::RandomAccessI ? > > If there is, how do I get a handle to it? > > If there isn't, I would be willing to help implementing it. > > > > Thank you, > > Razvan Sultana > > > > #!/usr/local/bin/perl > > use strict; > > use Bio::DB::BioDB; > > use Bio::Seq::RichSeq; > > use Bio::DB::Query::BioQuery; > > use Getopt::Long; > > > > my $host = 'kevin'; > > my $dbuser = 'biosql'; > > my $dbpass; > > my $dbname = 'biocompd'; > > my $driver = 'Oracle'; > > my $acc; > > my $biodbname = 'genbank'; > > > > &GetOptions( 'host=s' => \$host, > > 'driver=s' => \$driver, > > 'dbuser=s' => \$dbuser, > > 'dbpass=s' => \$dbpass, > > 'dbname=s' => \$dbname, > > 'accession=s' => \$acc); > > $acc = 'AF327270' unless $acc; > > > > my $db = Bio::DB::BioDB->new(-database => "biosql", > > -host => $host, > > -dbname => $dbname, > > -driver => $driver, > > -user => $dbuser, > > -pass => $dbpass, > > ); > > > > my $seqadaptor = $db->get_object_adaptor('Bio::SeqI'); > > my $query = Bio::DB::Query::BioQuery->new(-datacollections => > > ['Bio::PrimarySeqI e', > > > > 'BioNamespace=>Bio::PrimarySeqI db'], > > -where => > > ["e.accession_number = '$acc'", > > "db.namespace= > > '$biodbname'"]); > > my $seq_object = Bio::Seq::RichSeq->new(); > > my $query_result = $seqadaptor->find_by_query($query); > > > > while ($seq_object = $query_result->next_object()) { > > for my $feature ($seq_object->get_SeqFeatures()) { > > my $full_sequence = $seq_object->seq(); > > my $cds_seq = $feature->spliced_seq($db)->seq() if > > ($feature->primary_tag() eq 'CDS'); > > } > > } > > _______________________________________________ > > Bioperl-l mailing list > > Bioperl-l@portal.open-bio.org > > http://portal.open-bio.org/mailman/listinfo/bioperl-l > > > > > -- > ------------------------------------------------------------- > Hilmar Lapp email: lapp at gnf.org > GNF, San Diego, Ca. 92121 phone: +1-858-812-1757 > ------------------------------------------------------------- > > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l@portal.open-bio.org > http://portal.open-bio.org/mailman/listinfo/bioperl-l > From osborne1 at optonline.net Tue Dec 6 12:28:50 2005 From: osborne1 at optonline.net (Brian Osborne) Date: Tue Dec 6 12:33:06 2005 Subject: [Bioperl-l] (no subject) In-Reply-To: <259a224c0512060913m71f41da7q5791a4b9bd6270c7@mail.gmail.com> Message-ID: Olena, This is _probably_ the problem described in the FAQ: http://bioperl.org/Core/Latest/faq.html#Q2.5 Brian O. On 12/6/05 12:13 PM, "Olena Morozova" wrote: > Hi, > > I am trying to extract sequences from a fasta file using the script > below (the fasta file has been indexed). I keep getting the message: > failure to provide a valid PrimarySeqI object. Why is this? > > Thanks a lot for all your help!! > Olena > > > #! /usr/bin/perl -w > > use Bio::Index::Fasta; > use strict; > > my $Index_File_Name = "c:/perl/known.fa"; > my $inx = Bio::Index::Fasta->new('-filename' => $Index_File_Name); > my $out = Bio::SeqIO->new('-format' => 'Fasta','-fh' => \*STDOUT); > > foreach my $id ('puffer_tf_id1.txt') { > my $seq = $inx->fetch($id); # Returns Bio::Seq object > $out->write_seq($seq); > } > On 12/6/05, Hilmar Lapp wrote: >> Good point. No there is no such module, i.e., one that would implement >> RandomAccessI for BioSQL. Shouldn't be hard to write though; would you >> be inclined to volunteer? >> >> -hilmar >> >> On Dec 5, 2005, at 10:27 AM, Razvan Sultana wrote: >> >>> Hi ! >>> I have a question regarding the bioperl-db implementation in Oracle. >>> I have created a BioSQL schema in Oracle and populated with the latest >>> version of Genbank. >>> I used the "bleeding edge" version of bioperl, bioperl-db and >>> bioperl-schema from the CVS repository. >>> >>> When I try to extract the spliced sequences of the CDS features of >>> entries that have remote locations (e.g., for AF327267: >>> join(4..190,AF327268.1:89..275,AF327268.1:780..930, >>> AF327268.1:1049..1196,AF327269.1:63..229, >>> AF327269.1:522..659,AF327269.1:784..917, >>> AF327269.1:1461..1582,AF327270.1:100..173, >>> AF327270.1:349..417) >>> the appended code complains that it >>> Can't locate object method "get_Seq_by_acc" via package >>> "Bio::DB::BioSQL::DBAdaptor" >>> and rightly so, because the above object doesn't implement the >>> Bio::DB::RandomAccessI interface. >>> If I omit $db from the spliced_seq() method call, I obviously get the >>> message: >>> MSG: cannot get remote location for AF327268.1 without a valid >>> Bio::DB::RandomAccessI database handle (like Bio::DB::GenBank) >>> >>> My question is: is there an object to the BioSQL schema that >>> implements Bio::DB::RandomAccessI ? >>> If there is, how do I get a handle to it? >>> If there isn't, I would be willing to help implementing it. >>> >>> Thank you, >>> Razvan Sultana >>> >>> #!/usr/local/bin/perl >>> use strict; >>> use Bio::DB::BioDB; >>> use Bio::Seq::RichSeq; >>> use Bio::DB::Query::BioQuery; >>> use Getopt::Long; >>> >>> my $host = 'kevin'; >>> my $dbuser = 'biosql'; >>> my $dbpass; >>> my $dbname = 'biocompd'; >>> my $driver = 'Oracle'; >>> my $acc; >>> my $biodbname = 'genbank'; >>> >>> &GetOptions( 'host=s' => \$host, >>> 'driver=s' => \$driver, >>> 'dbuser=s' => \$dbuser, >>> 'dbpass=s' => \$dbpass, >>> 'dbname=s' => \$dbname, >>> 'accession=s' => \$acc); >>> $acc = 'AF327270' unless $acc; >>> >>> my $db = Bio::DB::BioDB->new(-database => "biosql", >>> -host => $host, >>> -dbname => $dbname, >>> -driver => $driver, >>> -user => $dbuser, >>> -pass => $dbpass, >>> ); >>> >>> my $seqadaptor = $db->get_object_adaptor('Bio::SeqI'); >>> my $query = Bio::DB::Query::BioQuery->new(-datacollections => >>> ['Bio::PrimarySeqI e', >>> >>> 'BioNamespace=>Bio::PrimarySeqI db'], >>> -where => >>> ["e.accession_number = '$acc'", >>> "db.namespace= >>> '$biodbname'"]); >>> my $seq_object = Bio::Seq::RichSeq->new(); >>> my $query_result = $seqadaptor->find_by_query($query); >>> >>> while ($seq_object = $query_result->next_object()) { >>> for my $feature ($seq_object->get_SeqFeatures()) { >>> my $full_sequence = $seq_object->seq(); >>> my $cds_seq = $feature->spliced_seq($db)->seq() if >>> ($feature->primary_tag() eq 'CDS'); >>> } >>> } >>> _______________________________________________ >>> Bioperl-l mailing list >>> Bioperl-l@portal.open-bio.org >>> http://portal.open-bio.org/mailman/listinfo/bioperl-l >>> >>> >> -- >> ------------------------------------------------------------- >> Hilmar Lapp email: lapp at gnf.org >> GNF, San Diego, Ca. 92121 phone: +1-858-812-1757 >> ------------------------------------------------------------- >> >> >> _______________________________________________ >> Bioperl-l mailing list >> Bioperl-l@portal.open-bio.org >> http://portal.open-bio.org/mailman/listinfo/bioperl-l >> > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l@portal.open-bio.org > http://portal.open-bio.org/mailman/listinfo/bioperl-l From olenka.m at gmail.com Tue Dec 6 11:57:31 2005 From: olenka.m at gmail.com (Olena Morozova) Date: Tue Dec 6 12:51:56 2005 Subject: [Bioperl-l] Reciprocal best blast Message-ID: <259a224c0512060857r519a897fpa4ab2f33287a0f44@mail.gmail.com> Hello everyone, Does anyone know where I can quickly get a perl script to identify reciprocal best blast matches for two fasta files? Thanks a lot for your help, Olena From olenka.m at gmail.com Tue Dec 6 12:34:21 2005 From: olenka.m at gmail.com (Olena Morozova) Date: Tue Dec 6 13:37:07 2005 Subject: [Bioperl-l] Re: (no subject) In-Reply-To: References: <259a224c0512060913m71f41da7q5791a4b9bd6270c7@mail.gmail.com> Message-ID: <259a224c0512060934k3c27d88fw3b035bd2b4c7371a@mail.gmail.com> Thank you very much for your reply, Brian. The script works when I write out the actual ID like this: my $seq = $inx->fetch('GSTENP00038948001') However, it does not work when I try to fetch the sequence by the same ID written in a file (I tried just putting this one ID in the file)... On 12/6/05, Brian Osborne wrote: > Olena, > > This is _probably_ the problem described in the FAQ: > > http://bioperl.org/Core/Latest/faq.html#Q2.5 > > > Brian O. > > > On 12/6/05 12:13 PM, "Olena Morozova" wrote: > > > Hi, > > > > I am trying to extract sequences from a fasta file using the script > > below (the fasta file has been indexed). I keep getting the message: > > failure to provide a valid PrimarySeqI object. Why is this? > > > > Thanks a lot for all your help!! > > Olena > > > > > > #! /usr/bin/perl -w > > > > use Bio::Index::Fasta; > > use strict; > > > > my $Index_File_Name = "c:/perl/known.fa"; > > my $inx = Bio::Index::Fasta->new('-filename' => $Index_File_Name); > > my $out = Bio::SeqIO->new('-format' => 'Fasta','-fh' => \*STDOUT); > > > > foreach my $id ('puffer_tf_id1.txt') { > > my $seq = $inx->fetch($id); # Returns Bio::Seq object > > $out->write_seq($seq); > > } > > On 12/6/05, Hilmar Lapp wrote: > >> Good point. No there is no such module, i.e., one that would implement > >> RandomAccessI for BioSQL. Shouldn't be hard to write though; would you > >> be inclined to volunteer? > >> > >> -hilmar > >> > >> On Dec 5, 2005, at 10:27 AM, Razvan Sultana wrote: > >> > >>> Hi ! > >>> I have a question regarding the bioperl-db implementation in Oracle. > >>> I have created a BioSQL schema in Oracle and populated with the latest > >>> version of Genbank. > >>> I used the "bleeding edge" version of bioperl, bioperl-db and > >>> bioperl-schema from the CVS repository. > >>> > >>> When I try to extract the spliced sequences of the CDS features of > >>> entries that have remote locations (e.g., for AF327267: > >>> join(4..190,AF327268.1:89..275,AF327268.1:780..930, > >>> AF327268.1:1049..1196,AF327269.1:63..229, > >>> AF327269.1:522..659,AF327269.1:784..917, > >>> AF327269.1:1461..1582,AF327270.1:100..173, > >>> AF327270.1:349..417) > >>> the appended code complains that it > >>> Can't locate object method "get_Seq_by_acc" via package > >>> "Bio::DB::BioSQL::DBAdaptor" > >>> and rightly so, because the above object doesn't implement the > >>> Bio::DB::RandomAccessI interface. > >>> If I omit $db from the spliced_seq() method call, I obviously get the > >>> message: > >>> MSG: cannot get remote location for AF327268.1 without a valid > >>> Bio::DB::RandomAccessI database handle (like Bio::DB::GenBank) > >>> > >>> My question is: is there an object to the BioSQL schema that > >>> implements Bio::DB::RandomAccessI ? > >>> If there is, how do I get a handle to it? > >>> If there isn't, I would be willing to help implementing it. > >>> > >>> Thank you, > >>> Razvan Sultana > >>> > >>> #!/usr/local/bin/perl > >>> use strict; > >>> use Bio::DB::BioDB; > >>> use Bio::Seq::RichSeq; > >>> use Bio::DB::Query::BioQuery; > >>> use Getopt::Long; > >>> > >>> my $host = 'kevin'; > >>> my $dbuser = 'biosql'; > >>> my $dbpass; > >>> my $dbname = 'biocompd'; > >>> my $driver = 'Oracle'; > >>> my $acc; > >>> my $biodbname = 'genbank'; > >>> > >>> &GetOptions( 'host=s' => \$host, > >>> 'driver=s' => \$driver, > >>> 'dbuser=s' => \$dbuser, > >>> 'dbpass=s' => \$dbpass, > >>> 'dbname=s' => \$dbname, > >>> 'accession=s' => \$acc); > >>> $acc = 'AF327270' unless $acc; > >>> > >>> my $db = Bio::DB::BioDB->new(-database => "biosql", > >>> -host => $host, > >>> -dbname => $dbname, > >>> -driver => $driver, > >>> -user => $dbuser, > >>> -pass => $dbpass, > >>> ); > >>> > >>> my $seqadaptor = $db->get_object_adaptor('Bio::SeqI'); > >>> my $query = Bio::DB::Query::BioQuery->new(-datacollections => > >>> ['Bio::PrimarySeqI e', > >>> > >>> 'BioNamespace=>Bio::PrimarySeqI db'], > >>> -where => > >>> ["e.accession_number = '$acc'", > >>> "db.namespace= > >>> '$biodbname'"]); > >>> my $seq_object = Bio::Seq::RichSeq->new(); > >>> my $query_result = $seqadaptor->find_by_query($query); > >>> > >>> while ($seq_object = $query_result->next_object()) { > >>> for my $feature ($seq_object->get_SeqFeatures()) { > >>> my $full_sequence = $seq_object->seq(); > >>> my $cds_seq = $feature->spliced_seq($db)->seq() if > >>> ($feature->primary_tag() eq 'CDS'); > >>> } > >>> } > >>> _______________________________________________ > >>> Bioperl-l mailing list > >>> Bioperl-l@portal.open-bio.org > >>> http://portal.open-bio.org/mailman/listinfo/bioperl-l > >>> > >>> > >> -- > >> ------------------------------------------------------------- > >> Hilmar Lapp email: lapp at gnf.org > >> GNF, San Diego, Ca. 92121 phone: +1-858-812-1757 > >> ------------------------------------------------------------- > >> > >> > >> _______________________________________________ > >> Bioperl-l mailing list > >> Bioperl-l@portal.open-bio.org > >> http://portal.open-bio.org/mailman/listinfo/bioperl-l > >> > > > > _______________________________________________ > > Bioperl-l mailing list > > Bioperl-l@portal.open-bio.org > > http://portal.open-bio.org/mailman/listinfo/bioperl-l > > > From akholloway at ucdavis.edu Tue Dec 6 16:24:34 2005 From: akholloway at ucdavis.edu (Alisha Holloway) Date: Tue Dec 6 17:15:01 2005 Subject: [Bioperl-l] what am I doing wrong with $alignio->next_aln Message-ID: An HTML attachment was scrubbed... URL: http://portal.open-bio.org/pipermail/bioperl-l/attachments/20051206/57fef8b3/attachment.htm From jbmatthewgeisler at yahoo.ca Tue Dec 6 09:40:57 2005 From: jbmatthewgeisler at yahoo.ca (matt geisler) Date: Tue Dec 6 17:15:25 2005 Subject: [Bioperl-l] help with HOWTO script example Message-ID: <20051206144057.83846.qmail@web32911.mail.mud.yahoo.com> Hello, I am learning perl and bioperl (I am a mol biologist turning bioinformatics). I tried the example script in the beginners HOWTO tutorial and encountered errors. System Windows XP with Activeperl 5.8.7 Writing the following script in notepad #!C:\perl\bin\perl.exe -w use Bio::Seq; $seq_obj = Bio::Seq->new(-seq =>"aaaatttggggggggcccccgtt" -alphabet => 'dna'); print $seq_obj->desc(); error: arguement isn't numeric in subtraction line 3 If I remove the -alphabet =>'dna' it works. I assume that it thinks I am trying to do a subtraction with the '-' symbol. The example text in beginners HOWTO insists that multiple properties of the object ($seq_obj) are assigned with dashes. Is this true? If so, why is it thinking I am trying to subtract? --------------------------------- Find your next car at Yahoo! Canada Autos From jbmatthewgeisler at yahoo.ca Tue Dec 6 10:28:18 2005 From: jbmatthewgeisler at yahoo.ca (matt geisler) Date: Tue Dec 6 17:15:26 2005 Subject: [Bioperl-l] re: help with HOWTO script example Message-ID: <20051206152818.2049.qmail@web32904.mail.mud.yahoo.com> I found an error in the manual "beginners HOWTO' page 5 The second script is missing a comma between the sequence and -alphabet. This caused the error reported earlier by me. --------------------------------- Find your next car at Yahoo! Canada Autos From osborne1 at optonline.net Tue Dec 6 17:41:18 2005 From: osborne1 at optonline.net (Brian Osborne) Date: Tue Dec 6 17:44:10 2005 Subject: [Bioperl-l] re: help with HOWTO script example In-Reply-To: <20051206152818.2049.qmail@web32904.mail.mud.yahoo.com> Message-ID: Matt, Thanks for pointing that out, I'll correct it. Brian O. On 12/6/05 10:28 AM, "matt geisler" wrote: > I found an error in the manual "beginners HOWTO' > > page 5 > > The second script is missing a comma between the sequence and -alphabet. > This caused the error reported earlier by me. > > > > --------------------------------- > Find your next car at Yahoo! Canada Autos > _______________________________________________ > Bioperl-l mailing list > Bioperl-l@portal.open-bio.org > http://portal.open-bio.org/mailman/listinfo/bioperl-l From torsten.seemann at infotech.monash.edu.au Tue Dec 6 19:08:47 2005 From: torsten.seemann at infotech.monash.edu.au (Torsten Seemann) Date: Tue Dec 6 19:09:25 2005 Subject: [Bioperl-l] remoteblast doesn't save the Output File In-Reply-To: <439614FE.60104@gmx.at> References: <438F7DF1.5060000@gmx.at> <1133830593.20822.13.camel@chauvel.csse.monash.edu.au> <4394DC21.90401@gmx.at> <1133838353.20822.32.camel@chauvel.csse.monash.edu.au> <4395D85D.20501@gmx.at> <1133909153.25750.14.camel@chauvel.csse.monash.edu.au> <439614FE.60104@gmx.at> Message-ID: <1133914128.25750.24.camel@chauvel.csse.monash.edu.au> > >The error is clear: > >'9 1' fails because it does not match the regular expression > >/-?\d+(\.\d+)\s+i-?\d+(\.\d+)/ > >The $NUM='-?\d+(\.\d+)' parts are trying to match a floating point > >number or integer, eg. "3.41" or "6" or "-78.1", which simplifies to > >/$NUM\s+i$NUM/x which suggests it wants it in the form: > >'GAPCOSTS' = '9 i1' > >which looks unusual but would match the pattern? > Thanks, I have tried your Example: > 'GAPCOSTS' = '9 i1' ...... it wasn't working, but if you took float > instead of integer the gap existence will work but not the Gap extend, > like 'GAPCOSTS' = '9.0 i1.0' > It never minds, if I take 'GAPCOSTS' = '9.0 i1.0' or 'GAPCOSTS' = '9.0 > i-1.0' or instead of 1.0 any other float value, I always get the same > Message: > Gap existence and extension values of 10 and 0 not supported for BLOSUM80 > it always sets the extension Value to 0. If I was awake when I wrote this I would have realised that it was $NUM='-?\d+(\.\d+)' which means you MUST have the number with a decimal point and a zero after it. i.e. integers are NOT accepted. They would be accepted if it was $NUM='-?\d+(\.\d+)?' - the fact that they have bracketed it suggests they forgot to put the ? after it, other wise their isn't much point to bracketing it. So you are right that "9.0 i1.0" matches the expression. -- Torsten Seemann Victorian Bioinformatics Consortium From torsten.seemann at infotech.monash.edu.au Tue Dec 6 17:45:53 2005 From: torsten.seemann at infotech.monash.edu.au (Torsten Seemann) Date: Tue Dec 6 19:10:54 2005 Subject: [Bioperl-l] remoteblast doesn't save the Output File In-Reply-To: <4395D85D.20501@gmx.at> References: <438F7DF1.5060000@gmx.at> <1133830593.20822.13.camel@chauvel.csse.monash.edu.au> <4394DC21.90401@gmx.at> <1133838353.20822.32.camel@chauvel.csse.monash.edu.au> <4395D85D.20501@gmx.at> Message-ID: <1133909153.25750.14.camel@chauvel.csse.monash.edu.au> (Hubert - please send your emails to bioperl-l@bioperl.org so others can help too) > Sorry for the Misunderstanding, it was my fault. I have already read the > Module, and the Matrix is working well. But if I want to change the > Gapcost Parameter, I don't know > how to format, because if I do it like that 'GAPCOSTS' = '9 1'; I will > get the following Message: > > MSG: Value 9 1 for PUT parameter GAPCOSTS does not match expression > -?\d+(\.\d+)\s+i-?\d+(\.\d+). Rejecting. > > if I do it like 'GAPCOSTS' = '9'; I will get the same Message like above. > I have tried it with "" too, but there was no Impact on it, i got the > same Message: The error is clear: '9 1' fails because it does not match the regular expression /-?\d+(\.\d+)\s+i-?\d+(\.\d+)/ The $NUM='-?\d+(\.\d+)' parts are trying to match a floating point number or integer, eg. "3.41" or "6" or "-78.1", which simplifies to /$NUM\s+i$NUM/x which suggests it wants it in the form: 'GAPCOSTS' = '9 i1' which looks unusual but would match the pattern? The documentation says '9 1' should work? http://www.ncbi.nlm.nih.gov/BLAST/Doc/node28.html Perhaps a bug in NCBI's code, the 'i' might be a typo? -- Torsten Seemann Victorian Bioinformatics Consortium From chen_li3 at yahoo.com Wed Dec 7 01:28:23 2005 From: chen_li3 at yahoo.com (chen li) Date: Wed Dec 7 01:32:37 2005 Subject: [Bioperl-l] Bioperl and Mysql In-Reply-To: <1f0b7c7937e41d7f285fcccabb612c2a@gnf.org> Message-ID: <20051207062823.52871.qmail@web36805.mail.mud.yahoo.com> Hi Hilmar, I download a small fasta-format sequence file from NCBI and populate the database. But I can't retrieve them within mysql as a root user. The biosql database is empty. What is going on? Thanks, Li [alex@cpe-65-189-147-4 biosql]$ perl load_seqdatabase.pl --host localhost --dbname biosql /home/alex/DB/RNA.fasta Loading /home/alex/DB/RNA.fasta ... A small part of my file: >gi|21237774|ref|NM_139124.1| Homo sapiens mitogen-activated protein kinase 8 interacting protein 2 (MAPK8IP2), transcript variant 3, mRNA CGCGGGGCGGACGCCGCAGGGCGTGTCACGAGGTGAGCGGGGCGGGCCGAGCGCCGGCGCGGGGCGCGGCGAGGCTCCCG ........................... __________________________________________ Yahoo! DSL ? Something to write home about. Just $16.99/mo. or less. dsl.yahoo.com From avilella at ub.edu Wed Dec 7 01:46:12 2005 From: avilella at ub.edu (Albert Vilella) Date: Wed Dec 7 04:03:04 2005 Subject: [Bioperl-l] what am I doing wrong with $alignio->next_aln In-Reply-To: References: Message-ID: <1133937972.9277.4.camel@localhost.localdomain> Quick guess, can you try changing: > my $alignio = new Bio::AlignIO('-format' => 'fasta', > -interleaved => 0, > -file => $tempfile); > to: > my $alignio = new Bio::AlignIO('-format' => 'fasta', > -file => $tempfile); > the "interleaved" bit is for phylip file format... not sure if that is really the problem here. if you don't get a proper $aln object after: my $aln = $alignio->next_aln; then you should double-check the place where you are getting the alignment files from. To be sure it is not a bioperl-run/PAML problem, you can run the test: cd /where/bioperl/and/bioperl-run/are/placed/ perl t/PAML.t should give you 18 ok's, Albert. El dt 06 de 12 del 2005 a les 13:24 -0800, en/na Alisha Holloway va escriure: > I'm trying to run PAML through BioPerl. I get this error message > > > -------------------- WARNING --------------------- > MSG: must have supplied a valid alignment file in order to run codeml > --------------------------------------------------- > > because I'm not getting a return value from "my $aln = > $alignio->next_aln". I just can't quite figure out why. Here's the > code bit. Any help would be greatly appreciated. I have a feeling > this has to do with the input file, but the format of the file is > fine. Is there something about the path that I'm missing? > > > > > if($sepfiles[$x] =~ /.fa$/){ > my $tempfile = './tempfiles/'.$sepfiles[$x]; > print "Datafile is - $allfiles[$a], $tempfile\n"; > ##my $tempfile= shift @ARGV; ## to load infile on command line > my $alignio = new Bio::AlignIO('-format' => 'fasta', > -interleaved => 0, > -file => $tempfile); > > my $aln = $alignio->next_aln; > my $codeml = new Bio::Tools::Run::Phylo::PAML::Codeml(); > #$codeml->no_param_checks(1); > $codeml->set_parameter('runmode',0); > $codeml->set_parameter('seqtype',1); > $codeml->set_parameter('CodonFreq',2); > $codeml->set_parameter('model', 1); > $codeml->set_parameter('NSsites', 0); > $codeml->set_parameter('fix_omega', 0); > $codeml->set_parameter('omega',0.4); > $codeml->set_parameter('fix_kappa', 0); > $codeml->set_parameter('kappa',2); > $codeml->set_parameter('cleandata',1); > > print $codeml->executable(), " is codeml\n"; > > $codeml->alignment($aln); ## it gets to here and tries to use > $aln and gives the warning > > > Thanks, > > > Alisha > > > > > -- > Alisha Holloway > > Postdoctoral Fellow > Section of Evolution & Ecology > 3347 Storer Hall > University of California > Davis, CA 95616 > > 530-754-9551 Office > 512-297-3958 Cell > 530-752-1449 Fax > _______________________________________________ > Bioperl-l mailing list > Bioperl-l@portal.open-bio.org > http://portal.open-bio.org/mailman/listinfo/bioperl-l From andrej.kastrin at siol.net Wed Dec 7 07:42:32 2005 From: andrej.kastrin at siol.net (Andrej Kastrin) Date: Wed Dec 7 07:57:34 2005 Subject: [Bioperl-l] Extract fields from Medline Message-ID: <4396D8B8.3040706@siol.net> Hello all, big problem for me, small for you (while I'm noob in perl). I have a list of terms (i.e. genes, gene products) in row data format. Now I have to parse Medline (standard Medline format) and extract PMID, TI and AB (ID number, Title and Abstract) fields which involve any term in my term list. I already transform Medline "multiline" format to "single" line, so there is only one line per each field. How to start? Thanks for any suggesstion. Best, Andrej From andrej.kastrin at siol.net Wed Dec 7 07:40:05 2005 From: andrej.kastrin at siol.net (Andrej Kastrin) Date: Wed Dec 7 07:57:36 2005 Subject: [Bioperl-l] Extract field from Medline Message-ID: <4396D825.5050901@siol.net> Hello all, big problem for me, small for you (while I'm noob in perl). I have a list of terms (i.e. genes, gene products) in row data format. Now I have to parse Medline (standard Medline format) and extract PMID, TI and AB (ID number, Title and Abstract) fields which involve any term in my term list. I already transform Medline "multiline" format to "single" line, so there is only one line per each field. How to start? Thanks for any suggesstion. Best, Andrej From amackey at pcbi.upenn.edu Wed Dec 7 08:13:05 2005 From: amackey at pcbi.upenn.edu (Aaron J. Mackey) Date: Wed Dec 7 08:50:09 2005 Subject: [Bioperl-l] Kyte Doolittle hydropathy plot: Bio::Graphics::Glyph::protein Message-ID: <580CCD33-D413-4038-9C62-B57B236C71B1@pcbi.upenn.edu> I've just checked in to BioPerl the above mentioned glyph, which (like Glyph::dna) is a "global feature" glyph that at high magnification shows raw protein sequence, but at lower magnification provides the standard Kyte-Doolittle hydropathy plot (adding an option to use Hopp-Woods scale is left as an exercise to the reader). You can see an example of using this glyph at: http://v5-0.plasmodb.org/cgi-bin/gbrowse_img/plasmodbaa/?q=PF10_0238 Enjoy, -Aaron -- Aaron J. Mackey, Ph.D. Project Manager, ApiDB Bioinformatics Resource Center Penn Genomics Institute, University of Pennsylvania email: amackey@pcbi.upenn.edu office: 215-898-1205 (Biology, 212 Goddard Labs) 215-746-7018 (PCBI, 1428 Blockley Hall) fax: 215-746-6697 (Penn Genomics Institute) postal: Penn Genomics Institute Goddard Labs 212 415 S. University Avenue Philadelphia, PA 19104-6017 From osborne1 at optonline.net Wed Dec 7 09:06:09 2005 From: osborne1 at optonline.net (Brian Osborne) Date: Wed Dec 7 09:08:42 2005 Subject: [Bioperl-l] Extract fields from Medline In-Reply-To: <4396D8B8.3040706@siol.net> Message-ID: Andrej, For a start take a look at the scripts in examples/biblio, these scripts show how one can access an OpenBQS service ("soap") or PubMed ("eutils") using Bio::Biblio. Brian O. On 12/7/05 7:42 AM, "Andrej Kastrin" wrote: > Hello all, > > big problem for me, small for you (while I'm noob in perl). I have a > list of terms (i.e. genes, gene products) in row data format. Now I have > to parse Medline (standard Medline format) and extract PMID, TI and AB > (ID number, Title and Abstract) fields which involve any term in my term > list. I already transform Medline "multiline" format to "single" line, > so there is only one line per each field. > > How to start? Thanks for any suggesstion. > Best, Andrej > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l@portal.open-bio.org > http://portal.open-bio.org/mailman/listinfo/bioperl-l From bmoore at genetics.utah.edu Wed Dec 7 09:13:15 2005 From: bmoore at genetics.utah.edu (Barry Moore) Date: Wed Dec 7 09:10:10 2005 Subject: [Bioperl-l] Extract field from Medline Message-ID: Andrej- Doesn't really sound like you need Bioperl for this one - just some loops and regular expressions. Can't offer too much help without seeing your file formats, but a boiler plate might look like this: #!/usr/bin/perl use strict; use warnings; my $file_terms = shift; my $file_medline = shift; open (TERM, $file_term) or die "Can't open TERM"; open (MEDL, $file_medline) or die "Can't open MEDL"; my @terms = ; while (my ($pmid, $ti, $ab) = split ) { for my $term (@terms) { if (/$term/ for ($pmid, $ti, $ab)) { print "$pmid\t$ti\t$ab"; } } } -----Original Message----- From: bioperl-l-bounces@portal.open-bio.org [mailto:bioperl-l-bounces@portal.open-bio.org] On Behalf Of Andrej Kastrin Sent: Wednesday, December 07, 2005 5:40 AM To: bioperl-l@portal.open-bio.org Subject: [Bioperl-l] Extract field from Medline Hello all, big problem for me, small for you (while I'm noob in perl). I have a list of terms (i.e. genes, gene products) in row data format. Now I have to parse Medline (standard Medline format) and extract PMID, TI and AB (ID number, Title and Abstract) fields which involve any term in my term list. I already transform Medline "multiline" format to "single" line, so there is only one line per each field. How to start? Thanks for any suggesstion. Best, Andrej _______________________________________________ Bioperl-l mailing list Bioperl-l@portal.open-bio.org http://portal.open-bio.org/mailman/listinfo/bioperl-l From Richard.Adams at ed.ac.uk Wed Dec 7 09:01:49 2005 From: Richard.Adams at ed.ac.uk (Richard Adams) Date: Wed Dec 7 09:31:28 2005 Subject: [Bioperl-l] need info Message-ID: <4396EB4D.30109@ed.ac.uk> Hi Heikki, Regarding looking for untested modules/tests, how about parsing the *.t files, parsing out the lines such as ok $myvar->method1 and getting the class of the variable and making a hash of which public methods in each class are tested. In this way the class of IO type instances will be the correct subclass and you can get this info independent of parsing the 'use module' statements If a method is not in that class (ie is in a superclass) then the inheritance hierarchy can be searched until the method is found and that method in the superclass ticked as 'tested' If you like I could work up some pseudocode for it for any comments, Richard -- Dr Richard Adams Psychiatric Genetics Group, Medical Genetics, Molecular Medicine Centre, Western General Hospital, Crewe Rd West, Edinburgh UK EH4 2XU Tel: 44 131 651 1084 richard.adams@ed.ac.uk From andrej.kastrin at siol.net Wed Dec 7 09:57:26 2005 From: andrej.kastrin at siol.net (Andrej Kastrin) Date: Wed Dec 7 09:54:39 2005 Subject: [Bioperl-l] Extract field from Medline In-Reply-To: References: Message-ID: <4396F856.3020303@siol.net> Barry Moore wrote: >Andrej- > >Doesn't really sound like you need Bioperl for this one - just some >loops and regular expressions. Can't offer too much help without seeing >your file formats, but a boiler plate might look like this: > >#!/usr/bin/perl > >use strict; >use warnings; > >my $file_terms = shift; >my $file_medline = shift; >open (TERM, $file_term) or die "Can't open TERM"; >open (MEDL, $file_medline) or die "Can't open MEDL"; > >my @terms = ; > >while (my ($pmid, $ti, $ab) = split ) { > for my $term (@terms) { > if (/$term/ for ($pmid, $ti, $ab)) { > print "$pmid\t$ti\t$ab"; > } > } >} > >-----Original Message----- >From: bioperl-l-bounces@portal.open-bio.org >[mailto:bioperl-l-bounces@portal.open-bio.org] On Behalf Of Andrej >Kastrin >Sent: Wednesday, December 07, 2005 5:40 AM >To: bioperl-l@portal.open-bio.org >Subject: [Bioperl-l] Extract field from Medline > >Hello all, > >big problem for me, small for you (while I'm noob in perl). I have a >list of terms (i.e. genes, gene products) in row data format. Now I have > >to parse Medline (standard Medline format) and extract PMID, TI and AB >(ID number, Title and Abstract) fields which involve any term in my term > >list. I already transform Medline "multiline" format to "single" line, >so there is only one line per each field. > >How to start? Thanks for any suggesstion. >Best, Andrej > >_______________________________________________ >Bioperl-l mailing list >Bioperl-l@portal.open-bio.org >http://portal.open-bio.org/mailman/listinfo/bioperl-l > >_______________________________________________ >Bioperl-l mailing list >Bioperl-l@portal.open-bio.org >http://portal.open-bio.org/mailman/listinfo/bioperl-l > > > > Hi, I try this but something wrong, due to compilation problem (lines 15 and 19); and also, how to include input files? From hubert.prielinger at gmx.at Tue Dec 6 17:47:18 2005 From: hubert.prielinger at gmx.at (Hubert Prielinger) Date: Wed Dec 7 13:07:29 2005 Subject: [Bioperl-l] the Parameter GAPCOSTS for the remoteblast Modul doesn't work Message-ID: <439614F6.3080500@gmx.at> Hi, I have tried to set the GAPCOSTS Parameter: 'GAPCOSTS' = '9 i1' ...... it wasn't working, but if I took float instead of integer the gap existence will work but not the Gap extend, like 'GAPCOSTS' = '9.0 i1.0' It never minds, if I take 'GAPCOSTS' = '9.0 i1.0' or 'GAPCOSTS' = '9.0 i-1.0' or instead of 1.0 any other float value, I always get the same Message: Gap existence and extension values of 10 and 0 not supported for BLOSUM80 (for example) it always sets the extension Value to 0. regards Hubert From hlapp at gnf.org Wed Dec 7 12:37:16 2005 From: hlapp at gnf.org (Hilmar Lapp) Date: Wed Dec 7 15:42:55 2005 Subject: [Bioperl-l] Bioperl and Mysql In-Reply-To: <20051207062823.52871.qmail@web36805.mail.mud.yahoo.com> References: <20051207062823.52871.qmail@web36805.mail.mud.yahoo.com> Message-ID: <428b7198c15ed6fb961cdba547db83ba@gnf.org> You probably got a lot of errors when you uploaded the file didn't you? Always post error messages. The problem with fasta files is that the Bioperl SeqIO parser (for legitimate reasons) doesn't parse out an accession number, which, however, is part of the unique key constraint. So you probably you do have one sequence in the database; the only way you can get an empty database after running a non-empty file is by failure to connect, failure to load (find) the bioperl/bioperl-db modules, or by providing --testonly to the load_seqdatabase.pl script. Only the latter will be silent. See http://bioperl.org/pipermail/bioperl-l/2005-August/019579.html for what you need to do in order to load fasta-formatted files into biosql. If none of this can be the problem, run the bioperl-db tests and report the outcome. -hilmar On Dec 6, 2005, at 10:28 PM, chen li wrote: > Hi Hilmar, > > I download a small fasta-format sequence file from > NCBI and populate the database. But I can't retrieve > them within mysql as a root user. The biosql database > is empty. What is going on? > > Thanks, > > Li > > > > [alex@cpe-65-189-147-4 biosql]$ perl > load_seqdatabase.pl --host localhost --dbname biosql > /home/alex/DB/RNA.fasta > Loading /home/alex/DB/RNA.fasta ... > > A small part of my file: >> gi|21237774|ref|NM_139124.1| Homo sapiens > mitogen-activated protein kinase 8 interacting protein > 2 (MAPK8IP2), transcript variant 3, mRNA > CGCGGGGCGGACGCCGCAGGGCGTGTCACGAGGTGAGCGGGGCGGGCCGAGCGCCGGCGCGGGGCGCGGCG > AGGCTCCCG > ........................... > > > > > > > > __________________________________________ > Yahoo! DSL ? Something to write home about. > Just $16.99/mo. or less. > dsl.yahoo.com > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l@portal.open-bio.org > http://portal.open-bio.org/mailman/listinfo/bioperl-l > > -- ------------------------------------------------------------- Hilmar Lapp email: lapp at gnf.org GNF, San Diego, Ca. 92121 phone: +1-858-812-1757 ------------------------------------------------------------- From bmoore at genetics.utah.edu Wed Dec 7 17:44:07 2005 From: bmoore at genetics.utah.edu (Barry Moore) Date: Wed Dec 7 17:40:42 2005 Subject: [Bioperl-l] Extract field from Medline Message-ID: Andrej- I didn't run that script, it was just meant to give you a start. I've cleaned it up and it works for me - without knowing your file formats that's about all I can do to help you. You'll need to debug it and adapt it to your needs. If you are very new to perl and unfamiliar with debugging scripts, then you will certainly want to consult one of the many fine texts on the subject such as "Learning Perl" from O'Reilly. You can use the script below as a starting point. Barry #!/usr/bin/perl use strict; use warnings; my $file_terms = shift; my $file_medline = shift; open (TERM, $file_terms) or die "Can't open TERM"; open (MEDL, $file_medline) or die "Can't open MEDL"; my @terms = ; my @lines = ; for my $line (@lines) { my ($pmid, $ti, $ab) = split /\t/, $line; for my $term (@terms) { chomp $term; for ($pmid, $ti, $ab) { if (/$term/) { print "$pmid\t$ti\t$ab"; } } } } > -----Original Message----- > From: Andrej Kastrin [mailto:andrej.kastrin@siol.net] > Sent: Wednesday, December 07, 2005 7:57 AM > To: Barry Moore > Cc: bioperl-l@portal.open-bio.org > Subject: Re: [Bioperl-l] Extract field from Medline > > Barry Moore wrote: > > >Andrej- > > > >Doesn't really sound like you need Bioperl for this one - just some > >loops and regular expressions. Can't offer too much help without seeing > >your file formats, but a boiler plate might look like this: > > > >#!/usr/bin/perl > > > >use strict; > >use warnings; > > > >my $file_terms = shift; > >my $file_medline = shift; > >open (TERM, $file_term) or die "Can't open TERM"; > >open (MEDL, $file_medline) or die "Can't open MEDL"; > > > >my @terms = ; > > > >while (my ($pmid, $ti, $ab) = split ) { > > for my $term (@terms) { > > if (/$term/ for ($pmid, $ti, $ab)) { > > print "$pmid\t$ti\t$ab"; > > } > > } > >} > > > >-----Original Message----- > >From: bioperl-l-bounces@portal.open-bio.org > >[mailto:bioperl-l-bounces@portal.open-bio.org] On Behalf Of Andrej > >Kastrin > >Sent: Wednesday, December 07, 2005 5:40 AM > >To: bioperl-l@portal.open-bio.org > >Subject: [Bioperl-l] Extract field from Medline > > > >Hello all, > > > >big problem for me, small for you (while I'm noob in perl). I have a > >list of terms (i.e. genes, gene products) in row data format. Now I have > > > >to parse Medline (standard Medline format) and extract PMID, TI and AB > >(ID number, Title and Abstract) fields which involve any term in my term > > > >list. I already transform Medline "multiline" format to "single" line, > >so there is only one line per each field. > > > >How to start? Thanks for any suggesstion. > >Best, Andrej > > > >_______________________________________________ > >Bioperl-l mailing list > >Bioperl-l@portal.open-bio.org > >http://portal.open-bio.org/mailman/listinfo/bioperl-l > > > >_______________________________________________ > >Bioperl-l mailing list > >Bioperl-l@portal.open-bio.org > >http://portal.open-bio.org/mailman/listinfo/bioperl-l > > > > > > > > > Hi, > I try this but something wrong, due to compilation problem (lines 15 > and 19); and also, how to include input files? From angshu96 at gmail.com Wed Dec 7 21:12:36 2005 From: angshu96 at gmail.com (Angshu Kar) Date: Wed Dec 7 21:38:48 2005 Subject: [Bioperl-l] loading data to biosql tables Message-ID: Hi, I've created 5 tables (taxon, taxon name, bioentry, biosequence, biodatabase) in my postgresql database (linux box) using the biosql schema ddl from http://cvs.open-bio.org/cgi-bin/viewcvs/viewcvs.cgi/biosql-schema/sql/biosqldb-pg.sql?rev=1.29&cvsroot=biosql&content-type=text/vnd.viewcvs-markup . Now I want to load the tables with arabidopsis data. Could you please let me know where can I find such scripts for pgsql? And also I find at http://bio.perl.org/Core/Latest/index.shtml that the DB module has not been updated since 2001. Do I need to install that? Or are there some new releases? I'll be obliged if you can guide. Thanks, Angshu From hlapp at gmx.net Thu Dec 8 11:55:21 2005 From: hlapp at gmx.net (Hilmar Lapp) Date: Thu Dec 8 11:59:34 2005 Subject: [Bioperl-l] loading data to biosql tables In-Reply-To: References: Message-ID: <3e86c394e54677a898eab8de11d8e002@gmx.net> Any reason you didn't instantiate the rest of the schema? Any scripts and software that have been written against BioSQL will certainly expect the rest of the schema be present ... Bioperl-db is the BioSQL language binding for Bioperl, so that's what you will want to use. It comes with a script load_seqdatabase.pl to load any format supported by Bioperl. However, bioperl-db does expect all of Biosql to be present ... -hilmar On Dec 7, 2005, at 6:12 PM, Angshu Kar wrote: > Hi, > > I've created 5 tables (taxon, taxon name, bioentry, biosequence, > biodatabase) in my postgresql database (linux box) using the biosql > schema > ddl from > http://cvs.open-bio.org/cgi-bin/viewcvs/viewcvs.cgi/biosql-schema/sql/ > biosqldb-pg.sql?rev=1.29&cvsroot=biosql&content-type=text/vnd.viewcvs- > markup > . > Now I want to load the tables with arabidopsis data. Could you please > let me > know where can I find such scripts for pgsql? And also I find at > http://bio.perl.org/Core/Latest/index.shtml that the DB module has not > been > updated since 2001. Do I need to install that? Or are there some new > releases? > > I'll be obliged if you can guide. > > Thanks, > Angshu > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l@portal.open-bio.org > http://portal.open-bio.org/mailman/listinfo/bioperl-l > > -- ------------------------------------------------------------- Hilmar Lapp email: lapp at gnf.org GNF, San Diego, Ca. 92121 phone: +1-858-812-1757 ------------------------------------------------------------- From kevin.mcmahon at ttuhsc.edu Thu Dec 8 12:36:57 2005 From: kevin.mcmahon at ttuhsc.edu (kevin.mcmahon@ttuhsc.edu) Date: Thu Dec 8 12:49:30 2005 Subject: [Bioperl-l] PrimarySeq object question Message-ID: <8CA078F2E9F09043B48141DD6195E99D08A752C0@marfa.ttuhsc.edu> Everyone, I'm new to this, so please bear with me. I'm having some trouble with a scf to fasta converting program I'm writing. my $in = Bio::SeqIO->new(-file => $infile , '-format' => 'scf', -alphabet => 'dna'); my $seq = $in->next_seq(); print "My sequence is: " . $seq->seq() . "\n"; Above is the code in discussion. The $in object contains information from a file ($infile) in scf format. Here's my problem. When we get to $in->next_seq(), if the file is empty, the program dies and returns: "MSG: If you want me to create a PrimarySeq object for your empty sequence you must specify a -alphabet to satisfy the constructor requirements for a Bio::PrimarySeq object with no sequence. Read the POD for it, luke." I guess what I need to know is: if this $in->next_seq() doesn't work, how can I test for this before I get this reply. Thanks in advance, Wyatt From angshu96 at gmail.com Thu Dec 8 13:09:15 2005 From: angshu96 at gmail.com (Angshu Kar) Date: Thu Dec 8 13:06:46 2005 Subject: [Bioperl-l] loading data to biosql tables In-Reply-To: <3e86c394e54677a898eab8de11d8e002@gmx.net> References: <3e86c394e54677a898eab8de11d8e002@gmx.net> Message-ID: Thank you Hilmar. Actually we want only those tables for our tests! Also at first, do I need to install the Bioperl-db module ( But is it the one that was updated 4 years back or are there any new releases)? And then run the script suggested by you in the box? Can't we just edit the script and keep those parts that correspond to only the tables that I've created? Thanks, Angshu On 12/8/05, Hilmar Lapp wrote: > > Any reason you didn't instantiate the rest of the schema? Any scripts > and software that have been written against BioSQL will certainly > expect the rest of the schema be present ... > > Bioperl-db is the BioSQL language binding for Bioperl, so that's what > you will want to use. It comes with a script load_seqdatabase.pl to > load any format supported by Bioperl. > > However, bioperl-db does expect all of Biosql to be present ... > > -hilmar > > On Dec 7, 2005, at 6:12 PM, Angshu Kar wrote: > > > Hi, > > > > I've created 5 tables (taxon, taxon name, bioentry, biosequence, > > biodatabase) in my postgresql database (linux box) using the biosql > > schema > > ddl from > > http://cvs.open-bio.org/cgi-bin/viewcvs/viewcvs.cgi/biosql-schema/sql/ > > biosqldb-pg.sql?rev=1.29&cvsroot=biosql&content-type=text/vnd.viewcvs- > > markup > > . > > Now I want to load the tables with arabidopsis data. Could you please > > let me > > know where can I find such scripts for pgsql? And also I find at > > http://bio.perl.org/Core/Latest/index.shtml that the DB module has not > > been > > updated since 2001. Do I need to install that? Or are there some new > > releases? > > > > I'll be obliged if you can guide. > > > > Thanks, > > Angshu > > > > _______________________________________________ > > Bioperl-l mailing list > > Bioperl-l@portal.open-bio.org > > http://portal.open-bio.org/mailman/listinfo/bioperl-l > > > > > -- > ------------------------------------------------------------- > Hilmar Lapp email: lapp at gnf.org > GNF, San Diego, Ca. 92121 phone: +1-858-812-1757 > ------------------------------------------------------------- > > > From saldroubi at yahoo.com Thu Dec 8 14:51:09 2005 From: saldroubi at yahoo.com (Sam Al-Droubi) Date: Thu Dec 8 14:48:39 2005 Subject: [Bioperl-l] PrimarySeq object question In-Reply-To: <8CA078F2E9F09043B48141DD6195E99D08A752C0@marfa.ttuhsc.edu> Message-ID: <20051208195109.18775.qmail@web34305.mail.mud.yahoo.com> Kevin, I am fairly new to bioperl as well. But I have written a program that reads fasta format files. I gave my program an empty file and it worked. The difference I see is that you are reading a different format. I am wondering if you found a bug. Did you try a differnt format, like fasta? If you did find a bug may be you can call "ls" from Perl and check the file size before you try to open it. I am assuming you are running this on UNIX/Linux. Snippet of my code is below $seqio_obj = Bio::SeqIO->new(-file => $fname,-format => "fasta" ); while ($seq_obj = $seqio_obj->next_seq){ .... } kevin.mcmahon@ttuhsc.edu wrote: Everyone, I'm new to this, so please bear with me. I'm having some trouble with a scf to fasta converting program I'm writing. my $in = Bio::SeqIO->new(-file => $infile , '-format' => 'scf', -alphabet => 'dna'); my $seq = $in->next_seq(); print "My sequence is: " . $seq->seq() . "\n"; Above is the code in discussion. The $in object contains information from a file ($infile) in scf format. Here's my problem. When we get to $in->next_seq(), if the file is empty, the program dies and returns: "MSG: If you want me to create a PrimarySeq object for your empty sequence you must specify a -alphabet to satisfy the constructor requirements for a Bio::PrimarySeq object with no sequence. Read the POD for it, luke." I guess what I need to know is: if this $in->next_seq() doesn't work, how can I test for this before I get this reply. Thanks in advance, Wyatt _______________________________________________ Bioperl-l mailing list Bioperl-l@portal.open-bio.org http://portal.open-bio.org/mailman/listinfo/bioperl-l Sincerely, Sam Al-Droubi, M.S. saldroubi@yahoo.com From angshu96 at gmail.com Thu Dec 8 18:55:00 2005 From: angshu96 at gmail.com (Angshu Kar) Date: Thu Dec 8 19:00:28 2005 Subject: [Bioperl-l] parsing a BLAST output In-Reply-To: <82B9FC7C-BE37-4961-85BB-6014CEADE8E7@duke.edu> References: <82B9FC7C-BE37-4961-85BB-6014CEADE8E7@duke.edu> Message-ID: Thanks Jason...So, if I get you right, if I use the -postsw option I can use any of the $fracqaligned = $hsp->query->length / $result->query_length; $frachaligned = $hsp->hit->length / $hit->length; formulae and merge the result for all HSPs to get the % overlap.? I mean then do I have to worry about repeated domain or multiple high scoring suboptimal alignments ? Appreciate your guidance. Thanks, Angshu On 12/4/05, Jason Stajich wrote: > > frac_identical gives the fraction of bases in the HSP that are > identical. > > overlap would be calculated from (length of the query aligned) / > (length of query) or (length of hit aligned) / (length of hit). > > So for an HSP you can calculate this > $fracqaligned = $hsp->query->length / $result->query_length; > $frachaligned = $hsp->hit->length / $hit->length; > > But remember there may be multiple HSPs so you may need to merge the > HSP information to get the total length aligned. If there is a > repeated domain or multiple high scoring suboptimal alignments this > can cause things to get a little tricky. > > There are two methods provided in the HitI interface called > frac_aligned_query() and frac_aligned_hit() do try and take all of > this into account for you, but I admit this is less well tested > code. But do give them a try: > $fracqaligned = $hit->frac_aligned_query(); > $frachaligned = $hit->frac_aligned_hit(); > > > If you are using WU-BLASTP add the -postsw option to get a refined > alignment which will merge HSPs where appropriate so you should use > that. > > You can also use the -links option to get WU-BLAST to get the logical > ordering and a consistent path through the HSPs. > > > On Dec 4, 2005, at 8:32 PM, Angshu Kar wrote: > > > Hi, > > > > To begin with, I'm new to Bioperl. > > Now, I've written the following simple piece of code to parse a WU- > > Blast > > output which filters data *for a given e-value and >50% overlap*. > > > > I'm writing the main algorithm here: > > > > my $blast_report = $ARG[1]; > > my $threshold_evalue = $ARG[2]; > > > > my $in = new Bio::SearchIO(-format => 'blast', -file => > > $blast_report); > > > > while (my $result = $in -> next_result) > > { > > while(my $hit = $result->next_hit) > > { > > if(($line{$hit->name} == $line{$result->query_accession})) > > { > > next; > > } > > if($hit->hsp->evalue <= $threshold_evalue) > > { > > if($hit->hsp->frac_indentical>=0.5) > > { > > print $line{$result->query_accession} . "\t" . > > $line{$hit->name} . "\t" . $hit->hsp-evalue . "\n"; > > } > > } > > } > > } > > > > My questions are: > > > > 1. does the frac_identical gives the measure of % overlap? Or, are > > there any > > other methods? > > 2. now, i don't have any blast data sets to test my code upon.could > > any of > > the experienced users let me know whether the algorithm is fine?any > > tip-offs on any point (from optimization to syntactical errors) are > > heartily > > welcome. > > 3. could any one please let me know if i can find sample wu-blast > > outputs to > > test my script upon? > > http://fungal.genome.duke.edu/~jes12/BGT203.2005/sample_reports/ > > Also checkout the biodata repository from bioperl and look in the > DB_Searching directory, we had started a project cataloging example > reports in all the different formats. This sort of fizzled out, but > could still use some volunteers to better organize things and > incorporate more examples. > > http://cvs.open-bio.org/cgi-bin/viewcvs/viewcvs.cgi/biodata/ > DB_Searching/ > > > > Appreciate your guidance. > > > > Thanks, > > Angshu > > > > _______________________________________________ > > Bioperl-l mailing list > > Bioperl-l@portal.open-bio.org > > http://portal.open-bio.org/mailman/listinfo/bioperl-l > > -- > Jason Stajich > Duke University > http://www.duke.edu/~jes12 > > > From angshu96 at gmail.com Thu Dec 8 18:59:12 2005 From: angshu96 at gmail.com (Angshu Kar) Date: Thu Dec 8 20:01:28 2005 Subject: [Bioperl-l] parsing a BLAST output In-Reply-To: References: Message-ID: Thanks for the url. I'll go through it and let you know if I face any problem. And have you had any code pieces using those functions to calculate % overlap? It will be great if you can provide them to me. Thank you so much, Angshu On 12/4/05, Barry Moore wrote: > > Angshu- > > 1) No. From the docs (online at > http://doc.bioperl.org/releases/bioperl-1.4/Bio/Search/HSP/BlastHSP.html > ): > > Different versions of Blast report different values for the total length > of the alignment. This is the number reported in the denominators in the > stats section: "Identical = 34/120 Positives = 67/120". NCBI-BLAST uses > the total length of the alignment (with gaps) WU-BLAST uses the length > of the query sequence (without gaps). Therefore, when called without an > argument or an argument of 'total', this method will report different > values depending on the version of BLAST used. > > To get the fraction identical among only the aligned residues, ignoring > the gaps, call this method with an argument of 'query' or 'sbjct' > ('sbjct' is synonymous with 'hit'). > > 2) If I understand your question correctly I think you are looking for > frac_aligned_hit and/or frac_aligned_query called on you hit object. > See > (http://doc.bioperl.org/releases/bioperl-1.4/Bio/Search/Hit/GenericHit.h > tml) for discussion. > > 3) Try the files in the bioperl test/data directory for lots of program > output samples. For wu-blast have a look at: > > bioperl-live/t/data/brassica_ATH.WUBLASTN > > which can be found on the web at: > > http://cvs.bioperl.org/cgi-bin/viewcvs/viewcvs.cgi/*checkout*/bioperl-li > ve/t/data/brassica_ATH.WUBLASTN?rev=HEAD&cvsroot=bioperl&content-type=te > xt/plain. > > Barry > > -----Original Message----- > From: bioperl-l-bounces@portal.open-bio.org > [mailto:bioperl-l-bounces@portal.open-bio.org] On Behalf Of Angshu Kar > Sent: Sunday, December 04, 2005 6:32 PM > To: bioperl-l@bioperl.org > Subject: [Bioperl-l] parsing a BLAST output > > Hi, > > To begin with, I'm new to Bioperl. > Now, I've written the following simple piece of code to parse a WU-Blast > output which filters data *for a given e-value and >50% overlap*. > > I'm writing the main algorithm here: > > my $blast_report = $ARG[1]; > my $threshold_evalue = $ARG[2]; > > my $in = new Bio::SearchIO(-format => 'blast', -file => $blast_report); > > while (my $result = $in -> next_result) > { > while(my $hit = $result->next_hit) > { > if(($line{$hit->name} == $line{$result->query_accession})) > { > next; > } > if($hit->hsp->evalue <= $threshold_evalue) > { > if($hit->hsp->frac_indentical>=0.5) > { > print $line{$result->query_accession} . "\t" . > $line{$hit->name} . "\t" . $hit->hsp-evalue . "\n"; > } > } > } > } > > My questions are: > > 1. does the frac_identical gives the measure of % overlap? Or, are there > any > other methods? > 2. now, i don't have any blast data sets to test my code upon.could any > of > the experienced users let me know whether the algorithm is fine?any > tip-offs on any point (from optimization to syntactical errors) are > heartily > welcome. > 3. could any one please let me know if i can find sample wu-blast > outputs to > test my script upon? > > Appreciate your guidance. > > Thanks, > Angshu > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l@portal.open-bio.org > http://portal.open-bio.org/mailman/listinfo/bioperl-l > From angshu96 at gmail.com Thu Dec 8 18:55:32 2005 From: angshu96 at gmail.com (Angshu Kar) Date: Thu Dec 8 20:35:51 2005 Subject: [Bioperl-l] parsing a BLAST output In-Reply-To: References: Message-ID: Thanks Brian. On 12/5/05, Brian Osborne wrote: > > Angshu, > > It looks like there's WU Blast output used in various tests: > > t/data/dnaEbsub_ecoli.wublastx > t/data/ecolitst.wublastp > t/data/ecolitst.noseqs.wublastp > > > Brian O. > > > On 12/4/05 8:32 PM, "Angshu Kar" wrote: > > > Hi, > > > > To begin with, I'm new to Bioperl. > > Now, I've written the following simple piece of code to parse a WU-Blast > > output which filters data *for a given e-value and >50% overlap*. > > > > I'm writing the main algorithm here: > > > > my $blast_report = $ARG[1]; > > my $threshold_evalue = $ARG[2]; > > > > my $in = new Bio::SearchIO(-format => 'blast', -file => $blast_report); > > > > while (my $result = $in -> next_result) > > { > > while(my $hit = $result->next_hit) > > { > > if(($line{$hit->name} == $line{$result->query_accession})) > > { > > next; > > } > > if($hit->hsp->evalue <= $threshold_evalue) > > { > > if($hit->hsp->frac_indentical>=0.5) > > { > > print $line{$result->query_accession} . "\t" . > > $line{$hit->name} . "\t" . $hit->hsp-evalue . "\n"; > > } > > } > > } > > } > > > > My questions are: > > > > 1. does the frac_identical gives the measure of % overlap? Or, are there > any > > other methods? > > 2. now, i don't have any blast data sets to test my code upon.could any > of > > the experienced users let me know whether the algorithm is fine?any > > tip-offs on any point (from optimization to syntactical errors) are > heartily > > welcome. > > 3. could any one please let me know if i can find sample wu-blast > outputs to > > test my script upon? > > > > Appreciate your guidance. > > > > Thanks, > > Angshu > > > > _______________________________________________ > > Bioperl-l mailing list > > Bioperl-l@portal.open-bio.org > > http://portal.open-bio.org/mailman/listinfo/bioperl-l > > > From angshu96 at gmail.com Thu Dec 8 21:00:35 2005 From: angshu96 at gmail.com (Angshu Kar) Date: Thu Dec 8 21:04:09 2005 Subject: [Bioperl-l] Urgent: DB module installation Message-ID: Hi, Could anyone please let me know how to install Bioperl-db in WindowsXP as well as a linux machine? Thanks, Angshu From bmoore at genetics.utah.edu Thu Dec 8 23:02:55 2005 From: bmoore at genetics.utah.edu (Barry Moore) Date: Thu Dec 8 23:06:44 2005 Subject: [Bioperl-l] Urgent: DB module installation Message-ID: Angshu- The package you downloaded should have come with a file named INSTALL. Did you have specific problems following the instructions in that file? Barry -----Original Message----- From: bioperl-l-bounces@portal.open-bio.org [mailto:bioperl-l-bounces@portal.open-bio.org] On Behalf Of Angshu Kar Sent: Thursday, December 08, 2005 7:01 PM To: bioperl-l Subject: [Bioperl-l] Urgent: DB module installation Hi, Could anyone please let me know how to install Bioperl-db in WindowsXP as well as a linux machine? Thanks, Angshu _______________________________________________ Bioperl-l mailing list Bioperl-l@portal.open-bio.org http://portal.open-bio.org/mailman/listinfo/bioperl-l From bmoore at genetics.utah.edu Thu Dec 8 22:58:07 2005 From: bmoore at genetics.utah.edu (Barry Moore) Date: Thu Dec 8 23:06:46 2005 Subject: [Bioperl-l] parsing a BLAST output Message-ID: Angshu- I have not used those functions in any of my code, you might grep through some of the test scripts to look for examples. Barry -----Original Message----- From: Angshu Kar [mailto:angshu96@gmail.com] Sent: Thursday, December 08, 2005 4:59 PM To: Barry Moore Cc: bioperl-l@bioperl.org Subject: Re: [Bioperl-l] parsing a BLAST output Thanks for the url. I'll go through it and let you know if I face any problem. And have you had any code pieces using those functions to calculate % overlap? It will be great if you can provide them to me. Thank you so much, Angshu On 12/4/05, Barry Moore wrote: Angshu- 1) No. From the docs (online at http://doc.bioperl.org/releases/bioperl-1.4/Bio/Search/HSP/BlastHSP.html ): Different versions of Blast report different values for the total length of the alignment. This is the number reported in the denominators in the stats section: "Identical = 34/120 Positives = 67/120". NCBI-BLAST uses the total length of the alignment (with gaps) WU-BLAST uses the length of the query sequence (without gaps). Therefore, when called without an argument or an argument of 'total', this method will report different values depending on the version of BLAST used. To get the fraction identical among only the aligned residues, ignoring the gaps, call this method with an argument of 'query' or 'sbjct' ('sbjct' is synonymous with 'hit'). 2) If I understand your question correctly I think you are looking for frac_aligned_hit and/or frac_aligned_query called on you hit object. See ( http://doc.bioperl.org/releases/bioperl-1.4/Bio/Search/Hit/GenericHit.h tml) for discussion. 3) Try the files in the bioperl test/data directory for lots of program output samples. For wu-blast have a look at: bioperl-live/t/data/brassica_ATH.WUBLASTN which can be found on the web at: http://cvs.bioperl.org/cgi-bin/viewcvs/viewcvs.cgi/*checkout*/bioperl-li ve/t/data/brassica_ATH.WUBLASTN?rev=HEAD&cvsroot=bioperl&content-type=te xt/plain. Barry -----Original Message----- From: bioperl-l-bounces@portal.open-bio.org [mailto:bioperl-l-bounces@portal.open-bio.org] On Behalf Of Angshu Kar Sent: Sunday, December 04, 2005 6:32 PM To: bioperl-l@bioperl.org Subject: [Bioperl-l] parsing a BLAST output Hi, To begin with, I'm new to Bioperl. Now, I've written the following simple piece of code to parse a WU-Blast output which filters data *for a given e-value and >50% overlap*. I'm writing the main algorithm here: my $blast_report = $ARG[1]; my $threshold_evalue = $ARG[2]; my $in = new Bio::SearchIO(-format => 'blast', -file => $blast_report); while (my $result = $in -> next_result) { while(my $hit = $result->next_hit) { if(($line{$hit->name} == $line{$result->query_accession})) { next; } if($hit->hsp->evalue <= $threshold_evalue) { if($hit->hsp->frac_indentical>=0.5) { print $line{$result->query_accession} . "\t" . $line{$hit->name} . "\t" . $hit->hsp-evalue . "\n"; } } } } My questions are: 1. does the frac_identical gives the measure of % overlap? Or, are there any other methods? 2. now, i don't have any blast data sets to test my code upon.could any of the experienced users let me know whether the algorithm is fine?any tip-offs on any point (from optimization to syntactical errors) are heartily welcome. 3. could any one please let me know if i can find sample wu-blast outputs to test my script upon? Appreciate your guidance. Thanks, Angshu _______________________________________________ Bioperl-l mailing list Bioperl-l@portal.open-bio.org http://portal.open-bio.org/mailman/listinfo/bioperl-l From angshu96 at gmail.com Thu Dec 8 23:10:19 2005 From: angshu96 at gmail.com (Angshu Kar) Date: Thu Dec 8 23:07:46 2005 Subject: [Bioperl-l] Urgent: DB module installation In-Reply-To: References: Message-ID: I'm away from my lab now. I'll surely send you the error messages. I got 16 errors when i ran the make test command and all in the .t files. Also could you please let me know if I can test whether the db module has been properly installed using some command? Thanks, Angshu On 12/8/05, Barry Moore wrote: > > Angshu- > > The package you downloaded should have come with a file named INSTALL. > Did you have specific problems following the instructions in that file? > > Barry > > -----Original Message----- > From: bioperl-l-bounces@portal.open-bio.org > [mailto:bioperl-l-bounces@portal.open-bio.org] On Behalf Of Angshu Kar > Sent: Thursday, December 08, 2005 7:01 PM > To: bioperl-l > Subject: [Bioperl-l] Urgent: DB module installation > > Hi, > > Could anyone please let me know how to install Bioperl-db in WindowsXP > as > well as a linux machine? > > Thanks, > Angshu > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l@portal.open-bio.org > http://portal.open-bio.org/mailman/listinfo/bioperl-l > From MEC at stowers-institute.org Fri Dec 9 00:25:57 2005 From: MEC at stowers-institute.org (Cook, Malcolm) Date: Fri Dec 9 00:38:51 2005 Subject: [Bioperl-l] patch to Bio::Location::Split - split locations whose strand is -1 don't print with complement when only one sublocation present Message-ID: Jason, When a "split location" really isn't, and has only one sublocation, the 'complement' was missing from the string produced by to_FTstring. So, I moved the code that wraps the FTstring in 'complement' outside of the test for number for sublocations and it works, at least better for me in my hands. Fix is below. I'm tracking the head. sub to_FTstring { my ($self) = @_; my @strs; my $strand; if( ($strand = ($self->strand || 0)) < 0 ) { $self->flip_strand; # this will recursively set the strand # to +1 for all the sub locations } foreach my $loc ( $self->sub_Location() ) { my $str = $loc->to_FTstring(); # we only append the remote seq_id if it hasn't been done already # by the sub-location (which it should if it knows it's remote) # (and of course only if it's necessary) if( (! $loc->is_remote) && defined($self->seq_id) && defined($loc->seq_id) && ($loc->seq_id ne $self->seq_id) ) { $str = sprintf("%s:%s", $loc->seq_id, $str); } push @strs, $str; } $self->flip_strand if $strand < 0; my $str; if( @strs == 1 ) { ($str) = @strs; } elsif( @strs == 0 ) { $self->warn("no Sublocations for this splitloc, so not returning anything\n"); } else { $str = sprintf("%s(%s)",lc $self->splittype, join(",", @strs)); } if( $strand < 0 ) { # wrap this in a complement if it was unrolled $str = sprintf("%s(%s)",'complement',$str); } # mec fixed: had erroneously not occured previously when @srs == 0 !!! ! return $str; } Also, while we're here, what do you understand the semantics of strand in split sublocations to be? Your logic of termporarily flipping the sublocations strand seems to suggest that you expect that the strand of the sublocation should in practice agree with that of the superiour split location. I'm building split locations 'by hand' and seem forced to set the strand in both the parent and all subs. Is this what you expect? thanks. As an aside but related issue, I've got a the bioperl source head cvs checked out as anonymous with a bunch of edits (such as this) that I'd like now to commit given that I now have privs (I'm user mcook). Do you now a way to edit my source tree to change them to be checked out by mcook instead of anonymous. Or do I have to recheck out a fresh source tree and make my edits there for commit? Thanks Cheers, Malcolm Cook - mec@stowers-institute.org - 816-926-4449 Database Applications Manager - Bioinformatics Stowers Institute for Medical Research - Kansas City, MO USA From heikki at sanbi.ac.za Fri Dec 9 01:52:48 2005 From: heikki at sanbi.ac.za (Heikki Lehvaslaiho) Date: Fri Dec 9 02:32:39 2005 Subject: [Bioperl-l] PrimarySeq object question In-Reply-To: <8CA078F2E9F09043B48141DD6195E99D08A752C0@marfa.ttuhsc.edu> References: <8CA078F2E9F09043B48141DD6195E99D08A752C0@marfa.ttuhsc.edu> Message-ID: <200512090852.48878.heikki@sanbi.ac.za> Kevin, The message you get comes from Bio::Seq::SeqWithQuality module. It could be argued that it is overzealous to throw an exception if it can not find a sequence. Also, the message comes after it has come to the conclusion that you have not set the alphabet that, according to your code snippet, you have. Report this as a bug to bugzilla.bioperl.org and make sure you attach an example scf file. Meanwhile, you can always avoid exiting from you code by wrapping the code within the eval statement. eval { # your code }; # note the ';' if ($@) { # do something else } Yours, -Heikki On Thursday 08 December 2005 19:36, kevin.mcmahon@ttuhsc.edu wrote: > Everyone, > > I'm new to this, so please bear with me. > > I'm having some trouble with a scf to fasta converting program I'm writing. > > my $in = Bio::SeqIO->new(-file => $infile , '-format' => 'scf', > -alphabet => 'dna'); > > my $seq = $in->next_seq(); > print "My sequence is: " . $seq->seq() . "\n"; > > Above is the code in discussion. The $in object contains information from > a file ($infile) in scf format. > > Here's my problem. When we get to $in->next_seq(), if the file is empty, > the program dies and returns: > > "MSG: If you want me to create a PrimarySeq object for your empty sequence > you must specify a -alphabet to satisfy the constructor > requirements for a Bio::PrimarySeq object with no sequence. Read the POD > for it, luke." > > I guess what I need to know is: if this $in->next_seq() doesn't work, how > can I test for this before I get this reply. > > Thanks in advance, > > Wyatt > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l@portal.open-bio.org > http://portal.open-bio.org/mailman/listinfo/bioperl-l -- ______ _/ _/_____________________________________________________ _/ _/ _/ _/ _/ Heikki Lehvaslaiho heikki at_sanbi _ac _za _/_/_/_/_/ Associate Professor skype: heikki_lehvaslaiho _/ _/ _/ SANBI, South African National Bioinformatics Institute _/ _/ _/ University of the Western Cape, South Africa _/ Phone: +27 21 959 2096 FAX: +27 21 959 2512 ___ _/_/_/_/_/________________________________________________________ From heikki at sanbi.ac.za Fri Dec 9 04:36:32 2005 From: heikki at sanbi.ac.za (Heikki Lehvaslaiho) Date: Fri Dec 9 04:34:20 2005 Subject: [Bioperl-l] need info In-Reply-To: <4396EB4D.30109@ed.ac.uk> References: <4396EB4D.30109@ed.ac.uk> Message-ID: <200512091136.33101.heikki@sanbi.ac.za> Richard, The modules.pl really could check for public methods and it could try to find from t files which have been tested. That would be quite useful, but only after we can clean up the module level testing output sufficiently. I've been thinking extending the current code in modules.pl to recursively tick tested all modules that are used or inherited from, but have not written any code, yet. Please go ahead and try to get something together along the lines you've been thinking. -Heikki On Wednesday 07 December 2005 16:01, Richard Adams wrote: > Hi Heikki, > Regarding looking for untested modules/tests, > how about parsing the *.t files, parsing out the lines such as > > ok $myvar->method1 > > and getting the class of the variable and making a hash of which public > methods in each class are tested. > In this way the class of IO type instances will be the correct subclass > and you can get this info independent of parsing the 'use module' > statements If a method is not in that class (ie is in a superclass) then > the inheritance hierarchy can be searched until the method is found and > that method in the superclass ticked as 'tested' > > If you like I could work up some pseudocode for it for any comments, > > Richard -- ______ _/ _/_____________________________________________________ _/ _/ _/ _/ _/ Heikki Lehvaslaiho heikki at_sanbi _ac _za _/_/_/_/_/ Associate Professor skype: heikki_lehvaslaiho _/ _/ _/ SANBI, South African National Bioinformatics Institute _/ _/ _/ University of the Western Cape, South Africa _/ Phone: +27 21 959 2096 FAX: +27 21 959 2512 ___ _/_/_/_/_/________________________________________________________ From ron at ron.dk Fri Dec 9 04:36:47 2005 From: ron at ron.dk (Rasmus Ory Nielsen) Date: Fri Dec 9 05:31:45 2005 Subject: [Bioperl-l] Missing parameters in Bio::Tools::Run::Primer3 Message-ID: <1134121007.14008.28.camel@gbi-pc-128036.djf.agrsci.dk> Hi list I wish to use the Bio::Tools::Run::Primer3 module (branch 1-5-1). I found that the PRIMER_LIB_AMBIGUITY_CODES_CONSENSUS parameter is missing from the module. This is the warning I get: MSG: Parameter PRIMER_LIB_AMBIGUITY_CODES_CONSENSUS is not a valid Primer3 parameter I then searched the module for more missing parameters. Below is a list of what I found. PRIMER_DEFAULT_PRODUCT PRIMER_DEFAULT_SIZE PRIMER_INSIDE_PENALTY PRIMER_INTERNAL_OLIGO_MAX_TEMPLATE_MISHYB PRIMER_LIB_AMBIGUITY_CODES_CONSENSUS PRIMER_MAX_TEMPLATE_MISPRIMING PRIMER_OUTSIDE_PENALTY PRIMER_PAIR_MAX_TEMPLATE_MISPRIMING PRIMER_PAIR_WT_TEMPLATE_MISPRIMING PRIMER_WT_TEMPLATE_MISPRIMING Best regards Rasmus Ory Nielsen From heikki at sanbi.ac.za Fri Dec 9 07:26:56 2005 From: heikki at sanbi.ac.za (Heikki Lehvaslaiho) Date: Fri Dec 9 07:24:45 2005 Subject: [Bioperl-l] throw, not die Message-ID: <200512091426.57161.heikki@sanbi.ac.za> Brian, I picked up the idea from your recent commits, did a search for 'die' commands in bioperl cvs head and fixed quite a few. I did not touch the Bio::Graphics name space as those do not inherit from Bio::Root::Root. I hope I was not overly zealous. At least all tests pass. -Heikki -- ______ _/ _/_____________________________________________________ _/ _/ _/ _/ _/ Heikki Lehvaslaiho heikki at_sanbi _ac _za _/_/_/_/_/ Associate Professor skype: heikki_lehvaslaiho _/ _/ _/ SANBI, South African National Bioinformatics Institute _/ _/ _/ University of the Western Cape, South Africa _/ Phone: +27 21 959 2096 FAX: +27 21 959 2512 ___ _/_/_/_/_/________________________________________________________ From angshu96 at gmail.com Fri Dec 9 10:10:12 2005 From: angshu96 at gmail.com (Angshu Kar) Date: Fri Dec 9 10:08:42 2005 Subject: [Bioperl-l] loading data to biosql tables In-Reply-To: <3e86c394e54677a898eab8de11d8e002@gmx.net> References: <3e86c394e54677a898eab8de11d8e002@gmx.net> Message-ID: Hi Hilmar, In the load_seqdatabase.pl script could you please tell me where you are inserting the data into the db tables, so that I can try and modify that part to insert data only to the tables that I need ? Thanks, Angshu On 12/8/05, Hilmar Lapp wrote: > > Any reason you didn't instantiate the rest of the schema? Any scripts > and software that have been written against BioSQL will certainly > expect the rest of the schema be present ... > > Bioperl-db is the BioSQL language binding for Bioperl, so that's what > you will want to use. It comes with a script load_seqdatabase.pl to > load any format supported by Bioperl. > > However, bioperl-db does expect all of Biosql to be present ... > > -hilmar > > On Dec 7, 2005, at 6:12 PM, Angshu Kar wrote: > > > Hi, > > > > I've created 5 tables (taxon, taxon name, bioentry, biosequence, > > biodatabase) in my postgresql database (linux box) using the biosql > > schema > > ddl from > > http://cvs.open-bio.org/cgi-bin/viewcvs/viewcvs.cgi/biosql-schema/sql/ > > biosqldb-pg.sql?rev=1.29&cvsroot=biosql&content-type=text/vnd.viewcvs- > > markup > > . > > Now I want to load the tables with arabidopsis data. Could you please > > let me > > know where can I find such scripts for pgsql? And also I find at > > http://bio.perl.org/Core/Latest/index.shtml that the DB module has not > > been > > updated since 2001. Do I need to install that? Or are there some new > > releases? > > > > I'll be obliged if you can guide. > > > > Thanks, > > Angshu > > > > _______________________________________________ > > Bioperl-l mailing list > > Bioperl-l@portal.open-bio.org > > http://portal.open-bio.org/mailman/listinfo/bioperl-l > > > > > -- > ------------------------------------------------------------- > Hilmar Lapp email: lapp at gnf.org > GNF, San Diego, Ca. 92121 phone: +1-858-812-1757 > ------------------------------------------------------------- > > > From sdavis2 at mail.nih.gov Fri Dec 9 10:56:14 2005 From: sdavis2 at mail.nih.gov (Sean Davis) Date: Fri Dec 9 10:59:48 2005 Subject: [Bioperl-l] loading data to biosql tables In-Reply-To: Message-ID: On 12/9/05 10:10 AM, "Angshu Kar" wrote: > Hi Hilmar, > > In the load_seqdatabase.pl script could you please tell me where you are > inserting the data into the db tables, so that I can try and modify that > part to insert data only to the tables that I need ? Angshu, In general, that isn't going to be a fruitful exercise, as the code probably isn't divided nicely into an "insert" (but Hilmar can comment directly). I would do as Hilmar suggests and instantiate the entire schema. Only then can you expect the tools for loading and querying the database to be useful. If you want to do something differently, you are probably better off starting with your own schema (or using BioSQL) and then writing your own tools to load and query it. However, doing so can be EXTREMELY challenging for genomics data. Sean From osborne1 at optonline.net Fri Dec 9 11:36:23 2005 From: osborne1 at optonline.net (Brian Osborne) Date: Fri Dec 9 11:38:51 2005 Subject: [Bioperl-l] Missing parameters in Bio::Tools::Run::Primer3 In-Reply-To: <1134121007.14008.28.camel@gbi-pc-128036.djf.agrsci.dk> Message-ID: Rasmus, Thanks for pointing this out. Please record this in Bugzilla (http://bugzilla.bioperl.org) as an "Enhancement", this way your suggestion won't get lost. Brian O. On 12/9/05 4:36 AM, "Rasmus Ory Nielsen" wrote: > Hi list > > I wish to use the Bio::Tools::Run::Primer3 module (branch 1-5-1). I > found that the PRIMER_LIB_AMBIGUITY_CODES_CONSENSUS parameter is missing > from the module. This is the warning I get: > > MSG: Parameter PRIMER_LIB_AMBIGUITY_CODES_CONSENSUS is not a valid > Primer3 parameter > > I then searched the module for more missing parameters. Below is a list > of what I found. > > PRIMER_DEFAULT_PRODUCT > PRIMER_DEFAULT_SIZE > PRIMER_INSIDE_PENALTY > PRIMER_INTERNAL_OLIGO_MAX_TEMPLATE_MISHYB > PRIMER_LIB_AMBIGUITY_CODES_CONSENSUS > PRIMER_MAX_TEMPLATE_MISPRIMING > PRIMER_OUTSIDE_PENALTY > PRIMER_PAIR_MAX_TEMPLATE_MISPRIMING > PRIMER_PAIR_WT_TEMPLATE_MISPRIMING > PRIMER_WT_TEMPLATE_MISPRIMING > > Best regards > Rasmus Ory Nielsen > _______________________________________________ > Bioperl-l mailing list > Bioperl-l@portal.open-bio.org > http://portal.open-bio.org/mailman/listinfo/bioperl-l From hlapp at gmx.net Fri Dec 9 12:22:51 2005 From: hlapp at gmx.net (Hilmar Lapp) Date: Fri Dec 9 12:27:01 2005 Subject: [Bioperl-l] loading data to biosql tables In-Reply-To: References: <3e86c394e54677a898eab8de11d8e002@gmx.net> Message-ID: <25db4b5c66443458715cf08b488f538f@gmx.net> You need to install bioperl-db from CVS. There's is no good way to remove code that addresses those tables you don't want to instantiate, and frankly I don't understand why you want to spend any time on accommodating a truncated Biosql schema (i.e., I don't understand why there would be a price to instantiating the rest of the schema too). -hilmar On Dec 8, 2005, at 10:09 AM, Angshu Kar wrote: > Thank you Hilmar. Actually we want only those tables for our tests! > Also at first, do I need to install the Bioperl-db module ( But is it > the > one that was updated 4 years back or are there any new releases)? And > then > run the script suggested by you in the box? Can't we just edit the > script > and keep those parts that correspond to only the tables that I've > created? > > Thanks, > Angshu > > > On 12/8/05, Hilmar Lapp wrote: >> >> Any reason you didn't instantiate the rest of the schema? Any scripts >> and software that have been written against BioSQL will certainly >> expect the rest of the schema be present ... >> >> Bioperl-db is the BioSQL language binding for Bioperl, so that's what >> you will want to use. It comes with a script load_seqdatabase.pl to >> load any format supported by Bioperl. >> >> However, bioperl-db does expect all of Biosql to be present ... >> >> -hilmar >> >> On Dec 7, 2005, at 6:12 PM, Angshu Kar wrote: >> >>> Hi, >>> >>> I've created 5 tables (taxon, taxon name, bioentry, biosequence, >>> biodatabase) in my postgresql database (linux box) using the biosql >>> schema >>> ddl from >>> http://cvs.open-bio.org/cgi-bin/viewcvs/viewcvs.cgi/biosql-schema/ >>> sql/ >>> biosqldb-pg.sql?rev=1.29&cvsroot=biosql&content-type=text/ >>> vnd.viewcvs- >>> markup >>> . >>> Now I want to load the tables with arabidopsis data. Could you please >>> let me >>> know where can I find such scripts for pgsql? And also I find at >>> http://bio.perl.org/Core/Latest/index.shtml that the DB module has >>> not >>> been >>> updated since 2001. Do I need to install that? Or are there some new >>> releases? >>> >>> I'll be obliged if you can guide. >>> >>> Thanks, >>> Angshu >>> >>> _______________________________________________ >>> Bioperl-l mailing list >>> Bioperl-l@portal.open-bio.org >>> http://portal.open-bio.org/mailman/listinfo/bioperl-l >>> >>> >> -- >> ------------------------------------------------------------- >> Hilmar Lapp email: lapp at gnf.org >> GNF, San Diego, Ca. 92121 phone: +1-858-812-1757 >> ------------------------------------------------------------- >> >> >> > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l@portal.open-bio.org > http://portal.open-bio.org/mailman/listinfo/bioperl-l > > -- ------------------------------------------------------------- Hilmar Lapp email: lapp at gnf.org GNF, San Diego, Ca. 92121 phone: +1-858-812-1757 ------------------------------------------------------------- From hlapp at gmx.net Fri Dec 9 12:45:02 2005 From: hlapp at gmx.net (Hilmar Lapp) Date: Fri Dec 9 12:42:27 2005 Subject: [Bioperl-l] loading data to biosql tables In-Reply-To: References: <3e86c394e54677a898eab8de11d8e002@gmx.net> Message-ID: <2725905b99cf48d0d0740376a1156082@gmx.net> Angshu, load_seqdatabase.pl is a script that utilizes the language binding library bioperl-db to load sequences and annotation into Biosql. The object-relational mapping code is all over bioperl-db. I'm sorry, but if you believe it is worth fiddling with that object-relational code to 'save' instantiating a few more tables then you're welcome to do so but you're essentially on your own. Also, if you'd like to work with a schema the language binding of which allows to arbitrarily drop tables from the schema and still work then Biosql/bioperl-db may not be for you. -hilmar On Dec 9, 2005, at 7:10 AM, Angshu Kar wrote: > Hi Hilmar, > ? > In the load_seqdatabase.pl script could you please tell me where you > are inserting the data into the db tables, so that I can try and > modify that part to insert data only to the tables that I need ? > ? > Thanks, > Angshu > > ? > On 12/8/05, Hilmar Lapp wrote: Any reason you didn't > instantiate the rest of the schema? Any scripts >> and software that have been written against BioSQL will certainly >> expect the rest of the schema be present ... >> >> Bioperl-db is the BioSQL language binding for Bioperl, so that's what >> you will want to use. It comes with a script load_seqdatabase.pl to >> load any format supported by Bioperl. >> >> However, bioperl-db does expect all of Biosql to be present ... >> >> ?????? -hilmar >> >> On Dec 7, 2005, at 6:12 PM, Angshu Kar wrote: >> >> > Hi, >> > >> > I've created 5 tables (taxon, taxon name, bioentry, biosequence, >> > biodatabase) in my postgresql database (linux box) using the biosql >> > schema >> > ddl from >> > >> http://cvs.open-bio.org/cgi-bin/viewcvs/viewcvs.cgi/biosql-schema/ >> sql/ >> > >> biosqldb-pg.sql?rev=1.29&cvsroot=biosql&content-type=text/ >> vnd.viewcvs- >> > markup >> >??. >> > Now I want to load the tables with arabidopsis data. Could you >> please >> > let me >> > know where can I find such scripts for pgsql? And also I find at >> > http://bio.perl.org/Core/Latest/index.shtml that the DB module has >> not >> > been >> > updated since 2001. Do I need to install that? Or are there some new >> > releases? >> > >> > I'll be obliged if you can guide. >> > >> > Thanks, >> > Angshu >> > >> > _______________________________________________ >> > Bioperl-l mailing list >> > Bioperl-l@portal.open-bio.org >> > http://portal.open-bio.org/mailman/listinfo/bioperl-l >> > >> > >> -- >> ------------------------------------------------------------- >> Hilmar Lapp????????????????????????????email: lapp at gnf.org >> GNF, San Diego, Ca. 92121??????????????phone: +1-858-812-1757 >> ------------------------------------------------------------- >> >> >> -- ------------------------------------------------------------- Hilmar Lapp email: lapp at gnf.org GNF, San Diego, Ca. 92121 phone: +1-858-812-1757 ------------------------------------------------------------- From kevin.mcmahon at ttuhsc.edu Fri Dec 9 12:56:29 2005 From: kevin.mcmahon at ttuhsc.edu (kevin.mcmahon@ttuhsc.edu) Date: Fri Dec 9 12:54:05 2005 Subject: [Bioperl-l] PrimarySeq object question Message-ID: <6846AE140C394A4E9F91E177190F875204874564@alamo.ttuhsc.edu> Wow, what a great response! I'm very thankful for what everyone has done for me. First, to answer some questions: Thanks to Matt and Sam (my fellow beginners) for the help. The file converter you showed me works great, but (as mentioned below in Heikki's response) this bug is in the Bio::Seq::SeqWithQuality module. It only throws this exception when you're working with these objects. So while your codes work great for someone converting Genbank to Fasta, it won't help me read an empty scf file. Instead, this exception will be thrown and the program will stop. Now, on to Heikki and Brian. Thanks a ton guys. Here's what I eventually did: if (eval {$in->next_seq()}){ [Do some really cool stuff] }else{ next; } And now it works. If there is still problems with this code, let us all know for future reference and general education purposes. As Heikki said, " It could be argued that it is overzealous to throw an exception if it can not find a sequence." But, if you know the eval trick (which I should have been using all along) you can get around this. Thanks to everyone for helping. I'll submit the Bugzilla report after lunch. Thanks a ton, Wyatt -----Original Message----- From: Heikki Lehvaslaiho [mailto:heikki@sanbi.ac.za] Sent: Friday, December 09, 2005 12:53 AM To: bioperl-l@portal.open-bio.org Cc: McMahon, Kevin Subject: Re: [Bioperl-l] PrimarySeq object question Kevin, The message you get comes from Bio::Seq::SeqWithQuality module. It could be argued that it is overzealous to throw an exception if it can not find a sequence. Also, the message comes after it has come to the conclusion that you have not set the alphabet that, according to your code snippet, you have. Report this as a bug to bugzilla.bioperl.org and make sure you attach an example scf file. Meanwhile, you can always avoid exiting from you code by wrapping the code within the eval statement. eval { # your code }; # note the ';' if ($@) { # do something else } Yours, -Heikki On Thursday 08 December 2005 19:36, kevin.mcmahon@ttuhsc.edu wrote: > Everyone, > > I'm new to this, so please bear with me. > > I'm having some trouble with a scf to fasta converting program I'm writing. > > my $in = Bio::SeqIO->new(-file => $infile , '-format' => 'scf', > -alphabet => 'dna'); > > my $seq = $in->next_seq(); > print "My sequence is: " . $seq->seq() . "\n"; > > Above is the code in discussion. The $in object contains information from > a file ($infile) in scf format. > > Here's my problem. When we get to $in->next_seq(), if the file is empty, > the program dies and returns: > > "MSG: If you want me to create a PrimarySeq object for your empty sequence > you must specify a -alphabet to satisfy the constructor > requirements for a Bio::PrimarySeq object with no sequence. Read the POD > for it, luke." > > I guess what I need to know is: if this $in->next_seq() doesn't work, how > can I test for this before I get this reply. > > Thanks in advance, > > Wyatt > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l@portal.open-bio.org > http://portal.open-bio.org/mailman/listinfo/bioperl-l -- ______ _/ _/_____________________________________________________ _/ _/ _/ _/ _/ Heikki Lehvaslaiho heikki at_sanbi _ac _za _/_/_/_/_/ Associate Professor skype: heikki_lehvaslaiho _/ _/ _/ SANBI, South African National Bioinformatics Institute _/ _/ _/ University of the Western Cape, South Africa _/ Phone: +27 21 959 2096 FAX: +27 21 959 2512 ___ _/_/_/_/_/________________________________________________________ From hlapp at gmx.net Fri Dec 9 13:00:14 2005 From: hlapp at gmx.net (Hilmar Lapp) Date: Fri Dec 9 12:57:44 2005 Subject: [Bioperl-l] Urgent: DB module installation In-Reply-To: References: Message-ID: If you ran the tests against your 5-table version of Biosql then for sure they are going to fail. On Dec 8, 2005, at 8:10 PM, Angshu Kar wrote: > I'm away from my lab now. I'll surely send you the error messages. I > got 16 > errors when i ran the make test command and all in the .t files. Also > could > you please let me know if I can test whether the db module has been > properly > installed using some command? > > Thanks, > Angshu > > > On 12/8/05, Barry Moore wrote: >> >> Angshu- >> >> The package you downloaded should have come with a file named INSTALL. >> Did you have specific problems following the instructions in that >> file? >> >> Barry >> >> -----Original Message----- >> From: bioperl-l-bounces@portal.open-bio.org >> [mailto:bioperl-l-bounces@portal.open-bio.org] On Behalf Of Angshu Kar >> Sent: Thursday, December 08, 2005 7:01 PM >> To: bioperl-l >> Subject: [Bioperl-l] Urgent: DB module installation >> >> Hi, >> >> Could anyone please let me know how to install Bioperl-db in WindowsXP >> as >> well as a linux machine? >> >> Thanks, >> Angshu >> >> _______________________________________________ >> Bioperl-l mailing list >> Bioperl-l@portal.open-bio.org >> http://portal.open-bio.org/mailman/listinfo/bioperl-l >> > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l@portal.open-bio.org > http://portal.open-bio.org/mailman/listinfo/bioperl-l > > -- ------------------------------------------------------------- Hilmar Lapp email: lapp at gnf.org GNF, San Diego, Ca. 92121 phone: +1-858-812-1757 ------------------------------------------------------------- From angshu96 at gmail.com Fri Dec 9 13:06:33 2005 From: angshu96 at gmail.com (Angshu Kar) Date: Fri Dec 9 13:04:04 2005 Subject: [Bioperl-l] Urgent: DB module installation In-Reply-To: References: Message-ID: Hi Hilmar, But this was while I was trying to install the Bioperl-db module. Has it got anything to do with my existing schema ? I'll be obliged if you could explain. Thanks, Angshu On 12/9/05, Hilmar Lapp wrote: > > If you ran the tests against your 5-table version of Biosql then for > sure they are going to fail. > > On Dec 8, 2005, at 8:10 PM, Angshu Kar wrote: > > > I'm away from my lab now. I'll surely send you the error messages. I > > got 16 > > errors when i ran the make test command and all in the .t files. Also > > could > > you please let me know if I can test whether the db module has been > > properly > > installed using some command? > > > > Thanks, > > Angshu > > > > > > On 12/8/05, Barry Moore wrote: > >> > >> Angshu- > >> > >> The package you downloaded should have come with a file named INSTALL. > >> Did you have specific problems following the instructions in that > >> file? > >> > >> Barry > >> > >> -----Original Message----- > >> From: bioperl-l-bounces@portal.open-bio.org > >> [mailto:bioperl-l-bounces@portal.open-bio.org] On Behalf Of Angshu Kar > >> Sent: Thursday, December 08, 2005 7:01 PM > >> To: bioperl-l > >> Subject: [Bioperl-l] Urgent: DB module installation > >> > >> Hi, > >> > >> Could anyone please let me know how to install Bioperl-db in WindowsXP > >> as > >> well as a linux machine? > >> > >> Thanks, > >> Angshu > >> > >> _______________________________________________ > >> Bioperl-l mailing list > >> Bioperl-l@portal.open-bio.org > >> http://portal.open-bio.org/mailman/listinfo/bioperl-l > >> > > > > _______________________________________________ > > Bioperl-l mailing list > > Bioperl-l@portal.open-bio.org > > http://portal.open-bio.org/mailman/listinfo/bioperl-l > > > > > -- > ------------------------------------------------------------- > Hilmar Lapp email: lapp at gnf.org > GNF, San Diego, Ca. 92121 phone: +1-858-812-1757 > ------------------------------------------------------------- > > > From hlapp at gmx.net Fri Dec 9 13:09:54 2005 From: hlapp at gmx.net (Hilmar Lapp) Date: Fri Dec 9 13:07:21 2005 Subject: [Bioperl-l] Urgent: DB module installation In-Reply-To: References: Message-ID: The tests need a Biosql schema (otherwise testing bioperl-db is meaningless). If you didn't configure one in t/DBHarness.biosql.conf then they'll also fail .. On Dec 9, 2005, at 10:06 AM, Angshu Kar wrote: > Hi Hilmar, > ? > But this was while I was trying to install the Bioperl-db module. Has > it got anything to do with my existing schema ? I'll be obliged if you > could explain. > ? > Thanks, > Angshu > > ? > On 12/9/05, Hilmar Lapp wrote: If you ran the tests > against your 5-table version of Biosql then for >> sure they are going to fail. >> >> On Dec 8, 2005, at 8:10 PM, Angshu Kar wrote: >> >> > I'm away from my lab now. I'll surely send you the error messages. I >> > got 16 >> > errors when i ran the make test command and all in the .t files. >> Also >> > could >> > you please let me know if I can test whether the db module has been >> > properly >> > installed using some command? >> > >> > Thanks, >> > Angshu >> > >> > >> > On 12/8/05, Barry Moore < bmoore@genetics.utah.edu> wrote: >> >> >> >> Angshu- >> >> >> >> The package you downloaded should have come with a file named >> INSTALL. >> >> Did you have specific problems following the instructions in that >> >> file? >> >> >> >> Barry >> >> >> >> -----Original Message----- >> >> From: bioperl-l-bounces@portal.open-bio.org >> >> [mailto: bioperl-l-bounces@portal.open-bio.org] On Behalf Of >> Angshu Kar >> >> Sent: Thursday, December 08, 2005 7:01 PM >> >> To: bioperl-l >> >> Subject: [Bioperl-l] Urgent: DB module installation >> >> >> >> Hi, >> >> >> >> Could anyone please let me know how to install Bioperl-db in >> WindowsXP >> >> as >> >> well as a linux machine? >> >> >> >> Thanks, >> >> Angshu >> >> >> >> _______________________________________________ >> >> Bioperl-l mailing list >> >> Bioperl-l@portal.open-bio.org >> >> http://portal.open-bio.org/mailman/listinfo/bioperl-l >> >> >> > >> > _______________________________________________ >> > Bioperl-l mailing list >> > Bioperl-l@portal.open-bio.org >> > http://portal.open-bio.org/mailman/listinfo/bioperl-l >> > >> > >> -- >> ------------------------------------------------------------- >> Hilmar Lapp????????????????????????????email: lapp at gnf.org >> GNF, San Diego, Ca. 92121??????????????phone: +1-858-812-1757 >> ------------------------------------------------------------- >> >> >> -- ------------------------------------------------------------- Hilmar Lapp email: lapp at gnf.org GNF, San Diego, Ca. 92121 phone: +1-858-812-1757 ------------------------------------------------------------- From hlapp at gmx.net Fri Dec 9 13:22:41 2005 From: hlapp at gmx.net (Hilmar Lapp) Date: Fri Dec 9 13:20:07 2005 Subject: [Bioperl-l] patch to Bio::Location::Split - split locations whose strand is -1 don't print with complement when only one sublocation present In-Reply-To: References: Message-ID: On Dec 8, 2005, at 9:25 PM, Cook, Malcolm wrote: > Also, while we're here, what do you understand the semantics of strand > in split sublocations to be? Your logic of termporarily flipping the > sublocations strand seems to suggest that you expect that the strand > of the sublocation should in practice agree with that of the superiour > split location. I'm building split locations 'by hand' and seem > forced to set the strand in both the parent and all subs. Is this > what you expect? thanks. Setting the strand on the container location should propagate to all its sub-locations. I.e., you should only have to set a single strand, unless you want sub-locations on different strands. In that case, any manipulation of the container strand will override the manually set sub-location strands. At least that's how it should be (i.e., was meant to be). > > As an aside but related issue, I've got a the bioperl source head cvs > checked out as anonymous with a bunch of edits (such as this) that I'd > like now to commit given that I now have privs (I'm user mcook). Do > you now a way to edit my source tree to change them to be checked out > by mcook instead of anonymous. Or do I have to recheck out a fresh > source tree and make my edits there for commit? Thanks It might be easiest to get a fresh checkout and then copy over your changed sources (just be careful to only copy .pm files, so as not to copy the CVS/* files used by cvs ...). Actually, I'm going to retract this somewhat - it is really only a good idea if you're very sure that the repository wasn't updated from the time you made your changes. You can convince yourself whether a module was updated in the repository or not by comparing the revision IDs. If you have a series of modules you could skip that step, take a diff of your changed modules to the repository ('cvs diff') and apply the resulting patch(es) to your new checkout. -hilmar > > Cheers, > > Malcolm Cook - mec@stowers-institute.org - 816-926-4449 > Database Applications Manager - Bioinformatics > Stowers Institute for Medical Research - Kansas City, MO USA > > > > > > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l@portal.open-bio.org > http://portal.open-bio.org/mailman/listinfo/bioperl-l > > -- ------------------------------------------------------------- Hilmar Lapp email: lapp at gnf.org GNF, San Diego, Ca. 92121 phone: +1-858-812-1757 ------------------------------------------------------------- From hlapp at gmx.net Fri Dec 9 13:26:19 2005 From: hlapp at gmx.net (Hilmar Lapp) Date: Fri Dec 9 13:23:44 2005 Subject: [Bioperl-l] Urgent: DB module installation In-Reply-To: References: Message-ID: <2cc8cd28502f68811af5e8870b1a0e4e@gmx.net> Read t/DBHarness.conf.example and follow the instructions. Also, this is explained in the INSTALL file. Let me know if you have questions not addressed there. -hilmar On Dec 9, 2005, at 10:13 AM, Angshu Kar wrote: > Thank you so much Hilmar...and could you please explain this .conf > part (how/what to do)? > ? > Gratefully, > Angshu > > ? > On 12/9/05, Hilmar Lapp wrote: The tests need a Biosql > schema (otherwise testing bioperl-db is >> meaningless). If you didn't configure one in t/DBHarness.biosql.conf >> then they'll also fail .. >> >> On Dec 9, 2005, at 10:06 AM, Angshu Kar wrote: >> >> > Hi Hilmar, >> > >> > But this was while I was trying to install the Bioperl-db module. >> Has >> > it got anything to do with my existing schema ? I'll be obliged if >> you >> > could explain. >> > >> > Thanks, >> > Angshu >> > >> > >> > On 12/9/05, Hilmar Lapp wrote: If you ran the tests >> > against your 5-table version of Biosql then for >> >> sure they are going to fail. >> >> >> >>??On Dec 8, 2005, at 8:10 PM, Angshu Kar wrote: >> >> >> >> > I'm away from my lab now. I'll surely send you the error >> messages. I >> >> > got 16 >> >> > errors when i ran the make test command and all in the .t files. >> >> Also >> >>??> could >> >> > you please let me know if I can test whether the db module has >> been >> >> > properly >> >> > installed using some command? >> >> > >> >> > Thanks, >> >> > Angshu >> >> > >> >> > >> >> > On 12/8/05, Barry Moore < bmoore@genetics.utah.edu> wrote: >> >> >> >> >> >> Angshu- >> >> >> >> >> >> The package you downloaded should have come with a file named >> >> INSTALL. >> >> >> Did you have specific problems following the instructions in >> that >> >> >> file? >> >> >> >> >> >> Barry >> >> >> >> >> >> -----Original Message----- >> >> >> From: bioperl-l-bounces@portal.open-bio.org >> >> >> [mailto: bioperl-l-bounces@portal.open-bio.org] On Behalf Of >> >> Angshu Kar >> >> >> Sent: Thursday, December 08, 2005 7:01 PM >> >> >> To: bioperl-l >> >> >> Subject: [Bioperl-l] Urgent: DB module installation >> >> >> >> >> >> Hi, >> >> >> >> >> >> Could anyone please let me know how to install Bioperl-db in >> >> WindowsXP >> >> >> as >> >> >> well as a linux machine? >> >> >> >> >> >> Thanks, >> >> >> Angshu >> >> >> >> >> >> _______________________________________________ >> >> >> Bioperl-l mailing list >> >> >> Bioperl-l@portal.open-bio.org >> >> >> http://portal.open-bio.org/mailman/listinfo/bioperl-l >> >> >> >> >> > >> >> > _______________________________________________ >> >> > Bioperl-l mailing list >> >> > Bioperl-l@portal.open-bio.org >> >> > http://portal.open-bio.org/mailman/listinfo/bioperl-l >> >> > >> >> > >> >> -- >> >> ------------------------------------------------------------- >> >> Hilmar Lappemail: lapp at gnf.org >> >> GNF, San Diego, Ca. 92121phone: +1-858-812-1757 >> >> ------------------------------------------------------------- >> >> >> >> >> >> >> -- >> ------------------------------------------------------------- >> Hilmar Lapp????????????????????????????email: lapp at gnf.org >> GNF, San Diego, Ca. 92121??????????????phone: +1-858-812-1757 >> ------------------------------------------------------------- >> >> >> -- ------------------------------------------------------------- Hilmar Lapp email: lapp at gnf.org GNF, San Diego, Ca. 92121 phone: +1-858-812-1757 ------------------------------------------------------------- From angshu96 at gmail.com Fri Dec 9 13:13:40 2005 From: angshu96 at gmail.com (Angshu Kar) Date: Fri Dec 9 13:42:06 2005 Subject: [Bioperl-l] Urgent: DB module installation In-Reply-To: References: Message-ID: Thank you so much Hilmar...and could you please explain this .conf part (how/what to do)? Gratefully, Angshu On 12/9/05, Hilmar Lapp wrote: > > The tests need a Biosql schema (otherwise testing bioperl-db is > meaningless). If you didn't configure one in t/DBHarness.biosql.conf > then they'll also fail .. > > On Dec 9, 2005, at 10:06 AM, Angshu Kar wrote: > > > Hi Hilmar, > > > > But this was while I was trying to install the Bioperl-db module. Has > > it got anything to do with my existing schema ? I'll be obliged if you > > could explain. > > > > Thanks, > > Angshu > > > > > > On 12/9/05, Hilmar Lapp wrote: If you ran the tests > > against your 5-table version of Biosql then for > >> sure they are going to fail. > >> > >> On Dec 8, 2005, at 8:10 PM, Angshu Kar wrote: > >> > >> > I'm away from my lab now. I'll surely send you the error messages. I > >> > got 16 > >> > errors when i ran the make test command and all in the .t files. > >> Also > >> > could > >> > you please let me know if I can test whether the db module has been > >> > properly > >> > installed using some command? > >> > > >> > Thanks, > >> > Angshu > >> > > >> > > >> > On 12/8/05, Barry Moore < bmoore@genetics.utah.edu> wrote: > >> >> > >> >> Angshu- > >> >> > >> >> The package you downloaded should have come with a file named > >> INSTALL. > >> >> Did you have specific problems following the instructions in that > >> >> file? > >> >> > >> >> Barry > >> >> > >> >> -----Original Message----- > >> >> From: bioperl-l-bounces@portal.open-bio.org > >> >> [mailto: bioperl-l-bounces@portal.open-bio.org] On Behalf Of > >> Angshu Kar > >> >> Sent: Thursday, December 08, 2005 7:01 PM > >> >> To: bioperl-l > >> >> Subject: [Bioperl-l] Urgent: DB module installation > >> >> > >> >> Hi, > >> >> > >> >> Could anyone please let me know how to install Bioperl-db in > >> WindowsXP > >> >> as > >> >> well as a linux machine? > >> >> > >> >> Thanks, > >> >> Angshu > >> >> > >> >> _______________________________________________ > >> >> Bioperl-l mailing list > >> >> Bioperl-l@portal.open-bio.org > >> >> http://portal.open-bio.org/mailman/listinfo/bioperl-l > >> >> > >> > > >> > _______________________________________________ > >> > Bioperl-l mailing list > >> > Bioperl-l@portal.open-bio.org > >> > http://portal.open-bio.org/mailman/listinfo/bioperl-l > >> > > >> > > >> -- > >> ------------------------------------------------------------- > >> Hilmar Lappemail: lapp at gnf.org > >> GNF, San Diego, Ca. 92121phone: +1-858-812-1757 > >> ------------------------------------------------------------- > >> > >> > >> > -- > ------------------------------------------------------------- > Hilmar Lapp email: lapp at gnf.org > GNF, San Diego, Ca. 92121 phone: +1-858-812-1757 > ------------------------------------------------------------- > > > From osborne1 at optonline.net Fri Dec 9 15:36:22 2005 From: osborne1 at optonline.net (Brian Osborne) Date: Fri Dec 9 15:38:54 2005 Subject: [Bioperl-l] throw, not die In-Reply-To: <200512091426.57161.heikki@sanbi.ac.za> Message-ID: Heikki, Torsten's idea! Brian O. On 12/9/05 7:26 AM, "Heikki Lehvaslaiho" wrote: > Brian, > > I picked up the idea from your recent commits, did a search for 'die' commands > in bioperl cvs head and fixed quite a few. I did not touch the Bio::Graphics > name space as those do not inherit from Bio::Root::Root. > > I hope I was not overly zealous. At least all tests pass. > > -Heikki From angshu96 at gmail.com Fri Dec 9 16:58:42 2005 From: angshu96 at gmail.com (Angshu Kar) Date: Fri Dec 9 17:22:20 2005 Subject: [Bioperl-l] loading data to biosql tables In-Reply-To: <2725905b99cf48d0d0740376a1156082@gmx.net> References: <3e86c394e54677a898eab8de11d8e002@gmx.net> <2725905b99cf48d0d0740376a1156082@gmx.net> Message-ID: Hi Hilmar, I'm obliged that you showed me the light. I reanalyzed the schema and found that its no use working with a truncated version of biosql-schema and now I'm planning to install the entire schema. Could you please let me know where can I find the script for that for a Pg db? Thank you so much. Else I would have to face a lots of problem in the later half of my project. Gratefully, Angshu On 12/9/05, Hilmar Lapp wrote: > > Angshu, load_seqdatabase.pl is a script that utilizes the language > binding library bioperl-db to load sequences and annotation into > Biosql. The object-relational mapping code is all over bioperl-db. > > I'm sorry, but if you believe it is worth fiddling with that > object-relational code to 'save' instantiating a few more tables then > you're welcome to do so but you're essentially on your own. > > Also, if you'd like to work with a schema the language binding of which > allows to arbitrarily drop tables from the schema and still work then > Biosql/bioperl-db may not be for you. > > -hilmar > > On Dec 9, 2005, at 7:10 AM, Angshu Kar wrote: > > > Hi Hilmar, > > > > In the load_seqdatabase.pl script could you please tell me where you > > are inserting the data into the db tables, so that I can try and > > modify that part to insert data only to the tables that I need ? > > > > Thanks, > > Angshu > > > > > > On 12/8/05, Hilmar Lapp wrote: Any reason you didn't > > instantiate the rest of the schema? Any scripts > >> and software that have been written against BioSQL will certainly > >> expect the rest of the schema be present ... > >> > >> Bioperl-db is the BioSQL language binding for Bioperl, so that's what > >> you will want to use. It comes with a script load_seqdatabase.pl to > >> load any format supported by Bioperl. > >> > >> However, bioperl-db does expect all of Biosql to be present ... > >> > >> -hilmar > >> > >> On Dec 7, 2005, at 6:12 PM, Angshu Kar wrote: > >> > >> > Hi, > >> > > >> > I've created 5 tables (taxon, taxon name, bioentry, biosequence, > >> > biodatabase) in my postgresql database (linux box) using the biosql > >> > schema > >> > ddl from > >> > > >> http://cvs.open-bio.org/cgi-bin/viewcvs/viewcvs.cgi/biosql-schema/ > >> sql/ > >> > > >> biosqldb-pg.sql?rev=1.29&cvsroot=biosql&content-type=text/ > >> vnd.viewcvs- > >> > markup > >> >. > >> > Now I want to load the tables with arabidopsis data. Could you > >> please > >> > let me > >> > know where can I find such scripts for pgsql? And also I find at > >> > http://bio.perl.org/Core/Latest/index.shtml that the DB module has > >> not > >> > been > >> > updated since 2001. Do I need to install that? Or are there some new > >> > releases? > >> > > >> > I'll be obliged if you can guide. > >> > > >> > Thanks, > >> > Angshu > >> > > >> > _______________________________________________ > >> > Bioperl-l mailing list > >> > Bioperl-l@portal.open-bio.org > >> > http://portal.open-bio.org/mailman/listinfo/bioperl-l > >> > > >> > > >> -- > >> ------------------------------------------------------------- > >> Hilmar Lappemail: lapp at gnf.org > >> GNF, San Diego, Ca. 92121phone: +1-858-812-1757 > >> ------------------------------------------------------------- > >> > >> > >> > -- > ------------------------------------------------------------- > Hilmar Lapp email: lapp at gnf.org > GNF, San Diego, Ca. 92121 phone: +1-858-812-1757 > ------------------------------------------------------------- > > > From sdavis2 at mail.nih.gov Fri Dec 9 18:07:36 2005 From: sdavis2 at mail.nih.gov (Sean Davis) Date: Fri Dec 9 18:05:08 2005 Subject: [Bioperl-l] loading data to biosql tables In-Reply-To: Message-ID: On 12/9/05 4:58 PM, "Angshu Kar" wrote: > Hi Hilmar, > > I'm obliged that you showed me the light. I reanalyzed the schema and found > that its no use working with a truncated version of biosql-schema and now > I'm planning to install the entire schema. Could you please let me know > where can I find the script for that for a Pg db? > > Thank you so much. Else I would have to face a lots of problem in the later > half of my project. > > Gratefully, > Angshu Angshu, Unfortunately (I hate to read documentation, also), you need to read the documentation and installation instructions with things that you download. If you got the biosql-schema, there is an INSTALL file in it that gives details. Hilmar and others have pointed out the INSTALL files on several occasions as a source of answers to most installation questions. I would suggest going through the INSTALL file located here: http://cvs.bioperl.org/cgi-bin/viewcvs/viewcvs.cgi/*checkout*/biosql-schema/ INSTALL?rev=HEAD&cvsroot=bioperl&content-type=text/plain If you have a problem, then you need to provide the exact commands that you executed prior to getting that problem, the exact error message you see, and the operating system and version numbers for any relevant software. That will save you much time and grief when asking for help. Sean From angshu96 at gmail.com Fri Dec 9 18:14:03 2005 From: angshu96 at gmail.com (Angshu Kar) Date: Fri Dec 9 18:39:09 2005 Subject: [Bioperl-l] loading data to biosql tables In-Reply-To: References: Message-ID: Thanks Sean. I've found all documents needed. I've also installed the biosql schema (by the way are there 28 tables? to confirm if I've missed something). My taxon loading script has also run successfully, Now I'll be running the load_seq script. Thank you so much for helping this novice out. Thanks, Angshu On 12/9/05, Sean Davis wrote: > > > > > On 12/9/05 4:58 PM, "Angshu Kar" wrote: > > > Hi Hilmar, > > > > I'm obliged that you showed me the light. I reanalyzed the schema and > found > > that its no use working with a truncated version of biosql-schema and > now > > I'm planning to install the entire schema. Could you please let me know > > where can I find the script for that for a Pg db? > > > > Thank you so much. Else I would have to face a lots of problem in the > later > > half of my project. > > > > Gratefully, > > Angshu > > Angshu, > > Unfortunately (I hate to read documentation, also), you need to read the > documentation and installation instructions with things that you download. > If you got the biosql-schema, there is an INSTALL file in it that gives > details. Hilmar and others have pointed out the INSTALL files on several > occasions as a source of answers to most installation questions. > > I would suggest going through the INSTALL file located here: > > > http://cvs.bioperl.org/cgi-bin/viewcvs/viewcvs.cgi/*checkout*/biosql-schema/ > INSTALL?rev=HEAD&cvsroot=bioperl&content-type=text/plain > > If you have a problem, then you need to provide the exact commands that > you > executed prior to getting that problem, the exact error message you see, > and > the operating system and version numbers for any relevant software. That > will save you much time and grief when asking for help. > > Sean > > > From angshu96 at gmail.com Fri Dec 9 19:21:49 2005 From: angshu96 at gmail.com (Angshu Kar) Date: Fri Dec 9 19:19:13 2005 Subject: [Bioperl-l] loading data to biosql tables In-Reply-To: References: Message-ID: Hi Sean, A small help I need before I run the load_seqdatabase.pl. I've downloaded my datafile which is ATH1_cds_cm_20040228 from TAIR. What's the namespace and format for this? Thanks, Angshu From bmoore at genetics.utah.edu Fri Dec 9 19:35:34 2005 From: bmoore at genetics.utah.edu (Barry Moore) Date: Fri Dec 9 19:32:00 2005 Subject: [Bioperl-l] loading data to biosql tables Message-ID: Angshu- Make the namespace whatever you want it to be. This is useful if you want to load sequence from different sources into the same database. As for the format - you tell us what format is the file in? You could just let bioperl guess, but looking at the file and deciding yourself would be your best bet. Barry > -----Original Message----- > From: bioperl-l-bounces@portal.open-bio.org [mailto:bioperl-l- > bounces@portal.open-bio.org] On Behalf Of Angshu Kar > Sent: Friday, December 09, 2005 5:22 PM > To: Sean Davis > Cc: bioperl-l > Subject: Re: [Bioperl-l] loading data to biosql tables > > Hi Sean, > > A small help I need before I run the load_seqdatabase.pl. I've downloaded > my > datafile which is ATH1_cds_cm_20040228 from TAIR. What's the namespace and > format for this? > > Thanks, > Angshu > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l@portal.open-bio.org > http://portal.open-bio.org/mailman/listinfo/bioperl-l From angshu96 at gmail.com Fri Dec 9 19:45:41 2005 From: angshu96 at gmail.com (Angshu Kar) Date: Fri Dec 9 19:43:07 2005 Subject: [Bioperl-l] loading data to biosql tables In-Reply-To: References: Message-ID: Thanks a lot Barry. Now I'm getting this error while tryin to run the load_seqdatabase.pl in a linux box (I used : perl load_seqdatabase.pl /akar/seq/ATH1_cds_cm_20040228) Can't locate Bio/Root/Root.pm in @INC (@INC contains: /usr/lib/perl5/5.8.5/i386-linux-thread-multi /usr/lib/perl5/5.8.5 /usr/lib/perl5/site_perl/5.8.5/i386-linux-thread-multi /usr/lib/perl5/site_perl/5.8.4/i386-linux-thread-multi /usr/lib/perl5/site_perl/5.8.3/i386-linux-thread-multi /usr/lib/perl5/site_perl/5.8.2/i386-linux-thread-multi /usr/lib/perl5/site_perl/5.8.1/i386-linux-thread-multi /usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi /usr/lib/perl5/site_perl/5.8.5 /usr/lib/perl5/site_perl/5.8.4 /usr/lib/perl5/site_perl/5.8.3 /usr/lib/perl5/site_perl/5.8.2 /usr/lib/perl5/site_perl/5.8.1 /usr/lib/perl5/site_perl/5.8.0 /usr/lib/perl5/site_perl /usr/lib/perl5/vendor_perl/5.8.5/i386-linux-thread-multi /usr/lib/perl5/vendor_perl/5.8.4/i386-linux-thread-multi /usr/lib/perl5/vendor_perl/5.8.3/i386-linux-thread-multi /usr/lib/perl5/vendor_perl/5.8.2/i386-linux-thread-multi /usr/lib/perl5/vendor_perl/5.8.1/i386-linux-thread-multi /usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi /usr/lib/perl5/vendor_perl/5.8.5 /usr/lib/perl5/vendor_perl/5.8.4 /usr/lib/perl5/vendor_perl/5.8.3 /usr/lib/perl5/vendor_perl/5.8.2 /usr/lib/perl5/vendor_perl/5.8.1 /usr/lib/perl5/vendor_perl/5.8.0 /usr/lib/perl5/vendor_perl .) at load_seqdatabase.pl line 7. BEGIN failed--compilation aborted at load_seqdatabase.pl line 7. Please guide. Thanks, Angshu On 12/9/05, Barry Moore wrote: > > Angshu- > > Make the namespace whatever you want it to be. This is useful if you > want to load sequence from different sources into the same database. As > for the format - you tell us what format is the file in? You could just > let bioperl guess, but looking at the file and deciding yourself would > be your best bet. > > Barry > > > -----Original Message----- > > From: bioperl-l-bounces@portal.open-bio.org [mailto:bioperl-l- > > bounces@portal.open-bio.org] On Behalf Of Angshu Kar > > Sent: Friday, December 09, 2005 5:22 PM > > To: Sean Davis > > Cc: bioperl-l > > Subject: Re: [Bioperl-l] loading data to biosql tables > > > > Hi Sean, > > > > A small help I need before I run the load_seqdatabase.pl. I've > downloaded > > my > > datafile which is ATH1_cds_cm_20040228 from TAIR. What's the namespace > and > > format for this? > > > > Thanks, > > Angshu > > > > _______________________________________________ > > Bioperl-l mailing list > > Bioperl-l@portal.open-bio.org > > http://portal.open-bio.org/mailman/listinfo/bioperl-l > From angshu96 at gmail.com Fri Dec 9 19:54:59 2005 From: angshu96 at gmail.com (Angshu Kar) Date: Fri Dec 9 19:52:26 2005 Subject: [Bioperl-l] loading data to biosql tables In-Reply-To: References: Message-ID: One thing I missed was that my Root.pm resides in a different path...How to specify that? On 12/9/05, Angshu Kar wrote: > > Thanks a lot Barry. > > Now I'm getting this error while tryin to run the load_seqdatabase.pl in a > linux box (I used : > perl load_seqdatabase.pl /akar/seq/ATH1_cds_cm_20040228) > > > Can't locate Bio/Root/Root.pm in @INC (@INC contains: > /usr/lib/perl5/5.8.5/i386-linux-thread-multi /usr/lib/perl5/5.8.5 > /usr/lib/perl5/site_perl/5.8.5/i386-linux-thread-multi > /usr/lib/perl5/site_perl/5.8.4/i386-linux-thread-multi > /usr/lib/perl5/site_perl/5.8.3/i386-linux-thread-multi > /usr/lib/perl5/site_perl/5.8.2/i386-linux-thread-multi > /usr/lib/perl5/site_perl/5.8.1/i386-linux-thread-multi > /usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi > /usr/lib/perl5/site_perl/5.8.5 /usr/lib/perl5/site_perl/5.8.4 > /usr/lib/perl5/site_perl/5.8.3 /usr/lib/perl5/site_perl/5.8.2 > /usr/lib/perl5/site_perl/5.8.1 /usr/lib/perl5/site_perl/5.8.0 > /usr/lib/perl5/site_perl > /usr/lib/perl5/vendor_perl/5.8.5/i386-linux-thread-multi > /usr/lib/perl5/vendor_perl/5.8.4/i386-linux-thread-multi > /usr/lib/perl5/vendor_perl/5.8.3/i386-linux-thread-multi > /usr/lib/perl5/vendor_perl/5.8.2/i386-linux-thread-multi > /usr/lib/perl5/vendor_perl/5.8.1/i386-linux-thread-multi > /usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi > /usr/lib/perl5/vendor_perl/5.8.5 /usr/lib/perl5/vendor_perl/5.8.4 > /usr/lib/perl5/vendor_perl/5.8.3 /usr/lib/perl5/vendor_perl/5.8.2 > /usr/lib/perl5/vendor_perl/5.8.1 /usr/lib/perl5/vendor_perl/5.8.0 > /usr/lib/perl5/vendor_perl .) at load_seqdatabase.pl line 7. > BEGIN failed--compilation aborted at load_seqdatabase.pl line 7. > > Please guide. > > Thanks, > Angshu > > On 12/9/05, Barry Moore < bmoore@genetics.utah.edu> wrote: > > > > Angshu- > > > > Make the namespace whatever you want it to be. This is useful if you > > want to load sequence from different sources into the same database. As > > for the format - you tell us what format is the file in? You could just > > let bioperl guess, but looking at the file and deciding yourself would > > be your best bet. > > > > Barry > > > > > -----Original Message----- > > > From: bioperl-l-bounces@portal.open-bio.org [mailto: bioperl-l- > > > bounces@portal.open-bio.org] On Behalf Of Angshu Kar > > > Sent: Friday, December 09, 2005 5:22 PM > > > To: Sean Davis > > > Cc: bioperl-l > > > Subject: Re: [Bioperl-l] loading data to biosql tables > > > > > > Hi Sean, > > > > > > A small help I need before I run the load_seqdatabase.pl. I've > > downloaded > > > my > > > datafile which is ATH1_cds_cm_20040228 from TAIR. What's the namespace > > > > and > > > format for this? > > > > > > Thanks, > > > Angshu > > > > > > _______________________________________________ > > > Bioperl-l mailing list > > > Bioperl-l@portal.open-bio.org > > > http://portal.open-bio.org/mailman/listinfo/bioperl-l > > > > From jason.stajich at duke.edu Fri Dec 9 21:09:35 2005 From: jason.stajich at duke.edu (Jason Stajich) Date: Fri Dec 9 21:43:35 2005 Subject: [Bioperl-l] loading data to biosql tables In-Reply-To: References: Message-ID: <3CCC9485-FF57-4C6F-81AF-3B9C9AFDF197@duke.edu> Follow the install instructions for bioperl first, you need bioperl to run bioperl-db. These include, set your PERL5LIB or install bioperl on your system or run the load script with -I PATH/TO/BIOPERL On Dec 9, 2005, at 7:54 PM, Angshu Kar wrote: > One thing I missed was that my Root.pm resides in a different > path...How to > specify that? > > On 12/9/05, Angshu Kar wrote: >> >> Thanks a lot Barry. >> >> Now I'm getting this error while tryin to run the >> load_seqdatabase.pl in a >> linux box (I used : >> perl load_seqdatabase.pl /akar/seq/ATH1_cds_cm_20040228) >> >> >> Can't locate Bio/Root/Root.pm in @INC (@INC contains: >> /usr/lib/perl5/5.8.5/i386-linux-thread-multi /usr/lib/perl5/5.8.5 >> /usr/lib/perl5/site_perl/5.8.5/i386-linux-thread-multi >> /usr/lib/perl5/site_perl/5.8.4/i386-linux-thread-multi >> /usr/lib/perl5/site_perl/5.8.3/i386-linux-thread-multi >> /usr/lib/perl5/site_perl/5.8.2/i386-linux-thread-multi >> /usr/lib/perl5/site_perl/5.8.1/i386-linux-thread-multi >> /usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi >> /usr/lib/perl5/site_perl/5.8.5 /usr/lib/perl5/site_perl/5.8.4 >> /usr/lib/perl5/site_perl/5.8.3 /usr/lib/perl5/site_perl/5.8.2 >> /usr/lib/perl5/site_perl/5.8.1 /usr/lib/perl5/site_perl/5.8.0 >> /usr/lib/perl5/site_perl >> /usr/lib/perl5/vendor_perl/5.8.5/i386-linux-thread-multi >> /usr/lib/perl5/vendor_perl/5.8.4/i386-linux-thread-multi >> /usr/lib/perl5/vendor_perl/5.8.3/i386-linux-thread-multi >> /usr/lib/perl5/vendor_perl/5.8.2/i386-linux-thread-multi >> /usr/lib/perl5/vendor_perl/5.8.1/i386-linux-thread-multi >> /usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi >> /usr/lib/perl5/vendor_perl/5.8.5 /usr/lib/perl5/vendor_perl/5.8.4 >> /usr/lib/perl5/vendor_perl/5.8.3 /usr/lib/perl5/vendor_perl/5.8.2 >> /usr/lib/perl5/vendor_perl/5.8.1 /usr/lib/perl5/vendor_perl/5.8.0 >> /usr/lib/perl5/vendor_perl .) at load_seqdatabase.pl line 7. >> BEGIN failed--compilation aborted at load_seqdatabase.pl line 7. >> >> Please guide. >> >> Thanks, >> Angshu >> >> On 12/9/05, Barry Moore < bmoore@genetics.utah.edu> wrote: >>> >>> Angshu- >>> >>> Make the namespace whatever you want it to be. This is useful if >>> you >>> want to load sequence from different sources into the same >>> database. As >>> for the format - you tell us what format is the file in? You >>> could just >>> let bioperl guess, but looking at the file and deciding yourself >>> would >>> be your best bet. >>> >>> Barry >>> >>>> -----Original Message----- >>>> From: bioperl-l-bounces@portal.open-bio.org [mailto: bioperl-l- >>>> bounces@portal.open-bio.org] On Behalf Of Angshu Kar >>>> Sent: Friday, December 09, 2005 5:22 PM >>>> To: Sean Davis >>>> Cc: bioperl-l >>>> Subject: Re: [Bioperl-l] loading data to biosql tables >>>> >>>> Hi Sean, >>>> >>>> A small help I need before I run the load_seqdatabase.pl. I've >>> downloaded >>>> my >>>> datafile which is ATH1_cds_cm_20040228 from TAIR. What's the >>>> namespace >>> >>> and >>>> format for this? >>>> >>>> Thanks, >>>> Angshu >>>> >>>> _______________________________________________ >>>> Bioperl-l mailing list >>>> Bioperl-l@portal.open-bio.org >>>> http://portal.open-bio.org/mailman/listinfo/bioperl-l >>> >> >> > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l@portal.open-bio.org > http://portal.open-bio.org/mailman/listinfo/bioperl-l -- Jason Stajich Duke University http://www.duke.edu/~jes12 From angshu96 at gmail.com Fri Dec 9 22:21:27 2005 From: angshu96 at gmail.com (Angshu Kar) Date: Fri Dec 9 22:27:12 2005 Subject: [Bioperl-l] loading data to biosql tables In-Reply-To: <3CCC9485-FF57-4C6F-81AF-3B9C9AFDF197@duke.edu> References: <3CCC9485-FF57-4C6F-81AF-3B9C9AFDF197@duke.edu> Message-ID: Thanks Jason... I'm sorry but I didn't get you. I've installed bioperl as well as bioperl-db module in my system... Now what should be my next step to resolve this problem? I'm sorry again, but as I told that I'm a novice in this domain. Thanks, Angshu On 12/9/05, Jason Stajich wrote: > > > Follow the install instructions for bioperl first, you need bioperl > to run bioperl-db. > These include, set your PERL5LIB or install bioperl on your system or > run the load script with -I PATH/TO/BIOPERL > > > On Dec 9, 2005, at 7:54 PM, Angshu Kar wrote: > > > One thing I missed was that my Root.pm resides in a different > > path...How to > > specify that? > > > > On 12/9/05, Angshu Kar wrote: > >> > >> Thanks a lot Barry. > >> > >> Now I'm getting this error while tryin to run the > >> load_seqdatabase.pl in a > >> linux box (I used : > >> perl load_seqdatabase.pl /akar/seq/ATH1_cds_cm_20040228) > >> > >> > >> Can't locate Bio/Root/Root.pm in @INC (@INC contains: > >> /usr/lib/perl5/5.8.5/i386-linux-thread-multi /usr/lib/perl5/5.8.5 > >> /usr/lib/perl5/site_perl/5.8.5/i386-linux-thread-multi > >> /usr/lib/perl5/site_perl/5.8.4/i386-linux-thread-multi > >> /usr/lib/perl5/site_perl/5.8.3/i386-linux-thread-multi > >> /usr/lib/perl5/site_perl/5.8.2/i386-linux-thread-multi > >> /usr/lib/perl5/site_perl/5.8.1/i386-linux-thread-multi > >> /usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi > >> /usr/lib/perl5/site_perl/5.8.5 /usr/lib/perl5/site_perl/5.8.4 > >> /usr/lib/perl5/site_perl/5.8.3 /usr/lib/perl5/site_perl/5.8.2 > >> /usr/lib/perl5/site_perl/5.8.1 /usr/lib/perl5/site_perl/5.8.0 > >> /usr/lib/perl5/site_perl > >> /usr/lib/perl5/vendor_perl/5.8.5/i386-linux-thread-multi > >> /usr/lib/perl5/vendor_perl/5.8.4/i386-linux-thread-multi > >> /usr/lib/perl5/vendor_perl/5.8.3/i386-linux-thread-multi > >> /usr/lib/perl5/vendor_perl/5.8.2/i386-linux-thread-multi > >> /usr/lib/perl5/vendor_perl/5.8.1/i386-linux-thread-multi > >> /usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi > >> /usr/lib/perl5/vendor_perl/5.8.5 /usr/lib/perl5/vendor_perl/5.8.4 > >> /usr/lib/perl5/vendor_perl/5.8.3 /usr/lib/perl5/vendor_perl/5.8.2 > >> /usr/lib/perl5/vendor_perl/5.8.1 /usr/lib/perl5/vendor_perl/5.8.0 > >> /usr/lib/perl5/vendor_perl .) at load_seqdatabase.pl line 7. > >> BEGIN failed--compilation aborted at load_seqdatabase.pl line 7. > >> > >> Please guide. > >> > >> Thanks, > >> Angshu > >> > >> On 12/9/05, Barry Moore < bmoore@genetics.utah.edu> wrote: > >>> > >>> Angshu- > >>> > >>> Make the namespace whatever you want it to be. This is useful if > >>> you > >>> want to load sequence from different sources into the same > >>> database. As > >>> for the format - you tell us what format is the file in? You > >>> could just > >>> let bioperl guess, but looking at the file and deciding yourself > >>> would > >>> be your best bet. > >>> > >>> Barry > >>> > >>>> -----Original Message----- > >>>> From: bioperl-l-bounces@portal.open-bio.org [mailto: bioperl-l- > >>>> bounces@portal.open-bio.org] On Behalf Of Angshu Kar > >>>> Sent: Friday, December 09, 2005 5:22 PM > >>>> To: Sean Davis > >>>> Cc: bioperl-l > >>>> Subject: Re: [Bioperl-l] loading data to biosql tables > >>>> > >>>> Hi Sean, > >>>> > >>>> A small help I need before I run the load_seqdatabase.pl. I've > >>> downloaded > >>>> my > >>>> datafile which is ATH1_cds_cm_20040228 from TAIR. What's the > >>>> namespace > >>> > >>> and > >>>> format for this? > >>>> > >>>> Thanks, > >>>> Angshu > >>>> > >>>> _______________________________________________ > >>>> Bioperl-l mailing list > >>>> Bioperl-l@portal.open-bio.org > >>>> http://portal.open-bio.org/mailman/listinfo/bioperl-l > >>> > >> > >> > > > > _______________________________________________ > > Bioperl-l mailing list > > Bioperl-l@portal.open-bio.org > > http://portal.open-bio.org/mailman/listinfo/bioperl-l > > -- > Jason Stajich > Duke University > http://www.duke.edu/~jes12 > > > From angshu96 at gmail.com Fri Dec 9 23:46:12 2005 From: angshu96 at gmail.com (Angshu Kar) Date: Fri Dec 9 23:43:47 2005 Subject: [Bioperl-l] loading data to biosql tables In-Reply-To: <20051210044021.95719.qmail@web36809.mail.mud.yahoo.com> References: <20051210044021.95719.qmail@web36809.mail.mud.yahoo.com> Message-ID: Thanks Chen. I'm running it on linux (red hat) too... but did you work with biosql-db module and face that problem? On 12/9/05, chen li wrote: > > Based on my personal experience Bioperl is meant to > run much much smoothly on linux system (or Unix). > Which operation system are you using? If it is windows > I expect you have a lot of troubles (now and in the > future) for a novice even you install an ActivePerl. I > have the same probelm before. But everything is fine > after I install a red hat linxu Fedora core 1 on my > computer. Now I have dual OS on my computer. I follow > the HOWTO INSTALL coming with Biosql and everything is > fine. > > Li > > --- Angshu Kar wrote: > > > Thanks Jason... > > I'm sorry but I didn't get you. > > I've installed bioperl as well as bioperl-db module > > in my system... > > Now what should be my next step to resolve this > > problem? > > I'm sorry again, but as I told that I'm a novice in > > this domain. > > > > Thanks, > > Angshu > > > > > > On 12/9/05, Jason Stajich > > wrote: > > > > > > > > > Follow the install instructions for bioperl first, > > you need bioperl > > > to run bioperl-db. > > > These include, set your PERL5LIB or install > > bioperl on your system or > > > run the load script with -I PATH/TO/BIOPERL > > > > > > > > > On Dec 9, 2005, at 7:54 PM, Angshu Kar wrote: > > > > > > > One thing I missed was that my Root.pm resides > > in a different > > > > path...How to > > > > specify that? > > > > > > > > On 12/9/05, Angshu Kar > > wrote: > > > >> > > > >> Thanks a lot Barry. > > > >> > > > >> Now I'm getting this error while tryin to run > > the > > > >> load_seqdatabase.pl in a > > > >> linux box (I used : > > > >> perl load_seqdatabase.pl > > /akar/seq/ATH1_cds_cm_20040228) > > > >> > > > >> > > > >> Can't locate Bio/Root/Root.pm in @INC (@INC > > contains: > > > >> /usr/lib/perl5/5.8.5/i386-linux-thread-multi > > /usr/lib/perl5/5.8.5 > > > >> > > > /usr/lib/perl5/site_perl/5.8.5/i386-linux-thread-multi > > > >> > > > /usr/lib/perl5/site_perl/5.8.4/i386-linux-thread-multi > > > >> > > > /usr/lib/perl5/site_perl/5.8.3/i386-linux-thread-multi > > > >> > > > /usr/lib/perl5/site_perl/5.8.2/i386-linux-thread-multi > > > >> > > > /usr/lib/perl5/site_perl/5.8.1/i386-linux-thread-multi > > > >> > > > /usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi > > > >> /usr/lib/perl5/site_perl/5.8.5 > > /usr/lib/perl5/site_perl/5.8.4 > > > >> /usr/lib/perl5/site_perl/5.8.3 > > /usr/lib/perl5/site_perl/5.8.2 > > > >> /usr/lib/perl5/site_perl/5.8.1 > > /usr/lib/perl5/site_perl/5.8.0 > > > >> /usr/lib/perl5/site_perl > > > >> > > > /usr/lib/perl5/vendor_perl/5.8.5/i386-linux-thread-multi > > > >> > > > /usr/lib/perl5/vendor_perl/5.8.4/i386-linux-thread-multi > > > >> > > > /usr/lib/perl5/vendor_perl/5.8.3/i386-linux-thread-multi > > > >> > > > /usr/lib/perl5/vendor_perl/5.8.2/i386-linux-thread-multi > > > >> > > > /usr/lib/perl5/vendor_perl/5.8.1/i386-linux-thread-multi > > > >> > > > /usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi > > > >> /usr/lib/perl5/vendor_perl/5.8.5 > > /usr/lib/perl5/vendor_perl/5.8.4 > > > >> /usr/lib/perl5/vendor_perl/5.8.3 > > /usr/lib/perl5/vendor_perl/5.8.2 > > > >> /usr/lib/perl5/vendor_perl/5.8.1 > > /usr/lib/perl5/vendor_perl/5.8.0 > > > >> /usr/lib/perl5/vendor_perl .) at > > load_seqdatabase.pl line 7. > > > >> BEGIN failed--compilation aborted at > > load_seqdatabase.pl line 7. > > > >> > > > >> Please guide. > > > >> > > > >> Thanks, > > > >> Angshu > > > >> > > > >> On 12/9/05, Barry Moore < > > bmoore@genetics.utah.edu> wrote: > > > >>> > > > >>> Angshu- > > > >>> > > > >>> Make the namespace whatever you want it to be. > > This is useful if > > > >>> you > > > >>> want to load sequence from different sources > > into the same > > > >>> database. As > > > >>> for the format - you tell us what format is > > the file in? You > > > >>> could just > > > >>> let bioperl guess, but looking at the file and > > deciding yourself > > > >>> would > > > >>> be your best bet. > > > >>> > > > >>> Barry > > > >>> > > > >>>> -----Original Message----- > > > >>>> From: bioperl-l-bounces@portal.open-bio.org > > [mailto: bioperl-l- > > > >>>> bounces@portal.open-bio.org] On Behalf Of > > Angshu Kar > > > >>>> Sent: Friday, December 09, 2005 5:22 PM > > > >>>> To: Sean Davis > > > >>>> Cc: bioperl-l > > > >>>> Subject: Re: [Bioperl-l] loading data to > > biosql tables > > > >>>> > > > >>>> Hi Sean, > > > >>>> > > > >>>> A small help I need before I run the > > load_seqdatabase.pl. I've > > > >>> downloaded > > > >>>> my > > > >>>> datafile which is ATH1_cds_cm_20040228 from > > TAIR. What's the > > > >>>> namespace > > > >>> > > > >>> and > > > >>>> format for this? > > > >>>> > > > >>>> Thanks, > > > >>>> Angshu > > > >>>> > > > >>>> > > _______________________________________________ > > > >>>> Bioperl-l mailing list > > > >>>> Bioperl-l@portal.open-bio.org > > > >>>> > > > http://portal.open-bio.org/mailman/listinfo/bioperl-l > > > >>> > > > >> > > > >> > > > > > > > > _______________________________________________ > > > > Bioperl-l mailing list > > > > Bioperl-l@portal.open-bio.org > > > > > > > http://portal.open-bio.org/mailman/listinfo/bioperl-l > > > > > > -- > > > Jason Stajich > > > Duke University > > > http://www.duke.edu/~jes12 > > > > > > > > > > > > > _______________________________________________ > > Bioperl-l mailing list > > Bioperl-l@portal.open-bio.org > > > http://portal.open-bio.org/mailman/listinfo/bioperl-l > > > > > __________________________________________________ > Do You Yahoo!? > Tired of spam? Yahoo! Mail has the best spam protection around > http://mail.yahoo.com > From osborne1 at optonline.net Sat Dec 10 00:11:21 2005 From: osborne1 at optonline.net (Brian Osborne) Date: Sat Dec 10 00:13:50 2005 Subject: [Bioperl-l] loading data to biosql tables In-Reply-To: Message-ID: Li, Cygwin on Windows is a great platform for Bioperl, I used it happily for many years. When I was using Windows and Cygwin I recall getting bioperl-db and Biosql up and running in 10 minutes using the postgres package from Cygwin, and this was the first time I tried to install bioperl-db and Biosql with postgres. Since Cygwin is a Unix emulator all the INSTALL instructions work perfectly without any amendments. Brian O. On 12/9/05 11:46 PM, "Angshu Kar" wrote: > Thanks Chen. I'm running it on linux (red hat) too... but did you work with > biosql-db module and face that problem? > > On 12/9/05, chen li wrote: >> >> Based on my personal experience Bioperl is meant to >> run much much smoothly on linux system (or Unix). >> Which operation system are you using? If it is windows >> I expect you have a lot of troubles (now and in the >> future) for a novice even you install an ActivePerl. I >> have the same probelm before. But everything is fine >> after I install a red hat linxu Fedora core 1 on my >> computer. Now I have dual OS on my computer. I follow >> the HOWTO INSTALL coming with Biosql and everything is >> fine. >> >> Li >> >> --- Angshu Kar wrote: >> >>> Thanks Jason... >>> I'm sorry but I didn't get you. >>> I've installed bioperl as well as bioperl-db module >>> in my system... >>> Now what should be my next step to resolve this >>> problem? >>> I'm sorry again, but as I told that I'm a novice in >>> this domain. >>> >>> Thanks, >>> Angshu >>> >>> >>> On 12/9/05, Jason Stajich >>> wrote: >>>> >>>> >>>> Follow the install instructions for bioperl first, >>> you need bioperl >>>> to run bioperl-db. >>>> These include, set your PERL5LIB or install >>> bioperl on your system or >>>> run the load script with -I PATH/TO/BIOPERL >>>> >>>> >>>> On Dec 9, 2005, at 7:54 PM, Angshu Kar wrote: >>>> >>>>> One thing I missed was that my Root.pm resides >>> in a different >>>>> path...How to >>>>> specify that? >>>>> >>>>> On 12/9/05, Angshu Kar >>> wrote: >>>>>> >>>>>> Thanks a lot Barry. >>>>>> >>>>>> Now I'm getting this error while tryin to run >>> the >>>>>> load_seqdatabase.pl in a >>>>>> linux box (I used : >>>>>> perl load_seqdatabase.pl >>> /akar/seq/ATH1_cds_cm_20040228) >>>>>> >>>>>> >>>>>> Can't locate Bio/Root/Root.pm in @INC (@INC >>> contains: >>>>>> /usr/lib/perl5/5.8.5/i386-linux-thread-multi >>> /usr/lib/perl5/5.8.5 >>>>>> >>> >> /usr/lib/perl5/site_perl/5.8.5/i386-linux-thread-multi >>>>>> >>> >> /usr/lib/perl5/site_perl/5.8.4/i386-linux-thread-multi >>>>>> >>> >> /usr/lib/perl5/site_perl/5.8.3/i386-linux-thread-multi >>>>>> >>> >> /usr/lib/perl5/site_perl/5.8.2/i386-linux-thread-multi >>>>>> >>> >> /usr/lib/perl5/site_perl/5.8.1/i386-linux-thread-multi >>>>>> >>> >> /usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi >>>>>> /usr/lib/perl5/site_perl/5.8.5 >>> /usr/lib/perl5/site_perl/5.8.4 >>>>>> /usr/lib/perl5/site_perl/5.8.3 >>> /usr/lib/perl5/site_perl/5.8.2 >>>>>> /usr/lib/perl5/site_perl/5.8.1 >>> /usr/lib/perl5/site_perl/5.8.0 >>>>>> /usr/lib/perl5/site_perl >>>>>> >>> >> /usr/lib/perl5/vendor_perl/5.8.5/i386-linux-thread-multi >>>>>> >>> >> /usr/lib/perl5/vendor_perl/5.8.4/i386-linux-thread-multi >>>>>> >>> >> /usr/lib/perl5/vendor_perl/5.8.3/i386-linux-thread-multi >>>>>> >>> >> /usr/lib/perl5/vendor_perl/5.8.2/i386-linux-thread-multi >>>>>> >>> >> /usr/lib/perl5/vendor_perl/5.8.1/i386-linux-thread-multi >>>>>> >>> >> /usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi >>>>>> /usr/lib/perl5/vendor_perl/5.8.5 >>> /usr/lib/perl5/vendor_perl/5.8.4 >>>>>> /usr/lib/perl5/vendor_perl/5.8.3 >>> /usr/lib/perl5/vendor_perl/5.8.2 >>>>>> /usr/lib/perl5/vendor_perl/5.8.1 >>> /usr/lib/perl5/vendor_perl/5.8.0 >>>>>> /usr/lib/perl5/vendor_perl .) at >>> load_seqdatabase.pl line 7. >>>>>> BEGIN failed--compilation aborted at >>> load_seqdatabase.pl line 7. >>>>>> >>>>>> Please guide. >>>>>> >>>>>> Thanks, >>>>>> Angshu >>>>>> >>>>>> On 12/9/05, Barry Moore < >>> bmoore@genetics.utah.edu> wrote: >>>>>>> >>>>>>> Angshu- >>>>>>> >>>>>>> Make the namespace whatever you want it to be. >>> This is useful if >>>>>>> you >>>>>>> want to load sequence from different sources >>> into the same >>>>>>> database. As >>>>>>> for the format - you tell us what format is >>> the file in? You >>>>>>> could just >>>>>>> let bioperl guess, but looking at the file and >>> deciding yourself >>>>>>> would >>>>>>> be your best bet. >>>>>>> >>>>>>> Barry >>>>>>> >>>>>>>> -----Original Message----- >>>>>>>> From: bioperl-l-bounces@portal.open-bio.org >>> [mailto: bioperl-l- >>>>>>>> bounces@portal.open-bio.org] On Behalf Of >>> Angshu Kar >>>>>>>> Sent: Friday, December 09, 2005 5:22 PM >>>>>>>> To: Sean Davis >>>>>>>> Cc: bioperl-l >>>>>>>> Subject: Re: [Bioperl-l] loading data to >>> biosql tables >>>>>>>> >>>>>>>> Hi Sean, >>>>>>>> >>>>>>>> A small help I need before I run the >>> load_seqdatabase.pl. I've >>>>>>> downloaded >>>>>>>> my >>>>>>>> datafile which is ATH1_cds_cm_20040228 from >>> TAIR. What's the >>>>>>>> namespace >>>>>>> >>>>>>> and >>>>>>>> format for this? >>>>>>>> >>>>>>>> Thanks, >>>>>>>> Angshu >>>>>>>> >>>>>>>> >>> _______________________________________________ >>>>>>>> Bioperl-l mailing list >>>>>>>> Bioperl-l@portal.open-bio.org >>>>>>>> >>> >> http://portal.open-bio.org/mailman/listinfo/bioperl-l >>>>>>> >>>>>> >>>>>> >>>>> >>>>> _______________________________________________ >>>>> Bioperl-l mailing list >>>>> Bioperl-l@portal.open-bio.org >>>>> >>> >> http://portal.open-bio.org/mailman/listinfo/bioperl-l >>>> >>>> -- >>>> Jason Stajich >>>> Duke University >>>> http://www.duke.edu/~jes12 >>>> >>>> >>>> >>> >>> _______________________________________________ >>> Bioperl-l mailing list >>> Bioperl-l@portal.open-bio.org >>> >> http://portal.open-bio.org/mailman/listinfo/bioperl-l >>> >> >> >> __________________________________________________ >> Do You Yahoo!? >> Tired of spam? Yahoo! Mail has the best spam protection around >> http://mail.yahoo.com >> > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l@portal.open-bio.org > http://portal.open-bio.org/mailman/listinfo/bioperl-l From angshu96 at gmail.com Sat Dec 10 00:17:03 2005 From: angshu96 at gmail.com (Angshu Kar) Date: Sat Dec 10 00:14:34 2005 Subject: [Bioperl-l] loading data to biosql tables In-Reply-To: References: Message-ID: Hi Brian, That sounds great! But in our labs they don't have any win boxes! Could you please let me know how to get my problem resolved in a linux box? Thanks, Angshu On 12/10/05, Brian Osborne wrote: > > Li, > > Cygwin on Windows is a great platform for Bioperl, I used it happily for > many years. When I was using Windows and Cygwin I recall getting > bioperl-db > and Biosql up and running in 10 minutes using the postgres package from > Cygwin, and this was the first time I tried to install bioperl-db and > Biosql > with postgres. Since Cygwin is a Unix emulator all the INSTALL > instructions > work perfectly without any amendments. > > Brian O. > > > On 12/9/05 11:46 PM, "Angshu Kar" wrote: > > > Thanks Chen. I'm running it on linux (red hat) too... but did you work > with > > biosql-db module and face that problem? > > > > On 12/9/05, chen li wrote: > >> > >> Based on my personal experience Bioperl is meant to > >> run much much smoothly on linux system (or Unix). > >> Which operation system are you using? If it is windows > >> I expect you have a lot of troubles (now and in the > >> future) for a novice even you install an ActivePerl. I > >> have the same probelm before. But everything is fine > >> after I install a red hat linxu Fedora core 1 on my > >> computer. Now I have dual OS on my computer. I follow > >> the HOWTO INSTALL coming with Biosql and everything is > >> fine. > >> > >> Li > >> > >> --- Angshu Kar wrote: > >> > >>> Thanks Jason... > >>> I'm sorry but I didn't get you. > >>> I've installed bioperl as well as bioperl-db module > >>> in my system... > >>> Now what should be my next step to resolve this > >>> problem? > >>> I'm sorry again, but as I told that I'm a novice in > >>> this domain. > >>> > >>> Thanks, > >>> Angshu > >>> > >>> > >>> On 12/9/05, Jason Stajich > >>> wrote: > >>>> > >>>> > >>>> Follow the install instructions for bioperl first, > >>> you need bioperl > >>>> to run bioperl-db. > >>>> These include, set your PERL5LIB or install > >>> bioperl on your system or > >>>> run the load script with -I PATH/TO/BIOPERL > >>>> > >>>> > >>>> On Dec 9, 2005, at 7:54 PM, Angshu Kar wrote: > >>>> > >>>>> One thing I missed was that my Root.pm resides > >>> in a different > >>>>> path...How to > >>>>> specify that? > >>>>> > >>>>> On 12/9/05, Angshu Kar > >>> wrote: > >>>>>> > >>>>>> Thanks a lot Barry. > >>>>>> > >>>>>> Now I'm getting this error while tryin to run > >>> the > >>>>>> load_seqdatabase.pl in a > >>>>>> linux box (I used : > >>>>>> perl load_seqdatabase.pl > >>> /akar/seq/ATH1_cds_cm_20040228) > >>>>>> > >>>>>> > >>>>>> Can't locate Bio/Root/Root.pm in @INC (@INC > >>> contains: > >>>>>> /usr/lib/perl5/5.8.5/i386-linux-thread-multi > >>> /usr/lib/perl5/5.8.5 > >>>>>> > >>> > >> /usr/lib/perl5/site_perl/5.8.5/i386-linux-thread-multi > >>>>>> > >>> > >> /usr/lib/perl5/site_perl/5.8.4/i386-linux-thread-multi > >>>>>> > >>> > >> /usr/lib/perl5/site_perl/5.8.3/i386-linux-thread-multi > >>>>>> > >>> > >> /usr/lib/perl5/site_perl/5.8.2/i386-linux-thread-multi > >>>>>> > >>> > >> /usr/lib/perl5/site_perl/5.8.1/i386-linux-thread-multi > >>>>>> > >>> > >> /usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi > >>>>>> /usr/lib/perl5/site_perl/5.8.5 > >>> /usr/lib/perl5/site_perl/5.8.4 > >>>>>> /usr/lib/perl5/site_perl/5.8.3 > >>> /usr/lib/perl5/site_perl/5.8.2 > >>>>>> /usr/lib/perl5/site_perl/5.8.1 > >>> /usr/lib/perl5/site_perl/5.8.0 > >>>>>> /usr/lib/perl5/site_perl > >>>>>> > >>> > >> /usr/lib/perl5/vendor_perl/5.8.5/i386-linux-thread-multi > >>>>>> > >>> > >> /usr/lib/perl5/vendor_perl/5.8.4/i386-linux-thread-multi > >>>>>> > >>> > >> /usr/lib/perl5/vendor_perl/5.8.3/i386-linux-thread-multi > >>>>>> > >>> > >> /usr/lib/perl5/vendor_perl/5.8.2/i386-linux-thread-multi > >>>>>> > >>> > >> /usr/lib/perl5/vendor_perl/5.8.1/i386-linux-thread-multi > >>>>>> > >>> > >> /usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi > >>>>>> /usr/lib/perl5/vendor_perl/5.8.5 > >>> /usr/lib/perl5/vendor_perl/5.8.4 > >>>>>> /usr/lib/perl5/vendor_perl/5.8.3 > >>> /usr/lib/perl5/vendor_perl/5.8.2 > >>>>>> /usr/lib/perl5/vendor_perl/5.8.1 > >>> /usr/lib/perl5/vendor_perl/5.8.0 > >>>>>> /usr/lib/perl5/vendor_perl .) at > >>> load_seqdatabase.pl line 7. > >>>>>> BEGIN failed--compilation aborted at > >>> load_seqdatabase.pl line 7. > >>>>>> > >>>>>> Please guide. > >>>>>> > >>>>>> Thanks, > >>>>>> Angshu > >>>>>> > >>>>>> On 12/9/05, Barry Moore < > >>> bmoore@genetics.utah.edu> wrote: > >>>>>>> > >>>>>>> Angshu- > >>>>>>> > >>>>>>> Make the namespace whatever you want it to be. > >>> This is useful if > >>>>>>> you > >>>>>>> want to load sequence from different sources > >>> into the same > >>>>>>> database. As > >>>>>>> for the format - you tell us what format is > >>> the file in? You > >>>>>>> could just > >>>>>>> let bioperl guess, but looking at the file and > >>> deciding yourself > >>>>>>> would > >>>>>>> be your best bet. > >>>>>>> > >>>>>>> Barry > >>>>>>> > >>>>>>>> -----Original Message----- > >>>>>>>> From: bioperl-l-bounces@portal.open-bio.org > >>> [mailto: bioperl-l- > >>>>>>>> bounces@portal.open-bio.org] On Behalf Of > >>> Angshu Kar > >>>>>>>> Sent: Friday, December 09, 2005 5:22 PM > >>>>>>>> To: Sean Davis > >>>>>>>> Cc: bioperl-l > >>>>>>>> Subject: Re: [Bioperl-l] loading data to > >>> biosql tables > >>>>>>>> > >>>>>>>> Hi Sean, > >>>>>>>> > >>>>>>>> A small help I need before I run the > >>> load_seqdatabase.pl. I've > >>>>>>> downloaded > >>>>>>>> my > >>>>>>>> datafile which is ATH1_cds_cm_20040228 from > >>> TAIR. What's the > >>>>>>>> namespace > >>>>>>> > >>>>>>> and > >>>>>>>> format for this? > >>>>>>>> > >>>>>>>> Thanks, > >>>>>>>> Angshu > >>>>>>>> > >>>>>>>> > >>> _______________________________________________ > >>>>>>>> Bioperl-l mailing list > >>>>>>>> Bioperl-l@portal.open-bio.org > >>>>>>>> > >>> > >> http://portal.open-bio.org/mailman/listinfo/bioperl-l > >>>>>>> > >>>>>> > >>>>>> > >>>>> > >>>>> _______________________________________________ > >>>>> Bioperl-l mailing list > >>>>> Bioperl-l@portal.open-bio.org > >>>>> > >>> > >> http://portal.open-bio.org/mailman/listinfo/bioperl-l > >>>> > >>>> -- > >>>> Jason Stajich > >>>> Duke University > >>>> http://www.duke.edu/~jes12 > >>>> > >>>> > >>>> > >>> > >>> _______________________________________________ > >>> Bioperl-l mailing list > >>> Bioperl-l@portal.open-bio.org > >>> > >> http://portal.open-bio.org/mailman/listinfo/bioperl-l > >>> > >> > >> > >> __________________________________________________ > >> Do You Yahoo!? > >> Tired of spam? Yahoo! Mail has the best spam protection around > >> http://mail.yahoo.com > >> > > > > _______________________________________________ > > Bioperl-l mailing list > > Bioperl-l@portal.open-bio.org > > http://portal.open-bio.org/mailman/listinfo/bioperl-l > > > From MEC at stowers-institute.org Sat Dec 10 02:06:03 2005 From: MEC at stowers-institute.org (Cook, Malcolm) Date: Sat Dec 10 02:20:25 2005 Subject: [Bioperl-l] HOWTO: take a slice of a split location Message-ID: Fellow Bioperlers, I was in need of extracting the 3'-most 1000 bp of from multiple genomic CDS regions (designing 70mer u-array probes). I looked in vain for Bio::Location->splice($from,$to); So I wrote one which works but suffers from actually materializing the list of interger indices into the sequence for every base. Has anyone a better approach they'd care to share? Malcolm Cook - mec@stowers-institute.org Stowers Institute for Medical Research - Kansas City, MO USA P.S. Here' what I wrote: package Bio::LocationI; # Code in the interface so it works # with both ::Split and ::Simple # Bio::Locations sub _intspans { # Purpose: for a (presumably) monotonically increasing list of # integers, return list of arrays each holding min and max of # the list's internal contiguous spans. # # Example: 1..5,10..20,30 => ([1,5],[10,20],[30,30]) my @i = @_; die "nothing passed to intspans" unless @i; my @s = ([$i[0],shift(@i)]); foreach (@i) { if ($_ == 1 + $s[0][1]) { $s[0][1] = $_; } else { unshift @s, [$_, $_] }} reverse @s; } sub slice { # Purpose: compute a slice of the Location, using perls normal slice # semantics, expect that it trims out of range values. my ($self, $from, $to) = @_; my @int = eval (join ',', map {$_->start . '..' . $_->end} $self->each_Location); # build perl expression using the range (..) and list (,) operators. @int = @int[$from..$to]; @int = grep {$_} @int; # Removing undefs (in case $from/$to out of bounds). my @intspans = _intspans(@int); new Bio::Location::Split (-strand => $self->strand, -locations => [map {new Bio::Location::Simple(-start => $_->[0], -end => $_->[1], -strand => $self->strand, ) } @intspans], ); } From jason.stajich at duke.edu Sat Dec 10 11:24:25 2005 From: jason.stajich at duke.edu (Jason Stajich) Date: Sat Dec 10 11:22:23 2005 Subject: [Bioperl-l] loading data to biosql tables In-Reply-To: References: <3CCC9485-FF57-4C6F-81AF-3B9C9AFDF197@duke.edu> Message-ID: <2E2A564F-116D-497B-86B1-421CCF368437@duke.edu> you have not installed it so that your perl knows to find it. Did you do 'make install'? On Dec 9, 2005, at 10:21 PM, Angshu Kar wrote: > Thanks Jason... > I'm sorry but I didn't get you. > I've installed bioperl as well as bioperl-db module in my system... > Now what should be my next step to resolve this problem? > I'm sorry again, but as I told that I'm a novice in this domain. > > Thanks, > Angshu > > > On 12/9/05, Jason Stajich wrote: > > Follow the install instructions for bioperl first, you need bioperl > to run bioperl-db. > These include, set your PERL5LIB or install bioperl on your system or > run the load script with -I PATH/TO/BIOPERL > > > On Dec 9, 2005, at 7:54 PM, Angshu Kar wrote: > > > One thing I missed was that my Root.pm resides in a different > > path...How to > > specify that? > > > > On 12/9/05, Angshu Kar wrote: > >> > >> Thanks a lot Barry. > >> > >> Now I'm getting this error while tryin to run the > >> load_seqdatabase.pl in a > >> linux box (I used : > >> perl load_seqdatabase.pl /akar/seq/ATH1_cds_cm_20040228) > >> > >> > >> Can't locate Bio/Root/Root.pm in @INC (@INC contains: > >> /usr/lib/perl5/5.8.5/i386-linux-thread-multi /usr/lib/perl5/5.8.5 > >> /usr/lib/perl5/site_perl/5.8.5/i386-linux-thread-multi > >> /usr/lib/perl5/site_perl/5.8.4/i386-linux-thread-multi > >> /usr/lib/perl5/site_perl/5.8.3/i386-linux-thread-multi > >> /usr/lib/perl5/site_perl/5.8.2/i386-linux-thread-multi > >> /usr/lib/perl5/site_perl/5.8.1/i386-linux-thread-multi > >> /usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi > >> /usr/lib/perl5/site_perl/5.8.5 /usr/lib/perl5/site_perl/5.8.4 > >> /usr/lib/perl5/site_perl/5.8.3 /usr/lib/perl5/site_perl/5.8.2 > >> /usr/lib/perl5/site_perl/5.8.1 /usr/lib/perl5/site_perl/5.8.0 > >> /usr/lib/perl5/site_perl > >> /usr/lib/perl5/vendor_perl/5.8.5/i386-linux-thread-multi > >> /usr/lib/perl5/vendor_perl/5.8.4/i386-linux-thread-multi > >> /usr/lib/perl5/vendor_perl/5.8.3/i386-linux-thread-multi > >> /usr/lib/perl5/vendor_perl/5.8.2/i386-linux-thread-multi > >> /usr/lib/perl5/vendor_perl/5.8.1/i386-linux-thread-multi > >> /usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi > >> /usr/lib/perl5/vendor_perl/5.8.5 /usr/lib/perl5/vendor_perl/5.8.4 > >> /usr/lib/perl5/vendor_perl/5.8.3 /usr/lib/perl5/vendor_perl/5.8.2 > >> /usr/lib/perl5/vendor_perl/5.8.1 /usr/lib/perl5/vendor_perl/5.8.0 > >> /usr/lib/perl5/vendor_perl .) at load_seqdatabase.pl line 7. > >> BEGIN failed--compilation aborted at load_seqdatabase.pl line 7. > >> > >> Please guide. > >> > >> Thanks, > >> Angshu > >> > >> On 12/9/05, Barry Moore < bmoore@genetics.utah.edu> wrote: > >>> > >>> Angshu- > >>> > >>> Make the namespace whatever you want it to be. This is useful if > >>> you > >>> want to load sequence from different sources into the same > >>> database. As > >>> for the format - you tell us what format is the file in? You > >>> could just > >>> let bioperl guess, but looking at the file and deciding yourself > >>> would > >>> be your best bet. > >>> > >>> Barry > >>> > >>>> -----Original Message----- > >>>> From: bioperl-l-bounces@portal.open-bio.org [mailto: bioperl-l- > >>>> bounces@portal.open-bio.org] On Behalf Of Angshu Kar > >>>> Sent: Friday, December 09, 2005 5:22 PM > >>>> To: Sean Davis > >>>> Cc: bioperl-l > >>>> Subject: Re: [Bioperl-l] loading data to biosql tables > >>>> > >>>> Hi Sean, > >>>> > >>>> A small help I need before I run the load_seqdatabase.pl. I've > >>> downloaded > >>>> my > >>>> datafile which is ATH1_cds_cm_20040228 from TAIR. What's the > >>>> namespace > >>> > >>> and > >>>> format for this? > >>>> > >>>> Thanks, > >>>> Angshu > >>>> > >>>> _______________________________________________ > >>>> Bioperl-l mailing list > >>>> Bioperl-l@portal.open-bio.org > >>>> http://portal.open-bio.org/mailman/listinfo/bioperl-l > >>> > >> > >> > > > > _______________________________________________ > > Bioperl-l mailing list > > Bioperl-l@portal.open-bio.org > > http://portal.open-bio.org/mailman/listinfo/bioperl-l > > -- > Jason Stajich > Duke University > http://www.duke.edu/~jes12 > > > -- Jason Stajich Duke University http://www.duke.edu/~jes12 From angshu96 at gmail.com Sat Dec 10 11:51:19 2005 From: angshu96 at gmail.com (Angshu Kar) Date: Sat Dec 10 11:55:33 2005 Subject: [Bioperl-l] loading data to biosql tables In-Reply-To: <2E2A564F-116D-497B-86B1-421CCF368437@duke.edu> References: <3CCC9485-FF57-4C6F-81AF-3B9C9AFDF197@duke.edu> <2E2A564F-116D-497B-86B1-421CCF368437@duke.edu> Message-ID: Yes Jason. But what I've done is, instead of putting the .pm and .pl files in default locations I've used the LIB and PREFIX arguments to place them in my local directory. This I've done for bioperl as well as bioperl-db modules. Now could you please help me in how to make perl find it? Thanks, Angshu On 12/10/05, Jason Stajich wrote: > > you have not installed it so that your perl knows to find it. Did you do > 'make install'? > On Dec 9, 2005, at 10:21 PM, Angshu Kar wrote: > > Thanks Jason... > I'm sorry but I didn't get you. > I've installed bioperl as well as bioperl-db module in my system... > Now what should be my next step to resolve this problem? > I'm sorry again, but as I told that I'm a novice in this domain. > > Thanks, > Angshu > > > On 12/9/05, Jason Stajich wrote: > > > > > > Follow the install instructions for bioperl first, you need bioperl > > to run bioperl-db. > > These include, set your PERL5LIB or install bioperl on your system or > > run the load script with -I PATH/TO/BIOPERL > > > > > > On Dec 9, 2005, at 7:54 PM, Angshu Kar wrote: > > > > > One thing I missed was that my Root.pm resides in a > > different > > > path...How to > > > specify that? > > > > > > On 12/9/05, Angshu Kar wrote: > > >> > > >> Thanks a lot Barry. > > >> > > >> Now I'm getting this error while tryin to run the > > >> load_seqdatabase.pl in a > > >> linux box (I used : > > >> perl load_seqdatabase.pl /akar/seq/ATH1_cds_cm_20040228) > > >> > > >> > > >> Can't locate Bio/Root/Root.pm in @INC (@INC contains: > > >> /usr/lib/perl5/5.8.5/i386-linux-thread-multi /usr/lib/perl5/5.8.5 > > >> /usr/lib/perl5/site_perl/5.8.5/i386-linux-thread-multi > > >> /usr/lib/perl5/site_perl/5.8.4/i386-linux-thread-multi > > >> /usr/lib/perl5/site_perl/5.8.3/i386-linux-thread-multi > > >> /usr/lib/perl5/site_perl/5.8.2/i386-linux-thread-multi > > >> /usr/lib/perl5/site_perl/5.8.1/i386-linux-thread-multi > > >> /usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi > > >> /usr/lib/perl5/site_perl/5.8.5 /usr/lib/perl5/site_perl/5.8.4 > > >> /usr/lib/perl5/site_perl/5.8.3 /usr/lib/perl5/site_perl/5.8.2 > > >> /usr/lib/perl5/site_perl/5.8.1 /usr/lib/perl5/site_perl/5.8.0 > > >> /usr/lib/perl5/site_perl > > >> /usr/lib/perl5/vendor_perl/5.8.5/i386-linux-thread-multi > > >> /usr/lib/perl5/vendor_perl/5.8.4/i386-linux-thread-multi > > >> /usr/lib/perl5/vendor_perl/5.8.3/i386-linux-thread-multi > > >> /usr/lib/perl5/vendor_perl/5.8.2/i386-linux-thread-multi > > >> /usr/lib/perl5/vendor_perl/5.8.1/i386-linux-thread-multi > > >> /usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi > > >> /usr/lib/perl5/vendor_perl/5.8.5 /usr/lib/perl5/vendor_perl/5.8.4 > > >> /usr/lib/perl5/vendor_perl/5.8.3 /usr/lib/perl5/vendor_perl/5.8.2 > > >> /usr/lib/perl5/vendor_perl/5.8.1 /usr/lib/perl5/vendor_perl/5.8.0 > > >> /usr/lib/perl5/vendor_perl .) at load_seqdatabase.pl line 7. > > >> BEGIN failed--compilation aborted at load_seqdatabase.pl line 7. > > >> > > >> Please guide. > > >> > > >> Thanks, > > >> Angshu > > >> > > >> On 12/9/05, Barry Moore < bmoore@genetics.utah.edu> wrote: > > >>> > > >>> Angshu- > > >>> > > >>> Make the namespace whatever you want it to be. This is useful if > > >>> you > > >>> want to load sequence from different sources into the same > > >>> database. As > > >>> for the format - you tell us what format is the file in? You > > >>> could just > > >>> let bioperl guess, but looking at the file and deciding yourself > > >>> would > > >>> be your best bet. > > >>> > > >>> Barry > > >>> > > >>>> -----Original Message----- > > >>>> From: bioperl-l-bounces@portal.open-bio.org [mailto: bioperl-l- > > >>>> bounces@portal.open-bio.org] On Behalf Of Angshu Kar > > >>>> Sent: Friday, December 09, 2005 5:22 PM > > >>>> To: Sean Davis > > >>>> Cc: bioperl-l > > >>>> Subject: Re: [Bioperl-l] loading data to biosql tables > > >>>> > > >>>> Hi Sean, > > >>>> > > >>>> A small help I need before I run the load_seqdatabase.pl. I've > > >>> downloaded > > >>>> my > > >>>> datafile which is ATH1_cds_cm_20040228 from TAIR. What's the > > >>>> namespace > > >>> > > >>> and > > >>>> format for this? > > >>>> > > >>>> Thanks, > > >>>> Angshu > > >>>> > > >>>> _______________________________________________ > > >>>> Bioperl-l mailing list > > >>>> Bioperl-l@portal.open-bio.org > > >>>> http://portal.open-bio.org/mailman/listinfo/bioperl-l > > >>> > > >> > > >> > > > > > > _______________________________________________ > > > Bioperl-l mailing list > > > Bioperl-l@portal.open-bio.org > > > http://portal.open-bio.org/mailman/listinfo/bioperl-l > > > > -- > > Jason Stajich > > Duke University > > http://www.duke.edu/~jes12 > > > > > > > > -- > Jason Stajich > Duke University > http://www.duke.edu/~jes12 > > > > From jason.stajich at duke.edu Sat Dec 10 12:05:33 2005 From: jason.stajich at duke.edu (Jason Stajich) Date: Sat Dec 10 12:02:58 2005 Subject: [Bioperl-l] loading data to biosql tables In-Reply-To: References: <3CCC9485-FF57-4C6F-81AF-3B9C9AFDF197@duke.edu> <2E2A564F-116D-497B-86B1-421CCF368437@duke.edu> Message-ID: <6CE9663B-2666-418C-8322-AC48A7036143@duke.edu> From the INSTALL document in the Bioperl distribution You can explicitly tell perl where to look for modules by using the lib module which comes standard with perl. Example: #!/usr/bin/perl use lib "/home/users/dag/My_Local_Perl_Modules/"; use Bio::Seq; <...insert whizzy perl code here...> Or, you can set the environmental variable PERL5LIB: csh or tcsh: setenv PERL5LIB /home/users/dag/My_Local_Perl_Modules/ bash or sh: export PERL5LIB=/home/users/dag/My_Local_Perl_Modules/ On Dec 10, 2005, at 11:51 AM, Angshu Kar wrote: > Yes Jason. But what I've done is, instead of putting the .pm > and .pl files in default locations I've used the LIB and PREFIX > arguments to place them in my local directory. This I've done for > bioperl as well as bioperl-db modules. > Now could you please help me in how to make perl find it? > > Thanks, > Angshu > > > On 12/10/05, Jason Stajich wrote: > you have not installed it so that your perl knows to find it. Did > you do 'make install'? > > On Dec 9, 2005, at 10:21 PM, Angshu Kar wrote: > >> Thanks Jason... >> I'm sorry but I didn't get you. >> I've installed bioperl as well as bioperl-db module in my system... >> Now what should be my next step to resolve this problem? >> I'm sorry again, but as I told that I'm a novice in this domain. >> >> Thanks, >> Angshu >> >> >> On 12/9/05, Jason Stajich wrote: >> >> Follow the install instructions for bioperl first, you need bioperl >> to run bioperl-db. >> These include, set your PERL5LIB or install bioperl on your system or >> run the load script with -I PATH/TO/BIOPERL >> >> >> On Dec 9, 2005, at 7:54 PM, Angshu Kar wrote: >> >> > One thing I missed was that my Root.pm resides in a different >> > path...How to >> > specify that? >> > >> > On 12/9/05, Angshu Kar < angshu96@gmail.com> wrote: >> >> >> >> Thanks a lot Barry. >> >> >> >> Now I'm getting this error while tryin to run the >> >> load_seqdatabase.pl in a >> >> linux box (I used : >> >> perl load_seqdatabase.pl /akar/seq/ATH1_cds_cm_20040228) >> >> >> >> >> >> Can't locate Bio/Root/Root.pm in @INC (@INC contains: >> >> /usr/lib/perl5/5.8.5/i386-linux-thread-multi /usr/lib/perl5/5.8.5 >> >> /usr/lib/perl5/site_perl/5.8.5/i386-linux-thread-multi >> >> /usr/lib/perl5/site_perl/5.8.4/i386-linux-thread-multi >> >> /usr/lib/perl5/site_perl/5.8.3/i386-linux-thread-multi >> >> /usr/lib/perl5/site_perl/5.8.2/i386-linux-thread-multi >> >> /usr/lib/perl5/site_perl/5.8.1/i386-linux-thread-multi >> >> /usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi >> >> /usr/lib/perl5/site_perl/5.8.5 /usr/lib/perl5/site_perl/5.8.4 >> >> /usr/lib/perl5/site_perl/5.8.3 /usr/lib/perl5/site_perl/5.8.2 >> >> /usr/lib/perl5/site_perl/5.8.1 /usr/lib/perl5/site_perl/5.8.0 >> >> /usr/lib/perl5/site_perl >> >> /usr/lib/perl5/vendor_perl/5.8.5/i386-linux-thread-multi >> >> /usr/lib/perl5/vendor_perl/5.8.4/i386-linux-thread-multi >> >> /usr/lib/perl5/vendor_perl/5.8.3/i386-linux-thread-multi >> >> /usr/lib/perl5/vendor_perl/5.8.2/i386-linux-thread-multi >> >> /usr/lib/perl5/vendor_perl/5.8.1/i386-linux-thread-multi >> >> /usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi >> >> /usr/lib/perl5/vendor_perl/5.8.5 /usr/lib/perl5/vendor_perl/5.8.4 >> >> /usr/lib/perl5/vendor_perl/5.8.3 /usr/lib/perl5/vendor_perl/5.8.2 >> >> /usr/lib/perl5/vendor_perl/5.8.1 /usr/lib/perl5/vendor_perl/5.8.0 >> >> /usr/lib/perl5/vendor_perl .) at load_seqdatabase.pl line 7. >> >> BEGIN failed--compilation aborted at load_seqdatabase.pl line 7. >> >> >> >> Please guide. >> >> >> >> Thanks, >> >> Angshu >> >> >> >> On 12/9/05, Barry Moore < bmoore@genetics.utah.edu> wrote: >> >>> >> >>> Angshu- >> >>> >> >>> Make the namespace whatever you want it to be. This is useful if >> >>> you >> >>> want to load sequence from different sources into the same >> >>> database. As >> >>> for the format - you tell us what format is the file in? You >> >>> could just >> >>> let bioperl guess, but looking at the file and deciding yourself >> >>> would >> >>> be your best bet. >> >>> >> >>> Barry >> >>> >> >>>> -----Original Message----- >> >>>> From: bioperl-l-bounces@portal.open-bio.org [mailto: bioperl-l- >> >>>> bounces@portal.open-bio.org ] On Behalf Of Angshu Kar >> >>>> Sent: Friday, December 09, 2005 5:22 PM >> >>>> To: Sean Davis >> >>>> Cc: bioperl-l >> >>>> Subject: Re: [Bioperl-l] loading data to biosql tables >> >>>> >> >>>> Hi Sean, >> >>>> >> >>>> A small help I need before I run the load_seqdatabase.pl. I've >> >>> downloaded >> >>>> my >> >>>> datafile which is ATH1_cds_cm_20040228 from TAIR. What's the >> >>>> namespace >> >>> >> >>> and >> >>>> format for this? >> >>>> >> >>>> Thanks, >> >>>> Angshu >> >>>> >> >>>> _______________________________________________ >> >>>> Bioperl-l mailing list >> >>>> Bioperl-l@portal.open-bio.org >> >>>> http://portal.open-bio.org/mailman/listinfo/bioperl-l >> >>> >> >> >> >> >> > >> > _______________________________________________ >> > Bioperl-l mailing list >> > Bioperl-l@portal.open-bio.org >> > http://portal.open-bio.org/mailman/listinfo/bioperl-l >> >> -- >> Jason Stajich >> Duke University >> http://www.duke.edu/~jes12 >> >> >> > > -- > Jason Stajich > Duke University > http://www.duke.edu/~jes12 > > > > -- Jason Stajich Duke University http://www.duke.edu/~jes12 From angshu96 at gmail.com Sat Dec 10 12:17:09 2005 From: angshu96 at gmail.com (Angshu Kar) Date: Sat Dec 10 12:14:33 2005 Subject: [Bioperl-l] loading data to biosql tables In-Reply-To: <6CE9663B-2666-418C-8322-AC48A7036143@duke.edu> References: <3CCC9485-FF57-4C6F-81AF-3B9C9AFDF197@duke.edu> <2E2A564F-116D-497B-86B1-421CCF368437@duke.edu> <6CE9663B-2666-418C-8322-AC48A7036143@duke.edu> Message-ID: You are marvellous Jason...Thanks a lot for using the lingo thats for a freshie like me. I'll apply this today to the load_seqdatabase.pl and let you know if any problem arises. Thanks, Angshu On 12/10/05, Jason Stajich wrote: > > From the INSTALL document in the Bioperl distribution > > > You can explicitly tell perl where to look for modules by using the > lib module which comes standard with perl. > > > Example: > > > #!/usr/bin/perl > > > use lib "/home/users/dag/My_Local_Perl_Modules/"; > use Bio::Seq; > > > <...insert whizzy perl code here...> > > > Or, you can set the environmental variable PERL5LIB: > > > csh or tcsh: > > > setenv PERL5LIB /home/users/dag/My_Local_Perl_Modules/ > > bash or sh: > > > export PERL5LIB=/home/users/dag/My_Local_Perl_Modules/ > > > > On Dec 10, 2005, at 11:51 AM, Angshu Kar wrote: > > Yes Jason. But what I've done is, instead of putting the .pm and .pl > files in default locations I've used the LIB and PREFIX arguments to > place them in my local directory. This I've done for bioperl as well as > bioperl-db modules. > Now could you please help me in how to make perl find it? > > Thanks, > Angshu > > > On 12/10/05, Jason Stajich wrote: > > > > you have not installed it so that your perl knows to find it. Did you > > do 'make install'? > > On Dec 9, 2005, at 10:21 PM, Angshu Kar wrote: > > > > Thanks Jason... > > I'm sorry but I didn't get you. > > I've installed bioperl as well as bioperl-db module in my system... > > Now what should be my next step to resolve this problem? > > I'm sorry again, but as I told that I'm a novice in this domain. > > > > Thanks, > > Angshu > > > > > > On 12/9/05, Jason Stajich wrote: > > > > > > > > > Follow the install instructions for bioperl first, you need bioperl > > > to run bioperl-db. > > > These include, set your PERL5LIB or install bioperl on your system or > > > run the load script with -I PATH/TO/BIOPERL > > > > > > > > > On Dec 9, 2005, at 7:54 PM, Angshu Kar wrote: > > > > > > > One thing I missed was that my Root.pm resides in > > > a different > > > > path...How to > > > > specify that? > > > > > > > > On 12/9/05, Angshu Kar < angshu96@gmail.com> wrote: > > > >> > > > >> Thanks a lot Barry. > > > >> > > > >> Now I'm getting this error while tryin to run the > > > >> load_seqdatabase.pl in a > > > >> linux box (I used : > > > >> perl load_seqdatabase.pl /akar/seq/ATH1_cds_cm_20040228) > > > >> > > > >> > > > >> Can't locate Bio/Root/Root.pm in @INC (@INC contains: > > > >> /usr/lib/perl5/5.8.5/i386-linux-thread-multi /usr/lib/perl5/5.8.5 > > > >> /usr/lib/perl5/site_perl/5.8.5/i386-linux-thread-multi > > > >> /usr/lib/perl5/site_perl/5.8.4/i386-linux-thread-multi > > > >> /usr/lib/perl5/site_perl/5.8.3/i386-linux-thread-multi > > > >> /usr/lib/perl5/site_perl/5.8.2/i386-linux-thread-multi > > > >> /usr/lib/perl5/site_perl/5.8.1/i386-linux-thread-multi > > > >> /usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi > > > >> /usr/lib/perl5/site_perl/5.8.5 /usr/lib/perl5/site_perl/5.8.4 > > > >> /usr/lib/perl5/site_perl/5.8.3 /usr/lib/perl5/site_perl/5.8.2 > > > >> /usr/lib/perl5/site_perl/5.8.1 /usr/lib/perl5/site_perl/5.8.0 > > > >> /usr/lib/perl5/site_perl > > > >> /usr/lib/perl5/vendor_perl/5.8.5/i386-linux-thread-multi > > > >> /usr/lib/perl5/vendor_perl/5.8.4/i386-linux-thread-multi > > > >> /usr/lib/perl5/vendor_perl/5.8.3/i386-linux-thread-multi > > > >> /usr/lib/perl5/vendor_perl/5.8.2/i386-linux-thread-multi > > > >> /usr/lib/perl5/vendor_perl/5.8.1/i386-linux-thread-multi > > > >> /usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi > > > >> /usr/lib/perl5/vendor_perl/5.8.5 /usr/lib/perl5/vendor_perl/5.8.4 > > > >> /usr/lib/perl5/vendor_perl/5.8.3 /usr/lib/perl5/vendor_perl/5.8.2 > > > >> /usr/lib/perl5/vendor_perl/5.8.1 /usr/lib/perl5/vendor_perl/5.8.0 > > > >> /usr/lib/perl5/vendor_perl .) at load_seqdatabase.pl line 7. > > > >> BEGIN failed--compilation aborted at load_seqdatabase.pl line 7. > > > >> > > > >> Please guide. > > > >> > > > >> Thanks, > > > >> Angshu > > > >> > > > >> On 12/9/05, Barry Moore < bmoore@genetics.utah.edu> wrote: > > > >>> > > > >>> Angshu- > > > >>> > > > >>> Make the namespace whatever you want it to be. This is useful if > > > >>> you > > > >>> want to load sequence from different sources into the same > > > >>> database. As > > > >>> for the format - you tell us what format is the file in? You > > > >>> could just > > > >>> let bioperl guess, but looking at the file and deciding yourself > > > >>> would > > > >>> be your best bet. > > > >>> > > > >>> Barry > > > >>> > > > >>>> -----Original Message----- > > > >>>> From: bioperl-l-bounces@portal.open-bio.org [mailto: bioperl-l- > > > >>>> bounces@portal.open-bio.org ] On Behalf Of Angshu Kar > > > >>>> Sent: Friday, December 09, 2005 5:22 PM > > > >>>> To: Sean Davis > > > >>>> Cc: bioperl-l > > > >>>> Subject: Re: [Bioperl-l] loading data to biosql tables > > > >>>> > > > >>>> Hi Sean, > > > >>>> > > > >>>> A small help I need before I run the load_seqdatabase.pl. I've > > > >>> downloaded > > > >>>> my > > > >>>> datafile which is ATH1_cds_cm_20040228 from TAIR. What's the > > > >>>> namespace > > > >>> > > > >>> and > > > >>>> format for this? > > > >>>> > > > >>>> Thanks, > > > >>>> Angshu > > > >>>> > > > >>>> _______________________________________________ > > > >>>> Bioperl-l mailing list > > > >>>> Bioperl-l@portal.open-bio.org > > > >>>> http://portal.open-bio.org/mailman/listinfo/bioperl-l > > > >>> > > > >> > > > >> > > > > > > > > _______________________________________________ > > > > Bioperl-l mailing list > > > > Bioperl-l@portal.open-bio.org > > > > http://portal.open-bio.org/mailman/listinfo/bioperl-l > > > > > > -- > > > Jason Stajich > > > Duke University > > > http://www.duke.edu/~jes12 > > > > > > > > > > > > > -- > > Jason Stajich > > Duke University > > http://www.duke.edu/~jes12 > > > > > > > > > > > -- > Jason Stajich > Duke University > http://www.duke.edu/~jes12 > > > > From jason.stajich at duke.edu Sat Dec 10 13:11:57 2005 From: jason.stajich at duke.edu (Jason Stajich) Date: Sat Dec 10 13:09:23 2005 Subject: [Bioperl-l] HOWTO: take a slice of a split location In-Reply-To: References: Message-ID: Hi Malcom - Don't have a chance to look at your code, but my approach to this problem would be to first splice the sequence out from the genome my $feature = Bio::SeqFeature::Generic->new(-location => $splitlocation); my $cdsseq = $feature->spliced_seq; then just retrieve the last 1000 bases of this sequence. my $threeprime = $cdsseq->subseq($cdsseq->length - 1000, $cdsseq- >length); (this might be off-by-one?) There is also a module to map between coordinates - Bio::Coordinate::GeneMapper if you need to go from transcript to genomic coordinates. -jason On Dec 10, 2005, at 2:06 AM, Cook, Malcolm wrote: > Fellow Bioperlers, > > I was in need of extracting the 3'-most 1000 bp of from multiple > genomic CDS regions (designing 70mer u-array probes). > > I looked in vain for Bio::Location->splice($from,$to); > > So I wrote one which works but suffers from actually materializing > the list of interger indices into the sequence for every base. > > Has anyone a better approach they'd care to share? > > Malcolm Cook - mec@stowers-institute.org > Stowers Institute for Medical Research - Kansas City, MO USA > > P.S. Here' what I wrote: > > package Bio::LocationI; # Code in the interface so it works > # with both ::Split and ::Simple > # Bio::Locations > > sub _intspans { > # Purpose: for a (presumably) monotonically increasing list of > # integers, return list of arrays each holding min and max of > # the list's internal contiguous spans. > # > # Example: 1..5,10..20,30 => ([1,5],[10,20],[30,30]) > my @i = @_; > die "nothing passed to intspans" unless @i; > my @s = ([$i[0],shift(@i)]); > foreach (@i) { > if ($_ == 1 + $s[0][1]) { > $s[0][1] = $_; > } else { > unshift @s, [$_, $_] > }} > reverse @s; > } > > sub slice { > # Purpose: compute a slice of the Location, using perls normal slice > # semantics, expect that it trims out of range values. > my ($self, $from, $to) = @_; > my @int = eval (join ',', map {$_->start . '..' . $_->end} $self- > >each_Location); # build perl expression using the range (..) and > list (,) operators. > @int = @int[$from..$to]; > @int = grep {$_} @int; # Removing undefs (in case $from/$to out > of bounds). > my @intspans = _intspans(@int); > new Bio::Location::Split (-strand => $self->strand, > -locations => [map {new Bio::Location::Simple(-start => $_-> > [0], > -end => $_->[1], > -strand => $self->strand, > ) > } @intspans], > ); > } > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l@portal.open-bio.org > http://portal.open-bio.org/mailman/listinfo/bioperl-l -- Jason Stajich Duke University http://www.duke.edu/~jes12 From angshu96 at gmail.com Sat Dec 10 17:59:19 2005 From: angshu96 at gmail.com (Angshu Kar) Date: Sat Dec 10 18:21:04 2005 Subject: [Bioperl-l] loading data to biosql tables In-Reply-To: References: <3CCC9485-FF57-4C6F-81AF-3B9C9AFDF197@duke.edu> <2E2A564F-116D-497B-86B1-421CCF368437@duke.edu> <6CE9663B-2666-418C-8322-AC48A7036143@duke.edu> Message-ID: Hi, Now I'm getting this new error: Can't locate Bio/DB/BioDB.pm in @INC (@INC contains: /home/akar/local/perl//i386-linux-thread-multi /home/akar/local/perl/ /usr/lib/perl5/5.8.5/i386-linux-thread-multi /usr/lib/perl5/5.8.5 /usr/lib/perl5/site_perl/5.8.5/i386-linux-thread-multi /usr/lib/perl5/site_perl/5.8.4/i386-linux-thread-multi /usr/lib/perl5/site_perl/5.8.3/i386-linux-thread-multi /usr/lib/perl5/site_perl/5.8.2/i386-linux-thread-multi /usr/lib/perl5/site_perl/5.8.1/i386-linux-thread-multi /usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi /usr/lib/perl5/site_perl/5.8.5 /usr/lib/perl5/site_perl/5.8.4 /usr/lib/perl5/site_perl/5.8.3 /usr/lib/perl5/site_perl/5.8.2 /usr/lib/perl5/site_perl/5.8.1 /usr/lib/perl5/site_perl/5.8.0 /usr/lib/perl5/site_perl /usr/lib/perl5/vendor_perl/5.8.5/i386-linux-thread-multi /usr/lib/perl5/vendor_perl/5.8.4/i386-linux-thread-multi /usr/lib/perl5/vendor_perl/5.8.3/i386-linux-thread-multi /usr/lib/perl5/vendor_perl/5.8.2/i386-linux-thread-multi /usr/lib/perl5/vendor_perl/5.8.1/i386-linux-thread-multi /usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi /usr/lib/perl5/vendor_perl/5.8.5 /usr/lib/perl5/vendor_perl/5.8.4 /usr/lib/perl5/vendor_perl/5.8.3 /usr/lib/perl5/vendor_perl/5.8.2 /usr/lib/perl5/vendor_perl/5.8.1 /usr/lib/perl5/vendor_perl/5.8.0 /usr/lib/perl5/vendor_perl .) at load_seqdatabase.pl line 8. BEGIN failed--compilation aborted at load_seqdatabase.pl line 8. I've checked the Bio/DB/ folder but it doesn't contain the BioDB.pm module - it contains the following: Ace.pm DBFetch.pm Flat GFF MeSH.pm RefSeq.pm Taxonomy.pm XEMBLService.pm Biblio EMBL.pm Flat.pm GFF.pm NCBIHelper.pm Registry.pm Universal.pm BiblioI.pm Failover.pm GDB.pm InMemoryCache.pm Query SeqI.pm UpdateableSeqI.pm BioFetch.pm Fasta.pm GenBank.pm Makefile.PL QueryI.pm SwissProt.pm WebDBSeqI.pm CUTG.pm FileCache.pm GenPept.pm MANIFEST RandomAccessI.pm Taxonomy XEMBL.pm Have I installed some wrong version of bioperl-db ? (I've used http://bio.perl.org/Core/Latest/index.shtml) Could anyone please let me know what I've missed? Thanks, Angshu On 12/10/05, Angshu Kar wrote: > > You are marvellous Jason...Thanks a lot for using the lingo thats for a > freshie like me. > I'll apply this today to the load_seqdatabase.pl and let you know if any > problem arises. > > Thanks, > Angshu > > > On 12/10/05, Jason Stajich wrote: > > > > From the INSTALL document in the Bioperl distribution > > > > > > You can explicitly tell perl where to look for modules by using the > > lib module which comes standard with perl. > > > > > > Example: > > > > > > #!/usr/bin/perl > > > > > > use lib "/home/users/dag/My_Local_Perl_Modules/"; > > use Bio::Seq; > > > > > > <...insert whizzy perl code here...> > > > > > > Or, you can set the environmental variable PERL5LIB: > > > > > > csh or tcsh: > > > > > > setenv PERL5LIB /home/users/dag/My_Local_Perl_Modules/ > > > > bash or sh: > > > > > > export PERL5LIB=/home/users/dag/My_Local_Perl_Modules/ > > > > > > > > On Dec 10, 2005, at 11:51 AM, Angshu Kar wrote: > > > > Yes Jason. But what I've done is, instead of putting the .pm and .pl > > files in default locations I've used the LIB and PREFIX arguments to > > place them in my local directory. This I've done for bioperl as well as > > bioperl-db modules. > > Now could you please help me in how to make perl find it? > > > > Thanks, > > Angshu > > > > > > On 12/10/05, Jason Stajich wrote: > > > > > > you have not installed it so that your perl knows to find it. Did you > > > do 'make install'? > > > On Dec 9, 2005, at 10:21 PM, Angshu Kar wrote: > > > > > > Thanks Jason... > > > I'm sorry but I didn't get you. > > > I've installed bioperl as well as bioperl-db module in my system... > > > Now what should be my next step to resolve this problem? > > > I'm sorry again, but as I told that I'm a novice in this domain. > > > > > > Thanks, > > > Angshu > > > > > > > > > On 12/9/05, Jason Stajich wrote: > > > > > > > > > > > > Follow the install instructions for bioperl first, you need bioperl > > > > to run bioperl-db. > > > > These include, set your PERL5LIB or install bioperl on your system > > > > or > > > > run the load script with -I PATH/TO/BIOPERL > > > > > > > > > > > > On Dec 9, 2005, at 7:54 PM, Angshu Kar wrote: > > > > > > > > > One thing I missed was that my Root.pm resides > > > > in a different > > > > > path...How to > > > > > specify that? > > > > > > > > > > On 12/9/05, Angshu Kar < angshu96@gmail.com> wrote: > > > > >> > > > > >> Thanks a lot Barry. > > > > >> > > > > >> Now I'm getting this error while tryin to run the > > > > >> load_seqdatabase.pl in a > > > > >> linux box (I used : > > > > >> perl load_seqdatabase.pl /akar/seq/ATH1_cds_cm_20040228) > > > > >> > > > > >> > > > > >> Can't locate Bio/Root/Root.pm in @INC (@INC contains: > > > > >> /usr/lib/perl5/5.8.5/i386-linux-thread-multi /usr/lib/perl5/5.8.5 > > > > > > > > >> /usr/lib/perl5/site_perl/5.8.5/i386-linux-thread-multi > > > > >> /usr/lib/perl5/site_perl/5.8.4/i386-linux-thread-multi > > > > >> /usr/lib/perl5/site_perl/5.8.3/i386-linux-thread-multi > > > > >> /usr/lib/perl5/site_perl/5.8.2/i386-linux-thread-multi > > > > >> /usr/lib/perl5/site_perl/5.8.1/i386-linux-thread-multi > > > > >> /usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi > > > > >> /usr/lib/perl5/site_perl/5.8.5 /usr/lib/perl5/site_perl/5.8.4 > > > > >> /usr/lib/perl5/site_perl/5.8.3 /usr/lib/perl5/site_perl/5.8.2 > > > > >> /usr/lib/perl5/site_perl/5.8.1 /usr/lib/perl5/site_perl/5.8.0 > > > > >> /usr/lib/perl5/site_perl > > > > >> /usr/lib/perl5/vendor_perl/5.8.5/i386-linux-thread-multi > > > > >> /usr/lib/perl5/vendor_perl/5.8.4/i386-linux-thread-multi > > > > >> /usr/lib/perl5/vendor_perl/5.8.3/i386-linux-thread-multi > > > > >> /usr/lib/perl5/vendor_perl/5.8.2/i386-linux-thread-multi > > > > >> /usr/lib/perl5/vendor_perl/5.8.1/i386-linux-thread-multi > > > > >> /usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi > > > > >> /usr/lib/perl5/vendor_perl/5.8.5 /usr/lib/perl5/vendor_perl/5.8.4 > > > > >> /usr/lib/perl5/vendor_perl/5.8.3 /usr/lib/perl5/vendor_perl/5.8.2 > > > > > > > > >> /usr/lib/perl5/vendor_perl/5.8.1 /usr/lib/perl5/vendor_perl/5.8.0 > > > > > > > > >> /usr/lib/perl5/vendor_perl .) at load_seqdatabase.pl line 7. > > > > >> BEGIN failed--compilation aborted at load_seqdatabase.pl line 7. > > > > >> > > > > >> Please guide. > > > > >> > > > > >> Thanks, > > > > >> Angshu > > > > >> > > > > >> On 12/9/05, Barry Moore < bmoore@genetics.utah.edu> wrote: > > > > >>> > > > > >>> Angshu- > > > > >>> > > > > >>> Make the namespace whatever you want it to be. This is useful > > > > if > > > > >>> you > > > > >>> want to load sequence from different sources into the same > > > > >>> database. As > > > > >>> for the format - you tell us what format is the file in? You > > > > >>> could just > > > > >>> let bioperl guess, but looking at the file and deciding yourself > > > > >>> would > > > > >>> be your best bet. > > > > >>> > > > > >>> Barry > > > > >>> > > > > >>>> -----Original Message----- > > > > >>>> From: bioperl-l-bounces@portal.open-bio.org [mailto: bioperl-l- > > > > >>>> bounces@portal.open-bio.org ] On Behalf Of Angshu Kar > > > > >>>> Sent: Friday, December 09, 2005 5:22 PM > > > > >>>> To: Sean Davis > > > > >>>> Cc: bioperl-l > > > > >>>> Subject: Re: [Bioperl-l] loading data to biosql tables > > > > >>>> > > > > >>>> Hi Sean, > > > > >>>> > > > > >>>> A small help I need before I run the load_seqdatabase.pl. I've > > > > >>> downloaded > > > > >>>> my > > > > >>>> datafile which is ATH1_cds_cm_20040228 from TAIR. What's the > > > > >>>> namespace > > > > >>> > > > > >>> and > > > > >>>> format for this? > > > > >>>> > > > > >>>> Thanks, > > > > >>>> Angshu > > > > >>>> > > > > >>>> _______________________________________________ > > > > >>>> Bioperl-l mailing list > > > > >>>> Bioperl-l@portal.open-bio.org > > > > >>>> http://portal.open-bio.org/mailman/listinfo/bioperl-l > > > > >>> > > > > >> > > > > >> > > > > > > > > > > _______________________________________________ > > > > > Bioperl-l mailing list > > > > > Bioperl-l@portal.open-bio.org > > > > > http://portal.open-bio.org/mailman/listinfo/bioperl-l > > > > > > > > -- > > > > Jason Stajich > > > > Duke University > > > > http://www.duke.edu/~jes12 > > > > > > > > > > > > > > > > > > -- > > > Jason Stajich > > > Duke University > > > http://www.duke.edu/~jes12 > > > > > > > > > > > > > > > > > > -- > > Jason Stajich > > Duke University > > http://www.duke.edu/~jes12 > > > > > > > > > > From hlapp at gmx.net Sat Dec 10 21:34:36 2005 From: hlapp at gmx.net (Hilmar Lapp) Date: Sat Dec 10 21:32:20 2005 Subject: [Bioperl-l] loading data to biosql tables In-Reply-To: References: <3e86c394e54677a898eab8de11d8e002@gmx.net> <2725905b99cf48d0d0740376a1156082@gmx.net> Message-ID: <0a732bf7655c118d12c4c548d2897aea@gmx.net> You should download the current biosql-schema from CVS. If my recollection is correct then you are on Postgres, so you will want to run the file biosqldb-pg.sql through psql. You do not need the other Postgres related files there. The rest of the repository, aside from support for other RDBMSs (mysql, Oracle, and HSQL), contains documentation, an ERD of the schema, and the script for loading the NCBI taxonomy database. -hilmar On Dec 9, 2005, at 1:58 PM, Angshu Kar wrote: > Hi Hilmar, > > I'm obliged that you showed me the light. I reanalyzed the schema and > found that its no use working with a truncated version of > biosql-schema and now I'm planning to install the entire schema. Could > you please let me know where can I find the script for that for a Pg > db? > > Thank you so much. Else I would have to face a lots of problem in the > later half of my project. > > Gratefully, > Angshu > > On 12/9/05, Hilmar Lapp wrote:Angshu, > load_seqdatabase.pl is a script that utilizes the language >> binding library bioperl-db to load sequences and annotation into >> Biosql. The object-relational mapping code is all over bioperl-db. >> >> I'm sorry, but if you believe it is worth fiddling with that >> object-relational code to 'save' instantiating a few more tables then >> you're welcome to do so but you're essentially on your own. >> >> Also, if you'd like to work with a schema the language binding of >> which >> allows to arbitrarily drop tables from the schema and still work then >> Biosql/bioperl-db may not be for you. >> >> ????????-hilmar >> >> On Dec 9, 2005, at 7:10 AM, Angshu Kar wrote: >> >> > Hi Hilmar, >> > >> > In the load_seqdatabase.pl script could you please tell me where you >> > are inserting the data into the db tables, so that I can try and >> > modify that part to insert data only to the tables that I need ? >> > >> > Thanks, >> > Angshu >> > >> > >> > On 12/8/05, Hilmar Lapp wrote: Any reason you didn't >> > instantiate the rest of the schema? Any scripts >> >> and software that have been written against BioSQL will certainly >> >> expect the rest of the schema be present ... >> >> >> >> Bioperl-db is the BioSQL language binding for Bioperl, so that's >> what >> >> you will want to use. It comes with a script load_seqdatabase.pl to >> >> load any format supported by Bioperl. >> >> >> >> However, bioperl-db does expect all of Biosql to be present ... >> >> >> >> -hilmar >> >> >> >> On Dec 7, 2005, at 6:12 PM, Angshu Kar wrote: >> >> >> >> > Hi, >> >> > >> >> > I've created 5 tables (taxon, taxon name, bioentry, biosequence, >> >> > biodatabase) in my postgresql database (linux box) using the >> biosql >> >> > schema >> >> > ddl from >> >> > >> >> http://cvs.open-bio.org/cgi-bin/viewcvs/viewcvs.cgi/biosql-schema/ >> >> sql/ >> >> > >> >> biosqldb-pg.sql?rev=1.29&cvsroot=biosql&content-type=text/ >> >> vnd.viewcvs- >> >> > markup >> >> >. >> >> > Now I want to load the tables with arabidopsis data. Could you >> >> please >> >> > let me >> >> > know where can I find such scripts for pgsql? And also I find at >> >> > http://bio.perl.org/Core/Latest/index.shtml that the DB module >> has >> >> not >> >> > been >> >> > updated since 2001. Do I need to install that? Or are there some >> new >> >> > releases? >> >> > >> >> > I'll be obliged if you can guide. >> >> > >> >> > Thanks, >> >> > Angshu >> >> > >> >> > _______________________________________________ >> >> > Bioperl-l mailing list >> >> > Bioperl-l@portal.open-bio.org >> >> > http://portal.open-bio.org/mailman/listinfo/bioperl-l >> >> > >> >> > >> >> -- >> >> ------------------------------------------------------------- >> >> Hilmar Lappemail: lapp at gnf.org >> >> GNF, San Diego, Ca. 92121phone: +1-858-812-1757 >> >> ------------------------------------------------------------- >> >> >> >> >> >> >> -- >> ------------------------------------------------------------- >> Hilmar Lapp????????????????????????????email: lapp at gnf.org >> GNF, San Diego, Ca. 92121??????????????phone: +1-858-812-1757 >> ------------------------------------------------------------- >> >> >> -- ------------------------------------------------------------- Hilmar Lapp email: lapp at gnf.org GNF, San Diego, Ca. 92121 phone: +1-858-812-1757 ------------------------------------------------------------- From angshu96 at gmail.com Sat Dec 10 21:38:12 2005 From: angshu96 at gmail.com (Angshu Kar) Date: Sat Dec 10 21:42:46 2005 Subject: [Bioperl-l] loading data to biosql tables In-Reply-To: <0a732bf7655c118d12c4c548d2897aea@gmx.net> References: <3e86c394e54677a898eab8de11d8e002@gmx.net> <2725905b99cf48d0d0740376a1156082@gmx.net> <0a732bf7655c118d12c4c548d2897aea@gmx.net> Message-ID: Hi Hilmar, I've run the biosqldb-pg.sql through psql successfully. It has created 28 tables. :( Are there any other possibilities? Thanks, Angshu On 12/10/05, Hilmar Lapp wrote: > > You should download the current biosql-schema from CVS. If my > recollection is correct then you are on Postgres, so you will want to > run the file biosqldb-pg.sql through psql. You do not need the other > Postgres related files there. The rest of the repository, aside from > support for other RDBMSs (mysql, Oracle, and HSQL), contains > documentation, an ERD of the schema, and the script for loading the > NCBI taxonomy database. > > -hilmar > > On Dec 9, 2005, at 1:58 PM, Angshu Kar wrote: > > > Hi Hilmar, > > > > I'm obliged that you showed me the light. I reanalyzed the schema and > > found that its no use working with a truncated version of > > biosql-schema and now I'm planning to install the entire schema. Could > > you please let me know where can I find the script for that for a Pg > > db? > > > > Thank you so much. Else I would have to face a lots of problem in the > > later half of my project. > > > > Gratefully, > > Angshu > > > > On 12/9/05, Hilmar Lapp wrote:Angshu, > > load_seqdatabase.pl is a script that utilizes the language > >> binding library bioperl-db to load sequences and annotation into > >> Biosql. The object-relational mapping code is all over bioperl-db. > >> > >> I'm sorry, but if you believe it is worth fiddling with that > >> object-relational code to 'save' instantiating a few more tables then > >> you're welcome to do so but you're essentially on your own. > >> > >> Also, if you'd like to work with a schema the language binding of > >> which > >> allows to arbitrarily drop tables from the schema and still work then > >> Biosql/bioperl-db may not be for you. > >> > >> -hilmar > >> > >> On Dec 9, 2005, at 7:10 AM, Angshu Kar wrote: > >> > >> > Hi Hilmar, > >> > > >> > In the load_seqdatabase.pl script could you please tell me where you > >> > are inserting the data into the db tables, so that I can try and > >> > modify that part to insert data only to the tables that I need ? > >> > > >> > Thanks, > >> > Angshu > >> > > >> > > >> > On 12/8/05, Hilmar Lapp wrote: Any reason you didn't > >> > instantiate the rest of the schema? Any scripts > >> >> and software that have been written against BioSQL will certainly > >> >> expect the rest of the schema be present ... > >> >> > >> >> Bioperl-db is the BioSQL language binding for Bioperl, so that's > >> what > >> >> you will want to use. It comes with a script load_seqdatabase.pl to > >> >> load any format supported by Bioperl. > >> >> > >> >> However, bioperl-db does expect all of Biosql to be present ... > >> >> > >> >> -hilmar > >> >> > >> >> On Dec 7, 2005, at 6:12 PM, Angshu Kar wrote: > >> >> > >> >> > Hi, > >> >> > > >> >> > I've created 5 tables (taxon, taxon name, bioentry, biosequence, > >> >> > biodatabase) in my postgresql database (linux box) using the > >> biosql > >> >> > schema > >> >> > ddl from > >> >> > > >> >> http://cvs.open-bio.org/cgi-bin/viewcvs/viewcvs.cgi/biosql-schema/ > >> >> sql/ > >> >> > > >> >> biosqldb-pg.sql?rev=1.29&cvsroot=biosql&content-type=text/ > >> >> vnd.viewcvs- > >> >> > markup > >> >> >. > >> >> > Now I want to load the tables with arabidopsis data. Could you > >> >> please > >> >> > let me > >> >> > know where can I find such scripts for pgsql? And also I find at > >> >> > http://bio.perl.org/Core/Latest/index.shtml that the DB module > >> has > >> >> not > >> >> > been > >> >> > updated since 2001. Do I need to install that? Or are there some > >> new > >> >> > releases? > >> >> > > >> >> > I'll be obliged if you can guide. > >> >> > > >> >> > Thanks, > >> >> > Angshu > >> >> > > >> >> > _______________________________________________ > >> >> > Bioperl-l mailing list > >> >> > Bioperl-l@portal.open-bio.org > >> >> > http://portal.open-bio.org/mailman/listinfo/bioperl-l > >> >> > > >> >> > > >> >> -- > >> >> ------------------------------------------------------------- > >> >> Hilmar Lappemail: lapp at gnf.org > >> >> GNF, San Diego, Ca. 92121phone: +1-858-812-1757 > >> >> ------------------------------------------------------------- > >> >> > >> >> > >> >> > >> -- > >> ------------------------------------------------------------- > >> Hilmar Lappemail: lapp at gnf.org > >> GNF, San Diego, Ca. 92121phone: +1-858-812-1757 > >> ------------------------------------------------------------- > >> > >> > >> > -- > ------------------------------------------------------------- > Hilmar Lapp email: lapp at gnf.org > GNF, San Diego, Ca. 92121 phone: +1-858-812-1757 > ------------------------------------------------------------- > > > From hlapp at gmx.net Sat Dec 10 21:48:32 2005 From: hlapp at gmx.net (Hilmar Lapp) Date: Sat Dec 10 21:45:52 2005 Subject: [Bioperl-l] loading data to biosql tables In-Reply-To: References: <3e86c394e54677a898eab8de11d8e002@gmx.net> <2725905b99cf48d0d0740376a1156082@gmx.net> <0a732bf7655c118d12c4c548d2897aea@gmx.net> Message-ID: <5955ab1684c84ab9e31035685c9d6aa8@gmx.net> On Dec 10, 2005, at 6:38 PM, Angshu Kar wrote: > Hi Hilmar, > > I've run the biosqldb-pg.sql through psql successfully. It has created > 28 tables. :( That sounds good. Why are you unhappy about it? > Are there any other possibilities? To achieve what? -hilmar > > Thanks, > Angshu > > > On 12/10/05, Hilmar Lapp wrote:You should download the > current biosql-schema from CVS. If my >> recollection is correct then you are on Postgres, so you will want to >> run the file biosqldb-pg.sql through psql. You do not need the other >> Postgres related files there. The rest of the repository, aside from >> support for other RDBMSs (mysql, Oracle, and HSQL), contains >> documentation, an ERD of the schema, and the script for loading the >> NCBI taxonomy database. >> >> ????????-hilmar >> >> On Dec 9, 2005, at 1:58 PM, Angshu Kar wrote: >> >> > Hi Hilmar, >> > >> > I'm obliged that you showed me the light. I reanalyzed the schema >> and >> > found that its no use working with a truncated version of >> > biosql-schema and now I'm planning to install the entire schema. >> Could >> > you please let me know where can I find the script for that for a Pg >> > db? >> > >> > Thank you so much. Else I would have to face a lots of problem in >> the >> > later half of my project. >> > >> > Gratefully, >> > Angshu >> > >> > On 12/9/05, Hilmar Lapp wrote:Angshu, >> > load_seqdatabase.pl is a script that utilizes the language >> >> binding library bioperl-db to load sequences and annotation into >> >> Biosql. The object-relational mapping code is all over bioperl-db. >> >> >> >> I'm sorry, but if you believe it is worth fiddling with that >> >> object-relational code to 'save' instantiating a few more tables >> then >> >> you're welcome to do so but you're essentially on your own. >> >> >> >> Also, if you'd like to work with a schema the language binding of >> >> which >> >> allows to arbitrarily drop tables from the schema and still work >> then >> >> Biosql/bioperl-db may not be for you. >> >> >> >> -hilmar >> >> >> >> On Dec 9, 2005, at 7:10 AM, Angshu Kar wrote: >> >> >> >> > Hi Hilmar, >> >> > >> >> > In the load_seqdatabase.pl script could you please tell me where >> you >> >> > are inserting the data into the db tables, so that I can try and >> >> > modify that part to insert data only to the tables that I need ? >> >> > >> >> > Thanks, >> >> > Angshu >> >> > >> >> > >> >> > On 12/8/05, Hilmar Lapp wrote: Any reason you >> didn't >> >> > instantiate the rest of the schema? Any scripts >> >> >> and software that have been written against BioSQL will >> certainly >> >> >> expect the rest of the schema be present ... >> >> >> >> >> >> Bioperl-db is the BioSQL language binding for Bioperl, so that's >> >> what >> >> >> you will want to use. It comes with a script >> load_seqdatabase.pl to >> >> >> load any format supported by Bioperl. >> >> >> >> >> >> However, bioperl-db does expect all of Biosql to be present ... >> >>??>> >> >> >> -hilmar >> >> >> >> >> >> On Dec 7, 2005, at 6:12 PM, Angshu Kar wrote: >> >> >> >> >> >> > Hi, >> >> >> > >> >> >> > I've created 5 tables (taxon, taxon name, bioentry, >> biosequence, >> >> >> > biodatabase) in my postgresql database (linux box) using the >> >> biosql >> >> >> > schema >> >> >> > ddl from >> >> >> > >> >> >> >> http://cvs.open-bio.org/cgi-bin/viewcvs/viewcvs.cgi/biosql-schema/ >> >> >> sql/ >> >> >> > >> >> >> biosqldb-pg.sql?rev=1.29&cvsroot=biosql&content-type=text/ >> >> >> vnd.viewcvs- >> >> >> > markup >> >> >> >. >> >> >> > Now I want to load the tables with arabidopsis data. Could you >> >> >> please >> >> >> > let me >> >> >> > know where can I find such scripts for pgsql? And also I find >> at >> >> >> > http://bio.perl.org/Core/Latest/index.shtml that the DB module >> >> has >> >> >> not >> >> >> > been >> >> >> > updated since 2001. Do I need to install that? Or are there >> some >> >> new >> >> >> > releases? >> >> >> > >> >> >> > I'll be obliged if you can guide. >> >> >> > >> >> >> > Thanks, >> >> >> > Angshu >> >> >> > >> >> >> > _______________________________________________ >> >> >> > Bioperl-l mailing list >> >> >> > Bioperl-l@portal.open-bio.org >> >> >> > http://portal.open-bio.org/mailman/listinfo/bioperl-l >> >> >> > >> >> >> > >> >> >> -- >> >> >> ------------------------------------------------------------- >> >> >> Hilmar Lappemail: lapp at gnf.org >> >> >> GNF, San Diego, Ca. 92121phone: +1-858-812-1757 >> >> >> ------------------------------------------------------------- >> >> >> >> >> >> >> >> >> >> >> -- >> >> ------------------------------------------------------------- >> >> Hilmar Lappemail: lapp at gnf.org >> >> GNF, San Diego, Ca. 92121phone: +1-858-812-1757 >> >> ------------------------------------------------------------- >> >> >> >> >> >> >> -- >> ------------------------------------------------------------- >> Hilmar Lapp????????????????????????????email: lapp at gnf.org >> GNF, San Diego, Ca. 92121??????????????phone: +1-858-812-1757 >> ------------------------------------------------------------- >> >> >> -- ------------------------------------------------------------- Hilmar Lapp email: lapp at gnf.org GNF, San Diego, Ca. 92121 phone: +1-858-812-1757 ------------------------------------------------------------- From angshu96 at gmail.com Sat Dec 10 21:51:53 2005 From: angshu96 at gmail.com (Angshu Kar) Date: Sat Dec 10 21:49:18 2005 Subject: [Bioperl-l] loading data to biosql tables In-Reply-To: <5955ab1684c84ab9e31035685c9d6aa8@gmx.net> References: <3e86c394e54677a898eab8de11d8e002@gmx.net> <2725905b99cf48d0d0740376a1156082@gmx.net> <0a732bf7655c118d12c4c548d2897aea@gmx.net> <5955ab1684c84ab9e31035685c9d6aa8@gmx.net> Message-ID: Oh...may be u missed my last mail... I'm sedning it again... Now I'm getting this new error: Can't locate Bio/DB/BioDB.pm in @INC (@INC contains: /home/akar/local/perl//i386-linux-thread-multi /home/akar/local/perl/ /usr/lib/perl5/5.8.5/i386-linux-thread-multi /usr/lib/perl5/5.8.5 /usr/lib/perl5/site_perl/5.8.5/i386-linux-thread-multi /usr/lib/perl5/site_perl/5.8.4/i386-linux-thread-multi /usr/lib/perl5/site_perl/5.8.3/i386-linux-thread-multi /usr/lib/perl5/site_perl/5.8.2/i386-linux-thread-multi /usr/lib/perl5/site_perl/5.8.1/i386-linux-thread-multi /usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi /usr/lib/perl5/site_perl/5.8.5 /usr/lib/perl5/site_perl/5.8.4 /usr/lib/perl5/site_perl/5.8.3 /usr/lib/perl5/site_perl/5.8.2 /usr/lib/perl5/site_perl/5.8.1 /usr/lib/perl5/site_perl/5.8.0 /usr/lib/perl5/site_perl /usr/lib/perl5/vendor_perl/5.8.5/i386-linux-thread-multi /usr/lib/perl5/vendor_perl/5.8.4/i386-linux-thread-multi /usr/lib/perl5/vendor_perl/5.8.3/i386-linux-thread-multi /usr/lib/perl5/vendor_perl/5.8.2/i386-linux-thread-multi /usr/lib/perl5/vendor_perl/5.8.1/i386-linux-thread-multi /usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi /usr/lib/perl5/vendor_perl/5.8.5 /usr/lib/perl5/vendor_perl/5.8.4 /usr/lib/perl5/vendor_perl/5.8.3 /usr/lib/perl5/vendor_perl/5.8.2 /usr/lib/perl5/vendor_perl/5.8.1 /usr/lib/perl5/vendor_perl/5.8.0 /usr/lib/perl5/vendor_perl .) at load_seqdatabase.pl line 8. BEGIN failed--compilation aborted at load_seqdatabase.pl line 8. I've checked the Bio/DB/ folder but it doesn't contain the BioDB.pm module - it contains the following: Ace.pm DBFetch.pm Flat GFF MeSH.pm RefSeq.pm Taxonomy.pm XEMBLService.pm Biblio EMBL.pm Flat.pm GFF.pm NCBIHelper.pm Registry.pm Universal.pm BiblioI.pm Failover.pm GDB.pm InMemoryCache.pm Query SeqI.pm UpdateableSeqI.pm BioFetch.pm Fasta.pm GenBank.pm Makefile.PL QueryI.pm SwissProt.pm WebDBSeqI.pm CUTG.pm FileCache.pm GenPept.pm MANIFEST RandomAccessI.pm Taxonomy XEMBL.pm Have I installed some wrong version of bioperl-db ? (I've used http://bio.perl.org/Core/Latest/index.shtml) Thanks, Angshu On 12/10/05, Hilmar Lapp wrote: > > > On Dec 10, 2005, at 6:38 PM, Angshu Kar wrote: > > > Hi Hilmar, > > > > I've run the biosqldb-pg.sql through psql successfully. It has created > > 28 tables. :( > > That sounds good. Why are you unhappy about it? > > > Are there any other possibilities? > > To achieve what? > > -hilmar > > > > > Thanks, > > Angshu > > > > > > On 12/10/05, Hilmar Lapp wrote:You should download the > > current biosql-schema from CVS. If my > >> recollection is correct then you are on Postgres, so you will want to > >> run the file biosqldb-pg.sql through psql. You do not need the other > >> Postgres related files there. The rest of the repository, aside from > >> support for other RDBMSs (mysql, Oracle, and HSQL), contains > >> documentation, an ERD of the schema, and the script for loading the > >> NCBI taxonomy database. > >> > >> -hilmar > >> > >> On Dec 9, 2005, at 1:58 PM, Angshu Kar wrote: > >> > >> > Hi Hilmar, > >> > > >> > I'm obliged that you showed me the light. I reanalyzed the schema > >> and > >> > found that its no use working with a truncated version of > >> > biosql-schema and now I'm planning to install the entire schema. > >> Could > >> > you please let me know where can I find the script for that for a Pg > >> > db? > >> > > >> > Thank you so much. Else I would have to face a lots of problem in > >> the > >> > later half of my project. > >> > > >> > Gratefully, > >> > Angshu > >> > > >> > On 12/9/05, Hilmar Lapp wrote:Angshu, > >> > load_seqdatabase.pl is a script that utilizes the language > >> >> binding library bioperl-db to load sequences and annotation into > >> >> Biosql. The object-relational mapping code is all over bioperl-db. > >> >> > >> >> I'm sorry, but if you believe it is worth fiddling with that > >> >> object-relational code to 'save' instantiating a few more tables > >> then > >> >> you're welcome to do so but you're essentially on your own. > >> >> > >> >> Also, if you'd like to work with a schema the language binding of > >> >> which > >> >> allows to arbitrarily drop tables from the schema and still work > >> then > >> >> Biosql/bioperl-db may not be for you. > >> >> > >> >> -hilmar > >> >> > >> >> On Dec 9, 2005, at 7:10 AM, Angshu Kar wrote: > >> >> > >> >> > Hi Hilmar, > >> >> > > >> >> > In the load_seqdatabase.pl script could you please tell me where > >> you > >> >> > are inserting the data into the db tables, so that I can try and > >> >> > modify that part to insert data only to the tables that I need ? > >> >> > > >> >> > Thanks, > >> >> > Angshu > >> >> > > >> >> > > >> >> > On 12/8/05, Hilmar Lapp wrote: Any reason you > >> didn't > >> >> > instantiate the rest of the schema? Any scripts > >> >> >> and software that have been written against BioSQL will > >> certainly > >> >> >> expect the rest of the schema be present ... > >> >> >> > >> >> >> Bioperl-db is the BioSQL language binding for Bioperl, so that's > >> >> what > >> >> >> you will want to use. It comes with a script > >> load_seqdatabase.pl to > >> >> >> load any format supported by Bioperl. > >> >> >> > >> >> >> However, bioperl-db does expect all of Biosql to be present ... > >> >>>> > >> >> >> -hilmar > >> >> >> > >> >> >> On Dec 7, 2005, at 6:12 PM, Angshu Kar wrote: > >> >> >> > >> >> >> > Hi, > >> >> >> > > >> >> >> > I've created 5 tables (taxon, taxon name, bioentry, > >> biosequence, > >> >> >> > biodatabase) in my postgresql database (linux box) using the > >> >> biosql > >> >> >> > schema > >> >> >> > ddl from > >> >> >> > > >> >> >> > >> http://cvs.open-bio.org/cgi-bin/viewcvs/viewcvs.cgi/biosql-schema/ > >> >> >> sql/ > >> >> >> > > >> >> >> biosqldb-pg.sql?rev=1.29&cvsroot=biosql&content-type=text/ > >> >> >> vnd.viewcvs- > >> >> >> > markup > >> >> >> >. > >> >> >> > Now I want to load the tables with arabidopsis data. Could you > >> >> >> please > >> >> >> > let me > >> >> >> > know where can I find such scripts for pgsql? And also I find > >> at > >> >> >> > http://bio.perl.org/Core/Latest/index.shtml that the DB module > >> >> has > >> >> >> not > >> >> >> > been > >> >> >> > updated since 2001. Do I need to install that? Or are there > >> some > >> >> new > >> >> >> > releases? > >> >> >> > > >> >> >> > I'll be obliged if you can guide. > >> >> >> > > >> >> >> > Thanks, > >> >> >> > Angshu > >> >> >> > > >> >> >> > _______________________________________________ > >> >> >> > Bioperl-l mailing list > >> >> >> > Bioperl-l@portal.open-bio.org > >> >> >> > http://portal.open-bio.org/mailman/listinfo/bioperl-l > >> >> >> > > >> >> >> > > >> >> >> -- > >> >> >> ------------------------------------------------------------- > >> >> >> Hilmar Lappemail: lapp at gnf.org > >> >> >> GNF, San Diego, Ca. 92121phone: +1-858-812-1757 > >> >> >> ------------------------------------------------------------- > >> >> >> > >> >> >> > >> >> >> > >> >> -- > >> >> ------------------------------------------------------------- > >> >> Hilmar Lappemail: lapp at gnf.org > >> >> GNF, San Diego, Ca. 92121phone: +1-858-812-1757 > >> >> ------------------------------------------------------------- > >> >> > >> >> > >> >> > >> -- > >> ------------------------------------------------------------- > >> Hilmar Lappemail: lapp at gnf.org > >> GNF, San Diego, Ca. 92121phone: +1-858-812-1757 > >> ------------------------------------------------------------- > >> > >> > >> > -- > ------------------------------------------------------------- > Hilmar Lapp email: lapp at gnf.org > GNF, San Diego, Ca. 92121 phone: +1-858-812-1757 > ------------------------------------------------------------- > > > From hlapp at gmx.net Sat Dec 10 21:51:57 2005 From: hlapp at gmx.net (Hilmar Lapp) Date: Sat Dec 10 21:49:22 2005 Subject: [Bioperl-l] loading data to biosql tables In-Reply-To: References: <3CCC9485-FF57-4C6F-81AF-3B9C9AFDF197@duke.edu> <2E2A564F-116D-497B-86B1-421CCF368437@duke.edu> <6CE9663B-2666-418C-8322-AC48A7036143@duke.edu> Message-ID: Well this means that the place where you installed Bioperl-db is different from the one you installed Bioperl, and you didn't include it in your PERL5LIB setting. You can have multiple directories in PERL5LIB, separate them by colon: $ setenv PERL5LIB /path/to/where/I/installed/bioperl:/path/to/where/I/installed/bioperl- db -hilmar On Dec 10, 2005, at 2:59 PM, Angshu Kar wrote: > Hi, > > Now I'm getting this new error: > > Can't locate Bio/DB/BioDB.pm in @INC (@INC contains: > /home/akar/local/perl//i386-linux-thread-multi /home/akar/local/perl/ > /usr/lib/perl5/5.8.5/i386-linux-thread-multi /usr/lib/perl5/5.8.5 > /usr/lib/perl5/site_perl/5.8.5/i386-linux-thread-multi > /usr/lib/perl5/site_perl/5.8.4/i386-linux-thread-multi > /usr/lib/perl5/site_perl/5.8.3/i386-linux-thread-multi > /usr/lib/perl5/site_perl/5.8.2/i386-linux-thread-multi > /usr/lib/perl5/site_perl/5.8.1/i386-linux-thread-multi > /usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi > /usr/lib/perl5/site_perl/5.8.5 /usr/lib/perl5/site_perl/5.8.4 > /usr/lib/perl5/site_perl/5.8.3 /usr/lib/perl5/site_perl/5.8.2 > /usr/lib/perl5/site_perl/5.8.1 /usr/lib/perl5/site_perl/5.8.0 > /usr/lib/perl5/site_perl > /usr/lib/perl5/vendor_perl/5.8.5/i386-linux-thread-multi > /usr/lib/perl5/vendor_perl/5.8.4/i386-linux-thread-multi > /usr/lib/perl5/vendor_perl/5.8.3/i386-linux-thread-multi > /usr/lib/perl5/vendor_perl/5.8.2/i386-linux-thread-multi > /usr/lib/perl5/vendor_perl/5.8.1/i386-linux-thread-multi > /usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi > /usr/lib/perl5/vendor_perl/5.8.5 /usr/lib/perl5/vendor_perl/5.8.4 > /usr/lib/perl5/vendor_perl/5.8.3 /usr/lib/perl5/vendor_perl/5.8.2 > /usr/lib/perl5/vendor_perl/5.8.1 /usr/lib/perl5/vendor_perl/5.8.0 > /usr/lib/perl5/vendor_perl .) at load_seqdatabase.pl line 8. > BEGIN failed--compilation aborted at load_seqdatabase.pl line 8. > > > I've checked the Bio/DB/ folder but it doesn't contain the BioDB.pm > module - > it contains the following: > > > Ace.pm DBFetch.pm Flat GFF MeSH.pm > RefSeq.pm Taxonomy.pm XEMBLService.pm > Biblio EMBL.pm Flat.pm GFF.pm NCBIHelper.pm > Registry.pm Universal.pm > BiblioI.pm Failover.pm GDB.pm InMemoryCache.pm Query > SeqI.pm UpdateableSeqI.pm > BioFetch.pm Fasta.pm GenBank.pm Makefile.PL QueryI.pm > SwissProt.pm WebDBSeqI.pm > CUTG.pm FileCache.pm GenPept.pm MANIFEST > RandomAccessI.pm > Taxonomy XEMBL.pm > > Have I installed some wrong version of bioperl-db ? (I've used > http://bio.perl.org/Core/Latest/index.shtml) > > Could anyone please let me know what I've missed? > > Thanks, > Angshu > > > On 12/10/05, Angshu Kar wrote: >> >> You are marvellous Jason...Thanks a lot for using the lingo thats for >> a >> freshie like me. >> I'll apply this today to the load_seqdatabase.pl and let you know if >> any >> problem arises. >> >> Thanks, >> Angshu >> >> >> On 12/10/05, Jason Stajich wrote: >>> >>> From the INSTALL document in the Bioperl distribution >>> >>> >>> You can explicitly tell perl where to look for modules by using >>> the >>> lib module which comes standard with perl. >>> >>> >>> Example: >>> >>> >>> #!/usr/bin/perl >>> >>> >>> use lib "/home/users/dag/My_Local_Perl_Modules/"; >>> use Bio::Seq; >>> >>> >>> <...insert whizzy perl code here...> >>> >>> >>> Or, you can set the environmental variable PERL5LIB: >>> >>> >>> csh or tcsh: >>> >>> >>> setenv PERL5LIB /home/users/dag/My_Local_Perl_Modules/ >>> >>> bash or sh: >>> >>> >>> export PERL5LIB=/home/users/dag/My_Local_Perl_Modules/ >>> >>> >>> >>> On Dec 10, 2005, at 11:51 AM, Angshu Kar wrote: >>> >>> Yes Jason. But what I've done is, instead of putting the .pm and .pl >>> files in default locations I've used the LIB and PREFIX arguments to >>> place them in my local directory. This I've done for bioperl as well >>> as >>> bioperl-db modules. >>> Now could you please help me in how to make perl find it? >>> >>> Thanks, >>> Angshu >>> >>> >>> On 12/10/05, Jason Stajich wrote: >>>> >>>> you have not installed it so that your perl knows to find it. Did >>>> you >>>> do 'make install'? >>>> On Dec 9, 2005, at 10:21 PM, Angshu Kar wrote: >>>> >>>> Thanks Jason... >>>> I'm sorry but I didn't get you. >>>> I've installed bioperl as well as bioperl-db module in my system... >>>> Now what should be my next step to resolve this problem? >>>> I'm sorry again, but as I told that I'm a novice in this domain. >>>> >>>> Thanks, >>>> Angshu >>>> >>>> >>>> On 12/9/05, Jason Stajich wrote: >>>>> >>>>> >>>>> Follow the install instructions for bioperl first, you need bioperl >>>>> to run bioperl-db. >>>>> These include, set your PERL5LIB or install bioperl on your system >>>>> or >>>>> run the load script with -I PATH/TO/BIOPERL >>>>> >>>>> >>>>> On Dec 9, 2005, at 7:54 PM, Angshu Kar wrote: >>>>> >>>>>> One thing I missed was that my Root.pm resides >>>>> in a different >>>>>> path...How to >>>>>> specify that? >>>>>> >>>>>> On 12/9/05, Angshu Kar < angshu96@gmail.com> wrote: >>>>>>> >>>>>>> Thanks a lot Barry. >>>>>>> >>>>>>> Now I'm getting this error while tryin to run the >>>>>>> load_seqdatabase.pl in a >>>>>>> linux box (I used : >>>>>>> perl load_seqdatabase.pl /akar/seq/ATH1_cds_cm_20040228) >>>>>>> >>>>>>> >>>>>>> Can't locate Bio/Root/Root.pm in @INC (@INC contains: >>>>>>> /usr/lib/perl5/5.8.5/i386-linux-thread-multi /usr/lib/perl5/5.8.5 >>>>> >>>>>>> /usr/lib/perl5/site_perl/5.8.5/i386-linux-thread-multi >>>>>>> /usr/lib/perl5/site_perl/5.8.4/i386-linux-thread-multi >>>>>>> /usr/lib/perl5/site_perl/5.8.3/i386-linux-thread-multi >>>>>>> /usr/lib/perl5/site_perl/5.8.2/i386-linux-thread-multi >>>>>>> /usr/lib/perl5/site_perl/5.8.1/i386-linux-thread-multi >>>>>>> /usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi >>>>>>> /usr/lib/perl5/site_perl/5.8.5 /usr/lib/perl5/site_perl/5.8.4 >>>>>>> /usr/lib/perl5/site_perl/5.8.3 /usr/lib/perl5/site_perl/5.8.2 >>>>>>> /usr/lib/perl5/site_perl/5.8.1 /usr/lib/perl5/site_perl/5.8.0 >>>>>>> /usr/lib/perl5/site_perl >>>>>>> /usr/lib/perl5/vendor_perl/5.8.5/i386-linux-thread-multi >>>>>>> /usr/lib/perl5/vendor_perl/5.8.4/i386-linux-thread-multi >>>>>>> /usr/lib/perl5/vendor_perl/5.8.3/i386-linux-thread-multi >>>>>>> /usr/lib/perl5/vendor_perl/5.8.2/i386-linux-thread-multi >>>>>>> /usr/lib/perl5/vendor_perl/5.8.1/i386-linux-thread-multi >>>>>>> /usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi >>>>>>> /usr/lib/perl5/vendor_perl/5.8.5 /usr/lib/perl5/vendor_perl/5.8.4 >>>>>>> /usr/lib/perl5/vendor_perl/5.8.3 /usr/lib/perl5/vendor_perl/5.8.2 >>>>> >>>>>>> /usr/lib/perl5/vendor_perl/5.8.1 /usr/lib/perl5/vendor_perl/5.8.0 >>>>> >>>>>>> /usr/lib/perl5/vendor_perl .) at load_seqdatabase.pl line 7. >>>>>>> BEGIN failed--compilation aborted at load_seqdatabase.pl line 7. >>>>>>> >>>>>>> Please guide. >>>>>>> >>>>>>> Thanks, >>>>>>> Angshu >>>>>>> >>>>>>> On 12/9/05, Barry Moore < bmoore@genetics.utah.edu> wrote: >>>>>>>> >>>>>>>> Angshu- >>>>>>>> >>>>>>>> Make the namespace whatever you want it to be. This is useful >>>>> if >>>>>>>> you >>>>>>>> want to load sequence from different sources into the same >>>>>>>> database. As >>>>>>>> for the format - you tell us what format is the file in? You >>>>>>>> could just >>>>>>>> let bioperl guess, but looking at the file and deciding yourself >>>>>>>> would >>>>>>>> be your best bet. >>>>>>>> >>>>>>>> Barry >>>>>>>> >>>>>>>>> -----Original Message----- >>>>>>>>> From: bioperl-l-bounces@portal.open-bio.org [mailto: bioperl-l- >>>>>>>>> bounces@portal.open-bio.org ] On Behalf Of Angshu Kar >>>>>>>>> Sent: Friday, December 09, 2005 5:22 PM >>>>>>>>> To: Sean Davis >>>>>>>>> Cc: bioperl-l >>>>>>>>> Subject: Re: [Bioperl-l] loading data to biosql tables >>>>>>>>> >>>>>>>>> Hi Sean, >>>>>>>>> >>>>>>>>> A small help I need before I run the load_seqdatabase.pl. I've >>>>>>>> downloaded >>>>>>>>> my >>>>>>>>> datafile which is ATH1_cds_cm_20040228 from TAIR. What's the >>>>>>>>> namespace >>>>>>>> >>>>>>>> and >>>>>>>>> format for this? >>>>>>>>> >>>>>>>>> Thanks, >>>>>>>>> Angshu >>>>>>>>> >>>>>>>>> _______________________________________________ >>>>>>>>> Bioperl-l mailing list >>>>>>>>> Bioperl-l@portal.open-bio.org >>>>>>>>> http://portal.open-bio.org/mailman/listinfo/bioperl-l >>>>>>>> >>>>>>> >>>>>>> >>>>>> >>>>>> _______________________________________________ >>>>>> Bioperl-l mailing list >>>>>> Bioperl-l@portal.open-bio.org >>>>>> http://portal.open-bio.org/mailman/listinfo/bioperl-l >>>>> >>>>> -- >>>>> Jason Stajich >>>>> Duke University >>>>> http://www.duke.edu/~jes12 >>>>> >>>>> >>>>> >>>> >>>> -- >>>> Jason Stajich >>>> Duke University >>>> http://www.duke.edu/~jes12 >>>> >>>> >>>> >>>> >>> >>> >>> -- >>> Jason Stajich >>> Duke University >>> http://www.duke.edu/~jes12 >>> >>> >>> >>> >> >> > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l@portal.open-bio.org > http://portal.open-bio.org/mailman/listinfo/bioperl-l > > -- ------------------------------------------------------------- Hilmar Lapp email: lapp at gnf.org GNF, San Diego, Ca. 92121 phone: +1-858-812-1757 ------------------------------------------------------------- From hlapp at gmx.net Sat Dec 10 22:22:48 2005 From: hlapp at gmx.net (Hilmar Lapp) Date: Sat Dec 10 22:22:16 2005 Subject: [Bioperl-l] newbies In-Reply-To: References: Message-ID: Angshu, somebody else pointed out already that you really need to take a look at the documentation. This wasn't a joke - I can only recommend you take that advice pretty literally or soon you'll find yourself posting questions and being ignored. You're not the first newbie ever to hit bioperl or bioperl-db, and you're not the first one either to face the problems you're facing - that's why we have INSTALL documents and FAQs and such. They were written exactly for newbie people like you - not for me or Barry or Jason - so that most newbie questions DO NOT have to be addressed over the mailing list. People responding to you do so in their spare time out of very busy lives. If you don't read documentation you're essentially saying that you don't respect other people's time, because you're happily wasting that time for questions you could have easily answered yourself had you bothered to read the documentation, instead of using people's time to assist with problems which are NOT readily addressed in the documentation. If you don't respect somebody's time, why should people think you respect them themselves? I recommend you think about that next time you're going to hit the keyboard as soon as something doesn't work as expected. -hilmar On Dec 9, 2005, at 4:54 PM, Angshu Kar wrote: > One thing I missed was that my Root.pm resides in a different > path...How to > specify that? > > On 12/9/05, Angshu Kar wrote: >> >> Thanks a lot Barry. >> >> Now I'm getting this error while tryin to run the load_seqdatabase.pl >> in a >> linux box (I used : >> perl load_seqdatabase.pl /akar/seq/ATH1_cds_cm_20040228) >> >> >> Can't locate Bio/Root/Root.pm in @INC (@INC contains: >> /usr/lib/perl5/5.8.5/i386-linux-thread-multi /usr/lib/perl5/5.8.5 >> /usr/lib/perl5/site_perl/5.8.5/i386-linux-thread-multi >> /usr/lib/perl5/site_perl/5.8.4/i386-linux-thread-multi >> /usr/lib/perl5/site_perl/5.8.3/i386-linux-thread-multi >> /usr/lib/perl5/site_perl/5.8.2/i386-linux-thread-multi >> /usr/lib/perl5/site_perl/5.8.1/i386-linux-thread-multi >> /usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi >> /usr/lib/perl5/site_perl/5.8.5 /usr/lib/perl5/site_perl/5.8.4 >> /usr/lib/perl5/site_perl/5.8.3 /usr/lib/perl5/site_perl/5.8.2 >> /usr/lib/perl5/site_perl/5.8.1 /usr/lib/perl5/site_perl/5.8.0 >> /usr/lib/perl5/site_perl >> /usr/lib/perl5/vendor_perl/5.8.5/i386-linux-thread-multi >> /usr/lib/perl5/vendor_perl/5.8.4/i386-linux-thread-multi >> /usr/lib/perl5/vendor_perl/5.8.3/i386-linux-thread-multi >> /usr/lib/perl5/vendor_perl/5.8.2/i386-linux-thread-multi >> /usr/lib/perl5/vendor_perl/5.8.1/i386-linux-thread-multi >> /usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi >> /usr/lib/perl5/vendor_perl/5.8.5 /usr/lib/perl5/vendor_perl/5.8.4 >> /usr/lib/perl5/vendor_perl/5.8.3 /usr/lib/perl5/vendor_perl/5.8.2 >> /usr/lib/perl5/vendor_perl/5.8.1 /usr/lib/perl5/vendor_perl/5.8.0 >> /usr/lib/perl5/vendor_perl .) at load_seqdatabase.pl line 7. >> BEGIN failed--compilation aborted at load_seqdatabase.pl line 7. >> >> Please guide. >> >> Thanks, >> Angshu >> >> On 12/9/05, Barry Moore < bmoore@genetics.utah.edu> wrote: >>> >>> Angshu- >>> >>> Make the namespace whatever you want it to be. This is useful if you >>> want to load sequence from different sources into the same database. >>> As >>> for the format - you tell us what format is the file in? You could >>> just >>> let bioperl guess, but looking at the file and deciding yourself >>> would >>> be your best bet. >>> >>> Barry >>> >>>> -----Original Message----- >>>> From: bioperl-l-bounces@portal.open-bio.org [mailto: bioperl-l- >>>> bounces@portal.open-bio.org] On Behalf Of Angshu Kar >>>> Sent: Friday, December 09, 2005 5:22 PM >>>> To: Sean Davis >>>> Cc: bioperl-l >>>> Subject: Re: [Bioperl-l] loading data to biosql tables >>>> >>>> Hi Sean, >>>> >>>> A small help I need before I run the load_seqdatabase.pl. I've >>> downloaded >>>> my >>>> datafile which is ATH1_cds_cm_20040228 from TAIR. What's the >>>> namespace >>> >>> and >>>> format for this? >>>> >>>> Thanks, >>>> Angshu >>>> >>>> _______________________________________________ >>>> Bioperl-l mailing list >>>> Bioperl-l@portal.open-bio.org >>>> http://portal.open-bio.org/mailman/listinfo/bioperl-l >>> >> >> > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l@portal.open-bio.org > http://portal.open-bio.org/mailman/listinfo/bioperl-l > > -- ------------------------------------------------------------- Hilmar Lapp email: lapp at gnf.org GNF, San Diego, Ca. 92121 phone: +1-858-812-1757 ------------------------------------------------------------- From angshu96 at gmail.com Sat Dec 10 22:30:08 2005 From: angshu96 at gmail.com (Angshu Kar) Date: Sat Dec 10 22:27:51 2005 Subject: [Bioperl-l] newbies In-Reply-To: References: Message-ID: I'm extremely apologetic about the inconvenience caused by me to you and other people. Actually I was getting more gulping and greedy by each of the ready and easily comprehensive answers that you bright people were posting. Thanks for pointing out. I really feel self-accusing. I'm sorry Hilmar. This wouldn't be repeated in future. Thanks, Angshu On 12/10/05, Hilmar Lapp wrote: > > Angshu, somebody else pointed out already that you really need to take > a look at the documentation. This wasn't a joke - I can only recommend > you take that advice pretty literally or soon you'll find yourself > posting questions and being ignored. > > You're not the first newbie ever to hit bioperl or bioperl-db, and > you're not the first one either to face the problems you're facing - > that's why we have INSTALL documents and FAQs and such. They were > written exactly for newbie people like you - not for me or Barry or > Jason - so that most newbie questions DO NOT have to be addressed over > the mailing list. > > People responding to you do so in their spare time out of very busy > lives. If you don't read documentation you're essentially saying that > you don't respect other people's time, because you're happily wasting > that time for questions you could have easily answered yourself had you > bothered to read the documentation, instead of using people's time to > assist with problems which are NOT readily addressed in the > documentation. If you don't respect somebody's time, why should people > think you respect them themselves? > > I recommend you think about that next time you're going to hit the > keyboard as soon as something doesn't work as expected. > > -hilmar > > On Dec 9, 2005, at 4:54 PM, Angshu Kar wrote: > > > One thing I missed was that my Root.pm resides in a different > > path...How to > > specify that? > > > > On 12/9/05, Angshu Kar wrote: > >> > >> Thanks a lot Barry. > >> > >> Now I'm getting this error while tryin to run the load_seqdatabase.pl > >> in a > >> linux box (I used : > >> perl load_seqdatabase.pl /akar/seq/ATH1_cds_cm_20040228) > >> > >> > >> Can't locate Bio/Root/Root.pm in @INC (@INC contains: > >> /usr/lib/perl5/5.8.5/i386-linux-thread-multi /usr/lib/perl5/5.8.5 > >> /usr/lib/perl5/site_perl/5.8.5/i386-linux-thread-multi > >> /usr/lib/perl5/site_perl/5.8.4/i386-linux-thread-multi > >> /usr/lib/perl5/site_perl/5.8.3/i386-linux-thread-multi > >> /usr/lib/perl5/site_perl/5.8.2/i386-linux-thread-multi > >> /usr/lib/perl5/site_perl/5.8.1/i386-linux-thread-multi > >> /usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi > >> /usr/lib/perl5/site_perl/5.8.5 /usr/lib/perl5/site_perl/5.8.4 > >> /usr/lib/perl5/site_perl/5.8.3 /usr/lib/perl5/site_perl/5.8.2 > >> /usr/lib/perl5/site_perl/5.8.1 /usr/lib/perl5/site_perl/5.8.0 > >> /usr/lib/perl5/site_perl > >> /usr/lib/perl5/vendor_perl/5.8.5/i386-linux-thread-multi > >> /usr/lib/perl5/vendor_perl/5.8.4/i386-linux-thread-multi > >> /usr/lib/perl5/vendor_perl/5.8.3/i386-linux-thread-multi > >> /usr/lib/perl5/vendor_perl/5.8.2/i386-linux-thread-multi > >> /usr/lib/perl5/vendor_perl/5.8.1/i386-linux-thread-multi > >> /usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi > >> /usr/lib/perl5/vendor_perl/5.8.5 /usr/lib/perl5/vendor_perl/5.8.4 > >> /usr/lib/perl5/vendor_perl/5.8.3 /usr/lib/perl5/vendor_perl/5.8.2 > >> /usr/lib/perl5/vendor_perl/5.8.1 /usr/lib/perl5/vendor_perl/5.8.0 > >> /usr/lib/perl5/vendor_perl .) at load_seqdatabase.pl line 7. > >> BEGIN failed--compilation aborted at load_seqdatabase.pl line 7. > >> > >> Please guide. > >> > >> Thanks, > >> Angshu > >> > >> On 12/9/05, Barry Moore < bmoore@genetics.utah.edu> wrote: > >>> > >>> Angshu- > >>> > >>> Make the namespace whatever you want it to be. This is useful if you > >>> want to load sequence from different sources into the same database. > >>> As > >>> for the format - you tell us what format is the file in? You could > >>> just > >>> let bioperl guess, but looking at the file and deciding yourself > >>> would > >>> be your best bet. > >>> > >>> Barry > >>> > >>>> -----Original Message----- > >>>> From: bioperl-l-bounces@portal.open-bio.org [mailto: bioperl-l- > >>>> bounces@portal.open-bio.org] On Behalf Of Angshu Kar > >>>> Sent: Friday, December 09, 2005 5:22 PM > >>>> To: Sean Davis > >>>> Cc: bioperl-l > >>>> Subject: Re: [Bioperl-l] loading data to biosql tables > >>>> > >>>> Hi Sean, > >>>> > >>>> A small help I need before I run the load_seqdatabase.pl. I've > >>> downloaded > >>>> my > >>>> datafile which is ATH1_cds_cm_20040228 from TAIR. What's the > >>>> namespace > >>> > >>> and > >>>> format for this? > >>>> > >>>> Thanks, > >>>> Angshu > >>>> > >>>> _______________________________________________ > >>>> Bioperl-l mailing list > >>>> Bioperl-l@portal.open-bio.org > >>>> http://portal.open-bio.org/mailman/listinfo/bioperl-l > >>> > >> > >> > > > > _______________________________________________ > > Bioperl-l mailing list > > Bioperl-l@portal.open-bio.org > > http://portal.open-bio.org/mailman/listinfo/bioperl-l > > > > > -- > ------------------------------------------------------------- > Hilmar Lapp email: lapp at gnf.org > GNF, San Diego, Ca. 92121 phone: +1-858-812-1757 > ------------------------------------------------------------- > > > From bmoore at genetics.utah.edu Sun Dec 11 08:47:06 2005 From: bmoore at genetics.utah.edu (Barry Moore) Date: Sun Dec 11 08:43:43 2005 Subject: [Bioperl-l] loading data to biosql tables Message-ID: Angshu- You can either add: 'use lib /path/to/your/Root.pm;' at the top of all of your perl scripts, or set PERL5LIB to the path of your Root.pm. For example, I use the bash shell, and in my .bashrc file I have a line 'export PERL5LIB=/home/bmoore/perl/lib'. Barry -----Original Message----- From: Angshu Kar [mailto:angshu96@gmail.com] Sent: Friday, December 09, 2005 5:55 PM To: Barry Moore Cc: bioperl-l Subject: Re: [Bioperl-l] loading data to biosql tables One thing I missed was that my Root.pm resides in a different path...How to specify that? On 12/9/05, Angshu Kar < angshu96@gmail.com > wrote: Thanks a lot Barry. Now I'm getting this error while tryin to run the load_seqdatabase.pl in a linux box (I used : perl load_seqdatabase.pl /akar/seq/ATH1_cds_cm_20040228) Can't locate Bio/Root/Root.pm in @INC (@INC contains: /usr/lib/perl5/5.8.5/i386-linux-thread-multi /usr/lib/perl5/5.8.5 /usr/lib/perl5/site_perl/5.8.5/i386-linux-thread-multi /usr/lib/perl5/site_perl/5.8.4/i386-linux-thread-multi /usr/lib/perl5/site_perl/5.8.3/i386-linux-thread-multi /usr/lib/perl5/site_perl/5.8.2/i386-linux-thread-multi /usr/lib/perl5/site_perl/5.8.1/i386-linux-thread-multi /usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi /usr/lib/perl5/site_perl/5.8.5 /usr/lib/perl5/site_perl/5.8.4 /usr/lib/perl5/site_perl/5.8.3 /usr/lib/perl5/site_perl/5.8.2 /usr/lib/perl5/site_perl/5.8.1 /usr/lib/perl5/site_perl/5.8.0 /usr/lib/perl5/site_perl /usr/lib/perl5/vendor_perl/5.8.5/i386-linux-thread-multi /usr/lib/perl5/vendor_perl/5.8.4/i386-linux-thread-multi /usr/lib/perl5/vendor_perl/5.8.3/i386-linux-thread-multi /usr/lib/perl5/vendor_perl/5.8.2/i386-linux-thread-multi /usr/lib/perl5/vendor_perl/5.8.1/i386-linux-thread-multi /usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi /usr/lib/perl5/vendor_perl/5.8.5 /usr/lib/perl5/vendor_perl/5.8.4 /usr/lib/perl5/vendor_perl/5.8.3 /usr/lib/perl5/vendor_perl/5.8.2 /usr/lib/perl5/vendor_perl/5.8.1 /usr/lib/perl5/vendor_perl/5.8.0 /usr/lib/perl5/vendor_perl .) at load_seqdatabase.pl line 7. BEGIN failed--compilation aborted at load_seqdatabase.pl line 7. Please guide. Thanks, Angshu On 12/9/05, Barry Moore < bmoore@genetics.utah.edu > wrote: Angshu- Make the namespace whatever you want it to be. This is useful if you want to load sequence from different sources into the same database. As for the format - you tell us what format is the file in? You could just let bioperl guess, but looking at the file and deciding yourself would be your best bet. Barry > -----Original Message----- > From: bioperl-l-bounces@portal.open-bio.org [mailto: bioperl-l- > bounces@portal.open-bio.org] On Behalf Of Angshu Kar > Sent: Friday, December 09, 2005 5:22 PM > To: Sean Davis > Cc: bioperl-l > Subject: Re: [Bioperl-l] loading data to biosql tables > > Hi Sean, > > A small help I need before I run the load_seqdatabase.pl. I've downloaded > my > datafile which is ATH1_cds_cm_20040228 from TAIR. What's the namespace and > format for this? > > Thanks, > Angshu > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l@portal.open-bio.org > http://portal.open-bio.org/mailman/listinfo/bioperl-l From chen_li3 at yahoo.com Sun Dec 11 15:47:02 2005 From: chen_li3 at yahoo.com (chen li) Date: Sun Dec 11 15:51:05 2005 Subject: [Bioperl-l] load sequence files into Mysql Message-ID: <20051211204702.14515.qmail@web36805.mail.mud.yahoo.com> Hi all, I already install Biosql on my computer and everything works fine. Now I want to download some mouse EST database files which can be read into mysql correctly without anymore modifications. Where can I download these files or other sequence files? And I know it still has some problems to read the fasta files from NCBI(as Hilmar already points out). The link provided in the HOWTO is broken. Any suggestion will be appreciated and thanks in advance. Li __________________________________________________ Do You Yahoo!? Tired of spam? Yahoo! Mail has the best spam protection around http://mail.yahoo.com From hlapp at gmx.net Sun Dec 11 18:51:19 2005 From: hlapp at gmx.net (Hilmar Lapp) Date: Sun Dec 11 18:48:51 2005 Subject: [Bioperl-l] load sequence files into Mysql In-Reply-To: <20051211204702.14515.qmail@web36805.mail.mud.yahoo.com> References: <20051211204702.14515.qmail@web36805.mail.mud.yahoo.com> Message-ID: Have you tried the GenBank EST database flat files (in GenBank format)? On Dec 11, 2005, at 12:47 PM, chen li wrote: > Hi all, > > I already install Biosql on my computer and everything > works fine. Now I want to download some mouse EST > database files which can be read into mysql correctly > without anymore modifications. Where can I download > these files or other sequence files? And I know it > still has some problems to read the fasta files from > NCBI(as Hilmar already points out). The link provided > in the HOWTO is broken. > > Any suggestion will be appreciated and thanks in > advance. > > > > Li > > __________________________________________________ > Do You Yahoo!? > Tired of spam? Yahoo! Mail has the best spam protection around > http://mail.yahoo.com > _______________________________________________ > Bioperl-l mailing list > Bioperl-l@portal.open-bio.org > http://portal.open-bio.org/mailman/listinfo/bioperl-l > > -- ------------------------------------------------------------- Hilmar Lapp email: lapp at gnf.org GNF, San Diego, Ca. 92121 phone: +1-858-812-1757 ------------------------------------------------------------- From chen_li3 at yahoo.com Sun Dec 11 20:29:08 2005 From: chen_li3 at yahoo.com (chen li) Date: Sun Dec 11 20:33:10 2005 Subject: [Bioperl-l] load sequence files into Mysql In-Reply-To: Message-ID: <20051212012908.77276.qmail@web36811.mail.mud.yahoo.com> Hi Hilmar, Thanks for the replay. Do you mean the files in this link: ftp://ftp.ncbi.nih.gov/repository/dbEST/ There are many files out in this folder. I am not sure which one belongs to mouse. Li --- Hilmar Lapp wrote: > Have you tried the GenBank EST database flat files > (in GenBank format)? > > On Dec 11, 2005, at 12:47 PM, chen li wrote: > > > Hi all, > > > > I already install Biosql on my computer and > everything > > works fine. Now I want to download some mouse EST > > database files which can be read into mysql > correctly > > without anymore modifications. Where can I > download > > these files or other sequence files? And I know > it > > still has some problems to read the fasta files > from > > NCBI(as Hilmar already points out). The link > provided > > in the HOWTO is broken. > > > > Any suggestion will be appreciated and thanks in > > advance. > > > > > > > > Li > > > > __________________________________________________ > > Do You Yahoo!? > > Tired of spam? Yahoo! Mail has the best spam > protection around > > http://mail.yahoo.com > > _______________________________________________ > > Bioperl-l mailing list > > Bioperl-l@portal.open-bio.org > > > http://portal.open-bio.org/mailman/listinfo/bioperl-l > > > > > -- > ------------------------------------------------------------- > Hilmar Lapp email: lapp > at gnf.org > GNF, San Diego, Ca. 92121 phone: > +1-858-812-1757 > ------------------------------------------------------------- > > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l@portal.open-bio.org > http://portal.open-bio.org/mailman/listinfo/bioperl-l > __________________________________________________ Do You Yahoo!? Tired of spam? Yahoo! Mail has the best spam protection around http://mail.yahoo.com From heroen.verbruggen at ugent.be Mon Dec 12 04:37:15 2005 From: heroen.verbruggen at ugent.be (Heroen Verbruggen) Date: Mon Dec 12 05:04:27 2005 Subject: [Bioperl-l] TreeIO::nexus - multiple trees Message-ID: <000901c5feff$a2f555b0$f2a4c19d@bikini> Dear all, There seems to be a bug in the TreeIO/nexus.pm module. When I tried parsing a file with multiple trees, I only got every other tree. The problem is that the parser matches whitespace after each semicolon and requires whitespace before the subsequent tree (regex on lines 166-167). I've solved it by removing the terminal whitespace from the regex. Cheers, Heroen From sdavis2 at mail.nih.gov Mon Dec 12 08:24:40 2005 From: sdavis2 at mail.nih.gov (Sean Davis) Date: Mon Dec 12 08:22:10 2005 Subject: [Bioperl-l] load sequence files into Mysql In-Reply-To: <20051212012908.77276.qmail@web36811.mail.mud.yahoo.com> Message-ID: On 12/11/05 8:29 PM, "chen li" wrote: > Hi Hilmar, > > Thanks for the replay. > > Do you mean the files in this link: > > ftp://ftp.ncbi.nih.gov/repository/dbEST/ > > There are many files out in this folder. I am not sure > which one belongs to mouse. I'm not sure what you need, but UCSC maintains fasta of the mouse for all mouse genbank records. See here: http://hgdownload.cse.ucsc.edu/goldenPath/mm6/bigZips/ They have tables in their database that contain much of the actual genbank records, but in a relational form. Sean From jason.stajich at duke.edu Mon Dec 12 08:35:47 2005 From: jason.stajich at duke.edu (Jason Stajich) Date: Mon Dec 12 08:53:44 2005 Subject: [Bioperl-l] TreeIO::nexus - multiple trees In-Reply-To: <000901c5feff$a2f555b0$f2a4c19d@bikini> References: <000901c5feff$a2f555b0$f2a4c19d@bikini> Message-ID: <1FCC3F1A-9083-482E-8155-14AFB0EC2E47@duke.edu> Hmm what version of bioperl are you using? That whitespace is optional now. while( $trees =~ /\s+tree\s+(\S+)\s*\=\s*(?:\[\S+\])?\s*([^\;]+;)\s*/ig) On Dec 12, 2005, at 4:37 AM, Heroen Verbruggen wrote: > Dear all, > > There seems to be a bug in the TreeIO/nexus.pm module. When I tried > parsing > a file with multiple trees, I only got every other tree. The > problem is that > the parser matches whitespace after each semicolon and requires > whitespace > before the subsequent tree (regex on lines 166-167). I've solved it by > removing the terminal whitespace from the regex. > > Cheers, > > Heroen > > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l@portal.open-bio.org > http://portal.open-bio.org/mailman/listinfo/bioperl-l -- Jason Stajich Duke University http://www.duke.edu/~jes12 From MEC at stowers-institute.org Mon Dec 12 10:45:06 2005 From: MEC at stowers-institute.org (Cook, Malcolm) Date: Mon Dec 12 10:43:33 2005 Subject: [Bioperl-l] HOWTO: take a slice of a split location Message-ID: Thanks Jason, If I simply wanted the 3'most 1Kbp of CDS I would have taken your approach, but, I rather needed to focus my algorithm on this location, which may be a split location, and then select the 'best' sublocation (exon) which doesn't span a splice site, so I needed to preserve the gaps. I'm guess I could have simply collected sequence starting at the 3'most sublocation and stopped when I got my fill... I guess I found the expressivity of taking an (appropriatly oriented) `slice` of the entire CDS to compelling to avoid adding it to LocationI, which I hope to do one I have my implementation performance detail worked out (film at 11). Any objections to this pending addition? I think it is in spirit, do you? Looking at Bio::Coordinate::GeneMapper, I don't see how this could address this specifically. Pointers welcomed! Cheers, Malcolm -----Original Message----- From: Jason Stajich [mailto:jason.stajich@duke.edu] Sent: Saturday, December 10, 2005 12:12 PM To: Cook, Malcolm Cc: bioperl-l; Seidel, Christopher Subject: Re: [Bioperl-l] HOWTO: take a slice of a split location Hi Malcom - Don't have a chance to look at your code, but my approach to this problem would be to first splice the sequence out from the genome my $feature = Bio::SeqFeature::Generic->new(-location => $splitlocation); my $cdsseq = $feature->spliced_seq; then just retrieve the last 1000 bases of this sequence. my $threeprime = $cdsseq->subseq($cdsseq->length - 1000, $cdsseq- >length); (this might be off-by-one?) There is also a module to map between coordinates - Bio::Coordinate::GeneMapper if you need to go from transcript to genomic coordinates. -jason On Dec 10, 2005, at 2:06 AM, Cook, Malcolm wrote: > Fellow Bioperlers, > > I was in need of extracting the 3'-most 1000 bp of from multiple > genomic CDS regions (designing 70mer u-array probes). > > I looked in vain for Bio::Location->splice($from,$to); > > So I wrote one which works but suffers from actually materializing > the list of interger indices into the sequence for every base. > > Has anyone a better approach they'd care to share? > > Malcolm Cook - mec@stowers-institute.org > Stowers Institute for Medical Research - Kansas City, MO USA > > P.S. Here' what I wrote: > > package Bio::LocationI; # Code in the interface so it works > # with both ::Split and ::Simple > # Bio::Locations > > sub _intspans { > # Purpose: for a (presumably) monotonically increasing list of > # integers, return list of arrays each holding min and max of > # the list's internal contiguous spans. > # > # Example: 1..5,10..20,30 => ([1,5],[10,20],[30,30]) > my @i = @_; > die "nothing passed to intspans" unless @i; > my @s = ([$i[0],shift(@i)]); > foreach (@i) { > if ($_ == 1 + $s[0][1]) { > $s[0][1] = $_; > } else { > unshift @s, [$_, $_] > }} > reverse @s; > } > > sub slice { > # Purpose: compute a slice of the Location, using perls normal slice > # semantics, expect that it trims out of range values. > my ($self, $from, $to) = @_; > my @int = eval (join ',', map {$_->start . '..' . $_->end} $self- > >each_Location); # build perl expression using the range (..) and > list (,) operators. > @int = @int[$from..$to]; > @int = grep {$_} @int; # Removing undefs (in case $from/$to out > of bounds). > my @intspans = _intspans(@int); > new Bio::Location::Split (-strand => $self->strand, > -locations => [map {new Bio::Location::Simple(-start => $_-> > [0], > -end => $_->[1], > -strand => $self->strand, > ) > } @intspans], > ); > } > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l@portal.open-bio.org > http://portal.open-bio.org/mailman/listinfo/bioperl-l -- Jason Stajich Duke University http://www.duke.edu/~jes12 From hlapp at gmx.net Mon Dec 12 13:09:17 2005 From: hlapp at gmx.net (Hilmar Lapp) Date: Mon Dec 12 13:06:44 2005 Subject: [Bioperl-l] load sequence files into Mysql In-Reply-To: <20051212012908.77276.qmail@web36811.mail.mud.yahoo.com> References: <20051212012908.77276.qmail@web36811.mail.mud.yahoo.com> Message-ID: <5e252a0be2ad0b3dbfdc3252b9a1b6f8@gmx.net> Li, you might want to consider to subscribe to the Bulletin Board at bioinformatics.org. The genbank database is at ftp://ftp.ncbi.nih.gov/genbank. gbest*.seq.gz is the EST database. It is not segregated by species, but it is easy to screen out by species using the --seqfilter option of load_seqdatabase.pl. -hilmar On Dec 11, 2005, at 5:29 PM, chen li wrote: > Hi Hilmar, > > Thanks for the replay. > > Do you mean the files in this link: > > ftp://ftp.ncbi.nih.gov/repository/dbEST/ > > There are many files out in this folder. I am not sure > which one belongs to mouse. > > > Li > > --- Hilmar Lapp wrote: > >> Have you tried the GenBank EST database flat files >> (in GenBank format)? >> >> On Dec 11, 2005, at 12:47 PM, chen li wrote: >> >>> Hi all, >>> >>> I already install Biosql on my computer and >> everything >>> works fine. Now I want to download some mouse EST >>> database files which can be read into mysql >> correctly >>> without anymore modifications. Where can I >> download >>> these files or other sequence files? And I know >> it >>> still has some problems to read the fasta files >> from >>> NCBI(as Hilmar already points out). The link >> provided >>> in the HOWTO is broken. >>> >>> Any suggestion will be appreciated and thanks in >>> advance. >>> >>> >>> >>> Li >>> >>> __________________________________________________ >>> Do You Yahoo!? >>> Tired of spam? Yahoo! Mail has the best spam >> protection around >>> http://mail.yahoo.com >>> _______________________________________________ >>> Bioperl-l mailing list >>> Bioperl-l@portal.open-bio.org >>> >> > http://portal.open-bio.org/mailman/listinfo/bioperl-l >>> >>> >> -- >> > ------------------------------------------------------------- >> Hilmar Lapp email: lapp >> at gnf.org >> GNF, San Diego, Ca. 92121 phone: >> +1-858-812-1757 >> > ------------------------------------------------------------- >> >> >> _______________________________________________ >> Bioperl-l mailing list >> Bioperl-l@portal.open-bio.org >> > http://portal.open-bio.org/mailman/listinfo/bioperl-l >> > > > __________________________________________________ > Do You Yahoo!? > Tired of spam? Yahoo! Mail has the best spam protection around > http://mail.yahoo.com > > -- ------------------------------------------------------------- Hilmar Lapp email: lapp at gnf.org GNF, San Diego, Ca. 92121 phone: +1-858-812-1757 ------------------------------------------------------------- From bmoore at genetics.utah.edu Mon Dec 12 13:53:10 2005 From: bmoore at genetics.utah.edu (Barry Moore) Date: Mon Dec 12 13:49:31 2005 Subject: FW: [Bioperl-l] loading data to biosql tables Message-ID: Angshu- ? I support Hilmar's previous post about "Newbie's". ?This is the kind of question that you should be able to answer for yourself with a some effort on your part. ? Barry ? ________________________________________ From: Angshu Kar [mailto:angshu96@gmail.com] Sent: Sunday, December 11, 2005 8:59 AM To: Barry Moore Subject: Re: [Bioperl-l] loading data to biosql tables ? But Barry my problem is that despite installing bioperl and bioperl-db module I cannot find the BioDB.pm. Did I install anything wrong? ? Thanks, Angshu ? On 12/11/05, Barry Moore wrote: Angshu- ? You can either add: 'use lib /path/to/your/Root.pm;' at the top of all of your perl scripts, or set PERL5LIB to the path of your Root.pm.? For example, I use the bash shell, and in my .bashrc file I have a line 'export PERL5LIB=/home/bmoore/perl/lib'. ? Barry ? ? ? ? -----Original Message----- From: Angshu Kar [mailto: angshu96@gmail.com] Sent: Friday, December 09, 2005 5:55 PM To: Barry Moore Cc: bioperl-l Subject: Re: [Bioperl-l] loading data to biosql tables ? One thing I missed was that my Root.pm resides in a different path...How to specify that? On 12/9/05, Angshu Kar < angshu96@gmail.com> wrote: Thanks a lot Barry. Now I'm getting this error while tryin to run the load_seqdatabase.pl in a linux box (I used : perl load_seqdatabase.pl /akar/seq/ATH1_cds_cm_20040228) Can't locate Bio/Root/Root.pm in @INC (@INC contains: /usr/lib/perl5/5.8.5/i386-linux-thread-multi /usr/lib/perl5/5.8.5 /usr/lib/perl5/site_perl/5.8.5/i386-linux-thread-multi /usr/lib/perl5/site_perl/5.8.4/i386-linux-thread-multi /usr/lib/perl5/site_perl/5.8.3/i386-linux-thread-multi /usr/lib/perl5/site_perl/5.8.2/i386-linux-thread-multi /usr/lib/perl5/site_perl/5.8.1/i386-linux-thread-multi /usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi /usr/lib/perl5/site_perl/5.8.5 /usr/lib/perl5/site_perl/5.8.4 /usr/lib/perl5/site_perl/5.8.3 /usr/lib/perl5/site_perl/5.8.2 /usr/lib/perl5/site_perl/5.8.1 /usr/lib/perl5/site_perl/5.8.0 /usr/lib/perl5/site_perl /usr/lib/perl5/vendor_perl/5.8.5/i386-linux-thread-multi /usr/lib/perl5/vendor_perl/5.8.4/i386-linux-thread-multi /usr/lib/perl5/vendor_perl/5.8.3/i386-linux-thread-multi /usr/lib/perl5/vendor_perl/5.8.2/i386-linux-thread-multi /usr/lib/perl5/vendor_perl/5.8.1/i386-linux-thread-multi /usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi /usr/lib/perl5/vendor_perl/5.8.5 /usr/lib/perl5/vendor_perl/5.8.4 /usr/lib/perl5/vendor_perl/5.8.3 /usr/lib/perl5/vendor_perl/5.8.2 /usr/lib/perl5/vendor_perl/5.8.1 /usr/lib/perl5/vendor_perl/5.8.0 /usr/lib/perl5/vendor_perl .) at load_seqdatabase.pl line 7. BEGIN failed--compilation aborted at load_seqdatabase.pl line 7. Please guide. Thanks, Angshu ? On 12/9/05, Barry Moore < bmoore@genetics.utah.edu> wrote: Angshu- Make the namespace whatever you want it to be.??This is useful if you want to load sequence from different sources into the same database.??As for the format - you tell us what format is the file in???You could just let bioperl guess, but looking at the file and deciding yourself would be your best bet. Barry > -----Original Message----- > From: bioperl-l-bounces@portal.open-bio.org [mailto: bioperl-l- > bounces@portal.open-bio.org] On Behalf Of Angshu Kar > Sent: Friday, December 09, 2005 5:22 PM > To: Sean Davis > Cc: bioperl-l > Subject: Re: [Bioperl-l] loading data to biosql tables > > Hi Sean, > > A small help I need before I run the load_seqdatabase.pl. I've downloaded > my > datafile which is ATH1_cds_cm_20040228 from TAIR. What's the namespace and > format for this? > > Thanks, > Angshu > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l@portal.open-bio.org > http://portal.open-bio.org/mailman/listinfo/bioperl-l ? ? ? From chen_li3 at yahoo.com Mon Dec 12 15:55:42 2005 From: chen_li3 at yahoo.com (chen li) Date: Mon Dec 12 15:59:46 2005 Subject: [Bioperl-l] load sequence files into Mysql In-Reply-To: <5e252a0be2ad0b3dbfdc3252b9a1b6f8@gmx.net> Message-ID: <20051212205542.72957.qmail@web36812.mail.mud.yahoo.com> Thanks all for the informatiom. Li --- Hilmar Lapp wrote: > Li, you might want to consider to subscribe to the > Bulletin Board at > bioinformatics.org. The genbank database is at > ftp://ftp.ncbi.nih.gov/genbank. gbest*.seq.gz is the > EST database. > > It is not segregated by species, but it is easy to > screen out by > species using the --seqfilter option of > load_seqdatabase.pl. > > -hilmar > > On Dec 11, 2005, at 5:29 PM, chen li wrote: > > > Hi Hilmar, > > > > Thanks for the replay. > > > > Do you mean the files in this link: > > > > ftp://ftp.ncbi.nih.gov/repository/dbEST/ > > > > There are many files out in this folder. I am not > sure > > which one belongs to mouse. > > > > > > Li > > > > --- Hilmar Lapp wrote: > > > >> Have you tried the GenBank EST database flat > files > >> (in GenBank format)? > >> > >> On Dec 11, 2005, at 12:47 PM, chen li wrote: > >> > >>> Hi all, > >>> > >>> I already install Biosql on my computer and > >> everything > >>> works fine. Now I want to download some mouse > EST > >>> database files which can be read into mysql > >> correctly > >>> without anymore modifications. Where can I > >> download > >>> these files or other sequence files? And I know > >> it > >>> still has some problems to read the fasta files > >> from > >>> NCBI(as Hilmar already points out). The link > >> provided > >>> in the HOWTO is broken. > >>> > >>> Any suggestion will be appreciated and thanks in > >>> advance. > >>> > >>> > >>> > >>> Li > >>> > >>> > __________________________________________________ > >>> Do You Yahoo!? > >>> Tired of spam? Yahoo! Mail has the best spam > >> protection around > >>> http://mail.yahoo.com > >>> _______________________________________________ > >>> Bioperl-l mailing list > >>> Bioperl-l@portal.open-bio.org > >>> > >> > > > http://portal.open-bio.org/mailman/listinfo/bioperl-l > >>> > >>> > >> -- > >> > > > ------------------------------------------------------------- > >> Hilmar Lapp email: > lapp > >> at gnf.org > >> GNF, San Diego, Ca. 92121 phone: > >> +1-858-812-1757 > >> > > > ------------------------------------------------------------- > >> > >> > >> _______________________________________________ > >> Bioperl-l mailing list > >> Bioperl-l@portal.open-bio.org > >> > > > http://portal.open-bio.org/mailman/listinfo/bioperl-l > >> > > > > > > __________________________________________________ > > Do You Yahoo!? > > Tired of spam? Yahoo! Mail has the best spam > protection around > > http://mail.yahoo.com > > > > > -- > ------------------------------------------------------------- > Hilmar Lapp email: lapp > at gnf.org > GNF, San Diego, Ca. 92121 phone: > +1-858-812-1757 > ------------------------------------------------------------- > > > __________________________________________________ Do You Yahoo!? Tired of spam? Yahoo! Mail has the best spam protection around http://mail.yahoo.com From angshu96 at gmail.com Mon Dec 12 20:57:07 2005 From: angshu96 at gmail.com (Angshu Kar) Date: Mon Dec 12 20:54:28 2005 Subject: [Bioperl-l] urgent:load_seqdatabase.pl doesn't seem to like fasta... Message-ID: Hi, This is the format that I've : >At1g01010.1 68414.m00001 no apical meristem (NAM) family protein contains Pfam PF02365: No apical meristem (NAM) domain; sim ilar to NAC domain protein NAM GB: AAD17313 GI:4325282 from [Arabidopsis thaliana] And with regard to the following discussion: http://bioperl.org/pipermail/bioperl-l/2004-June/016198.html Could anyone please let me know whether any fixes have been done for this? Appreciate your guidance. Thanks, Angshu From torsten.seemann at infotech.monash.edu.au Mon Dec 12 18:27:10 2005 From: torsten.seemann at infotech.monash.edu.au (Torsten Seemann) Date: Mon Dec 12 21:59:04 2005 Subject: [Bioperl-l] throw, not die In-Reply-To: References: