[Bioperl-l] Fetching genbank sequences

S.Paul s.paul at surrey.ac.uk
Mon Jun 14 01:36:20 EDT 2004


Hi Everyone:

I am trying to access the list of sequences for alpha-synuclein in genbank format and when I try to run the program, I get the following error(warning) message and the output file is not generated:

*************************************************************************************************************************************************
ERROR
***********************************************
C:\research\perl\perlcodes>perl alpha_synuclein.pl
Query string
Homo sapiens[Organism] AND
 alpha-synuclein

-------------------- WARNING ---------------------
MSG: trouble dissecting "TGTTACCAATCTGAAGTGGGAGCGGCCGCATATCTTTTTTTTTTTTTTTTTTTTT
TTT T 3'];  double-stranded cDNA was ligated to Eco RI adaptors (Pharmacia),  di
gested with Not I and cloned into the Not I and Eco RI sites of  the modified pT
7T3 vector. Library is normalized, and was constructed  by Bento Soares and M.Fa
tima Bonaldo."" into chunks of 57 chars or less - this tag won't print right

*****************************************************************************************************************************************

I am not sure how to fix this; I am enclosing the code:
***************************************************************
CODE
***************************************************************

 use strict;
 use Bio::DB::Query::GenBank;
 use Bio::SeqIO;
 use Bio::DB::GenBank;
  
 my $query_string = ('Homo sapiens[Organism] AND
 alpha-synuclein');
 
 print "Query string \n",$query_string;
 print "\n";
 my $query = new Bio::DB::Query::GenBank(-db=>'nucleotide',
                                         -query=>$query_string);
 
 my $seqio=new Bio::DB::GenBank->get_Stream_by_query($query);
  print "Number of results:\n", $query->count, "\n";
 #open a seqio handle for writing the outputfile in genbank
 my $outfile = new Bio::SeqIO(-format=>'genbank',
                              -file=>'>synuclein.gbk');
   
  while (my $s = $seqio->next_seq) {
 
 #write the gbk
    $outfile->write_seq($s);
    
 } 
**************************************************************************
If I do the same thing and try to retrieve in fasta format, it seems to run but does not generate any output file.  I also noticed that after I run the program it seems to create files in the file directory with ".filename.pl.swp".  Is it trying to use the memory of the operating system?

Thanks in advance for the help

Sujoy


---------------------------------------------------
Sujoy Paul, PRISE Centre, UniS, s.paul at surrey.ac.uk


More information about the Bioperl-l mailing list