[Bioperl-l] get_sequence() gets some sequences but not others

Chris Fields cjfields at uiuc.edu
Wed Jun 20 20:11:34 UTC 2007


I'm assuming you are using the Bio::Perl exported sub get_sequence 
().  I am able to reproduce the issue using bioperl-live; it's an odd  
issue as direct use of Bio::DB::GenPept works fine:

use Bio::DB::GenPept;

my $factory = Bio::DB::GenPept->new();

my @accs = qw(SKG3_YEAST NEM1_YEAST);

my $io = $factory->get_Stream_by_acc(\@accs);

while (my $seq = $io->next_seq) {
     print "Accession:",$seq->accession,"\n";
}

chris


On Jun 20, 2007, at 1:11 PM, Wollenberg, Kurt (NIH/NIAID) wrote:

> Greetings:
>
> I am working on a script to take a list of sequence IDs, extract the
> sequences from GenPept, and then run a BLAST search for each of the
> retrieved sequences. I am having a problem with the sequence  
> retrieval,
> where some sequences are found and others are not and it's not  
> obvious to me
> why this is.
>
> For example, using a text file containing the two following IDs as  
> input:
> SKG3_YEAST
> NEM1_YEAST
>
> My script
>
> while( <IN> ) {
>   chomp;
>   my $seqid = $_;
>   my $seq_obj = get_sequence( 'genpept', $seqid );
> }
>
> will create a sequence object for the first ID, (print "Accession of
> ",$seqid," is ",$seq_obj->accession, "\n"; gives me the correct  
> accession
> number) but for the second I am told
>
> -------------------- WARNING ---------------------
> MSG: id (NEM1_YEAST) does not exist
> ---------------------------------------------------
>
> When I pull up these records using the Entrez cross-databse search  
> in my web
> browser I find genpept records for both SKG3_YEAST and NEM1_YEAST  
> (using
> these search terms). In both records these IDs reside in the same  
> field
> ("DBSOURCE    swissprot: locus") so I'm mystified why get_sequence  
> finds one
> but not the other. Any advice would be greatly appreciated.
>
> Cheers,
> Kurt Wollenberg, Ph.D.
> Phylogenetics and Sequence Analysis Consultant
> Biocomputing Research Consulting Section
> Bioinformatics and Scientific IT Program (BSIP)
> NIH/NIAID/OTIS
> Contractor, Lockheed Martin
> http://bioinformatics.niaid.nih.gov
>
> Disclaimer:
> The information in this e-mail and any of its attachments is  
> confidential
> and may contain sensitive information. It should not be used by  
> anyone who
> is not the original intended recipient. If you have received this e- 
> mail in
> error please inform the sender and delete it from your mailbox or  
> any other
> storage devices. National Institute of Allergy and Infectious  
> Diseases shall
> not accept liability for any statements made that are sender's own  
> and not
> expressly made on behalf of the NIAID by one of its representatives.
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l at lists.open-bio.org
> http://lists.open-bio.org/mailman/listinfo/bioperl-l

Christopher Fields
Postdoctoral Researcher
Lab of Dr. Robert Switzer
Dept of Biochemistry
University of Illinois Urbana-Champaign






More information about the Bioperl-l mailing list