[Bioperl-l] GI identifier missing when using Bio::Index::GenBank?

Brian Osborne osborne1 at optonline.net
Thu Apr 27 16:29:43 UTC 2006


Todd,

Can't you go directly from the daily update to the database?

Brian O.


On 4/26/06 9:47 PM, "Todd Richmond" <richmond.todd at gmail.com> wrote:

> I've got an application where I grab the daily updates from NCBI, pull
> out just the plant sequences and store them in a separate flat file.
> Then I use Bio::Index::GenBank to index the plant flat file so I can
> pull out my sequences of interest. I'm in the midst of converting my
> scripts to using bioperl-db/biosql so I can push those sequences into
> the database. The problem is that the NCBI GI identifier isn't
> returned when using the index file.
> 
> When I run the following test script:
> ***
> use Bio::Index::GenBank;
> use Bio::SeqIO;
> use strict;
> my $Index_File_Name = 'nc0425.idx';
> my $inx = Bio::Index::GenBank->new('-filename' => $Index_File_Name);
> 
> my $seqio = new Bio::SeqIO( '-format' => 'genbank' );
> my $seq = $inx->get_Seq_by_acc('CJ521890');
> $seqio->write_seq($seq);
> ***
> 
> Diffing to the original GenBank record, the only difference is the GI
> identifier:
> 
> diff CJ521890_orig.out CJ521890_seqio.out
> 5c5
> < VERSION     CJ521890.1  GI:93266243
> ---
>> VERSION     CJ521890.1
> 
> Is this expected behaviour? If so, is there a workaround that will
> allow me to retrieve the GI from the index file so I can store it in
> the bioentry table?
> 
> Thanks, Todd
> 
> 
> --
> Todd Richmond
> richmond.todd at gmail.com
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l at lists.open-bio.org
> http://lists.open-bio.org/mailman/listinfo/bioperl-l





More information about the Bioperl-l mailing list