[Bioperl-l] parse genbank file
    Andrew Walsh 
    walsh at cenix-bioscience.com
       
    Wed Aug  3 03:13:47 EDT 2005
    
    
  
Hello,
There is only 1 'sequence' in the file (namely, NC_003212).  The genes 
are actually features on the sequence.  So, you would have to get the 
'gene' sequence features for the sequence.
e.g.
my $gene_seq_feats = get_list_seq_feats_by_primary_tag($seq_obj, 'gene');
sub get_list_seq_feats_by_primary_tag {
     my ($seq_obj, $tag) = @_;
     ref $seq_obj or
         confess "Seq obj not defined!";
     my @features = $seq_obj->top_SeqFeatures();
     my @list = ();
     for my $feat (@features) {
         if ($feat->primary_tag eq $tag) {
             push @list, $feat;
         }
     }
     return \@list
}
HTH,
Andrew
Guido Dieterich wrote:
> Hi,
> 
> I want to parse a genbank file (Listeria Innocua)!
> 
> this is a part of the code ...
> <code>
> 
> my $file = "NC_003212.gbk";
> 
> my $stream = Bio::SeqIO->new(-file => $file, -format => 'GenBank');
> 
>     while( my $seq = $stream->next_seq ) {
> 
>         print $seq->display_id;
> 
> }
> 
> </code>
> 
> 
> output:
> 
> NC_003212
> 
> I just get the NC ID for this file, but not for the genes within ...
> 
> 
> ?????
> 
> Greetings
> 
> Guido
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l at portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 
-- 
------------------------------------------------------------------
Andrew Walsh, M.Sc.
Bioinformatics Software Engineer
IT Unit
Cenix BioScience GmbH
Tatzberg 47
01307 Dresden
Germany
Tel. +49-351-4173 137
Fax  +49-351-4173 109
public key: http://www.cenix-bioscience.com/public_keys/walsh.gpg
------------------------------------------------------------------
    
    
More information about the Bioperl-l
mailing list