[Bioperl-l] Converting Genbank to fasta via SeqIO

Wes Barris wes.barris at csiro.au
Mon Jul 21 16:36:00 EDT 2003


Hi,

I am using the following code to convert a genbank file into a fasta
file:

my $seq_in = Bio::SeqIO->new('-file' => "<$infile", '-format' => 'genbank');
my $seq_out = Bio::SeqIO->new('-file' => ">$outfile", '-format' => 'fasta');

while ( my $inseq = $seq_in->next_seq ) {
    if ($seq->accession =~ '_') {
       $seq_out->write_seq($inseq);
    }
}

The genbank entry (NM_174198) results in the following defline
in the fasta file:

 >TLR4 Bos taurus toll-like receptor 4 (TLR4), mRNA.

However, I prefer to have a defline containing the accession number similar to
what is shown at the NCBI site:

 >gi|31342611|ref|NM_174198.2| Bos taurus toll-like receptor 4 (TLR4), mRNA

Is there a way to have the SeqIO routines do this?

-- 
Wes Barris
E-Mail: Wes.Barris at csiro.au



More information about the Bioperl-l mailing list