[Bioperl-l] How to merge mulitple genbank records into one record

Haiming Wang hwang at uga.edu
Mon Apr 24 15:02:02 EDT 2006


Hi,

I am wondering if there is a script or tool can merge several genbank 
records into one record with all features' coordinates updated 
accordingly. For example, I have multiple Fugu scaffold_1 genbank files 
which are arbitrarily cut by 1000000 bps. I'd like to merge them into 
one big scaffold_1 genbank file.

Thanks in advance!

-Haiming

p.s. example data
genbank record 1:
LOCUS   scaffold_1 1000000 bp DNA HTG 8-FEB-2006
DEFINITION  Fugu rubripes scaffold scaffold_1 FUGU4 partial sequence 
1..1000000  reannotated via EnsEMBL
ACCESSION   scaffold:FUGU4:scaffold_1:1:1000000:1
......
//

genbank record 2:
LOCUS  scaffold_1 1000000 bp DNA HTG 8-FEB-2006
DEFINITION  Fugu rubripes scaffold scaffold_1 FUGU4 partial 
sequence1000001..2000000 reannotated via EnsEMBL
ACCESSION   scaffold:FUGU4:scaffold_1:1000001:2000000:1
......
//



More information about the Bioperl-l mailing list