[Bioperl-l] How to merge mulitple genbank records into one record

Brian Osborne osborne1 at optonline.net
Mon Apr 24 20:35:01 UTC 2006


Haiming,

Do the locations of the features refer to the individual 1000000 bp
sub-sequences or are they actually locations on the merged sequence, the
"chromosome"?

Brian O.


On 4/24/06 3:02 PM, "Haiming Wang" <hwang at uga.edu> wrote:

> Hi,
> 
> I am wondering if there is a script or tool can merge several genbank
> records into one record with all features' coordinates updated
> accordingly. For example, I have multiple Fugu scaffold_1 genbank files
> which are arbitrarily cut by 1000000 bps. I'd like to merge them into
> one big scaffold_1 genbank file.
> 
> Thanks in advance!
> 
> -Haiming
> 
> p.s. example data
> genbank record 1:
> LOCUS   scaffold_1 1000000 bp DNA HTG 8-FEB-2006
> DEFINITION  Fugu rubripes scaffold scaffold_1 FUGU4 partial sequence
> 1..1000000  reannotated via EnsEMBL
> ACCESSION   scaffold:FUGU4:scaffold_1:1:1000000:1
> ......
> //
> 
> genbank record 2:
> LOCUS  scaffold_1 1000000 bp DNA HTG 8-FEB-2006
> DEFINITION  Fugu rubripes scaffold scaffold_1 FUGU4 partial
> sequence1000001..2000000 reannotated via EnsEMBL
> ACCESSION   scaffold:FUGU4:scaffold_1:1000001:2000000:1
> ......
> //
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l at lists.open-bio.org
> http://lists.open-bio.org/mailman/listinfo/bioperl-l





More information about the Bioperl-l mailing list