[Bioperl-l] modify sequence name

yang liu yang.liu0508 at gmail.com
Fri Mar 9 19:25:50 UTC 2012


Dear colleagues,



When I do Sanger sequencing, I get hundreds of sequences named by DNA
Numbers, and for several genes. I need to add taxon name manually for each
sequence. I wonder is there a way to change the names automatically?


I have two .txt files.

file 1, with seqeucens named by DNA Number:
>2863
AGGATTAAAAATCAACGCTATGAATCTGGTGTAATTCCATATGCTAAAATGGGCTATTGGGATCCTAATT
ATGCAATTAAAGAAACTGATGTATTAGCATTATTTC

>2864
AGGATTAAAAATCAACGCTATGAATCTGGTGTAATTCCATATGCTAAAATGGGCTATTGGGATCCTAATT
ATGCAATTAAAGAAACTGATGTATTAGCATTATTTCGTATTACTCCACAACCAGGTGTAGAT
........


file 2, with DNA Number and taxa names, seperated by tabs
2863 Gelidium
2864 Poa
........

I hope the final file to be like this,
>Gelidium-2863
AGGATTAAAAATCAACGCTATGAATCTGGTGTAATTCCATATGCTAAAATGGGCTATTGGGATCCTAATT
ATGCAATTAAAGAAACTGATGTATTAGCATTATTTC

>Poa-2864
AGGATTAAAAATCAACGCTATGAATCTGGTGTAATTCCATATGCTAAAATGGGCTATTGGGATCCTAATT
ATGCAATTAAAGAAACTGATGTATTAGCATTATTTCGTATTACTCCACAACCAGGTGTAGAT
Any ideas? Anything help would be appreciated.

Yang.



More information about the Bioperl-l mailing list