[Biopython] Biopython function for operating multiple homologous sequences in a single file
Edson Ishengoma
ishengomae at nm-aist.ac.tz
Tue Jan 21 11:42:58 UTC 2014
Hi all,
I have a single large file containing many (thousands) coding sequence
pairs according to their homologs as so:
> >ENSBTAT00000048342_species1
> sequences
> >ENSBTAT00000048342_species2
> sequences
> >ENSBTAT00000009085_species1
> sequences
> >ENSBTAT00000009085_species2
> sequences
> >ENSBTAT00000009212_species1
> sequences
> >ENSBTAT00000009212_species2
> sequences
> ......
> ......
> ......
>
Now I want to produce a clustalw alignment for each cds pair. Is there a
way to use the biopython commandline function for clustalw to treat each
gene pair separately for all pairs, run alignment and produce an ouput
(alignments + trees file)?
I appreciate your time and look forward to hear from you,
With regards,
Edson B. Ishengoma
PhD-Candidate
*School of Life Sciences and Engineering
Nelson Mandela African Institute of Science and Technology
Nelson Mandela Road
P. O. Box 447, Arusha
Tanzania (255)
*
*ishengomae at nm-aist.ac.tz *ebarongo82 at yahoo.co.uk
*
<http://www.nm-aist.ac.tz/>Mobile: +255 762 348 037, +255 714 789 360,
Website: www.nm-aist.ac.tz
Skype: edson.ishengoma
*
*
**
More information about the Biopython
mailing list