[Biopython-dev] Clustal alignment format header line
Cymon Cox
cy at cymon.org
Tue May 12 11:07:59 UTC 2009
Both Muscle (-clw) and Probcons (-clustalw) output a programme specific
header line for the clustal format alignment:
"MUSCLE (3.7) multiple sequence alignment
AK1H_ECOLI/1-378 CPDSINAALICRGEKMSIAIMAGVLEAR etc"
"PROBCONS version 1.12 multiple sequence alignment
AK1H_ECOLI/1-378 CPDSINAALICRGEKMSIAIMA
"
Bio.AlignIO will not read these alignments
Bio/AlignIO/ClustalIO.py:94
if line[:7] != 'CLUSTAL':
raise ValueError("Did not find CLUSTAL header")
Muscle does have a -clwstrict flag but ProbCons doesnt.
Would it be a good idea to relax the header parsing?
C.
--
More information about the Biopython-dev
mailing list