[Biojava-l] How to read a protein clustal alignment?
Thomas Down
td2@sanger.ac.uk
Mon, 1 Oct 2001 16:23:36 +0100
On Mon, Oct 01, 2001 at 12:15:37PM -0300, Damian D'Onia wrote:
> Hi,
>
> I'm trying to read a clustal file (.ALN) that contains protein
> sequences. Now I have a subclass of DefaultHandler which can read DNA
> sequences from .aln files, but I don't know how to fill a SymbolList from
> String.
>
> But, ProteinTools don't have a method like createDNA.
What version of BioJava are you using? Certainly, the current
CVS version has a method ProteinTools.createProtein(String), which
seems to do what you want.
However, it would seem cleaner to remove references to particular
alphabets from your code completely. You can do something like:
Alphabet alpha = DNATools.getDNA();
SymbolParser parser = alpha.getParser("token");
SymbolList sl = parser.parse(someString);
This is effectively what's happening inside DNATools.createDNA,
and means you can just parameterize your code with a particular
Alphabet object.
Hope this helps,
Thomas.