[Biojava-l] handling gap symbols
Wim De Smet
Wim.DeSmet at UGent.be
Thu May 20 14:58:55 UTC 2010
Hello all,
I've been trying to figure out how to determine the location of gap
symbols in an alignment, but I keep running into trouble determining
what is a gap symbol. Apparently there are two different possible gap
symbols and they can both appear in the same alignment?
An example might make it clearer, suppose I perform the following
alignment (matrix is the EDNA matrix):
SequenceAlignment aligner = new NeedlemanWunsch((short) 0, (short) 3,
(short) 10, (short) 10, (short) 1, matrix);
Sequence first = DNATools.createDNASequence("ACT", "query");
Sequence second = DNATools.createDNASequence("AACTA", "target");
Alignment alignment = aligner.getAlignment(first, second);
And Obtain the symbollist for "query", which should look like "-ACT-", I
get the following Symbols:
AlphabetManager$GapSymbol
AlphabetManager$WellKnownAtomicSymbol
AlphabetManager$WellKnownAtomicSymbol
AlphabetManager$WellKnownAtomicSymbol
AlphabetManager$WellKnownGapSymbol
AlphabetManager.getGapSymbol() returns AlphabetManager$GapSymbol, while
symbolList.getAlphabet().getGapSymbol() returns
AlphabetManager$WellKnownGapSymbol. Am I supposed to test against both
or is there a bug here somewhere? I'm using biojava 1.7.1.
regards,
Wim
--
Wim De Smet
http://www.straininfo.net/
More information about the Biojava-l
mailing list