[Biojava-l] Ambiguity consensus string search
Charles Danko
dankoc at gmail.com
Sun Mar 11 20:22:48 UTC 2007
Hi,
I'm trying to search a Sequence or SymbolList object for a consensus
sequence that contains IUPac ambiguity codes.
Without ambiguity codes, I could write a function that breaks the sequence
into "windows" the size of the consensus, and checks each window for a
match. Am I missing a simple function that does this for me?
Next, adding ambiguity codes ... do I have to define my own alphabet for the
DNA IUPac codes, or are these already included in the distribution
somewhere?
I have found the weight matrix class, and realize that I could create one of
these objects and calculate a threshold that will work in the same manner as
a consensus, but this seems like a bit of a hack for some functionality I am
most likely overlooking.
Thanks very much!
Charles
More information about the Biojava-l
mailing list