[Biojava-l] Cookbook entry - feedback please
Andreas Prlic
andreas at sdsc.edu
Thu Dec 22 15:26:44 UTC 2011
> A question concerning consensus sequences: how do you handle gaps? N
> (as I understood it) does not allow a gap. Is there a way to encode a
> "might be gap here, or A or G"?
If the input is a multiple sequence alignment then you could count
frequencies at each position and take the most frequently occurring
nucleotide. For each position you could count a conservation score.
Andreas
More information about the Biojava-l
mailing list