[Bioperl-l] Bio::SimpleAlign - Meaning of overall_percentage_identity?

Dave Messina David.Messina at sbc.su.se
Thu Nov 3 09:22:21 EDT 2011


Hi Giuseppe,

If I understand correctly, the method works by considering only aminoacids
> that are identical over all the members of the alignment


Yes.


> , and then averaging over the total number of aminoacids in the sequence.
> Is this correct?
>

Almost.

By default, the denominator is the alignment length, namely the length of
the MSA including gaps. By means of the 'short' and 'long' options, it's
also possible to use the shortest or longest sequence's ungapped lengths as
the denominator.


Dave


More information about the Bioperl-l mailing list