[Biopython-dev] Normalization of Distance Matrices
Jordan Willis
jwillis0720 at gmail.com
Thu Oct 15 06:02:47 UTC 2015
Hi,
I have a question about the pairwise distance calculation in the DistanceCalculator under Phylo.TreeConstruction.
It pertains to the normalization.
Line 456.
return 1 - (score * 1.0 / max_score)
I understand that the goal here is to have a distance between 0 and 1. But couldn’t this be greater than 1 if say score was negative as is allowed in PAM and BLOSUM lookup matrices. Perhaps the entire distance matrix should be computed, then a minimum and maximum derived, and then normalized between 0 and 1 rather than do it on a pairwise basis. What do you think?
Jordan
PS.
I apologize if you are seeing this in duplicate. I sent it to the regular list instead of the dev list which seems more applicable.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.open-bio.org/pipermail/biopython-dev/attachments/20151014/87037b8a/attachment.html>
More information about the Biopython-dev
mailing list