[Biopython] bio.motifs P-value on pssm searches
Bartek Wilczynski
bartek at rezolwenta.eu.org
Thu Nov 14 13:14:00 UTC 2013
Dear Marco,
the score you mention is in fact a log-odds score. it represents a
logarithm of the ratio between the probability of the sequence in question
being generated from the motif or from a random generator.
If you want to get some analog of a p-value (the probability of obtaining a
score of x or higher), you need to look into the score distributions in the
thresholds package. For example if you want to know what score corresponds
to a p-value of 0.05 for motif M you can do
thresholds.ScoreDistribution(M).threshold_fpr(0.05)
Please remember that the thresholds are computed approximately to a given
precision (in the scoreDistribution constructor).
Naturally, if you are searching in a sequence of length 1000, you should
expect ~20 cases, for this given fpr.
Hope that helps
Bartek
On Thu, Nov 14, 2013 at 1:30 PM, Marco Galardini
<marco.galardini at unifi.it>wrote:
> Dear biopythoners,
>
> the Bio.motifs search of PSSM is a really effective tool when dealing with
> regulatory motifs. When searching a pssm in a DNA sequence, a bit score is
> associated with each position; I was wondering if you have any gotchas to
> obtain a P- or E-value from such scores. I couldn't find any method in the
> package that does that but maybe I've missed something.
>
> Thanks for your help,
> Marco
>
> --
> -------------------------------------------------
> Marco Galardini, PhD
> Dipartimento di Biologia
> Via Madonna del Piano, 6 - 50019 Sesto Fiorentino (FI)
>
> e-mail: marco.galardini at unifi.it
> www: http://www.unifi.it/dblage/CMpro-v-p-51.html
> phone: +39 055 4574737
> mobile: +39 340 2808041
> -------------------------------------------------
>
> _______________________________________________
> Biopython mailing list - Biopython at lists.open-bio.org
> http://lists.open-bio.org/mailman/listinfo/biopython
>
>
--
Bartek Wilczynski
==================
Institute of Informatics
University of Warsaw
http://www.mimuw.edu.pl/~bartek
More information about the Biopython
mailing list