[Biojava-l] position weight matrix
Schreiber, Mark
mark.schreiber at agresearch.co.nz
Wed Sep 17 23:30:25 EDT 2003
Brian,
I think we should change WeightMatrix so that N only scores a quater match of A where as R would score a half match.
Do others feel this is sensible?
- Mark
-----Original Message-----
From: Brian Cox [mailto:cox at mshri.on.ca]
Sent: Thu 18/09/2003 2:35 p.m.
To: Schreiber, Mark
Cc:
Subject: RE: [Biojava-l] position weight matrix
Thanks,
That sounds like what I was looking for, I wanted to penalize the use of an
N in the sequence. Not sure yet how to implement this but I'll give it a
shot.
thanks for the reply,
BRian
-----Original Message-----
From: Schreiber, Mark
To: Brian Cox; biojava-l at biojava.org
Sent: 9/17/03 7:45 PM
Subject: RE: [Biojava-l] position weight matrix
Hi Brian,
Technically this is correct as N or X do actually match everything. Are
wanting to rule out any motif with an N or are you wanting to penalize a
motif with an N (or other ambiguity)?
If you are working with DNA you could use
org.biojava.bio.seq.NucleotideTools, this class can be used to access
the nucleotide alphabet that treats all symbols as Atomic, even if they
are normally IUPAC ambiguity symbols. If you did this and set the weight
of N in the marix to 0.0 it would exclude those motifs.
- Mark
-----Original Message-----
From: Brian Cox [mailto:cox at mshri.on.ca]
Sent: Friday, 12 September 2003 11:07 a.m.
To: biojava-l at biojava.org
Subject: [Biojava-l] position weight matrix
I wrote a program to find TF binding sites using a
WeightMatrixAnnotator, but when I try to annotate a sequence if the
sequences has any N or X then everything matches. How do I get the
WeightMatrixAnnotator to ignore the Ns or Xs?
thanks,
Brian Cox
Samuel Lunenfeld Research Institute
Mount Sinai Hospital, Rm 884
Toronto, Ontario
Canada
416-586-8266
=======================================================================
Attention: The information contained in this message and/or attachments
from AgResearch Limited is intended only for the persons or entities
to which it is addressed and may contain confidential and/or privileged
material. Any review, retransmission, dissemination or other use of, or
taking of any action in reliance upon, this information by persons or
entities other than the intended recipients is prohibited by AgResearch
Limited. If you have received this message in error, please notify the
sender immediately.
=======================================================================
=======================================================================
Attention: The information contained in this message and/or attachments
from AgResearch Limited is intended only for the persons or entities
to which it is addressed and may contain confidential and/or privileged
material. Any review, retransmission, dissemination or other use of, or
taking of any action in reliance upon, this information by persons or
entities other than the intended recipients is prohibited by AgResearch
Limited. If you have received this message in error, please notify the
sender immediately.
=======================================================================
More information about the Biojava-l
mailing list