[BioPython] How to Bio.cluster Module
Bruno Santos
bsantos at biocant.pt
Thu Mar 27 17:25:52 UTC 2008
Hi,
This question is a little bit more generic so I really dont know if anyone
in the mailing may help me.
I have a fasta file with thousands of reads obtained by a sequencing run, in
this fasta file I know I have several copies of the same sequences but their
size and some nucleotides inside it can change. So I need to group them
together using clustering so then I can create a consensus sequence for each
group.
I am trying to achieve this by align all the sequences using clustalw-mpi
and the I run dnadist from phylip to obtain a matrix of distances between
the sequences. Now I need to use clustering to group the sequences based on
these values and for that I am trying to use Bio.cluster to achieve this.
Can anyone help me to choose the clustering method I should use and how can
I submit this kind of data to that method?
Sincerely,
Bruno Santos
logo_biocant
Bioinformatics Unit
Biocantpark, Núcleo 04, Lote 3
3060-197 Cantanhede
Tel: 231 410 892 E-mail: bsantos at biocant.pt
http://bioinformatics.biocant.pt <http://bioinformatics.biocant.pt/>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image001.gif
Type: image/gif
Size: 2271 bytes
Desc: not available
URL: <http://lists.open-bio.org/pipermail/biopython/attachments/20080327/c6f92584/attachment-0002.gif>
More information about the Biopython
mailing list