[Biopython] Request from help

Paulo Nuin nuin at genedrift.org
Thu Apr 11 17:05:57 UTC 2013


On 2013-04-11, at 12:49 PM, Eric Ma <ericmajinglong at gmail.com> wrote:

> Hello everybody,
> 
> I'm new to the mailing list here, though I've been playing with BioPython
> for quite a while.
> 
> I'm having some trouble here. I wanted to display a tree of sequences for
> which I had done a multiple sequence alignment. I tried going through the
> pipeline example here (http://biopython.org/wiki/Phylo#Example_pipeline).
> Because I'm still in the testing phase, instead of writing it as a single
> script, I wrote it as a series of scripts that I would execute in order.
> 
> The problem I run into is at step 4 in the example, where I "feed the
> alignment to PhyML". My data set is 70 protein sequences, and the trouble I
> run into is that it takes a very, very long time at the "feeding alignment
> to PhyML" step. I tried running the script on my MacBook Pro overnight, and
> even the next morning it was not done. Am I missing something here?
> 

Hi

With 70 OTUs you have 5.00 E115 possible trees. Guaranteed it will take a long time, independent to what parameters you are using in PhyML. Try with a smaller number of taxa, just for testing purposes and depending on the complexity of your protein phylogeny, give your computer some weeks to actually generate some result.

This is not a BioPython issue, is more a phylogenetics one.

Cheers
Paulo





> Just to be clear here, aligning the sequences using Muscle was successful,
> and I also managed to output a distance matrix from sample to sample, which
> I used in another downstream pipeline to display the clustering of the
> sequences on a 2D euclidean plane. However, I wanted to have a tree
> representation to validate the clustering results; the trouble is, I can't
> get the _phyml_tree.txt file to be created, which I would then use to draw
> the tree.
> 
> Thanks in advance for any help!
> 
> Cheers,
> Eric
> -----------------------------------------------------------------------
> Please consider the environment before printing this e-mail. Do you really
> need to print it?
> 
> http://about.me/ericmjl
> _______________________________________________
> Biopython mailing list  -  Biopython at lists.open-bio.org
> http://lists.open-bio.org/mailman/listinfo/biopython





More information about the Biopython mailing list