[Biopython] Divergent sequence data set

Christian Schäfer schafer at rostlab.org
Wed Nov 18 13:14:06 UTC 2009


There are stand-alone tools out there like cd-hit or uniqueProt for the 
purpose of creating sequence-unique subsets on particular thresholds. If 
you want to access them from within your python code, it's easy to do so 
via commands.getoutput() or similar means and then parsing the result.

Chris

Animesh Agrawal wrote:
> Hi,
> 
> I have been trying to develop a divergent sequence data set for a
> phylogenetic analysis. Do we have something in Biopython, where for a given
> set of  sequences we can choose identity threshold to reduce redundancy in
> the dataset.
> 
>  
> 
> Cheers,
> 
> Animesh 
> 
> _______________________________________________
> Biopython mailing list  -  Biopython at lists.open-bio.org
> http://lists.open-bio.org/mailman/listinfo/biopython



More information about the Biopython mailing list