[Biopython] Calculating NS and S over a given sequence

Riesgo Ferreiro, Pablo Pablo.RiesgoFerreiro at TrOn-Mainz.DE
Thu Sep 30 04:07:53 EDT 2021


Hi all,





I am new to this mailing list. First of all many thanks for your work, I have happily used Biopython in several projects before.



I have a need to compute the dN/dS ratio over a set of samples of the same species. I know this is not great 10.1371/journal.pgen.1000304, but still. I have found this feature in biopython calculating the dN/dS between sequences: https://biopython.org/docs/1.76/api/Bio.codonalign.codonseq.html#Bio.codonalign.codonseq.cal_dn_ds, but this does not cover my needs.



What I need is to compute dN/dS based on the count of mutations over a set of samples as explained at https://bioinformatics.cvr.ac.uk/calculating-dnds-for-ngs-datasets/



[cid:7c03806e-bbb0-47b1-9c49-3c53e33af83e]



N and S is dependent on the reference sequence and independent on the samples. N and S can be calculated on different genomic regions (eg: coding region, transcript, exon, domain, etc.). The simplest input for this tool would be a given ORF sequence and you would think of more complete things as a GFF file.



It is a small thing, but unless anyone knows of an existing implementation, I think it may be useful to others. Do you think this would be a valuable contribution to biopython?







Best wishes,

Pablo Riesgo Ferreiro
Computational Medicine


TRON
Translationale Onkologie an der Universitätsmedizin der
Johannes Gutenberg-Universität Mainz gemeinnützige GmbH

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.open-bio.org/pipermail/biopython/attachments/20210930/96484e77/attachment-0001.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Screenshot from 2021-09-30 09-55-58.png
Type: image/png
Size: 100309 bytes
Desc: Screenshot from 2021-09-30 09-55-58.png
URL: <http://mailman.open-bio.org/pipermail/biopython/attachments/20210930/96484e77/attachment-0001.png>


More information about the Biopython mailing list