[Biopython] CAI confusion
Björn Johansson
bjorn_johansson at bio.uminho.pt
Sun Feb 16 19:23:45 UTC 2014
Hi,
I am trying to use the Bio.SeqUtils.CodonUsage module to calculate CAI for
S. cerevisiae genes.
Biopython comes with the SharpEcoliIndex from
Bio.SeqUtils.CodonUsageIndices, but none for S. cerevisiae.
I found one here:
http://downloads.yeastgenome.org/unpublished_data/codon/s_cerevisiae-codonusage.txt
and here:
http://downloads.yeastgenome.org/unpublished_data/codon/ysc.orf.cod
I parsed the first table which have the following format, unfortunately w/o
headers:
Gly GGG 17673 6.05 0.12
Gly GGA 32723 11.20 0.23
Gly GGT 66198 22.66 0.46
Gly GGC 28522 9.76 0.20
Glu GAG 57046 19.52 0.30
...
ĩ believe the last column is the fraction. I think biopython expects
instead relative adaptedness w as indata for each codon.
<https://paperpile.com/shared/V6j5yN>see
http://www.ncbi.nlm.nih.gov/pubmed/3547335
How do I calculate w from the frequency? Are there any examples or
code avaliable? I googled, but could not find anything.
grateful for help!
/bjorn
--
______O_________oO________oO______o_______oO__
Björn Johansson
Assistant Professor
Departament of Biology
University of Minho
Campus de Gualtar
4710-057 Braga
PORTUGAL
www.bio.uminho.pt
Google profile <https://profiles.google.com/bjornjobb>
Google Scholar Profile<http://scholar.google.com/citations?user=7AiEuJ4AAAAJ>
my group <https://sites.google.com/site/metabolicengineeringgroup/>
Office (direct) +351-253 601517 | (PT) mob. +351-967 147 704 | (SWE) mob.
+46 739 792 968
Dept of Biology (secr) +351-253 60 4310 | fax +351-253 678980
More information about the Biopython
mailing list