Back

Third-nucleotide codon bias and synonymous codon bias define functional translational programs that shape human tissue and cancer proteomes.

Rashad, S.; Niizuma, K.

2025-11-03 genomics
10.1101/2025.10.31.685942 bioRxiv
Show abstract

BackgroundCodon usage bias is a universal feature of the genetic code, yet how synonymous codon bias or third-nucleotide codon bias (A/T-vs G/C-ending) shape translation and proteome composition across tissues and cancer remain unclear. ResultsUsing comparative genomics between human and rodent coding sequences, we uncovered a conserved codon-bias axis. A/T-ending codons consistently marked genes involved in proliferation and RNA processing, whereas G/C-ending codons were enriched for differentiation and neuronal functions. While GC3 scores, measuring the third-nucleotide codon bias, showed differences between humans and rodents due to recombination events, the functional dichotomy was conserved. Isoacceptors frequencies, measuring gene synonymous codon bias, was conserved from rodents to humans. Synonymous codons exhibited distinct functional enrichment patterns, demonstrating functional divergence at the codon level. Two new indices; the ANN-index and mG-index, reflecting codons decoded by the tA and mG tRNA modifications, linked tRNA modification biology to translation. Both indices correlated with proliferative, A/T-biased programs, providing a universal basis for their roles in cancer. Tissue proteomes showed strong RNA-protein discordance and distinct codon biases. Analysis of 21 cancer types revealed a global A/T-ending codon bias in cancer. Analysis of 2,600 cancer cell lines revealed codon bias heterogeneity in cell lines from the same cancer subtype that is not observable between cancer patients. ConclusionsOur results define synonymous codon divergence and tRNA-modification indices as determinants of translational reprogramming. This work establishes a unified framework connecting codon usage, tRNA modifications, and proteome remodeling, providing a basis for rational design of mRNA and gene therapeutics. Graphical abstract O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=80 SRC="FIGDIR/small/685942v1_ufig1.gif" ALT="Figure 1"> View larger version (35K): org.highwire.dtl.DTLVardef@161ca0borg.highwire.dtl.DTLVardef@117c3f8org.highwire.dtl.DTLVardef@142d26borg.highwire.dtl.DTLVardef@483f6_HPS_FORMAT_FIGEXP M_FIG C_FIG

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.1%
26.0%
2
Genome Biology
555 papers in training set
Top 0.6%
8.3%
3
Nucleic Acids Research
1128 papers in training set
Top 3%
6.9%
4
NAR Genomics and Bioinformatics
214 papers in training set
Top 0.6%
3.6%
5
Cell Genomics
162 papers in training set
Top 1%
3.6%
6
Scientific Reports
3102 papers in training set
Top 42%
2.9%
50% of probability mass above
7
Frontiers in Genetics
197 papers in training set
Top 3%
2.8%
8
PLOS Computational Biology
1633 papers in training set
Top 13%
2.4%
9
BMC Biology
248 papers in training set
Top 0.6%
2.4%
10
Nature Communications
4913 papers in training set
Top 46%
2.1%
11
Bioinformatics
1061 papers in training set
Top 6%
2.1%
12
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 3%
2.1%
13
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 31%
1.8%
14
iScience
1063 papers in training set
Top 13%
1.8%
15
Cell Reports
1338 papers in training set
Top 23%
1.7%
16
Communications Biology
886 papers in training set
Top 8%
1.7%
17
PLOS ONE
4510 papers in training set
Top 54%
1.7%
18
RNA Biology
70 papers in training set
Top 0.2%
1.7%
19
Molecular Systems Biology
142 papers in training set
Top 0.7%
1.5%
20
BMC Bioinformatics
383 papers in training set
Top 5%
1.4%
21
PLOS Biology
408 papers in training set
Top 15%
1.0%
22
Genomics
60 papers in training set
Top 2%
1.0%
23
eLife
5422 papers in training set
Top 53%
0.9%
24
PLOS Genetics
756 papers in training set
Top 14%
0.8%
25
Genes
126 papers in training set
Top 3%
0.7%
26
PeerJ
261 papers in training set
Top 17%
0.7%
27
NAR Cancer
36 papers in training set
Top 0.2%
0.7%
28
Molecular Biology and Evolution
488 papers in training set
Top 5%
0.7%
29
EMBO reports
136 papers in training set
Top 8%
0.5%
30
Nature Structural & Molecular Biology
218 papers in training set
Top 6%
0.5%