Back

Evaluating codon optimization strategies for mammalian glycoprotein production with an open-source expression vector

Yang, C.; Soni, R.; Visconti, S. E.; Abdollahi, M.; Belay, F.; Ghosh, A.; Duvall, S. W.; Walton, C. J. W.; Meijers, R.; Zhu, H.

2026-03-20 molecular biology
10.64898/2026.03.18.712111 bioRxiv
Show abstract

Efficient production of human proteins for the development of tool compounds and biologics depends on a detailed understanding of the protein expression machinery in mammalian cells. Codon optimization is widely believed to enhance protein yield, yet its impact in homologous mammalian systems remains poorly defined. Here, we systematically compare five codon usage strategies reflecting common assumptions about rare codons, RNA stability, and synthesis efficiency. We developed pTipi, an efficient open-source mammalian expression vector, and evaluated its performance in antibody production. We generated plasmids for common epitope tag antibodies such as V5, anti-biotin and anti-His for distribution by Addgene. To compare codon usage schemes, we performed a bake-off of 18 human and murine Wnt pathway glycoproteins in mammalian cells. Small-scale expression screens revealed that codon optimization did not provide a general advantage over native coding sequences, while strategies prioritizing RNA stability consistently reduced expression. Interestingly, a skewed codon scheme using the most abundant codons produced yields comparable to native sequences and occasionally enhanced protein output. To enable flexible evaluation of codon strategies, we implemented a Golden Gate-compatible pTipi platform for efficient synthetic gene incorporation. We conclude that native codons are sufficient for robust homologous mammalian expression of glycoproteins, while selective codon skewing can be beneficial for some targets.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Frontiers in Bioengineering and Biotechnology
88 papers in training set
Top 0.1%
18.5%
2
ACS Synthetic Biology
256 papers in training set
Top 0.3%
17.4%
3
Protein Science
221 papers in training set
Top 0.1%
8.4%
4
PLOS ONE
4510 papers in training set
Top 28%
6.3%
50% of probability mass above
5
Nucleic Acids Research
1128 papers in training set
Top 6%
3.6%
6
Scientific Reports
3102 papers in training set
Top 37%
3.6%
7
ACS Omega
90 papers in training set
Top 0.6%
3.6%
8
mAbs
28 papers in training set
Top 0.1%
3.2%
9
Cell Reports Methods
141 papers in training set
Top 1%
3.1%
10
Journal of Molecular Biology
217 papers in training set
Top 1%
2.1%
11
Protein Expression and Purification
11 papers in training set
Top 0.1%
1.9%
12
Computational and Structural Biotechnology Journal
216 papers in training set
Top 4%
1.7%
13
BMC Bioinformatics
383 papers in training set
Top 6%
0.9%
14
Journal of Biological Chemistry
641 papers in training set
Top 3%
0.9%
15
Nature Communications
4913 papers in training set
Top 61%
0.8%
16
Bioconjugate Chemistry
17 papers in training set
Top 0.2%
0.8%
17
BMC Biotechnology
10 papers in training set
Top 0.1%
0.7%
18
Current Protocols
13 papers in training set
Top 0.2%
0.7%
19
International Journal of Biological Macromolecules
65 papers in training set
Top 4%
0.7%
20
FEBS Open Bio
29 papers in training set
Top 0.6%
0.7%
21
Biomacromolecules
25 papers in training set
Top 0.5%
0.7%
22
iScience
1063 papers in training set
Top 35%
0.7%
23
Open Biology
95 papers in training set
Top 3%
0.6%
24
FEBS Letters
42 papers in training set
Top 0.5%
0.6%
25
PeerJ
261 papers in training set
Top 18%
0.6%