Back

Impact of different synonymous codon substitution strategies on SARS-CoV-2 nucleocapsid protein expression in Escherichia coli

Bello, A. J.; Omotuyi, A. O.; Oladapo, O. B.; Adekunle, A. A.; Udechime, K. U.; Akinwande, A. B.; Odewale, A. F.; Adamson, O. O.; Eban, D. O.; Folarin, O.; Okpuzor, J.; Minari, J. B.

2025-07-22 microbiology
10.1101/2024.11.06.622014 bioRxiv
Show abstract

Synonymous codon substitution, a gene engineering approach in synthetic biology, has been effective in improving the codon composition of recombinant genes of interest based on various criteria without altering the amino acid sequence. The SARS-CoV-2 virus nucleocapsid (N) protein is a stable, conserved and highly immunogenic that is less prone to mutation during infection, making it a key antigen in in vitro diagnosis, vaccine development, immunological and structural studies. While reports have focused on applying optimized N protein for different applications, the basic parameters used by different optimization tools for choosing the best approach for the N gene synonymous codon substitution are often neglected. Here, we analyzed the influence of different synonymous codon substitution strategies on SARS-CoV-2 N-protein expression in E. coli. Using different codon optimization (CO) and harmonization (CH) tools, we predicted and compared how parameters such as GC content, Codon Adaptation Index, codon quality and number of rare codons present in these sequences affect the N-protein expression. Our results also show that Minimum Free Energy (MFE) and RNA structure of N-term and C-tail of the N-protein coding sequence influence protein folding. We then predicted that the SR-rich region of the N-protein may contribute to slowing down the elongation rate during translation. This work presents a fundamental analysis of how different optimization tools affect SARS-CoV-2 N-protein expression and folding and suggests a basic approach to choosing the best strategy for optimal expression and folding of the protein for further studies. Graphical Abstract O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=60 SRC="FIGDIR/small/622014v3_ufig1.gif" ALT="Figure 1"> View larger version (15K): org.highwire.dtl.DTLVardef@1fbd7f8org.highwire.dtl.DTLVardef@11fe81borg.highwire.dtl.DTLVardef@1bf787borg.highwire.dtl.DTLVardef@17edc57_HPS_FORMAT_FIGEXP M_FIG C_FIG HighlightsCodon substitution affects SARS-CoV-2 nucleocapsid (N) protein expression. SARS-CoV-2 N-protein expression varies with different codon optimization tools. RNA structures of N- and C-term impact RNA stability, protein expression and folding. SR-rich region of the N-protein may slow down elongation rate during translation.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Computational Biology and Chemistry
23 papers in training set
Top 0.1%
12.6%
2
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.2%
10.0%
3
ACS Omega
90 papers in training set
Top 0.1%
9.1%
4
PLOS Computational Biology
1633 papers in training set
Top 6%
6.3%
5
Pharmaceuticals
33 papers in training set
Top 0.2%
3.6%
6
PeerJ
261 papers in training set
Top 3%
3.6%
7
Biochemical and Biophysical Research Communications
78 papers in training set
Top 0.2%
2.6%
8
PLOS ONE
4510 papers in training set
Top 46%
2.4%
50% of probability mass above
9
Biochimie
23 papers in training set
Top 0.1%
2.1%
10
Molecules
37 papers in training set
Top 0.7%
1.8%
11
Scientific Reports
3102 papers in training set
Top 59%
1.7%
12
Viruses
318 papers in training set
Top 3%
1.7%
13
International Journal of Molecular Sciences
453 papers in training set
Top 8%
1.7%
14
Biochemistry
130 papers in training set
Top 0.9%
1.7%
15
Protein Science
221 papers in training set
Top 0.9%
1.7%
16
Journal of Molecular Biology
217 papers in training set
Top 2%
1.7%
17
Nucleic Acids Research
1128 papers in training set
Top 12%
1.5%
18
Virus Research
36 papers in training set
Top 0.6%
1.5%
19
The Journal of Physical Chemistry Letters
58 papers in training set
Top 1.0%
1.3%
20
ACS Synthetic Biology
256 papers in training set
Top 2%
1.2%
21
FEBS Letters
42 papers in training set
Top 0.2%
1.1%
22
Journal of Chemical Information and Modeling
207 papers in training set
Top 2%
1.1%
23
RNA Biology
70 papers in training set
Top 0.4%
0.9%
24
Biomedicines
66 papers in training set
Top 2%
0.9%
25
Frontiers in Microbiology
375 papers in training set
Top 8%
0.8%
26
Journal of Molecular Graphics and Modelling
16 papers in training set
Top 0.3%
0.7%
27
Chemical Communications
24 papers in training set
Top 1%
0.7%
28
F1000Research
79 papers in training set
Top 5%
0.7%
29
Frontiers in Bioengineering and Biotechnology
88 papers in training set
Top 3%
0.6%
30
Briefings in Bioinformatics
326 papers in training set
Top 7%
0.6%