Back

Genetic code expansion enables programmable covalent protein design

de Puig, H.; Kuru, E.; Moret, M.; Flores, A.; Karunakaran, S.; Sayfullina, D.; Rout, S.; Escobedo-Lucea, C.; Collins, J. J.; Church, G. M.

2026-05-16 synthetic biology
10.64898/2026.05.15.725538 bioRxiv
Show abstract

Covalent chemistry has transformed small-molecule drug discovery, yet analogous strategies for proteins remain largely inaccessible because covalent warheads cannot be readily integrated into biologics. Conventional genetic code expansion requires engineering a dedicated aminoacyl-tRNA synthetase for each new amino acid, rendering broad warhead screening impractical. Here we introduce AminoX, a platform that bypasses this limitation through direct tRNA acylation, enabling site-specific incorporation of chemically diverse non-standard amino acids (nsAAs), including covalent warhead nsAAs compatible with scalable biologic manufacturing and multifunctional nsAAs. Using a pooled mRNA display workflow, we screened more than 2,000 warhead-position combinations in machine learning-designed de novo miniproteins targeting CTLA-4, enabling parallel interrogation of covalent chemistry, linker geometry, and incorporation site. We confirmed covalent engagement on cells together with enhanced functional blockade. Finally, we demonstrate multifunctional nsAAs that combine covalent warheads with fluorogenic reporters for real-time detection of target engagement, as well as dual nsAA incorporation for macrocyclization and fluorescent imaging of covalent binding on cell surfaces. By uniting synthetic biology, chemical biology, generative protein design, and high-throughput functional selection, AminoX compresses covalent protein engineering timelines by orders of magnitude, accelerating the development of next-generation therapeutics, biosensors, and chemical probes.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Nature Chemical Biology
104 papers in training set
Top 0.1%
17.7%
2
Science
429 papers in training set
Top 2%
13.7%
3
Nature Communications
4913 papers in training set
Top 19%
9.9%
4
Nature Biotechnology
147 papers in training set
Top 2%
6.1%
5
Cell
370 papers in training set
Top 3%
6.0%
50% of probability mass above
6
Nature Materials
21 papers in training set
Top 0.2%
3.8%
7
Nature
575 papers in training set
Top 7%
3.5%
8
Advanced Science
249 papers in training set
Top 6%
3.4%
9
Nature Nanotechnology
30 papers in training set
Top 0.4%
3.4%
10
ACS Central Science
66 papers in training set
Top 0.5%
3.4%
11
Nature Methods
336 papers in training set
Top 3%
3.1%
12
Journal of the American Chemical Society
199 papers in training set
Top 2%
2.6%
13
Nature Biomedical Engineering
42 papers in training set
Top 0.6%
2.0%
14
Angewandte Chemie International Edition
81 papers in training set
Top 2%
1.8%
15
Science Advances
1098 papers in training set
Top 16%
1.8%
16
Nucleic Acids Research
1128 papers in training set
Top 12%
1.6%
17
ACS Nano
99 papers in training set
Top 2%
1.6%
18
Cell Chemical Biology
81 papers in training set
Top 3%
1.2%
19
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 41%
0.9%
20
ACS Synthetic Biology
256 papers in training set
Top 3%
0.8%
21
Cancer Cell
38 papers in training set
Top 2%
0.8%
22
Cell Systems
167 papers in training set
Top 12%
0.8%
23
Advanced Materials
53 papers in training set
Top 2%
0.7%
24
Nature Chemistry
34 papers in training set
Top 1%
0.7%
25
Circulation
66 papers in training set
Top 3%
0.7%
26
Cell Discovery
54 papers in training set
Top 6%
0.6%
27
Nano Letters
63 papers in training set
Top 3%
0.6%