Back

Enhancing ML-based binder design with high-throughput screening: a comparison of mRNA and yeast display technologies

Yao, Z.; Metts, J. M.; Huber, A. K.; Li, J.; Kinjo, T.; Dieckhaus, H.; Nallathambi, A.; Bowers, A.; Kuhlman, B.

2026-02-14 bioengineering
10.64898/2026.02.12.705611 bioRxiv
Show abstract

Recent advances in machine learning (ML)-based protein design methods have enabled the rapid in silico generation of large libraries of miniprotein binders with minimal manual input. While computational design capacity has scaled rapidly, experimental validation methods have lagged, creating a bottleneck in binder discovery pipelines. Here, we apply mRNA display to screen an ML-designed miniprotein binder library and directly compare its performance with the more widely used yeast surface display platform using a single shared DNA library. We screened 2,009 designs targeting the platelet receptor TLT-1 and 3,159 designs targeting the immune receptor B7-H3 across both platforms. While both selection methods reliably identified functional binders, we found that mRNA display preferentially enriched binders with slower dissociation rates. In addition, mRNA display achieved higher library coverage than yeast display, likely rescuing functional designs that are penalized in a cell-based expression system. Biophysical characterization of selected binders from both platforms revealed strong binding affinities and high thermal stabilities. These results showcase the power of integrating ML-based computational design tools with rapid in vitro selection technologies, providing a scalable framework for therapeutic miniprotein discovery. IMPORTANCEMiniprotein binders offer major advantages as next-generation therapeutics, including small size, high stability, and efficient production. In this work, we conduct a side-by-side comparison of mRNA and yeast display as platforms for high-throughput evaluation of de novo miniprotein binders. The binders generated here serve as starting points for therapeutics targeting TLT-1 or B7-H3, two clinically relevant molecules.

Matching journals

The top 10 journals account for 50% of the predicted probability mass.

1
Protein Engineering, Design and Selection
14 papers in training set
Top 0.1%
8.4%
2
Cell Chemical Biology
81 papers in training set
Top 0.2%
8.2%
3
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 10%
6.8%
4
Journal of Chemical Information and Modeling
207 papers in training set
Top 0.9%
6.4%
5
ACS Chemical Biology
150 papers in training set
Top 0.2%
6.4%
6
Scientific Reports
3102 papers in training set
Top 34%
3.7%
7
Molecular Therapy
71 papers in training set
Top 0.8%
3.3%
8
Nature Communications
4913 papers in training set
Top 43%
2.7%
9
Advanced Science
249 papers in training set
Top 7%
2.6%
10
Cell Systems
167 papers in training set
Top 5%
2.6%
50% of probability mass above
11
Journal of the American Chemical Society
199 papers in training set
Top 2%
2.4%
12
Molecular Pharmaceutics
16 papers in training set
Top 0.2%
2.4%
13
Protein Science
221 papers in training set
Top 0.8%
1.8%
14
PLOS ONE
4510 papers in training set
Top 52%
1.8%
15
Structure
175 papers in training set
Top 2%
1.7%
16
Computational and Structural Biotechnology Journal
216 papers in training set
Top 4%
1.7%
17
Cell Reports Medicine
140 papers in training set
Top 4%
1.7%
18
mAbs
28 papers in training set
Top 0.2%
1.7%
19
PLOS Computational Biology
1633 papers in training set
Top 17%
1.7%
20
Journal of Medicinal Chemistry
68 papers in training set
Top 0.8%
1.3%
21
Antibody Therapeutics
16 papers in training set
Top 0.3%
1.2%
22
Angewandte Chemie International Edition
81 papers in training set
Top 3%
1.0%
23
Cellular and Molecular Bioengineering
21 papers in training set
Top 0.2%
1.0%
24
ChemMedChem
15 papers in training set
Top 0.5%
1.0%
25
Chemical Science
71 papers in training set
Top 2%
1.0%
26
iScience
1063 papers in training set
Top 25%
1.0%
27
Cell Genomics
162 papers in training set
Top 5%
0.9%
28
eLife
5422 papers in training set
Top 53%
0.9%
29
ACS Omega
90 papers in training set
Top 4%
0.8%
30
Communications Biology
886 papers in training set
Top 24%
0.7%