Back

Crowdsourced Protein Design: Lessons From the Adaptyv EGFR Binder Competition

Cotet, T.-S.; Krawczuk, I.; Pacesa, M.; Nickel, L.; Correia, B. E.; Haas, N.; Qamar, A.; Challacombe, C. A.; Kidger, P.; Ferragu, C.; Naka, A.; Castorina, L. V.; Subr, K.; Kluonis, T.; Stam, M. J.; Unal, S. M.; Wood, C. W.; Stocco, F.; Ferruz, N.; Kurumida, Y.; Calia, C. N.; Paesani, F.; Machado, L. d. A.; Belot, E.; Gitter, A.; Campbell, M. J.; Hallee, L.; Adaptyv Competition Organizers,

2025-04-23 bioengineering
10.1101/2025.04.17.648362 bioRxiv
Show abstract

In this report, we summarize and analyze the 2024 Adaptyv protein design competition. Participants used computational and Machine Learning (ML) methods of their choice to design proteins that bind the Epidermal Growth Factor Receptor (EGFR), a key drug target involved in cell growth, differentiation, and cancer development. Over 1,800 designs were submitted across two rounds. Of these, 601 proteins were selected and characterized for expression and binding affinity to EGFR, with competitors both optimizing existing binders (KD = 1.21 nM) and creating de novo binders (KD = 82 nM). All selected designs were experimentally validated using Adaptyvs automated Bio-Layer Interferometry (BLI) pipeline. This competition illustrates the potential of crowdsourcing to drive creativity and innovation in protein design. However, it also exposed key challenges, such as the lack of standardized benchmarks, experimental design targets, and robust computational metrics for method comparison. We anticipate that future competitions will address these gaps and further motivate progress in computational protein design.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Cell Systems
167 papers in training set
Top 0.6%
14.3%
2
Journal of Chemical Information and Modeling
207 papers in training set
Top 0.7%
8.4%
3
Nature Methods
336 papers in training set
Top 1%
8.4%
4
Protein Engineering, Design and Selection
14 papers in training set
Top 0.1%
7.2%
5
Protein Science
221 papers in training set
Top 0.1%
6.8%
6
Nature Communications
4913 papers in training set
Top 35%
4.3%
7
PLOS Computational Biology
1633 papers in training set
Top 12%
2.7%
50% of probability mass above
8
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 24%
2.7%
9
Computational and Structural Biotechnology Journal
216 papers in training set
Top 3%
2.6%
10
Nature Biotechnology
147 papers in training set
Top 4%
2.1%
11
Scientific Reports
3102 papers in training set
Top 54%
1.9%
12
Nature Machine Intelligence
61 papers in training set
Top 2%
1.7%
13
Nucleic Acids Research
1128 papers in training set
Top 11%
1.7%
14
Frontiers in Bioinformatics
45 papers in training set
Top 0.2%
1.7%
15
Structure
175 papers in training set
Top 2%
1.7%
16
Bioinformatics
1061 papers in training set
Top 7%
1.7%
17
Proteins: Structure, Function, and Bioinformatics
82 papers in training set
Top 0.5%
1.7%
18
Frontiers in Molecular Biosciences
100 papers in training set
Top 2%
1.5%
19
PLOS ONE
4510 papers in training set
Top 58%
1.3%
20
Chemical Science
71 papers in training set
Top 1%
1.2%
21
BMC Bioinformatics
383 papers in training set
Top 6%
1.1%
22
eLife
5422 papers in training set
Top 50%
1.1%
23
Nature
575 papers in training set
Top 13%
0.9%
24
Journal of Cheminformatics
25 papers in training set
Top 0.4%
0.9%
25
International Journal of Molecular Sciences
453 papers in training set
Top 13%
0.9%
26
Angewandte Chemie International Edition
81 papers in training set
Top 3%
0.9%
27
Communications Chemistry
39 papers in training set
Top 0.8%
0.9%
28
Communications Biology
886 papers in training set
Top 19%
0.9%
29
Science Advances
1098 papers in training set
Top 30%
0.7%
30
Advanced Science
249 papers in training set
Top 19%
0.7%