Back

More Structures, Less Accuracy: ESM3's Binding Prediction Paradox

Loux, T.; Wang, D.; Shakhnovich, E.

2024-12-09 molecular biology
10.1101/2024.12.09.627585 bioRxiv
Show abstract

This paper investigates the impact of incorporating structural information into the protein-protein interaction predictions made by ESM3, a multimodal protein language model (pLM). We utilized various structural variants as inputs and compared three widely used structure acquisition pipelines--EvoEF2, Gromacs, and Rosetta Relax--to assess their effects on ESM3s performance. Our findings reveal that the use of a consistent identical structure, regardless of whether it is relaxed or variant, consistently enhances model performance across various datasets. This improvement is striking in few-show learning. However, performance deteriorates when different relaxed mutant structures are used for each variant. Based on these results, we advise caution when integrating distinct mutant structures into ESM3 and similar models.This study highlights the critical need for careful consideration of structural inputs in protein binding affinity prediction.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
PLOS Computational Biology
1633 papers in training set
Top 3%
10.4%
2
PLOS ONE
4510 papers in training set
Top 19%
10.1%
3
Bioinformatics
1061 papers in training set
Top 3%
8.4%
4
Journal of Chemical Information and Modeling
207 papers in training set
Top 0.9%
6.3%
5
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.6%
6.3%
6
Frontiers in Bioinformatics
45 papers in training set
Top 0.1%
6.3%
7
BMC Bioinformatics
383 papers in training set
Top 2%
3.9%
50% of probability mass above
8
Computers in Biology and Medicine
120 papers in training set
Top 0.7%
3.9%
9
PeerJ
261 papers in training set
Top 3%
3.1%
10
Scientific Reports
3102 papers in training set
Top 41%
3.1%
11
Journal of Structural Biology
58 papers in training set
Top 0.6%
2.1%
12
Journal of Molecular Biology
217 papers in training set
Top 1%
2.1%
13
International Journal of Molecular Sciences
453 papers in training set
Top 5%
2.1%
14
Protein Science
221 papers in training set
Top 0.7%
1.9%
15
Briefings in Bioinformatics
326 papers in training set
Top 4%
1.8%
16
Bioinformatics Advances
184 papers in training set
Top 3%
1.7%
17
Biology Methods and Protocols
53 papers in training set
Top 1.0%
1.7%
18
Biomolecules
95 papers in training set
Top 0.7%
1.5%
19
ACS Omega
90 papers in training set
Top 3%
1.2%
20
FEBS Letters
42 papers in training set
Top 0.1%
1.2%
21
Journal of Computational Chemistry
11 papers in training set
Top 0.2%
0.8%
22
Entropy
20 papers in training set
Top 0.4%
0.7%
23
Heliyon
146 papers in training set
Top 7%
0.7%
24
Journal of Molecular Evolution
21 papers in training set
Top 0.4%
0.7%
25
Biochemistry and Biophysics Reports
28 papers in training set
Top 2%
0.6%
26
Acta Crystallographica Section D Structural Biology
54 papers in training set
Top 0.5%
0.6%
27
Journal of Chemical Theory and Computation
126 papers in training set
Top 0.9%
0.6%
28
Journal of Biosciences
12 papers in training set
Top 0.2%
0.6%