Back

KinConfBench: A Curated Benchmark for Cofolding Models on Kinase Conformational States

Sun, K.; Head-Gordon, T.

2026-04-10 biophysics
10.64898/2026.04.07.716788 bioRxiv
Show abstract

Protein kinases are critical drug targets, requiring therapeutics that can modulate their active and inactive conformational states. While cofolding models can generate global folds directly from kinase sequences and ligand SMILES strings, these models have not yet been tested on their ability to recover ligand induced-fit conformational states of the kinase proteins. Here, we introduce KinConfBench, a curated benchmark of 2,225 high-quality human kinase chains to evaluate the ability of three state-of-the-art cofolding models--Boltz-2, Chai-1, and Protenix--to recover both canonical and rare conformational states. We show that geometric success metrics of a ligand pose in the active site does not correlate strongly with the correct kinase conformational state, motivating a new set of dynamical benchmarks for assessing cofolding models. While all three cofolding models achieve [~]65-75% prediction accuracy for kinase conformational classification, they exhibit severe mode collapse when performing multiple inferences, show negligible structural diversity in sampling induced-fit motions, and display a prevalent "apo-drift" in which all three cofolding models predominately predict the kinase to be in its ligand-free state. Our results highlight that capturing ligand-induced protein conformational diversity, not just geometric fit, is critical for next-generation structure-based drug discovery.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Journal of Chemical Information and Modeling
207 papers in training set
Top 0.4%
14.7%
2
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 7%
9.2%
3
Nature Communications
4913 papers in training set
Top 25%
7.2%
4
PLOS Computational Biology
1633 papers in training set
Top 6%
6.4%
5
Structure
175 papers in training set
Top 0.5%
4.9%
6
Journal of Chemical Theory and Computation
126 papers in training set
Top 0.3%
4.0%
7
Biophysical Journal
545 papers in training set
Top 2%
3.7%
8
Scientific Reports
3102 papers in training set
Top 41%
3.1%
50% of probability mass above
9
Cell Systems
167 papers in training set
Top 4%
3.1%
10
Chemical Science
71 papers in training set
Top 0.6%
2.4%
11
The Journal of Physical Chemistry Letters
58 papers in training set
Top 0.6%
2.1%
12
Nature Methods
336 papers in training set
Top 4%
1.9%
13
Nature Computational Science
50 papers in training set
Top 0.5%
1.8%
14
Bioinformatics Advances
184 papers in training set
Top 3%
1.7%
15
Communications Biology
886 papers in training set
Top 8%
1.7%
16
The Journal of Physical Chemistry B
158 papers in training set
Top 1%
1.5%
17
Advanced Science
249 papers in training set
Top 12%
1.5%
18
Nucleic Acids Research
1128 papers in training set
Top 12%
1.5%
19
Science
429 papers in training set
Top 15%
1.5%
20
eLife
5422 papers in training set
Top 45%
1.5%
21
IUCrJ
29 papers in training set
Top 0.2%
1.5%
22
PLOS ONE
4510 papers in training set
Top 57%
1.5%
23
Nature Machine Intelligence
61 papers in training set
Top 2%
1.3%
24
Frontiers in Molecular Biosciences
100 papers in training set
Top 3%
1.2%
25
Briefings in Bioinformatics
326 papers in training set
Top 5%
1.1%
26
Proteins: Structure, Function, and Bioinformatics
82 papers in training set
Top 0.7%
1.1%
27
Protein Science
221 papers in training set
Top 1%
1.1%
28
Journal of the American Chemical Society
199 papers in training set
Top 4%
0.9%
29
Communications Chemistry
39 papers in training set
Top 1%
0.7%
30
Bioinformatics
1061 papers in training set
Top 9%
0.7%