Back

Results of a large scale study of the binding of 50 type IIinhibitors to 348 kinases: The role of protein reorganization

Vardanyan, V. H.; Haldane, A.; Hwang, H.; Coskun, D.; Lihan, M.; Miller, E. B.; Friesner, R. A.; Levy, R. M.

2026-02-08 biophysics
10.64898/2026.02.05.704068 bioRxiv
Show abstract

Kinase family proteins constitute the second largest protein class targeted in drug development efforts, most prominently to treat cancer, but also several other diseases associated with kinase dysfunction. In this work we focus on type II kinase inhibitors which bind to the "classical" inactive conformation of the protein kinase catalytic domain where the DFG motif has a "DFG-out" orientation and the activation loop is folded. Many Tyrosine kinases (TKs) exhibit strong binding affinity with a wide spectrum of type II inhibitors while serine/threonine kinases (STKs) often bind more weakly. Recent work suggests this difference is largely due to differences in the folded to extended conformational equilibrium of the activation loop between TKs vs. STKs. The binding affinity of a type II inhibitor to its kinase target can be decomposed into a sum of two contributions: (1) the free energy cost to reorganize the protein from the active to inactive state, and (2) the binding affinity of the type II inhibitor to the inactive kinase conformation. In previous work we used a Potts statistical energy potential based on sequence co-variation to thread sequences over ensembles of active and inactive kinase structures. The threading function was used to estimate the free energy cost to reorganize kinases from the active to classical inactive conformation, and we showed that this estimator is consistent with the results of molecular dynamics free energy simulations for a small set of STKs and TKs. In the current study, we analyze the results of a large-scale study of the binding affinities of 50 type II inhibitors to 348 kinases, of which the results for 16 of the 50 type II inhibitors were reported in an earlier study (the "Davis dataset"); the binding data for the remaining 34 type II inhibitors to the panel of 348 kinases were recently obtained (the "Schrodinger dataset"). We use the Potts statistical energy model to investigate the contribution of protein reorganization to the selectivity of the large kinase panel against the set of 50 type II inhibitors, and find that protein reorganization makes a significant contribution to the selectivity. The AUC of the receiver-operator characteristic curve is [~]0.8. We report the results of an internal "blind test", that shows how Potts threading energies can provide more accurate estimates of kinase selectivity than corresponding predictions using experimental results of small sample size. We discuss why two STK phylogenetic kinase families, STE and CMGC, appear to contain many outliers, and how to improve the ability to predict kinase selectivity with a more complete analysis of the kinase conformational landscape. We compare the performance of Potts threading for predicting binding properties of the large set of (50) Type II inhibitors to 348 kinases, with those of a sequence-based purely machine learning model, DeepDTAGen, a publicly available machine learning model that was trained on the complete Davis dataset, including both Type I and Type II kinase inhibitors. We observe that DeepDTAGen performs well on binding predictions for the 16 type II inhibitors in the Davis dataset, but performs poorly on binding predictions for the 34 type II inhibitors against 348 kinases in the Schrodinger dataset.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
PLOS Computational Biology
1633 papers in training set
Top 0.7%
22.6%
2
Journal of Chemical Information and Modeling
207 papers in training set
Top 0.7%
8.4%
3
Physical Biology
43 papers in training set
Top 0.1%
7.2%
4
Journal of Chemical Theory and Computation
126 papers in training set
Top 0.2%
4.4%
5
Scientific Reports
3102 papers in training set
Top 27%
4.3%
6
PLOS ONE
4510 papers in training set
Top 36%
4.0%
50% of probability mass above
7
Frontiers in Molecular Biosciences
100 papers in training set
Top 0.4%
3.6%
8
The Journal of Physical Chemistry B
158 papers in training set
Top 0.7%
2.9%
9
Biophysical Journal
545 papers in training set
Top 2%
2.7%
10
Journal of Theoretical Biology
144 papers in training set
Top 0.6%
2.4%
11
Proteins: Structure, Function, and Bioinformatics
82 papers in training set
Top 0.3%
2.1%
12
BMC Bioinformatics
383 papers in training set
Top 4%
2.1%
13
Briefings in Bioinformatics
326 papers in training set
Top 3%
1.9%
14
Molecules
37 papers in training set
Top 0.7%
1.8%
15
Journal of Biomolecular Structure and Dynamics
43 papers in training set
Top 0.7%
1.7%
16
Entropy
20 papers in training set
Top 0.2%
1.5%
17
Journal of Computational Chemistry
11 papers in training set
Top 0.1%
1.5%
18
Bioinformatics
1061 papers in training set
Top 8%
1.3%
19
Journal of Molecular Biology
217 papers in training set
Top 2%
1.2%
20
International Journal of Molecular Sciences
453 papers in training set
Top 11%
1.2%
21
Journal of Molecular Graphics and Modelling
16 papers in training set
Top 0.2%
0.9%
22
Communications Biology
886 papers in training set
Top 24%
0.7%
23
Physica A: Statistical Mechanics and its Applications
10 papers in training set
Top 0.3%
0.7%
24
Physical Review Research
46 papers in training set
Top 0.9%
0.7%
25
The European Physical Journal E
15 papers in training set
Top 0.1%
0.7%
26
Biochemistry and Biophysics Reports
28 papers in training set
Top 2%
0.7%
27
Biophysics and Physicobiology
10 papers in training set
Top 0.2%
0.6%
28
Journal of Biosciences
12 papers in training set
Top 0.2%
0.6%
29
Journal of Molecular and Cellular Cardiology
39 papers in training set
Top 0.9%
0.6%
30
Computational and Structural Biotechnology Journal
216 papers in training set
Top 12%
0.5%