Back

Integrating computational chemistry and machine learning to predict KRAS mutation-induced resistance

Mizgalska, K.; Urbaniak, K.; Imbody, D. J.; Haura, E. B.; Guida, W. C.; Branciamore, S.; Karolak, A.

2026-04-11 biophysics
10.64898/2026.04.10.717640 bioRxiv
Show abstract

Mutation-induced drug resistance is a major contributor to the failure of targeted cancer therapies, particularly in tumors driven by mutations in the KRAS oncogene. Although covalent inhibitors effectively target KRAS G12C, secondary mutations such as G12C/Y96C, G12C/Y96S, and G12C/Y96D lead to resistance despite leaving the covalent attachment site intact. To predict these resistance outcomes, we developed a computational framework that integrates molecular dynamics-derived structural, energetic, thermodynamic, and contact-based descriptors with machine learning. Features extracted from simulations of treatment-sensitive and treatment-resistant KRAS mutants were used to train logistic regression, random forest, support vector machine, and Bayesian Network classifiers, achieving average accuracies above 90%. Solvent-accessible surface area variability, Lennard-Jones 1,4 energy, mean square displacement, and root mean square fluctuation emerged as the most discriminatory features. Residues G10, E62, and H95 showed the highest predictive value. This approach highlights conformational and solvent-exposure changes as central drivers of KRAS drug resistance and provides a generalizable workflow for other clinically relevant mutant targets. Author SummaryMutation-induced resistance is a common challenge across many cancer types and is often associated with aggressive tumor progression and poor therapeutic response. Investigating the dynamic properties of proteins harboring such mutations provides valuable insights into the structural and functional consequences of these alterations, thereby helping to elucidate the mechanisms of drug resistance. Machine learning algorithms are particularly effective at uncovering complex patterns within high-dimensional data, such as molecular dynamics simulation trajectories. Integrating these algorithms with analysis of protein dynamics holds significant potential to aid in drug discovery challenges by reducing both time and resource demands while increasing the likelihood of identifying effective therapeutic candidates. As a proof of concept, we developed a computational framework that integrates molecular dynamics-derived molecular features with machine learning to distinguish treatment-sensitive from treatment-resistant KRAS mutants. KRAS is known for drug resistance arising from secondary mutations that disrupt inhibitor binding despite intact covalent attachment sites. The models achieved over 90% accuracy and identified solvent-exposure and conformational changes at residues G10, E62, and H95 as key predictors of treatment resistance. This workflow offers a generalizable strategy for understanding and forecasting mutation-induced resistance.

Matching journals

The top 1 journal accounts for 50% of the predicted probability mass.

1
Journal of Chemical Information and Modeling
207 papers in training set
Top 0.1%
52.3%
50% of probability mass above
2
Journal of Chemical Theory and Computation
126 papers in training set
Top 0.1%
10.2%
3
PLOS Computational Biology
1633 papers in training set
Top 8%
4.3%
4
Frontiers in Molecular Biosciences
100 papers in training set
Top 0.4%
3.6%
5
The Journal of Physical Chemistry B
158 papers in training set
Top 0.6%
3.1%
6
Chemical Science
71 papers in training set
Top 0.5%
2.8%
7
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 31%
1.8%
8
Scientific Reports
3102 papers in training set
Top 57%
1.7%
9
Physical Biology
43 papers in training set
Top 1%
1.7%
10
Computational and Structural Biotechnology Journal
216 papers in training set
Top 6%
1.3%
11
PLOS ONE
4510 papers in training set
Top 58%
1.3%
12
eLife
5422 papers in training set
Top 51%
1.0%
13
Protein Science
221 papers in training set
Top 1%
0.9%
14
The Journal of Chemical Physics
49 papers in training set
Top 0.4%
0.8%
15
The Journal of Physical Chemistry Letters
58 papers in training set
Top 1%
0.8%
16
Structure
175 papers in training set
Top 3%
0.8%
17
Biophysical Journal
545 papers in training set
Top 5%
0.8%
18
ACS Omega
90 papers in training set
Top 5%
0.6%
19
Communications Chemistry
39 papers in training set
Top 2%
0.5%
20
Journal of Biomolecular Structure and Dynamics
43 papers in training set
Top 2%
0.5%