Back

Towards chemical accuracy for alchemical free energy calculations with hybrid physics-based machine learning / molecular mechanics potentials

Rufa, D. A.; Bruce Macdonald, H. E.; Fass, J.; Wieder, M.; Grinaway, P. B.; Roitberg, A. E.; Isayev, O.; Chodera, J. D.

2020-07-30 biophysics
10.1101/2020.07.29.227959 bioRxiv
Show abstract

Alchemical free energy methods with molecular mechanics (MM) force fields are now widely used in the prioritization of small molecules for synthesis in structure-enabled drug discovery projects because of their ability to deliver 1-2 kcal mol-1 accuracy in well-behaved protein-ligand systems. Surpassing this accuracy limit would significantly reduce the number of compounds that must be synthesized to achieve desired potencies and selectivities in drug design campaigns. However, MM force fields pose a challenge to achieving higher accuracy due to their inability to capture the intricate atomic interactions of the physical systems they model. A major limitation is the accuracy with which ligand intramolecular energetics--especially torsions--can be modeled, as poor modeling of torsional profiles and coupling with other valence degrees of freedom can have a significant impact on binding free energies. Here, we demonstrate how a new generation of hybrid machine learning / molecular mechanics (ML/MM) potentials can deliver significant accuracy improvements in modeling protein-ligand binding affinities. Using a nonequilibrium perturbation approach, we can correct a standard, GPU-accelerated MM alchemical free energy calculation in a simple post-processing step to efficiently recover ML/MM free energies and deliver a significant accuracy improvement with small additional computational effort. To demonstrate the utility of ML/MM free energy calculations, we apply this approach to a benchmark system for predicting kinase:inhibitor binding affinities--a congeneric ligand series for non-receptor tyrosine kinase TYK2 (Tyk2)--wherein state-of-the-art MM free energy calculations (with OPLS2.1) achieve inaccuracies of 0.93{+/-}0.12 kcal mol-1 in predicting absolute binding free energies. Applying an ML/MM hybrid potential based on the ANI2x ML model and AMBER14SB/TIP3P with the OpenFF 1.0.0 ("Parsley") small molecule force field as an MM model, we show that it is possible to significantly reduce the error in absolute binding free energies from 0.97 [95% CI: 0.68, 1.21] kcal mol-1 (MM) to 0.47 [95% CI: 0.31, 0.63] kcal mol-1 (ML/MM).

Matching journals

The top 2 journals account for 50% of the predicted probability mass.

1
Journal of Chemical Theory and Computation
126 papers in training set
Top 0.1%
33.8%
2
Journal of Chemical Information and Modeling
207 papers in training set
Top 0.1%
26.6%
50% of probability mass above
3
Chemical Science
71 papers in training set
Top 0.2%
5.0%
4
The Journal of Physical Chemistry Letters
58 papers in training set
Top 0.2%
5.0%
5
Biophysical Journal
545 papers in training set
Top 2%
3.7%
6
The Journal of Physical Chemistry B
158 papers in training set
Top 0.5%
3.7%
7
eLife
5422 papers in training set
Top 37%
1.9%
8
The Journal of Chemical Physics
49 papers in training set
Top 0.3%
1.4%
9
Scientific Reports
3102 papers in training set
Top 69%
1.0%
10
Journal of the American Chemical Society
199 papers in training set
Top 4%
1.0%
11
Journal of Computational Chemistry
11 papers in training set
Top 0.1%
1.0%
12
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 40%
0.9%
13
Biophysical Reports
36 papers in training set
Top 0.5%
0.8%
14
PLOS Computational Biology
1633 papers in training set
Top 24%
0.8%
15
Bioinformatics
1061 papers in training set
Top 9%
0.8%
16
Bioinformatics Advances
184 papers in training set
Top 5%
0.7%
17
ACS Omega
90 papers in training set
Top 5%
0.7%
18
PLOS ONE
4510 papers in training set
Top 71%
0.7%
19
Protein Science
221 papers in training set
Top 2%
0.5%
20
Computational and Structural Biotechnology Journal
216 papers in training set
Top 12%
0.5%