Towards chemical accuracy for alchemical free energy calculations with hybrid physics-based machine learning / molecular mechanics potentials

Dominic A. Rufa; Hannah E. Bruce Macdonald; Josh Fass; Marcus Wieder; Patrick B. Grinaway; Adrian E. Roitberg; Olexandr Isayev; John D. Chodera

doi:10.1101/2020.07.29.227959

Abstract

Alchemical free energy methods with molecular mechanics (MM) force fields are now widely used in the prioritization of small molecules for synthesis in structure-enabled drug discovery projects because of their ability to deliver 1–2 kcal mol⁻¹ accuracy in well-behaved protein-ligand systems. Surpassing this accuracy limit would significantly reduce the number of compounds that must be synthesized to achieve desired potencies and selectivities in drug design campaigns. However, MM force fields pose a challenge to achieving higher accuracy due to their inability to capture the intricate atomic interactions of the physical systems they model. A major limitation is the accuracy with which ligand intramolecular energetics—especially torsions—can be modeled, as poor modeling of torsional profiles and coupling with other valence degrees of freedom can have a significant impact on binding free energies. Here, we demonstrate how a new generation of hybrid machine learning / molecular mechanics (ML/MM) potentials can deliver significant accuracy improvements in modeling protein-ligand binding affinities. Using a nonequilibrium perturbation approach, we can correct a standard, GPU-accelerated MM alchemical free energy calculation in a simple post-processing step to efficiently recover ML/MM free energies and deliver a significant accuracy improvement with small additional computational effort. To demonstrate the utility of ML/MM free energy calculations, we apply this approach to a benchmark system for predicting kinase:inhibitor binding affinities—a congeneric ligand series for non-receptor tyrosine kinase TYK2 (Tyk2)—wherein state-of-the-art MM free energy calculations (with OPLS2.1) achieve inaccuracies of 0.93±0.12 kcal mol⁻¹ in predicting absolute binding free energies. Applying an ML/MM hybrid potential based on the ANI2x ML model and AMBER14SB/TIP3P with the OpenFF 1.0.0 (“Parsley”) small molecule force field as an MM model, we show that it is possible to significantly reduce the error in absolute binding free energies from 0.97 [95% CI: 0.68, 1.21] kcal mol⁻¹ (MM) to 0.47 [95% CI: 0.31, 0.63] kcal mol⁻¹ (ML/MM).

Introduction

MM force fields are widely used in structure-enabled drug discovery

Alchemical free energy calculations are now widely used in structure-enabled drug discovery programs to optimize or maintain potency [1–5]. Typically, relative alchemical free energy methods can predict affinities with accuracies of 1–2 kcal mol⁻¹ in prospective use in well-behaved, structure-enabled programs [3, 6]. While an accuracy of 1 kcal mol⁻¹ is already sufficient to greatly reduce the number of compounds that must be synthesized to achieve desired potency gains, the ability to further improve this accuracy to 0.5 kcal mol⁻¹ (“chemical accuracy” [7]) would deliver significant benefits at least as large as the improvement achieved by accuracy improvements from 2 kcal mol⁻¹ to 1 kcal mol⁻¹ for optimization of potency [8, 9] and selectivity [10, 11]. To achieve “chemical accuracy”, improvements are required to the computational model of the protein-ligand system (the force field) while constraining any increase in computational cost to ensure results can be produced on a timescale viable for active drug discovery projects, as alchemical free energy calculations typically require generating tens to hundreds of nanoseconds of simulation data within a few hours [5, 12].

Surpassing 1 kcal mol⁻¹ accuracy requires model improvements that are difficult to generalize

Relative free energy methods almost universally utilize fixed-charge MM force fields to model small, organic, drug-like molecules and interactions with their respective receptors and aqueous environments, such as GAFF [13, 14], CGenFF [15, 16, 16], or OPLS [17]. Importantly, these popular class I [18, 19] MM force fields have well-characterized drawbacks, in part, because they omit a number of important energetic contributions known to limit their ability to achieve chemical accuracy [7, 20, 21]. For example, while moving to more complex electrostatics models which include fixed multipoles and polarizable dipoles [22] are promising, the development of polarizable force fields that broadly deliver accuracy gains has proven challenging [23–25].

Deficiencies in the modeling of torsions that accurately account for local chemical environment is also a difficult challenge for MM force fields [26]. Indeed, many MM force fields recommend refitting torsion potentials directly to quantum chemical calculations for individual molecules in a bespoke manner, a process considered essential to achieving a 1–2 kcal mol⁻¹ level of accuracy in binding free energy calculations [1, 27]. Even so, the environment-dependent coupling between torsions and other valence degrees of freedom [7, 20, 21, 28] (including adjacent torsions [26, 29–32]) makes it difficult for this simple refitting approach to accurately capture often significant ligand conformational reorganization effects [33].

QM/MM offers a parameterization-free alternative, but at significantly increased cost

Another approach attempts to avoid the parameterization issue altogether by modeling the ligand using QM levels of theory, while treating the remaining atomic environment with an MM force field in a hybrid QM/MM potential [34–38]. QM/MM calculations are orders of magnitude more expensive than the equivalent calculation at the MM level, which has led to attempts to speed up these calculations by either using a low level of QM theory, or reducing the number of QM-level evaluations that are performed. QM/MM simulations for ligand binding are yet limited in accuracy due to the low level of QM theory (often semi-empirical or DFT with limited basis sets) that are computationally practical. This has driven the development of methods that attempt to minimize the amount of QM/MM simulation data that must be generated by computing perturbative corrections to MM alchemical free energy calculations [39–41].

Machine learning (ML) potentials can reproduce QM energies at greatly reduced cost

Recently, quantum machine learning potentials (ML or QML) [42]—such as those based on neural networks like ANI [43]—have seen success in reproducing QM energetics with orders of magnitude less computational cost than the QM methods they aim to reproduce. The ANI-1x neural network potential [43], for example, is able to reproduce DFT-level energies (ωB97X functional with 6-31G* basis set) with a 10⁶ speed up. Indeed, the ANI models are so fast and reproduce quantum chemical data so well that recent approaches have integrated them into bespoke torsion refitting schemes as an alternative to costly QM torsion scans [44]. Notably, the recently-developed ANI-2x supports molecular systems including element types C, H, N, O, as well as F, Cl, and S—ideal for applications to receptor-ligand systems as they cover 90% of drug-like molecules. In particular, for the purpose of this study, ANI-2x covers 100% of the ligands included in the Schrödinger benchmark set for alchemical free energy calculations [45].

Hybrid ML/MM potentials provide improved ligand energetics

While the notion of simulating an entire solvated protein-ligand system with quantum chemical accuracy—in a manner that avoids molecular mechanics parameterization altogether—with orders of magnitude less effort is immediately appealing, several limitations stand in the way to this: First, the number of elements covered by potentials like ANI [49] are so far rather limited, precluding their application to ions and cofactors; Second, while ML models are orders of magnitude faster than any reasonable QM levels of theory, they are currently still orders of magnitude slower than GPU-accelerated MM simulations, though this gap is expected to close rapidly with both software and hardware improvements. Third, ML models have not yet been parameterized on systems that would ensure well-behaved condensed-phase properties, so that MM may yet provide superior results for treating intermolecular interactions in large, extended systems.

However, hybrid ML/MM models—wherein ligand interactions are treated with ML and the environment and ligand-environment interactions with MM (in analogy to QM/MM [34–38])—could provide a convenient and efficient path to improving accuracy by capturing complex molecular interactions that classical force fields fail to do. Recently, Lahey et. al. [52] demonstrated that by using the ANI-1ccx [53] ML potential (a variant of ANI-1x refit to coupled-cluster calculations) to represent intramolecular interactions of small molecule ligands, accurate binding poses and conformational energies could be afforded to the EGFR inhibitor, erlotinib. Notably, they reported significant discrepancies among torsional energy profiles between the MM and ANI-1ccx potentials [52].

Here, we wondered whether incorporating this higher-accuracy ML treatment of small molecule ligand intramolecular energetics within an MM scheme would lead to quantitative improvements in absolute and relative binding free energies. A particularly convenient hybrid ML/MM formulation corresponds to the functional form given in Equation 1 (and visualized in Figure 1) wherein the potential energy function for a environment (receptor and/or solvent)/ligand system takes the form with as all non-ligand coordinates (receptor and/or solvent), as the ligand coordinates, and indicating the MM/ML potential energy function for the ligand and U_MM(X_P, X_L) the MM potential energy function for the environment/ligand system. The formulation given in Equation 1 treats intramolecular ligand interactions with an ML potential while intermolecular and environmental (receptor and/or solvent) atomic interactions are treated with MM force fields [52]. Although other formulations are possible—such as including short-range ligand-environment interactions within the ML region—we will demonstrate that the simple formulation of Equation 1 is sufficient to realize significant improvements in the accuracy of computed binding free energies for a challenging kinase:inhibitor benchmark system (Figure 2).

Figure 1. A hybrid ML/MM potential can treat intramolecular ligand forces with high accuracy. ML potentials can treat intramolecular ligand forces with high accuracy, as part of a hybrid ML/MM scheme.

Top: We can construct a hybrid machine learning / molecular mechanics (ML/MM) potential that treats ligand intramolecular interactions with higher accuracy than achievable by MM potentials by subtracting the MM energy of the ligand in vacuum and adding the more accurate ML energy of the ligand in vacuum. Here, the MM model uses the Open Force Field Initiative [http://openforcefield.org] OpenFF 1.0.0 (“Parsley”) small molecule force field [46], AMBER14SB [47], and TIP3P [48] while the ML model uses the ANI-2x [49] neural network potential parameterized using DFT ωB97X/6-31G* QM calculations. Bottom: The ANI-2x [49] ML potential first computes radial and angular features for each atom and then sums energetic contributions by atom using deep learning models specific to each element-element pair.

Figure 2. Tyk2 is a challenging test set for predicting kinase:inhibitor binding free energies.

The Tyk2 congeneric ligand benchmark series was taken from the Schrödinger JACS benchmark set [45], which is challenging for both commercial force fields (OPLS2.1 achieves a ΔG RMSE of 0.93±0.12 kcal mol⁻¹ [45]) and public force fields (GAFF 1.8 achieves a ΔG RMSE of 1.13 kcal mol⁻¹, and ΔΔG RMSE of 1.27 kcal mol⁻¹ [50]). Left: Illustration of the X-ray structure used for all calculations. Right: 2D structures of all ligands in the benchmark set, showing common scaffold and substituents. The Schrodinger Tyk2 benchmark set contains a congeneric series selected from [51, 51] where experimental errors in K_i are reported to have δK_i/K_i < 0.3, yielding δΔG ≈ 0.18 kcal mol⁻¹ and δΔΔG ≈ 0.25 kcal mol⁻¹.

Figure 3. A simple nonequilibrium switching scheme can eficiently correct standard MM alchemical free energy calculations to ML/MM accuracy.

(A) Augmented thermodynamic cycle used to estimate ML/MM free energies. The blue-bracketed, four-state thermodynamic cycle represents a typical MM relative free energy calculation where . The orange bracketed thermodynamic augmentations represent respective ML/MM hybrid states such that . (B) Illustration of the nonequilibrium switching perturbation approach to estimating free energy corrections. The blue MM and orange ML/MM arrows represent equilibria at respective thermodynamic states. First, N configurations are sampled from equilibrium at the MM state and short MM→ML/MM nonequilibrium (NEQ) trajectories are generated and the dimensionless work w_MM→ML/MM recorded. Subsequently, the last configuration of each NEQ trajectory is resampled (with replacement) with probability proportional to . Resampled configurations are decorrelated with 10 ps of ML/MM equilibrium molecular dynamics before backward ML/MM→MM nonequilibrium trajectories are generated and the corresponding dimensionless work stored. The Bennett acceptance ratio (BAR) [54–56] is used to estimate the free energy difference, which corresponds to the crossing of the true p(w_MM→ML/MM) and p(−w_ML/MM→MM) work distributions [55, 57].

Nonequilibrium perturbations can eficiently compute MM to ML/MM corrections

Current implementations of ML potentials do not permit an entire alchemical free energy calculation to be carried out with hybrid ML/MM potentials in a practical timescale. Instead, we aim for an approach that post-processes traditional MM alchemical free energy calculations [5, 12, 58]—such as the relative free energy calculation used here—to compute a correction ΔG^MM→ML/MM to the free energy of binding. Alchemical free energy calculations are now both routine and efficient, and available in a wide variety of software packages, often with GPU acceleration [6, 45, 50, 59–65].

While it may be tempting to simply sample from equilibrium MM states where the ligand is in complex or solution and estimate ΔG^MM→ML/MM based on the instantaneous dimensionless work w[X] ≡ β[U_ML/MM(X) − U_MM(X)], unless the MM models have been specifically parameterized to minimize the variance in w [41], small differences in the equilibrium valence degrees of freedom ensure that the variance of this weight is so large as to make this approach impractical (Figure 4).

Figure 4. Short 10 ps nonequilibrium trajectories are suficient to reliably estimate MM→ML/MM free energy corrections.

Left: Forward nonequilibrium work distribution (MM→ML/MM) for each of three switching times for both complex and solvated ligand phases for a representative ligand (1). The evolution of the work distributions demonstrates a reduction in variance and converge to more negative work values upon longer annealing time [69]. Right: Standard deviations of work distributions with respect to nonequilibrium protocol length for complex and solvent phases and bootstrapped 68% CIs. The complex phase has consistently lower work standard deviations, likely due comparatively lower ligand entropy than in the solvent phase.

As an alternative, we propose a convenient approach that uses short, nonequilibrium (NEQ) simulations to reduce the variance sufficiently to enable practical free energy estimates. Importantly, we aim to avoid two pitfalls: First, we aim to avoid the significant bias that arises in attempting to estimate free energy differences from short, unidirectional MM→ML/MM nonequilibrium switching trajectories [66], as well as the costly long nonequilibrium trajectories that would be required to minimize that bias; instead, we aim to use bidirectional protocols (MM→ML/MM and ML/MM→MM) and the optimal Bennett acceptance ratio estimator, which minimizes this bias [54, 55, 66]. Second, we aim to minimize the amount of simulation data that must be generated from the ML/MM state since it is so slow to sample from.

We construct an alchemical protocol that connects the easily-sampled MM thermodynamic states to the ML/MM thermodynamic states (in solvent and complex) via a linear interpolation of the potential (geometric interpolation of the sampled probability density function [67]) wherein the potential takes the form

Here, the alchemical parameter λ ∈ [0, 1] interpolates between the MM and hybrid ML/MM endstates, corresponds to the configuration of the receptor (and all other environment) atoms, and corresponds to the configuration of the ligand atoms. The NEQ free energy correction can be computed using four sequential steps, each performed independently for the solvated phase and the complex phase:

Extract N iid equilibrium samples from each MM thermodynamic state (ligand in complex or solvent) and perform MM→ML/MM nonequilibrium switching (NEQ) simulations for a fixed trajectory length T using the alchemical potential Eq. 2 with λ = t/T, recording the dimensionless protocol work w_MM→ML/MM [68].
Resample the final snapshots from each trajectory (with replacement) using the weight to generate an ensemble of N snapshots sampled from equilibrium for the ML/MM state.
For each resampled configuration, perform a short (10 ps) MD simulation with the ML/MM potential to decorrelate the resampled configurations.
For each of these ML/MM configurations, perform NEQ switching with the time-reversed protocol (ML/MM→MM) and record the dimensionless work w_ML/MM→MM.

The Bennett acceptance ratio (BAR) [54–56] is then used to estimate the free energy correction ΔG^MM→ML/MM to the absolute free energy of the MM endstate for each phase (complex, solvent) of the alchemical free energy calculation.

We stress that resampling after forward NEQ switching (step 2) is a critically important part of the procedure. Once forward NEQ trajectories are collected, each final configuration is approximately Boltzmann distributed with respect to the ML/MM thermodynamic state as [70]. Omitting the resampling step and simply retaining all of the conformations generated by the forward NEQ switch step would not recover an approximately Boltzmann-distributed sample size. Omitting the resampling step would be particularly problematic in the solvent phase—where the forward work distributions generally span several k_BT (see Fig. S.I.1)—since ligand conformations that are exponentially disfavored at the ML/MM state (compared to the MM state) would undergo backward NEQ switching, rendering prohibitively biased backward work distribution and free energy estimates. The resampling step is followed by a short equilibration (step 3) to decorrelate configurations that are resampled multiple times and recover from any collapse in effective sample size that occured during the resampling step.

An analysis of the unidirectional work distribution for several nonequilibrium protocol lengths (sometimes referred to as “annealing times” in the annealed importance sampling (AIS) literature [71]) suggests that 10 ps switching times are sufficient to produce useful free energy estimates (Figure 4). Indeed, performing the bidirectional switching scheme for several ligands confirms that the forward and backward work distributions overlap and BAR can produce useful estimates of the free energy corrections (Figure 5). Notably, solvent phase NEQ perturbations consistently yield higher work variances (Figure S.I.1) in their work distributions than their complex-phase counterparts, presumably indicative of the conformationally constrained nature of bound (but not solvated) ligands.

Figure 5. Bidirectional work distributions from nonequilibrium switching show sufficient overlap to compute precise free energy corrections from MM to ML/MM in both solvent and complex phases.

Forward (blue) and negative backward (orange) work distributions from 10 ps nonequilibrium switching trajectories for MM to ML/MM perturbations are shown for complex (top) and solvent (bottom) phases for ligands 1 (left) and 14 (right). The Bennett acceptance ratio (BAR) estimates of the free energies and the uncertainty thereof are shown as vertical black lines and vertical gray dotted lines, respectively.

ML/MM significantly improves accuracy on a kinase:inhibitor benchmark

We applied this nonequilibrium switching free energy correction scheme to a benchmark set of a well-studied congeneric series of inhibitors for non-receptor tyrosine-protein kinase (Tyk2) from the Schrödinger JACS benchmark set (Figure 2) [45]. This benchmark set is challenging for both commercial force fields (OPLS2.1 achieves a ΔG RMSE of 0.93±0.12 kcal mol⁻¹ [45]) and public force fields (GAFF 1.8 achieves a ΔG RMSE 1.13 kcal/mol⁻¹ and ΔΔG RMSE of 1.27 kcal/mol [50]). We consider a purely MM binding free energy baseline by using the ANI-2x [49] ML model (parameterized from DFT ωB97X/6-31G*) with AMBER14SB [47], TIP3P [48], and the Open Force Field Initiative OpenFF 1.0.0 (“Parsley”) small molecule force field [46].

The OpenFF 1.0.0 (“Parsley”) MM free energy calculations, shown in Figure 6 (A) and (C), achieve an accuracy that is statistically indistinguishable from other public and commercial MM force field benchmarks in terms of both root-mean squared error (RMSE; OPLS2.1 [45] and GAFF 1.8 [50]) and mean unsigned error (MUE; OPLS3.1, GAFF 2.1, and CGenFF 4.1 [64]). When the MM free energy calculation is corrected to ML/MM level of theory, Figure 6 (B) and (D), we recover experimental free energies with an RMSE of 0.47 [95% CI: 0.30, 0.67] kcal mol⁻¹, a large and statistically significant improvement from MM (RMSE 0.97 [95% CI: 0.70, 1.22] kcal mol⁻¹). Due to the naïve formulation of the ML/MM potential, this improvement in the experimental agreement can only be a consequence of an improved intramolecular potential for the ligands in this system. The particular formulation was chosen as it allows for rapid calculation of the per-ligand corrections to be performed post hoc. More advanced definitions of the ML/MM potential may lead to further improvements, by modelling additional interactions with higher levels of theory, however this would require concerted efforts to implement efficient interoperability between MM and ML packages—an area that requires further work.

Figure 6. ML/MM free energy calculations show significant improvement over MM in reproducing absolute and relative Tyk2 inhibitor binding free energies.

(A) Absolute binding free energies for the MM small molecule OpenFF 1.0.0 (“Parsley”) force field used with AMBER14SB and TIP3P water computed from relative free energy calculations estimated using perses 0.7.1 [http://github.com/choderalab/perses] and the maximum-likelihood estimator [72] to integrate estimates from redundant transformations in the relative alchemical transformation network. The same redundant network of relative alchemical transformations used in [45] was used here. (B) Absolute free energies (ΔG) corrected to ML/MM (using ANI-2x [49] for the ML model) using the nonequilibrium correction scheme depicted in Figure 3. (C) Relative MM binding free energies (ΔΔG) for computed relative free energy transformation edges, with correction using MLE. (D) Relative ML/MM binding free energies obtained from differences in the corrected absolute binding free energy estimates (top right). Blue scatter points are MM results, and orange are ML/MM results. Dark and light grey shaded regions indicate the region of ±0.5 and ±1.0 kcal mol⁻¹ error respectively. Vertical error bars (which appear smaller than the symbols) show one standard deviation in the free energy, calculated by MBAR, while the experimental error bar of 0.18 kcal mol⁻¹ is used [51]. Statistical analysis was performed using the Arsenic package [http://github.com/openforcefield/arsenic], with 95% confidence intervals calculated by bootstrap analysis. For all plots, an additive constant was added to all computed values, such that the mean computed value is equal to the mean experimental value, such as to minimise the RMSE as in [45].

The nature of these corrections is somewhat surprising: All MM→ML/MM corrections are positive, disfavorin binding. There is a notable trend in the magnitude of the correction, as illustrated by Figure 7. The smallest ΔG^MM→ML/MM corrections, on the order of 0.5–1.5 kcal mol⁻¹ are the conformationally-strained cyclopropane moieties. Aliphatic groups with more conformational degrees of freedom, such as larger rings and acyclic groups, show larger corrections, with some of the largest MM→ML/MM corrections containing functional groups that will conjugate with the amide group. The subtleties of the electronics of these conjugated molecules are unlikely to be captured by MM forcefields, particularly small molecule torsion parameters [26].

Figure 7. ML/MM corrections to MM binding free energies can be up to 4 kcal mol⁻¹ in magnitude.

The signed ΔG^MM→ML/MM corrections for each ligand (with R-group shown) are shown, ordered from least positive (slightly disfavoring binding) to most positive (strongly disfavoring binding).

Improved torsion energetics appear to drive accuracy improvements

It is well-appreciated that general small molecule MM force fields often fail to accurately describe torsion energy profiles observed with higher-level quantum chemical calculations [73, 74], a phenomenon driven by the significant effect substituents can have on torsion profiles via electronic effects [26, 75]. To overcome this limitation, many MM force fields recommend refitting torsion potentials directly to quantum chemical calculations for individual molecules in a bespoke manner [1, 27].

It is plausible that the improved accuracy demonstrated by ML/MM in Figure 6 arises primarily from improved modeling of torsion energetics or torsion-torsion coupling. To investigate this, we examined the torsion probability density functions for conformations sampled by the ligand in solvent and in complex. Figure 8 depicts the 1D torsion (top) and 2D torsion-torsion (bottom) probability density functions for ligand 1, focusing on the torsions associated with the amide linker to the substituted R-groups in the ligand series. The ML/MM potential samples a notably more peaked distribution with significantly perturbed equilibrium rotamer probabilities in solvent, and a tighter shifted torsion range in complex (Figure 8, top). 2D couplings are also surprisingly different between MM and ML/MM in complex.

Figure 8. Torsion profiles and couplings differ between MM and hybrid ML/MM models

1D and 2D torsion profiles of coupled amide torsions connecting the substituent R-group for ligand 1 (center) are shown. Top row: 1D torsion profiles for bonds highlighted purple and green for MM (blue) and MM/ML (orange) solvent and for complex. Bottom row: 2D torsion-torsion profiles for both solvent and complex are shown for both MM and ML/MM ensembles using a bivariate kernel density estimate, with red scatter points indicating observed samples.

Discussion

In this work, we demonstrated that a hybrid ML/MM model of a challenging, pharmaceutically-relevant benchmark receptor:ligand system dramatically outperforms the both commercially and publicly-available MM force fields in its ability to recover experimental binding free energies of a congeneric series of inhibitors. The ability to halve the current state-of-the-art free energy uncertainty to ~0.5 kcal mol⁻¹ using a simple post-processing procedure along with publicly-available software and ML models is promising. In particular, it suggests that there is significant potential yet to predict ligand:target binding affinities for prospective drug design campaigns.

The NEQ protocol suggests the potential for further efficiency improvements

Among the insights made in the course of this study, we make special note of the NEQ procedure and its prospect for optimization. It is clear from Figures 4, 5, and the Supporting Information that a fixed-length NEQ protocol may indeed be wasteful in the complex phase considering the consistency of high work distribution overlap and BAR precision; in fact, exponential averaging results for all of the forward complex phase NEQ protocols yield free energies within 0.1 kcal mol⁻¹ of the calculated BAR free energy correction. Indeed, reallocating the effort of conducting an ML/MM equilibration and backward protocol from complex to the solvent NEQ protocol might afford a more robust, lower variance free energy correction.

ML/MM and MM torsion distribution discrepancies prompt further investigation

The substantial differences between torsion profiles from the ML/MM and MM models in Fig. 8 suggests that the significant free energy corrections afforded by the ML/MM model may be largely a consequence of poorly parameterized torsions in the MM model, or perhaps the existence of nonneglibible torsion couplings. Further experiments could distinguish between these scenarios by refitting torsion profiles or reweighting using only 1D or 2D torsion profiles rather than using the complete replacement of ligand intra molecular energetics as was considered here.

The fact that the most pronounced discrepancies in torsion profiles for both solvent and complex phases were observed about amide functional groups is particularly notable. If it is indeed the case that current MM force fields fail to recover appropriate conformational energetics of amides, then there is certainly a potential for free energy accuracy improvement in a large subset of druglike molecules, especially among protease inhibitors, which are characterized by several amide groups to mimic peptide backbones.

Binding free energy improvements afforded by ML/MM show promise for other systems

In this study, we were fortuitous in that the OpenFF 1.0.0 provided heavy-tailed phase-space distributions, particular in the solvent phase (see Figure 8), that overlapped sufficiently with the ML/MM model. Had this not been the case, it is likely that the NEQ correction procedure would have failed to recover free energies with sufficient precision at the annealing times employed in this study. Further study will indicate whether other MM force fields—including the GAFF force field [13, 14] and more recent iterations of the OpenFF force fields—generally provide sufficient phase-space overlap with ML models for ML/MM corrections to remain computationally convenient and accurate with respect to experiment.

While these results illustrate the notable improvement to relative free energy calculations for this Tyk2 protein:ligand system, more extensive studies will be needed to determine how robustly this accuracy improvement manifests for a broad range of congeneric series. While the current implementation involves post-processing of pre-generated MM data, the implementation of the method could be improved by integrating ML potential models such as ANI into extensible simulation packages such as OpenMM [76], perhaps via a plugin architecture. Improved interoperability would increase ease of adoption for computational efforts in drug design projects. The definition of the hybrid ML/MM potential could be improved through expanding the terms in the system that are computed with ML by using ML methods such as AIMNet [77], SchNet [78], PhysNet [79], or AP-Net [80] that allow for decomposition of electrostatics and long-range dispersion from short-term valence energies.

Machine learning will likely permeate all aspects of alchemical free energy calculations

More broadly, machine learning will play various roles in all aspects of alchemical free energy calculations. As more calculations are performed, machine learning models (such as graph convolutional or message passing networks [81]) will undoubtedly be used to learn the difficulty (statistical efficiency) of relative transformations in a manner that can be used to design optimal transformation networks [72] or optimal alchemical protocols. Scheen et al. [82] recently demonstrated how ligand-based ML models can be used to correct MM alchemical free energy calculations based on experimental training data, applying it to hydration free energy computation. Ghanakota et al. [83] also demonstrated how ligand-based ML models could be trained to learn more expensive free energy calculations to permit evaluation of large compound spaces with free energy accuracy. These few applications are just the beginning of how machine learning will transform physical modeling in the biosciences.

Code and data availability

• Input files and setup scripts: https://github.com/choderalab/qmlify

Author Contributions

Conceptualization: JDC, DAR, HEBM, JF, and MW; Methodology: JDC, HBM, DAR, JF, and MW; Software: PBG, DAR, HBM, JF, and MW; Investigation: DAR, HEBM, JF, and MW; Writing–Original Draft: DAR, JDC, HBM, JF, and MW; Writing–Review&Editing: AER, OI and JDC; Funding Acquisition: JDC; Resources: JDC; Supervision: JDC, AER, and OI.

Disclosures

JDC is a current member of the Scientific Advisory Board of OpenEye Scientific Software and a consultant to Foresite Laboratories. The Chodera laboratory receives or has received funding from multiple sources, including the National Institutes of Health, the National Science Foundation, the Parker Institute for Cancer Immunotherapy, Relay Therapeutics, Entasis Therapeutics, Silicon Therapeutics, EMD Serono (Merck KGaA), AstraZeneca, Vir Biotechnology, Bayer, XtalPi, the Molecular Sciences Software Institute, the Starr Cancer Consortium, the Open Force Field Consortium, Cycle for Survival, a Louis V. Gerstner Young Investigator Award, and the Sloan Kettering Institute. A complete funding history for the Chodera lab can be found at http://choderalab.org/funding AER is a current member of the Science Advisory Board of Schrödinger Co. and receives funding from Genentech Co.

Detailed methods

MM Relative Free Energy Calculations

MM relative free energy calculations were performed using the open-source software Perses 0.7.1 [http://github/choderalab/perses]. The set of 24 pairwise comparisons for the set of 16 ligands were performed, as specified in [45]. Simulations were performed for both the bound-complex phase and solvent phase to afford relative binding free energies using OpenMM 7.4.2 [76]. Protein and ligand files were adapted from [50] and are available as part of the openmm-forcefields 0.7.1 package [https://github.com/openmm/openmmforcefields]. Simulations were performed with AMBER14SB [47] protein forcefield and the OpenFF 1.0.0 “Parsley” small molecule forcefield [46]. The system was solvated with a 9.0 Å padding using TIP3P water [48] and 150 mM NaCl [84]. Simulations were performed without hydrogen bond constraints using 4 amu hydrogen massses following mass repartitioning. A timestep of 2 fs with a 1 ps⁻¹ collision rate at 300 K was performed using a BAOAB Langevin integration scheme [85, 86], using an NPT ensemble at 1.0 atm sampled using a Monte Carlo barostat with molecular scaling. Nonbonded interactions were handled using a 9 Å cutoff using Particle Mesh Ewald (PME) with a tolerance of 2.5 × 10⁻⁴ and long-range dispersion corrections.

The alchemical perturbation was performed using a single topology protocol. The mapping protocol to generate the single hybrid ligand is performed using the maximum common substructure search (MCSS) algorithm from the OpenEye Toolkits 2020.0.4 to identify the common ‘core’ of the molecule. Atoms not common between the two molecules (not contained in the core) are included as ‘unique-old’ or ‘unique-new’ atoms. The interaction perturbation scheme is described in Figure S.I.2. Softcore steric potentials (see [87]) with an α parameter of 0.85.

11 equally-spaced λ-windows were used for these calculations, for both the solvent and the complex phases. 1000 cycles of 250 integration steps (2 fs timestep) were performed, resulting in a total of 5 ns of sampling per λ-state (55 ns sampling per-phase, per-ligand pair). All-to-all Hamiltonian replica exchange was attempted every cycle. MBAR was performed on decorrelated replica exchange samples to recover MM relative free energies and associated uncertainties. The maximum likelihood estimator (MLE) DiffNet [72] was used to compute absolute binding free energies for the set of ligands, and shifted using a single experimental value. The aforementioned calculations are self-contained operations in the [https://github.com/openforcefield/arsenic] as a graph representation (nodes correspond to ligands and edges correspond to relative alchemical transformations). Results are shown in Figure 6 (A and C) and were used as the basis for the MM→MM/ML corrected energies, described in the following.

Bidirectional nonequilibrium switching and ML/MM free energy corrections

The bidirectional nonequilibrium protocol was parameterized in accordance with Eq. 2 with 5000 annealing steps. MM model and simulation parameters are consistent with those described above.

The forward NEQ procedure is as follows for each phase:

Algorithm 1:

Nonequilibrium (NEQ) switching protocol

To conduct decorrelation and equilibration at the ML/MM thermodynamic states, {x_i} returned from Algorithm 1 is resampled N_A times (with replacement) w.r.t. ) from {work_i}. The algorithm is performed again (in this case, setting N_neq to 5000) whilst maintaining λ = 1; in this case, work_i is identically 0. Subsequently, Algorithm 1 is conducted in the backward direction (i.e. in the “lambda update” step).

The work sets {work_i,forward} from Algorithm 1 and {work_i,backward} from the aforementioned modification were then passed to the BAR estimator [https://github.com/choderalab/pymbar/blob/master/pymbar/bar.py] to maximize the likelihood of the ΔG_ML/MM. To correct the MM free energy calculations, the complex/solvent differences were added in-place to the MM ligand network nodes, and absolute binding free energies were recomputed with DiffNett [72].

For both the complex and solvent phase, 100 decorrelated equilibrium configurations (N_A = 100) were extracted from replica exchange checkpoint files from each relative calculation edge. This treatment resulted in duplicate and independent repeats of ensemble ML/MM bidirectional switching. This resulted in 200 to 700 independent forward/backward work samples for each phase for the 16 ligands, depending on the degree of each ligand in the relative free energy calculation network. The aggregated bidirectional work distributions for each ligand and each phase is shown in Figure S.I.1.

The forward, resampling, ML/MM endstate simulation (for 10 ps), and backward annealing procedures, described previously, were used to recover bidirectional work distributions along with the BAR-estimated free energy correction.

The ligand configuration was extracted from the solvent and complex phases and modelled with the OpenFF 1.0.0 forcefield as used in the MM simulations; however the nonbonded (i.e. steric and electrostatic) interactions were treated as non-periodic without a cutoff. The ANI2x ligand model was computed using TorchANI package [https://github.com/aiqm/torchani] (ANI2x version 2.1.1)[88].

In order to propagate dynamics through the protocol, each velocity update steps are preceded by a force update step wherein the forces of the MM and ML vacuum ligand systems are computed, scaled appropriately by the value of λ, and added to the appropriate sub-block matrix of the full ligand-and-environment force matrix.

Supplementary Information

Appendix 0 Figure S.I.1. Bidirectional NEQ work distributions for Tyk2 congeneric inhibitor series.

Forward work distributions (blue) and (negative) backward work distributions (orange) show sufficient overlap for a BAR free energy calculation. BAR ΔG_MM→MM/ML are shown as vertical black lines, and uncertainties thereof are shown as gray, dotted lines. Both complex and solvent phases are shown.

Appendix 0 Figure S.I.2. Perses relative free energy calculations default alchemical protocol.

The interaction potentials are perturbed linearly in two stages: between λ: 0.0 → 0.5 the sterics of the unique-new atoms are turned on while the electrostatics of the unique-old atoms are turned off, followed by the turning on of the electrostatics of the unique-new atoms simultaneously with the steric terms of the unique-old atoms being turned off between λ: 0.5 → 1.0. The parameters of the core atoms are linearly perturbed from λ: 0. → 1.0.

Acknowledgments

DAR acknowledges support from the Tri-Institutaional PhD Program in Chemical Biology and the Sloan Kettering Institute. HEBM acknowledges support from a Molecular Sciences Software Institute Investment Fellowship and Relay Therapeutics. JF acknowledges support from NSF CHE-1738979 and the Sloan Kettering Institute. MW acknowledges support from a FWF Erwin Schrödinger Postdoctoral Fellowship J 4245-N28. JDC acknowledges support from NIH grant P30 CA008748, NIH grant R01 GM121505, NIH grant R01 GM132386, and the Sloan Kettering Institute. OI acknowledges support from NSF CHE-1802789 and Carnegie Mellon University. AER acknowledges support from NSF CHE-1802831

The authors thank Christopher Rowley (ORCID: 0000-0002-0205-952X) for sharing early work and discussions that inspired this study; Christopher I. Bayly (ORCID: 0000-0001-9145-6457) and David L. Mobley (ORCID: 0000-0002-1083-5533) for sharing insights on nonequilibrium free energy calculations; Gianni de Fabritiis (ORCID: 0000-0003-3913-4877) for discussions and inspirational work in motivating the utility of hybrid ML/MM models; Peter K. Eastman (ORCID: 0000-0002-9566-9684) for providing extensive support for OpenMM and implementing numerous features of use in this work; and the scientists and software scientists from the Open Force Field Initiative [http://openforcefield.org/members] and Consortium [http://openforcefield.org/consortium] for their contributions to both the science behind OpenFF 1.0.0 and the highly usable software infrastructure that made this work possible.

The authors are extremely grateful to OpenEye Scientific for granting an academic license for use of the OpenEye Toolkit for this work.

Footnotes

https://github.com/choderalab/qmlify

References

[1].↵
Abel R, Wang L, Harder ED, Berne B, Friesner RA. Advancing drug discovery through enhanced free energy calculations. Accounts of chemical research. 2017; 50(7):1625–1632.
OpenUrl CrossRef
[2].
Wang L, Chambers J, Abel R. Protein–Ligand Binding Free Energy Calculations with FEP+. In: Biomolecular Simulations Springer; 2019.p. 201–232.
[3].↵
Schindler C, Baumann H, Blum A, Böse D, Buchstaller HP, Burgdorf L, Cappel D, Chekler E, Czodrowski P, Dorsch D, et al. Large-scale assessment of binding free energy calculations in active drug discovery projects. ChemRxiv. 2020;.
[4].
Sherborne B, Shanmugasundaram V, Cheng AC, Christ CD, DesJarlais RL, Duca JS, Lewis RA, Loughney DA, Manas ES, McGaughey GB, et al. Collaborating to improve the use of free-energy and other quantitative methods in drug discovery. Journal of computer-aided molecular design. 2016; 30(12):1139–1141.
OpenUrl CrossRef
[5].↵
Cournia Z, Allen B, Sherman W. Relative binding free energy calculations in drug discovery: recent advances and practical considerations. Journal of chemical information and modeling. 2017; 57(12):2911–2937.
OpenUrl CrossRef
[6].↵
Abel R, Wang L, Mobley DL, Friesner RA. A critical review of validation, blind testing, and real-world use of alchemical protein-ligand binding free energy calculations. Current topics in medicinal chemistry. 2017; 17(23):2577–2585.
OpenUrl CrossRef
[7].↵
Ponder JW, Case DA. Force fields for protein simulations. In: Advances in protein chemistry, vol. 66 Elsevier; 2003.p. 27–85.
OpenUrl CrossRef PubMed Web of Science
[8].↵
Shirts MR, Mobley DL, Brown SP. Free-energy calculations in structure-based drug design. Drug design: structure- and ligand-based approaches. 2010; p. 61–86.
[9].↵
Abel R, Manas ES, Friesner RA, Farid RS, Wang L. Modeling the value of predictive affinity scoring in preclinical drug discovery. Current opinion in structural biology. 2018; 52:103–110.
OpenUrl
[10].↵
Moraca F, Negri A, de Oliveira C, Abel R. Application of Free Energy Perturbation (FEP+) to Understanding Ligand Selectivity: A Case Study to Assess Selectivity Between Pairs of Phosphodiesterases (PDE’s). Journal of chemical information and modeling. 2019; 59(6):2729–2740.
OpenUrl CrossRef PubMed
[11].↵
Albanese SK, Chodera JD, Volkamer A, Keng S, Abel R, Wang L. Is structure based drug design ready for selectivity optimization? BioRxiv. 2020;.
[12].↵
Chodera JD, Mobley DL, Shirts MR, Dixon RW, Branson K, Pande VS. Alchemical free energy methods for drug discovery: progress and challenges. Current opinion in structural biology. 2011; 21(2):150–160.
OpenUrl CrossRef PubMed
[13].↵
Wang J, Wolf RM, Caldwell JW, Kollman PA, Case DA. Development and testing of a general amber force field. Journal of computational chemistry. 2004; 25(9):1157–1174.
OpenUrl CrossRef PubMed Web of Science
[14].↵
Wang J, Wang W, Kollman PA, Case DA. Automatic atom type and bond type perception in molecular mechanical calculations. Journal of molecular graphics and modelling. 2006; 25(2):247–260.
OpenUrl
[15].↵
Vanommeslaeghe K, Hatcher E, Acharya C, Kundu S, Zhong S, Shim J, Darian E, Guvench O, Lopes P, Vorobyov I, et al. CHARMM general force field: A force field for drug-like molecules compatible with the CHARMM all-atom additive biological force fields. Journal of computational chemistry. 2010; 31(4):671–690.
OpenUrl CrossRef PubMed Web of Science
[16].↵
Vanommeslaeghe K, Raman EP, MacKerell Jr AD. Automation of the CHARMM General Force Field (CGenFF) II: assignment of bonded parameters and partial atomic charges. Journal of chemical information and modeling. 2012; 52(12):3155–3168.
OpenUrl CrossRef PubMed
[17].↵
Roos K, Wu C, Damm W, Reboul M, Stevenson JM, Lu C, Dahlgren MK, Mondal S, Chen W, Wang L, et al. OPLS3e: Extending force field coverage for drug-like small molecules. Journal of chemical theory and computation. 2019; 15(3):1863–1874.
OpenUrl
[18].↵
Maple JR, Hwang MJ, Stockfisch TP, Dinur U, Waldman M, Ewig CS, Hagler AT. Derivation of class II force fields. I. Methodology and quantum force field for the alkyl functional group and alkane molecules. Journal of Computational Chemistry. 1994; 15(2):162–182.
OpenUrl
[19].↵
Hwang MJ, Stockfisch T, Hagler A. Derivation of class II force fields. 2. Derivation and characterization of a class II force field, CFF93, for the alkyl functional group and alkane molecules. Journal of the American Chemical Society. 1994; 116(6):2515–2525.
OpenUrl
[20].↵
Dauber-Osguthorpe P, Hagler AT. Biomolecular force fields: where have we been, where are we now, where do we need to go and how do we get there? Journal of computer-aided molecular design. 2019; 33(2):133–203.
OpenUrl
[21].↵
Hagler AT. Force field development phase II: Relaxation of physics-based criteria…or inclusion of more rigorous physics into the representation of molecular energetics. Journal of computer-aided molecular design. 2019; 33(2):205–264.
OpenUrl
[22].↵
Shi Y, Jiao D, Schnieders MJ, Ren P. Trypsin-ligand binding free energy calculation with AMOEBA. In: 2009 Annual International Conference of the IEEE Engineering in Medicine and Biology Society IEEE; 2009. p. 2328–2331.
[23].↵
Bell DR, Qi R, Jing Z, Xiang JY, Mejias C, Schnieders MJ, Ponder JW, Ren P. Calculating binding free energies of host–guest systems using the AMOEBA polarizable force field. Physical Chemistry Chemical Physics. 2016; 18(44):30261–30269.
OpenUrl CrossRef
[24].
Laury ML, Wang Z, Gordon AS, Ponder JW. Absolute binding free energies for the SAMPL6 cucurbit [8] uril host–guest challenge via the AMOEBA polarizable force field. Journal of computer-aided molecular design. 2018; 32(10):1087–1095.
OpenUrl
[25].↵
Lin FY, MacKerell AD. Force fields for small molecules. In: Biomolecular Simulations Springer; 2019.p. 21–54.
[26].↵
Stern C, Capturing non-local through-bond effects when fragmenting molecules for quantum chemical torsion scans; 2020. Online; accessed 23 July 2020. https://chayast.github.io/fragmenter-manuscript/.
[27].↵
Harder E, Damm W, Maple J, Wu C, Reboul M, Xiang JY, Wang L, Lupyan D, Dahlgren MK, Knight JL, et al. OPLS3: a force field providing broad coverage of drug-like small molecules and proteins. Journal of chemical theory and computation. 2016; 12(1):281–296.
OpenUrl
[28].↵
Maple JR, Dinur U, Hagler AT. Derivation of force fields for molecular mechanics and dynamics from ab initio energy surfaces. Proceedings of the National Academy of Sciences. 1988; 85(15):5350–5354.
OpenUrl Abstract/FREE Full Text
[29].↵
Mackerell Jr AD, Feig M, Brooks III CL. Extending the treatment of backbone energetics in protein force fields: Limitations of gas-phase quantum mechanics in reproducing protein conformational distributions in molecular dynamics simulations. Journal of computational chemistry. 2004; 25(11):1400–1415.
OpenUrl CrossRef PubMed Web of Science
[30].
Wang LP, McKiernan KA, Gomes J, Beauchamp KA, Head-Gordon T, Rice JE, Swope WC, Martínez TJ, Pande VS. Building a more predictive protein force field: a systematic and reproducible route to AMBER-FB15. The Journal of Physical Chemistry B. 2017; 121(16):4023–4039.
OpenUrl
[31].
Kang W, Jiang F, Wu YD. Universal Implementation of a Residue-Specific Force Field Based on CMAP Potentials and Free Energy Decomposition. Journal of chemical theory and computation. 2018; 14(8):4474–4486.
OpenUrl
[32].↵
Best RB, Mittal J, Feig M, MacKerell Jr AD. Inclusion of many-body effects in the additive CHARMM protein CMAP potential results in enhanced cooperativity of α-helix and β-hairpin formation. Biophysical journal. 2012; 103(5):1045–1051.
OpenUrl CrossRef PubMed
[33].↵
Perola E, Charifson PS. Conformational analysis of drug-like molecules bound to proteins: an extensive study of ligand reorganization upon binding. Journal of medicinal chemistry. 2004; 47(10):2499–2510.
OpenUrl CrossRef PubMed
[34].↵
Beierlein FR, Michel J, Essex JW. A simple QM/MM approach for capturing polarization effects in protein-ligand binding free energy calculations. The Journal of Physical Chemistry B. 2011; 115(17):4911–4926.
OpenUrl
[35].
Dubey KD, Ojha RP. Binding free energy calculation with QM/MM hybrid methods for Abl-Kinase inhibitor. Journal of biological physics. 2011; 37(1):69–78.
OpenUrl
[36].
Rathore R, Sumakanth M, Reddy MS, Reddanna P, Rao AA, Erion MD, Reddy M. Advances in binding free energies calculations: QM/MM-based free energy perturbation method for drug design. Current pharmaceutical design. 2013; 19(26):4674–4686.
OpenUrl
[37].
König G, Hudson PS, Boresch S, Woodcock HL. Multiscale free energy simulations: An efficient method for connecting classical MD simulations to QM or QM/MM free energies using Non-Boltzmann Bennett reweighting schemes. Journal of chemical theory and computation. 2014; 10(4):1406–1419.
OpenUrl
[38].↵
Steinmann C, Olsson MA, Ryde U. Relative ligand-binding free energies calculated from multiple short QM/MM MD simulations. Journal of chemical theory and computation. 2018; 14(6):3228–3237.
OpenUrl
[39].↵
Yang W, Cui Q, Min D, Li H. QM/MM alchemical free energy simulations: Challenges and recent developments. In: Annual Reports in Computational Chemistry, vol. 6 Elsevier; 2010.p. 51–62.
OpenUrl
[40].
Hudson PS, Boresch S, Rogers DM, Woodcock HL. Accelerating QM/MM Free Energy Computations via Intramolecular Force Matching. Journal of Chemical Theory and Computation. 2018 Dec; 14(12):6327–6335. https://doi.org/10.1021/acs.jctc.8b00517, doi: 10.1021/acs.jctc.8b00517, publisher: American Chemical Society.
OpenUrl CrossRef
[41].↵
Giese TJ, York DM. Development of a Robust Indirect Approach for MM→QM Free Energy Calculations That Combines Force-Matched Reference Potential and Bennett’s Acceptance Ratio Methods. Journal of chemical theory and computation. 2019; 15(10):5543–5562.
OpenUrl
[42].↵
Von Lilienfeld OA. Quantum machine learning in chemical compound space. Angewandte Chemie International Edition. 2018; 57(16):4164–4169.
OpenUrl
[43].↵
Smith JS, Isayev O, Roitberg AE. ANI-1: an extensible neural network potential with DFT accuracy at force field computational cost. Chemical science. 2017; 8(4):3192–3203.
OpenUrl
[44].↵
Galvelis R, Doerr S, Damas JM, Harvey MJ, De Fabritiis G. A Scalable Molecular Force Field Parameterization Method Based on Density Functional Theory and Quantum-Level Machine Learning. Journal of chemical information and modeling. 2019; 59(8):3485–3493.
OpenUrl
[45].↵
Wang L, Wu Y, Deng Y, Kim B, Pierce L, Krilov G, Lupyan D, Robinson S, Dahlgren MK, Greenwood J, et al. Accurate and reliable prediction of relative ligand binding potency in prospective drug discovery by way of a modern free-energy calculation protocol and force field. Journal of the American Chemical Society. 2015; 137(7):2695–2703.
OpenUrl CrossRef PubMed
[46].↵
Qiu Y, Smith DGA, Boothroyd S, Wagner J, Bannan CC, Gokey T, Jang H, Lim VT, Lucas X, Tjanaka B, et al., openforce-field/openforcefields: Version 1.0.0 “Parsley”. Zenodo; 2019. doi: 10.5281/zenodo.3483227.
OpenUrl CrossRef
[47].↵
Maier JA, Martinez C, Kasavajhala K, Wickstrom L, Hauser KE, Simmerling C. ff14SB: improving the accuracy of protein side chain and backbone parameters from ff99SB. Journal of chemical theory and computation. 2015; 11(8):3696–3713.
OpenUrl
[48].↵
Jorgensen WL, Chandrasekhar J, Madura JD, Impey RW, Klein ML. Comparison of simple potential functions for simulating liquid water. The Journal of chemical physics. 1983; 79(2):926–935.
OpenUrl CrossRef PubMed Web of Science
[49].↵
Devereux C, Smith JS, Davis KK, Barros K, Zubatyuk R, Isayev O, Roitberg AE. Extending the Applicability of the ANI Deep Learning Molecular Potential to Sulfur and Halogens. Journal of Chemical Theory and Computation. 2020; 16(7):4192–4202. https://doi.org/10.1021/acs.jctc.0c00121, doi: 10.1021/acs.jctc.0c00121, publisher: American Chemical Society.
OpenUrl CrossRef
[50].↵
Song LF, Lee TS, Zhu C, York DM, Merz Jr KM. Using AMBER18 for Relative Free Energy Calculations. Journal of chemical information and modeling. 2019; 59(7):3128–3135.
OpenUrl
[51].↵
Liang J, van Abbema A, Balazs M, Barrett K, Berezhkovsky L, Blair W, Chang C, Delarosa D, DeVoss J, Driscoll J, et al. Lead optimization of a 4-aminopyridine benzamide scaffold to identify potent, selective, and orally bioavailable TYK2 inhibitors. Journal of medicinal chemistry. 2013; 56(11):4521–4536.
OpenUrl CrossRef PubMed
[52].↵
Lahey SLJ, Rowley CN. Simulating protein–ligand binding with neural network potentials. Chemical Science. 2020; 11(9):2362–2368.
OpenUrl
[53].↵
Smith JS, Nebgen BT, Zubatyuk R, Lubbers N, Devereux C, Barros K, Tretiak S, Isayev O, Roitberg AE. Approaching coupled cluster accuracy with a general-purpose neural network potential through transfer learning. Nature communications. 2019; 10(1):1–8.
OpenUrl
[54].↵
Bennett CH. Efficient estimation of free energy differences from Monte Carlo data. Journal of Computational Physics. 1976; 22(2):245–268.
OpenUrl CrossRef
[55].↵
Crooks GE. Excursions in statistical dynamics. PhD thesis, Citeseer; 1999.
[56].↵
Shirts MR, Bair E, Hooker G, Pande VS. Equilibrium free energies from nonequilibrium measurements using maximum-likelihood methods. Physical review letters. 2003; 91(14):140601.
OpenUrl CrossRef PubMed
[57].↵
Crooks GE. Nonequilibrium measurements of free energy differences for microscopically reversible Markovian systems. Journal of Statistical Physics. 1998; 90(5-6):1481–1487.
OpenUrl
[58].↵
Shirts MR, Mobley DL, Chodera JD. Alchemical free energy calculations: ready for prime time? Annual reports in computational chemistry. 2007; 3:41–59.
OpenUrl
[59].↵
Wan S, Bhati AP, Zasada SJ, Wall I, Green D, Bamborough P, Coveney PV. Rapid and reliable binding affinity prediction of bromodomain inhibitors: a computational study. Journal of chemical theory and computation. 2017; 13(2):784–795.
OpenUrl
[60].
Harger M, Li D, Wang Z, Dalby K, Lagardère L, Piquemal JP, Ponder J, Ren P. Tinker-OpenMM: Absolute and relative alchemical free energies using AMOEBA on GPUs. Journal of computational chemistry. 2017; 38(23):2047–2055.
OpenUrl
[61].
Lee TS, Cerutti DS, Mermelstein D, Lin C, LeGrand S, Giese TJ, Roitberg A, Case DA, Walker RC, York DM. GPU-accelerated molecular dynamics and free energy methods in Amber18: performance enhancements and new features. Journal of chemical information and modeling. 2018; 58(10):2043–2050.
OpenUrl CrossRef
[62].
Jespers W, Esguerra M, Åqvist J, Gutiérrez-de Terán H. QligFEP: an automated workflow for small molecule free energy calculations in Q. Journal of cheminformatics. 2019; 11(1):26.
OpenUrl
[63].
Suruzhon M, Senapathi T, Bodnarchuk MS, Viner R, Wall ID, Barnett CB, Naidoo KJ, Essex JW. ProtoCaller: Robust Automation of Binding Free Energy Calculations. Journal of Chemical Information and Modeling. 2020; 60(4):1917–1921.
OpenUrl
[64].↵
Gapsys V, Pérez-Benito L, Aldeghi M, Seeliger D, Van Vlijmen H, Tresadern G, de Groot BL. Large scale relative protein ligand binding affinities using non-equilibrium alchemy. Chemical Science. 2020; 11(4):1140–1152.
OpenUrl
[65].↵
Kuhn M, Firth-Clark S, Tosco P, Mey AS, Mackey MD, Michel J. Assessment of Binding Affinity via Alchemical Free Energy Calculations. Journal of Chemical Information and Modeling. 2020;.
[66].↵
Shirts MR, Pande VS. Comparison of efficiency and bias of free energies computed by exponential averaging, the Bennett acceptance ratio, and thermodynamic integration. The Journal of chemical physics. 2005; 122(14):144107.
OpenUrl CrossRef PubMed
[67].↵
Neal R. Annealed importance sampling (Technical Report 9805 (revised)). Department of Statistics, University of Toronto. 1998;.
[68].↵
Nilmeier JP, Crooks GE, Minh DD, Chodera JD. Nonequilibrium candidate Monte Carlo is an efficient tool for equilibrium simulation. Proceedings of the National Academy of Sciences. 2011; 108(45):E1009–E1018.
OpenUrl Abstract/FREE Full Text
[69].↵
Jarzynski C. Nonequilibrium equality for free energy differences. Physical Review Letters. 1997; 78(14):2690.
OpenUrl CrossRef Web of Science
[70].↵
Del Moral P, Doucet A, Jasra A. Sequential monte carlo samplers. Journal of the Royal Statistical Society: Series B (Statistical Methodology). 2006; 68(3):411–436.
OpenUrl
[71].↵
Neal RM. Estimating ratios of normalizing constants using linked importance sampling. arXiv preprint math/0511216. 2005;.
[72].↵
Xu H. Optimal measurement network of pairwise differences. Journal of Chemical Information and Modeling. 2019; 59(11):4720–4728.
OpenUrl
[73].↵
Sellers BD, James NC, Gobbi A. A comparison of quantum and molecular mechanical methods to estimate strain energy in druglike fragments. Journal of chemical information and modeling. 2017; 57(6):1265–1275.
OpenUrl
[74].↵
Rai BK, Sresht V, Yang Q, Unwalla R, Tu M, Mathiowetz AM, Bakken GA. Comprehensive Assessment of Torsional Strain in Crystal Structures of Small Molecules and Protein–Ligand Complexes using ab Initio Calculations. Journal of Chemical Information and Modeling. 2019; 59(10):4195–4208.
OpenUrl
[75].↵
Wei W, Champion C, Liu Z, Barigye SJ, Labute P, Moitessier N. Torsional Energy Barriers of Biaryls Could Be Predicted by Electron Richness/Deficiency of Aromatic Rings; Advancement of Molecular Mechanics toward Atom-Type Independence. Journal of chemical information and modeling. 2019; 59(11):4764–4777.
OpenUrl
[76].↵
Eastman P, Swails J, Chodera JD, McGibbon RT, Zhao Y, Beauchamp KA, Wang LP, Simmonett AC, Harrigan MP, Stern CD, et al. OpenMM 7: Rapid development of high performance algorithms for molecular dynamics. PLoS computational biology. 2017; 13(7):e1005659.
OpenUrl
[77].↵
Zubatyuk R, Smith JS, Leszczynski J, Isayev O. Accurate and transferable multitask prediction of chemical properties with an atoms-in-molecules neural network. Science advances. 2019; 5(8):eaav6490.
OpenUrl FREE Full Text
[78].↵
Schütt KT, Sauceda HE, Kindermans PJ, Tkatchenko A, Müller KR. SchNet–A deep learning architecture for molecules and materials. The Journal of Chemical Physics. 2018; 148(24):241722.
OpenUrl
[79].↵
Unke OT, Meuwly M. PhysNet: A neural network for predicting energies, forces, dipole moments, and partial charges. Journal of chemical theory and computation. 2019; 15(6):3678–3693.
OpenUrl
[80].↵
Glick ZL, Metcalf DP, Koutsoukas A, Spronk SA, Cheney DL, Sherrill CD. AP-Net: An atomic-pairwise neural network for smooth and transferable interaction potentials.. 2020;.
[81].↵
Gilmer J, Schoenholz SS, Riley PF, Vinyals O, Dahl GE. Neural message passing for quantum chemistry. arXiv preprint arXiv:170401212. 2017;.
[82].↵
Scheen J, Wu W, Mey AS, Tosco P, Mackey MD, Michel J. A hybrid Alchemical Free Energy/Machine Learning Methodology for the Computation of Hydration Free Energies. Journal of Chemical Information and Modeling. 2020;.
[83].↵
Ghanakota P, Bos PH, Konze K, Staker J, Marques G, Marshall K, Leswing K, Abel R, Bhat S. Combining Cloud-Based Free Energy Calculations, Synthetically Aware Enumerations and Goal-Directed Generative Machine Learning for Rapid Large Scale Chemical Exploration and Optimization. Journal of Chemical Information and Modeling. 2020;.
[84].↵
Joung IS, Cheatham III TE. Determination of alkali and halide monovalent ion parameters for use in explicitly solvated biomolecular simulations. The journal of physical chemistry B. 2008; 112(30):9020–9041.
OpenUrl
[85].↵
Leimkuhler B, Matthews C. Robust and efficient configurational molecular sampling via Langevin dynamics. The Journal of chemical physics. 2013; 138(17):05B601_1.
OpenUrl
[86].↵
Fass J, Sivak DA, Crooks GE, Beauchamp KA, Leimkuhler B, Chodera JD. Quantifying configuration-sampling error in Langevin simulations of complex molecular systems. Entropy. 2018; 20(5):318.
OpenUrl
[87].↵
Gapsys V, Seeliger D, de Groot BL. New soft-core potential function for molecular dynamics based alchemical free energy calculations. Journal of Chemical Theory and Computation. 2012; 8(7):2373–2382.
OpenUrl
[88].↵
Gao X, Ramezanghorbani F, Isayev O, Smith JS, Roitberg AE. TorchANI: A Free and Open Source PyTorch-Based Deep Learning Implementation of the ANI Neural Network Potentials. Journal of Chemical Information and Modeling. 2020; 60(7):3408–3415. https://doi.org/10.1021/acs.jcim.0c00451, doi: 10.1021/acs.jcim.0c00451, publisher: American Chemical Society.
OpenUrl CrossRef

View the discussion thread.

Posted July 30, 2020.

Download PDF

Data/Code

Citation Tools

Subject Area

Biophysics

Subject Areas

All Articles

Animal Behavior and Cognition (5209)
Biochemistry (11730)
Bioengineering (8743)
Bioinformatics (29179)
Biophysics (14964)
Cancer Biology (12080)
Cell Biology (17399)
Clinical Trials (138)
Developmental Biology (9417)
Ecology (14174)
Epidemiology (2067)
Evolutionary Biology (18294)
Genetics (12233)
Genomics (16791)
Immunology (11858)
Microbiology (28051)
Molecular Biology (11575)
Neuroscience (60919)
Paleontology (451)
Pathology (1870)
Pharmacology and Toxicology (3238)
Physiology (4955)
Plant Biology (10422)
Scientific Communication and Education (1682)
Synthetic Biology (2881)
Systems Biology (7338)
Zoology (1650)

[1] [1].↵
Abel R, Wang L, Harder ED, Berne B, Friesner RA. Advancing drug discovery through enhanced free energy calculations. Accounts of chemical research. 2017; 50(7):1625–1632.
OpenUrl CrossRef

[2] [2].
Wang L, Chambers J, Abel R. Protein–Ligand Binding Free Energy Calculations with FEP+. In: Biomolecular Simulations Springer; 2019.p. 201–232.

[3] [3].↵
Schindler C, Baumann H, Blum A, Böse D, Buchstaller HP, Burgdorf L, Cappel D, Chekler E, Czodrowski P, Dorsch D, et al. Large-scale assessment of binding free energy calculations in active drug discovery projects. ChemRxiv. 2020;.

[4] [4].
Sherborne B, Shanmugasundaram V, Cheng AC, Christ CD, DesJarlais RL, Duca JS, Lewis RA, Loughney DA, Manas ES, McGaughey GB, et al. Collaborating to improve the use of free-energy and other quantitative methods in drug discovery. Journal of computer-aided molecular design. 2016; 30(12):1139–1141.
OpenUrl CrossRef

[5] [5].↵
Cournia Z, Allen B, Sherman W. Relative binding free energy calculations in drug discovery: recent advances and practical considerations. Journal of chemical information and modeling. 2017; 57(12):2911–2937.
OpenUrl CrossRef

[6] [6].↵
Abel R, Wang L, Mobley DL, Friesner RA. A critical review of validation, blind testing, and real-world use of alchemical protein-ligand binding free energy calculations. Current topics in medicinal chemistry. 2017; 17(23):2577–2585.
OpenUrl CrossRef

[7] [7].↵
Ponder JW, Case DA. Force fields for protein simulations. In: Advances in protein chemistry, vol. 66 Elsevier; 2003.p. 27–85.
OpenUrl CrossRef PubMed Web of Science

[8] [8].↵
Shirts MR, Mobley DL, Brown SP. Free-energy calculations in structure-based drug design. Drug design: structure- and ligand-based approaches. 2010; p. 61–86.

[9] [9].↵
Abel R, Manas ES, Friesner RA, Farid RS, Wang L. Modeling the value of predictive affinity scoring in preclinical drug discovery. Current opinion in structural biology. 2018; 52:103–110.
OpenUrl

[10] [10].↵
Moraca F, Negri A, de Oliveira C, Abel R. Application of Free Energy Perturbation (FEP+) to Understanding Ligand Selectivity: A Case Study to Assess Selectivity Between Pairs of Phosphodiesterases (PDE’s). Journal of chemical information and modeling. 2019; 59(6):2729–2740.
OpenUrl CrossRef PubMed

[11] [11].↵
Albanese SK, Chodera JD, Volkamer A, Keng S, Abel R, Wang L. Is structure based drug design ready for selectivity optimization? BioRxiv. 2020;.

[12] [12].↵
Chodera JD, Mobley DL, Shirts MR, Dixon RW, Branson K, Pande VS. Alchemical free energy methods for drug discovery: progress and challenges. Current opinion in structural biology. 2011; 21(2):150–160.
OpenUrl CrossRef PubMed

[13] [13].↵
Wang J, Wolf RM, Caldwell JW, Kollman PA, Case DA. Development and testing of a general amber force field. Journal of computational chemistry. 2004; 25(9):1157–1174.
OpenUrl CrossRef PubMed Web of Science

[14] [14].↵
Wang J, Wang W, Kollman PA, Case DA. Automatic atom type and bond type perception in molecular mechanical calculations. Journal of molecular graphics and modelling. 2006; 25(2):247–260.
OpenUrl

[15] [15].↵
Vanommeslaeghe K, Hatcher E, Acharya C, Kundu S, Zhong S, Shim J, Darian E, Guvench O, Lopes P, Vorobyov I, et al. CHARMM general force field: A force field for drug-like molecules compatible with the CHARMM all-atom additive biological force fields. Journal of computational chemistry. 2010; 31(4):671–690.
OpenUrl CrossRef PubMed Web of Science

[16] [16].↵
Vanommeslaeghe K, Raman EP, MacKerell Jr AD. Automation of the CHARMM General Force Field (CGenFF) II: assignment of bonded parameters and partial atomic charges. Journal of chemical information and modeling. 2012; 52(12):3155–3168.
OpenUrl CrossRef PubMed

[17] [17].↵
Roos K, Wu C, Damm W, Reboul M, Stevenson JM, Lu C, Dahlgren MK, Mondal S, Chen W, Wang L, et al. OPLS3e: Extending force field coverage for drug-like small molecules. Journal of chemical theory and computation. 2019; 15(3):1863–1874.
OpenUrl

[18] [18].↵
Maple JR, Hwang MJ, Stockfisch TP, Dinur U, Waldman M, Ewig CS, Hagler AT. Derivation of class II force fields. I. Methodology and quantum force field for the alkyl functional group and alkane molecules. Journal of Computational Chemistry. 1994; 15(2):162–182.
OpenUrl

[19] [19].↵
Hwang MJ, Stockfisch T, Hagler A. Derivation of class II force fields. 2. Derivation and characterization of a class II force field, CFF93, for the alkyl functional group and alkane molecules. Journal of the American Chemical Society. 1994; 116(6):2515–2525.
OpenUrl

[20] [20].↵
Dauber-Osguthorpe P, Hagler AT. Biomolecular force fields: where have we been, where are we now, where do we need to go and how do we get there? Journal of computer-aided molecular design. 2019; 33(2):133–203.
OpenUrl

[21] [21].↵
Hagler AT. Force field development phase II: Relaxation of physics-based criteria…or inclusion of more rigorous physics into the representation of molecular energetics. Journal of computer-aided molecular design. 2019; 33(2):205–264.
OpenUrl

[22] [22].↵
Shi Y, Jiao D, Schnieders MJ, Ren P. Trypsin-ligand binding free energy calculation with AMOEBA. In: 2009 Annual International Conference of the IEEE Engineering in Medicine and Biology Society IEEE; 2009. p. 2328–2331.

[23] [23].↵
Bell DR, Qi R, Jing Z, Xiang JY, Mejias C, Schnieders MJ, Ponder JW, Ren P. Calculating binding free energies of host–guest systems using the AMOEBA polarizable force field. Physical Chemistry Chemical Physics. 2016; 18(44):30261–30269.
OpenUrl CrossRef

[24] [24].
Laury ML, Wang Z, Gordon AS, Ponder JW. Absolute binding free energies for the SAMPL6 cucurbit [8] uril host–guest challenge via the AMOEBA polarizable force field. Journal of computer-aided molecular design. 2018; 32(10):1087–1095.
OpenUrl

[25] [25].↵
Lin FY, MacKerell AD. Force fields for small molecules. In: Biomolecular Simulations Springer; 2019.p. 21–54.

[26] [26].↵
Stern C, Capturing non-local through-bond effects when fragmenting molecules for quantum chemical torsion scans; 2020. Online; accessed 23 July 2020. https://chayast.github.io/fragmenter-manuscript/.

[27] [27].↵
Harder E, Damm W, Maple J, Wu C, Reboul M, Xiang JY, Wang L, Lupyan D, Dahlgren MK, Knight JL, et al. OPLS3: a force field providing broad coverage of drug-like small molecules and proteins. Journal of chemical theory and computation. 2016; 12(1):281–296.
OpenUrl

[28] [28].↵
Maple JR, Dinur U, Hagler AT. Derivation of force fields for molecular mechanics and dynamics from ab initio energy surfaces. Proceedings of the National Academy of Sciences. 1988; 85(15):5350–5354.
OpenUrl Abstract/FREE Full Text

[29] [29].↵
Mackerell Jr AD, Feig M, Brooks III CL. Extending the treatment of backbone energetics in protein force fields: Limitations of gas-phase quantum mechanics in reproducing protein conformational distributions in molecular dynamics simulations. Journal of computational chemistry. 2004; 25(11):1400–1415.
OpenUrl CrossRef PubMed Web of Science

[30] [30].
Wang LP, McKiernan KA, Gomes J, Beauchamp KA, Head-Gordon T, Rice JE, Swope WC, Martínez TJ, Pande VS. Building a more predictive protein force field: a systematic and reproducible route to AMBER-FB15. The Journal of Physical Chemistry B. 2017; 121(16):4023–4039.
OpenUrl

[31] [31].
Kang W, Jiang F, Wu YD. Universal Implementation of a Residue-Specific Force Field Based on CMAP Potentials and Free Energy Decomposition. Journal of chemical theory and computation. 2018; 14(8):4474–4486.
OpenUrl

[32] [32].↵
Best RB, Mittal J, Feig M, MacKerell Jr AD. Inclusion of many-body effects in the additive CHARMM protein CMAP potential results in enhanced cooperativity of α-helix and β-hairpin formation. Biophysical journal. 2012; 103(5):1045–1051.
OpenUrl CrossRef PubMed

[33] [33].↵
Perola E, Charifson PS. Conformational analysis of drug-like molecules bound to proteins: an extensive study of ligand reorganization upon binding. Journal of medicinal chemistry. 2004; 47(10):2499–2510.
OpenUrl CrossRef PubMed

[34] [34].↵
Beierlein FR, Michel J, Essex JW. A simple QM/MM approach for capturing polarization effects in protein-ligand binding free energy calculations. The Journal of Physical Chemistry B. 2011; 115(17):4911–4926.
OpenUrl

[35] [35].
Dubey KD, Ojha RP. Binding free energy calculation with QM/MM hybrid methods for Abl-Kinase inhibitor. Journal of biological physics. 2011; 37(1):69–78.
OpenUrl

[36] [36].
Rathore R, Sumakanth M, Reddy MS, Reddanna P, Rao AA, Erion MD, Reddy M. Advances in binding free energies calculations: QM/MM-based free energy perturbation method for drug design. Current pharmaceutical design. 2013; 19(26):4674–4686.
OpenUrl

[37] [37].
König G, Hudson PS, Boresch S, Woodcock HL. Multiscale free energy simulations: An efficient method for connecting classical MD simulations to QM or QM/MM free energies using Non-Boltzmann Bennett reweighting schemes. Journal of chemical theory and computation. 2014; 10(4):1406–1419.
OpenUrl

[38] [38].↵
Steinmann C, Olsson MA, Ryde U. Relative ligand-binding free energies calculated from multiple short QM/MM MD simulations. Journal of chemical theory and computation. 2018; 14(6):3228–3237.
OpenUrl

[39] [39].↵
Yang W, Cui Q, Min D, Li H. QM/MM alchemical free energy simulations: Challenges and recent developments. In: Annual Reports in Computational Chemistry, vol. 6 Elsevier; 2010.p. 51–62.
OpenUrl

[40] [40].
Hudson PS, Boresch S, Rogers DM, Woodcock HL. Accelerating QM/MM Free Energy Computations via Intramolecular Force Matching. Journal of Chemical Theory and Computation. 2018 Dec; 14(12):6327–6335. https://doi.org/10.1021/acs.jctc.8b00517, doi: 10.1021/acs.jctc.8b00517, publisher: American Chemical Society.
OpenUrl CrossRef

[41] [41].↵
Giese TJ, York DM. Development of a Robust Indirect Approach for MM→QM Free Energy Calculations That Combines Force-Matched Reference Potential and Bennett’s Acceptance Ratio Methods. Journal of chemical theory and computation. 2019; 15(10):5543–5562.
OpenUrl

[42] [42].↵
Von Lilienfeld OA. Quantum machine learning in chemical compound space. Angewandte Chemie International Edition. 2018; 57(16):4164–4169.
OpenUrl

[43] [43].↵
Smith JS, Isayev O, Roitberg AE. ANI-1: an extensible neural network potential with DFT accuracy at force field computational cost. Chemical science. 2017; 8(4):3192–3203.
OpenUrl

[44] [44].↵
Galvelis R, Doerr S, Damas JM, Harvey MJ, De Fabritiis G. A Scalable Molecular Force Field Parameterization Method Based on Density Functional Theory and Quantum-Level Machine Learning. Journal of chemical information and modeling. 2019; 59(8):3485–3493.
OpenUrl

[45] [45].↵
Wang L, Wu Y, Deng Y, Kim B, Pierce L, Krilov G, Lupyan D, Robinson S, Dahlgren MK, Greenwood J, et al. Accurate and reliable prediction of relative ligand binding potency in prospective drug discovery by way of a modern free-energy calculation protocol and force field. Journal of the American Chemical Society. 2015; 137(7):2695–2703.
OpenUrl CrossRef PubMed

[46] [46].↵
Qiu Y, Smith DGA, Boothroyd S, Wagner J, Bannan CC, Gokey T, Jang H, Lim VT, Lucas X, Tjanaka B, et al., openforce-field/openforcefields: Version 1.0.0 “Parsley”. Zenodo; 2019. doi: 10.5281/zenodo.3483227.
OpenUrl CrossRef

[47] [47].↵
Maier JA, Martinez C, Kasavajhala K, Wickstrom L, Hauser KE, Simmerling C. ff14SB: improving the accuracy of protein side chain and backbone parameters from ff99SB. Journal of chemical theory and computation. 2015; 11(8):3696–3713.
OpenUrl

[48] [48].↵
Jorgensen WL, Chandrasekhar J, Madura JD, Impey RW, Klein ML. Comparison of simple potential functions for simulating liquid water. The Journal of chemical physics. 1983; 79(2):926–935.
OpenUrl CrossRef PubMed Web of Science

[49] [49].↵
Devereux C, Smith JS, Davis KK, Barros K, Zubatyuk R, Isayev O, Roitberg AE. Extending the Applicability of the ANI Deep Learning Molecular Potential to Sulfur and Halogens. Journal of Chemical Theory and Computation. 2020; 16(7):4192–4202. https://doi.org/10.1021/acs.jctc.0c00121, doi: 10.1021/acs.jctc.0c00121, publisher: American Chemical Society.
OpenUrl CrossRef

[50] [50].↵
Song LF, Lee TS, Zhu C, York DM, Merz Jr KM. Using AMBER18 for Relative Free Energy Calculations. Journal of chemical information and modeling. 2019; 59(7):3128–3135.
OpenUrl

[51] [51].↵
Liang J, van Abbema A, Balazs M, Barrett K, Berezhkovsky L, Blair W, Chang C, Delarosa D, DeVoss J, Driscoll J, et al. Lead optimization of a 4-aminopyridine benzamide scaffold to identify potent, selective, and orally bioavailable TYK2 inhibitors. Journal of medicinal chemistry. 2013; 56(11):4521–4536.
OpenUrl CrossRef PubMed

[52] [52].↵
Lahey SLJ, Rowley CN. Simulating protein–ligand binding with neural network potentials. Chemical Science. 2020; 11(9):2362–2368.
OpenUrl

[53] [53].↵
Smith JS, Nebgen BT, Zubatyuk R, Lubbers N, Devereux C, Barros K, Tretiak S, Isayev O, Roitberg AE. Approaching coupled cluster accuracy with a general-purpose neural network potential through transfer learning. Nature communications. 2019; 10(1):1–8.
OpenUrl

[54] [54].↵
Bennett CH. Efficient estimation of free energy differences from Monte Carlo data. Journal of Computational Physics. 1976; 22(2):245–268.
OpenUrl CrossRef

[55] [55].↵
Crooks GE. Excursions in statistical dynamics. PhD thesis, Citeseer; 1999.

[56] [56].↵
Shirts MR, Bair E, Hooker G, Pande VS. Equilibrium free energies from nonequilibrium measurements using maximum-likelihood methods. Physical review letters. 2003; 91(14):140601.
OpenUrl CrossRef PubMed

[57] [57].↵
Crooks GE. Nonequilibrium measurements of free energy differences for microscopically reversible Markovian systems. Journal of Statistical Physics. 1998; 90(5-6):1481–1487.
OpenUrl

[58] [58].↵
Shirts MR, Mobley DL, Chodera JD. Alchemical free energy calculations: ready for prime time? Annual reports in computational chemistry. 2007; 3:41–59.
OpenUrl

[59] [59].↵
Wan S, Bhati AP, Zasada SJ, Wall I, Green D, Bamborough P, Coveney PV. Rapid and reliable binding affinity prediction of bromodomain inhibitors: a computational study. Journal of chemical theory and computation. 2017; 13(2):784–795.
OpenUrl

[60] [60].
Harger M, Li D, Wang Z, Dalby K, Lagardère L, Piquemal JP, Ponder J, Ren P. Tinker-OpenMM: Absolute and relative alchemical free energies using AMOEBA on GPUs. Journal of computational chemistry. 2017; 38(23):2047–2055.
OpenUrl

[61] [61].
Lee TS, Cerutti DS, Mermelstein D, Lin C, LeGrand S, Giese TJ, Roitberg A, Case DA, Walker RC, York DM. GPU-accelerated molecular dynamics and free energy methods in Amber18: performance enhancements and new features. Journal of chemical information and modeling. 2018; 58(10):2043–2050.
OpenUrl CrossRef

[62] [62].
Jespers W, Esguerra M, Åqvist J, Gutiérrez-de Terán H. QligFEP: an automated workflow for small molecule free energy calculations in Q. Journal of cheminformatics. 2019; 11(1):26.
OpenUrl

[63] [63].
Suruzhon M, Senapathi T, Bodnarchuk MS, Viner R, Wall ID, Barnett CB, Naidoo KJ, Essex JW. ProtoCaller: Robust Automation of Binding Free Energy Calculations. Journal of Chemical Information and Modeling. 2020; 60(4):1917–1921.
OpenUrl

[64] [64].↵
Gapsys V, Pérez-Benito L, Aldeghi M, Seeliger D, Van Vlijmen H, Tresadern G, de Groot BL. Large scale relative protein ligand binding affinities using non-equilibrium alchemy. Chemical Science. 2020; 11(4):1140–1152.
OpenUrl

[65] [65].↵
Kuhn M, Firth-Clark S, Tosco P, Mey AS, Mackey MD, Michel J. Assessment of Binding Affinity via Alchemical Free Energy Calculations. Journal of Chemical Information and Modeling. 2020;.

[66] [66].↵
Shirts MR, Pande VS. Comparison of efficiency and bias of free energies computed by exponential averaging, the Bennett acceptance ratio, and thermodynamic integration. The Journal of chemical physics. 2005; 122(14):144107.
OpenUrl CrossRef PubMed

[67] [67].↵
Neal R. Annealed importance sampling (Technical Report 9805 (revised)). Department of Statistics, University of Toronto. 1998;.

[68] [68].↵
Nilmeier JP, Crooks GE, Minh DD, Chodera JD. Nonequilibrium candidate Monte Carlo is an efficient tool for equilibrium simulation. Proceedings of the National Academy of Sciences. 2011; 108(45):E1009–E1018.
OpenUrl Abstract/FREE Full Text

[69] [69].↵
Jarzynski C. Nonequilibrium equality for free energy differences. Physical Review Letters. 1997; 78(14):2690.
OpenUrl CrossRef Web of Science

[70] [70].↵
Del Moral P, Doucet A, Jasra A. Sequential monte carlo samplers. Journal of the Royal Statistical Society: Series B (Statistical Methodology). 2006; 68(3):411–436.
OpenUrl

[71] [71].↵
Neal RM. Estimating ratios of normalizing constants using linked importance sampling. arXiv preprint math/0511216. 2005;.

[72] [72].↵
Xu H. Optimal measurement network of pairwise differences. Journal of Chemical Information and Modeling. 2019; 59(11):4720–4728.
OpenUrl

[73] [73].↵
Sellers BD, James NC, Gobbi A. A comparison of quantum and molecular mechanical methods to estimate strain energy in druglike fragments. Journal of chemical information and modeling. 2017; 57(6):1265–1275.
OpenUrl

[74] [74].↵
Rai BK, Sresht V, Yang Q, Unwalla R, Tu M, Mathiowetz AM, Bakken GA. Comprehensive Assessment of Torsional Strain in Crystal Structures of Small Molecules and Protein–Ligand Complexes using ab Initio Calculations. Journal of Chemical Information and Modeling. 2019; 59(10):4195–4208.
OpenUrl

[75] [75].↵
Wei W, Champion C, Liu Z, Barigye SJ, Labute P, Moitessier N. Torsional Energy Barriers of Biaryls Could Be Predicted by Electron Richness/Deficiency of Aromatic Rings; Advancement of Molecular Mechanics toward Atom-Type Independence. Journal of chemical information and modeling. 2019; 59(11):4764–4777.
OpenUrl

[76] [76].↵
Eastman P, Swails J, Chodera JD, McGibbon RT, Zhao Y, Beauchamp KA, Wang LP, Simmonett AC, Harrigan MP, Stern CD, et al. OpenMM 7: Rapid development of high performance algorithms for molecular dynamics. PLoS computational biology. 2017; 13(7):e1005659.
OpenUrl

[77] [77].↵
Zubatyuk R, Smith JS, Leszczynski J, Isayev O. Accurate and transferable multitask prediction of chemical properties with an atoms-in-molecules neural network. Science advances. 2019; 5(8):eaav6490.
OpenUrl FREE Full Text

[78] [78].↵
Schütt KT, Sauceda HE, Kindermans PJ, Tkatchenko A, Müller KR. SchNet–A deep learning architecture for molecules and materials. The Journal of Chemical Physics. 2018; 148(24):241722.
OpenUrl

[79] [79].↵
Unke OT, Meuwly M. PhysNet: A neural network for predicting energies, forces, dipole moments, and partial charges. Journal of chemical theory and computation. 2019; 15(6):3678–3693.
OpenUrl

[80] [80].↵
Glick ZL, Metcalf DP, Koutsoukas A, Spronk SA, Cheney DL, Sherrill CD. AP-Net: An atomic-pairwise neural network for smooth and transferable interaction potentials.. 2020;.

[81] [81].↵
Gilmer J, Schoenholz SS, Riley PF, Vinyals O, Dahl GE. Neural message passing for quantum chemistry. arXiv preprint arXiv:170401212. 2017;.

[82] [82].↵
Scheen J, Wu W, Mey AS, Tosco P, Mackey MD, Michel J. A hybrid Alchemical Free Energy/Machine Learning Methodology for the Computation of Hydration Free Energies. Journal of Chemical Information and Modeling. 2020;.

[83] [83].↵
Ghanakota P, Bos PH, Konze K, Staker J, Marques G, Marshall K, Leswing K, Abel R, Bhat S. Combining Cloud-Based Free Energy Calculations, Synthetically Aware Enumerations and Goal-Directed Generative Machine Learning for Rapid Large Scale Chemical Exploration and Optimization. Journal of Chemical Information and Modeling. 2020;.

[84] [84].↵
Joung IS, Cheatham III TE. Determination of alkali and halide monovalent ion parameters for use in explicitly solvated biomolecular simulations. The journal of physical chemistry B. 2008; 112(30):9020–9041.
OpenUrl

[85] [85].↵
Leimkuhler B, Matthews C. Robust and efficient configurational molecular sampling via Langevin dynamics. The Journal of chemical physics. 2013; 138(17):05B601_1.
OpenUrl

[86] [86].↵
Fass J, Sivak DA, Crooks GE, Beauchamp KA, Leimkuhler B, Chodera JD. Quantifying configuration-sampling error in Langevin simulations of complex molecular systems. Entropy. 2018; 20(5):318.
OpenUrl

[87] [87].↵
Gapsys V, Seeliger D, de Groot BL. New soft-core potential function for molecular dynamics based alchemical free energy calculations. Journal of Chemical Theory and Computation. 2012; 8(7):2373–2382.
OpenUrl

[88] [88].↵
Gao X, Ramezanghorbani F, Isayev O, Smith JS, Roitberg AE. TorchANI: A Free and Open Source PyTorch-Based Deep Learning Implementation of the ANI Neural Network Potentials. Journal of Chemical Information and Modeling. 2020; 60(7):3408–3415. https://doi.org/10.1021/acs.jcim.0c00451, doi: 10.1021/acs.jcim.0c00451, publisher: American Chemical Society.
OpenUrl CrossRef