Markovian Weighted Ensemble Milestoning (M-WEM): Long-time Kinetics from Short Trajectories

Dhiman Ray; Sharon Emily Stone; Ioan Andricioaei

doi:10.1101/2021.06.26.450057

Abstract

We introduce a rare-event sampling scheme, named Markovian Weighted Ensemble Milestoning (M-WEM), which inlays a weighted ensemble framework within a Markovian milestoning theory to efficiently calculate thermodynamic and kinetic properties of long-timescale biomolecular processes from short atomistic molecular dynamics simulations. M-WEM is tested on the Müller-Brown potential model, the conformational switching in alanine dipeptide, and the millisecond timescale protein-ligand unbinding in a trypsin-benzamidine complex. Not only can M-WEM predict the kinetics of these processes with quantitative accuracy, but it also allows for a scheme to reconstruct a multidimensional free energy landscape along additional degrees of freedom which are not part of the milestoning progress coordinate. For the ligand-receptor system, the experimental residence time, association and dissociation kinetics, and binding free energy could be reproduced using M-WEM within a simulation time of a few hundreds of nanoseconds, which is a fraction of the computational cost of other currently available methods, and close to four orders of magnitude less than the experimental residence time. Due to the high accuracy and low computational cost, the M-WEM approach can find potential application in kinetics and free-energy based computational drug design.

Introduction

It is a challenge to quantify with accuracy the kinetics of rare events in molecular biophysics via computational means. Molecular dynamics (MD) simulations provide atomistically detailed movies of the structural and functional dynamics of biological macro-molecules. However, the majority of important dynamic processes in the cell involve broad length and time scales. A large fraction of such processes are rare over the timescale of the simulation. Energy barriers higher than thermal energy trap the simulated system in conformational basins of attraction, impeding proper sampling of all relevant states. Examples of rare processes include protein folding, conformational transitions, ligand binding and unbinding etc., which in most cases involve ~ 10⁴ – 10⁶ atoms including the natural solution environment. Despite phenomenal advances in computing hardware, atomistic MD simulations of such large systems still go, typically, up to multiple microseconds only. This is many orders of magnitude smaller than the timescale relevant to biological function, which is often in the range of seconds to hours.

The current study focuses on protein-ligand interactions; its adequate sampling is pivotal to computer-aided drug design. Molecular dynamics (MD) simulations provide mechanistic insights into such interactions at atomistic detail, and is one of the essential tools in the repertoire of the pharmaceutical research community. A wide range of alchemical free energy calculation methods^1–3 and enhanced sampling methods (involving external biasing force)^4–7 have been developed over the past few decades to calculate the binding free energy of a protein-ligand complex. Although the virtual screening of potential inhibitors is currently based on the binding free energy, the efficacy of a drug molecule is often dependent on the binding and unbinding kinetics or the residence time.^{8, 9} It is difficult to compute the kinetic properties from traditional enhanced sampling simulations, as the dynamics become non-physical due to the application of artificial biases (although there are methods to recover kinetics from simple constant force or constant velocity steered molecular dynamics^10–13). On the other hand, using brute force MD simulation, one needs to sample multiple binding and unbinding events to obtain converged results for kinetic properties. This requires a simulation time many times higher than the timescale of one event, which itself is beyond the reach of even the most powerful supercomputers. This results in a dire need to develop theoretical methods and computational algorithms to make quantitative predictions about the kinetics of long timescale processes such as rare events from short timescale trajectories.

A category of methods involves transition path sampling (TPS), a concept introduced by Pratt¹⁴ and later developed by Chandler and co-workers,^{15, 16} to simulate transitions across energy barriers. Instead of applying external bias, path sampling methods utilize the statistical properties of the unbiased trajectory ensemble to compute experimental observables such as the kinetics of conformational transition or ligand unbinding, as well as molecular scale properties like ligand release pathways and mechanism.¹⁷ A different path sampling approach is the weighted ensemble (WE) method of Huber and Kim,¹⁸ which belongs to a broader category of variance-reduction algorithms that use “splitting” in the framework of Monte Carlo sampling (see, e.g., Kahn¹⁹). The WE method was further developed by Zuckerman, Chong and collaborators (see e.g., Ref.²⁰); it also was established that the weighted ensemble is statistically exact.²¹ In this approach, the conformational space between the initial and final state is discretized into multiple bins and a number of short trajectories are propagated from the starting bin. Trajectory segments are split or merged when they enter a new bin to keep an equal number of trajectories in each bin. Appropriate weights are assigned to the new set of trajectories to conserve the total probability. It allows for the sampling of fast moving but low-weight trajectories that reach the final state well before the mean first passage time; this facilitates the calculation of converged kinetics, free energy and pathways at a relatively low computational cost. With the implementation in the open source software WESTPA, ²² the weighted ensemble method has seen a wide range of applications including folding and conformational transitions in proteins,^23–25 formation of host-guest complexes,²⁶ protein-peptide²⁷ and protein-protein binding,²⁸ ion permeation through protein channels, ²⁹ viral capsid assembly³⁰ and many others. Many new variants, as well as new analysis schemes for the traditional WE approach, have emerged in recent years, including WExplore,³¹ resampling of ensembles by variation optimization (REVO),³² history augmented Markov State Modeling (haMSM),³³ the RED scheme,³⁴ minimal adaptive binning (MAB),³⁵ and micro-bin analysis.³⁶ Particularly, the WExplore and REVO algorithms have been successfully applied to study the pathways and kinetics of protein-ligand dissociation,^{32, 37, 38} even for systems with residence times as high as seconds to minutes.^{39, 40}

Another popular approach to study the kinetics of biophysical rare events is milestoning,^41–43 which belongs to the larger category of trajectory stratification.^44–47 In milestoning, multiple interfaces are placed along a reaction coordinate, and short MD trajectories are propagated in between the interfaces, which thus serve as milestones for the progress of the transition of interest. Analyzing the milestone-to-milestone transition statistics via a statistical framework, ⁴³ the kinetics and free energy profile are estimated. This method has also been implemented in the software tools miles,⁴⁸ ScMile⁴⁹ and SEEKR,⁵⁰ and has been used to study a variety of complex biological problems including protein allosteric transitions,⁵¹ membrane permeation by small molecules,^52–55 protein small molecule interaction,^{50, 56–58} simple ligand-receptor binding, ⁵⁹ peptide transport through protein channels,^{60, 61} DNA protein interaction,⁶² protein conformational dynamics⁶³ etc. Apart from the necessity of having a predefined reaction coordinate, the milestones need to be placed far apart to preserve the assumption of Markovianity. ⁴² This itself increases the computational cost significantly, leaving aside the fact that two independent studies have shown that the majority of the total computational effort in milestoning simulation is spent on sampling along the milestone interfaces to generate starting structures in accordance with the equilibrium distribution.^{60, 64}

A different variant of the milestoning approach has been developed: Markovian Milestoning with Voronoi Tesselation (MMVT),^{55, 65} which removes the necessity of performing additional sampling along milestone interface, reducing the overall computational cost to a large extent. The application of MMVT remained rather limited, being used primarily for studying small molecule transport through transmembrane proteins,^66–69 substrate translocation through ATPase motor, ⁷⁰ and the CO entry in myoglobin. ⁷¹ Only recently, the Markovian milestoning approach has been tested on ligand-receptor binding for crown-ether host-guest complexes and for the dissociation of a benzamidine ligand from the trypsin protein. ⁶⁴ Despite cutting down the computational cost in sampling at the milestone interface, this approach still suffers from the Markovian assumption and can be significantly expensive for complex systems. ⁶⁴

In our previous work, we attempted to improve the milestoning scheme by accelerating transitions between distant milestones via the application of directed wind forces. ⁷² This technique did increase the number of energetically uphill transitions, but the statistical properties of the computed observables were not significantly better. ⁷³ More recently, we proposed the combined Weighted Ensemble Milestoning (WEM) scheme, where we performed WE simulations in between milestones to accelerate the convergence of the transition between adequately spaced milestones. ⁷⁴ The WEM method not only produced accurate prediction of kinetics, free energy and time correlation function for small molecular systems like alanine dipeptide, ⁷⁴ but we could also reproduce protein-ligand binding and unbinding rate constants and binding affinity, previously obtained from 30 μs equilibrium simulation,⁷⁵ in less than 100 ns of WEM simulation. ⁷⁶

Yet, the current methodology and the implementation of WEM have a few drawbacks. First, the sampling of the degrees of freedom perpendicular to the reaction coordinate (RC) is significantly poor, particularly in situations where slow conformational changes of the protein are coupled to the ligand unbinding. ⁷⁶ This can potentially be rectified by using multiple starting states on the milestone interface sampled from long umbrella-sampling simulations at the expense of a manifold increase in the computational cost similar to traditional milestoning. Second, the choice of the milestoning reaction coordinate (RC) is arbitrary and can possibly impact the quality of the results, depending on the complexity of the underlying free energy landscape. Moreover, a major hindrance of the large scale application of WEM technique is the complexity of the simulation protocol, which requires propagating many short trajectories and stopping them upon reaching a nearby milestone. ⁷⁶ It requires frequent monitoring of the trajectory as well as frequent communication to the dynamics engine to stop the propagation if the progress coordinate reaches a particular value; this makes the WEM algorithm particularly inefficient to implement in Graphical Processing Unit (GPU) hardware.

We, thereby, present a novel Markovian Weighted Ensemble Milestoning (M-WEM) approach, in which we combine weighted ensemble with soft-wall⁶⁵ based Markovian Milestoning, in an attempt to mitigate the deficiencies and improve the performance of the weighted ensemble milestoning technique. We first provide a detailed description of the theory of Markovian milestoning and the M-WEM approach. We then show the application of this method to the two-dimensional Müller-Brown potential, the conformational transition of alanine dipeptide, and the dissociation and association of the trypsin-benzamidine complex, a protein ligand system with a residence time beyond millisecond. The choice of the trypsin-benzamidine complex is inspired by the fact that many existing path sampling and enhanced sampling methods have been applied on this system, including Markov State Modeling (MSM),^{77, 78} Metadynamics,⁷⁹ Adaptive Multilevel Splitting (AMS),⁸⁰ Milestoning,⁵⁰ MMVT, ⁶⁴ WExplore, ³⁸ and REVO. ³² So, we compare the accuracy of the results and the performance of M-WEM with these existing techniques, as well as with the experimental rate constants and free energy values obtained by Guillian and Thusius. ⁸¹ We also discuss a new approach to construct multidimensional free energy landscapes via post-analysis of MMVT and M-WEM trajectories obtained using a one-dimensional reaction coordinate, with a potential application in systems were orthogonal degrees of freedom are strongly coupled with the reaction coordinate.

Theory

Markovian Milestoning

The theoretical details of the Markovian milestoning with Voronoi tessellation (MMVT) approach is described elsewhere.^{55, 64, 65} Here, we provide only a brief description relevant to the current work.

In MMVT, the configurational space is discretized into Voronoi cells. A flat bottom potential is applied to each cell with half-harmonic walls placed at each milestone interface, preventing the trajectories from escaping out of the Voronoi cells. For a 1-dimensional reaction coordinate, used here, the flat bottom potential has the expression: where α is the cell index, is the value of the reaction coordinate at the milestone i at the boundary of the cell α, and is the force constant; the total number of cells is Λ and the total number of milestones is M. One or more unbiased trajectories are propagated in each cell. The trajectories which cross the milestone interface are reflected back into the cell by the half harmonic restraint. As a result, the trajectories remain confined into one cell and perform many transitions between the milestones interfaces constituting the boundaries of the cell. The portions of the trajectory outside the cell are to be discarded before performing further analysis. This protocol is referred to as the soft wall restraint^{65, 66} which we adopt in the current work. Alternatively, a hard wall restraint^{55, 64} can also be used where the direction of velocity is switched when a trajectory crosses a milestone.

From these confined trajectories, the transition counts between milestones are recorded. A flux matrix is constructed whose elements are given by: where N_α,β is the number of transitions from cell α to cell β recorded from a trajectory propagated for time T_α in the cell α. The equilibrium probability for each cell (π_α) is the obtained by iteratively solving the linear equation (Eq. 3) in a self-consistent manner under the constraint of a constant total probability of one:

The free energy profile at each cell is then computed as:

For calculating kinetics, the transition matrix and the lifetime vector are constructed, whose elements are computed as follows: where is the number of times a trajectory in cell α collides with milestone j after having last visited milestone i; is the cumulative time the trajectory spends in cell α visiting milestone i and before reaching any other milestone. T is a constant for dimensional consistency, which is not necessary to compute because it cancels out at a later stage. A rate matrix is then defined as:

Considering milestone M is the target milestone, the mean first passage time of the process can be computed as:

is the matrix obtained by deleting the last row and column of Q. 1 is a unit vector with M – 1 elements, and is the vector with entries that are the MFPTs from milestone i to milestone M.

Markovian Weighted Ensemble Milestoning (M-WEM)

In the current work we introduce the Markovian Weighted Ensemble Milestoning (M-WEM) approach, where the conventional MD trajectories in the Markovian milestoning framework are replaced by weighted ensemble simulation. A schematic representation of the M-WEM protocol is depicted in Fig. 1. WE bins are placed along the reaction coordinate in-between the milestone interfaces, as well as along a different coordinate to accelerate sampling along the milestone interface. The additional non-RC coordinate should ideally be locally orthogonal to the RC, but this is not a necessary condition. WE simulation is performed in this 2D progress coordinate space using the recently developed minimal adaptive binning (MAB) scheme. ³⁵ As opposed to the traditional fixed binning scheme, the MAB approach adaptively changes the bin boundaries during the course of simulation, avoiding the requirement of an arbitrarily chosen predefined set of bins. It also provides an increased sampling of the conformational space. As the total number of occupied bins remains virtually unchanged throughout the simulation, the maximum amount of computational resources needed for the simulation can be easily estimated beforehand.³⁵ A stochastic integrator, e.g., for the Langevin equation, is needed to propagate the dynamics to ensure that the new set of trajectories, generated after a splitting event, follow different paths despite emerging from a single parent trajectory.

Figure 1:

A schematic representation of the M-WEM simulation protocol. The thick lines indicate milestones (labeled as milestone index i and i + 1). The dotted lines indicate a WE bin boundaries (which are adapted during the simulation but in this figure we show fixed bins for clarity). Trajectories for different WE iteration is shown in different color scheme: Iteration 1: blue, iteration 2: green, iteration 3: red, and iteration 4: pink. (a) First, WE simulation is performed with harmonic walls placed at the milestone interfaces allowing for the trajectory to bounce back and forth. (b) The propagation history of individual trajectories are traced back from the last iteration (an example trace is highlighted with gray dashed line). (c) The milestone crossing events (gray circles) are recorded from each trace, and are used in subsequent analysis.

Unlike the MMVT approach with conventional MD, the WE trajectories hitting the milestones will have different weights. To properly take into account this effect, we take all the trajectory segments at the last iteration and trace them back to the first iteration to obtain separate trajectory traces. The weight of each trajectory trace is set equal to the weight of the corresponding trajectory segment in the last iteration.

The total number of trajectory traces in cell α (M_α) is equal to the number of occupied bins × the number of trajectories per bin in the final iteration. The elements of the flux matrix k in this formalism are given by: where w_J is the weight of the Jth trajectory trace. , and have similar definitions as in Eq. 2 except that they are computed just from the Jth trace. The equilibrium probability distribution and the free energy profile are computed from the elements of the flux matrix obtained from Eq. 8 using Eq. 3 and 4, respectively.

For calculating kinetics, the N_i,j and the R_i matrix elements are to be constructed taking into account the different weights of the trajectory traces. The new transition matrix element becomes: where has the same definition as in Eq. 5 except it is for the Jth trajectory trace. Now we define a pseudo transition matrix , which is identical to the transition matrix N but computed only from one (Jth) trajectory trace in the cell α. Its elements are given by:

Now the total transition matrix N can be obtained as

Similarly, the lifetime vector R is given by: and is equivalent to but computed only for the Jth trajectory segment. Using the N and R, the rate matrix Q is computed using Eqs. 6 and 7.

Error Analysis

Error analysis of milestoning-based simulations can be performed in a few different ways, primarily by generating an ensemble of rate matrices (Q). Then the desired properties (such as the MFPT) are calculated from many sample matrices and the uncertainty is estimated. When working with transition matrices (as in traditional milestoning), the kernels can be sampled from a beta distribution. ⁶⁰ Similar to our previous work,^{74, 76} we generated the ensemble of rate matrices by sampling from a Bayesian type conditional probability,^{56, 82, 83} given by where p(Q) is a uniform prior, N_ij is the number of trajectories transiting from milestone i to j and , where is the set of all milestones. We sampled the Q matrices from the distribution in Eq. 13 using a non-reversible element exchange Monte-Carlo scheme.⁸⁴ One randomly chosen off-diagonal element and the diagonal element of the corresponding row of Q are updated to generate a new rate matrix Q′. where Δ is a random number sampled from an exponential distribution of range [–Q_ij, ∞) with mean zero. The new matrix Q′ is accepted with a probability equal to min(1, p_accept) where:

As each step modifies only one element, the sampled matrices are highly correlated.⁸⁴ So we only considered the matrices sampled every 50 steps for uncertainty estimation. We recomputed the MFPT obtained from them using Eq. 7 and calculated mean and 95% confidence intervals for all MMVT simulations.

For M-WEM simulations, estimating errors using non-reversible element exchange Monte-Carlo can be difficult. The expression for p_accept (Eq. 15) has exponential dependence on the number of transitions N_ij and number of trajectories reflected from milestone i (N_i). In the case of MMVT simulations all such transitions have equal weight, unlike in M-WEM where the weights of transiting trajectories can vary to a large extent, often over many orders of magnitude. So using the expression in Eq. 15, which counts all transitions with equal importance, one is unable to sample the ensemble of transition matrices accurately in our M-WEM approach. For each observable (MFPT, free energy, etc.), we computed its estimate over an M-WEM simulation of increasing number of total iterations for each cell. We then compute the mean and 95% confidence interval of the sampled observables after the simulation is converged. The uncertainties obtained using this procedure capture the effect of the fluctuations of an observable around the mean after the simulation has converged. Such approach is commonly used in the literature to compute the error estimates of free energy profiles, as discussed, for example, in our previous work Ref.⁸⁵ A more elaborate version of this technique is called block averaging, ⁸⁶ which is a statistically rigorous way to compute error estimates in molecular dynamics simulations.⁸⁷

Specific details of error analysis for each test system is mentioned in the Computational Methods section. For the M-WEM technique, the derivation of a more rigorous approach for error analysis, similar to the one described in Eqs. (13) - (15) will be addressed in future work.

Efficiency Analysis

Because of the difference in the uncertainties of the measured values from different methods, a direct comparison between the simulation time or convergence time may not always correctly represent the relative efficiency between different approaches. To address this issue, we used the algorithmic efficiency metric, η, which takes into account the variance of the data apart from the total simulation time. ⁷⁴ This metric is evaluated as: where A is the measured quantity and ΔA is the uncertainty of the measurement. N_s is the number of force evaluations performed to calculate A (although it can be thought of as the simulation time if expressed in units of time instead of time steps). In brief, η⁻¹ is the number of force evaluations required to measure our observable A with an uncertainty equal to its mean.¹⁸ The lower the value of η⁻¹, the better the efficiency. For the first two systems, the Müller-Brown potential and the alanine dipeptide, we reported the value of η⁻¹ where we calculated the error bars from multiple runs. We did not report η⁻¹ for each individual run because the error of MMVT calculations, computed using Bayesian analysis, and the error bars of the M-WEM approach computed from averaging over iterations are not directly comparable. For the same reason, we do not report an algorithmic efficiency for the trypsin-benzamidine system, for which η will also depend on the quantity we are measuring.

Calculation of k_off and k_on

For the protein-ligand binding problem considered in this paper, we computed the unbinding rate constant (k_off) as where τ is the ligand residence time, which is equivalent to the MFPT of transitioning from the bound state milestone to the unbound state milestone, and which, in turn, can be computed from the milestoning or M-WEM framework described above.

We noted in our previous work that the ligand-binding kinetics is often diffusion dominant. ⁷⁶ So the rate determining step can be the arrival of the ligand on the outermost milestone surface, rather than going from the outermost milestone to the binding pocket. (Whether this is true, for a specific system, should be carefully evaluated on a case-by-case basis.) In our previous work, we delineated an analytical method to combine diffusion theory with milestoning transition kernel integration to compute the binding rate constant k_on.⁷⁶ We begin with the expression of the diffusion-dependent arrival rate of a small molecule on the surface of a sphere of radius r:⁸⁸ where D is the diffusion constant of the ligand in water. We make two modifications to this expression. First, we assume that the only conformations that can lead to binding are in the space explored by the ligand in the outermost milestone. So we scale the rate by the surface coverage factor α. The concept of coverage factor has been described more elaborately in our previous work. ⁷⁶ To mention it only briefly here, α is the fraction of the surface area of the outermost milestone interface that is accessible to the ligand. Details of the calculation of α in this work are slightly different from our previous work. Instead of performing a Monte-Carlo simulation, here we create a 2D grid on the interface of the outermost milestone based on the solid angles. Then we calculate how many of those bins are occupied by the center of mass of the ligand in the outermost milestoning cell. The coverage factor α is then computed as where δ_occ(θ, ϕ) = 1, if the surface element ΔθΔϕ contains the angular coordinate of the ligand, and zero otherwise.

Second, we consider that a ligand can only reach the bound state if it moves towards the protein from the outermost milestone surface. So we further scale the arrival rate by K_M,M–1, the transition probability of going from the last (Mth) milestone to the previous milestone. But as the transitional kernel is not directly available in the current Markovian milestoning framework, we compute this inward transition probability from the N matrix.

As N_M,M+1 becomes undefined when M is the last milestone, we perform this analysis with the last but one milestone. Including these factors, the flux of binding trajectories through that milestone becomes from which k_on is calculated in M⁻¹s⁻¹ units by multiplying the flux with Avogadro’s number N_av

In the last step we used the diffusion constant of small molecule in cm² s⁻¹ unit and the r is provided in Å.

Reconstruction of free energy landscape

The trajectories confined in the different Voronoi cells can be used to construct a higher dimensional free energy landscape, both for Markovian milestoning and M-WEM simulations. The trajectory data is first histogrammed in appropriate collective variables to obtain different independent histograms for each individual cell Λ. A weighted sum of these histograms are then performed to obtain the equilibrium high dimensional probability distribution where p(R) is the probability density along the collective variable space R, and p^α(R) is the same obtained from the histogram only in the cell α. In case of M-WEM the p^α(R) is obtained from multiple (M_α) trajectories of different weights as: where is the histogram of the Jth trajectory in cell α in the CV space R. The free energy landscape is then reconstructed using,

We demonstrate this approach for alanine dipeptide in the Results section.

Committor

The committor of a point in a conformational space is the probability of a trajectory starting from that point to reach the final state before visiting the initial state.⁸⁹ Recent work by Elber et al. established that it is possible to calculate the committor at the milestone interfaces.⁴⁸ But, to the best of our knowledge, such approach has not been applied to Markovian milestoning techniques so far. We performed the committor calculation in the following way. First, we constructed a transition kernel (K), equivalent to conventional milestoning, from the N matrix:

The transition kernel K is then modified with a boundary condition which ensures that the flux through the final state will remain “absorbed” there and will not return to the previous milestones. This is achieved by replacing the last row of the K matrix with zeros, except for the element corresponding to the last milestone for which the value is one.^{48, 90} For a three-milestone model, this can be illustrated as:

The vector containing the committor values of each milestone, C, is then calculated as: where e_p is a unit vector, all elements of which, are zero except for the one corresponding to the final milestone. Multiple powers of K_C are computed numerically until the committor converges. The results are considered to be converged when the change in the norm of the C vector to be less than 10⁻³.

Computational Methods

We tested the Markovian Weighted Ensemble Milestoning approach on a toy model system of 2D Müller-Brown potential, conformational transition in alanine dipeptide and on the millisecond timescale protein-ligand unbinding in the trypsin-benzamidine system. In the first two systems, we performed long equilibrium simulation and Markovian milestoning simulation to compare with our M-WEM results.

Müller-Brown Potential

The two-dimensional Müller-Brown potential⁹¹ is defined as where A ∈ {−200, −100, −170, 15}, α ∈ { −1, – 1, −6.5, 0.7}, b ∈ {0, 0, 11, 0.6}, c ∈ { – 10, −10, −6.5, 0.7}, x₀ ∈ {1, 0, −0.5, −1}, y₀ ∈ {0, 0.5, 1.5, 1}, and h = 0.04. This system has a non-linear transition path with barrier height about 4.5 k_BT. For the purpose of this model, we set k_BT = 1.

To obtain a benchmark of the kinetics, a long overdamped Langevin dynamics simulation was propagated starting from (x, y) = (−0.5, 1.5), which corresponds to the minimum A in Figure 2. The minimum B is chosen to be the target state. The simulation was propagated for 10⁷ time steps, capturing 347 back and forth transitions. The free energy landscape obtained from this equilibrium trajectory is depicted in Figure 2. The mean first passage time (MFPT) to go from minima A to B is depicted in Table 1.

Figure 2:

The free energy landscape of the Müller Brown potential explored using 10⁷ steps of over-damped Langevin dynamics simulation. The position of the milestones, used in MMVT and M-WEM calculations, are shown in black lines. The two minima relevant to this study are marked as A and B.

View this table:

Table 1:

Results of conventional Langevin dynamics, MMVT and M-WEM simulations for the Müller Brown potential

For milestoning simulations, 8 milestones are placed at y = {−0.3, 0.0, 0.3, 0.6, 0.9, 1.2, 1.5, 1.8} at equal intervals. The reaction coordinate is chosen to be parallel to the y axis. This poor choice of RC was made intentionally to represent realistic situations where the arbitrarily chosen empirical RCs are used to study complex biomolecular processes with many coupled degrees of freedom. MFPTs were computed for transition from y = 1.5 to y = 0.0 for both the milestoning based methods.

For Markovian milestoning (MMVT) simulations, an overdamped Langevin dynamics simulation is propagated in 2D in all the seven cells in the spacing between 8 milestones. A half-harmonic wall is applied at both ends of the cell (milestones) with a force constant , to confine the trajectories within the cell. Each simulation is propagated for 1.5 × 10⁶ steps in each cell. Transition statistics between each milestone pair is computed using the method described in theory section, and the MFPT has been computed.

For the M-WEM method, the procedure remained identical to the MMVT approach, except weighted ensemble simulation was performed in between the milestone interfaces as opposed to conventional dynamics. The 2D adaptive binning (MAB)³⁵ was employed along x and y directions with 5 bins per dimension with 4 trajectory segments per bin. This led to 33 bins in total, including the additional bins in each direction for the most forward, backward and the bottleneck trajectories.³⁵ A total of 300 iterations of WE simulation are performed in each cell, with a recycling interval of 10 steps. The transition rate matrices and MFPTs were computed every 10 iterations of WE simulation in each cell between iteration 260 and 300. The mean and error estimates were performed on the 5 sampled data points for the MFPT values between iteration 260 and 300.

Alanine Dipeptide

The conformational transition of alanine dipeptide was simulated in the gas phase. The system was set up following the protocol described by Wei and Elber. ⁴⁹ The 22-atom system is modelled using the CHARMM22 force field⁹² with a 10 Å cut-off distance for the inter-atomic interactions. A time step of 0.5 fs was used and all the bonds between heavy atoms and hydrogen were constrained using the SHAKE algorithm.⁹³ All simulations were performed using the NAMD 2.13 package⁹⁴ with the colvars⁹⁵ module. The conformational change can be described adequately in the 2D coarse space of two backbone torsion angles Φ and Ψ. Half harmonic walls with a mild force constant (0.04 kcal mol⁻¹ deg⁻²) are placed at the value of ±175 for both the Φ and Ψ angles to avoid transitions along the edges of the free energy surface (i.e., to remove periodicity).⁴⁹ This will ensure that we observe the transition along the center of the free energy map.

The barrier height for the conformational transition of gas-phase alanine dipeptide is very high (> 10 kcal/mol) and it is very difficult to observe direct transitions at room temperature. Following the earlier work⁴⁹ we performed the simulations at 600K temperature which allowed us to sample 285 transitions between the two free energy minima in 500 ns equilibrium simulation.

For both MMVT and M-WEM simulation, the reaction coordinate was chosen to be the Φ dihedral angle and milestones were placed at Φ = −80°, −60°, −40°, −20°, 0°, 20°, 40°, 60°, and 80°. The initial and final states were chosen to be the milestones at Φ = −80° and Φ = 60°. In case of MMVT simulation, 5 ns of conventional MD simulation was propagated in each cell confined between the two consecutive milestones, leading to a total computational effort of 40 ns. (The trajectories were extended to 10 ns with no difference in results. So the result of 5 ns simulation is reported). A force constant of 4 kcal mol⁻¹ deg⁻² were applied in the harmonic walls placed at the milestones, to confine the trajectories in between the milestones. The portion of the trajectories outside the cell has been removed prior to further analysis.

In case of M-WEM simulations, 5 WE bins were placed for each cell in Φ and Ψ coordinates leading up to 33 bins in total including separate bins for the forward, backward and the bottleneck trajectories. Four trajectory segments were propagated in each occupied bin. The progress coordinate values were recorded at very frequent interval (10 fs) to record the time of milestone crossings as accurately as possible. A total of 100 iterations of WE simulation are performed in each cell, with a recycling time of 1 ps. The transition rate matrices and MFPTs were computed every 2 iterations of WE simulation in each cell between iteration 2 and 10 and every 10 iterations between iteration 10 and 100, to monitor the convergence of the results. The convergence plots and related discussion is provided in the supporting information. The mean and error estimates were performed on the 5 sampled data points for the MFPT values between iteration 60 and 100.

Trypsin-Benzamidine Complex

The system setup for the trypsin-benzamidine complex is identical to the work by Votapka et al. ⁵⁰ The structure, parameter and topology files were obtained from the authors of Ref. 50. We point the reader to their original publication ⁵⁰ for more details. To mention briefly, the atomic coordinates of the protein-ligand complex were obtained from Protein Data Bank (PBD) PDB ID: 3PTB.⁹⁶ The protonation states of ASP, GLU and HIS residues were determined at pH 7.7 which was used in this study to replicate the experimental condition. ⁸¹ The protein was modelled using AMBER ff14SB force field⁹⁷ and Generalized Amber Force Field (GAFF)⁹⁸ parameters were used for the ligand. The structure was solvated in a truncated octahedron box of TIP4Pew⁹⁹ water molecules and 8 Cl⁻ ions were added to neutralize the system. Overall the system contains ~23000 atoms. All MD simulations were performed using NAMD 2.14b2 package⁹⁴ with a time step of 2 fs. A Langevin integrator with a damping coefficient of 5 ps⁻¹ was used to keep the temperature constant at 298 K. A Langevin piston was used to maintain the pressure at 1 atm.

The bound state structure was equilibrated for 10 ns in NPT ensemble. From the end point of this simulation, the ligand was pulled out of the binding pocket using a 10 ns steered molecular dynamics (SMD) simulation. The reaction coordinate (RC) description is identical to previous work,⁵⁰ i.e. the center of mass distance between the benzamidine ligand and the C_α atoms of the following residues near the binding pocket: 190, 191, 192, 195, 213, 215, 216, 219, 220, 224, and 228 (numbered according to PDB: 3PTB). During the SMD simulation, a moving harmonic restraint of 1 kcal mol⁻¹ Å⁻² was applied on the RC with a pulling velocity of ~ 1.5 Å/ns. The collective variables were biased and monitored using the colvars module. ⁹⁵ Representative structures for seeding the milestoning simulations were sampled from the SMD trajectory.

Concentric spherical milestones were placed at the following values of the RC: 1.0, 1.5, 2.0, 2.5, 3.0, 3.5, 4.0, 5.0, 6.0, 8.0, 10.0, 12.0, and 13.0 Å. These values are similar to previous studies^{50, 64} except for a few additional milestones, as we were unable to observe energetically uphill transitions otherwise. The separation between milestones should be such that the transition timescales between one milestone to the other should be larger than the decay time of the velocity auto-correlation function of the RC. We checked this condition in our system, as discussed in detail in the Supporting Information.

The following distinction is worth noting at this point. As in Markov State Modeling (which assumes the formalism of continuous-time Markov chains) the transitions between conformational macrostates¹⁰⁰ are to be Markovian. However, the dynamics inside the macrostates delimited by the milestones may not necessarily be Markovian, and it is for that dynamics that we check for the decay of velocity autocorrelation. Milestoning theory is built upon two key assumptions: the reaction channels are localized and the committor can be represented as a function of the reaction coordinate only. A sufficient condition to make these assumptions valid is that the dynamics be overdamped. To ensure this condition, milestones need to be placed sufficiently far from each other such that the timescale of transitions between milestones is higher than the decorrelation time of the velocity along the reaction coordinate.⁸³

A total of 12 cells were constructed in the spacing between 13 milestones. For each cell, a flat bottom potential (Eqn. 1) is applied with a force constant of 100 kcal mol⁻¹ Å⁻² for the harmonic walls present at the milestones. First, the representative structure (sampled from SMD) is equilibrated at the center of the cell for 1 ns by restraining the RC via a harmonic potential. The force constant was gradually increased to 500 kcal mol⁻¹ Å⁻² over the first 500 ps and kept constant over the last 500 ps. From the end point of the 1 ns equilibration Weighted ensemble (WE) simulations were propagated for 300 iterations with a recycle time δt of 2 ps. A two-dimensional MAB scheme was used for the binning. The two progress coordinates were the RC and the RMSD of the ligand with respect to the representative structure (sampled from SMD) corresponding to the specific cell. The progress coordinates were recorded using the colvars module. ⁹⁵ The total computational cost of the M-WEM simulation was approximately 734 ns. The simulation was stopped at multiple points, at an interval of 10 WE iterations between 30 and 300 iterations for each cell. For each set, the trajectory traces were computed using which equilibrium probabilities, free energy profiles and MFPTs between the first milestone (at 1 Å) and the last milestone (at 13 Å) (residence time) was computed. This allowed us to monitor the convergence of residence time over the course of the simulation. The unbinding rate constant k_off was calculated as the inverse of residence time. Free energy profile, binding rate constant k_on and committors were calculated following the procedure described in the Theory section. The error bars for all quantities were calculated from the last five iterations sampled (i.e. iteration 160-200 for values reported at iteration 200 and iteration 260-300 for values reported at iteration 300). Before any calculation, the probabilities of the voronoi cells are modified to take into account the Jacobian factor appearing due to the different surface area of milestones with different radius: where r_α is the radius of the cell α which we choose to be the radius of a sphere equidistant from the two milestones surrounding the cell.