ABSTRACT
We use a combination of Brownian dynamics (BD) simulation results and Deep Learning (DL) strategies for rapid identification of large structural changes caused by missense mutations in intrinsically disordered proteins (IDPs). 2000 IDP sequences from DisProt database of length 20 −300 are used to obtain gyration radii from BD simulation on a coarse-grained single bead amino acid model (HPS model) used by us and others [Seth et al. J. Chem. Phys. 160, 014902 (2024), Dignon et al. PLOS Comp. Biology, 14, 2018, Tesei et al. PNAS, 118, 2021] to generate the training sets for the DL algorithm. Using the gyration radii ⟨Rg ⟩ of the simulated IDPs as the training set, we develop a multilayer perceptron neural net (NN) architecture that predicts the gyration radii of 33 IDPs previously studied using BD simulation with 95% accuracy from the sequence and the corresponding parameters from the HPS model. We now utilize this NN to predict gyration radii of every permutation of missense mutations in IDPs. Our approach successfully identifies mutation-prone regions that induce significant alterations in the radius of gyration when compared to the wild-type IDP sequence. We further validate the prediction by running BD simulations on the subset of identified mutants. The neural network yields a (104 − 105)-fold faster computation in the search space for potentially harmful mutations. Our findings have substantial implications for rapid identification and understanding diseases related to missense mutations in IDPs and for the development of potential therapeutic interventions. The method can be extended to accurate predictions of other mutation effects in disordered proteins.
Competing Interest Statement
The authors have declared no competing interest.
Footnotes
We revised the abstract slightly, added a few references (based on the feed back from the current version) and added a couple of sentences about the new references.