Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Machine learning assisted ligand binding energy prediction for in silico generated glycosyl hydrolase enzyme combinatorial mutant library

Igor Guranovic, Mohit Kumar, View ORCID ProfileChandra K. Bandi, View ORCID ProfileShishir P. S. Chundawat
doi: https://doi.org/10.1101/2022.11.29.518414
Igor Guranovic
aDepartment of Biomedical Engineering, Rutgers The State University of New Jersey, Piscataway, New Jersey 08854, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Mohit Kumar
bDepartment of Chemical and Biochemical Engineering, Rutgers The State University of New Jersey, Piscataway, New Jersey 08854, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Chandra K. Bandi
bDepartment of Chemical and Biochemical Engineering, Rutgers The State University of New Jersey, Piscataway, New Jersey 08854, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Chandra K. Bandi
Shishir P. S. Chundawat
bDepartment of Chemical and Biochemical Engineering, Rutgers The State University of New Jersey, Piscataway, New Jersey 08854, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Shishir P. S. Chundawat
  • For correspondence: shishir.chundawat@rutgers.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Molecular docking is a computational method used to predict the preferred binding orientation of one molecule to another when bound to each other to form an energetically stable complex. This approach has been widely used for early-stage small-molecule drug design as well as identifying suitable protein-based macromolecule residues for mutagenesis. Estimating binding free energy, based on docking interactions of protein to its ligand based on an appropriate scoring function is often critical for protein mutagenesis studies to improve the activity or alter the specificity of targeted enzymes. However, calculating docking free energy for a large number of protein mutants is computationally challenging and time-consuming. Here, we showcase an end-to-end computational workflow for predicting the binding energy of pNP-Xylose substrate docked within the substrate binding site for a large library of combinatorial mutants of an alpha-L-fucosidase (TmAfc, PDB ID-2ZWY) belonging to Thermotoga maritima glycosyl hydrolase (GH) family 29. Briefly, in silico combinatorial mutagenesis was performed for the top conserved residues in TmAfc as determined by running multiple sequence alignment against all GH29 family enzyme sequences downloaded from an in-house developed Carbohydrate-Active enZyme (CAZy) database retriever program. The binding energy was calculated through Autodock Vina with pNP-Xylose ligand docking with energy minimized TmAfc mutants, and the data was then used to train a neural network model which was also validated for model predictions using data from Autodock Vina. The current workflow can be adopted for any family of CAZymes to rapidly identify the effect of different mutations within the active site on substrate binding free energy to identify suitable targets for mutagenesis. We anticipate that this workflow could also serve as the starting point for performing more sophisticated and computationally intensive binding free energy calculations to identify targets for mutagenesis and hence optimize use of wet lab resources.

Competing Interest Statement

The authors have declared no competing interest.

Footnotes

  • https://github.com/IgorGuranovic/binding_energy_predictor

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted December 02, 2022.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Machine learning assisted ligand binding energy prediction for in silico generated glycosyl hydrolase enzyme combinatorial mutant library
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Machine learning assisted ligand binding energy prediction for in silico generated glycosyl hydrolase enzyme combinatorial mutant library
Igor Guranovic, Mohit Kumar, Chandra K. Bandi, Shishir P. S. Chundawat
bioRxiv 2022.11.29.518414; doi: https://doi.org/10.1101/2022.11.29.518414
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
Machine learning assisted ligand binding energy prediction for in silico generated glycosyl hydrolase enzyme combinatorial mutant library
Igor Guranovic, Mohit Kumar, Chandra K. Bandi, Shishir P. S. Chundawat
bioRxiv 2022.11.29.518414; doi: https://doi.org/10.1101/2022.11.29.518414

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Bioengineering
Subject Areas
All Articles
  • Animal Behavior and Cognition (4087)
  • Biochemistry (8763)
  • Bioengineering (6479)
  • Bioinformatics (23341)
  • Biophysics (11750)
  • Cancer Biology (9149)
  • Cell Biology (13255)
  • Clinical Trials (138)
  • Developmental Biology (7417)
  • Ecology (11369)
  • Epidemiology (2066)
  • Evolutionary Biology (15087)
  • Genetics (10399)
  • Genomics (14009)
  • Immunology (9121)
  • Microbiology (22040)
  • Molecular Biology (8779)
  • Neuroscience (47368)
  • Paleontology (350)
  • Pathology (1420)
  • Pharmacology and Toxicology (2482)
  • Physiology (3704)
  • Plant Biology (8050)
  • Scientific Communication and Education (1431)
  • Synthetic Biology (2208)
  • Systems Biology (6016)
  • Zoology (1249)