Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Structure prediction of protein-ligand complexes from sequence information with Umol

View ORCID ProfilePatrick Bryant, Atharva Kelkar, Andrea Guljas, Cecilia Clementi, View ORCID ProfileFrank Noé
doi: https://doi.org/10.1101/2023.11.03.565471
Patrick Bryant
1Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 12, 14195 Berlin, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Patrick Bryant
  • For correspondence: [email protected]
Atharva Kelkar
1Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 12, 14195 Berlin, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Andrea Guljas
2Department of Physics, Freie Universität Berlin, Arnimallee 12, 14195 Berlin, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Cecilia Clementi
2Department of Physics, Freie Universität Berlin, Arnimallee 12, 14195 Berlin, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Frank Noé
1Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 12, 14195 Berlin, Germany
2Department of Physics, Freie Universität Berlin, Arnimallee 12, 14195 Berlin, Germany
3Microsoft Research AI4Science, Karl-Liebknecht Str. 32, 10178 Berlin, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Frank Noé
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Preview PDF
Loading

Abstract

Protein-ligand docking is an established tool in drug discovery and development to narrow down potential therapeutics for experimental testing. However, a high-quality protein structure is required and often the protein is treated as fully or partially rigid. Here we develop an AI system that can predict the fully flexible all-atom structure of protein-ligand complexes directly, given a multiple sequence alignment representation of the protein and a SMILES string representing the ligand. At a high accuracy threshold, unseen protein-ligand complexes can be predicted more accurately than for RoseTTAFold-AA, and at medium accuracy even classical docking methods that use known protein structures as input are surpassed. The high accuracy presented here suggests that the goal of AI-based drug discovery is one step closer, but there is still a way to go to fully grasp the complexity of protein-ligand interactions. Umol is available at: https://github.com/patrickbryant1/Umol

Competing Interest Statement

The authors have declared no competing interest.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
Posted November 05, 2023.
Download PDF
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Structure prediction of protein-ligand complexes from sequence information with Umol
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Structure prediction of protein-ligand complexes from sequence information with Umol
Patrick Bryant, Atharva Kelkar, Andrea Guljas, Cecilia Clementi, Frank Noé
bioRxiv 2023.11.03.565471; doi: https://doi.org/10.1101/2023.11.03.565471
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Structure prediction of protein-ligand complexes from sequence information with Umol
Patrick Bryant, Atharva Kelkar, Andrea Guljas, Cecilia Clementi, Frank Noé
bioRxiv 2023.11.03.565471; doi: https://doi.org/10.1101/2023.11.03.565471

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Bioinformatics
Subject Areas
All Articles
  • Animal Behavior and Cognition (6022)
  • Biochemistry (13708)
  • Bioengineering (10436)
  • Bioinformatics (33157)
  • Biophysics (17109)
  • Cancer Biology (14173)
  • Cell Biology (20106)
  • Clinical Trials (138)
  • Developmental Biology (10868)
  • Ecology (16018)
  • Epidemiology (2067)
  • Evolutionary Biology (20346)
  • Genetics (13395)
  • Genomics (18634)
  • Immunology (13750)
  • Microbiology (32164)
  • Molecular Biology (13392)
  • Neuroscience (70069)
  • Paleontology (526)
  • Pathology (2190)
  • Pharmacology and Toxicology (3741)
  • Physiology (5864)
  • Plant Biology (12020)
  • Scientific Communication and Education (1814)
  • Synthetic Biology (3367)
  • Systems Biology (8166)
  • Zoology (1841)