Abstract
Protein-ligand docking is an established tool in drug discovery and development to narrow down potential therapeutics for experimental testing. However, a high-quality protein structure is required and often the protein is treated as fully or partially rigid. Here we develop an AI system that can predict the fully flexible all-atom structure of protein-ligand complexes directly, given a multiple sequence alignment representation of the protein and a SMILES string representing the ligand. At a high accuracy threshold, unseen protein-ligand complexes can be predicted more accurately than for RoseTTAFold-AA, and at medium accuracy even classical docking methods that use known protein structures as input are surpassed. The high accuracy presented here suggests that the goal of AI-based drug discovery is one step closer, but there is still a way to go to fully grasp the complexity of protein-ligand interactions. Umol is available at: https://github.com/patrickbryant1/Umol
Competing Interest Statement
The authors have declared no competing interest.