Abstract
We present a novel combination of generative and predictive machine learning models for discovering unique protein inhibitors. The new method is assessed on its ability to generate unique inhibitors for the cancer associated protein kinase, CDK9. We validate our method by performing biochemical assays, attaining a hit rate of more than 10%, demonstrating the method to be a notable improvement upon a more standard, and somewhat naive approach. Moreover, we imposed the additional challenge of finding inhibitors that are readily synthesized. Importantly, two new inhibitors are found, with one being distinct from reported CDK9 inhibitors. We discuss the results in the context of modern machine learning principles and the desire expressed by the rational drug design community to secure molecules that are structurally different, yet with high binding affinities, to structurally determined protein targets.