Abstract
Currently, bioimaging databases cannot be queried by chemical structures that induce the phenotypic effects captured by the image. We present a novel retrieval system based on contrastive learning that is able to identify the chemical structure inducing the phenotype out of ~2,000 candidates with a top-1 accuracy >70 times higher than a random baseline.
Competing Interest Statement
The authors have declared no competing interest.
Footnotes
Updated Acknowledgments.
Copyright
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission.