PT - JOURNAL ARTICLE AU - Benjamin M. David AU - Ryan M. Wyllie AU - Ramdane Harouaka AU - Paul A. Jensen TI - A Reinforcement Learning Framework for Pooled Oligonucleotide Design AID - 10.1101/2021.08.25.455853 DP - 2021 Jan 01 TA - bioRxiv PG - 2021.08.25.455853 4099 - http://biorxiv.org/content/early/2021/08/27/2021.08.25.455853.short 4100 - http://biorxiv.org/content/early/2021/08/27/2021.08.25.455853.full AB - The goal of oligonucleotide (oligo) design is to select oligos that optimize a set of design criteria. Oligo design problems are combinatorial in nature and require computationally intensive models to evaluate design criteria. Even relatively small problems can be intractable for brute-force approaches that test every possible combination of oligos, so heuristic approaches must be used to find near-optimal solutions. We present a general reinforcement learning framework, called OligoRL, to solve oligo design problems with complex constraints. OligoRL allows “black-box” design criteria and can be adapted to solve many oligo design problems. We highlight the flexibility of OligoRL by building tools to solve three distinct design problems: 1.) finding pools of random DNA barcodes that lack restriction enzyme recognition sequences (CutFreeRL); 2.) compressing large, non-degenerate oligo pools into smaller degenerate ones (OligoCompressor); and 3.) finding Not-So-Random hexamer primer pools that avoid rRNA and other unwanted transcripts during RNA-seq library preparation (NSR-RL). OligoRL demonstrates how reinforcement learning offers a general solution for complex oligo design problems. OligoRL and its associated software tools are available as a Julia package at http://jensenlab.net/tools.Competing Interest StatementThe authors have declared no competing interest.