PT - JOURNAL ARTICLE AU - Jonas Koeppel AU - Elin Madli Peets AU - Juliane Weller AU - Ananth Pallaseni AU - Fabio Liberante AU - Leopold Parts TI - Predicting efficiency of writing short sequences into the genome using prime editing AID - 10.1101/2021.11.10.468024 DP - 2021 Jan 01 TA - bioRxiv PG - 2021.11.10.468024 4099 - http://biorxiv.org/content/early/2021/11/10/2021.11.10.468024.short 4100 - http://biorxiv.org/content/early/2021/11/10/2021.11.10.468024.full AB - Short sequences can be precisely written into a selected genomic target using prime editing. This ability facilitates protein tagging, correction of pathogenic deletions, and many other exciting applications. However, it remains unclear what types of sequences prime editors can easily insert, and how to choose optimal reagents for a desired outcome. To characterize features that influence insertion efficiency, we designed a library of 2,666 sequences up to 69 nt in length and measured the frequency of their insertion into four genomic sites in three human cell lines, using different prime editor systems. We discover that insertion sequence length, nucleotide composition and secondary structure all affect insertion rates, and that mismatch repair proficiency is a strong determinant for the shortest insertions. Combining the sequence and repair features into a machine learning model, we can predict insertion frequency for new sequences with R = 0.69. The tools we provide allow users to choose optimal constructs for DNA insertion using prime editing.Competing Interest StatementThe authors have declared no competing interest.