Abstract
R2 retrotransposons are model site-specific eukaryotic non-LTR retrotransposons that copy-and-paste into gene loci encoding ribosomal RNAs. Recently we demonstrated that avian A-clade R2 proteins achieve efficient and precise insertion of transgenes into their native safe-harbor loci in human cells. The features of A-clade R2 proteins that support gene insertion are not characterized. Here, we report high resolution cryo-electron microscopy structures of two vertebrate A-clade R2 proteins, avian and testudine, at the initiation of target-primed reverse transcription and one structure after cDNA synthesis and second strand nicking. Using biochemical and cellular assays we discover the basis for high selectivity of template use and unique roles for each of the expanded A-clade zinc-finger domains in nucleic acid recognition. Reverse transcriptase active site architecture is reinforced by an unanticipated insertion motif in vertebrate A-clade R2 proteins. Our work brings first insights to A-clade R2 protein structure during gene insertion and enables further improvement and adaptation of R2-based systems for precise transgene insertion.
Competing Interest Statement
K.C. is an equity holder and scientific advisor for Addition Therapeutics, Inc., using a retrotransposon-based genome engineering technology.
Footnotes
Revision includes the Methods section and Supplementary Information