TY - JOUR T1 - Comprehensive scanning mutagenesis of human retrotransposon LINE-1 identifies motifs essential for function JF - bioRxiv DO - 10.1101/721357 SP - 721357 AU - Emily M. Adney AU - Matthias T. Ochmann AU - Srinjoy Sil AU - David M. Truong AU - Paolo Mita AU - Xuya Wang AU - David J. Kahler AU - David Fenyö AU - Liam J. Holt AU - Jef D. Boeke Y1 - 2019/01/01 UR - http://biorxiv.org/content/early/2019/08/01/721357.abstract N2 - Long Interspersed Nuclear Element-1 (LINE-1, L1) is the only autonomous active transposable element in the human genome. The L1-encoded proteins ORF1p and ORF2p enable the element to jump from one locus to another via a “copy and paste” mechanism. ORF1p is an RNA-binding protein and ORF2p has endonuclease and reverse transcriptase activities. The huge number of truncated L1 remnants in the human genome suggests that the host has likely evolved mechanisms to prevent full L1 replication and thereby decrease the proliferation of active elements and reduce the mutagenic potential of L1. In turn, L1 appears to have a minimized length to increase the probability of successful full-length replication. This streamlining would be expected to lead to high information density. Here, we describe the construction and initial characterization of a library of 538 consecutive trialanine substitutions that scan along ORF1p and ORF2p to identify functionally important regions. In accordance with the streamlining hypothesis, retrotransposition was overall very sensitive to mutations in ORF1p and ORF2p, only 16% of trialanine mutants retained near-wild-type activity. All ORF1p mutants formed near-wild-type levels of mRNA transcripts and seventy-five percent formed near-wild-type levels of protein. Two ORF1p mutants present a unique nucleolar-relocalization phenotype. Regions of ORF2p that are sensitive to mutagenesis, but lack phylogenetic conservation were also identified. We provide comprehensive information on the regions most critical to retrotransposition. This resource will guide future studies of intermolecular interactions that form with RNA, proteins and target DNA throughout the L1 life cycle. ER -