Saturation mutagenesis reveals manifold determinants of exon definition

  1. Lawrence A. Chasin1
  1. 1Department of Biological Sciences, Columbia University, New York, New York 10027, USA
  2. 2Department of Chemical Engineering, Columbia University, New York, New York 10027, USA
  1. 3 These authors are joint first authors and contributed equally to this work.

  • Present addresses: 4The Ke Lab of Quantitative RNA Biology, The Jackson Laboratory, Bar Harbor, ME 04609, USA; 5INSERM U1127, ICM, Hôpital Pitié-Salpêtrière, 75013 Paris, France; 6School of Medicine, New York Medical College, Valhalla, NY 10595, USA; 7Sidney Kimmel Medical College, Thomas Jefferson University, Philadelphia, PA 19107, USA; 8Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA

  • Corresponding author: lac2{at}columbia.edu
  • Abstract

    To illuminate the extent and roles of exonic sequences in the splicing of human RNA transcripts, we conducted saturation mutagenesis of a 51-nt internal exon in a three-exon minigene. All possible single and tandem dinucleotide substitutions were surveyed. Using high-throughput genetics, 5560 minigene molecules were assayed for splicing in human HEK293 cells. Up to 70% of mutations produced substantial (greater than twofold) phenotypes of either increased or decreased splicing. Of all predicted secondary structural elements, only a single 15-nt stem–loop showed a strong correlation with splicing, acting negatively. The in vitro formation of exon-protein complexes between the mutant molecules and proteins associated with spliceosome formation (U2AF35, U2AF65, U1A, and U1-70K) correlated with splicing efficiencies, suggesting exon definition as the step affected by most mutations. The measured relative binding affinities of dozens of human RNA binding protein domains as reported in the CISBP-RNA database were found to correlate either positively or negatively with splicing efficiency, more than could fit on the 51-nt test exon simultaneously. The large number of these functional protein binding correlations point to a dynamic and heterogeneous population of pre-mRNA molecules, each responding to a particular collection of binding proteins.

    Footnotes

    • Received December 14, 2016.
    • Accepted November 27, 2017.

    This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.

    | Table of Contents

    Preprint Server