TRUB1 is the predominant pseudouridine synthase acting on mammalian mRNA via a predictable and conserved code

  1. Schraga Schwartz1
  1. 1Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel;
  2. 2Broad Institute, Cambridge, Massachusetts 02142, USA;
  3. 3Department of Computer Science and Applied Math, Weizmann Institute of Science, Rehovot 76100, Israel;
  4. 4Department of Molecular Cell Biology, Weizmann Institute of Science, Rehovot 76100, Israel
  1. 5 These authors contributed equally to this work.

  • Corresponding author: schwartz{at}weizmann.ac.il
  • Abstract

    Following synthesis, RNA can be modified with over 100 chemically distinct modifications, which can potentially regulate RNA expression post-transcriptionally. Pseudouridine (Ψ) was recently established to be widespread and dynamically regulated on yeast mRNA, but less is known about Ψ presence, regulation, and biogenesis in mammalian mRNA. Here, we sought to characterize the Ψ landscape on mammalian mRNA, to identify the main Ψ-synthases (PUSs) catalyzing Ψ formation, and to understand the factors governing their specificity toward selected targets. We first developed a framework allowing analysis, evaluation, and integration of Ψ mappings, which we applied to >2.5 billion reads from 30 human samples. These maps, complemented with genetic perturbations, allowed us to uncover TRUB1 and PUS7 as the two key PUSs acting on mammalian mRNA and to computationally model the sequence and structural elements governing the specificity of TRUB1, achieving near-perfect prediction of its substrates (AUC = 0.974). We then validated and extended these maps and the inferred specificity of TRUB1 using massively parallel reporter assays in which we monitored Ψ levels at thousands of synthetically designed sequence variants comprising either the sequences surrounding pseudouridylation targets or systematically designed mutants perturbing RNA sequence and structure. Our findings provide an extensive and high-quality characterization of the transcriptome-wide distribution of pseudouridine in human and the factors governing it and provide an important resource for the community, paving the path toward functional and mechanistic dissection of this emerging layer of post-transcriptional regulation.

    Footnotes

    • Received March 27, 2016.
    • Accepted December 15, 2016.

    This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.

    | Table of Contents

    Preprint Server