Genome-wide search for novel human uORFs and N-terminal protein extensions using ribosomal footprinting

  1. Mario Brosch1,4
  1. 1Department of Internal Medicine I, University Hospital Schleswig Holstein, 24105 Kiel, Germany;
  2. 2Institute for Medical Informatics and Statistics, Christian-Albrechts University Kiel, 24118 Kiel, Germany;
  3. 3Genome Analysis, Fritz Lipmann Institute for Age Research, 07745 Jena, Germany
    1. 4 These authors contributed equally to this work.

    Abstract

    So far, the annotation of translation initiation sites (TISs) has been based mostly upon bioinformatics rather than experimental evidence. We adapted ribosomal footprinting to puromycin-treated cells to generate a transcriptome-wide map of TISs in a human monocytic cell line. A neural network was trained on the ribosomal footprints observed at previously annotated AUG translation initiation codons (TICs), and used for the ab initio prediction of TISs in 5062 transcripts with sufficient sequence coverage. Functional interpretation suggested 2994 novel upstream open reading frames (uORFs) in the 5′ UTR, 1406 uORFs overlapping with the coding sequence, and 546 N-terminal protein extensions. The TIS detection method was validated on the basis of previously published alternative TISs and uORFs. Among primates, TICs in newly annotated TISs were significantly more conserved than control codons, both for AUGs and near-cognate codons. The transcriptome-wide map of novel candidate TISs derived as part of the study will shed further light on the way in which human proteome diversity is influenced by alternative translation initiation and regulation.

    Footnotes

    • Received February 22, 2012.
    • Accepted August 8, 2012.

    This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 3.0 Unported License), as described at http://creativecommons.org/licenses/by-nc/3.0/.

    | Table of Contents

    Preprint Server