Novel determinants of mammalian primary microRNA processing revealed by systematic evaluation of hairpin-containing transcripts and human genetic variation

  1. Jun Lu1,2,3,8
  1. 1Department of Genetics, Yale University School of Medicine, New Haven, Connecticut 06510, USA;
  2. 2Yale Stem Cell Center and Yale Cancer Center, Yale University, New Haven, Connecticut 06520, USA;
  3. 3Yale Center for RNA Science and Medicine, New Haven, Connecticut 06520, USA;
  4. 4Graduate Program in Biological and Biomedical Sciences, Yale University, New Haven, Connecticut 06510, USA;
  5. 5School of Medicine, Yale University, New Haven, Connecticut 06510, USA;
  6. 6Wadsworth Center, New York State Department of Health, Albany, New York 12208, USA;
  7. 7Department of Neurology, Yale School of Medicine, New Haven, Connecticut 06511, USA;
  8. 8Yale Cooperative Center of Excellence in Hematology, Yale University, New Haven, Connecticut 06520, USA
  1. Corresponding author: jun.lu{at}yale.edu
  1. 9 These authors contributed equally to this work.

Abstract

Mature microRNAs (miRNAs) are processed from hairpin-containing primary miRNAs (pri-miRNAs). However, rules that distinguish pri-miRNAs from other hairpin-containing transcripts in the genome are incompletely understood. By developing a computational pipeline to systematically evaluate 30 structural and sequence features of mammalian RNA hairpins, we report several new rules that are preferentially utilized in miRNA hairpins and govern efficient pri-miRNA processing. We propose that a hairpin stem length of 36 ± 3 nt is optimal for pri-miRNA processing. We identify two bulge-depleted regions on the miRNA stem, located ∼16–21 nt and ∼28–32 nt from the base of the stem, that are less tolerant of unpaired bases. We further show that the CNNC primary sequence motif selectively enhances the processing of optimal-length hairpins. We predict that a small but significant fraction of human single-nucleotide polymorphisms (SNPs) alter pri-miRNA processing, and confirm several predictions experimentally including a disease-causing mutation. Our study enhances the rules governing mammalian pri-miRNA processing and suggests a diverse impact of human genetic variation on miRNA biogenesis.

Footnotes

  • Received April 25, 2016.
  • Accepted January 6, 2017.

This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.

| Table of Contents

Preprint Server