A large fraction of extragenic RNA pol II transcription sites overlap enhancers

PLoS Biol. 2010 May 11;8(5):e1000384. doi: 10.1371/journal.pbio.1000384.

Abstract

Mammalian genomes are pervasively transcribed outside mapped protein-coding genes. One class of extragenic transcription products is represented by long non-coding RNAs (lncRNAs), some of which result from Pol_II transcription of bona-fide RNA genes. Whether all lncRNAs described insofar are products of RNA genes, however, is still unclear. Here we have characterized transcription sites located outside protein-coding genes in a highly regulated response, macrophage activation by endotoxin. Using chromatin signatures, we could unambiguously classify extragenic Pol_II binding sites as belonging to either canonical RNA genes or transcribed enhancers. Unexpectedly, 70% of extragenic Pol_II peaks were associated with genomic regions with a canonical chromatin signature of enhancers. Enhancer-associated extragenic transcription was frequently adjacent to inducible inflammatory genes, was regulated in response to endotoxin stimulation, and generated very low abundance transcripts. Moreover, transcribed enhancers were under purifying selection and contained binding sites for inflammatory transcription factors, thus suggesting their functionality. These data demonstrate that a large fraction of extragenic Pol_II transcription sites can be ascribed to cis-regulatory genomic regions. Discrimination between lncRNAs generated by canonical RNA genes and products of transcribed enhancers will provide a framework for experimental approaches to lncRNAs and help complete the annotation of mammalian genomes.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Binding Sites
  • Female
  • Gene Expression Regulation
  • Humans
  • Lipopolysaccharides / immunology
  • Macrophage Activation / immunology
  • Mice
  • Promoter Regions, Genetic / genetics*
  • RNA Polymerase II / genetics*
  • RNA Polymerase II / metabolism
  • RNA, Untranslated / genetics*
  • Regulatory Sequences, Nucleic Acid*
  • Transcription, Genetic*

Substances

  • Lipopolysaccharides
  • RNA, Untranslated
  • RNA Polymerase II