Identification of active transcriptional regulatory elements from GRO-seq data

Nat Methods. 2015 May;12(5):433-8. doi: 10.1038/nmeth.3329. Epub 2015 Mar 23.

Abstract

Modifications to the global run-on and sequencing (GRO-seq) protocol that enrich for 5'-capped RNAs can be used to reveal active transcriptional regulatory elements (TREs) with high accuracy. Here, we introduce discriminative regulatory-element detection from GRO-seq (dREG), a sensitive machine learning method that uses support vector regression to identify active TREs from GRO-seq data without requiring cap-based enrichment (https://github.com/Danko-Lab/dREG/). This approach allows TREs to be assayed together with gene expression levels and other transcriptional features in a single experiment. Predicted TREs are more enriched for several marks of transcriptional activation—including expression quantitative trait loci, disease-associated polymorphisms, acetylated histone 3 lysine 27 (H3K27ac) and transcription factor binding—than those identified by alternative functional assays. Using dREG, we surveyed TREs in eight human cell types and provide new insights into global patterns of TRE function.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Artificial Intelligence*
  • Cell Line
  • Gene Expression Regulation / physiology*
  • Genome-Wide Association Study
  • Histones
  • Humans
  • K562 Cells
  • Polymorphism, Single Nucleotide
  • Quantitative Trait Loci
  • Regulatory Elements, Transcriptional / genetics
  • Regulatory Elements, Transcriptional / physiology*
  • Software

Substances

  • Histones

Associated data

  • GEO/GSE66031