Constrained transcription factor spacing is prevalent and important for transcriptional control of mouse blood cells

Nucleic Acids Res. 2014 Dec 16;42(22):13513-24. doi: 10.1093/nar/gku1254. Epub 2014 Nov 26.

Abstract

Combinatorial transcription factor (TF) binding is essential for cell-type-specific gene regulation. However, much remains to be learned about the mechanisms of TF interactions, including to what extent constrained spacing and orientation of interacting TFs are critical for regulatory element activity. To examine the relative prevalence of the 'enhanceosome' versus the 'TF collective' model of combinatorial TF binding, a comprehensive analysis of TF binding site sequences in large scale datasets is necessary. We developed a motif-pair discovery pipeline to identify motif co-occurrences with preferential distance(s) between motifs in TF-bound regions. Utilizing a compendium of 289 mouse haematopoietic TF ChIP-seq datasets, we demonstrate that haematopoietic-related motif-pairs commonly occur with highly conserved constrained spacing and orientation between motifs. Furthermore, motif clustering revealed specific associations for both heterotypic and homotypic motif-pairs with particular haematopoietic cell types. We also showed that disrupting the spacing between motif-pairs significantly affects transcriptional activity in a well-known motif-pair-E-box and GATA, and in two previously unknown motif-pairs with constrained spacing-Ets and Homeobox as well as Ets and E-box. In this study, we provide evidence for widespread sequence-specific TF pair interaction with DNA that conforms to the 'enhanceosome' model, and furthermore identify associations between specific haematopoietic cell-types and motif-pairs.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Binding Sites
  • Blood Cells / metabolism
  • Chromatin Immunoprecipitation
  • DNA / chemistry
  • DNA / metabolism
  • Hematopoiesis / genetics*
  • Mice
  • Nucleotide Motifs
  • Regulatory Elements, Transcriptional*
  • Sequence Analysis, DNA
  • Transcription Factors / metabolism*
  • Transcription, Genetic*

Substances

  • Transcription Factors
  • DNA