Effect of natural genetic variation on enhancer selection and function

Nature. 2013 Nov 28;503(7477):487-92. doi: 10.1038/nature12615. Epub 2013 Oct 13.

Abstract

The mechanisms by which genetic variation affects transcription regulation and phenotypes at the nucleotide level are incompletely understood. Here we use natural genetic variation as an in vivo mutagenesis screen to assess the genome-wide effects of sequence variation on lineage-determining and signal-specific transcription factor binding, epigenomics and transcriptional outcomes in primary macrophages from different mouse strains. We find substantial genetic evidence to support the concept that lineage-determining transcription factors define epigenetic and transcriptomic states by selecting enhancer-like regions in the genome in a collaborative fashion and facilitating binding of signal-dependent factors. This hierarchical model of transcription factor function suggests that limited sets of genomic data for lineage-determining transcription factors and informative histone modifications can be used for the prioritization of disease-associated regulatory variants.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Motifs / genetics
  • Animals
  • Base Sequence
  • Cell Lineage / genetics
  • DNA-Binding Proteins / metabolism
  • Enhancer Elements, Genetic / genetics*
  • Gene Expression Regulation / genetics*
  • Genetic Variation / genetics*
  • Histones / chemistry
  • Histones / metabolism
  • Macrophages / metabolism
  • Male
  • Mice
  • Mice, Inbred BALB C
  • Mice, Inbred C57BL
  • Models, Biological
  • Mutation / genetics
  • NF-kappa B / metabolism
  • Protein Binding
  • Reproducibility of Results
  • Selection, Genetic / genetics*
  • Transcription Factor RelA / metabolism
  • Transcription Factors / metabolism*

Substances

  • DNA-Binding Proteins
  • Histones
  • NF-kappa B
  • Rela protein, mouse
  • Transcription Factor RelA
  • Transcription Factors

Associated data

  • GEO/GSE46494