Abstract
Expression of Sox2 in mouse embryonic stem cells (mESCs) depends on a distal regulatory cluster of DNase I hypersensitive sites (DHSs), but their individual contributions and degree of independence remain a mystery. Here, we comprehensively analyze the regulatory architecture of Sox2 at its endogenous locus using Big-IN to scarlessly integrate DNA payloads ranging up to 143 kb. We analyzed 83 payloads incorporating deletions, rearrangements, and inversions affecting single or multiple DHSs, as well as surgical alterations to transcription factor (TF) recognition sequences. Multiple mESC clones were derived for each payload, sequence-verified, and analyzed to establish the necessity and sufficiency of genomic features for Sox2 expression. We found that two LCR DHSs comprising a handful of key TF recognition sequences were each sufficient to autonomously sustain significant expression in mESCs. However, three additional LCR DHSs were entirely context-dependent, in that they showed no activity alone but could dramatically augment activity of the core DHSs. Our synthetic regulatory genomics approach demonstrates that composite regulatory elements can be reduced to a tractable set of essential sequence features, and is readily scalable to investigate regulatory architecture at other key loci genome-wide.
Competing Interest Statement
R.B., J.D.B., and M.T.M. are listed as inventors on a patent application describing Big-IN. J.D.B. is a Founder and Director of CDI Labs, Inc., a founder of and consultant to Neochromosome, Inc, a founder, SAB member of and consultant to ReOpen Diagnostics, LLC and serves or served on the Scientific Advisory Board of the following: Sangamo, Inc., Modern Meadow, Inc., Rome Therapeutics, Inc., Sample6, Inc., Tessera Therapeutics, Inc. and the Wyss Institute.
Footnotes
Incorporation of new data and analysis