Deconvolving the recognition of DNA shape from sequence

Cell. 2015 Apr 9;161(2):307-18. doi: 10.1016/j.cell.2015.02.008. Epub 2015 Apr 2.

Abstract

Protein-DNA binding is mediated by the recognition of the chemical signatures of the DNA bases and the 3D shape of the DNA molecule. Because DNA shape is a consequence of sequence, it is difficult to dissociate these modes of recognition. Here, we tease them apart in the context of Hox-DNA binding by mutating residues that, in a co-crystal structure, only recognize DNA shape. Complexes made with these mutants lose the preference to bind sequences with specific DNA shape features. Introducing shape-recognizing residues from one Hox protein to another swapped binding specificities in vitro and gene regulation in vivo. Statistical machine learning revealed that the accuracy of binding specificity predictions improves by adding shape features to a model that only depends on sequence, and feature selection identified shape features important for recognition. Thus, shape readout is a direct and independent component of binding site selection by Hox proteins.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Crystallography, X-Ray
  • DNA / chemistry*
  • DNA / metabolism*
  • Drosophila Proteins / chemistry*
  • Drosophila Proteins / metabolism*
  • Drosophila melanogaster / metabolism*
  • Homeodomain Proteins / chemistry
  • Homeodomain Proteins / metabolism
  • Molecular Sequence Data
  • Nucleic Acid Conformation
  • Protein Binding
  • Sequence Alignment
  • Transcription Factors / chemistry*
  • Transcription Factors / metabolism*

Substances

  • Drosophila Proteins
  • Homeodomain Proteins
  • Scr protein, Drosophila
  • Transcription Factors
  • exd protein, Drosophila
  • DNA