RT Journal Article SR Electronic T1 A domain-resolution map of in vivo DNA binding reveals the regulatory consequences of somatic mutations in zinc finger transcription factors JF bioRxiv FD Cold Spring Harbor Laboratory SP 630756 DO 10.1101/630756 A1 Dogan, Berat A1 Kailasam, Senthilkumar A1 Corchado, Aldo Hernández A1 Nikpoor, Naghmeh A1 Najafabadi, Hamed S. YR 2020 UL http://biorxiv.org/content/early/2020/06/16/630756.abstract AB Multi-zinc finger proteins constitute the largest class of human transcription factors. Their DNA-binding specificity is usually encoded by a subset of their tandem Cys2His2 zinc finger (ZF) domains – the subset that binds to DNA, however, is often unknown. Here, by combining a context-aware machine-learning-based model of DNA recognition with in vivo binding data, we characterize the sequence preferences and the ZF subset that is responsible for DNA binding in 209 human multi-ZF proteins. We show that in vivo DNA binding is primarily driven by ∼50% of the ZFs – these DNA-binding ZFs are under strong selective pressure within and across species, and their mutations affect the expression of hundreds of genes as revealed by pan-cancer trans-eQTL analysis across 18 tissues. Among the genes affected by mutations in multi-ZF proteins, we identify several oncogenic factors regulated by SP1, and show that SP1 up-regulation in cancer promotes the expression of these genes while mutations in SP1 ZFs lead to their repression. Together, these analyses suggest that mutations in DNA-binding ZFs have distinct and widespread regulatory consequences that contribute to transcriptome remodelling in cancer.Competing Interest StatementThe authors have declared no competing interest.