Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Whole Genome Regulatory Variant Evaluation for Transcription Factor Binding

Haoyang Zeng, Tatsunori Hashimoto, Daniel D. Kang, David K. Gifford
doi: https://doi.org/10.1101/017392
Haoyang Zeng
1Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, United States of America
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Tatsunori Hashimoto
1Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, United States of America
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Daniel D. Kang
1Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, United States of America
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
David K. Gifford
1Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, United States of America
2Department of Stem Cell and Regenerative Biology, Harvard University and Harvard Medical School, Cambridge, MA, United States of America
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: gifford@mit.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

Abstract

Contemporary approaches to predict single nucleotide polymorphisms (SNPs) that alter transcription factor binding rely upon the sequence affinity of a transcription factor as represented by its canonical motif. WAVE (Whole-genome regulAtory Variants Evaluation) is a novel method for predicting more general regulatory variants that affect transcription factor binding, including those that fall outside of the canonical motif. WAVE learns a k-mer based generative model of transcription factor binding from ChIP-seq data and scores variants using its generative binding model. The k-mers learned by WAVE capture more sequence feature in transcription factor binding than a motif-based approach alone, including both a transcription factor’s canonical motif as well as associated co-factor motifs. WAVE significantly outperforms motif-based methods in predicting SNPs associated with allele-specific binding.

Author Summary Specific variations in our genome sequence can render us more susceptible to a genetic disease. Certain disease risks are caused by genetic variations that alter where transcription factors bind to the genome and regulate cellular function. Previous methods for identifying which genetic changes are significant have assumed that transcription factor binding is dependent on a short single sequence recognized by a transcription factor. Here we consider a more general model where the binding of a factor may be up or down regulated by any number of short DNA sequences that are proximal to a binding site. Our method substantially improves the detection of genomic changes that are important for factor binding.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission.
Back to top
PreviousNext
Posted April 01, 2015.
Download PDF

Supplementary Material

Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Whole Genome Regulatory Variant Evaluation for Transcription Factor Binding
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Whole Genome Regulatory Variant Evaluation for Transcription Factor Binding
Haoyang Zeng, Tatsunori Hashimoto, Daniel D. Kang, David K. Gifford
bioRxiv 017392; doi: https://doi.org/10.1101/017392
Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
Citation Tools
Whole Genome Regulatory Variant Evaluation for Transcription Factor Binding
Haoyang Zeng, Tatsunori Hashimoto, Daniel D. Kang, David K. Gifford
bioRxiv 017392; doi: https://doi.org/10.1101/017392

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One
Subject Areas
All Articles
  • Animal Behavior and Cognition (2517)
  • Biochemistry (4964)
  • Bioengineering (3469)
  • Bioinformatics (15181)
  • Biophysics (6885)
  • Cancer Biology (5380)
  • Cell Biology (7711)
  • Clinical Trials (138)
  • Developmental Biology (4518)
  • Ecology (7135)
  • Epidemiology (2059)
  • Evolutionary Biology (10210)
  • Genetics (7497)
  • Genomics (9767)
  • Immunology (4822)
  • Microbiology (13179)
  • Molecular Biology (5129)
  • Neuroscience (29367)
  • Paleontology (203)
  • Pathology (835)
  • Pharmacology and Toxicology (1460)
  • Physiology (2129)
  • Plant Biology (4734)
  • Scientific Communication and Education (1008)
  • Synthetic Biology (1337)
  • Systems Biology (4002)
  • Zoology (768)