motifbreakR: an R/Bioconductor package for predicting variant effects at transcription factor binding sites

Bioinformatics. 2015 Dec 1;31(23):3847-9. doi: 10.1093/bioinformatics/btv470. Epub 2015 Aug 12.

Abstract

Functional annotation represents a key step toward the understanding and interpretation of germline and somatic variation as revealed by genome-wide association studies (GWAS) and The Cancer Genome Atlas (TCGA), respectively. GWAS have revealed numerous genetic risk variants residing in non-coding DNA associated with complex diseases. For sequences that lie within enhancers or promoters of transcription, it is not straightforward to assess the effects of variants on likely transcription factor binding sites. Consequently we introduce motifbreakR, which allows the biologist to judge whether the sequence surrounding a polymorphism or mutation is a good match, and how much information is gained or lost in one allele of the polymorphism or mutation relative to the other. MotifbreakR is flexible, giving a choice of algorithms for interrogation of genomes with motifs from many public sources that users can choose from. MotifbreakR can predict effects for novel or previously described variants in public databases, making it suitable for tasks beyond the scope of its original design. Lastly, it can be used to interrogate any genome curated within bioconductor.

Availability and implementation: https://github.com/Simon-Coetzee/MotifBreakR, www.bioconductor.org.

Contact: dennis.hazelett@cshs.org.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms
  • Animals
  • Binding Sites
  • Genomics
  • Humans
  • Mice
  • Mutation*
  • Polymorphism, Single Nucleotide*
  • Regulatory Elements, Transcriptional*
  • Regulatory Sequences, Nucleic Acid*
  • Sequence Analysis, DNA
  • Software*
  • Transcription Factors / metabolism*

Substances

  • Transcription Factors