Epistatic module detection for case-control studies: a Bayesian model with a Gibbs sampling strategy

PLoS Genet. 2009 May;5(5):e1000464. doi: 10.1371/journal.pgen.1000464. Epub 2009 May 1.

Abstract

The detection of epistatic interactive effects of multiple genetic variants on the susceptibility of human complex diseases is a great challenge in genome-wide association studies (GWAS). Although methods have been proposed to identify such interactions, the lack of an explicit definition of epistatic effects, together with computational difficulties, makes the development of new methods indispensable. In this paper, we introduce epistatic modules to describe epistatic interactive effects of multiple loci on diseases. On the basis of this notion, we put forward a Bayesian marker partition model to explain observed case-control data, and we develop a Gibbs sampling strategy to facilitate the detection of epistatic modules. Comparisons of the proposed approach with three existing methods on seven simulated disease models demonstrate the superior performance of our approach. When applied to a genome-wide case-control data set for Age-related Macular Degeneration (AMD), the proposed approach successfully identifies two known susceptible loci and suggests that a combination of two other loci -- one in the gene SGCD and the other in SCAPER -- is associated with the disease. Further functional analysis supports the speculation that the interaction of these two genetic variants may be responsible for the susceptibility of AMD. When applied to a genome-wide case-control data set for Parkinson's disease, the proposed method identifies seven suspicious loci that may contribute independently to the disease.

Publication types

  • Evaluation Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bayes Theorem
  • Carrier Proteins / genetics
  • Case-Control Studies
  • Epidemiologic Methods*
  • Epistasis, Genetic*
  • Genome-Wide Association Study
  • Humans
  • Macular Degeneration / epidemiology
  • Macular Degeneration / genetics*
  • Models, Genetic
  • Models, Statistical
  • Parkinson Disease / epidemiology
  • Parkinson Disease / genetics*
  • Polymorphism, Single Nucleotide
  • Sampling Studies
  • Sarcoglycans / genetics

Substances

  • Carrier Proteins
  • SCAPER protein, human
  • SGCD protein, human
  • Sarcoglycans