A fast and efficient colocalization algorithm for identifying shared genetic risk factors across multiple traits

Nat Commun. 2021 Feb 3;12(1):764. doi: 10.1038/s41467-020-20885-8.

Abstract

Genome-wide association studies (GWAS) have identified thousands of genomic regions affecting complex diseases. The next challenge is to elucidate the causal genes and mechanisms involved. One approach is to use statistical colocalization to assess shared genetic aetiology across multiple related traits (e.g. molecular traits, metabolic pathways and complex diseases) to identify causal pathways, prioritize causal variants and evaluate pleiotropy. We propose HyPrColoc (Hypothesis Prioritisation for multi-trait Colocalization), an efficient deterministic Bayesian algorithm using GWAS summary statistics that can detect colocalization across vast numbers of traits simultaneously (e.g. 100 traits can be jointly analysed in around 1 s). We perform a genome-wide multi-trait colocalization analysis of coronary heart disease (CHD) and fourteen related traits, identifying 43 regions in which CHD colocalized with ≥1 trait, including 5 previously unknown CHD loci. Across the 43 loci, we further integrate gene and protein expression quantitative trait loci to identify candidate causal genes.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Computational Biology / methods*
  • Coronary Disease / diagnosis
  • Coronary Disease / genetics*
  • Genetic Predisposition to Disease / genetics*
  • Genome-Wide Association Study / methods*
  • Genomics / methods
  • Humans
  • Linkage Disequilibrium
  • Polymorphism, Single Nucleotide
  • Quantitative Trait Loci / genetics*
  • Reproducibility of Results
  • Risk Factors