Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Hierarchical clustering of gene-level association statistics reveals shared and differential genetic architecture among traits in the UK Biobank

Melissa R. McGuirl, Samuel Pattillo Smith, Björn Sandstede, Sohini Ramachandran
doi: https://doi.org/10.1101/565903
Melissa R. McGuirl
*Division of Applied Mathematics, Brown University
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Samuel Pattillo Smith
†Center for Computational Molecular Biology, Brown University
‡Department of Ecology and Evolutionary Biology, Brown University
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Björn Sandstede
*Division of Applied Mathematics, Brown University
§Data Science Initiative, Brown University
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: bjorn_sandstede@brown.edu sramachandran@brown.edu
Sohini Ramachandran
†Center for Computational Molecular Biology, Brown University
‡Department of Ecology and Evolutionary Biology, Brown University
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: bjorn_sandstede@brown.edu sramachandran@brown.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Preview PDF
Loading

Abstract

Genome-wide association (GWA) studies have generally focused on a single phenotype of interest. Emerging biobanks that pair genotype data from thousands of individuals with phenotype data using medical records or surveys enable testing for genetic associations in each phenotype assayed. However, methods for characterizing shared genetic architecture among multiple traits are lagging behind. Here, we present a new method, Ward clustering to identify Internal Node branch length outliers using Gene Scores (WINGS), for characterizing shared and divergent genetic architecture among multiple phenotypes. The objective of WINGS (freely available at https://github.com/ramachandran-lab/PEGASUS-WINGS) is to identify groups of phenotypes, or “clusters”, that share a core set of genes enriched for mutations in cases. We show in simulations that WINGS can reliably detect phenotype clusters across a range of percent shared architecture and number of phenotypes included. We then use the gene-level association test PEGASUS with WINGS to characterize shared genetic architecture among 87 case-control and seven quantitative phenotypes in 349,468 unrelated European-ancestry individuals from the UK Biobank. We identify 10 significant phenotype clusters that contain two to eight phenotypes. One significant cluster of seven immunological phenotypes is driven by seven genes; these genes have each been associated with two or more of those same phenotypes in past publications. WINGS offers a precise and efficient new application of Ward hierarchical clustering to generate hypotheses regarding shared genetic architecture among phenotypes in the biobank era.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted March 04, 2019.
Download PDF
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Hierarchical clustering of gene-level association statistics reveals shared and differential genetic architecture among traits in the UK Biobank
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Hierarchical clustering of gene-level association statistics reveals shared and differential genetic architecture among traits in the UK Biobank
Melissa R. McGuirl, Samuel Pattillo Smith, Björn Sandstede, Sohini Ramachandran
bioRxiv 565903; doi: https://doi.org/10.1101/565903
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
Hierarchical clustering of gene-level association statistics reveals shared and differential genetic architecture among traits in the UK Biobank
Melissa R. McGuirl, Samuel Pattillo Smith, Björn Sandstede, Sohini Ramachandran
bioRxiv 565903; doi: https://doi.org/10.1101/565903

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genomics
Subject Areas
All Articles
  • Animal Behavior and Cognition (3501)
  • Biochemistry (7343)
  • Bioengineering (5319)
  • Bioinformatics (20257)
  • Biophysics (10008)
  • Cancer Biology (7735)
  • Cell Biology (11292)
  • Clinical Trials (138)
  • Developmental Biology (6434)
  • Ecology (9947)
  • Epidemiology (2065)
  • Evolutionary Biology (13315)
  • Genetics (9359)
  • Genomics (12578)
  • Immunology (7696)
  • Microbiology (19005)
  • Molecular Biology (7436)
  • Neuroscience (41010)
  • Paleontology (300)
  • Pathology (1228)
  • Pharmacology and Toxicology (2134)
  • Physiology (3155)
  • Plant Biology (6857)
  • Scientific Communication and Education (1272)
  • Synthetic Biology (1895)
  • Systems Biology (5311)
  • Zoology (1087)