Regional heritability advanced complex trait analysis for GPU and traditional parallel architectures

Bioinformatics. 2014 Apr 15;30(8):1177-1179. doi: 10.1093/bioinformatics/btt754. Epub 2014 Jan 7.

Abstract

Motivation: Quantification of the contribution of genetic variation to phenotypic variation for complex traits becomes increasingly computationally demanding with increasing numbers of single-nucleotide polymorphisms and individuals. To meet the challenges in making feasible large-scale studies, we present the REgional heritability advanced complex trait analysis software. Adapted from advanced complex trait analysis (and, in turn, genome-wide complex trait analysis), it is tailored to exploit the parallelism present in modern traditional and graphics processing unit (GPU)-accelerated machines, from workstations to supercomputers.

Results: We adapt the genetic relationship matrix estimation algorithm to remove limitations on memory, allowing the analysis of large datasets. We build on this to develop a version of the code able to efficiently exploit GPU-accelerated systems for both the genetic relationship matrix and REstricted maximum likelihood (REML) parts of the analysis, offering substantial speedup over the traditional central processing unit version. We develop the ability to analyze multiple small regions of the genome across multiple compute nodes in parallel, following the 'regional heritability' approach. We demonstrate the new software using 1024 GPUs in parallel on one of the world's fastest supercomputers.

Availability: The code is freely available at http://www.epcc.ed.ac.uk/software-products CONTACT: a.gray@ed.ac.uk.

MeSH terms

  • Algorithms
  • Computational Biology
  • Likelihood Functions
  • Phenotype*
  • Polymorphism, Single Nucleotide*
  • Software*