Gowinda: unbiased analysis of gene set enrichment for genome-wide association studies

Bioinformatics. 2012 Aug 1;28(15):2084-5. doi: 10.1093/bioinformatics/bts315. Epub 2012 May 26.

Abstract

Summary: An analysis of gene set [e.g. Gene Ontology (GO)] enrichment assumes that all genes are sampled independently from each other with the same probability. These assumptions are violated in genome-wide association (GWA) studies since (i) longer genes typically have more single-nucleotide polymorphisms resulting in a higher probability of being sampled and (ii) overlapping genes are sampled in clusters. Herein, we introduce Gowinda, a software specifically designed to test for enrichment of gene sets in GWA studies. We show that GO tests on GWA data could result in a substantial number of false-positive GO terms. Permutation tests implemented in Gowinda eliminate these biases, but maintain sufficient power to detect enrichment of GO terms. Since sufficient resolution for large datasets requires millions of permutations, we use multi-threading to keep computation times reasonable.

Availability and implementation: Gowinda is implemented in Java (v1.6) and freely available on http://code.google.com/p/gowinda/

Contact: christian.schloetterer@vetmeduni.ac.at

Supplementary information: Manual: http://code.google.com/p/gowinda/wiki/Manual. Test data and tutorial: http://code.google.com/p/gowinda/wiki/Tutorial.

Validation: http://code.google.com/p/gowinda/wiki/VALIDATION.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Computational Biology / methods*
  • Drosophila melanogaster / genetics
  • Genome-Wide Association Study / methods*
  • Linkage Disequilibrium
  • Polymorphism, Single Nucleotide*
  • Probability
  • Software*