Prioritizing disease-linked variants, genes, and pathways with an interactive whole-genome analysis pipeline

Hum Mutat. 2014 May;35(5):537-47. doi: 10.1002/humu.22520. Epub 2014 Mar 6.

Abstract

Whole-genome sequencing (WGS) studies are uncovering disease-associated variants in both rare and nonrare diseases. Utilizing the next-generation sequencing for WGS requires a series of computational methods for alignment, variant detection, and annotation, and the accuracy and reproducibility of annotation results are essential for clinical implementation. However, annotating WGS with up to date genomic information is still challenging for biomedical researchers. Here, we present one of the fastest and highly scalable annotation, filtering, and analysis pipeline-gNOME-to prioritize phenotype-associated variants while minimizing false-positive findings. Intuitive graphical user interface of gNOME facilitates the selection of phenotype-associated variants, and the result summaries are provided at variant, gene, and genome levels. Moreover, the enrichment results of specific variants, genes, and gene sets between two groups or compared with population scale WGS datasets that is already integrated in the pipeline can help the interpretation. We found a small number of discordant results between annotation software tools in part due to different reporting strategies for the variants with complex impacts. Using two published whole-exome datasets of uveal melanoma and bladder cancer, we demonstrated gNOME's accuracy of variant annotation and the enrichment of loss-of-function variants in known cancer pathways. gNOME Web server and source codes are freely available to the academic community (http://gnome.tchlab.org).

Keywords: analysis pipeline; disease gene discovery; variant annotation; whole-genome sequences.

MeSH terms

  • Exome
  • Genome, Human*
  • Genomics
  • High-Throughput Nucleotide Sequencing*
  • Humans
  • Internet
  • Molecular Sequence Annotation
  • Phenotype
  • Polymorphism, Single Nucleotide
  • Software*