Abstract
We introduce new statistical methods for analyzing genomic datasets that measure many effects in many conditions (e.g., gene expression changes under many treatments). These new methods improve on existing methods by allowing for arbitrary correlations in effects among conditions. This flexible approach increases power, improves effect-size estimates, and allows for more quantitative assessments of effect-size heterogeneity than simple “shared/condition-specific” assessments. We illustrate these features through a detailed analysis of locally-acting variants associated with gene expression (“cis eQTLs”) in 44 human tissues. Our analysis identifies more eQTLs than existing approaches, consistent with improved power. We show that genetic effects on gene expression are extensively shared among tissues, but that effect sizes can nonetheless vary widely among tissues. Some shared eQTLs show stronger effects in subsets of biologically related tissues (e.g., brain-related tissues), or in only a single tissue (e.g., testis). Our methods are widely applicable, computationally tractable for many conditions, and available online.
Footnotes
↵* e-mail: mstephens{at}uchicago.edu