echolocatoR: an automated end-to-end statistical and functional genomic fine-mapping pipeline

Brian M. Schilder; Jack Humphrey; Towfique Raj

doi:10.1101/2020.10.22.351221

Abstract

Summary echolocatoR integrates a diverse suite of statistical and functional fine-mapping tools in order to identify, test enrichment in, and visualize high-confidence causal consensus variants in any phenotype. It requires minimal input from users (a summary statistics file), can be run in a single R function, and provides extensive access to relevant datasets (e.g. reference linkage disequilibrium (LD) panels, quantitative trait loci (QTL) datasets, genome-wide annotations, cell type-specific epigenomics, thereby enabling rapid, robust and scalable end-to-end fine-mapping investigations.

Availability and implementation echolocatoR is an open-source R package available through GitHub under the MIT license: https://github.com/RajLabMSSM/echolocatoR

1. Introduction

Genome-wide association studies (GWAS) across a variety of phenotypes and quantitative trait loci (QTL) have identified many significant genetic associations. However, widespread non-independence between genomic variants due to linkage disequilibrium (LD) makes it difficult to distinguish causal variants from correlated non-causal variants (Pasaniuc and Price, 2017; Pritchard and Przeworski, 2001; Yang et al., 2011). Fine-mapping aims to identify the causal variant(s) and thus the mechanisms underlying a phenotype (Spain and Barrett, 2015; Trynka et al., 2015). This methodology has been especially important to the study of medical conditions such as diabetes (Gaulton et al., 2015; Mahajan et al., 2018), rheumatoid arthritis (Kichaev and Pasaniuc, 2015; Westra et al., 2018), and obesity (Zhang et al., 2018).

Many fine-mapping tools have been developed over the years (Spain and Barrett, 2015; Trynka et al., 2015), each of which can nominate partially overlapping sets of putative causal variants. It can therefore be useful to compare results from multiple fine-mapping methods with complementary strengths and weaknesses, such as the ability to model multiple causal variants or incorporate functional annotations. However, these powerful methods are underutilized in no small part due to technical reasons (e.g. not available in the same programming language, idiosyncratic file inputs/outputs, gathering and formatting of datasets). We therefore developed echolocatoR, an open-source R package that conducts end-to-end statistical and functional fine-mapping, annotation, enrichment and plotting that only requires GWAS/QTL summary statistics as input (Fig. 1a).

Figure 1. echolocatoR facilitates automated end-to-end fine-mapping.

(a) Workflow of the echolocatoR pipeline: 1) user specifies the path to their full GWAS/QTL summary statistics, 2) locus subsets are queried and saved in a standardized format, 3) LD is extracted, computed from VCF, or supplied by the user, 3) statistical, functional, and/or trans-ethnic/joint fine-mapping are performed, 4) locus-specific fine-mapping results and selected annotations are visualized in tracks, 5) enrichment tests can be performed on different SNP groups using the various available annotations (see Section 2.2), 6) GWAS/QTL data, fine-mapping results and annotations are merged into a file with one SNP-row, 7) narrowed SNPs lists can be targeted in validation experiments. (b) Example multi-track plot for the PD locus LRRK2: 1) Manhattan plot of GWAS p-values (gold labels and vertical lines indicate Consensus SNPs), 2- 5) tool-specific fine-mapped posterior probability (PP) with 95% credible set (CS_95%) SNPs labeled (green labels), 6) mean per-SNP PP across all fine-mapping tools, 7) gene transcript models, 8) transcription factor binding site (TFBS) annotations from ENCODE, 9) cell-type-specific histone modifications from ENCODE (Broad Institute, Bernstein lab), 10) cell-type-specific chromatin marks from Roadmap. Vertical red lines indicate the location of the lead GWAS SNP.

2. Implementation

The full echolocatoR fine-mapping pipeline can be run using just the finemap_loci() function, which ultimately produces an organized folder structure containing study- and locus-specific multi-tool fine-mapping results tables and annotated multi-track plots. If some stage of the pipeline has been run previously for a given locus, finemap_loci() will automatically detect and use the associated files, saving time for when testing different parameters. Most echolocatoR functions can run on a standard laptop (tested on a MacBook Pro with a 2.3 GHz Intel Core i5 processor and 8 GB 2133 MHz LPDDR3 memory), or take full advantage of its parallelizing capabilities on a high performance computing (HPC) cluster.

2.1. Rapid, robust, and scalable fine-mapping

By default, echolocatoR automatically indexes the user’s summary statistics file using Tabix (Li, 2011) for rapid on the fly querying. Locus-specific summary statistics are then extracted, standardized, and filtered according to user-controllable parameters such as window size (± 1Mb surrounding the index SNP by default), minor allele frequency (MAF) threshold, LD block, and many other features.

echolocatoR integrates a suite of existing fine-mapping tools, which currently includes: ABF (Benner et al., 2016; Wellcome Trust Case Control Consortium et al., 2012; Wakefield, 2007), GCTA-COJO (Yang et al., 2012), FINEMAP (Benner et al., 2016), SuSiE (Wang et al., 2018), PolyFun (Weissbrod et al., 2019), and PAINTOR (Kichaev et al., 2017), the latter of which can be run with (i.e. PAINTOR+) or without (PAINTOR-) functional annotations. Colocalization tests between pairs of GWAS and/or QTL can also be performed using coloc (Giambartolomei et al., 2014) to identify locus-specific phenotype-relevant tissues and cell types and prioritize GWAS/QTL datasets for joint functional fine-mapping.

Each fine-mapping tool produces its own 95% Credible Set (CS_95%). The precise meaning of this term varies by tool but can be understood as the SNPs with 95% probability of being causal in the phenotype of interest. However, inter-tool comparisons have observed that there is substantial heterogeneity in their CS_95% (see (Weissbrod et al., 2019)), leading to questions about the validity of any single tool in all situations, which can be strongly influenced by the degree of LD complexity and the true number of causal SNPs (Pasaniuc and Price, 2017; Pritchard and Przeworski, 2001; Yang et al., 2011). We therefore define Consensus SNPs as those that were identified in the CS_95% of two or more tools, representing high-confidence putative causal SNPs. Indeed, we have shown that these Consensus SNPs have significantly higher predicted regulatory impact than either index SNPs or individual tool CS_95% SNP sets in Parkinson’s Disease (PD) (Schilder and Raj, 2020). echolocatoR automatically adds columns for Support (the number of tools that a given SNP was in the CS_95%), Consensus SNP status, as well as mean posterior probabilities (PP) across all fine-mapping tools used.

2.2. Extensive database access

2.2.1. Linkage disequilibrium

A common barrier to performing accurate fine-mapping is access to the appropriate LD reference panels. Currently, application programming interface (API) access is provided for 1000 Genomes Phases 1 & 3 (with selectable subpopulations) (Consortium and The 1000 Genomes Project Consortium, 2015), UK Biobank (Bycroft et al., 2018; Sudlow et al., 2015; Weissbrod et al.), or user-supplied VCF files or LD matrices. Unlike existing LD querying tools (Machiela and Chanock, 2015), echolocatoR does not restrict the size of LD matrices to allow comprehensive fine-mapping of all loci regardless of size or complexity.

2.2.2. Genome-wide annotations

Genome-wide annotations can be used to compute SNP-wise prior probabilities for functional fine-mapping (e.g. PolyFun, PAINTOR+). API access to a large compendium of genome-wide annotations and epigenomic data is provided, including: tissue and/or cell type/line-specific chromatin marks from Roadmap (Bernstein et al., 2010; Satterlee et al., 2019), ENCODE (Jou et al., 2019), genic annotations through biomaRt (Durinck et al., 2009),HaploReg (Zhbannikov et al., 2017; Ward and Kellis, 2012), cell-type-specific epigenomic datasets(Nott et al., 2019; Corces et al.), and hundreds of additional annotations through the R package XGR (http://xgr.r-forge.r-project.org/) (Fang et al., 2016). catalogueR, another R package developed by our group, provides rapid API access to full summary statistics from 110 uniformly reprocessed QTL datasets (across 20 studies) with parallelized Tabix queries. echolocatoR can utilize all genome-wide annotations and datasets to compare enrichment across different SNP group (e.g. GWAS lead SNPs vs. CS_95% vs. Consensus SNPs) using XGR, GoShifter, and/or S-LDSC (Gazal et al., 2017; Finucane et al., 2015; Bulik-Sullivan et al., 2015).

2.2.3. In-silico validation

We also built in API access to in silico validation datasets, including massively parallel reporter assays (MPRA) (van Arensbergen et al., 2019; Tewhey et al., 2018), S-LDSC heritability enrichment, and predictions from multiple machine learning models trained on tissue- and cell-type-specific epigenomic annotations: Basenji (Kelley et al., 2018), and DeepSEA (Zhou and Troyanskaya, 2015) (provided by (Dey et al.)) as well as IMPACT (Amariuta et al., 2019). Lastly, we integrated motifbreakR which uses a comprehensive set of algorithms and position weight matrices (n = 9,933) to assess whether fine-mapped variants fall within sequence motifs and to what extent they disrupt binding to specific transcription factors (Coetzee et al., 2015).

2.3 Multi-track plotting

High-resolution multi-track plots are automatically generated for each locus (Fig. 1b) and can include any combination of the following tracks: Manhattan plots of GWAS/QTL P-values or tool-specific fine-mapping posterior probabilities (PP) colored by LD with the lead SNP, mean PP, gene body models, and all aforementioned genome-wide annotations. Plots can be further customized as returned ggplot objects.

3. Conclusion

Overall, echolocatoR removes many of the primary barriers to perform a comprehensive fine-mapping investigation while improving the robustness of causal variant prediction through multi-tool consensus and in silico validation using a large compendium of (epi)genome-wide annotations. Thus, we hope that echolocatoR will make fine-mapping a standard practice, thereby uncovering human disease etiology and accelerating the development of novel therapeutics.

Supplementary information

Installation instructions (with an optional conda environment to minimize dependency conflicts), vignettes, example data, documentation of all functions and annotation datasets, as well as source code can be found in the echolocatoR website: https://rajlabmssm.github.io/echolocatoR

Funding

This work was supported by grants from the Michael J. Fox Foundation (Grant #14899 and #16743) and US National Institutes of Health (NIH NIA R01-AG054005).

Acknowledgements

We would like to thank Elisa Navarro, Gloriia Novikova, Cecilia Lindgren and Teresa Ferreira for their valuable feedback and suggestions. We would also like to thank Omer Weissbrod, Chris Glass, Alexi Nott for their guidance with data and/or tool integration. This work was supported in part through the computational resources provided by Scientific Computing at the ISMMS.

Footnotes

https://github.com/RajLabMSSM/echolocatoR

References

↵
Amariuta, T. et al. (2019) IMPACT: Genomic Annotation of Cell-State-Specific Regulatory Elements Inferred from the Epigenome of Bound Transcription Factors. Am. J. Hum. Genet., 104, 879–895.
OpenUrl
↵
van Arensbergen, J. et al. (2019) High-throughput identification of human SNPs affecting regulatory element activity. Nat. Genet., 51, 1160–1169.
OpenUrl CrossRef PubMed
↵
Benner, C. et al. (2016) FINEMAP: efficient variable selection using summary data from genome-wide association studies. Bioinformatics, 32, 1493–1501.
OpenUrl CrossRef PubMed
↵
Bernstein, B.E. et al. (2010) The NIH Roadmap Epigenomics Mapping Consortium. Nature Biotechnology, 28, 1045–1048.
OpenUrl CrossRef PubMed Web of Science
↵
Bulik-Sullivan, B.K. et al. (2015) LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nature Genetics, 47, 291–295.
OpenUrl CrossRef PubMed
↵
Bycroft, C. et al. (2018) The UK Biobank resource with deep phenotyping and genomic data. Nature, 562, 203–209.
OpenUrl CrossRef PubMed
↵
Coetzee, S.G. et al. (2015) motifbreakR: an R/Bioconductor package for predicting variant effects at transcription factor binding sites. Bioinformatics, 31, 3847–3849.
OpenUrl CrossRef PubMed
↵
Consortium, T. 1000 G.P. and The 1000 Genomes Project Consortium (2015) A global reference for human genetic variation. Nature, 526, 68–74.
OpenUrl CrossRef PubMed
↵
Corces, M.R. et al. Single-cell epigenomic identification of inherited risk loci in Alzheimer’s and Parkinson’s disease.
↵
Dey, K.K. et al. Evaluating the informativeness of deep learning annotations for human complex diseases.
↵
Durinck, S. et al. (2009) Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt. Nat. Protoc., 4, 1184–1191.
OpenUrl CrossRef PubMed Web of Science
↵
Fang, H. et al. (2016) XGR software for enhanced interpretation of genomic summary data, illustrated by application to immunological traits. Genome Med., 8, 129.
OpenUrl CrossRef
↵
Finucane, H.K. et al. (2015) Partitioning heritability by functional annotation using genome-wide association summary statistics. Nature Genetics, 47, 1228–1235.
OpenUrl CrossRef PubMed
↵
Gaulton, K.J. et al. (2015) Genetic fine mapping and genomic annotation defines causal mechanisms at type 2 diabetes susceptibility loci. Nat. Genet., 47, 1415–1425.
OpenUrl CrossRef PubMed
↵
Gazal, S. et al. (2017) Linkage disequilibrium–dependent architecture of human complex traits shows action of negative selection. Nature Genetics, 49, 1421–1427.
OpenUrl CrossRef PubMed
↵
Giambartolomei, C. et al. (2014) Bayesian test for colocalisation between pairs of genetic association studies using summary statistics. PLoS Genet., 10, e1004383.
OpenUrl CrossRef PubMed
↵
Jou, J. et al. (2019) The ENCODE Portal as an Epigenomics Resource. Curr. Protoc. Bioinformatics, 68, e89.
OpenUrl
↵
Kelley, D.R. et al. (2018) Sequential regulatory activity prediction across chromosomes with convolutional neural networks. Genome Res., 28, 739–750.
OpenUrl Abstract/FREE Full Text
↵
Kichaev, G. et al. (2017) Improved methods for multi-trait fine mapping of pleiotropic risk loci. Bioinformatics, 33, 248–255.
OpenUrl CrossRef PubMed
↵
Kichaev, G. and Pasaniuc, B. (2015) Leveraging Functional-Annotation Data in Trans-ethnic Fine-Mapping Studies. Am. J. Hum. Genet., 97, 260–271.
OpenUrl CrossRef PubMed
↵
Li, H. (2011) Tabix: fast retrieval of sequence features from generic TAB-delimited files. Bioinformatics, 27, 718–719.
OpenUrl CrossRef PubMed Web of Science
↵
Machiela, M.J. and Chanock, S.J. (2015) LDlink: a web-based application for exploring population-specific haplotype structure and linking correlated alleles of possible functional variants. Bioinformatics, 31, 3555–3557.
OpenUrl CrossRef PubMed
↵
Mahajan, A. et al. (2018) Fine-mapping type 2 diabetes loci to single-variant resolution using high-density imputation and islet-specific epigenome maps. Nat. Genet., 50, 1505–1513.
OpenUrl CrossRef PubMed
↵
Nott, A. et al. (2019) Brain cell type-specific enhancer-promoter interactome maps and disease risk association. Science, 366, 1134–1139.
OpenUrl Abstract/FREE Full Text
↵
Pasaniuc, B. and Price, A.L. (2017) Dissecting the genetics of complex traits using summary association statistics. Nat. Rev. Genet., 18, 117–127.
OpenUrl CrossRef PubMed
↵
Pritchard, J.K. and Przeworski, M. (2001) Linkage Disequilibrium in Humans: Models and Data. The American Journal of Human Genetics, 69, 1–14.
OpenUrl CrossRef PubMed Web of Science
↵
Satterlee, J.S. et al. (2019) The NIH Common Fund/Roadmap Epigenomics Program: Successes of a comprehensive consortium. Sci Adv, 5, eaaw6507.
OpenUrl FREE Full Text
↵
Schilder, B.M. and Raj, T. (2020) Statistical and functional fine-mapping of Parkinson’s disease susceptibility loci identifies putative causal variants, mechanisms, and cell-types. BioRxiv.
↵
Spain, S.L. and Barrett, J.C. (2015) Strategies for fine-mapping complex traits. Hum. Mol. Genet., 24, R111–9.
OpenUrl
↵
Sudlow, C. et al. (2015) UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med., 12, e1001779.
OpenUrl CrossRef PubMed
↵
Tewhey, R. et al. (2018) Direct Identification of Hundreds of Expression-Modulating Variants using a Multiplexed Reporter Assay. Cell, 172, 1132–1134.
OpenUrl CrossRef
↵
Trynka, G. et al. (2015) Disentangling the Effects of Colocalizing Genomic Annotations to Functionally Prioritize Non-coding Variants within Complex-Trait Loci. Am. J. Hum. Genet., 97, 139–152.
OpenUrl CrossRef PubMed
↵
Wakefield, J. (2007) A Bayesian measure of the probability of false discovery in genetic epidemiology studies. Am. J. Hum. Genet., 81, 208–227.
OpenUrl CrossRef PubMed Web of Science
↵
Wang, G. et al. (2018) A simple new approach to variable selection in regression, with application to genetic fine-mapping.
↵
Ward, L.D. and Kellis, M. (2012) HaploReg: a resource for exploring chromatin states, conservation, and regulatory motif alterations within sets of genetically linked variants. Nucleic Acids Research, 40, D930–D934.
OpenUrl CrossRef PubMed Web of Science
↵
Weissbrod, O. et al. (2019) Functionally-informed fine-mapping and polygenic localization of complex trait heritability. BioRxiv.
↵
Weissbrod, O. et al. Functionally-informed fine-mapping and polygenic localization of complex trait heritability.
↵
Wellcome Trust Case Control Consortium et al. (2012) Bayesian refinement of association signals for 14 loci in 3 common diseases. Nat. Genet., 44, 1294–1301.
OpenUrl CrossRef PubMed
↵
Westra, H.-J. et al. (2018) Fine-mapping and functional studies highlight potential causal variants for rheumatoid arthritis and type 1 diabetes. Nat. Genet., 50, 1366–1374.
OpenUrl CrossRef PubMed
↵
Yang, J. et al. (2012) Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits. Nat. Genet., 44, 369–75, S1-3.
OpenUrl CrossRef PubMed
↵
Yang, J. et al. (2011) Genomic inflation factors under polygenic inheritance. Eur. J. Hum. Genet., 19, 807–812.
OpenUrl CrossRef PubMed
↵
Zhang, X. et al. (2018) A fine-mapping study of central obesity loci incorporating functional annotation and imputation. Eur. J. Hum. Genet., 26, 1369–1377.
OpenUrl
↵
Zhbannikov, I.Y. et al. (2017) haploR: an R-package for querying web-based annotation tools. F1000Research, 6, 97.
OpenUrl
↵
Zhou, J. and Troyanskaya, O.G. (2015) Predicting effects of noncoding variants with deep learning-based sequence model. Nat. Methods, 12, 931–934.
OpenUrl CrossRef PubMed

View the discussion thread.

Posted October 23, 2020.

Download PDF

Data/Code

Citation Tools

Subject Area

Bioinformatics

Subject Areas

All Articles

Animal Behavior and Cognition (5201)
Biochemistry (11718)
Bioengineering (8724)
Bioinformatics (29132)
Biophysics (14936)
Cancer Biology (12051)
Cell Biology (17360)
Clinical Trials (138)
Developmental Biology (9406)
Ecology (14146)
Epidemiology (2067)
Evolutionary Biology (18269)
Genetics (12223)
Genomics (16768)
Immunology (11844)
Microbiology (28016)
Molecular Biology (11560)
Neuroscience (60822)
Paleontology (450)
Pathology (1864)
Pharmacology and Toxicology (3231)
Physiology (4940)
Plant Biology (10401)
Scientific Communication and Education (1680)
Synthetic Biology (2878)
Systems Biology (7333)
Zoology (1642)

[1] ↵
Amariuta, T. et al. (2019) IMPACT: Genomic Annotation of Cell-State-Specific Regulatory Elements Inferred from the Epigenome of Bound Transcription Factors. Am. J. Hum. Genet., 104, 879–895.
OpenUrl

[2] ↵
van Arensbergen, J. et al. (2019) High-throughput identification of human SNPs affecting regulatory element activity. Nat. Genet., 51, 1160–1169.
OpenUrl CrossRef PubMed

[3] ↵
Benner, C. et al. (2016) FINEMAP: efficient variable selection using summary data from genome-wide association studies. Bioinformatics, 32, 1493–1501.
OpenUrl CrossRef PubMed

[4] ↵
Bernstein, B.E. et al. (2010) The NIH Roadmap Epigenomics Mapping Consortium. Nature Biotechnology, 28, 1045–1048.
OpenUrl CrossRef PubMed Web of Science

[5] ↵
Bulik-Sullivan, B.K. et al. (2015) LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nature Genetics, 47, 291–295.
OpenUrl CrossRef PubMed

[6] ↵
Bycroft, C. et al. (2018) The UK Biobank resource with deep phenotyping and genomic data. Nature, 562, 203–209.
OpenUrl CrossRef PubMed

[7] ↵
Coetzee, S.G. et al. (2015) motifbreakR: an R/Bioconductor package for predicting variant effects at transcription factor binding sites. Bioinformatics, 31, 3847–3849.
OpenUrl CrossRef PubMed

[8] ↵
Consortium, T. 1000 G.P. and The 1000 Genomes Project Consortium (2015) A global reference for human genetic variation. Nature, 526, 68–74.
OpenUrl CrossRef PubMed

[9] ↵
Corces, M.R. et al. Single-cell epigenomic identification of inherited risk loci in Alzheimer’s and Parkinson’s disease.

[10] ↵
Dey, K.K. et al. Evaluating the informativeness of deep learning annotations for human complex diseases.

[11] ↵
Durinck, S. et al. (2009) Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt. Nat. Protoc., 4, 1184–1191.
OpenUrl CrossRef PubMed Web of Science

[12] ↵
Fang, H. et al. (2016) XGR software for enhanced interpretation of genomic summary data, illustrated by application to immunological traits. Genome Med., 8, 129.
OpenUrl CrossRef

[13] ↵
Finucane, H.K. et al. (2015) Partitioning heritability by functional annotation using genome-wide association summary statistics. Nature Genetics, 47, 1228–1235.
OpenUrl CrossRef PubMed

[14] ↵
Gaulton, K.J. et al. (2015) Genetic fine mapping and genomic annotation defines causal mechanisms at type 2 diabetes susceptibility loci. Nat. Genet., 47, 1415–1425.
OpenUrl CrossRef PubMed

[15] ↵
Gazal, S. et al. (2017) Linkage disequilibrium–dependent architecture of human complex traits shows action of negative selection. Nature Genetics, 49, 1421–1427.
OpenUrl CrossRef PubMed

[16] ↵
Giambartolomei, C. et al. (2014) Bayesian test for colocalisation between pairs of genetic association studies using summary statistics. PLoS Genet., 10, e1004383.
OpenUrl CrossRef PubMed

[17] ↵
Jou, J. et al. (2019) The ENCODE Portal as an Epigenomics Resource. Curr. Protoc. Bioinformatics, 68, e89.
OpenUrl

[18] ↵
Kelley, D.R. et al. (2018) Sequential regulatory activity prediction across chromosomes with convolutional neural networks. Genome Res., 28, 739–750.
OpenUrl Abstract/FREE Full Text

[19] ↵
Kichaev, G. et al. (2017) Improved methods for multi-trait fine mapping of pleiotropic risk loci. Bioinformatics, 33, 248–255.
OpenUrl CrossRef PubMed

[20] ↵
Kichaev, G. and Pasaniuc, B. (2015) Leveraging Functional-Annotation Data in Trans-ethnic Fine-Mapping Studies. Am. J. Hum. Genet., 97, 260–271.
OpenUrl CrossRef PubMed

[21] ↵
Li, H. (2011) Tabix: fast retrieval of sequence features from generic TAB-delimited files. Bioinformatics, 27, 718–719.
OpenUrl CrossRef PubMed Web of Science

[22] ↵
Machiela, M.J. and Chanock, S.J. (2015) LDlink: a web-based application for exploring population-specific haplotype structure and linking correlated alleles of possible functional variants. Bioinformatics, 31, 3555–3557.
OpenUrl CrossRef PubMed

[23] ↵
Mahajan, A. et al. (2018) Fine-mapping type 2 diabetes loci to single-variant resolution using high-density imputation and islet-specific epigenome maps. Nat. Genet., 50, 1505–1513.
OpenUrl CrossRef PubMed

[24] ↵
Nott, A. et al. (2019) Brain cell type-specific enhancer-promoter interactome maps and disease risk association. Science, 366, 1134–1139.
OpenUrl Abstract/FREE Full Text

[25] ↵
Pasaniuc, B. and Price, A.L. (2017) Dissecting the genetics of complex traits using summary association statistics. Nat. Rev. Genet., 18, 117–127.
OpenUrl CrossRef PubMed

[26] ↵
Pritchard, J.K. and Przeworski, M. (2001) Linkage Disequilibrium in Humans: Models and Data. The American Journal of Human Genetics, 69, 1–14.
OpenUrl CrossRef PubMed Web of Science

[27] ↵
Satterlee, J.S. et al. (2019) The NIH Common Fund/Roadmap Epigenomics Program: Successes of a comprehensive consortium. Sci Adv, 5, eaaw6507.
OpenUrl FREE Full Text

[28] ↵
Schilder, B.M. and Raj, T. (2020) Statistical and functional fine-mapping of Parkinson’s disease susceptibility loci identifies putative causal variants, mechanisms, and cell-types. BioRxiv.

[29] ↵
Spain, S.L. and Barrett, J.C. (2015) Strategies for fine-mapping complex traits. Hum. Mol. Genet., 24, R111–9.
OpenUrl

[30] ↵
Sudlow, C. et al. (2015) UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med., 12, e1001779.
OpenUrl CrossRef PubMed

[31] ↵
Tewhey, R. et al. (2018) Direct Identification of Hundreds of Expression-Modulating Variants using a Multiplexed Reporter Assay. Cell, 172, 1132–1134.
OpenUrl CrossRef

[32] ↵
Trynka, G. et al. (2015) Disentangling the Effects of Colocalizing Genomic Annotations to Functionally Prioritize Non-coding Variants within Complex-Trait Loci. Am. J. Hum. Genet., 97, 139–152.
OpenUrl CrossRef PubMed

[33] ↵
Wakefield, J. (2007) A Bayesian measure of the probability of false discovery in genetic epidemiology studies. Am. J. Hum. Genet., 81, 208–227.
OpenUrl CrossRef PubMed Web of Science

[34] ↵
Wang, G. et al. (2018) A simple new approach to variable selection in regression, with application to genetic fine-mapping.

[35] ↵
Ward, L.D. and Kellis, M. (2012) HaploReg: a resource for exploring chromatin states, conservation, and regulatory motif alterations within sets of genetically linked variants. Nucleic Acids Research, 40, D930–D934.
OpenUrl CrossRef PubMed Web of Science

[36] ↵
Weissbrod, O. et al. (2019) Functionally-informed fine-mapping and polygenic localization of complex trait heritability. BioRxiv.

[37] ↵
Weissbrod, O. et al. Functionally-informed fine-mapping and polygenic localization of complex trait heritability.

[38] ↵
Wellcome Trust Case Control Consortium et al. (2012) Bayesian refinement of association signals for 14 loci in 3 common diseases. Nat. Genet., 44, 1294–1301.
OpenUrl CrossRef PubMed

[39] ↵
Westra, H.-J. et al. (2018) Fine-mapping and functional studies highlight potential causal variants for rheumatoid arthritis and type 1 diabetes. Nat. Genet., 50, 1366–1374.
OpenUrl CrossRef PubMed

[40] ↵
Yang, J. et al. (2012) Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits. Nat. Genet., 44, 369–75, S1-3.
OpenUrl CrossRef PubMed

[41] ↵
Yang, J. et al. (2011) Genomic inflation factors under polygenic inheritance. Eur. J. Hum. Genet., 19, 807–812.
OpenUrl CrossRef PubMed

[42] ↵
Zhang, X. et al. (2018) A fine-mapping study of central obesity loci incorporating functional annotation and imputation. Eur. J. Hum. Genet., 26, 1369–1377.
OpenUrl

[43] ↵
Zhbannikov, I.Y. et al. (2017) haploR: an R-package for querying web-based annotation tools. F1000Research, 6, 97.
OpenUrl

[44] ↵
Zhou, J. and Troyanskaya, O.G. (2015) Predicting effects of noncoding variants with deep learning-based sequence model. Nat. Methods, 12, 931–934.
OpenUrl CrossRef PubMed