116 independent genetic variants influence the neuroticism personality trait in over 329,000 UK Biobank individuals

Michelle Luciano; Saskia P Hagenaars; Gail Davies; W David Hill; Toni-Kim Clarke; Masoud Shirali; Riccardo E Marioni; Sarah E Harris; David C Liewald; Chloe Fawns-Ritchie; Mark J Adams; David M Howard; Cathryn M Lewis; Catharine R. Gale; Andrew M McIntosh; Ian J Deary

doi:10.1101/168906

Abstract

Neuroticism is a stable personality trait ¹; twin studies report heritability between 30% and 50% ², and SNP-based heritability is about 15% ³. Higher levels of neuroticism are associated with poorer mental and physical health ^4,5, and the economic burden of neuroticism for societies is high ⁶. To date, genome-wide association (GWA) studies of neuroticism have identified up to 11 genetic loci ^3,7. Here we report 116 significant independent genetic loci from a GWA of neuroticism in 329,821 UK Biobank participants, with replication available in a GWA meta-analysis of neuroticism in 122,867 individuals. Genetic signals for neuroticism were enriched in neuronal genesis and differentiation pathways, and substantial genetic correlations were found between neuroticism and depressive symptoms (r_g = .82, SE=.03), major depressive disorder (r_g = .69, SE=.07) and subjective wellbeing (rg = -.68, SE=.03) alongside other mental health traits. These discoveries significantly advance our understanding of neuroticism and its association with major depressive disorder.

Understanding why people differ in neuroticism will be an important contribution to understanding people’s liability to poor mental health through the life course. The strong genetic correlation between neuroticism and mental health, especially anxiety and major depressive disorder ^8,9, means that exploring the genetic contribution to differences in neuroticism is one way to understand more about these common and burdensome, but aetiologically intractable illnesses. In the largest GWA study of major depression, 44 independent genetic loci have been identified ¹⁰.

The UK Biobank open resource (http://www.ukbiobank.ac.uk) has health and medical information for over 500,000 individuals aged 40-69 years from the United Kingdom, assessed between 2006 and 2010 ¹¹; genetic data are also available ¹². We performed a GWA analysis of trait neuroticism in 329,821 unrelated adults (152,710/46.3% male) of European descent from this resource who had high-quality genotype data (Online Methods). Neuroticism was measured by the total score of the 12-item Eysenck Personality Questionnaire-Revised Short Form (EPQ-R-S) ¹³ (Supplementary Table 1 and Supplementary Fig. 1; Online Methods). The score used in the association analysis was residualized for the effects of age, sex, assessment centre, genotype batch, array, and 40 genetic principal components. Neuroticism scores were tested against 18,485,882 bi-allelic single nucleotide polymorphism (SNP) variants, based on the Haplotype Reference Consortium panel, with a minor allele frequency ≥ 0.0005 and an information/imputation quality score of ≥ 0.1 under an additive model using the BGENIE software ¹². The distribution of obtained versus expected results under the null hypothesis showed some genomic inflation, with a lambda of 1.15 (the quantile-quantile plot is shown in Supplementary Fig. 2). Using univariate linkage disequilibrium score regression (LDSR) ¹⁴, 3.2% of this inflation was shown to be due to the presence of a large polygenic signal, indicated by the intercept being close to 1 (1.02, SE = .01). SNP-based heritability of neuroticism was calculated using LSDR, and was estimated at .108 (SE=.005).

Genome-wide significance (P < 5 × 10⁻⁸) was demonstrated for 10,353 genetic variants with a further 17,668 variants supported at a suggestive level (P < 1 × 10⁻⁵) of significance (Supplementary Table 3). The Manhattan plot is shown in Figure 1 and gene annotation for the significant 1000G SNPs is shown in Supplementary Table 4. Using the PLINK clumping tool ¹⁵, 116 of the significant SNPs were shown to be independent (see Supplementary Table 5 for gene annotation and LD intervals). There was one locus each on chromosomes 20 and 22; two loci each on chromosomes 1, 4, 10 and 19; three loci on chromosome 5; four loci each on chromosomes 12, 13, 14 and 16; six loci each on chromosomes 6, 15, 17 and 18; seven loci each on chromosomes 2 and 3; eight loci on chromosome 7; 12 loci on chromosome 11; and 20 loci on chromosome 8. Five SNPs were exonic, located in MSRA, NOS1, PINX1, ZCCHC14, and C12orf49 genes; a further two were coding SNPs in RPP21 (a missense mutation) and AGBL1 (synonymous). For the 116 independent SNPs, evidence of expression quantitative trait loci (eQTL) was explored using the GTEx database and this confirmed that 44 of these were eQTLs (Supplementary Table 5). A Regulome DB score (http://www.regulomedb.org/) was used to identify SNPs with a likely regulatory function. Thirty-three of the 116 SNPs were included in the Regulome DB database and eight of these had a score < 3, indicating that they are likely to bind to DNA and be involved in gene regulation (Supplementary Table 5).

Figure 1. GWA results for neuroticism in 329,821 UK Biobank individuals.

The two SNPs—rs6981523 and rs9611519—previously identified for neuroticism in 23andMe ¹⁶ were similarly significant in our larger sample, with respective p-values of 4.7 × 10⁻²² and 1.17 × 10⁻¹⁰ and consistent direction of allelic effect. SNP rs35855737, significant in the GPC-2 GWA of neuroticism ¹⁷, was not significant, P = .069. Previous association of SNPs within 8p23.1 linked to an inversion polymorphism (based partly on a subsample of UK Biobank) was even stronger in our study, with the lead SNPs, rs2572431 ⁷ and rs12682352 ³, showing respective p-values of 1.33 × 10⁻¹⁸ and 1.11 × 10⁻²⁴. Our lead SNP in this region was rs2921036 (P = 8.04 × 10⁻²⁶), an intergenic SNP located within a tighter region of LD (8,092,025-8,863,059 base pairs) within the larger ~4 Mb region (regional association plot is shown in Fig. 2). The 8p23.1 locus was previously cited as important in developmental neuropsychiatric disorders ¹⁸ which may be relevant for personality traits, which emerge early in life, and could therefore be influenced by the same developmental processes.

Figure 2. Regional association plot for chromosome 8p suggestive/significant signal.

Replication of the significant association signals in UK Biobank was sought from the results of a GWA meta-analysis of neuroticism which included 23andMe (N = 59,206) ¹⁶ and the Genetics of Personality Consortium (GPC-2; N = 63,661) ¹⁷. Of the genome-significant SNPs in UK Biobank, 10,171 were present in the 23andMe and GPC meta-analysis GWA results. Replication was observed for 984 of these at a conservative Bonferroni-corrected level of P < 4.9 × 10⁻⁶ and all these replicated SNPs increased in significance when meta-analysed with the discovery sample, indicating consistent allelic effect. Supplementary Table 6 shows the effect allele, effect size, and p-value for the discovery, replication, and discovery + replication meta-analysis samples. Of the 116 independent associated SNPs, 111 were present in the replication cohort, with 51 nominally significant (P < .05; see Supplementary Table 5), and 15 at a Bonferroni-corrected level (P < .00045; see Table 1). Figures 3 and 4 show the regional association plot for chromosomes 22 and 11 which, like chromosome 8, showed multiple genes present in the associated LD region. All other replicated independent loci were localised or situated nearby only one or two genes.

View this table:

Table 1. Fifteen independent SNPs associated with neuroticism in UK Biobank most strongly replicated in the meta-analysis of 23andMe and the GPC cohorts.

Figure 3. Regional association plot for chromosome 22 suggestive/significant signal.

Figure 4. Regional association plot for chromosome 11 suggestive/significant signal.

Gene-based analysis of the GWA results was performed using MAGMA ²⁰ ; 249 genes were significantly associated at a Bonferroni-corrected level (α = 0.05 / 18,080; P < 2.77 × 10⁻⁶ ; Supplementary Table 7). These genes corresponded with the genes annotated to the single SNP GWA findings. Three of these were genes (STH, HIST1H3J, HIST1H4L) containing a single SNP. Of the replicated independent GWA SNPs that were in/nearby genes, the following significant genes were corroborated in the gene-based results: CACNA1E, XKR6, MSRA, LINGO2, AGBL2, CELF4, ZC3H7B and BAIAP2. SNP rs6981523, previously identified in 23andMe, was an intergenic SNP near XKR6; this gene was the second most significant gene in our gene-based analysis (P = 6.55 × 10⁻³²). L3MBTL2 and CHADL, wherein 23andMe’s other significant SNP, rs9611519, resided, showed respective gene-based p-values of 2.40 × 10⁻⁶ and 1.15 × 10⁻⁶.

Pathway analysis in MAGMA highlighted five significant gene ontology pathways (family-wise error P < 1.21 × 10⁻⁶): neuron spine (cellular), homophilic cell adhesion via plasma membrane adhesion molecules (biological), neuron differentiation (biological), cell cell adhesion via plasma membrane adhesion molecules (biological), and neurogenesis (biological). See Table 2 for further details.

View this table:

Table 2. Significant gene ontology pathways for neuroticism in UK Biobank

LD score regression ²¹ was used to estimate the genetic correlation between neuroticism and a variety of mental health traits (Supplementary Table 8). The strongest correlation was observed for depressive symptoms (r_g = .82, SE = .03). Major depressive disorder and subjective wellbeing showed moderate-to-strong correlations (~.68), schizophrenia, ADHD and anorexia nervosa showed significant and moderate-to-low correlations (~.20), and Alzheimer’s disease had a low correlation (.10). Genetic correlations with ADHD, bipolar disorder and Alzheimer’s disease, which were not significant in a previous study using ~108,000 UK Biobank participants, ²² were now significant. The genetic correlation of one between Eysenck neuroticism and other neuroticism scales (used by 23andMe and the GPC) confirms that GWA meta-analysis based on different measurement instruments is valid. Polygenic profile analyses indicated that the neuroticism polygenic score explained 2.75% of the variance in neuroticism (β = .19, P = 1.26 × 10⁻⁴) and 0.8% of the variance in depression status (OR = 1.24, P = 2.80 × 10⁻⁸) in Generation Scotland. The results for each threshold can be found in Supplementary Table 9.

Given its strong association with human misery and happiness, its protean predictive power for health—from anxiety and depression to longevity ^5,23—, and the huge personal and societal burdens it brings, understanding the environmental and genetic origins of people’s differences in neuroticism is a health priority ⁶. The combination, in UK Biobank, of a large sample and a well-validated neuroticism scale has afforded the discovery of 116 genetic loci that influence neuroticism levels, most of them novel. These discoveries promise paths to understand the mechanisms whereby some people become depressed, and of broader human differences in happiness, and they are a resource for those seeking novel drug targets for major depression. As proof-of-principle, the CRHR1 gene (highlighted in our SNP and gene-based analysis) has been associated with anxiety, depression and neuroticism ^3,24,25 and is involved in normal hormonal responses to stress; the glucocorticoid pathway is thus a relevant and well-known target. After millennia in which scholars and researchers have sought the sources of individual differences in people’s proneness to dysphoria ²⁶, the present study adds significantly to explaining the (genetic) anatomy of melancholy.

Online Methods

Genome-wide association analysis in UK Biobank

An imputed dataset, including >92 million variants, referenced to the UK10K haplotype, 1000 Genomes Phase 3, and Haplotype Reference Consortium (HRC) panels was available in UK Biobank. The current analysis includes only those SNPs available in the HRC reference panel. Quality control filters were applied (see online Supplementary Methods) which resulted in 18,485,882 imputed SNPs for analysis in 329,821 individuals. The GWA of neuroticism was conducted using BGENIE ¹², a program specifically developed to analyse UK Biobank data in a fast and efficient manner. Further information can be found at the following URL: https://jmarchini.org/bgenie/. A linear SNP association model was tested which accounted for genotype uncertainty. Neuroticism was pre-adjusted for age, sex, genotyping batch, genotyping array, assessment centre, and 40 principal components to speed up analysis.

The number of independent signals from the GWAS was determined using LD-clumping in PLINK v1.90b3i ²⁷ (https://www.cog-genomics.org/plink2). The LD structure was based on SNPs with a p-value < 1 × 10⁻³ that were extracted from the imputed genotypes. Index SNPs were identified (P < 5 × 10⁻⁸) and clumps were formed for SNPs with P < 1 × 10⁻⁵ that were in LD (R² > 0.1) and within 500kb of the index SNP. SNPs were assigned to no more than one clump.

Meta-analysis of GWA Results

Two meta-analyses were performed. To check for replication of the top GWA signals in UK Biobank, results from a meta-analysis of 23andme ¹⁶ and the Genetics of Personality Consortium (GPC) ¹⁷ were used. This meta-analysis was conducted using METAL ²⁸ and due to the lack of phenotype harmonisation across the cohorts, a sample size weighted metaanalysis was preferred. For all significant SNPs in UK Biobank replicated in the metaanalysis of 23andMe and the GPC, an additional meta-analysis including UK Biobank was performed using the same method.

Genome-wide Gene-based Analysis

Gene-based analysis of neuroticism was performed using MAGMA ²⁰, which provides gene-based statistics derived using the results of the GWA analysis. Genetic variants were assigned to genes based on their position according to the NCBI 37.3 build, with no additional boundary placed around the genes. This resulted in a total of 18,080 genes being analysed. The European panel of the 1000 Genomes data (phase 1, release 3) was used as a reference panel to account for linkage disequilibrium. A genome-wide significance threshold for gene-based associations was calculated using the Bonferroni method (α=0.05/18,080; P < 2.77 × 10⁻⁶).

Functional annotation and gene expression

For the 116 independent genome-wide significant SNPs identified by LD clumping, evidence of expression quantitative trait loci (eQTL) and functional annotation were explored using publicly available online resources. The Genotype-Tissue Expression Portal (GTEx) (http://www.gtexportal.org) was used to identify eQTLs associated with the SNPs. Functional annotation was investigated using the Regulome DB database ²⁹ (http://www.regulomedb.org/).

Pathway Analysis

Biological pathway analysis was performed on the gene-based analysis results. This gene-set enrichment analysis was conducted utilising gene-annotation files from the Gene Ontology (GO) Consortium (http://geneontology.org/) ³⁰ taken from the Molecular Signatures Database (MSigDB) v5.2. The GO consortium includes gene-sets for three ontologies; molecular function, cellular components and biological function. This annotation file consisted of 5,917 gene-sets which were corrected for multiple testing correction using the MAGMA default setting correcting for 10,000 permutations.

Linkage Disequilibrium Score Regression

Univariate Linkage disequilibrium Score (LDSC) regression ¹⁴ was used to test for residual stratification in our GWAS summary statistics and to derive a heritability estimate. An LD regression was performed by regressing the GWA test statistics (χ²) on to each SNP’s LD score (the sum of squared correlations between the minor allele frequency count of a SNP with the minor allele frequency count of every other SNP). This regression allows for the estimation of heritability from the slope, and a means to detect residual confounders, the intercept. Bivariate LDSC regression ²¹ was used to derive genetic correlations between neuroticism and the following phenotypes: attention deficit hyperactivity disorder (ADHD), Alzheimer’s disease, schizophrenia, anorexia nervosa, depressive symptoms, major depressive disorder, and subjective wellbeing. For Alzheimer’s disease, a 500-kb region surrounding APOE was excluded and the analysis re-run (Alzheimer’s disease (500kb)). Further details, including source of GWA summary statistics can be found in the online supplementary material.

Polygenic Prediction into Generation Scotland

Polygenic profile analyses were performed to predict neuroticism and depression status in Generation Scotland (GS) ¹⁹. Polygenic profiles were created in PRSice ³¹ using the UK Biobank neuroticism SNP-based association results, for 7,388 unrelated individuals in GS. SNPs with a MAF <0.01 were removed prior to creating the polygenic profiles. Clumping was used to obtain SNPs in linkage disequilibrium with an r² < 0.25 within a 250kb window. Individuals were removed from GS if they had contributed to both UK Biobank and GS (n = 302). Polygenic profile scores were created based on the significance of the association in UK Biobank with the neuroticism phenotype, at p-value thresholds of 0.01, 0.05, 0.1, 0.5 and 1 (all SNPs). Linear regression models were used to examine the associations between the polygenic profile and neuroticism score in GS, adjusting for age at measurement, sex and the first 10 genetic principal components to adjust for population stratification. Logistic regression models were used to examine depression status, adjusting for the same covariates as in the neuroticism models. The false discovery rate (FDR) method was used to correct for multiple testing across the polygenic profiles for neuroticism at all five thresholds ³².

Data Availability

The GWA results generated by this analysis will be made publicly available via the UK Biobank repository.

Author Disclosure

IJD was a participant in UK Biobank. The other authors declare no conflict of interest.

Author Information

Centre for Cognitive Ageing and Cognitive Epidemiology, Department of Psychology, School of Philosophy, Psychology and Language Sciences, The University of Edinburgh, Edinburgh, UK

Michelle Luciano, Saskia P Hagenaars, Gail Davies, W David Hill, Riccardo E Marioni, Sarah E Harris David C Liewald, Chloe Fawns-Ritchie, Toni-Kim Clarke, Masoud Shirali, Mark J Adams, David M Howard, Andrew M McIntosh, Catharine R. Gale & Ian J Deary

Division of Psychiatry, University of Edinburgh, Edinburgh, UK Toni-Kim Clarke, Masoud Shirali, Mark J Adams, David M Howard & Andrew M McIntosh

Centre for Genomic & Experimental Medicine, MRC Institute of Genetics & Molecular Medicine, University of Edinburgh, Western General Hospital, Edinburgh, UK. Riccardo E Marioni & Sarah E Harris

MRC Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology & Neuroscience, King’s College London, De Crespigny Park, Denmark Hill, London, UK Saskia P Hagenaars & Cathryn M Lewis

MRC Lifecourse Epidemiology Unit, University of Southampton, Southampton, UK. Catharine R. Gale

Contributions

M.L. drafted the manuscript with contributions from D.W.H. and I.J.D. G.D., D.C.L., R.E.M., M.J.A. and D.M.H. performed quality control of UK Biobank data and/or Generation Scotland. M.L, G.D, S.P.H., and M.S. analysed the data. T-K.C., C.F-R., and S.E.H. performed/assisted with downstream analysis. M.L. and I.J.D. co-ordinated the work. All authors commented on and approved the manuscript.

Supplementary

Online Methods and Results

Online Excel Results Tables (large)

Acknowledgements

This research has been conducted using the UK Biobank Resource (Application Nos. 10279 and 4844). This work was supported by The University of Edinburgh Centre for Cognitive Ageing and Cognitive Epidemiology, part of the cross council Lifelong Health and Wellbeing Initiative (MR/K026992/1); funding from the Biotechnology and Biological Sciences Research Council (BBSRC) and Medical Research Council (MRC) is gratefully acknowledged. This report represents independent research part-funded by the National Institute for Health Research (NIHR) Biomedical Research Centre at South London and Maudsley NHS Foundation Trust and King’s College London. W.D.H. is supported by a grant from Age UK (Disconnected Mind Project). A.M.M. and I.J.D. are supported by funding from a Wellcome Trust Strategic Award (104036/Z/14/Z).

References

1.↵
Matthews, G., Deary, I.J. & Whiteman, M.C. Personality Traits, (Cambridge University Press, 2009).
2.↵
Vukasovic, T. & Bratko, D. Heritability of personality: A meta-analysis of behavior genetic studies. Psychological Bulletin. 141, pp (2015).
3.↵
Smith, D.J. et al. Genome-wide analysis of over 106 000 individuals identifies 9 neuroticism-associated loci. Mol Psychiatry 21, 749–57 (2016).
OpenUrl CrossRef PubMed
4.↵
Kubzansky, L.D., Martin, L.T. & Buka, S.L. Early manifestations of personality and adult health: a life course perspective. Health Psychol 28, 364–72 (2009).
OpenUrl CrossRef PubMed Web of Science
5.↵
Strickhouser, J.E., Zell, E. & Krizan, Z. Does Personality Predict Health and Well-Being? A Metasynthesis. Health psychology: official journal of the Division of Health Psychology, American Psychological Association (2017).
6.↵
Cuijpers, P. et al. Economic costs of neuroticism: a population-based study. Archives of General Psychiatry 67, 1086–1093 (2010).
OpenUrl CrossRef PubMed Web of Science
7.↵
Okbay, A. et al. Genetic variants associated with subjective well-being, depressive symptoms, and neuroticism identified through genome-wide analyses. Nat Genet 48, 624–33 (2016).
OpenUrl CrossRef PubMed
8.↵
Few, L.R. et al. Genetic variation in personality traits explains genetic overlap between borderline personality features and substance use disorders. Addiction 109, 2118–27 (2014).
OpenUrl
9.↵
Kendler, K.S., Gatz, M., Gardner, C.O. & Pedersen, N.L. Personality and major depression: a Swedish longitudinal, population-based twin study. Arch Gen Psychiatry 63, 1113–20 (2006).
OpenUrl CrossRef PubMed Web of Science
10.↵
Wray, N.R. & Sullivan, P.F. Genome-wide association analyses identify 44 risk variants and refine the genetic architecture of major depression. bioRxiv (2017).
11.↵
Sudlow, C. et al. UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med 12, e1001779 (2015).
OpenUrl CrossRef PubMed
12.↵
Bycroft, C. et al. Genome-wide genetic data on ~500,000 UK Biobank participants. bioRxiv (2017).
13.↵
Eysenck, S.B., Eysenck, H.J. & Barrett, P. A revised version of the psychoticism scale. Personality and individual differences 6, 21–29 (1985).
OpenUrl
14.↵
Bulik-Sullivan, B.K. et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nature genetics 47, 291–295 (2015).
OpenUrl CrossRef PubMed
15.↵
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. The American Journal of Human Genetics 81, 559–575 (2007).
OpenUrl CrossRef PubMed
16.↵
Lo, M.-T. et al. Genome-wide analyses for personality traits identify six genomic loci and show correlations with psychiatric disorders. Nat Genet 49, 152–156 (2017).
OpenUrl
17.↵
Genetics of Personality, C. et al. Meta-analysis of Genome-wide Association Studies for Neuroticism, and the Polygenic Association With Major Depressive Disorder. JAMA Psychiatry 72, 642–50 (2015).
OpenUrl
18.↵
Tabares-Seisdedos, R. & Rubenstein, J.L.R. Chromosome 8p as a potential hub for developmental neuropsychiatric disorders: implications for schizophrenia, autism and cancer. Mol Psychiatry 14, 563–589 (2009).
OpenUrl CrossRef PubMed Web of Science
19.↵
Smith, B.H. et al. Cohort Profile: Generation Scotland: Scottish Family Health Study (GS:SFHS). The study, its participants and their potential for genetic research on health and illness. International Journal of Epidemiology 42, 689–700 (2013).
OpenUrl CrossRef PubMed Web of Science
20.↵
de Leeuw, C.A., Mooij, J.M., Heskes, T. & Posthuma, D. MAGMA: Generalized Gene-Set Analysis of GWAS Data. PLoS Computational Biology 11(2015).
21.↵
Bulik-Sullivan, B.K. et al. An atlas of genetic correlations across human diseases and traits. Nat Genet 47, 1236–41 (2015).
OpenUrl CrossRef PubMed
22.↵
Gale, C. et al. Pleiotropy between neuroticism and physical and mental health: findings from 108 038 men and women in UK Biobank. Translational psychiatry 6, e791 (2016).
OpenUrl
23.↵
Deary, I.J., Weiss, A. & Batty, G.D. Intelligence and personality as predictors of illness and death: How researchers in differential psychology and chronic disease epidemiology are collaborating to understand and address health inequalities. Psychological Science in the Public Interest 11, 53–79 (2010).
OpenUrl CrossRef PubMed
24.↵
DeYoung, C.G., Cicchetti, D. & Rogosch, F.A. Moderation of the association between childhood maltreatment and Neuroticism by the corticotropin-releasing hormone receptor 1 gene. Journal of child psychology and psychiatry, and allied disciplines 52, 898–906 (2011).
OpenUrl CrossRef PubMed Web of Science
25.↵
Binder, E.B. & Nemeroff, C.B. The CRF system, stress, depression and anxiety – insights from human genetic studies. Molecular psychiatry 15, 574–588 (2010).
OpenUrl CrossRef PubMed Web of Science
26.↵
1. Faulkner, T.C.,
2. Kiessling, N.K. &
3. Blair, R.L.
Burton, R. The Anatomy of Melancholy. (eds Faulkner, T.C., Kiessling, N.K. & Blair, R.L.) (Oxford University Press, 1989).
27.↵
Chang, C.C. et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. GigaScience 4, 7 (2015).
28.↵
Willer, C.J., Li, Y. & Abecasis, G.R. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26, 2 (2010).
29.↵
Boyle, A.P. et al. Annotation of functional variation in personal genomes using RegulomeDB. Genome research 22, 1790–1797 (2012).
OpenUrl Abstract/FREE Full Text
30.↵
Gene Ontology, C. Expansion of the Gene Ontology knowledgebase and resources. Nucleic acids research 45, D331–D338 (2017).
OpenUrl CrossRef PubMed
31.↵
Euesden, J., Lewis, C.M. & O’Reilly, P.F. PRSice: polygenic risk score software. Bioinformatics 31, 1466–1468 (2014).
OpenUrl
32.↵
Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the royal statistical society. Series B (Methodological), 289–300 (1995).