Phenome-wide Investigation of Health Outcomes Associated with Genetic Predisposition to Loneliness

Abdel Abdellaoui; Sandra Sanchez-Roige; Julia Sealock; Jorien L. Treur; Jessica Dennis; Pierre Fontanillas; Sarah Elson; The 23andme Research Team; Michel Nivard; Hill Fung Ip; Matthijs van der Zee; Bart Baselmans; Jouke Jan Hottenga; Gonneke Willemsen; Miriam Mosing; Li Yu; Nancy L. Pedersen; Najaf Amin; Cornelia M van Duijn; Ingrid Szilagyi; Henning Tiemeier; Alexander Neumann; Karin Verweij; Stephanie Cacioppo; John T. Cacioppo; Lea K. Davis; Abraham A. Palmer; Dorret I. Boomsma

doi:10.1101/468835

Abstract

Humans are social animals that experience intense suffering when they perceive a lack of social connection. Modern societies are experiencing an epidemic of loneliness. While the experience of loneliness is universally human, some people report experiencing greater loneliness than others. Loneliness is more strongly associated with mortality than obesity, emphasizing the need to understand the nature of the relationship between loneliness and health. While it is intuitive that circumstantial factors such as marital status and age influence loneliness, there is also compelling evidence of a genetic predisposition towards loneliness. To better understand the genetic architecture of loneliness and its relationship with associated outcomes, we conducted a genome-wide association (GWAS) meta-analysis of loneliness (N=475,661), report 12 associated loci (two novel) and significant genetic correlations with 34 other complex traits. The polygenic basis for loneliness was significantly enriched for evolutionary constrained genes and genes expressed in specific brain tissues: (frontal) cortex, cerebellum, anterior cingulate cortex, and substantia nigra. We built polygenic scores based on this GWAS meta-analysis to explore the genetic association between loneliness and health outcomes in an independent sample of 18,498 individuals for whom electronic health records were available. A genetic predisposition towards loneliness predicted cardiovascular, psychiatric, and metabolic disorders, and triglycerides and high-density lipoproteins. Mendelian randomization analyses showed evidence of a causal, increasing, effect of body fat on loneliness, and a similar weaker causal effect of BMI. Our results provide a framework for ongoing studies of the genetic basis of loneliness and its role in mental and physical health.

Introduction

Loneliness is a universal human experience that has been documented across cultures and generations. According to the evolutionary theory of loneliness¹, this familiar painful feeling corresponds to an aversive response to a discrepancy between a people’s desired and perceived level of social connectedness.^2,3 This definition, which emphasizes the desired level of social connection, highlights the difference between loneliness and solitude. Unlike solitude, the signal associated with loneliness has likely evolved to motivate humans and other social animals to seek and improve the salutary social connections needed to help them survive and reproduce.⁴ Loneliness serves as an emotional warning or signal that there is an emotional imbalance in one’s social network, regardless of the size of that network. Feeling lonely is also very common; about 5-30% of adults in Western populations report some degree of loneliness, while the actual prevalence may be higher since loneliness is stigmatized in many cultures.^5-7

Multiple factors influence the individual differences in the experience of chronic loneliness.¹ Most studies have focused on circumstantial factors such as marital status, age, and sex.^8-11 However, there are also innate individual differences in the propensity to feel lonely. Heritability estimates based on twin and family data suggest that ~37% of the variation in loneliness levels is explained by genetic factors¹². Studies using molecular genetic data provided evidence that the aggregate of common genetic variants account for 4-27% in individual differences in loneliness,^13-15 and a recent genome-wide association study (GWAS) of social interaction and isolation in the UK Biobank sample has identified 15 common genetic variants associated with loneliness.¹⁵

Both social isolation and chronic high levels of loneliness are strongly correlated with negative health outcomes; chronic loneliness has a stronger association with early mortality than obesity.¹⁶ A long-running longitudinal study on physical and mental health, the Harvard Study of Adult Development, has concluded that the warmth of one’s relationships has the greatest impact on wellbeing and life satisfaction.¹⁷ Findings like these suggest that loneliness is a public health concern. While these studies demonstrate a clear and strong correlation between loneliness and increased morbidity and mortality, the causality and etiology of the relationship between loneliness and mental and physical health is unclear. For example, one possibility is that loneliness may cause poor health, or, alternatively, poor health may cause loneliness directly or indirectly, for example, by disrupting social networks.

Here, we first conducted a GWAS meta-analysis for loneliness on nearly half-a-million subjects of European descent from various cultural backgrounds, including the UK biobank (UKB), 23andMe (USA), the Health and Retirement Study (USA), the Netherlands Twin Register (NTR), and the Swedish Twin Registry (STR). We performed secondary analyses on the summary statistics¹⁸ using gene-based and LD score regression approaches to further elucidate the biological basis underlying the propensity to feel lonely and the genetic overlap between loneliness and complex human traits related to personality, cognition, reproduction, substance use, social connections, and physical and mental health. Next, we carried out a phenome wide association study (PheWAS). PheWAS have emerged as a method to screen for associations between genetic measures and a range of phenotypes, such as those measured in electronic health records (EHR).^19,20 For certain phenotypes, EHR may provide more objective measures of physical and mental health than self-reported health data, which may not be readily known by patients (e.g., lab values) or can be distorted by mood and recall bias. Since the time of their original publications, the PheWAS approach has expanded beyond analysis of a single SNP to also include analysis of polygenic risk predictors.²¹ In this study, we constructed a polygenic predictor of loneliness using estimated SNP effects from the GWAS meta-analysis and performed a PheWAS on this polygenic score in the Vanderbilt University Medical Center (VUMC) EHR and associated biobank. Subsequent to this analysis, we analyzed a subset of quantitative traits that were significantly associated with the loneliness polygenic score in our PheWAS and known to be biomarkers for diagnoses. The goal of this analysis was to determine whether polygenic scores for loneliness were associated with known causal biomarkers. However, these analyses, like others that rely on genetic correlations, do not distinguish causal effects from pleiotropic effects. Therefore, we further tested for bidirectional causal relationships between loneliness and a selection of genetically correlated phenotypes using Mendelian randomization. We performed a comprehensive characterization of the polygenic contribution to the universal human experience of loneliness and extended this understanding to elucidate the genetic relationships between loneliness and health.

Subjectsand Methods

Subjects & Phenotype

A total of 475,661 adult subjects from 7 different cohorts were included in the GWAS meta-analysis. An overview of subjects and phenotyping across cohorts can be found in Supplementary Table 1. The UK Biobank (UKB) dataset was the largest. UKB was the only cohort with a dichotomous phenotype (N_total = 413,337: 74,142 lonely and 339,195 non-lonely individuals). The other six cohorts had three types of continuous measures for loneliness: the sum of 9 items on a 4-point scale, the sum of 3 items on a 3-point scale, and 1 item on a 4-point scale.

View this table:

Supplementary Table 1:

Subjects and phenotype details per cohort

Genotyping and QC

Information on genotyping, imputation, and QC is given in Supplementary Table 2. In all cohorts, SNP data were imputed to either 1000 Genomes or the Haplotype Reference Consortium (HRC). SNPs remaining after QC ranged from 5.7 million to 14.1 million. Based on ancestry information derived from SNP data, only subjects with European descent were included.

View this table:

Supplementary Table 2:

Genotyping, imputation, QC, and GWAS information per cohort

GWASs & Meta-Analysis

GWASs were performed in all seven cohorts separately, with the variables age, sex, family relationships, and ancestry-informative PCs as fixed effects (see Supplementary Table 2 for details). The UK Biobank dataset was split into three groups of unrelated individuals on which three separate categorical GWASs were run, and there were six continuous GWASs for the other six cohorts. The categorical GWASs (logistic regressions) on UKB were run on the following three groups: 1) the largest group of unrelated individuals with British ancestry (N_total = 332,991: 58,960 cases & 274,031 controls), 2) individuals with British ancestry that consist of family members of the first group (N_total = 57,865: 10,430 cases & 47,435 controls), and 3) individuals of Non-British European descent (N_total = 22,481: 4,752 cases & 17,729 controls). The two groups of GWASs (i.e., three categorical GWASs and six continuous GWASs) were first meta-analyzed separately using the multivariate approach described in Baselmans et al (2017)²². This approach controls for bias due to relatedness or sample overlap between GWASs by incorporating the cross-trait LD-score intercept (a measure for sample overlap) from LD-score regression (LDSC)²³ as weights, which was especially necessary for the UK Biobank datasets, since the second dataset of unrelated individuals consisted of family members of the first dataset of unrelated individuals. The two meta-analyses (categorical and continuous) were then meta-analyzed using sample size-based weights to accounts for the respective differences of heritabilities, genetic correlation, and measurement scales of the categorical and continuous GWASs (see Demontis et al, 2017, for more details).²⁴

Follow-up Analyses

Gene-based tests & gene enrichment tests: GWAS meta-analysis summary statistics were used to compute gene-based p-values in MAGMA²⁵ for 18,125 protein coding genes using FUMA.²⁶ MAGMA in FUMA was further used to test whether the effects of genes on loneliness were correlated with higher or lower gene-expression in a given tissue based on GTEx RNA-seq data.²⁶ This was tested for 30 general tissue types and 53 more specific tissue types.

LD-Score Regression Heritability Partitioning: Stratified LD-score regression was carried out using LDSC in order to partition the heritability signal into specific cell-type groups or genomic annotations.^27,28 This method requires the GWAS meta-analysis summary statistics, and LD information based on an external reference panel, for which we used the European populations from the HapMap 3 reference panel.

S-PrediXcan: S-PrediXcan²⁹ uses reference panels with both measured gene expression and genotype data collected on the same individuals to build predictive models of gene expression in samples in which only genotype information is available. Predicted expression of genes for cases and controls can then be associated with phenotypic differences, yielding a gene-based test of association that incorporates transcriptional information. We used S-PrediXcan²⁹ to predict gene expression levels in 10 brain tissues, and to test whether the predicted gene expression correlates with loneliness. Pre-computed tissue weights were employed from the Genotype-Tissue Expression (GTEx v7) project database (https://www.gtexportal.org/)³⁰ as the reference transcriptome dataset. As input data, we included the loneliness GWAS meta-analysis summary statistics, transcriptome tissue data, and covariance matrices of the SNPs within each gene model (based on HapMap SNP set; available to download at the PredictDB Data Repository) from 10 brain tissues: anterior cingulate cortex, caudate basal ganglia, cerebellar hemisphere, cerebellum, cortex, frontal cortex, hippocampus, hypothalamus, nucleus accumbens basal ganglia, and putamen basal ganglia. We used a transcriptome-wide significant threshold of p < 1.34 × 10^-6, which is the Bonferroni corrected threshold when adjusting for all tissues and genes (37,281 gene-based tests).

Genetic correlations: Genetic correlations between loneliness and 60 other traits were computed in LDSC.³¹ Here, the genetic correlation between traits is based on the estimated slope from the regression of the product of z-scores from two GWASs on the LD score and represents the genetic covariation between the two traits based on all polygenic effects captured by the included SNPs. Summary statistics from well-powered GWASs were available for 60 traits related to personality, cognition, reproduction, social circle, body composition, substance use, and physical and mental health. Multiple testing was corrected for using a Bonferroni corrected significance threshold of 8.5 × 10^-4. LD scores were based on European populations from the HapMap 3 reference panel.^23,31

Polygenic scores: All SNPs from the loneliness meta-analysis were thinned using an association-driven pruning algorithm that clumped SNPs into 250 kb windows and removed SNPs in LD (r2>0.1) with the most associated SNP (i.e., lowest p-value) in that window. LD estimates were directly derived from the BioVU samples (see below). After clumping, a total of 93,501 LD-independent SNPs remained for scoring. Scores were then constructed using PRSice software³² and defined by the sum of the number of risk alleles at each locus, weighted by their estimated effect sizes. The polygenic scores were calculated in an independent sample of 18,498 genotyped individuals of European descent in BioVU. Genotyping and QC of this sample have been described elsewhere.^20,33

PheWAS: A logistic regression model was fitted to each of 897 case/control phenotypes to estimate the odds of each diagnosis given the loneliness polygenic score, after adjustment for sex, median age across the EHR, top 10 principal components of ancestry, and genotyping batch. The 897 disease phenotypes included 32 infectious diseases, 75 neoplasms, 86 endocrine/metabolic diseases, 29 hematopoietic diseases, 36 mental disorders, 44 neurological disorders, 54 sense organs, 126 circulatory system disorders, 59 respiratory diseases, 85 digestive diseases, 77 genitourinary diseases, 3 pregnancy complications, 43 dermatologic disorders, 64 musculoskeletal disorders, 8 congenital anomalies, 24 symptoms, and 52 injuries/poisonings. We required the presence of at least two International Classification of Disease (ICD) codes that mapped to a PheWAS disease category (Phecode Map 1.2 (https://phewascatalog.org/phecodes) to assign “case” status. PheWAS analyses were run using the PheWAS R package.³⁴

Lipid traits in the EHR: We examined the relationship between polygenic risk for loneliness and three quantitative lipid traits. Clinically measured lipid levels included low density lipoprotein (LDL) (N = 6,455 with pre-medication values), high density lipoprotein (HDL) (N = 10,722), and triglycerides (trigs) (N = 11,012; Supplementary Table 5). As most patients had multiple lipid values available in their EHRs, we calculated median LDL, HDL, and triglyceride values for each patient after removing outlier values that were +/- 4 SDs from the sample mean. To adjust for age, we extracted the age at the median lipid value, and used the average age between the two median measurements if the number of lab value measurements was even. We then regressed the median lab value on sex and the cubic spline of median age, and quantile normalized the residuals. For sensitivity analyses, we also calculated the median of pre-medication (Supplementary Table 5) lipid values, using only observations that occurred before the first mention of lipid-lowering medication in the EHR,³⁵ and transformed the age- and sex-adjusted residuals as above. Linear regression models were then fitted to the median LDL, HDL, and trigs values respectively to estimate the effect of the loneliness polygenic score on each lipid trait. As the lipid traits were already sex and age adjusted, we included only the top 10 principal components of ancestry and genotyping batch as covariates.

Mendelian Randomization: We performed two-sample bi-directional Mendelian Randomization (MR)³⁶ analyses to investigate the direction of causality in the relationship between loneliness and cardiovascular risk factors and diseases. Of the eight cardiovascular risk factors and diseases for which we know the genetic correlations from the LDSC analyses (coronary artery disease [CAD], myocardial Infarction, high density lipoprotein (HDL) cholesterol, low density lipoprotein (LDL) cholesterol, total cholesterol, triglycerides, BMI, and body fat), we tested the four traits that showed a significant genetic correlation, namely CAD (r_g = .19), triglycerides (r_g = .14), BMI (r_g = .18), and body fat (r_g = .25). We used genome-wide significant SNPs from the five GWASs (loneliness and the four significant traits) to serve as instrumental variables (gene-exposure association). SNPs were pruned for LD (r²<.001), and the remaining SNPs (or proxy SNPs with r²≥0.8 when the top-SNP was not available in the other GWASs) were then identified in the GWAS summary statistics of the outcome variable (gene-outcome association). When both gene-exposure and gene-outcome associations are significant and in the expected ratio of an indirect causal effect, and the MR assumptions are met,³⁷ this is considered evidence for a causal relationship. We combined estimates from individual SNPs by applying inverse-variance weighted (IVW) linear regression.³⁸ We conducted three sensitivity analyses more robust to horizontal pleiotropy, each relying on distinct assumptions: weighted median regression³⁹, MR-Egger regression⁴⁰ and Generalized Summary-data based Mendelian Randomization (GSMR).⁴¹ Weighted median regression can provide a consistent estimate of a possible causal effect, even when up to 50% of the weight in the genetic instrument comes from invalid instruments. MR-Egger regression uses “Egger’s test” to test for bias from horizontal pleiotropy. MR-Egger will provide a consistent estimate of the causal effect, given that the strength of the genetic instrument (gene-exposure association) does not correlate with the effect that the instrument has on the outcome. This InSIDE assumption (Instrument Strength Independent of Direct Effect) is a much weaker assumption than the assumption that there is no pleiotropy. However, if the NOME (NO Measurement Error) assumption is violated, MR-Egger may be biased. Violation of NOME can be assessed with the I² statistic, which ranges between 0 and 1. When I² is below 0.9, there is a considerable risk of bias. By applying MR-Egger simulation extrapolation (SIMEX),⁴² this bias can be corrected for. When I² is below 0.6 the results of MR-Egger (even with SIMEX correction) are not reliable. For our analyses we report MR-Egger results when I² >0.9, MR-Egger SIMEX results when I² = 0.6-0.9 and we don’t report MR-Egger results when I² <0.6. Lastly, we performed GSMR, a method that takes into account LD between the different genetic variants included in an instrument. Since GSMR accounts for LD, we pruned the genetic variants included in GSMR instruments at a higher threshold of r²<0.05 (as opposed to r²<0.001). Including SNPs in higher LD than 0.05 was shown to provide very limited increase in power. GSMR includes a filtering step which excludes SNPs that are suspected to have pleiotropic effects on both the exposure and the outcome (HEIDI filtering).

RESULTS

GWAS Meta-Analysis

The proportion of phenotypic variance accounted for by all genotyped variants (SNP heritability) of the categorical loneliness measure in UKB and continuous loneliness measure were 13.3% (SE = .7) and 4.9% (SE = .8) respectively (see Supplementary Table 3). The results from the two meta-analyses (categorical and continuous phenotypes) were then meta-analyzed together using sample size-based weights.²⁴ The adjusted effective sample size of the final meta-analysis, accounting for information from related individuals, was 205,708 (see Demontis et al).²⁴ The SNP heritability of the final meta-analysis was 7.9% (SE = .4), which accounts for approximately one quarter of the total heritability as estimated in twin-family studies and is about twice as large as the SNP heritability estimate of a recently published GWAS on loneliness in the UK Biobank sample.¹⁵

View this table:

Supplementary Table 3:

Sample size, lambda, intercept, and h2 estimated from LD score regression

The genomic inflation factor λ was 1.28 for the full meta-analysis (Figure 1) and results from LDSC analysis²³ showed that this inflation was mostly due to true polygenic signal with about 1.8% of the inflation due to residual population stratification (LDSC intercept = 1.005, SE = .01, ratio = .015). We identified 14 independent genome-wide significant variants (r² < .1), which were located in 12 genomic regions (i.e., within 250 kb; Figure 1, Table 1, and Supplementary File 1).

Figure 1:

QQ-plot and Manhattan plot of meta-analysis on loneliness. A: The QQ-plot shows a considerable inflation of association statistics (λ = 1.28), which is mostly due to true polygenic signal rather than population stratification (LD-score regression intercept = 1.005, SE = .01, ratio = .015). B:Manhattan Plot of the Loneliness GWAS meta-analysis showing 14 independent genome-wide significant associations

View this table:

Table 1:

14 Independent genome-wide significant SNPs from 12 loci, with independence based on an r2 threshold of .1, belonging to the same locus if they are within 250 kb (see supplementary file 1 for more details on the significant SNPs).

MAGMA and S-PrediXcangene-basedanalyses

We performed two types of gene-based analyses using MAGMA, which aggregates SNP effects at the gene level using positional annotations, and S-PrediXcan, which uses expression quantitative-trait loci (eQTL) annotations to assign SNPs to genes. The meta-analysis summary statistics formed the basis to compute gene-based p-values in MAGMA²⁵ and S-PrediXcan²⁹ for 17,715 and 13,037 protein coding genes, respectively. In the MAGMA analysis, a total of 38 genes reached genome-wide significance at a Bonferroni corrected significance threshold of 2.82 × 10^-6 (Supplementary Figure 1). Six genes of these genes (ARFGEF2, BPTF, MBD5, PHF2, TAOK3, TCF4; Table 1) included a genome-wide significant SNP from the GWAS meta-analysis. Using S-PrediXcan,²⁹ we identified 10 genes (of which 8 were significant in the MAGMA analysis) that significantly associated with loneliness at a Bonferroni corrected significance threshold of p < 1.34 x 10^-6 across six brain tissues: anterior cingulate cortex, cerebellar hemisphere, cerebellum, prefrontal cortex, cortex, and the caudate basal ganglia (Supplementary Table 4).

View this table:

Supplementary Table 4:

Significant associations of S-PrediXcan gene-based association analyses (Bonferroni corrected significance threshold = 0.05/37281 = 1.34 × 10^-6)

Supplementary Figure 1:

Manhattan plot of the gene-based analysis showing 38 significantly associated genes

GWAS signals are significantly enriched for brain tissues and evolutionary conserved regions

Next, we investigated if genetic effects on loneliness were enriched for loci with specific functional and tissue annotations.

First, we tested whether genome-wide effects on loneliness were consistent with tissue-specific differential gene-expression based on GTEx RNA-sequence data from 53 tissues types using two approaches. For the first approach, we determined whether the distribution of effect sizes of all 17,715 protein coding genes estimated from the gene-based tests showed enrichment of expression across multiple tissues.²⁶ These results indicated that the gene-based association results were significantly enriched (Bonferroni threshold: p < 9.4 × 10^-4) for genes with higher gene-expression levels in five brain tissues: frontal cortex, cortex, cerebellar hemisphere, cerebellum, and anterior cingulate cortex (Figure 3). For the second approach, SNP-heritability of loneliness was partitioned into categories of functional SNP annotations using LDSC.²³ We found that SNPs associated with loneliness were also more likely than expected by chance (Bonferroni threshold: p < 9.4 × 10^-4) to regulate gene expression in four brain tissues including cerebellum, anterior cingulate cortex, substantia nigra, and cortex.

Figure 3:

Enrichment of gene-expression for 53 specific tissue types using MAGMA and LDscore regression.

Second, we used LDSC to test for the enrichment of 24 genomic annotations that are not specific to any cell type, including coding vs non-coding regions, promoter regions, introns, and evolutionary conserved regions (see Finucane et al, 2015²⁸ for additional details). Of these 24 annotations, the genetic signals were significantly enriched for regions that were highly evolutionary conserved in mammals, which contain 2.6% of all SNPs but explain 25% of the loneliness heritability captured by all SNPs (Figure 4).

Figure 4:

Enrichment of 24 annotations not specific to cell-types, ordered by size (proportion of SNPs).

Genetic Correlations

Genetic correlations³¹ were estimated for loneliness and 60 characteristics from 9 domains including anthropomorphic traits, cardiovascular disease risk, cognitive functions, mental health, reproduction, and substance use. After applying a Bonferroni corrected significance threshold of 8.3 × 10^-4, 34 out of 60 traits showed a significant genetic correlation with loneliness (Figure 5 & supplementary_file.csv). A significant signal was observed at least once from each of the 9 domains, with the strongest genetic correlations observed for mental health, especially for depressive symptoms (r_g = .88, p = 2.2 × 10^-101), subjective wellbeing (r_g = −.77, p = 8.0 × 10^-50), and major depressive disorder (r_g = .64, p = 2.8 × 10^-114). In the health domain, tiredness and self-rated health showed the strongest correlations (r_g = .74, p = 3.0 × 10^-59, and r_g = −.56, p = 2.5 × 10^-44, respectively; more loneliness = more tiredness and worse health), while father’s and mother’s age of death showed modest but significant negative genetic correlations with loneliness (r_g = −.32, p = 1.8 × 10^-5, and r_g = −.38, p = 1.8 × 10^-5, respectively). Four out of five personality dimensions showed a significant genetic correlation with loneliness, with neuroticism showing the highest association (r_g = .69, p = 2.4 × 10^-49), a genetic association that has recently been shown to be a major driver for the association between loneliness and personality.¹⁴ SES indicators related to economic success (Townsend index and income; r_g = .43, p = 7.7 × 10^-12, and r_g = −.50, p = 3.5 × 10^-51, respectively) showed a considerably higher genetic correlation with loneliness than indicators of cognition (IQ and educational attainment; r_g = −.19, p = 6.1 × 10^-6, and r_g = −.28, p = 3.7 × 10^-23, respectively). Genetic correlations with traits from the reproduction domain indicate that having more offspring and having offspring at a younger age is genetically associated with higher levels of loneliness, an association that is in the other direction for phenotypic correlations.¹² For substance use, alcohol consumption had a significant genetic correlation with loneliness (r_g = −.16, p = 4.9 × 10^-4), with more alcohol consumption being associated with lower loneliness, while alcohol dependence had a larger genetic correlation in the opposite direction (r_g =.43, p = 9.7 × 10^-7). In the social circle domain, family and friendship satisfaction both showed significantly larger genetic correlations with loneliness (r_g = .56, p = 2.9 × 10^-42, and r_g = .55, p = 4.6 × 10^-31, respectively; more loneliness = less satisfaction) than the frequency of friend and family visits (r_g = .18, p = 9.8 × 10^-6; more loneliness = less visits), suggesting that the subjective experience of social isolation may play a larger role in feeling lonely than objective social isolation.

Figure 5:

Genetic correlations as computed with LD score regression. Red stars are significant after Bonferroni correction.

PheWAS on the polygenic score for loneliness

Five cardiovascular, three neuropsychiatric, and one of the metabolic phenotypes were significantly associated with genetic propensity to loneliness after Bonferroni correction for the 897 phenotypes tested (p < 5.57 × 10^-5) (Figure 6). Mood disorders yielded the most significant association with the loneliness polygenic score (N_cases = 3,299, OR = 1.11, SE = .02, p = 2.8× 10^-7), followed by depression (N_cases = 2,969, OR = 1.11, SE =.02 p = 3.9 × 10^-7), heart failure (N_cases = 818, OR = 1.19, SE = .03, p = 4.9 x 10^-6), ischemic heart disease (N_cases = 5,797 OR = 1.09, SE = .02, p = 5.7 × 10^-7), and tobacco use disorder (N_cases = 1,705, OR = 1.12, SE = .03, p = 1.2 × 10^-5). Complete results may be viewed interactively at https://sealockj.shinyapps.io/loneliness_phewas/loneliness_phewas.Rmd.

Figure 6:

Results of the Phewas on the polygenic score for loneliness, corrected for gender, age, first 10 PCs, and batch.

In our subsequent analysis of quantitative lipid traits, the polygenic score for loneliness modestly but significantly predicted reduced HDL (R² = 0.16%, p = 2.99 x 10^-5) and increased triglycerides (R² = 0.16%, p = 2.40 x 10^-5), but not LDL levels (R² = 0.05%, p = 2.62 x 10^-2). To benchmark these results, we compared them to the proportion of variance explained by a polygenic score for CAD developed using the beta weights from the CARDIOoGRAMplusC4D study (http://www.cardiogramplusc4d.org/data-downloads/).⁴³ The proportion of variance explained by the polygenic score for CAD was similar in magnitude to the variance explained by the loneliness polygenic score for clinically evaluated HDL (R² = 0.34%, p = 6.96 x 10^-10), triglycerides (R² = 0.16%, p = 2.40 x 10^-5), and LDL (R² = 0.03%, p = 6.05 x 10^-2). As a negative control (see Supplementary Figure 3), we also tested whether the loneliness polygenic score predicted median height across the medical record and, as expected, observed no significant prediction of height (R² = 0.02%, p = .09) (Supplementary Figure 3). Details on the best-fit p-value thresholds used in these analyses are provided in Supplementary Table 7 and Supplementary Figure 4.

View this table:

Supplementary Table 5.

Characteristics of genotyped BioVU patients with lipid measurements

View this table:

Supplementary Table 6.

Validation of EHR-derived lipid values. Polygenic scores for HDL, LDL, and TG constructed from SNPs below pT=5×10-8 in the discovery sample were associated with the same trait in BioVU.

View this table:

Supplementary Table 7.

Prediction of lipid levels in BioVU using polygenic scores for CAD and loneliness. We identified the best fit p-value threshold for each trait pair by iterating over thresholds from 5 × 10-8 to 0.5 in increments of 5 × 10-4. R2 is the proportion of variance in the trait explained by the polygenic score, p-value is its strength of association, and N SNPs denotes the number of SNPs included in the polygenic score at a given p-value threshold.

Supplementary Figure 2.

SNP-based heritability (h²_SNP) of lipid values in BioVU and in the discovery dataset (Global Lipid Genetics Consortium). BioVU h²_SNP values were estimated by restricted maximum likelihood models in GCTA while discovery dataset h²_SNP values were estimated by LD score regression and extracted from LD Hub (http://ldsc.broadinstitute.org/lookup/).

Supplementary Figure 3.

Polygenic scores for loneliness are associated with lipid levels in BioVU. The proportion of variability explained (R2) by the loneliness polygenic scores is similar to that of a CAD polygenic scores, while neither polygenic scores are associated with height (negative control).

Supplementary Figure 4.

High-resolution scoring of polygenic scores for CAD and loneliness into HDL (A), pre-medication LDL values (B), and triglycerides (C). We identified the best-fit pT for each trait pair (black dot) by iterating over p-value thresholds from 5 × 10-8 to 0.5 in increments of 5 × 10-4.

Mendelian Randomization

To distinguish genetic correlation from causation, we applied Mendelian Randomization (MR; methodology to examine evidence for causal effects of one phenotype on another) to traits that showed a significant genetic correlation with loneliness and of which the top SNPs were unlikely to share pleiotropic effects with loneliness. We focused on the relationship between loneliness and cardiovascular disease and its associated risk factors including: coronary artery disease (CAD), myocardial infarction, HDL cholesterol, LDL cholesterol, total cholesterol, triglycerides, BMI, and body fat, which we also found to be genetically correlated with loneliness. Of these traits, we selected those with a significant genetic correlation with loneliness, namely CAD (r_g = .19), triglycerides (r_g = .14), BMI (r_g = .18), and body fat (r_g = .25) (see Figure 7). When both gene-exposure and gene-outcome associations are significant and in the expected ratio of a causal effect, and the MR assumptions are met,³⁷ this is considered evidence for a causal relationship.

Figure 7:

Two Sample Mendelian Randomization results for the causal effect of body fat on loneliness.

We found evidence for a causal effect of body fat on loneliness using the IVW, weighted median and GSMR methods, but not the MR-Egger method (higher body fat = higher loneliness; Table 2 and Figure 7). Since the effect size of MR-Egger is of similar magnitude to the other two analyses and the MR Egger intercept is not significantly different from 0 (p = .90, see Supplementary Table 12), indicating that there is no evidence of horizontal pleiotropy for this relationship, it is most likely that this analysis remained underpowered due to weak instrument variables.⁴⁴ There was also evidence for a causal, increasing effect of BMI on loneliness, but only with the GSMR method. We note that sample overlap between GWASs could cause a bias of MR results in the direction of the observational association. However, sample overlap was minimal in the present study (max 3.7%) and so it is unlikely to have affected our findings.

View this table:

Supplementary Table 10:

Cochran’s heterogeneity statistic for Inverse Variance Weighted (IVW) bidirectional two-sample Mendelian randomization analyses

View this table:

Supplementary Table 11:

I-squared statistic

View this table:

Supplementary Table 12:

MR-Egger intercept, indicating horizontal pleiotropy, for bidirectional two-sample Mendelian randomization analyses

View this table:

Table 2.

Two sample, bidirectional Mendelian randomization results

Discussion

Chronic loneliness is strongly associated with physical and mental health, and is a growing concern in many societies. In this study we performed a large GWAS meta-analysis which confirmed the polygenic architecture of loneliness, and performed follow-up analyses to further investigate the genetic architecture of loneliness and its relationship with a wide range of traits. We identified 14 SNPs located in 12 independent loci that were significantly associated with loneliness. The most significant SNP signal came from chromosome 18 within the TCF4 gene, which plays an important role in nervous system development and has recently been associated with major depressive disorder (MDD)⁴⁵. We replicated 10 loci from a previous GWAS on loneliness¹⁵ and report two novel loci on chromosomes 9 and 12 (top SNPs: rs2149351 and rs61943369, respectively).

We identified that the significantly associated variants were disproportionally enriched for regions that were conserved in mammals, confirming the biological importance of processes that underlie individual differences in loneliness. Additionally, we found that associated variants were highly enriched in the brain, in particular in the cerebellum, (frontal) cortex, anterior cingulate cortex, and substantia nigra. The cerebellum is mostly known for its modulating role in motor, cognitive, and affective functions, and has been shown to play a role in social cognition as well, especially for processes that require higher-level abstraction away from the current event (i.e., past, future or hypothetical events).⁴⁶ The prefrontal cortex is posited to be involved in the perception of social isolation (i.e., loneliness).^47-49 The anterior cingulate cortex is functionally connected with the prefrontal cortex in orchestrating emotional and physiological adjustments for potential threats and stressors, and is known to be involved the social (rather than the physical) pain associated with loneliness.⁵⁰ The substantia nigra is most known for its role in reward and learning, which extends to social contexts as well.⁵¹ A recent large GWAS meta-analysis on MDD performed similar tissue enrichment analyses and reached only partly the same conclusions: for MDD all the same cortical regions were significant (frontal cortex, cortex, anterior cingulate cortex, substantia nigra), but none of the cerebellar regions (cerebellar hemisphere and cerebellum).⁴⁵ This suggests that the role of the cerebellar region may be more specific or larger for loneliness. The complex processes underlying loneliness likely involve more brain regions, as all 13 brain regions included in the enrichment analyses depicted in Figure 3 were nominally significant with p < .05.

Over 34 out of the 60 traits in the genetic correlation analyses showed a significant genetic association with loneliness, suggesting widespread shared genetic influences (e.g., pleiotropic effects) or causal relationships between loneliness and several traits. MDD has been strongly associated with loneliness in previous studies^52-54, but evidence from longitudinal studies indicates that loneliness and depression are conceptually and statistically different constructs.^53-55 Our results confirm the strong biological ties between loneliness, major depression, and depressive symptoms in both research ascertained samples and EHR from a hospital population. Our analyses do not provide conclusive findings however regarding the direction of causation in the relationship between loneliness and MDD, due to a lack of instrument variables for loneliness that are strong enough for causal inference.

There are several traits that show a genetic correlation with loneliness that is not in line with the expected direction from phenotypic observational data. In observational data, having more offspring has been shown to be associated with lower levels of loneliness,¹² while the genetic correlations with number of offspring and age at first birth indicate an association with loneliness in the opposite direction (i.e., having more children or children at an earlier age = more lonely). Alcohol use has also been associated with higher levels of loneliness⁵⁶, while we find a genetic correlation with alcohol consumption in the opposite direction. A possible explanation of these contradictory results may be that they are driven by the genetic association between loneliness and socio-economic status, which is also associated with many life outcomes. We observe a significant genetic overlap between loneliness and SES-related traits (e.g., income, educational attainment [EA], social deprivation of the neighborhood), with lower SES indicators showing a genetic association with more loneliness (with a particularly strong genetic correlation for income: r_g = −.50). Number of offspring shows a negative genetic correlation with EA⁵⁷ (and positive with loneliness), age at first birth a positive genetic correlation with EA⁵⁷ (and negative with loneliness), while alcohol consumption shows a positive genetic correlation with EA⁵⁸ (and negative with loneliness), and alcohol dependence a negative genetic correlation with EA⁵⁹ (and positive with loneliness), which is all in line with the negative genetic association between loneliness and EA/SES.

Our phenome-wide analysis in a unique EHR dataset recapitulated the genetic correlation results and found that genetic propensity to loneliness is associated with increased risk for clinical depression, cardiovascular disease, and metabolic diseases such as type-2 diabetes. Furthermore, we found that elevated triglycerides and reduced HDL, two well-known risk factors for heart disease, were also associated with predisposition to loneliness after adjusting for covariates and even after restricting to levels prior to use of antilipemic medications. These findings provide a proof of principle that even in clinical settings, polygenic scores can be used to uncover relationships between difficult to measure behavioral traits (such as loneliness) and health outcomes. Another important advantage of this out-of-sample analysis is that by relying on physician assigned ICD codes instead of retrospective self-report (as in UK Biobank), we avoid potential reporting biases related to loneliness that may subsequently influence correlations between loneliness and health outcomes. One possible limitation of this out-of-sample analysis could be the potential for some overfitting because we included all SNPs at a p-value threshold of 1. However, this is unlikely to be a substantial driver of our results given that an analysis of loneliness polygenic scores using a p-value threshold of 0.05 yielded similar results (Supplementary Figure 5). Another important limitation is that polygenic score analyses cannot distinguish pleiotropy from causal effects.

Supplementary Figure 5:

Results of the Phewas on the polygenic score for loneliness, corrected for gender, age, first 10 PCs, and batch. The loneliness polygenic score used here is constructed using only SNPs that reach p<.05 in the GWAS meta-analysis.

With Mendelian randomization analysis – which can distinguish pleiotropic from causal effects – we found evidence of a causal, increasing, effect of body fat on loneliness, and weaker evidence of a causal, increasing, effect of BMI on loneliness. This concurs with a recent MR study reporting that BMI increases depressive symptoms and decreases subjective well-being.⁶⁰ Our evidence for a causal effect on loneliness was stronger for total body fat than for BMI, which may be due to body fat being a better measure for an unhealthy excess of body weight than BMI. Nonetheless, both findings point to an increased body weight causally leading to poorer mental health. Our MR analysis ruled out the possibility of horizontal pleiotropy among the instrument variables used in this analysis. However, it is important to note that the condition of “no pleiotropy” is only required for the instrument variables themselves and need not apply genome-wide. Indeed, it is possible (and likely) that the relationship between loneliness and health outcomes is influenced by bidirectional causal effects and pleiotropic biological effects. More research is needed to tease apart the complex etiology of these states and traits.

We identified 12 genome-wide significant loci and a total of 40 significantly associated genes. Follow-up analyses identified specific brain tissues in cortical and cerebellar regions involved in loneliness risk. We showed that a wide range of traits are genetically associated with loneliness and extended these findings to an electronic health record system. Limitations of the study include the sample size, which, while large even by modern GWAS standards is still modest for detection of genome-wide significant loci for loneliness given its heritability and heterogeneity. Future work needs to establish the etiology of these associations, and to determine which additional loci explain the rest of common genetic variation underlying loneliness, which together explain ~8% of individual differences.

ACKNOWLEDGEMENTS AND FUNDING

Acknowledgments

We dedicate this paper to the memory of Dr. John T. Cacioppo, who pioneered the scientific study of loneliness

We warmly thank all volunteer participants who contributed data to this project.

Preparation of this manuscript was supported by the National Institutes of Health (NIH, R01AG033590 to JC). NTR: DIB acknowledges the Royal Netherlands Academy of Science Professor Award (PAH/6635). Data collection and genotyping in NTR were supported by the Netherlands Organization for Scientific Research (904‐61‐090, 85‐10‐002,904‐61‐193,480‐04‐004, 400‐05‐717, Spi-56‐464‐14192 and 480-15-001/674); Biobanking and Biomolecular Resources Research Infrastructure (BBMRI –NL, 184.021.007 and 184.033.111); the Avera Institute for Human Genetics, Sioux Falls, South Dakota (USA) and the National Institutes of Health (NIH, R01D0042157‐01A), Grand Opportunity grants 1RC2MH089951‐01 and 1RC2 MH089995‐01 from the NIMH. JLT is supported by a Rubicon grant from the Netherlands Organization for Scientific Research (NWO; grant number 446-16-009). LKD is supported by a grant from the NIMH (5R01MH113362-02) and JMS is supported by an NIH training grant (2T32GM080178).

The dataset(s) used for the PheWAS analyses described were obtained from Vanderbilt University Medical Center’s BioVU which is supported by institutional funding, the 1S10RR025141-01 instrumentation award, and by the CTSA grant UL1TR000445 from NCATS/NIH. Additional funding provided by the NIH through grants P50GM115305 and U19HL065962. The authors wish to acknowledge the expert technical support of the VANTAGE and VANGARD core facilities, supported in part by the Vanderbilt-Ingram Cancer Center (P30 CA068485) and Vanderbilt Vision Center (P30 EY08126).

Conflict of interests: PF, SLE and members of the 23andMe research team are employees of 23andMe Inc.

Data access: The full summary statistics for the 23andMe dataset will be made available to qualified investigators who enter into an agreement with 23andMe that protects participant privacy. Interested investigators should visit research.23andMe/collaborate/#publication to learn more and to apply for access.

Footnotes

↵* These authors jointly directed this work

References

↵
Cacioppo, J. T. & Cacioppo, S. Loneliness in the Modern Age: An Evolutionary Theory of Loneliness (ETL). Advances in Experimental Social Psychology (2018).
↵
Cacioppo, S., Grippo, A. J., London, S., Goossens, L. & Cacioppo, J. T. Loneliness: Clinical import and interventions. Perspectives on Psychological Science 10, 238-249 (2015).
OpenUrl CrossRef PubMed
↵
Cacioppo, J. T., Cacioppo, S., Capitanio, J. P. & Cole, S. W. The neuroendocrinology of social isolation. Annual review of psychology 66, 733-767 (2015).
OpenUrl CrossRef PubMed
↵
Cacioppo, J. T., Cacioppo, S. & Boomsma, D. I. Evolutionary mechanisms for loneliness. Cognition & emotion 28, 3-21 (2014).
OpenUrl CrossRef PubMed Web of Science
↵
Beutel, M. E. et al. Loneliness in the general population: prevalence, determinants and relations to mental health. BMC psychiatry 17, 97 (2017).
OpenUrl
Hakulinen, C. et al. Social isolation and loneliness as risk factors for myocardial infarction, stroke and mortality: UK Biobank cohort study of 479 054 men and women. Heart, heartjnl-2017-312663 (2018).
↵
Victor, C. R. & Yang, K. The prevalence of loneliness among adults: a case study of the United Kingdom. The Journal of psychology 146, 85-104 (2012).
OpenUrl CrossRef PubMed Web of Science
↵
Yang, K. Causal conditions for loneliness: a set-theoretic analysis on an adult sample in the UK. Quality & quantity 52, 685-701 (2018).
OpenUrl
Simon, M. A., Chang, E.-S., Zhang, M., Ruan, J. & Dong, X. The prevalence of loneliness among US Chinese older adults. Journal of aging and health 26, 1172-1188 (2014).
OpenUrl CrossRef PubMed Web of Science
de Jong Gierveld, J., Keating, N. & Fast, J. E. Determinants of loneliness among older adults in Canada. Canadian Journal on Aging/La Revue canadienne du vieillissement 34, 125-136 (2015).
OpenUrl
↵
Honigh-de Vlaming, R., Haveman-Nies, A., Bos-Oude Groeniger, I., de Groot, L. & van’t Veer, P. Determinants of trends in loneliness among Dutch older people over the period 2005-2010. Journal of aging and health 26, 422-440 (2014).
OpenUrl CrossRef PubMed
↵
Distel, M. A. et al. Familial resemblance for loneliness. Behavior genetics 40, 480-494 (2010).
OpenUrl CrossRef PubMed Web of Science
↵
Gao, J. et al. Genome-wide association study of loneliness demonstrates a role for common variation. Neuropsychopharmacology 42, 811 (2017).
OpenUrl CrossRef
↵
Abdellaoui, A. et al. Associations between loneliness and personality are mostly driven by a genetic association with neuroticism. Journal of personality (2018).
↵
Day, F. R., Ong, K. K. & Perry, J. R. Elucidating the genetic basis of social interaction and isolation. Nature communications 9, 2457 (2018).
OpenUrl
↵
Holt-Lunstad, J., Smith, T. B., Baker, M., Harris, T. & Stephenson, D. Loneliness and social isolation as risk factors for mortality: a meta-analytic review. Perspectives on Psychological Science 10, 227-237 (2015).
OpenUrl CrossRef PubMed
↵
Vaillant, G. E. Triumphs of experience. (Harvard University Press, 2012).
↵
Pasaniuc, B. & Price, A. L. Dissecting the genetics of complex traits using summary association statistics. Nature Reviews Genetics 18, 117-127, doi:10.1038/nrg.2016.142 (2017).
OpenUrl CrossRef
↵
Denny, J. C. et al. Systematic comparison of phenome-wide association study of electronic medical record data and genome-wide association study data. Nature biotechnology 31, 1102 (2013).
OpenUrl CrossRef PubMed
↵
Denny, J. C. et al. PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene–disease associations. Bioinformatics 26, 1205-1210 (2010).
OpenUrl CrossRef PubMed Web of Science
↵
Roden, D. M. Phenome‐wide association studies: a new method for functional genomics in humans. The Journal of physiology 595, 4109-4115 (2017).
OpenUrl
↵
Baselmans, B. M. et al. Multivariate Genome-wide and integrated transcriptome and epigenome-wide analyses of the Well-being spectrum. bioRxiv, 115915 (2017).
↵
Bulik-Sullivan, B. K. et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nature genetics 47, 291-295 (2015).
OpenUrl CrossRef PubMed
↵
Demontis, D. et al. Discovery of the first genome-wide significant risk loci for ADHD. bioRxiv, 145581 (2017).
↵
de Leeuw, C. A., Mooij, J. M., Heskes, T. & Posthuma, D. MAGMA: generalized gene-set analysis of GWAS data. PLoS computational biology 11, e1004219 (2015).
OpenUrl
↵
Watanabe, K., Taskesen, E., van Bochoven, A. & Posthuma, D. FUMA: Functional mapping and annotation of genetic associations. bioRxiv, 110023 (2017).
↵
Bulik-Sullivan, B. K. et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nature genetics 47, 291 (2015).
OpenUrl CrossRef PubMed
↵
Finucane, H. K. et al. Partitioning heritability by functional annotation using genome-wide association summary statistics. Nature genetics 47, 1228 (2015).
OpenUrl CrossRef PubMed
↵
Barbeira, A. et al. MetaXcan: summary statistics based gene-level association method infers accurate PrediXcan results. bioRxiv, 045260 (2016).
↵
Consortium, G. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans. Science 348, 648-660 (2015).
OpenUrl Abstract/FREE Full Text
↵
Bulik-Sullivan, B. et al. An atlas of genetic correlations across human diseases and traits. Nature genetics 47, 1236-1241 (2015).
OpenUrl CrossRef PubMed
↵
Euesden, J., Lewis, C. M. & O’Reilly, P. F. PRSice: polygenic risk score software. Bioinformatics 31, 1466-1468 (2014).
OpenUrl
↵
Ruderfer, D. M. et al. Significant shared heritability underlies suicide attempt and clinically predicted probability of attempting suicide. bioRxiv, 266411 (2018).
↵
Carroll, R. J., Bastarache, L. & Denny, J. C. R PheWAS: data analysis and plotting tools for phenome-wide association studies in the R environment. Bioinformatics 30, 2375-2376 (2014).
OpenUrl CrossRef PubMed Web of Science
↵
Xu, H. et al. MedEx: a medication information extraction system for clinical narratives. Journal of the American Medical Informatics Association 17, 19-24 (2010).
OpenUrl CrossRef PubMed
↵
Burgess, S. et al. Using published data in Mendelian randomization: a blueprint for efficient identification of causal risk factors. European journal of epidemiology 30, 543-552 (2015).
OpenUrl CrossRef PubMed
↵
Davey Smith, G. & Hemani, G. Mendelian randomization: genetic anchors for causal inference in epidemiological studies. Human molecular genetics 23, R89-R98 (2014).
OpenUrl CrossRef PubMed Web of Science
↵
Ehret, G. B. et al. Genetic variants in novel pathways influence blood pressure and cardiovascular disease risk. Nature 478, 103 (2011).
OpenUrl CrossRef PubMed Web of Science
↵
Bowden, J., Davey Smith, G., Haycock, P. C. & Burgess, S. Consistent estimation in Mendelian randomization with some invalid instruments using a weighted median estimator. Genetic epidemiology 40, 304-314 (2016).
OpenUrl CrossRef PubMed
↵
Bowden, J., Davey Smith, G. & Burgess, S. Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression. International journal of epidemiology 44, 512-525 (2015).
OpenUrl CrossRef PubMed
↵
Zhu, Z. et al. Causal associations between risk factors and common diseases inferred from GWAS summary data. Nature communications 9, 224 (2018).
OpenUrl
↵
Bowden, J. et al. Assessing the suitability of summary data for two-sample Mendelian randomization analyses using MR-Egger regression: the role of the I 2 statistic. International journal of epidemiology 45, 1961-1974 (2016).
OpenUrl
↵
Nikpay, M. et al. A comprehensive 1000 Genomes–based genome-wide association meta-analysis of coronary artery disease. Nature genetics 47, 1121 (2015).
OpenUrl CrossRef PubMed
↵
Burgess, S. & Thompson, S. G. Interpreting findings from Mendelian randomization using the MR-Egger method. European journal of epidemiology 32, 377-389 (2017).
OpenUrl CrossRef PubMed
↵
Wray, N. R. & Sullivan, P. F. Genome-wide association analyses identify 44 risk variants and refine the genetic architecture of major depression. bioRxiv, 167577 (2017).
↵
Van Overwalle, F., Baetens, K., Mariën, P. & Vandekerckhove, M. Social cognition and the cerebellum: a meta-analysis of over 350 fMRI studies. Neuroimage 86, 554-572 (2014).
OpenUrl
↵
Cacioppo, S., Capitanio, J. P. & Cacioppo, J. T. Toward a neurology of loneliness. Psychological Bulletin 140, 1464 (2014).
OpenUrl CrossRef PubMed
Amodio, D. M. & Frith, C. D. Meeting of minds: the medial frontal cortex and social cognition. Nature Reviews Neuroscience 7, 268 (2006).
OpenUrl CrossRef PubMed Web of Science
↵
Layden, E. A. et al. Perceived social isolation is associated with altered functional connectivity in neural networks associated with tonic alertness and executive control. Neuroimage 145, 58-73 (2017).
OpenUrl
↵
Cacioppo, S. et al. A quantitative meta-analysis of functional imaging studies of social rejection. Scientific reports 3, 2027 (2013).
OpenUrl
↵
Delgado, M. R. Reward‐related responses in the human striatum. Annals of the New York Academy of Sciences 1104, 70-88 (2007).
OpenUrl CrossRef PubMed Web of Science
↵
Abdellaoui, A. et al. Predicting Loneliness with Polygenic Scores of Social, Psychological, and Psychiatric Traits. Genes, Brain, and Behavior (2018).
↵
Cacioppo, J. T., Hawkley, L. C. & Thisted, R. A. Perceived social isolation makes me sad: 5-year cross-lagged analyses of loneliness and depressive symptomatology in the Chicago Health, Aging, and Social Relations Study. Psychology and aging 25, 453 (2010).
OpenUrl CrossRef PubMed Web of Science
↵
Cacioppo, J. T., Hughes, M. E., Waite, L. J., Hawkley, L. C. & Thisted, R. A. Loneliness as a specific risk factor for depressive symptoms: cross-sectional and longitudinal analyses. Psychology and aging 21, 140 (2006).
OpenUrl CrossRef PubMed Web of Science
↵
Weeks, D. G., Michela, J. L., Peplau, L. A. & Bragg, M. E. Relation between loneliness and depression: a structural equation analysis. J. Pers. Soc. Psychol. 39, 1238 (1980).
OpenUrl CrossRef PubMed Web of Science
↵
Åkerlind, I. & Hörnquist, J. O. Loneliness and alcohol abuse: A review of evidences of an interplay. Social science & medicine 34, 405-414 (1992).
OpenUrl CrossRef PubMed
↵
Barban, N. et al. Genome-wide analysis identifies 12 loci influencing human reproductive behavior. Nature genetics 48, 1462 (2016).
OpenUrl CrossRef PubMed
↵
Clarke, T.-K. et al. Genome-wide association study of alcohol consumption and genetic overlap with other health-related traits in UK Biobank (N= 112 117). Molecular psychiatry 22, 1376 (2017).
OpenUrl CrossRef PubMed
↵
Walters, R. K. et al. Trans-ancestral GWAS of alcohol dependence reveals common genetic underpinnings with psychiatric disorders. bioRxiv, 257311 (2018).
↵
van den Broek, N. et al. Causal Associations Between Body Mass Index and Mental Health: A Mendelian Randomization Study. bioRxiv, 168690 (2017).
Sudlow, C. et al. UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS medicine 12, e1001779 (2015).
OpenUrl
Sanchez-Roige, S. et al. Genome-wide association study of delay discounting in 23,217 adult research participants of European ancestry. Nature neuroscience 21, 16 (2018).
OpenUrl CrossRef PubMed
Willemsen, G. et al. The Adult Netherlands Twin Register: twenty-five years of survey and biological data collection. Twin Research and Human Genetics 16, 271-281 (2013).
OpenUrl CrossRef Web of Science
Ikram, M. A. et al. The Rotterdam Study: 2018 update on objectives, design and main results. European Journal of Epidemiology 32, 807-850 (2017).
OpenUrl CrossRef
Magnusson, P. K. et al. The Swedish Twin Registry: establishment of a biobank and other recent developments. Twin Research and Human Genetics 16, 317-329 (2013).
OpenUrl CrossRef Web of Science
Durand, E. Y., Do, C. B., Mountain, J. L. & Macpherson, J. M. Ancestry composition: a novel, efficient pipeline for ancestry deconvolution. biorxiv, 010512 (2014).
Ehli, E. A. et al. A method to customize population-specific arrays for genome-wide association testing. European Journal of Human Genetics 25, 267 (2017).
OpenUrl

View the discussion thread.

Posted November 14, 2018.

Download PDF

Supplementary Material

Citation Tools

Subject Area

Genetics

Subject Areas

All Articles

Animal Behavior and Cognition (5200)
Biochemistry (11703)
Bioengineering (8722)
Bioinformatics (29127)
Biophysics (14932)
Cancer Biology (12048)
Cell Biology (17359)
Clinical Trials (138)
Developmental Biology (9406)
Ecology (14143)
Epidemiology (2067)
Evolutionary Biology (18268)
Genetics (12220)
Genomics (16766)
Immunology (11841)
Microbiology (28005)
Molecular Biology (11552)
Neuroscience (60808)
Paleontology (450)
Pathology (1864)
Pharmacology and Toxicology (3231)
Physiology (4939)
Plant Biology (10384)
Scientific Communication and Education (1679)
Synthetic Biology (2877)
Systems Biology (7333)
Zoology (1642)

[1] ↵
Cacioppo, J. T. & Cacioppo, S. Loneliness in the Modern Age: An Evolutionary Theory of Loneliness (ETL). Advances in Experimental Social Psychology (2018).

[2] ↵
Cacioppo, S., Grippo, A. J., London, S., Goossens, L. & Cacioppo, J. T. Loneliness: Clinical import and interventions. Perspectives on Psychological Science 10, 238-249 (2015).
OpenUrl CrossRef PubMed

[3] ↵
Cacioppo, J. T., Cacioppo, S., Capitanio, J. P. & Cole, S. W. The neuroendocrinology of social isolation. Annual review of psychology 66, 733-767 (2015).
OpenUrl CrossRef PubMed

[4] ↵
Cacioppo, J. T., Cacioppo, S. & Boomsma, D. I. Evolutionary mechanisms for loneliness. Cognition & emotion 28, 3-21 (2014).
OpenUrl CrossRef PubMed Web of Science

[5] ↵
Beutel, M. E. et al. Loneliness in the general population: prevalence, determinants and relations to mental health. BMC psychiatry 17, 97 (2017).
OpenUrl

[6] Hakulinen, C. et al. Social isolation and loneliness as risk factors for myocardial infarction, stroke and mortality: UK Biobank cohort study of 479 054 men and women. Heart, heartjnl-2017-312663 (2018).

[7] ↵
Victor, C. R. & Yang, K. The prevalence of loneliness among adults: a case study of the United Kingdom. The Journal of psychology 146, 85-104 (2012).
OpenUrl CrossRef PubMed Web of Science

[8] ↵
Yang, K. Causal conditions for loneliness: a set-theoretic analysis on an adult sample in the UK. Quality & quantity 52, 685-701 (2018).
OpenUrl

[9] Simon, M. A., Chang, E.-S., Zhang, M., Ruan, J. & Dong, X. The prevalence of loneliness among US Chinese older adults. Journal of aging and health 26, 1172-1188 (2014).
OpenUrl CrossRef PubMed Web of Science

[10] de Jong Gierveld, J., Keating, N. & Fast, J. E. Determinants of loneliness among older adults in Canada. Canadian Journal on Aging/La Revue canadienne du vieillissement 34, 125-136 (2015).
OpenUrl

[11] ↵
Honigh-de Vlaming, R., Haveman-Nies, A., Bos-Oude Groeniger, I., de Groot, L. & van’t Veer, P. Determinants of trends in loneliness among Dutch older people over the period 2005-2010. Journal of aging and health 26, 422-440 (2014).
OpenUrl CrossRef PubMed

[12] ↵
Distel, M. A. et al. Familial resemblance for loneliness. Behavior genetics 40, 480-494 (2010).
OpenUrl CrossRef PubMed Web of Science

[13] ↵
Gao, J. et al. Genome-wide association study of loneliness demonstrates a role for common variation. Neuropsychopharmacology 42, 811 (2017).
OpenUrl CrossRef

[14] ↵
Abdellaoui, A. et al. Associations between loneliness and personality are mostly driven by a genetic association with neuroticism. Journal of personality (2018).

[15] ↵
Day, F. R., Ong, K. K. & Perry, J. R. Elucidating the genetic basis of social interaction and isolation. Nature communications 9, 2457 (2018).
OpenUrl

[16] ↵
Holt-Lunstad, J., Smith, T. B., Baker, M., Harris, T. & Stephenson, D. Loneliness and social isolation as risk factors for mortality: a meta-analytic review. Perspectives on Psychological Science 10, 227-237 (2015).
OpenUrl CrossRef PubMed

[17] ↵
Vaillant, G. E. Triumphs of experience. (Harvard University Press, 2012).

[18] ↵
Pasaniuc, B. & Price, A. L. Dissecting the genetics of complex traits using summary association statistics. Nature Reviews Genetics 18, 117-127, doi:10.1038/nrg.2016.142 (2017).
OpenUrl CrossRef

[19] ↵
Denny, J. C. et al. Systematic comparison of phenome-wide association study of electronic medical record data and genome-wide association study data. Nature biotechnology 31, 1102 (2013).
OpenUrl CrossRef PubMed

[20] ↵
Denny, J. C. et al. PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene–disease associations. Bioinformatics 26, 1205-1210 (2010).
OpenUrl CrossRef PubMed Web of Science

[21] ↵
Roden, D. M. Phenome‐wide association studies: a new method for functional genomics in humans. The Journal of physiology 595, 4109-4115 (2017).
OpenUrl

[22] ↵
Baselmans, B. M. et al. Multivariate Genome-wide and integrated transcriptome and epigenome-wide analyses of the Well-being spectrum. bioRxiv, 115915 (2017).

[23] ↵
Bulik-Sullivan, B. K. et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nature genetics 47, 291-295 (2015).
OpenUrl CrossRef PubMed

[24] ↵
Demontis, D. et al. Discovery of the first genome-wide significant risk loci for ADHD. bioRxiv, 145581 (2017).

[25] ↵
de Leeuw, C. A., Mooij, J. M., Heskes, T. & Posthuma, D. MAGMA: generalized gene-set analysis of GWAS data. PLoS computational biology 11, e1004219 (2015).
OpenUrl

[26] ↵
Watanabe, K., Taskesen, E., van Bochoven, A. & Posthuma, D. FUMA: Functional mapping and annotation of genetic associations. bioRxiv, 110023 (2017).

[27] ↵
Bulik-Sullivan, B. K. et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nature genetics 47, 291 (2015).
OpenUrl CrossRef PubMed

[28] ↵
Finucane, H. K. et al. Partitioning heritability by functional annotation using genome-wide association summary statistics. Nature genetics 47, 1228 (2015).
OpenUrl CrossRef PubMed

[29] ↵
Barbeira, A. et al. MetaXcan: summary statistics based gene-level association method infers accurate PrediXcan results. bioRxiv, 045260 (2016).

[30] ↵
Consortium, G. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans. Science 348, 648-660 (2015).
OpenUrl Abstract/FREE Full Text

[31] ↵
Bulik-Sullivan, B. et al. An atlas of genetic correlations across human diseases and traits. Nature genetics 47, 1236-1241 (2015).
OpenUrl CrossRef PubMed

[32] ↵
Euesden, J., Lewis, C. M. & O’Reilly, P. F. PRSice: polygenic risk score software. Bioinformatics 31, 1466-1468 (2014).
OpenUrl

[33] ↵
Ruderfer, D. M. et al. Significant shared heritability underlies suicide attempt and clinically predicted probability of attempting suicide. bioRxiv, 266411 (2018).

[34] ↵
Carroll, R. J., Bastarache, L. & Denny, J. C. R PheWAS: data analysis and plotting tools for phenome-wide association studies in the R environment. Bioinformatics 30, 2375-2376 (2014).
OpenUrl CrossRef PubMed Web of Science

[35] ↵
Xu, H. et al. MedEx: a medication information extraction system for clinical narratives. Journal of the American Medical Informatics Association 17, 19-24 (2010).
OpenUrl CrossRef PubMed

[36] ↵
Burgess, S. et al. Using published data in Mendelian randomization: a blueprint for efficient identification of causal risk factors. European journal of epidemiology 30, 543-552 (2015).
OpenUrl CrossRef PubMed

[37] ↵
Davey Smith, G. & Hemani, G. Mendelian randomization: genetic anchors for causal inference in epidemiological studies. Human molecular genetics 23, R89-R98 (2014).
OpenUrl CrossRef PubMed Web of Science

[38] ↵
Ehret, G. B. et al. Genetic variants in novel pathways influence blood pressure and cardiovascular disease risk. Nature 478, 103 (2011).
OpenUrl CrossRef PubMed Web of Science

[39] ↵
Bowden, J., Davey Smith, G., Haycock, P. C. & Burgess, S. Consistent estimation in Mendelian randomization with some invalid instruments using a weighted median estimator. Genetic epidemiology 40, 304-314 (2016).
OpenUrl CrossRef PubMed

[40] ↵
Bowden, J., Davey Smith, G. & Burgess, S. Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression. International journal of epidemiology 44, 512-525 (2015).
OpenUrl CrossRef PubMed

[41] ↵
Zhu, Z. et al. Causal associations between risk factors and common diseases inferred from GWAS summary data. Nature communications 9, 224 (2018).
OpenUrl

[42] ↵
Bowden, J. et al. Assessing the suitability of summary data for two-sample Mendelian randomization analyses using MR-Egger regression: the role of the I 2 statistic. International journal of epidemiology 45, 1961-1974 (2016).
OpenUrl

[43] ↵
Nikpay, M. et al. A comprehensive 1000 Genomes–based genome-wide association meta-analysis of coronary artery disease. Nature genetics 47, 1121 (2015).
OpenUrl CrossRef PubMed

[44] ↵
Burgess, S. & Thompson, S. G. Interpreting findings from Mendelian randomization using the MR-Egger method. European journal of epidemiology 32, 377-389 (2017).
OpenUrl CrossRef PubMed

[45] ↵
Wray, N. R. & Sullivan, P. F. Genome-wide association analyses identify 44 risk variants and refine the genetic architecture of major depression. bioRxiv, 167577 (2017).

[46] ↵
Van Overwalle, F., Baetens, K., Mariën, P. & Vandekerckhove, M. Social cognition and the cerebellum: a meta-analysis of over 350 fMRI studies. Neuroimage 86, 554-572 (2014).
OpenUrl

[47] ↵
Cacioppo, S., Capitanio, J. P. & Cacioppo, J. T. Toward a neurology of loneliness. Psychological Bulletin 140, 1464 (2014).
OpenUrl CrossRef PubMed

[48] Amodio, D. M. & Frith, C. D. Meeting of minds: the medial frontal cortex and social cognition. Nature Reviews Neuroscience 7, 268 (2006).
OpenUrl CrossRef PubMed Web of Science

[49] ↵
Layden, E. A. et al. Perceived social isolation is associated with altered functional connectivity in neural networks associated with tonic alertness and executive control. Neuroimage 145, 58-73 (2017).
OpenUrl

[50] ↵
Cacioppo, S. et al. A quantitative meta-analysis of functional imaging studies of social rejection. Scientific reports 3, 2027 (2013).
OpenUrl

[51] ↵
Delgado, M. R. Reward‐related responses in the human striatum. Annals of the New York Academy of Sciences 1104, 70-88 (2007).
OpenUrl CrossRef PubMed Web of Science

[52] ↵
Abdellaoui, A. et al. Predicting Loneliness with Polygenic Scores of Social, Psychological, and Psychiatric Traits. Genes, Brain, and Behavior (2018).

[53] ↵
Cacioppo, J. T., Hawkley, L. C. & Thisted, R. A. Perceived social isolation makes me sad: 5-year cross-lagged analyses of loneliness and depressive symptomatology in the Chicago Health, Aging, and Social Relations Study. Psychology and aging 25, 453 (2010).
OpenUrl CrossRef PubMed Web of Science

[54] ↵
Cacioppo, J. T., Hughes, M. E., Waite, L. J., Hawkley, L. C. & Thisted, R. A. Loneliness as a specific risk factor for depressive symptoms: cross-sectional and longitudinal analyses. Psychology and aging 21, 140 (2006).
OpenUrl CrossRef PubMed Web of Science

[55] ↵
Weeks, D. G., Michela, J. L., Peplau, L. A. & Bragg, M. E. Relation between loneliness and depression: a structural equation analysis. J. Pers. Soc. Psychol. 39, 1238 (1980).
OpenUrl CrossRef PubMed Web of Science

[56] ↵
Åkerlind, I. & Hörnquist, J. O. Loneliness and alcohol abuse: A review of evidences of an interplay. Social science & medicine 34, 405-414 (1992).
OpenUrl CrossRef PubMed

[57] ↵
Barban, N. et al. Genome-wide analysis identifies 12 loci influencing human reproductive behavior. Nature genetics 48, 1462 (2016).
OpenUrl CrossRef PubMed

[58] ↵
Clarke, T.-K. et al. Genome-wide association study of alcohol consumption and genetic overlap with other health-related traits in UK Biobank (N= 112 117). Molecular psychiatry 22, 1376 (2017).
OpenUrl CrossRef PubMed

[59] ↵
Walters, R. K. et al. Trans-ancestral GWAS of alcohol dependence reveals common genetic underpinnings with psychiatric disorders. bioRxiv, 257311 (2018).

[60] ↵
van den Broek, N. et al. Causal Associations Between Body Mass Index and Mental Health: A Mendelian Randomization Study. bioRxiv, 168690 (2017).

[61] Sudlow, C. et al. UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS medicine 12, e1001779 (2015).
OpenUrl

[62] Sanchez-Roige, S. et al. Genome-wide association study of delay discounting in 23,217 adult research participants of European ancestry. Nature neuroscience 21, 16 (2018).
OpenUrl CrossRef PubMed

[63] Willemsen, G. et al. The Adult Netherlands Twin Register: twenty-five years of survey and biological data collection. Twin Research and Human Genetics 16, 271-281 (2013).
OpenUrl CrossRef Web of Science

[64] Ikram, M. A. et al. The Rotterdam Study: 2018 update on objectives, design and main results. European Journal of Epidemiology 32, 807-850 (2017).
OpenUrl CrossRef

[65] Magnusson, P. K. et al. The Swedish Twin Registry: establishment of a biobank and other recent developments. Twin Research and Human Genetics 16, 317-329 (2013).
OpenUrl CrossRef Web of Science

[66] Durand, E. Y., Do, C. B., Mountain, J. L. & Macpherson, J. M. Ancestry composition: a novel, efficient pipeline for ancestry deconvolution. biorxiv, 010512 (2014).

[67] Ehli, E. A. et al. A method to customize population-specific arrays for genome-wide association testing. European Journal of Human Genetics 25, 267 (2017).
OpenUrl

Phenome-wide Investigation of Health Outcomes Associated with Genetic Predisposition to Loneliness

Abstract

Introduction

Subjectsand Methods

Subjects & Phenotype

Genotyping and QC

GWASs & Meta-Analysis

Follow-up Analyses

RESULTS

GWAS Meta-Analysis

MAGMA and S-PrediXcangene-basedanalyses

GWAS signals are significantly enriched for brain tissues and evolutionary conserved regions

Genetic Correlations

PheWAS on the polygenic score for loneliness

Mendelian Randomization

Discussion

ACKNOWLEDGEMENTS AND FUNDING

Acknowledgments

Footnotes

References

Citation Manager Formats

Subject Area