Abstract
Educated people are generally healthier, have fewer comorbidities and live longer than people with less education. Previous evidence about the effects of education come from observational studies many of which are affected by residual confounding. Legal changes to the minimum school leave age is a potential natural experiment which provides a potentially more robust source of evidence about the effects of schooling. Previous studies have exploited this natural experiment using population-level administrative data to investigate mortality, and relatively small surveys to investigate the effect on mortality. Here, we add to the evidence using data from a large sample from the UK Biobank. We exploit the raising of the school-leaving age in the UK in September 1972 as a natural experiment and regression discontinuity and instrumental variable estimators to identify the causal effects of staying on in school. Remaining in school was positively associated with 23 of 25 outcomes. After accounting for multiple hypothesis testing, we found evidence of causal effects on twelve outcomes, however, the associations of schooling and intelligence, smoking, and alcohol consumption may be due to genomic and socioeconomic confounding factors. Education affects some, but not all health and socioeconomic outcomes. Differences between educated and less educated people may be partially due to residual genetic and socioeconomic confounding.
Significance Statement On average people who choose to stay in education for longer are healthier, wealthier, and live longer. We investigated the causal effects of education on health, income, and well-being later in life. This is the largest study of its kind to date and it has objective clinic measures of morbidity and aging. We found evidence that people who were forced to remain in school had higher wages and lower mortality. However, there was little evidence of an effect on intelligence later in life. Furthermore, estimates of the effects of education using conventionally adjusted regression analysis are likely to suffer from genomic confounding. In conclusion, education affects some, but not all health outcomes later in life.
Funding The Medical Research Council (MRC) and the University of Bristol fund the MRC Integrative Epidemiology Unit [MC_UU_12013/1, MC_UU_12013/9]. NMD is supported by the Economics and Social Research Council (ESRC) via a Future Research Leaders Fellowship [ES/N000757/1]. The research described in this paper was specifically funded by a grant from the Economics and Social Research Council for Transformative Social Science. No funding body has influenced data collection, analysis or its interpretations. This publication is the work of the authors, who serve as the guarantors for the contents of this paper. This work was carried out using the computational facilities of the Advanced Computing Research Centre -http://www.bris.ac.uk/acrc/ and the Research Data Storage Facility of the University of Bristol — http://www.bris.ac.uk/acrc/storage/. This research was conducted using the UK Biobank Resource.
Data access The statistical code used to produce these results can be accessed here: (https://github.com/nmdavies/UKbiobankROSLA). The final analysis dataset used in this study is archived with UK Biobank, which can be accessed by contacting UK Biobank access{at}biobank.ac.uk.
Introduction
On average educated people are healthier, happier, more intelligent, richer, and live longer than those with less education. We do not know whether this is because education directly causes these outcomes, by affecting behaviors, such as smoking, or if these differences are due to other factors, such as socioeconomic or genomic differences. Whether education causes differences in outcomes later in life has been the subject of significant debate by epidemiologists, economists and other social scientists.(1–14) Economists have argued that a substantial portion of the benefits of education accrue via its potential effects on mortality and morbidity.(14) Epidemiologists have found that people who attended university have higher fluid intelligence in adulthood.(15) These associations are robust to adjustment for parental social class and adolescent cognition, which has been taken by some as proof that education causes later outcomes.(16) Despite this, many epidemiologists and economists are acutely aware that correlation, and multivariable adjusted regression, can be unreliable evidence of causation.(17–19) The ideal study design to definitively prove whether education has causal effects would be to randomize the age at which children leave school. However, this experiment would not be ethical, cost-effective, or timely. A more feasible, but potentially robust, research design is to exploit natural experiments that affected when people left school but are not related to confounding factors.(20, 21) One widely used form of natural experiment is to exploit changes in the legal minimum school leaving age. These changes forced some people to stay in school for longer than they would have otherwise chosen.
In September 1972, the United Kingdom raised the school-leaving age from age 15 to 16. Researchers have previously used this policy change to investigate the effects of forcing students to stay in school longer using administrative data and longitudinal cohort studies.(1, 22–24) However, the cohort studies had relatively small samples and, as a result, had to include people who were born many years before and after the reform. These studies produced relatively imprecise estimates of the effects of education. Previous results from administrative data lacked detailed information on covariates to identify people born in England affected by the reform or measures of many of the outcomes of interest such as cognition.
In the current study, we used the raising of the school-leaving age in 1972 as a natural experiment to estimate the causal effects of schooling. We used a regression discontinuity design and used novel data from the UK Biobank.(25, 26) We add to the literature in two ways. First, this is the largest sample with detailed individual-level information from the school years immediately before and after the reform. Second, we used genome-wide data to prove that the observational associations of education and other outcomes are likely to suffer from genomic confounding.
Results
Of the 502,644 participants in the UK Biobank, who were all aged between 37 and 73 at recruitment in 2008, 423,106 were born in England, see Figure 1 for a flow diagram of exclusion and inclusion of participants in this study. See Table 1 for a description of their characteristics. The youngest participants, those born between 1960 and 1971 obtained more education than those born earlier in the twentieth century (Figure 2). This is consistent with the well-documented secular increase in the length of education over the period.(1) UK Biobank includes 11,240 and 10,898 participants who turned 15 years old in the first year before and the first year after the school-leaving age increased. Before the reform, 85% of participants stayed in school after the age of 15, whereas after the reform almost 100% of participants remained in school after the age of 15. The proportions of men and women who remained in school after age 15 increased over time (Figures 3 and 4). Participants born in July and August could still technically leave school before their 16th birthday, so on average participants born in the summer term report leaving school at a younger age.
Covariate Balance Tests
People who stayed in school after age 15 had higher birth weights; mothers who were less likely to smoke during pregnancy; were taller than average at age 10 relative to their peers; were more likely to have been breastfed; had fewer siblings; and were more likely to have parents who were alive; they were more likely to have SNPs known to be associated with higher educational attainment(27) (Table 2). In comparison, there were few detectable baseline differences between people affected and unaffected by the reform. The only detectable difference was that participants in the first year affected by the reform were 4.3 (95% confidence intervals (95%CI): 2.5 to 6.1) and 3.7 (95%CI: 2.6 to 4.8) percentage points more likely to have an alive father and mother when they attended the assessment center in 2008-2010. There was weak evidence that fewer participants in the younger cohort were breastfed. The participants affected by the reform are, by definition, one year younger than those who were not affected. The raw differences above do not account for this age difference. We investigated whether these associations were simply due to the effects of age by estimating the effects of two negative control “placebo” reforms. For the first placebo reform, we constructed a sample of the two school years born before the reform (negative control sample 1) and for the second, we used the two first two school years after the reform (negative control sample 2). We estimated the difference in outcomes between the school years in exactly the same way as described above for the true reform. As with the true reform, on average there is a year difference in age between the two school years in the negative control samples. Therefore, if the differences between those unaffected by the reform and those affected were solely due to aging, then we should see similar differences between the two school cohorts who turned 15 before the reforms occurred, and the two school cohorts who turned 15 after the reform occurred. We know that all the participants in the two negative control samples experienced the same school leaving age. We found few differences between the two school years in negative control sample 1, or negative control sample 2. The only detectable differences were that the younger cohorts in both cases were more likely to have parents who were alive and were less likely to be breastfed (Table 3). These results suggest that participants unaffected and affected by the reform had similar observed covariates. This suggests that environmental or genomic confounding is unlikely to bias estimates which use the raising of the school-leaving age to identify the effects of education. In contrast socioeconomic and genomic factors confound the associations of school and later outcomes. Participants who chose to stay in school had more advantaged backgrounds and were more likely to have genetic variants associated with educational attainment.
Reduced form
We report two comparisons: first, the differences between participants who chose to stay in school after the age of 15 and those who left, and second, the difference between participants who were not affected by the reform (those born before September 1957) and those who were affected by it (those born in or after September 1957). On average participants who stayed in school after age 15 had better outcomes later in life. Fewer educated participants were diagnosed with high blood pressure; diabetes; a stroke; or a heart attack; or died (left columns in Table 4). There were minor differences in cancer diagnoses by education level. Fewer participants who stayed in school were diagnosed with depression. Educated people had stronger grip strength, lower arterial stiffness; and lower blood pressure. They were also taller, lighter, and achieved higher scores on intelligence tests. Participants who left school at age 15 reported similar levels of happiness as those who stayed on. Educated people drank more alcohol, but were much less likely to smoke. They were more likely to report higher incomes; watched less television, but exercised less. There were fewer differences between the participants affected and unaffected by the reform (right columns in Table 4). Participants affected by the reform were less likely to report having high blood pressure, diabetes, a stroke, a heart attack, or to have died. The results for the clinical measurements were mixed. Participants affected by the reform had higher grip strength, lower systolic blood pressure, but they had similar arterial stiffness, height, diastolic blood pressure, intelligence, and happiness to those unaffected by the reform. There was little evidence the reform affected alcohol or tobacco consumption. Participants affected by the reform spent less time watching television, and more time exercising, however, the associations with exercise were imprecise. Finally, participants affected by the reform more likely to earn over £18,000 and £31,000, but were no more likely to earn over £52,000 or £100,000 than those unaffected by the reform. We found some evidence that education may have a larger effect on men’s chances of earning more than £31,000 (p-value for interaction=0.001), but no evidence of interaction with any other outcomes (Tables S1 and S2). There was some evidence that the reform had larger effects on participants predicted to leave before the age of 16: specifically lowering the risk of diabetes, increased likelihood of earning over £31,000, increasing grip strength and happiness, and the likelihood of being diagnosed with a stroke and drank more alcohol (Table S3).
Negative control samples
As with the covariate balance tests above we investigated whether the differences in outcomes could be solely explained by the aging process using negative control samples. Participants within each negative control sample experienced the same minimum school leaving ages. Therefore, any differences in these negative control samples are likely to be due to the aging process and not an effect of education. Table 5 reports the placebo results for the negative control samples. These cohorts estimate the average change in each outcome that can be attributed to becoming a year older. On average, the younger participants were less likely to report having high blood pressure, having had a heart attack or depression, had lower systolic and diastolic blood pressure, and arterial stiffness as measured in the clinic. They also reported watching less television. These differences are similar in size to differences in these outcomes observed in Table 4. This suggests these differences are due to the aging process and not an effect of the reform. However, this implies that the aging process cannot entirely explain remaining differences described above, and shown in Table 4 (diagnosed with diabetes or stroke, mortality, grip strength, BMI, and having income over £18,000 or £31,000). The results differed by gender (Tables S4 and S5). For example, there was evidence that on average female participants affected by the reform had 0.15 kg/m2 (95%CI: 0.01, 0.30) lower BMI. These gender-specific effects are consistent with the literature.
Instrumental variables
The associations reported in Table 4 are valid tests of the null hypothesis that education does not affect each outcome. Participants affected by the reform were 14.5 (95%CI: 13.7, 15.2) percentage points more likely to remain in school past age 15 than those who were unaffected. This association is stronger than conventional thresholds for instrumental variable analysis (F-statistic=1393). In Table 6 we report instrumental variable estimates of the size of the effects of remaining in school past the age of 15. These results suggest that remaining in school reduced the probability of being diagnosed with diabetes, stroke, or dying by 4.5 (95%CI: 2.7 to 6.3), 1.7 (95%CI: 0.6 to 2.8), and 2.9 (95%CI: 1.3 to 4.5) percentage points. These estimates are larger than the conventional linear regression (Hausman test for differences p=9.7E-7, p=0.002, and p=3.4E-4). Staying in school also increased the participants grip strength by 2.89 (95%CI: 2.38 to 3.39) kg, which again was more than implied by the conventional linear regression (Hausman p-value=3.6E-29). Finally, the instrumental variable results imply that staying in school increases the likelihood of earning more than £18,000 or £31,000 by 5.6 (95%CI: 1.5 to 9.6) and 18.8 (95%CI: 14.9 to 22.6) percentage points. Both of these estimates are smaller than implied by conventional linear regression (Hausman p-values=0.001). These results exceeded the Benjamini and Hochberg (1995) false discovery rate threshold at δ=0.05 across 25 outcomes.(28) Figures 5 and 6 show the differences between the conventional linear regression results and the instrumental variable results. They plot the point estimate and confidence intervals for the conventional linear regression and the instrumental variable results. Tables S6 and S7 report the instrumental variable results stratified by gender.
Discussion
This study provides some of the strongest evidence to date about the causal effects of education. First, our instrumental variable results indicate that socioeconomic and genomic factors can explain some of the observed associations of educational attainment and outcomes later in life. In contrast to the observational associations, we found little evidence that education had causal effects on smoking, alcohol consumption, arterial stiffness, intelligence, or likelihood of having an annual income over £100,000. Second, our results provide robust evidence that education is likely to have a causal effect on other outcomes, including diagnoses of diabetes, stroke, mortality, grip strength and having incomes above £18,000 and £31,000. Finally, we found molecular genetic evidence that traditional approaches to investigate the effects of educational attainment, such as multivariable adjusted regression, are likely to suffer from residual genomic confounding.
Clark and Royer found the participants of the Health Survey for England and the General Household Survey affected by the reform were by 26.1 (95%CI: 23.0 to 29.2) percentage points more likely to stay in school after age 15.(1) We found a smaller difference (14.0, 95%CI: 12.9 to 15.1), this may be because on average the participants of UK Biobank are more educated. Clark and Royer found little evidence that the raising of the school-leaving age reduced mortality (odds-ratio=0.99, 95%CI: 0.94 to 1.05). Whereas we found people affected by the reform had a lower risk of death (odds-ratio=0.64, 95%CI: 0.50 to 0.83). We found little evidence of reductions in mortality in the two negative control samples odds-ratios= 1.22 (95%CI: 0.97 to 1.53) and 1.08 (95%CI: 0.91 to 1.29) for negative control samples 1 and 2, in the two years before and the two years after the reform. Clark and Royer found little evidence of effects of the reform on self-reported health. This may be because they used relatively liberal bandwidths, up to 138 months either side of the reform. This meant they were comparing individuals born up to 11.5 years before the reform, to individuals who were born up to 11.5 years after the reform. For some, but not all, analyses they allowed for a month and year of birth trend, which was allowed to change after the reform. The advantage of using such a liberal bandwidth is that they could include more data, and to obtain sufficiently precise results. However, this is potentially at the expense of introducing bias by comparing individuals who differ because of secular trends. We obtained similar precision to Clark and Royer using a highly conservative 12-month bandwidth. As a sensitivity analysis, we provide results using a comparable specification to Clark and Royer in the supplementary materials, see Tables S8, S9, and S10. Furthermore, the majority of their health outcomes were self-reported in a general health survey, whereas we had precise clinic measures of specific measures of gaining which are known to change with age, such as grip strength. This means our outcomes are likely to suffer from less measurement error. Our data were consistently measured in a single study across a relatively narrow time window, whereas previous results pooled data from many years. Again this means our results are likely to suffer from less measurement error.
Epidemiologists have argued that education has causal effects on intelligence later in life. For example, Richards and Sacker found that educational attainment by age 26 was associated with intelligence at age 53,(29) which they argue was evidence that education had a causal effect on intelligence.(16) However, Deary and Johnson raised doubts about this interpretation and called for greater clarity about the assumptions underlying these analyses.(19) We found little evidence of a causal effect of education on intelligence later in life. Nguyen and colleagues used increases in the legal school-leaving ages in the United States to investigate the effects of education on risk of dementia later in life.(21) They found evidence that education reduced the risk of dementia. We cannot test this hypothesis in the UK Biobank because too few participants have been diagnosed with dementia.
People with more education are much less likely to smoke. However, it is not clear whether this is due to a causal effect of education. Gilman and colleagues found the association between education and smoking status was attenuated in sibling fixed effects designs.(30) We found that participants who remained in school were 20.1 percentage points less likely to have ever smoked. However, we found little evidence that the raising of the school leaving age affected smoking behavior. We also found that educated participants drank more heavily, but there was little evidence that this was caused by education. This is consistent with recent evidence from other UK surveys. Silles (2015) used data from the General Household Survey for Great Britain and found that whilst people who left school at an older age were less likely to smoke, there were few differences in ever-smoking rates among individuals unaffected and affected by the 1947 and 1972 raising of the school leaving age.(31) We also found some evidence that the effects of the reform were greatest in participants who would otherwise have been expected to leave at age 15.
Strengths and limitations
A key strength of our study is that we used a natural experiment to identify the effects of education. The raising of the school-leaving age in 1972 provided exogenous variation in the length of schooling people received. We found few pre-existing differences between participants on either side of the reform, suggesting that it can be used as a potentially valid instrumental variable.(32) A strength of our study is it uses one of the largest samples to date to investigate the effects of education on a wide range of outcomes. Our outcomes were recorded both in interviews and via linked Office of National Statistics registry data. This means our outcomes are likely to suffer from relatively little measurement error. Furthermore, we were able to restrict our sample to people born in England who were affected by the reform. In addition, we used genome-wide data to show that conventional multivariable adjusted regression results are likely to suffer from genomic confounding. A potential limitation of our study is that our treatment group, people affected by the reform, are one year younger than our control group, those unaffected by the reform. Many of the outcomes we investigated increase linearly or log-linearly over time. This means it is difficult to determine if any differences we observed are due to an additional year of aging or the reform. We addressed this by using negative control samples to estimate the average effects of aging. For the results for outcomes such as blood pressure, where we observed similar differences in the negative control samples as the main results, are likely to be due to aging and not an effect of the reform. However, it is likely the reform affected outcomes, such as income, where we observed much larger differences in the main results than in the negative control samples. Further, aging cannot explain why we found little evidence of causal effects of staying in school on intelligence, smoking, and alcohol consumption.
Whilst a representative sample is not a necessary condition for making causal inferences,(33) collider bias could affect our results because Biobank is a volunteer sample, which over-represents highly educated people. People affected by the reform may be more likely to participate in the study.(34) Therefore less educated individuals, who would have stayed in school had they attended school after the reform (the compliers), may be under-represented in the pre-reform sample. If true, this could attenuate our results towards the null, because these marginal students would reduce the average outcome in the “treatment” group, and be missing from the “control” group. This would improve the control group’s outcome relative to the treatment group.
Nevertheless, our results are consistent with previous results which used random samples.(31) So collider bias is unlikely to explain why there was little evidence that the reform affected intelligence, smoking and alcohol consumption. However, this issue warrants further investigation in future research. There was limited time to collect measures during the participants’ assessment center visits, therefore our measure of intelligence is relatively coarse. However, despite this, participants who remained in school had substantially higher intelligence. Our instrumental variable estimates of the effects of schooling on intelligence are sufficiently precise to rule out even relatively small effects on intelligence. Finally, our instrumental variable results are estimates of the local average treatment effect of schooling.(35) Specifically, they are the estimates of the causal effects of being forced to remain in school after the age of 15, on those who would otherwise have left school. These effects may not be externally valid to infer either the effects of compelling students to remain in school for longer or of the effects on other populations.(36, 37) This means these results may not be valid estimates of the effect of education on “always takers”, that is people who would always remain in school regardless of the reform. Nevertheless, our results are likely to be internally valid estimates of the effects of schooling on people affected by the reform.
Conclusions
Does education have causal effects? Yes, whilst education is not the panacea implied by naïve multivariable adjusted regression, in this sample staying in school did result in substantial benefits. We found robust evidence that staying in school is likely to have causal effects on some, but not all health and socioeconomic outcomes later in life. This adds to our understanding of the long-term consequences of educational decisions in childhood and adolescence. We resolve two debates about the causal effects of schooling. First, our results suggest education is unlikely to cause differences in intelligence later in life.(16, 19) Second, previous research using these reforms found little evidence that education affected health, may be due to poor measurement of health outcomes and inadequate statistical power.(1)
Materials and Methods
Data
We used data from 502,624 participants of the UK Biobank project.(25) The participants, aged between 40 and 69, were originally recruited between 2006 and 2010. In our primary analysis, we restricted our sample to participants were born in England in the school cohorts in years immediately before and after the reform took place. We do this because we have a large enough sample born in these years to precisely identify the effects of schooling.
Exposure: left school after age 15
The participants were asked if they had a college or university degree. If they did not have a degree they were asked what age they left full-time education. We coded participants who reported having a degree as leaving full-time education at age 21.
Outcomes
Health outcomes
The participants were asked whether they had ever been diagnosed by a doctor with the following health conditions: high blood pressure, stroke, diabetes, or heart attack. They were asked if they had ever had a whole week where they felt depressed or down. The death of the participants was defined using linked Office of National Statistics mortality data. Follow-up for the linked mortality data started with the first death on 10th May 2006 ended with the last recorded death on 17th February 2014. The diagnoses of cancer and information were taken from the cancer registry. The first recorded cancer diagnosis was on 20th September 1957 and the last on 25th October 2013.
Height, BMI, Blood pressure, atrial stiffness, grip strength, and intelligence
Height and weight were measured during the participants’ visit to a UK Biobank assessment center. Two measures of diastolic and systolic blood pressure were recorded via an electronic blood pressure monitor. The measurements were taken two minutes apart. Atrial stiffness was measured using an electronic measure device. Grip strength was measured in kilos using a hydraulic hand dynamometer. We residualized the measures of grip strength and atrial stiffness to control for potential between device heterogeneity. Fluid intelligence was measured via 13 logic puzzles that the participants had to answer in 2 minutes. Their score is the number of correct answers.
Health behaviors and income
During their assessment center visit, the participants were asked to report their health behaviors. They were asked about how frequently they consumed alcohol. This is coded 6 if they drank every day, 5 for three or four times a week, 4 for once or twice a week, 3 for one to three times a week, 2 for special occasions only, and 1 for never. They were asked if they smoked, or had ever smoked. They were asked how often they vigorously and moderately exercised in a typical week. Finally, they were asked if their pre-tax income was below £18,000; between £18,000 and £30,999; between £31,000 and £50,999; between £52,000 and £100,000; or above £100,000. Participants who did not answer these questions were coded as missing.
Genotype data
The participants provided a blood sample. This sample was used to extract DNA and genotype using the Axiom and BiLEVE genome-wide arrays. These arrays genotyped around 800,000 single nucleotide polymorphism (SNPs) for each participant. The genotyping data was used to impute SNPs which were not directly genotyped using the 1000 genomes and UK10K reference panels. The imputation produced a likelihood of each participant having a specific genotype at each (e.g. AA=0.1, TA=0.9, and TT=0). This resulted in a dataset of around 80,000,000 SNPs. For each participant, we created a genome-wide allele score by summing the number of genetic variants they had that were associated with higher educational attainment. We weighted each variant by its association with education reported in a large genome-wide association study.(27) This study reported the association of 8,259,394 genetic variants and the years of education in a meta-analysis of 64 studies, which did not include the UK Biobank. We normalized the allele score have mean zero and standard deviation one. The allele score represents the effects of the known effects of genetic variants on educational attainment. However, these scores only explain a minority (r2=1.32% in the full Biobank sample) of the variation in educational attainment explained by genome-wide data.(27, 38, 39) This is because of limited statistical power of existing genome-wide association studies of educational attainment. One consequence of this is that the genetic score is too poor a proxy for the total genetic effects on educational attainment to be used as a conventional covariate in a regression. However, this score does provide a valid test of the null-hypothesis that previously unobserved genetic covariates do not confound the association of education and other outcomes.(32)
Statistical methods
We use the changes in the school-leaving age to identify the effects of schooling on a range of outcomes. Our empirical strategy has five steps. First, we estimated the effect of the reforms on the proportion of participants who remained in school after age 15. Second, we investigated the associations of potential confounders with educational attainment and across the cohorts affected by the reform.(32) Third, we estimated the reduced form associations of the reform and the outcomes. Fourth, we used instrumental variable estimators to estimate the effects of the attending school on each of the outcomes. For continuous outcomes, we used conventional two-stage least squares,(40) for binary outcomes we used semi-parametric additive structural mean models.(41) To address concerns about multiple hypothesis testing, we report whether the instrumental variable results for each outcome exceed a Benjamini and Hochberg (1995) false discovery rate threshold at δ=0.05 across 25 outcomes.(28) Fifth, we conducted two negative control analyses.(28) We did this by constructing two cohorts of students, born in the two school years before and the two years after the reform. Analogous with the primary cohort we designated participants born in the more recent of the two school years as being exposed to the “placebo” reform. There are no institutional differences between each year, therefore any differences observed in these negative control cohorts will be due age effects and not an effect of raising the school-leaving age.
Identification
The raising of the school-leaving age will be a valid natural experiment for testing whether remaining in school at age 15 affects later outcomes under the following three assumptions. First, participants who attended school after the leaving age was increased must be more likely to stay in school. Second, there must be no pre-existing differences between the cohort who attended school in the year immediately before and immediately after the reform. Finally, the reform must not have any other direct effects on the outcomes. We can test the first assumption by testing whether participants affected by the reform are more likely to stay in school. We can falsify the second assumption by investigating if there were any pre-existing differences between those affected and unaffected by the reform. The final assumption cannot be empirically tested, and could be invalid if the reform also affected the labor market around the time that the participants entered the workforce. However, claimant count statistics for the UK show that the cohorts entering the labor force immediately before and after the reform faced broadly similar conditions, with increases in unemployment related to the oil crises of the 1970s not being seen until 1975 onwards.(42, 43)
1. The effect of increasing the minimum leaving age on educational attainment
We used a regression discontinuity design to estimate the effects of increasing the school-leaving age from age 15 to 16 on the proportion of students who report leaving school before the age of 15. To investigate the effect of the reform on school attendance we estimated a regression of staying school after age 15 on a dummy variable equal to one if the participant was a member of the cohort affected by the reform, and equal to zero if they were not affected. In this and all subsequent analyses we included covariates for the month of birth, to control for seasonality, and gender. In contrast to Clark and Royer (2013), we do not include a term for birth cohort because we restricted our sample to people born in the single school years immediately before and after the reform. The regression discontinuity design is identified by assuming that the reform is independent of the unobserved confounding factors, and has no other direct effects on the outcome. The effect of the reform on the probability of participants staying in school after the age of 15, our parameter of interest, is the effect of remaining in school on those who were affected by the reform. This is a local average treatment effect.(35) We report this parameter on the risk difference scale. Our regressions allow for general form heteroskedasticity and clustering by year and month of birth.
2. Covariate balance tests
We compared the associations of eleven potential confounders Cict and the exposure, left school after the age of 15, Eict, and the indicator of the reform, Dic. We estimated these associations conditional on the same set of covariates, , as above and the standard errors allow for clustering by year and month of birth.
3. Effects of increasing the school-leaving age on outcomes in later life
A. Reduced form
We estimated the associations of leaving school after age 15 and the outcomes and the association of the reform and each of the outcomes using the following linear regressions:

The first is a linear regression of each of the health outcomes on whether the participant stayed in school after the age of 15. The second regression is the reduced form association of the health outcomes and the reform. As above, each regression includes terms for gender and month of birth to account for the season of birth. The reduced form is a valid test of the null-hypothesis that schooling does not affect the outcomes.
We tested whether the reform had larger effects on people who would otherwise have been expected to leave school at age 15. We estimated the probability that a participant would remain in school after the age of 15 using logistic regression and data from individuals born before 31st August 1956. This model included indicators for the participants’ assessment center, year and month of birth, sex, whether mother smoked during pregnancy, were breastfed, their relative height and body size at age 8, number of brothers and sisters, the normalized genome-wide education score, and their ethnicity. Missing data were replaced at the mean and indicators variables for missing values were included. We estimated the following regression:

Where is probability of remaining in education from the logistic regression. For each outcome we report the coefficients on the reform indicator, and the coefficient on the interaction term and the effect of the reform. The effect of the reform on participants predicted to leave is indicated by φ1, and the effect on those expected to stay is indicated by φ1 + φ3. As with the main results above we adjust for sex and month of birth, and the interaction of these variables with predicted education. (44)
B. Instrumental variables
We estimated the causal effect of schooling using instrumental variables estimators. For the continuous outcomes, we estimated mean differences using Two-Stage Least Squares (2SLS),(40) and for the binary outcomes we estimated risk differences using additive structural mean models.(41) These models can be identified by making one of three assumptions.(41) First, for the continuous outcomes we could assume that staying in school has the same effect on the outcomes for all participants. This identifies the average effects of staying in school but is implausible for binary outcomes.(45) Second, for the binary outcomes, we could assume a monotonic relationship between the reform and the participants’ likelihood of staying in school after the age of 15. In the potential outcomes framework, we assumed that E[Y(1) – Y(0) | E(1) – E(0) > 0]. This requires that there were no participants who were “defiers”, who would have stayed in school if they were not affected by the reform, but would have left school if they were affected by the reform. Under monotonicity, the instrumental variable estimators estimate a local average treatment effect. This is the effects of treatment in the subgroup of participants whose decisions were affected by the reform.(40) That is the people in the year after the reform who would have chosen to leave school at 15 had the reform not been introduced. Finally, we could assume that the effects of education are not affected by the reform (no effect modification). This would identify the effects of education on participants who chose to stay in school. We report the partial F-statistic of the association of remained in school Eict and the reform Dic. We also report the test for endogeneity (using a C-statistic, which is a heteroskedasticity robust Hausman test (46, 47), that E[Eictwict] = 0. This implicitly tests for differences between the linear regression and instrumental variable estimates.(47) All estimates allow for clustered standard errors by year and month of birth and include controls for gender.
C. Regression discontinuity design
Finally, we used a regression discontinuity design with variable bandwidths to investigate the robustness of our findings. This is a fuzzy regression discontinuity design, as the reform only increased the probability of staying in school.(48) In our main analysis above we present reduced forms on the participants born between September 1956 and August 1958. This is a regression discontinuity design estimates mean differences with a bandwidth of one year. We used the rd command in Stata to produce local linear regression discontinuity estimates for larger bandwidths as a sensitivity analysis.(49)
D. Negative control samples
We were concerned that differences between the two school years may occur because of the participants affected by the reform were a year younger on average than participants unaffected by the reform. To investigate this, we investigated the effects of two placebo reforms. The first placebo reform compares the cohort which left school two years before the reform was introduced (i.e. those born between September 1955 and August 1956) to the cohort that left school in the year before the reform was introduced (i.e. those born between September 1956 to August 1957). The second compares participants in the school cohort in the year after the reform, (i.e. those born between September 1957 to August 1958) to those in the cohort two years after the reform (i.e. those born between September 1958 to August 1959). These cohorts have the same average difference in age as the two school years either side of the reform, but they experienced the same school-leaving age. Therefore, any differences between the younger and older school years in these negative control samples are likely to be due to aging and will not be due to the effects of education. We estimated the reduced form associations using these negative control samples. We repeated all of the analyses stratified by sex as a sensitivity analysis and report these results in the online appendix.
All analyses were conducted in StataMP 14.0.(50) Code used to generate these results can be found at (https://github.com/nmdavies/UKbiobankROSLA) and the data used has been archived with UK Biobank (http://www.ukbiobank.ac.uk) and can be accessed by contacting the study (access@ukbiobank.ac.uk). The protocol for this study is available as an online appendix.
Acknowledgements
We would like to thank the Social Science Genetic Association Consortium for providing the coefficients from the Educational attainment GWAS, and Gibran Hemani, Lavinia Paternoster, David Carslake, Jack Bowden, Louisa Zuccolo Evie Stergiakouli and Eleanor Sanderson for helpful comments on an earlier draft. All mistake remain our own.