Abstract
The observation of individuals attaining remarkable ages, and their concentration into geographic sub-regions or ‘blue zones’, has generated considerable scientific interest. Proposed drivers of remarkable longevity include high vegetable intake, strong social connections, and genetic markers. Here, we reveal new predictors of remarkable longevity and ‘supercentenarian’ status. In the United States, supercentenarian status is predicted by the absence of vital registration. The state-specific introduction of birth certificates is associated with a 69-82% fall in the number of supercentenarian records. In Italy, England, and France, which have more uniform vital registration, remarkable longevity is instead predicted by poverty, low per capita incomes, shorter life expectancy, higher crime rates, worse health, higher deprivation, fewer 90+ year olds, and residence in remote, overseas, and colonial territories. In England and France, higher old-age poverty rates alone predict more than half of the regional variation in attaining a remarkable age. Only 18% of ‘exhaustively’ validated supercentenarians have a birth certificate, falling to zero percent in the USA, and supercentenarian birthdates are concentrated on days divisible by five: a pattern indicative of widespread fraud and error. Finally, the designated ‘blue zones’ of Sardinia, Okinawa, and Ikaria corresponded to regions with low incomes, low literacy, high crime rate and short life expectancy relative to their national average. As such, relative poverty and short lifespan constitute unexpected predictors of centenarian and supercentenarian status and support a primary role of fraud and error in generating remarkable human age records.
Introduction
The concentration of remarkable-aged individuals within geographic regions or ‘blue zones’[1] has stimulated diverse efforts to understand factors driving survival patterns in these populations[2,3]. Both the overall population residing within these regions, and the individuals exceeding remarkable age cut-offs, have been subject to extensive analysis of lifestyle patterns[2,4–6], social connections[3,7], biomarkers[8,9], and genomic variants[10], under the assumption that these represent potential drivers behind the attainment of remarkable age.
However, alternative explanations for the distribution of remarkable age records appear to have been overlooked. Previous work has noted the potential of population illiteracy[11] or heterogeneity[12] to explain remarkable age patterns. Other investigations have revealed the potential role of errors[13–16], bad data[17], and operator biases[18] in generating old-age data and survival patterns. In turn, these findings prompted a response with potentially disruptive implications: that, under such models, the majority if not all remarkable age records may be errors[19].
There is a theoretical reason to expect that most or all old-age records are errors[13,15]. Consider, for example, a population of fifty-year-olds into which we introduce a vanishingly low rate of random, undetectable age-coding errors. These rare errors involve taking a forty-year-old, and making their documents state they are now fifty years old.
The 40-year-old ‘young liar’ errors will have over double the annual survival rate of the (error-free) 50-year-old population. They are, after all, biologically 10 years younger. As the ‘young liar’ errors are more likely to survive than the error-free data, age-coding errors constitute an exponentially larger fraction of the population over time. Eventually, this exponential growth overtakes the population and, even from a vanishingly low baseline error rates of (say) 0.001%[15], the entire population becomes ‘young liar’ errors at advanced ages[15]. Of course, the baseline error rate in most populations is not vanishingly low. The USA, which has more supercentenarian records than any other country, exhibits historical error rates of 17%-66%[20–23], even in the general population[20].
Here, we explore the possibility that extreme old-age data are dominated by age-coding errors by linking civil registration rates and indicators of poverty to per-capita estimates of remarkable age attainment, obtained from central population registries and validated supercentenarian databases, across the USA, France, Japan, England, and Italy.
These data reveal that remarkable age attainment is predicted by indicators of error and fraud, including illiteracy, poverty, high crime rates, short average lifespans, and the absence of birth certificates. As a result, these findings raise serious questions about the validity of an extensive body of research based on the remarkable reported ages of populations and individuals.
Such concerns were sufficiently serious that an extensive review was conducted, revealing evidence that brings into question key findings, widespread research practices, and the basic level of integrity in the field (Supplementary Materials).
Methods
The number and birthplace of all validated supercentenarians (individuals attaining 110 years of age) and semisupercentenarians (SSCs; individuals attaining 105 years of age) were downloaded from the Gerontology Research Group or GRG supercentenarian table[24] (updated 2017) and the International Database on Longevity or IDL [25]. These data were aggregated by subnational units for birth locations, which were provided for the IDL data, and obtained through biographical research for the GRG data. Populations were excluded due to incomplete subnational birthplace records (<25% complete) or countries with an insufficient number of provinces to fit spatial regressions (<15 total provinces), leaving population data on SSCs and supercentenarians in the USA, France, and England (Fig 1).
To quantify the distribution of remarkable-aged individuals in Italy, province-specific quinquennial life tables were downloaded from the Italian Istituto Nazionale di Statistica Elders.Stat database [26] to obtain age-specific cohort survivorship data (Fig 1c,f; S1 Code). Using contemporary data across Italian provinces, probabilities of survival (lx) to ages 90-115, and life expectancy at age 100 were fit as dependent variables, and survival rates at age 55 and life expectancy at age 55 as independent variables, using a simple linear mixed model regression (S1 Code).
Japanese centenarian data were downloaded from the Japanese Ministry of Health, Labour, and Welfare[27] through the Statistics Japan portal[28] for all 47 prefectures (Fig S1), alongside data on prefectural income per capita (in 2011 yen), employment rates, age structure, survivorship, and a financial strength index, for 2010: the most complete recent year available for these data (S1 Code; Fig S1). These data were also linked to the most recent available prefecture-specific poverty rates[29].
Supercentenarians recorded in the GRG database and born in the USA were matched to the 1900 census counts for state and territory populations[30], and linked to the National Center for Health Statistics estimates for the timing of complete birth and death certificate coverage in each US state and territory[31]. Both the number of supercentenarian births overall, and estimates of supercentenarians per capita, approximated by dividing supercentenarian number by state population size in the 1900 US census[30], were averaged across the USA and represented as discontinuity time series relative to the onset of complete-area birth registration (S1 Code).
To capture the geographic distribution of French supercentenarians, all 175 supercentenarians in the GRG database who were either born or deceased in France were linked to the smallest discoverable region of birth using biographical searches[24]. In addition, de-identified records in the IDL were already linked to birth locations encoded by the Nomenclature for Territorial Units level 3 codes (NUTS-3), which divide France into 101 regions [25]. These modern regions were linked manually to their corresponding Savoyard-era department to obtain historic region-specific estimates of life expectancy at birth[32] for the birth year and location of all supercentenarians in metropolitan France. For each supercentenarian, life expectancy at birth was then measured relative to the contemporary average life expectancy of metropolitan France (S1 Code).
The number of total supercentenarians and SSCs born into Eurostat NUTS-3 coded regions, either documented for French and English regions in the IDL or estimated for Italian regional cohorts by ISTAT, were linked to modern socioeconomic indicators available at this administrative level: total regional gross domestic product (GDP), GDP per capita, GDP per capita adjusted for purchase power scores (PPS), murder and employment rates per capita, and the number of 90+ year-olds, using the EUROSTAT regional database[33] (S1 Code).
Across England, additional data were obtained for the Index of Multiple Deprivation or IMD: a national metric used to indicate relative levels of deprivation, including income deprivation in people aged 60+, by the UK Office of National Statistics[34]. The IMD data are measured in 317 local authority districts, each of which is a subset of a single Eurostat NUTS-3 encoded region.
To capture the relative degree of deprivation within England, the IMD and its component scores were averaged within each of the 175 NUTS-3 regions (S1 Code).
Similar estimates of deprivation were obtained for French NUTS-3 regions, by downloading the regional poverty rates and poverty rates in the oldest available age group, ages 75 and over from the French National Institute of Statistics and Economic Studies INSEE [35].
To overcome the three orders of magnitude differences in population size across subnational geographic units, the number of centenarians, SSCs and supercentenarians were adjusted to per capita rates. However, the ‘correct’ adjustment for per capita rates of remarkable longevity is dependent on the a priori assumptions of their cause. For example, if the null hypothesis was that all supercentenarians are ‘real’, adjustment for birth cohort size 110+ years previously would be a more correct method for best predicting the population density of supercentenarians. However, if the null hypothesis is that supercentenarians are more frequently modern-era pension frauds or clerical mistakes, per capita correction for a birth cohort 110 years in the past is of uncertain value for predicting modern events. In this latter case, the occurrence of supercentenarians would be better and more accurately predicted by correcting for modern population sizes. This concern is mitigated by the non-migration of most SSCs and supercentenarians with age -- where data is available, two thirds died in their region of birth -- and the spatial stability of poverty patterns over time[36].
Historical rates were used whenever possible, favoring the ‘real supercentenarians’ assumption. Per capita rates of remarkable age attainment, calculated relative to the size of historical birth cohorts, were downloaded from the respective government statistical bureaus of Japan and Italy [26,27]. Due to the absence of birth certificates, USA supercentenarian data from the GRG were corrected to historical per capita rates based on population data in the 1900 US census[30].
However, individuals in France and England were located into geographic units that have only existed since 2003. As a result, there were no data on historical population sizes available for these geographic units. It was therefore necessary to estimate per capita rates using modern population sizes surveyed at the NUTS-3 geographic level within France and England.
To address this unavoidable difference in per capita rate calculations the number of the centenarians, SSCs, and supercentenarians were also corrected relative to the number of old-age residents in each modern geographic unit of Japan, England, and France (S1 Code). This adjustment was less susceptible to migration or large longitudinal shifts in population size, and better reflected the density of older people in modern geographic units after survival and migration processes. However, the insufficient granularity of birth cohorts within England and the considerable rearrangement of geographic units within France remains an important constraint on the upper accuracy of these models.
Collective socioeconomic indicators obtained for each country were used to develop linear mixed models across all regions at least one case, for centenarian in Japan, SSCs in Italy and England and supercentenarians in England and France (S1 Code), to predict the regional per capita and per 90+ year old density of the oldest available populations in each country. Linear mixed models were fit using either the population poverty rate (England, France, and Japan) or estimates of old-age poverty rates (percent in poverty over 75 in France, the IDOP index in England) as the single predictor variable, and the number of centenarians, SSCs and supercentenarians both per capita and per 90+ year old. These models were then extended by fitting, as interactive effects, basic socioeconomic indicators used as global indicators of health and deprivation available at a sufficient geographic level (S2 Code). Such models focused on capturing fundamental indicators, representing crime rates, health, and income, available at the NUTS-3 regional level in the EU and the prefectural level in Japan.
Where available, French supercentenarians were linked to regional estimates of life expectancy at birth, calculated quinquennially for each of the Savoyard-era departmental boundaries of France into which they were born[32]. These local rates were then corrected relative to the contemporary French national average life expectancy at birth to yield the relative life expectancy at birth, in years[32]. For example, Jeanne Calment was born in the Alpes-Maritime department in 1875, when average life expectancy at birth was just 33.4 years and the contemporary national French average life expectancy was 37.8 years: a relative regional life expectancy of −4.4 years. These rates were used to measure if regional life expectancy at birth was significantly higher or lower than the French national average, using a one sample t-test, for each supercentenarian.
To explore the potential for age manufacture amongst remarkable age records, birthdate data were aggregated within the GRG and IDL databases. Enrichment for specific birth days is usually indicative of nonrandom age selection due to fraud, error, and clerical uncertainty. This check, however, is limited in that it cannot detect diverse sources of error, such as identity fraud or failed death registrations, which retain a representative distribution of birth days.
As population representative birthdates were unavailable within the target populations, the distribution of births was tabulated by days of the month to remove the often poorly-categorized or undocumented effects of birth seasonality. This distribution was compared to both modern birthdate distributions from seventy million births in the US, which suffer distortion from the heaping of elective births and caesarean sections on certain dates, and to the distribution of birthdays under a uniform distribution of births.
To facilitate reproduction of these findings, all shareable data and code are available in a single structured file, with instructions and links for the non-shareable data, in S1 Data.
Results
Between the 1880 and 1900 census, a period covering 79% of US supercentenarian births, the US population increased by 150% and average life expectancy by 20%[30,37]. The introduction of complete vital registration in the USA coincided with this rapid increase in lifespan and population size and was expected to result in a large increase in the number of supercentenarian records per capita.
Instead, the introduction of state-wide birth certification coincides with a sharp reduction in the number of supercentenarians. In total, 82% of supercentenarian records from the USA predate state-wide birth certification. Forty-two states achieved complete birth certificate coverage during the survey period. When these states transition to state-wide birth registration, the number of supercentenarians falls by 80% per year (Fig 2a) and 69% per capita (Fig 2b).
The introduction of birth certificates in Italy largely predates the onset of supercentenarian records. Instead, the attainment of remarkable age in Italy is predicted by a short average lifespan. Within Italian cohorts, life expectancy at age 55 is positively correlated with life expectancy at all ages at all ages from 60 to 95. However, better early-and mid-life survival is predictive of worse mortality rates after age 95 (r = 0.15; p = 0.1; Fig S2a). Cohort survival to age 55 is increasingly negatively correlated with survival to ages 100 (r = -0.17; p = 0.06; Fig S2b), 105 (r = -0.38; p = 0.00003; Fig S2c) and 110 years (r = -0.45; p < 0.000001; Fig S2d), and with life expectancy at age 100 (r = -0.4; p=0.00001). Furthermore, individuals across Italy are more likely to attain supercentenarian ages if their province has higher unemployment rates (Fig S3a), fewer people above age 90 (Fig S3b), and a worse economy (Fig S3c).
Findings from the Italian data support the hypothesis that centenarians, SSCs, and supercentenarians largely constitute a collection of errors and pension fraud[19]. For example, of Italy’s 114 provinces Medio Campidano ranked second for supercentenarians, third for SSCs and fourth for centenarians per capita. However, the province also had the second-lowest employment rate of all Italian provinces and the single lowest GDP per capita (both unadjusted and PPS-adjusted; Table S1). Individuals born in Medio Campidano were the second-least likely to survive from birth to age 55, after the Blue Zone province of Ogliastra. By this metric, the province of Olbia-Tempio was the seventh-worst province for survival to age 55, and according to Eurostat had the eighth-fewest individuals alive over the age of 90, yet somehow also ranked as the best province for survival to ages 100,105 and 110 (Table S1).
French supercentenarians are over-represented in the overseas departments, former colonial holdings, and Corsica (which is included in metropolitan France; Table S2): regions that historically constitute some of the most neglected, least well-documented, and shortest-lived administrative regions of France. At the first reliable estimate of population size in 1950 these departments contained around 1.7% of French citizens. However, at least 11% (N=16) of the French supercentenarians in the GRG database originate there: a 6.5-fold over-representation. This number increases when integrating the deidentified IDL data, which only includes regions monitored by Eurostat (Réunion, Guadeloupe, Martinique and Corsica), to establish the minimum numbers of supercentenarians born in each region. Once these data are included, at a minimum the overseas and former colonial regions of France contain twenty-four (15.5%) of a total 155 supercentenarians: Guadeloupe and Martinique each contain eight, French Algeria four, and one each in French Guiana, Saint Barthélemy, Réunion, and New Caledonia.
When IDL and GRG data are combined, the top six (NUTS-2 level) regions of France by supercentenarians per capita are, respectively, Saint Barthélemy, Martinique, Guadeloupe, Corsica, New Caledonia, and French Guiana. These six regions contain eight times as many supercentenarians per capita (1.3 per 100,000), and more supercentenarians overall (N=21), than the region of Île-de-France (0.16 per 100,000; N=19). This is despite Île-de-France receiving more than double the per capita income, being the longest-lived region in mainland France, and containing seven times as many citizens including huge numbers of (non-supercentenarian) Guadeloupe- and Martinique-born internal migrants[38].
Within these regions, Martinique has the second-highest poverty rate both overall (29%) and for people aged 75 and over (31%) in all the 101 NUTS-3 coded provinces. Of the ranked regions only Réunion has higher poverty rates, with 39% below the poverty line. Amongst the unranked regions, French Guyana has a 44% unemployment rate, with 44% of households are listed as ‘poor’, and Guadeloupe a 24% unemployment rate[39]. The two departments of Corsica have the highest and second-murder rate, and the third and fifth highest poverty rates, of the 101 French departments.
When compared across the 101 modern NUTS-3 coded regions of France, Guadeloupe and Martinique are both equal second for total supercentenarians after Paris (Table S2), with at least eight supercentenarians each. However, these two provinces contained only 190,000 French citizens each c.1900: by comparison the Paris department contained 2.7 million people in 1901. The two small, remote departments of Guadeloupe and Martinique produced the same number of supercentenarians as the Nord department, despite the latter containing ten times as many citizens in 1901.
When measuring supercentenarians per capita using modern population sizes, due to incomplete data, Martinique and Guadeloupe ranked second and third for supercentenarians per capita out of the 101 NUTS-3 regions of France. Creuse ranked as having the highest percentage of oldest-old individuals in France. Creuse also ranks as having the fourth-highest poverty rate in older people, the 16th-worst poverty rate overall, and the fourth-lowest GDP per capita (both raw numbers and PPS adjusted; Table S2) of the 101 regions of France. Per capita rates in Creuse may have also been impacted by massive out-migration, with the Creuse population falling by 60% since 1901.
Region-specific estimates of life expectancy were complicated by large-scale historical changes in province number, boundaries, and even nations. From 1900 the number of French regions fell from 88 to eighteen, with just twelve mainland regions. Outside the overseas regions and Corsica, whose boundaries are largely unchanged, few of the GRG supercentenarians could be located within the Savoyard-era department boundaries required to gain an accurate estimate of life expectancy at birth. Most records could only be resolved to the 12 modern administrative regions, which contain up to thirteen Napoleonic/Savoyard era provinces each, resulting in a wide range of possible life expectancies and socioeconomic data for each individual. Combined with the initially small number of supercentenarians in France, this coarse resolution excluded a more detailed analysis of the GRG data.
There were notable disparities in the abundance of the oldest-old across England (Table S3). For example, the region with the most 90+ year-olds per capita[34], West Sussex, had only five SSCs in a population of 460,000. In contrast, in Tower Hamlets, the region with the least 90+ year olds in the country[34], records fifteen SSCs amongst 289,000 residents.
As a result, Tower Hamlets has the most SSCs per capita in England. Tower Hamlets also has the highest poverty rate[40], highest child poverty rate[40], highest income inequality[40], the shortest disability-free life expectancy by ten to 15 years[40], and the worst index of multiple deprivation[34] of the 32 London boroughs. According to the Income Deprivation Affecting Older People Index (or IDOP), which captures the financial stress and deprivation of older people relative to the national average, Tower Hamlets is the single most income-deprived population of older people across all 317 local authority districts[34]. Tower Hamlets has the smallest percentage of people aged 90 and over[34]: a notable discrepancy for a region that, according to the IDL figures, has the largest percentage of SSCs.
Of the 131 regions included in analysis, the top 20 regions for SSCs per capita contained the 1st and 3rd-8th most income-deprived regions for older people in England (Table S3). As such, high rankings for income deprivation in older people predicts higher rates of achieving ages 105 and over. Out of 175 regions in England, for example, Southwark and Lewisham ranked as the eighth most income-deprived district for older people[34], second for SSCs per capita, and first for SSCs overall (Table S3). Central Manchester produced 18 SSCs overall (equal sixth) and ranks 14th for SSCs per capita yet is the third most income-deprived district for older people, and has the highest crime index, third-worst population health index, fourth-worst index of multiple deprivation, and sixth smallest percentage of 90+ year old people of any region (Table S3). Such rankings are not recent, with inner-city Central Manchester the second most persistently deprived of all 317 local-authority districts[34].These specific cases reflect broader patterns across England, where regions with the highest number and per capita rate of SSCs also have lower incomes, worse health, more deprivation, and higher crime rates (Fig 3).
Higher poverty is a particularly good predictor of higher survival to extreme ages. For example, a single-variable linear model using only (general or old-age) poverty as a predictor captures a substantial amount of the variation in centenarian, SSC, and supercentenarian abundance. Higher poverty and deprivation are positively correlated with all age categories of the oldest-old in every country measured (Fig 3; Fig 4a-f).
The IDOP ranking is an accurate indicator of the distribution of income deprivation in older people across England. When aggregated into NUTS-3 regions containing at least one SSC, IDOP scores are positively correlated with SSC abundance per capita (r =0.42; p = 0.0000009; Fig 4a). That is, higher income deprivation for residents over 60 predicts higher numbers of people surviving past age 105 (Fig 3b). This predictive accuracy improves markedly when predicting the number of SSCs per 90+ year old resident, rather than the number of SSCs per capita (r =0.70; p < 1x10-15; Fig 4d).
Like English data, the percentage of French supercentenarians in a population was predicted by higher rates of poverty in older people (r = 0.42; p= 0.0004; Fig 4b). Again, these associations were stronger when predicting the fraction of 90+ year-olds who lived over 110: across the 66 regions of France with sufficient data, the poverty rate over age 75 predicted half of the variation in supercentenarian abundance (r=0.51, p= 0.00001; Fig 4e).
Japan displayed similar anomalies in the regional distribution of extreme-age records. The first and second ranked Japanese prefectures for centenarians per capita, Shimane and Kochi, had the worst and second-worst regional economic rankings (Table S4), while extensive anomalies in third-ranked ‘Blue Zone’ of Okinawa are detailed in the supplementary materials (Supplementary Materials; Table S5). Japanese poverty rates in the general population were again positively correlated with attaining remarkable lifespans (r = 0.36; p = 0.01; Fig 4c), an interaction that strengthens when poverty rates are used to predict the fraction of 90+ year old people living past age 100 (r = 0.62; p = 4e-06; Fig 4f). The number of centenarians in Japan is also negatively correlated with income per capita (r = −0.44, p=0.001), the minimum wage (r = −0.64; p = 1e-09), and the Japanese financial strength index (r = −0.70; p = 3e-08) across all 47 Japanese prefectures. Prefectures that spend more money on old-age welfare per capita, a disincentive for welfare fraud, also produce fewer centenarians per capita (r = −0.49; p= 0.0004). These factors share latent drivers and are highly autocorrelated: prediction models for centenarians per capita based solely on poverty (R2 = 0.37; p= 0.0001; S2 Code) approach the accuracy of linear mixed models containing all available socioeconomic variables (R2 = 0.43; p = 3e-06) yet have a lower Akaike’s information Criteria (S2 Code).
Linking French historical life expectancy at birth[32] to some 143 French supercentenarians with sufficient data revealed that the cohort life expectancy at birth of each supercentenarian was not significantly different to the contemporary national average (Fig S4). That is, mainland French supercentenarians were not, on average, born into regions with either significantly longer or shorter than expected life expectancy. Given it is only known where these individuals where born, and not where they lived most of their lives, it seems notable that the modern economic conditions and poverty rate of a birth location should be predictive of becoming the ‘oldest-old’ (Fig 4) while life expectancy at birth is not (Fig S4).
Enrichment for specific days of the month in birth dates is seen as indicative of nonrandom age selection due to fraud, error, and clerical uncertainty. This check, however, is limited in that it cannot detect diverse sources of error, such as identity fraud or failed death registrations, which retain a representative distribution of birth dates.
Days of the month (e.g. the 1st or 2nd of each month) are nearly uniformly distributed throughout the year. As a result, US births had a near-uniform distribution of births, with minimal deviation from random sampling. Even after the widespread uptake of induced births, which avoid weekends and public holidays, birthdays generally varying by less than two per cent across different days of the month (Figure S5a), such that the distribution of births across days of the month is nearly invariant in the modern USA. In contrast, supercentenarians in the GRG database are 1.4-fold more likely to be born on the first day of the month (Figure S5b) and 1.2- fold as likely to be born on a day that is divisible by five. The number of supercentenarians born on the first day of the month is 150% higher than the previous day. Given the near-complete absence of caesarean sections in these population, this pattern may be explained if a large percentage of people have non-randomly chosen or manufactured birthdates.
These patterns are broadly reflected in the IDL birthdates, with days divisible by five, excepting the 25th, being over-represented amongst supercentenarians and SSCs (Fig S5b). In contrast the first day of the month was not over-represented amongst the IDL data (Fig S5c): this was initially difficult to reconcile, given these databases overlap considerably and largely represent the same individuals.
However, this differential appears to be a result of dates of birth being missing from 48% (N=797) of the IDL supercentenarians: despite extensive records being available, every US supercentenarian in the IDL has their day and month of birth removed. In addition, all Japanese records are absent. In the GRG data, most of the signal for over-enrichment arises in Japanese and US birthdates (Code S1-S2), with Japanese supercentenarians 2.77 times more likely and US supercentenarians 1.57 times more likely to be born on the first day of the month. In the primarily European countries in the remaining GRG database, individuals instead have a 0.67 odds ratio of being born on the first day: oddly, the inverse of the US and Japanese pattern. As a result, the omission of certain dates and records from the IDL data, for reasons that are unclear, masks the age heaping apparent in the USA and Japan.
Discussion
Basic economic and social indicators in the modern economy, such as GDP per capita and poverty rates, predict the distribution of extreme age records. Despite constraints on model construction and accuracy, such as unavoidable differences in per capita adjustments, these basic models approached reasonable accuracy. However, the direction of these interactions is the opposite of rational expectations.
Diverse social and economic indicators that normally predict worse health outcomes, such as income deprivation, poverty, and high unemployment, are all positively associated with a higher probability of reaching an extreme age. These factors are linked to a lower probability of survival and worse health outcomes at every age below 90, for every population included in this study.
However, these ‘anti-health’ factors exhibit a consistent positive association with extreme longevity. In England, which contains the only national data with sufficiently granular regional health measures, even poor health itself is positively associated with attaining a remarkable lifespan (Fig 3c). Across England and Italy, a larger number of people over the age of 90 is a significantly predictor of fewer people over age 105 (Fig 3e; Fig S3g-i): a difficult pattern to explain, given the younger population contains the older, unless errors are a primary factor.
Observed rates of vital registration in extreme-age and supercentenarian data are even more difficult to explain. For example, 96% of all 105+ year olds died since 1990, overwhelmingly in populations with over 95% rates of death certification. During validation, the intensive search for vital registration documents and the removal of unsupported cases should have enriched this death certification rates above the 95% baseline. It is therefore unclear why only 8% of all SSCs and 1.4% of all US supercentenarians recorded in the IDL have a death certificate unless these data are mostly errors. A similar pattern holds for birth certificates: even an intensive search could not exceed 15% coverage[41].
Finally, claims to remarkable regional longevity are often contra-indicated by substantial, but curiously uncited, studies. For example, the Centres for Disease Control generated an independent estimate of average longevity across the USA: they found that Loma Linda, a Blue Zone supposedly characterised by a ‘remarkable’ average lifespan 10 years above the national average, instead has an unremarkable average lifespan29 (27th-75th percentile; Fig S6). European Blue Zones also enjoy independent estimates. According to the collective governments of the European Union[42], the Sardinian and Ikarian Blue Zones occupy regions that are 36-44th and 56-65th in the EU for old-age (85+) longevity.
Viewed in isolation these anomalies may, perhaps, be explained away by reference to unknown lifestyle factors. However, these findings should be considered in the context of other diverse and incongruous patterns observed in extreme old age studies. These concerns are so extensive that discussion of even the broad outlines requires a critical review (Supplementary Materials).
The problems of extreme-age research may, however, be focused into two critical points. The first is observational. Extremely high-frequency errors are not impossible, improbable, or hypothetical[19]: instead, they have been routinely discovered after long evading the notice of demographers, often in the most intensively studied populations in the world. Any statement that such error rates could not happen[19], for example due to the vigilance or rigour of demographic validation, are therefore belied by historical fact.
In the USA, for example, 27% (‘white male’) to 66% (‘non-white female’) of Americans had multiple official ages in 1960, with 8%-30% misreported by more than a decade[20]. At least 54% of US centenarians were revealed as errors in 1979[43], and over half of all decedent African-Americans had multiple official ages in 1985[22]. In 2003, Stone checked 550 US supercentenarians recorded from 1980-1999 and, after an exhaustive search, just 43 of ‘supercentenarian’ records had a birth certificate (14.5%), and only 217 (40%) had a consistently-reported age [41].
Substantial error rates were recently uncovered in every ‘Blue Zone’. In 1997, thirty thousand Italian citizens were discovered to be claiming the pension whilst dead[44]. In 2008, 42% of Costa Rican 99+ year olds were revealed to have ‘mis-stated’ their age in the 2000 census[45] and, after limited error-correction, the Nicoya Blue Zone shrunk by around 90%[46] and old-age life expectancy plummeted from world-leading to ‘near the bottom of the pack’[47]. In 2010, over 230,000 Japanese centenarians were discovered to be missing, imaginary, clerical errors, or dead[48,49] -- an error rate of 82% in data then considered among the best in the world[7,50].
Greece followed in 2012, when at least 72% of Greek centenarians reported in the census were discovered to be dead or, depending on your perspective, committing pension fraud[51]. Finally, at least 17% of centenarians in the USA were discovered to be non-centenarians in 2019, not through intensive validation or qualitative interviews, but by reading two plain-text files and finding the dates did not match[14].
Only two of these cases were discovered, and all seem generally ignored, by professional demographers[14,45]. Some cases were seemingly not even noticed post-hoc: when most Greek centenarians were found to not exist in the aftermath of the global financial crisis, their non-existence was published in newspapers[51], but was not mentioned or cited in the demographic literature.
If equivalent rates of fake data were discovered in any other field – for example, if 82% of people in the UK Biobank or 17% of galaxies detected by the Hubble telescope were revealed to be imaginary – a major scandal would ensue. In demography, however, such revelations seem to barely merit citation.
It seems worth asking, therefore, why the recurrent discovery of high rates of non-random age errors have been ignored by the scientific community, given the fundamental importance of accurate age data to fields like medicine, gerontology, the social sciences, and epidemiology.
A typical response to discovering most data is illusory has not been to stop trusting data built on identical methods and documents, but to instead call for better validation and a closer examination of documents[52]. This raises the second critical point. The examination of documents, the only measure of human age in virtually all demographic research, cannot detect routine errors.
Again, this is not an assertion but a reality. Consider, for example, the 1878 Taché investigation[53,54] that pioneered testing centenarian status using validation ‘proved by authentic documents, examined with a rigorous scrutiny’: the unchanged state-of-the-art method. Of 421 claimed centenarians, 82 passed documentary validation, and only nine passed subsequent investigation, including Pierre Joubert: the world’s oldest man and first validated supercentenarian[53,54]. The case endured over a century of scrutiny, and was continually included in world records, before Joubert was found to have died at age 65. It written on his death certificate the entire time[53,55].
Extreme-age demographers seem to have misunderstood the central lesson of such events: not that errors can be detected though increased scrutiny, but that acceptance of such cases is dependent on the often-arbitrary generation, survival, and detection of documents. Without all three, completely fake cases can remain ‘validated’ indefinitely, regardless of whether they are true.
Demographers have confounded consistency with accuracy by assuming that consistent and valid documents are accurate, or at least strongly indicative of accuracy. This assumption is deeply unsound, especially given the nonlinear accumulation of error rates with age[15] and the rate at which errors have previously passed validation. Showing that documents are consistent and valid does not mean that they are accurate, or that any connection to their supposed owner is real.
To understand this problem, consider a room containing 100 real Italian centenarians, each holding complete and validated documents of their age. One random centenarian is then exchanged for a younger sibling, who is handed their siblings’ real and validated birth documents.
How could an independent observer discriminate this type I substitution from the 99 other real cases, using only documents as evidence? More problematically, how could that observer measure the frequency of such errors, and discriminate whether one, ten, or a hundred ‘Italian Siblings’ are in the room, given they each hold validated and consistent documents?
This simple class of error cannot be excluded based on documentary evidence: every document in the room is consistent, real, and validated. Any such type-I error therefore has the potential to indefinitely escape detection. Scrutinising documents, usually the only method open to demographers, provides no guarantee of accuracy.
Compounding this issue, any younger relative is also likely to have sufficient biographic knowledge to pass an interview (which almost never occur) with an accuracy indistinguishable from a plausibly-forgetful centenarian.
The ‘Italian Sibling’ though experiment illustrates just one reason that type I age-coding errors cannot be ruled out — or even necessarily measured — using documentary evidence alone. Instead, the experiment presents a substantial problem for remarkable-age databases, which can be embodied in a deliberately provocative, if seemingly absurd, hypothesis:
Every ‘supercentenarian’ is an accidental or intentional identity thief, who owns real and validated 110+ year-old documents, and is passably good at deception.
This hypothesis cannot be invalidated by the further scrutiny of documents, or by models calibrated using document-informed ages[56,57]. Rather, invalidating this hypothesis requires a fundamental shift: it requires the measurement of biological ages from fundamental physical properties, such as amino acid chirality[58] or isotopic decay [59].
The long-hidden flaws of document-based age measurement may, however, be starting to unravel. New biometric methods of measuring ages[60,61] are revealing huge discrepancies with document-reported ages. For example, measurements of age based on epigenetic data uniformly predict the oldest-old are 8-20 years younger than their paper records suggest[62–65]. Researchers have dismissed these estimates as the result of slower aging in centenarians, of mysterious aetiology, even when centenarians were measured to have undifferentiated or ‘non- significantly slower’[63] aging rates. It is hoped another explanation — the epigenetic ages are accurate, and the documents are not — may eventually be considered.
Until then, we can only hope that the development of new and better biometrics of age[59,60,66] may end our long reliance on documents, that we may finally stop ignoring the remarkable error rates observed in old-age demography.
Statements and Declarations
Funding
The author declares that no funds, grants, or other support were received during the preparation of this manuscript.
Author Contributions
SJN conceived, designed, analyzed, and wrote the study and supplementary critical review.
Competing Interests
The author has no relevant financial or non-financial interests to disclose.
Ethics approval
This study was performed in line with the principles of the Declaration of Helsinki constituting a reanalysis of published anonymized data from publicly accessible databases. No ethical approval was required.
Supplementary Figures and Tables
Supplementary Materials: Critical review
Issues raised in the analysis of extreme-age records suggest a remarkable situation: either that the anomalous pattern of extreme-age records can somehow be explained, or that extreme-age demography suffers extraordinary and systematic problems. Given the highly disruptive outcomes suggested by this analysis, the latter scenario requires some examination.
Indicators of systemic cultural problems in extreme-age demography
While individual experiences do not necessarily inform an entire research field, the author has independently uncovered multiple, recent, high-profile, and uncorrected issues that suggest deep-seated problems in the research culture of extreme-age demography. It is disturbing and noteworthy that that one individual can uncover broad-ranging problems, largely unaided, over such a short period.
First, a prominent Nature[1] paper on extreme ages rounded most of their old-age data down to zero[2] by accidentally using the wrong column in a life table (lx not qx) and treating zero as equal to one in a log transform. The authors then concluding that the rate of change in this data, which they had accidentally rounded off to 0 then turned into 1, was lower than the rate of change in unaffected data at younger ages[1]. Using the correct column of the same life table generates the opposite result[2] and contradicts the entire study. Despite these issues being revealed through open code and data, the paper remains unretracted and its many criticisms[2–7] ignored.
Barbi et al. then promoted an opposing view in Science[8] after picking the most empirically biased and worst-fit model possible from 861 alternatives[9,10]. Barbi et al. precipitated this remarkable coincidence by selecting ‘middle age’ as the ages 65-85 in their regression[9]. Fitting exactly the same regression to any other age range eliminates the claimed plateau, contradicts every research finding in the paper, and provides a far better fit of their own data[9]. The author highlighted these problems through a peer-reviewed, reproducible, and open analysis[9,10] and confronted the lead authors who explained their choices in a way that indicated research fraud. Again, however, the paper remains unretracted and uncorrected.
Further published explanations of late-life mortality patterns were then uncovered that are similarly problematic – such as work by Alvarez et al. that simply hides old-age data, but not the error bars, above the y-axis (see their figure 2) before eyeballing the remainder and claiming the data ‘levels off’[11]. Other research merely stretches credulity, such as the idea promoted by leading demographers that inbreeding is somehow good for survival[12,13], but some striking and unchallenged explanations push outright scientific racism.
Recently, for example, Vallin[14] claimed that extreme age records on Guadeloupe and Martinique are a positive outcome of the African slave trade’s “selection of the strongest individuals” through “the tremendous health selection effect of slavery”, and that such hypothetical effects are further enriched “due to low immigration and high fertility among black people compared to white”[14]. Vallin then proposes, without providing sources, that gaps in supercentenarian status between these two islands and Réunion exist because mortality rates from slavery was markedly lower in the Indian Ocean: a claim directly contradicted by contemporary and historical mortality estimates[15,16].
Even ignoring the neo-eugenics, such explanations make no historical or biological sense. Over 5.3 million people, 8% of the mainland French population, are descendants of the transatlantic slave trade[17,18]. This population is over ten times larger than Guadeloupe and Martinique combined, and includes hundreds of thousands of Guadeloupe- and Martinique-born internal migrants[19]. However, despite enjoying better healthcare, higher wages, lower poverty, and more complete vital registration records of mainland France, this larger slave-descendant population has apparently not produced a single supercentenarian. Given 16% of birth certificates in supercentenarian families are missing (Vallin Table 6) it seems the claim that slavery enriches for ‘robust’ individuals is just racist window-dressing of poor-quality data[14].
This view is unfortunately not an anomaly, but a continuation of openly propagated racism in extreme-age demography. A general argument that late-life mortality crossovers[20,21] develop when weaker individuals are ‘selected out’ at a young age[22], an argument that seems unlikely given high early-life mortality predicts higher late-life mortality[20], has long been co-opted for racist ideas. In the words of Markides and Machalek “While errors in the data are recognized as a possible explanatory factor, a consensus seems to have developed that the [old-age mortality] crossover is indeed real and results from higher early mortality that removes less hardy blacks”[23]: a casually racist idea that has enjoyed decades of support, often because the primary alternative is acknowledging residual undetected errors in demographic data.
The director of the GRG Robert Young typifies this thinking, with a desire to resurrect scientific racism through his “biological superiority hypothesis” developed at Georgia State University[24]. Young proposes that supercentenarians are more likely to be African-American because the GRG director can (apparently) see that African-Americans have ‘thicker skin than Caucasian-Americans’ which ‘protects their internal organs’ because “‘Black don’t crack’”[24]. His other explanations include that “greater selection pressures, and thus faster evolution” caused supercentenarian status in African-Americans because of the ‘pruning’ benefits of slavery (as with Vallin) and the forced breeding of slaves[24] because “often the most-healthy African American males were selected to be the ‘stud’ that would then be mated with the healthy females; less-healthy slaves would be discouraged from breeding”[24].
Young extends his racist theory to explain why the USA has supercentenarians and Africa does not. He first states that racial ‘pruning’ must be moderate to be effective, as “clearly, the Jewish population in Auschwitz had a very short life expectancy (akin to cutting a tree down, not pruning it)’[24]. He then uses this argument as an explanation for the lack of African-resident supercentenarians[24] by equating living standards in modern Africa to being Jewish in Auschwitz.
It is hard to adequately capture the full scale of reprehensible ideas proposed by this key Supercentenarian researcher, who has been dramatically promoted since publication of these ideas, or by other individuals who use scientific racism as an explanation for longevity patterns. It is equally hard to explain why these ideas have been propagated for over 15 years, loudly and in public, without any comment or push-back from the demographic community.
Young is responsible for compiling one of the two major supercentenarian databases as director of the GRG, contributes to the IDL, and assembles many of the Guinness World Records for longevity using his qualitative assessment of supercentenarian data. That his qualitative assessment includes staring at African-American supercentenarians to see if they have ‘thicker skin’[24] should ring alarm bells. In contrast, Young has been promoted to the top of the field after publishing such views.
Collectively, these issues suggest fundamental problems with the research culture of old-age demography. It is not that racist ideas and largely fake data are present in the literature, although these are extremely serious problems, but that such issues are met by a resounding absence of criticism or action. Excepting Dong et al.[1] none of the above dubious explanations, apparent misconduct, dubious models, or outright racism have been met with any perceptible response from the broader demographic community. That needs to change, rapidly, if we seek to understand the actual patterns of old-age human survival.
Indicators of fraud and error in population-wide data and ‘Blue Zones’
Indications of a problematic research culture and a high error rate in extreme-age demography can be evidenced at all scales: broad population patterns, population studies, individual case evaluations, and even the basic logical evaluation of data and evidence.
Emblematic of these systematic issues are the ‘Blue Zones’. Blue Zones (BZs) are five regions – in Okinawa, Costa Rica, California, Sardinia and Greek Ikaria – with claimed high concentrations of extreme age records[25–27]. The primary claim of BZ research is simple: that these regions have remarkable rates of achieving extreme old-age survival, or remarkable average survival in the case of Loma Linda and Costa Rica. In particular, it is routinely claimed these regions have unexpectedly high rates of reaching centenarian status[27].
The supposed enrichment for extreme-age survival in BZs is then subject to a host of secondary claims, each aimed at explaining the primary pattern of extraordinary longevity. Old-age survival in the BZs is supposed to result from diverse causes such as ‘moderate’ drinking at twice the NHS heavy-drinking guidelines[28], plant-based diets[25–27] and inbreeding[12,13]. Most prominent amongst these secondary claims is the idea that nine specific behaviours, the ‘power nine’[29], underpin the achievement of extreme longevity (see Table S5).
The BZ longevity claims have been made, very publicly, for decades and have seen widespread support in the general demographic community. However, evaluation of the central BZ claims to extreme longevity, and secondary claims of their supposed cause, reveal the general neglect of error processes as a potential generative factor in remarkable age records, and more broadly reveal a lack of critical appraisal of extreme-age claims.
Consider the first BZ, the one most amenable to measurement: extreme longevity in Okinawa. In Okinawa the central claim of remarkable survival to extreme ages was established in 2004, before 82% of Japanese records were debunked. Okinawa then had the highest number of centenarians per 90-99-year-old of any Japanese prefecture and was world-famous for remarkable longevity.
However, according to the statistics bureau of Japan, Okinawa also has the highest murder rate per capita, the worst over-65 dependency ratio, the second-lowest median income, the highest percentage (60%) of older people on welfare[30], and the highest unemployment rate of all 47 Japanese prefectures[31]. Despite prior claims of dietary benefits based on high vegetable[29], oily fish, and sweet potato consumption[26,27], Okinawa has the lowest per capita intake of sweet potato, fruits, vegetables, seafood, taro, shellfish, root vegetables, pickled vegetables, and oily fish such as sardines and yellowtail[32] of any prefecture.
Mortality rates in Okinawa ‘cross over’ after age 50, such that older individuals and cohorts have age-specific mortality rates far below the national average[33]: a pattern indicative of unreliable data and misreported ages[20,34]. Okinawa also has the second-highest per capita intake of beer and the highest per-capita intake of KFC, the highest child poverty rate at 36% (15% higher than the next-highest prefecture)[35], the most ‘flophouses’ and shotgun weddings[36], and according to USDA estimates Okinawan residents each consume an average 14 cans of SPAM per year[37]. Okinawa has the second-lowest minimum wage (by one yen)[31,36], the lowest household savings[31,36], the highest percentage of over-65s on income assistance[31,36], the highest poverty rate[35], and the worst average body mass index (BMI) of all 47 prefectures[36,38].
These rankings do not represent a recent sudden shift away from ‘traditional lifestyles’, and have not changed substantially for extended periods before, during, or after the BZ surveys[35]. For example, the body mass index of the over-75 residents of each prefecture has been continuously assessed since 1975, with the first measurement comparing residents born in 1900 or before to every other 75+ year old in Japan[38]. Every single year, over-75 Okinawans have retained the worst BMI [38]. Similar patterns hold across other nutritional indicators: in no case, at any point, have the massive population-representative surveys of Okinawa supported the claimed vegetarian dietary patterns pushed by BZ proponents[27,29,39].
These results become even more questionable given the capacity to directly assess each of the ‘power nine’ claims for BZ longevity[29] in Okinawa (Table S5). All nine claimed drivers of extreme longevity are assessable through data measured by the government of Japan. The ‘power nine’ claims are directly contradicted in every single case, usually through population-representative surveys of hundreds of thousands of people, with levels of inaccuracy that border on farce (Table S5).
If nothing else, these surveys reveal both astounding attention to detail by the Japanese government, and the scale of data that has been overlooked by BZ researchers. The older residents of Okinawa are not filled with purpose or “Ikigai” at remarkable rates: over-65 Okinawans have the 4th-highest suicide rate in Japan [40]. Older Okinawans do not “grow gardens”[29]: they self-report the lowest rate of gardening in the country, beating only the apartment-dominated Tokyo and Osaka megacities. Okinawans do not eat “Meat … only five times per month”[29] in 3-4oz. servings, which would total 5.1-6.8kg a year: they consume well over 40kg of meat a year[32,41] without including seafood. Nor do Okinawans overwhelmingly “belong to some faith-based community”[29]: they are 93.4% atheist[42], the most irreligious population in Japan, ranking third-last in the country for religious attendance[43].
What is most astounding about such claims is not that they have been used to generate profit by selling the fountain-of-youth repackaged for a modern wellness audience. Rather, it is that the demographic community, after hundreds of papers on the subject, does not seem to have read or cited the world-leading, comprehensive, population-representative surveys[35,38,42–45] that refute the BZ claims.
In Okinawa, the anomalous pattern of centenarians is even compounded by other error-generative processes. Japanese birth and marriage records are not generated by a central bureaucracy, but instead are hand-recorded by family members as ‘Koseki’ documents, which are then filed in local town halls and government offices. This combination of citizen self-reporting and government filing allows the propagation of errors without requiring fraud. On top of this self-report data, the large-scale bombing and invasion of Okinawa involved the destruction of entire cities and towns, obliterating around 90% of Koseki records[33] with almost universal losses outside of Miyako and the Yaeyama archipelago[46]. Post-war Okinawans subsequently requested replacement documents, described from memory[46], through a US-led military government that largely spoke no Japanese and which used a different calendar. As observed by Poulain[33] the number of replacement documents issued, a proxy measure of American bombing and shelling intensity, predicts 79% of the variation in centenarian status across Okinawa.
Despite the enrichment of regular error-generating processes through American firebombing, Okinawa is not an atypical case amongst the BZs. It is, rather, just the best-surveyed BZ region. Every proposed BZ displays patterns that suggest a dominant role of error, fraud, and (to phrase it generously) researcher degrees of freedom in explaining the distribution of extreme-age records.
As detailed in the manuscript, Sardinia, Ikaria, and Okinawa represent deprived regions of a rich high-welfare state. These BZ regions rank amongst the least educated, poorest, highest-crime and least healthy regions of their respective countries (Table S1-S2, S4). Therefore, it may be hypothesized that the relatively low incomes and high poverty rates of these populations generate age-reporting errors and pension fraud, and therefore remarkable age records.
The two remaining blue zones, Loma Linda and the Nicoya Peninsula, are considered exceptional due to their high average longevity rather than the presence of the oldest-old[25,26].
As such, these claims are not relevant to assessments of supercentenarian status, yet their analysis reinforces the evidence for broader problems in extreme longevity research.
Loma Linda is a Californian suburb containing just 23,000 people, designated as a BZ because of a supposed average lifespan of 86 years for females and 83 years for males. Even if taken at face value, this average lifespan is matched or exceeded by the 125 million citizens of Japan, the seven million citizens of Hong Kong, and the seven and a half million citizens of Singapore[47]. However, when assessed independently by the Centers for Disease Control (CDC) the five small-area survey tracts covering Loma Linda instead have an average life expectancy of 76 to 81 years[48]: the 27th to 75th percentiles of US life expectancy (Fig S6). This means, at best, the independent CDC estimates rank Loma Linda as the 16,101st most long-lived neighborhood in the USA (Fig. S6; S1 code). As such, it is again unclear why the lifespan of this community has been considered remarkable.
Like most BZs, Loma Linda is not a standard census tract or statistical area, but a custom-selected region within a larger geographic area: the largest US county of San Bernadino, which has an average lifespan of 78 years[48,49]. The Sardinian BZ was similarly delineated: by drawing circles on a map[12,25] and cut across the two standard, independently surveyed regions in Italy that have the lowest and sixth-lowest probability of survival to age 55[50]. The Nicoya peninsula in Costa Rica, where independent estimates are currently not available, is also a non-standard region cut from several independent census units of moderate life expectancy. Even worse, the goalposts have frequently moved on the BZ locations. For example, the Costa Rican BZ became two regions, and now one of the two Costa Rican BZs has now disappeared, and the other has shrunk by ∼90% in population, while a third another has apparently appeared 300km away near Nicaragua[51].
Given the lower life expectancy of their encompassing regions and the uncertain basis of their ascertainment, it remains somewhat questionable that these custom-selected and frequently redefined regions should be considered valid outliers for human longevity.
Indicators of error and fraud in health studies
Like anomalous population-scale patterns, indicators of poverty and fraud and contra-indications of health are regularly ignored or downplayed in targeted studies of extreme age. For example, smoking rates of e.g. 17-50% and illiteracy rates of 50-80% are often observed in samples of the oldest-old[52,53]. Surveying the BZ of Ikaria, Chrysohoou et al. observed that the Ikarian oldest-old (ages 80+) have a below-median income in over 95-98% of cases, moderate to high alcohol consumption (5.1-8.0 L/ year), a 10% illiteracy rate, an average 7.4 years of education, and a 99% rate of smoking in men[54].
Seemingly more moderate rates of smoking occur in the Tokyo study of exceptional longevity, where “few” centenarians smoked, some 15.4%, of the mostly (78%) female centenarian sample were current smokers[55]. However, according to the national statistics bureau of Japan, only 3.9% of Japanese women and 19.3% of men over the age of 80 are smokers[56]. Tokyo centenarians therefore smoke at around twice the rate that could be expected in a younger, 80+ year old cohort with an identical sex ratio. Likewise, 80% of the ‘exceptional’ health-status centenarian population were daily drinkers, followed by 49% of the ‘normal’ and less than 40% of the ‘frail or fragile’ centenarians, resulting in “a [significant] positive relationship between drinking habits and functional status”[55]. In contrast with these figures, Japanese government surveys estimate only 2.8% of women and 23% of men aged 80+ drink every day[56]. Daily drinking peaks at 36.7% in men aged 60-69, the heaviest-drinking cohort in Japan[56]. As such, Tokyo centenarians drink at higher rates than any other age group, and smoke at rates equal to a 45-year younger population[56].
In the USA only 8.4% of general population over the age of 65 currently smoke[57], and in Europe 4.1% population over the age of 75 currently smoke[58] and 31% formerly smoked: figures that continue to fall with age due to two-fold higher mortality rates in smokers[59,60]. However, in the US and Europe individuals over the age of 100 often smoke and drink at higher rates than any other age group: in one US centenarian study 60% of people over age of 95 were former smokers[61], compared to just 25% of individuals over the age of 65 in the broader USA.
This latter study is notable as it compares lifestyle factors in the oldest-old to earlier surveys of the same cohort. This comparison revealed that centenarians have similar or worse body mass index, rates of physical activity, smoking and alcohol consumption than the younger baseline population from which they were drawn[61]: a striking equivalence given the baseline population was 35 years younger. For these two samples of the same cohort to have a similar BMI, smoking, and drinking rates would require that these factors were somehow not risk factors for mortality across the intervening 35 years. Alternately, it is possible that smoking, obesity, and drinking cause mortality but that the decline in these factors with age is offset by rising vital statistics error rates with age[10].
Suitable alternatives explanations may be that these centenarians had worse health, lower exercise and much higher rates of drinking and smoking than the general population, or that they constitute a population of younger individuals with lifestyle patterns and smoking rates that reflect higher rates of fraud, illiteracy, or chronic government neglect.
While centenarian studies likely contain cleaner samples than supercentenarian populations, they often reveal broad assumptions on sample integrity. For example, the benchmark New England Centenarian study masks a widespread lack of birth certificates. Only 30% of the total sample had an official birth document discovered[62]. Of the 80 ‘centenarians’ who were reported to be alive in 1995, roughly 8% were an entire century younger than reported, thirteen (16%) had reported a false birthday, and eleven refused to enroll but were investigated anyway[63]. Even in this post-screening sample, just 45 of the 67 investigated centenarians had a birth certificate. This rate of possessing documents is, inexplicably, substantially lower than the background population, which achieved complete coverage for birth certificates in all New England states[64] by 1897. For the remaining individuals who lacked birth certificates, age validation was instead carried out using documents including “military certificates, an old passport, school report card, family bible, and baptismal or other church certificate”[63].
Missing birth certificates are also common in other samples of the oldest-old. For example, in the IDL database none of the 797 supercentenarians from the USA is listed as having a birth certificate[65] while 41% of US cases had the possibility of a birth certificate explicitly ruled out. Even an intensive search of 297 US supercentenarians by Stone[66] only managed, as an absolute best, 43 birth certificates (of which three were inaccurate, generating 13% coverage). This absence of birth registration data extends to the larger sample of supercentenarians and SSCs across the IDL data: only 19% of all supercentenarians, and 20% of the ‘exhaustively’ validated supercentenarians, have either an original or copy of a birth certificate[65] (Supplementary Code 1). Across all individuals in the IDL database, only 6.6% have an original birth certificate listed, and 74% of cases have no listed birth documents of any kind. These low rates of birth certification have not addressable in intensive follow-on searches: the most intensive search of vital registration data, conducted in US supercentenarians, discovered 43 birth certificates amongst 550 starting candidates[66].
Death certificates are also largely missing. The first follow-up survey of the population-representative NHANES I cohort, representative of the US population, found 3.8% of decedent men and 5.7% of decedent women did not have death certificates and remained alive on paper while actually dead[67]. However, for supercentenarians listed as dying in the USA under the IDL database, only seven are listed as having death certificates: some 98.7% do not have listed death certificates. In total only 15% of supercentenarians and 8% of SSCs in the total IDL database are listed as having death certificates[65].
Indicators of error and fraud in individual cases
The potential for bias during case validation is of marked importance, especially in the absence of such fundamental evidence as birth certification. However, individual case studies often highlight the potential for personal judgement during age validation. These problems extend beyond cases detailed in the manuscript, such as the century-long acceptance, and then rejection of the Taché investigation.
For example, the world’s oldest man, Jiroemon Kimura, is now widely considered to be a valid supercentenarian case. Evidence for case validation used primary source material collected and vetted by biological relatives of Kimura, who stood to gain from his validation[68]. As such, Jiroemon Kimura has at least three wedding dates to the same wife, was conscripted to the same military three times in four years despite the mandatory conscription period being three years long[69], has three dates of graduation from the same school, and has at least three birthdays[68]. In addition, for the first 20 years of his life all of Kimura’s birthdates and school records are actually those of ‘Kinjiro Miyake’, a name whose connection to Kimura is not attested to by an official document, but by an unofficial hand-written note from a Korean mail and telephone company[68]. Under interview, Kimura explained one of his extra birthdays in a way that was “not feasible”[68], and Gondo et al. concluded the birth date had been deliberately forged[68].
Gondo et al. then concluded the case validation by assuming any conflicting official records were mistakes and, amongst the diverse birth, wedding, conscription, and graduation dates, selected those they felt were accurate. The paper describing the world’s oldest man then lists a new birthday for Kimura as 1987, and his brother’s birthday as 1985: two undetected and uncorrected errors that rather undermine the claimed capacity of demographers to spot erroneous dates[68]. The study concludes that “no critical discordances were discovered” [68] and the case is still considered valid[70,71].
The validity of the Kimura case has been widely accepted under the assumption that age discrepancies can be discarded through the qualitative judgement of demographers. This reliance on qualitative judgement during case validation reflects general practice. For example, concerns surrounding the validation of ages are often met with a response that biographical inconsistencies, detected during interview by a demographer, will result in cases being removed from the record.
However, this sentiment can be difficult to reconcile with observed practice. For example, former smoker and occasional drinker Adele Dunlap, who “ate anything she wanted” and “never went out jogging or anything” was validated by the GRG and IDL as the oldest woman in the USA. This was despite Dunlap consistently maintaining under interview that she was a decade younger: when “asked how it felt to be 113, Dunlap… looked her questioner in the eye and answered: ‘I’m 104’”[72]. Despite consistently maintaining her age was incorrect, for years, Dunlap remains validated as a supercentenarian on the basis of documents contradicted by her own testimony.
Further indications of the role of opinion, especially when ignoring contra-indications of health, are highlighted by other prominent cases. When interviewing the then-oldest man and woman in the world – Christian Mortensen and Jeanne Calment – demographers from the Max Planck Institute for Demographic Research issued the contradictory statement that Jeanne Calment smoked both one and two cigarettes a day for an entire century[73], followed by the justification that Calment “possibly did not inhale at all”[73]. It was then observed that, from age 20 to age 117, Christian Mortensen smoked “mainly a pipe and later on cigars, but almost never cigarettes… he had also chewed tobacco…but never inhaled”[73]. Why two people might choose to smoke for an accumulated 190 years, but never inhale, was not questioned.
Similar opinions aimed at explaining the questionable health habits of the oldest old are widely considered satisfactory. A notable fraction of supercentenarians smoke and drinking, yet these anomalous health patterns are routinely downplayed. Of the five oldest men ever recorded, Kimura and Mortensen are detailed above, Emiliano Del Toro (3rd) smoked for 76 years, Mathew Beard (4th) was busted for drink-driving at age 90, and Walter Breuning (5th) smoked cigars until he was 108. The oldest man in the UK stated his secret to health as “cigarettes, whiskey and wild, wild women”[74], while the former oldest man in the USA smoked 12-18 cigars and drank alcohol every day, which routinely started with “a little bourbon in [his] coffee”[75]. At least three of the ten oldest women drank every day, two smoked every day, and four are of unknown smoking status, while Jeanne Calment smoked daily, drank daily, and ate around a kilogram of chocolate a week.
These instances of poor lifestyle choices are not rare but constitute a substantial fraction of all supercentenarian cases. As summarized by Coles, supercentenarians lifestyle are characterized by “heavy smoking, heavy drinking, or both, failure to exercise on a regular basis, and no conscious effort to eat nutritiously”[76]. Instead of prompting skepticism, under the relatively safe assumption that smoking, drinking, poverty, lack of exercise, poor nutrition, and illiteracy should not enrich for remarkable longevity records, these contra-indications of survival are routinely ignored or downplayed. For example, the study by Chrysohoou et al. concluded that “physical activity, dietary habits, smoking cessation, and midday naps” predict extreme longevity in Ikaria: a conclusion that questionably re-shapes past smoking status as a positive indicator of survival[54].
Finally, a troubling aspect of case validation is not restricted to the questionable, and largely irreproducible qualitative choices made by extreme-age demographers. Nor is it the repeat capacity to overlook extraordinarily high rates of later-invalidated cases, or the overt racism of individuals responsible for case validation[24]. Instead, the rather simpler challenge is that cases are almost never subject to even cursory investigation, let alone intensive interview. For example, the IDL now lists only 365 validations for 9,878 French SSCs, of which none have an original birth certificate [65]. It is unclear how many of these 365 individuals were interviewed, even under the irreproducible and qualitative-driven framework of a field that does not encourage pre-registered findings or the publishing of non-results.
Summary
Individually these issues raised here may be, in some cases, answerable. Yet in aggregate such patterns reveal a devastating reality. In extreme-age demography, not only are critical questions not answered: they are not asked.
The list is extensive. Why have population and lifestyle surveys of Japan, which cover over 94% of households, not been used to evaluate population and lifestyle patterns in Japan after twenty years? Why are a Nature paper that rounded its data off to zero, and a Science paper using the most empirically biased model possible, unretracted? Why do demographers trust a single metric of age – documents – when documents have routinely generated observed error rates above 50%, and exciting alternatives are being developed? How might documentary validation or interview methods ever reproducibly solve the ‘Italian sibling’ problem? Why are leading demographers allowed to state that African-American people live longer because they ‘have evolved 10% more genes’[24], or are more ‘robust’ due to ‘faster breeding’[14], whilst being promoted to senior roles in major institutes, without a solitary word of criticism?
Instead of being asked, such questions are ignored, basic critical analysis is derided, and fundamental problems written in reproducible code have no effect. Such responses may be effective in removing criticism and have resulted in the author leaving the field. But the future of demography is saddening indeed if it cannot bring itself to answer these questions alone.
I may leave demography with a final question. Half of all births in India, for example, have no birth certificate: these people are excluded de facto from age-stratified medical and epidemiological models, because we don’t know their age. How much better could our collective medical, demographic, and epidemiological models perform, if we replaced document-measured ages with accurate biometric measurements of age across every population that, like India, suffers from systematically unreliable or absent documents[77,78]?
If the ‘Italian sibling’ problem were taken seriously, and solved, we may find out. Given the reaction and conduct of the demographic community so far, however, hope for such an answer seems remote.
Acknowledgements
The author would like to acknowledge Dr Elena Racheva for advice and encouragement, Zoe Campbell and Prof Heather Booth for providing much-needed feedback and editing advice, Chris Mulligan for providing inspiration to use US birth data, Sally Morell and Dr Jim Docherty for interesting leads, Dr Gideon Meyerowitz-Katz for the final nudge toward publication, and an extraordinary cross-section of the scientific community for providing commentary and support.
Footnotes
An extensive critical review has been added, minor structural changes made to the layout of the paper, and a minor mistake in a supplementary figure has been corrected. In addition, a clear refutation of the Okinawan blue zone claims has been added.