Polygenic Adaptation has Impacted Multiple Anthropometric Traits

Jeremy J. Berg; Xinjun Zhang; Graham Coop

doi:10.1101/167551

Abstract

Most of our understanding of the genetic basis of human adaptation is biased toward loci of large phenotypic effect. Genome wide association studies (GWAS) now enable the study of genetic adaptation in highly polygenic phenotypes. Here we test for polygenic adaptation among 187 worldwide human populations using polygenic scores constructed from GWAS of 34 complex traits. By comparing these polygenic scores to a null distribution under genetic drift, we identify strong signals of selection for a suite of anthropometric traits including height, infant head circumference (IHC), hip circumference (HIP) and waist-to-hip ratio (WHR), as well as type 2 diabetes (T2D). In addition to the known north-south gradient of polygenic height scores within Europe, we find that natural selection has contributed to a gradient of decreasing polygenic height scores from West to East across Eurasia, and that this gradient is consistent with selection on height in ancient populations who have contributed ancestry broadly across Eurasia. We find that the signal of selection on HIP can largely be explained as a correlated response to selection on height. However, our signals in IHC and WC/WHR cannot, suggesting a response to selection along multiple axes of body shape variation. Our observation that IHC, WC, and WHR polygenic scores follow a strong latitudinal cline in Western Eurasia support the role of natural selection in establishing Bergmann’s Rule in humans, and are consistent with thermoregulatory adaptation in response to latitudinal temperature variation.

One Sentence Summary Natural selection has lead to divergence in multiple quantitative traits in humans across Eurasian populations.

Main Text

Decades of research in anthropology have identified anthropometric traits that show potential evidence of biological adaptation to climatic conditions as humans spread around the world over the past hundred thousand years.^1,2,3 However, it can be challenging to rule out environmental,^4,5 as opposed to genetic variation, as the primary cause of these phenotypic differences.⁶ Even for pheno-types where there is some confidence that some of the phenotypic differences among populations are due in part to genetic differences, it is often hard to rule out genetic drift as an alterative explanation to selection.^7,8,9 The development of population-genetic methods and genomic data resources during the last few decades has enabled the interrogation of adaptive hypotheses and has produced an expanding list of examples of plausible human adaptations.^10,11 However, such approaches are often inherently limited to detecting adaptation in genetically simple traits via large allele frequency changes at a small number of loci, whereas many adaptations likely involve highly polygenic traits and so are undetectable by most approaches.^12,13 Genome-wide association studies (GWAS) have now identified thousands of loci underlying the genetic basis of many complex traits.^14,15,16 These studies offer an unprecedented opportunity to identify adaptation in recent human evolution by detecting subtle shifts in allele frequencies compounded over many GWAS loci.^{17,18,19,20,21,22,23}

We conducted a broad screen for evidence of directional selection on variants that contribute to 34 polygenic traits by studying the distribution of their allele frequencies in dataset of 187 human populations (2158 individuals across 161 populations from the Human Origins Panel²⁴ and 2504 individuals across 26 populations of the 1000 Genomes phase 3 panel²⁵), making use of prior large-scale GWAS for these traits (see Table S1). We divided the genome into 1700 non-overlapping and approximately independent linkage blocks²⁶ and choose the SNP with the highest posterior probability of association within the block.^27,28 For each trait, we calculate a polygenic score for each population as a weighted sum of allele frequencies at each of these 1700 SNPs, with the GWAS effect sizes taken as the weights. Figure 1 shows the distribution of these scores for height across our population samples.

Figure 1: Polygenic Height Scores for 187 population samples (combined Human origin panel and 1000 genomes datasets), plotted on geographic coordinates.

Blue corresponds to populations with the “tallest” polygenic height scores, and yellow the “shortest”.

These polygenic scores should not be viewed as phenotypic predictions across populations. For example, the Maasai and Biaka pygmy populations have similar polygenic scores despite having dramatic differences in height.²⁹ Discrepancies between polygenic scores and actual phenotypes may be expected to occur either because of purely environmental influences on phenotype, as well as gene-by-gene and gene-by-environment interactions. We also expect that the accuracy of these scores when viewed as predictions should decay with genetic distance from Europe (where the GWAS were carried out), due to changes in the structure of linkage disequilibrium (LD) between causal variants and tag SNPs picked up in GWAS, and because GWAS are biased toward discovering intermediate frequency variants, which will explain more variance in the region they are mapped in than outside of it. These caveats notwithstanding, the distribution of polygenic scores across populations can still be informative about the history of natural selection on a given phenotype,¹⁸ and a number of striking patterns are visible in their distribution. For example, there is a strong gradient in polygenic height scores running from east to west across Eurasia (Figure 1)

To explore whether patterns observed in the polygenic scores were caused by natural selection, we tested whether the observed distribution of polygenic scores across populations could plausibly have been generated under a neutral model of genetic drift. To understand this null model, consider that a neutrally evolving allele is expected to be at the same frequency in a set of independently evolving sub-populations. However, due to genetic drift, sub-populations will deviate from this frequency, with the variance of the sub-population frequencies given by F_STp (1 – p) where p is the ancestral allele frequency, and F_ST is Wright’s “fixation index,”³⁰ which can be measured from genome-wide data.^17,31 Our polygenic scores sum the additive contributions of a large number of unlinked loci, which under our null model will experience genetic drift independently. Therefore, under a model of genetic drift, the polygenic score of each of a set of independent sub-populations will be normally distributed, with variance of V_AF_ST (here, V_A is the additive genetic variance of polygenic scores the ancestral population). Our test is based on a generalization of this simple relation in which we account for both variance and covariance among multiple populations that exhibit non-independence due to common descent, migration, and admixture over the history of human evolution. Specifically, we model the joint distribution of polygenic scores as multivariate normal and use a generalized variance statistic (Q_X) to measure the over-dispersion of polygenic scores relative to the neutral prediction, which is taken as evidence in favor of natural selection driving difference among populations in polygenic scores (see Methods and our previous study¹⁸ for details). Our approach is similar to classic tests of adaptation on phenotypes measured in common gardens, which rely on comparisons of the within and among-population additive genetic variance for phenotypes and neutral markers, i.e. Q_ST/F_ST comparisons.^32,33,34 Importantly, the neutral distribution we derive holds independent of whether the loci truly influence the trait in an additive manner (with respect to each other or the environment), and whether the GWAS loci are truly causal or merely imperfect tags. However, population structure in the original GWAS panels can confound signals of polygenic adaptation.^18,20 Modern methods are generally considered to be effective at controlling for the effects of population structure,³⁵ and we proceed assuming that it has been adequately accounted for in the original GWAS panels.

We applied our test to each of the 34 traits across all populations, as well as within nine restricted regional groupings (Figure 2 and Table S3). Using our test across all populations as a general test for the impact of selection anywhere in the dataset, we find 5 signals of selection after controlling for multiple testing (p < 0.05/34). The traits involved include height, infant head circumference (IHC), hip circumference (HIP), waist-hip ratio (WHR), and type 2 diabetes (T2D). Although the sixth-strongest signal, waist circumference (WC), failed to meet the multiple-testing correction, we include it in subsequent analyses due to its obvious connection to WHR. We also found signals of selection on polygenic scores constructed for waist and hip circumference and waist-hip ratio when adjusted for BMI (Table S3), but we focus on the unadjusted versions for ease of interpretation. We do not replicate a previously reported signal of selection on BMI within Europe, but also note that the previous study used many more SNPs than we have in constructing polygenic scores, which likely explains the difference.²⁰ In each case of significant over-dispersion, the signal represents a small but systematic shift in allele frequency of a few percent across many loci, which would be undetectable by standard population-genetic tests for selection (see Table S6), such that the majority of the variance in polygenic scores is within populations as opposed to among populations (see Table S4).

Figure 2: A heatmap showing the log10 p-values for the Q_X test statistic for over-dispersion of the polygenic scores for a trait among population samples.

The ‘All’ column gives the p-value in the combined Human Origin and 1000 Genomes dataset. See S2 and S1 for the definitions of the regional groupings. Each subsequent column gives the score in each geographic sub-region. MCV: Mean red blood cell volume; MCHC: Mean cell hemoglobin concentration; LSBMD: Lumbar spine bone mineral density; FNBMD: Femoral neck bone mineral density; PCV: Packed red blood cell volume; MPV: Mean platelet volume.

The predominantly European ascertainment of GWAS loci can lead to apparent deviations from neutrality. Therefore all p values in Figure 2 and throughout the paper are derived from comparing test statistics against frequency-matched empirical controls, unless otherwise stated (see Text S1.3). This empirical matching is an important control. For example, the distribution of polygenic scores for Schizophrenia show a signal of over-dispersion under the naive null hypothesis, but not after controlling for the effects of ascertainment. More generally, the ascertainment and selection against disease phenotypes pose difficulties for the interpretation of tests of differentiation. Thus, although we see a signal of selection for decreased T2D polygenic scores in Europe, the interpretation of this signal likely requires the development of more explicit models of selection on disease traits (section S1.4).

The Geography of Selection on Height

In addition to the known signal of a gradient of increased polygenic height score in northern Europeans relative to southern Europeans (latitude correlation within Europe p = 6.3 × 10⁻⁶, see S2 and Methods for statistical details),^{17,18,19,20,36} we also find evidence that that natural selection has impacted polygenic height scores well outside of modern Europe. Polygenic height scores decline sharply from west to east across Eurasia in a way that cannot be predicted by a neutral model (longitude correlation across Eurasia, p = 4.46 × 10⁻¹⁵; Figure 1), and they are overdispersed within each of our four population clusters (north, south/central, east, and west) across Asia, as well among Native Americans (Figure 2). A natural question is whether this broadly Eurasian signal represents multiple independent episodes of selection on the genetic basis of height, or ancient selection on one or just a few populations, with modern signals across Eurasia reflecting variation in the extent to which modern populations derive ancestry from these ancient populations. For example, the signal of selection on height in East Asia is driven entirely by the Tu population sample (p = 0.4329 after they are removed), who have the highest polygenic height score among East Asian samples. Does this unusually high polygenic score reflect recent selection, or the fact that the Tu derive a proportion of their ancestry from an ~800-year-old admixture event involving a population resembling modern Europeans³⁷?

To test whether the height signal within Asia is due to a selective event shared with Europeans, we predicted the polygenic height scores across Asia given the deviation of European populations from the Asian mean, and each of the Asian sample’s genome-wide relationship to the European samples (see Figure 3, and Methods for details). We find that this prediction conditioned on Europeans are sufficient to explain most the divergence between the Tu and the other East Asian populations in our dataset (see sky blue dots in Figure 3), and eliminate the signal of selection among East Asian populations (p = 0.099 after conditioning). In fact, all signals of differential selection on height across Asia can be eliminated using these conditional predictions (p = 0.2019 after conditioning). This suggests that most of the selected divergence in our polygenic height scores across Eurasia can be attributed either to events which are predominantly ancestral to modern Europeans (but which have impacted other regions via admixture), or which lie along an early lineage which has contributed ancestry broadly across Eurasia.

Figure 3: Polygenic height scores in Asia are well-predicted by a model conditioned on European height scores, consistent with selection occurring in a shared ancestral population.

An individual population sample’s position along the x axis gives the genetic height score predicted on the basis of scores observed in Europe and their relatedness to the European samples, whereas their position along the y axis gives the true polygenic height score (see Methods for statistical details). The dashed line gives the one-to-one line along which all populations would fall if the predictions were perfectly accurate, whereas the vertical gray lines give population-specific 95% confidence intervals under genetic drift.

To further investigate the history of selection on height, we examined polygenic height scores in a set of ancient DNA samples from Western Eurasia^19,38,39 (for more detail see Text S1.5, Figure S9, and Figure S10). We find that the Eurasian selective gradient in genetic height scores was long established, with many ancient Western Eurasian population samples (particularly those with significant hunter-gatherer or Yamnaya steppe ancestry) having significantly greater polygenic height scores than modern East Asian populations in our dataset (e.g. Yamnaya-Samara vs Han Chinese pairwise p = 0.011). In fact, European hunter-gatherer samples appear to have significantly higher polygenic height scores than any modern European population (e.g. CEU vs Caucasus hunter-gatherers pairwise p = 0.017 and CEU vs Western European hunter-gatherers pairwise p = 0.007, see Text S1.5). Our results do not support Mathieson and colleagues’¹⁹ suggestion of selection for reduced height in Iberian Neolithic samples relative to Anatolian Neolithic (p = 0.90, with the Iberians actually having a higher score than the Anatolians, see also⁴⁰). Our results seem most consistent with the idea there was selection for increased height in the history of the yamnaya and hunter-gatherer populations, and that modern signals of divergence result from variation in the proportion of this ancestry that has been inherited across the continent.

Selection on Body Shape Polygenic Scores

As four out of the next five strongest signals beyond height also represent anthropometric traits, we focus the remainder of our efforts on these phenotypes. Due to genetic correlations between traits, it is possible that signals of selection on two (or more) distinct phenotypes actually represent only a single episode of selection, where one trait responds indirectly to selection on the correlated trait. Because the genetic correlation with height varies among these phenotypes (HIP: r = 0.39, IHC: r= 0.268, WC: r = 0.22, and WHR: r = –0.08),⁴¹,⁴² we expect a priori that signals for more tightly correlated phenotypes are more likely due to a correlated response to selection on height, whereas for example the WHR signal is more likely to be independent.

To test whether the new signals we observe represent selective events distinguishable from the height signal, we developed a multi-trait extension to our null model based on the quantitative-genetic multivariate-selection model of Lande and Arnold⁴³ (see Methods and Supplementary Text Section S1.6). We condition on the observed polygenic height scores, and test whether the signal of selection on a second trait is still significant after accounting for a genetic correlation with height (a non-significant p-value is consistent with a correlated response to selection on height). Applying this test to our entire panel of populations, we find that conditioning on height ablates much of the signal for HIP (p = 0.0186) and WC (p = 0.0059), whereas signals in IHC (p = 1.11 × 10⁻⁵) and WHR (p = 3.57 × 10⁻⁸) are less affected. Restricting to European populations only, height is better able to explain HIP (p = 0.1152), WC (p = 0.0104), and IHC (0.0051) signals, while the signal of selection on WHR remains strong even after conditioning on height (p = 1.92 × 10⁻⁸). WHR is genetically correlated with HIP (r = 0.316) and WC (r = 0.729), but not with IHC (r = 0.01).^41,42 Conditioning on WHR is sufficient to explain WC (global p = 0.1523, Europe p = 0.5178), but signals in HIP, IHC, and height are all independent of WHR (Table S4). Together, these results suggest that we can distinguish the action of natural selection along a minimum of two phenotypic dimensions (i.e. height and WHR, or unmeasured phenotypes closely correlated to them). The signal of selection observed for HIP is likely due to selection on height, and the WC signal is probably due to selection on a combination of height and WHR (or closely correlated phenotypes; we provide additional evidence for this claim in supplement section S1.6.2). Whereas IHC shows some evidence of being influenced by selection on height, a correlated response to height seems not to fully explain this signal.

Signals of divergence for both IHC and WHR polygenic scores are confined mostly to Europe and West Asia. For both traits the null model gives a significantly improved fit to the data when conditioned on Europe to explain West Asia and similar when conditioning on West Asia to explain Europe (Table S5). This suggests that, as is the case for Eurasian height scores, a substantial fraction of the divergence in IHC and WHR polygenic scores among modern populations across western Eurasia reflects divergence among ancient populations and subsequent mixture rather than recent selection.

Bergmann’s Rule and Thermoregulatory Adaptation

For both IHC and WHR, the selective signal in Western Eurasia can be captured in large part by strong, positive latitudinal clines (p = 3.16 × 10⁻¹⁵ for IHC and p = 3.16 × 10⁻⁷ for WHR; Figure 5). These clines in polygenic scores support independent phenotypic evidence for larger and wider bodies and rounder skulls at high latitudes,^{44,1,45,2,46,47,3} consistent with Bergmann’s Rule,^48,49 and add genetic support for a thermoregulatory hypothesis for morphological adaptation, whereby individuals in colder environments are thought to have adapted to improve heat conservation by decreasing their surface area to volume ratio.

A broad range of selective mechanisms have been proposed to act on height variation.⁵⁰ Because we do not detect any signal of selection on age at menarche, we think it unlikely that the height signal represents a correlated response due to life-history mediated selection on age at reproductive maturity.⁵¹ It has also been suggested that selection on height may be explained as a thermoregulatory adaptation.⁵⁰ However, because the surface area to volume ratio is approximately independent of height,^52,2 the effect of height SNPs on this ratio is mediated almost entirely through their effect on circumference (hip and/or waist; see section S1.8). Because the signal of selection on height cannot be explained by conditioning on hip and waist circumference, it seems that the thermoregulation hypothesis cannot fully explain the signal of selection on height.

A second eco-geographic rule relevant to height is Allen’s rule,⁵³ which predicts relatively shorter limbs in colder environments, again consistent with adaptation on the basis of thermoregulation. In support of this, human populations in colder environments are observed to have proportionally shorter legs, compared to those in warmer environments.^45,54 However, we detect no signal of selection on polygenic scores for the ratio of sitting to standing height (SHR); a measure of leg length relative to total body height.⁵⁵ Indeed, by combining our height SNPs with their effect on SHR, we find a strong signal that both increases in leg length and torso length underlie the selective signal on height from North to South within Europe, and from East to West across Eurasia (see S1.9). This again suggests that thermoregulatory concerns are unlikely to fully explain signals of selection for height.

Discussion

The study of polygenic adaptation provides new avenues for the study of human evolution, and promises a synthesis of physical anthropology and human genetics. Here, we provide the first population genetic evidence for selected divergence in height polygenic scores among Asian populations. We also provide evidence of selected divergence in IHC and WHR polygenic scores within Europe and to a lesser extent Asia, and show that both hip and waist circumference have likely been influenced by correlated selection on height and waist-hip ratio. Finally, signals of divergence among Asian populations can be explained in terms of differential relatedness to Europeans, which suggests that much of the divergence we detect predates the major demographic events in the history of modern Eurasian populations, and represents differential inheritance from ancient populations which had already diverged at the time of admixture. Note that because modern non-admixed east Asian populations only show significant evidence of divergence in pairwise comparisons to western populations (ancient or modern) that have been selected up in height (Figure S10), our results do not support a hypothesis of selection for decreased height in east Asia.

However, the fact that we cannot detect departures from neutrality outside of those associated with broad scale variation in European ancestry across Asia should not be taken as evidence that such events have not occurred, merely that if they exist, we cannot currently detect them using GWAS variants mapped in Europe. We should expect to be better-powered to detect selective events in populations more closely related to Europeans for two reasons. First, changes in the structure of linkage disequilibrium (LD) across populations should lead GWAS variants to tag causal variation best in populations genetically close to the European-ancestry GWAS panels.⁵⁶ Second, gene-by-environment and gene-by-gene interactions can lead to changes in the additive effects of individual loci among populations,⁵⁷ and therefore in the way that they respond to selection on the phenotype. We expect that these difficulties can be overcome or mitigated in the future through a combination of well-powered GWAS in multiple populations of non-European ancestry, access to a wider array of ancient DNA samples, and improved frameworks for the interpretation of signals of polygenic adaptation.²³

The existence of latitudinal trends in the polygenic scores for WHR and IHC support the notion that some of the clinal phenotypic variation in body shape typically thought to represent thermoreg-ulatory adaptation can be attributed to genetic variation driven by selection, while the ability of simple models to unify signals across broad geographic regions again suggests that these patterns could have been generated by a limited number of selective events. Evidence for adaptation on the basis of specific environmental pressures is most convincing when multiple populations independently converge on the same phenotype in the face of the same environmental pressure, a pattern for which we currently lack evidence. Therefore, while our evidence is consistent with adaptation to temperature environments, alternative explanations (e.g. adaptation to diet) are plausible.

1 Methods

1.1 Population Genetics Datasets

We downloaded the 1000 genomes phase 3 release data from the 1000 genomes ftp portal.²⁵ We also used data from the Human Origins fully public panel²⁴ which was imputed from the 1000 Genomes phase 3 as reference, using the Michigan imputation server,⁵⁸ and restricting to SNPs with an imputation quality score (in terms of predicted r²) of 0.8 or greater (pers. comm. Joe Pickrell). The original genotype data can be downloaded from the Reich lab website. This combined dataset represent samples from 2504 people from 26 populations in the 1000 Genomes dataset and 2158 people across 161 populations from the Human Origins dataset, for a total of 4662 samples from 187 populations (S2). For global analyses we include all 187 populations. In regional analyses we exclude populations with a significant recent (i.e. < 500 years) African/non-African admixture to avoid confounding admixture with signals of recent selection within regions (see S2 and S1 for the regions).

1.2 Selection of GWAS SNPs

We took public GWAS results for a set of traits²⁸ and combined them with additional anthropometric traits from the GIANT consortium and a subset of Early Growth phenotypes contributed by EGG Consortium. Table S1 gives a full list of the traits included in this study and the relevant references. For each trait we selected a set of SNPs with which to construct our polygenic scores as follows. For each SNP, we calculated an approximate Bayes factor summarizing the evidence for association at that SNP via the method of Wakefield,⁵⁹ following Pickrell et al.²⁸ (see their supplementary note section 1.2.1). We then used a published set of 1700 non-overlapping linkage disequilibrium blocks²⁶ to divide the genome, after which we selected the single SNP with the strongest approximate Bayes factor in favor of association within each block to carry forward for analyses.

1.3 Polygenic Scores and Null Model

Given a set of L SNPs associated with a trait (L ≈ 1700), we construct the vector of polygenic scores across all M = 187 populations by taking the sum of allele frequencies across the L sites (the vector at site ℓ), weighting each allele’s frequency by its effect on the trait (α_ℓ) to give For each trait, we construct a null model for the joint distribution of polygenic scores across populations, assuming where . Here p̅_ℓ is the mean allele frequency across all population samples (weighting all population samples equally), and F is the M × M population-level genetic covariance matrix.¹⁸ All polygenic scores are plotted in centered standardized form .

We use the Mahalanobis distance of from its distribution under the null as a natural test statistic to assess the ability of the null model to explain the data (see Berg and Coop (2014)¹⁸ for an extended discussion). This test statistic should be X² with M – 1 degrees of freedom under neutrality. However, in practice we are concerned that the ascertainment of GWAS loci may invalidate our null model, so we compare the test statistic to an empirical null (see Section S1.3)

1.4 Latitudinal and Longitudinal Correlations

We also test for selection-driven correlations between geographic variables (e.g. latitude) and a subset of our polygenic scores (see Berg and Coop (2014)¹⁸ and Section S1.1 for more details of the test). We take the standardized geographic variable and polygenic scores, and then rotate these vectors by the inverse Cholesky decomposition of the relatedness matrix F. These rotated vectors are in a reference frame where the populations represent independent contrasts under the neutral model. We take as our test statistic the covariance of these rotated vectors. We calculate the significance of the statistic by comparing to a null distribution generated by calculating null sets of polygenic scores assembled from resampled SNPs with derived frequency matched to the CEU population sample so as to mimic the effects of the GWAS ascertainment.

1.5 Two-Trait Conditional Tests

Because some of the traits we examine are genetically correlated with one another, we were concerned that signals of selection observed for one trait might reflect a response to selection on another correlated trait. To determine whether genetic correlations might be responsible for some of our signals, we developed a multitrait extension to our neutral model that accounts for genetic covariance among traits. The extension is on the framework of Lande and Arnold.⁴³

If and are vectors of polygenic scores for two different traits constructed according to equation (1), and the matrix contains these vectors as columns, then under neutrality the distribution of Z is approximately matrix normal where the matrix μ contains the trait-specific means, F gives the population covariance structure among rows as in the single trait model, and G is the among trait additive genetic covariance matrix, the “G matrix” of multivariate quantitative genetics,⁴³ estimated for a population ancestral to all populations in the sample. The diagonal elements of the 2 × 2 G matrix are given by the V_A parameters from above in the single trait model and the off-diagonal element (C_A,12) corresponds to the additive genetic covariance between the two traits. Given this null model for the joint distribution of the two traits, we can construct a conditional model for the distribution of polygenic scores for trait 1, given the polygenic score observed for trait 2, as Given a value of C_A,12 we can then use these conditional means and variances in equation (3) to form a conditional Q_X statistic and compare it to its null distribution. We take the failure to reject neutrality on the basis of the conditional Q_X statistic as consistent with the hypothesis that any response to selection observed for trait 1 is a result of selection on trait 2. Some of the traits we study have non-linear allometric relationships with each other, but because our polygenic scores are linear by construction our tests are robust to this non-linearity (see S1.7).

We experimented with estimating C_A,12 on the basis of SNPs that overlap between the two traits in each genomic block. However, we were concerned about this approach to estimating genetic correlations not being a sufficient joint model for cases in which different SNPs within a block affected the two traits but were in linkage disequilibrium with one another, and therefore do not drift independently. To deal with this issue, we represent the genetic covariance among populations as where ρ represents the genetic correlation between the two sets of polygenic scores. We pursued a conservative strategy, testing a range of values for ρ along a dense grid from -1 to 1 to ask whether any assumed genetic correlation between polygenic scores could plausibly allow one trait to be explained as a correlated response to another. As a further conservative measure, we allowed the genetic correlation used to calculate the conditional variance (Eq (7)) to be equal to zero, while allowing the p used to compute the conditional mean (Eq (6)) was not. This is a conservative approach, as it fits our conditional prediction to the mean, but allows the variance of the null model to remain as large as the unconditional model. The conditional two-trait p-values we present in the text, and the CI shown in two-trait Figure 4 and in the supplement, use this conservative approach. In practice our values of ρ are consistent with estimates of genetic correlations obtained from the LDscore approach,^41,42 given that our polygenic scores capture only a fraction of the total genetic variance for each trait.

Figure 4: The overdispersion of genetic HIP scores among populations can be explained as a correlated response to selection on height, but such an effect cannot explain the signal of selection on the WHR polygenic scores.

A) The observed polygenic HIP score (y axis) plotted against the height polygenic scores (x axis). We show only Western Eurasian population samples (blue dots: Europe; green dots: West Asia), as it is these samples which drive the majority of the signal. The line gives the best prediction for each sample’s polygenic HIP score according to the model of a correlated response to selection on height. Vertical lines give the 95% confidence interval of this prediction for each sample under this model. Most populations’ polygenic HIP scores lie within their confidence intervals, consistent with our failure to reject this conditional null model (main text). B) The same as A but now giving polygenic WHR scores rather than HIP. Note that for many populations the WHR scores lie outside of their 95% CI predictions based on genetic drift and correlated selection on height alone, consistent with the inability of this model to fully capture variation in polygenic WHR scores (main text)

Figure 5:

Genetic IHC, WC, and WHR score plotted against Latitude for the Western Eurasian population samples. The points are colored East to West (blue to yellow).

1.6 Single Trait Conditional Null Model

We also developed an extension of the null model for a single trait to test whether two (or more) signals of selection detected in different geographic regions might reflect a single ancestral event that occurred in an ancient population that has contributed ancestry broadly to modern populations.

Assume for example that we have detected a signal of selection among the population samples from region A (e.g. Europe) and among the population samples from (e.g. Asia), and we would like to test whether the signal detected in region B is due to a selective event that is also responsible for generating a signal of selection in region A. We first reorganize our samples into two blocks for the two regions Where μ_B is the mean polygenic score in the set of populations being tested, the F_•,•s refer to the sub-matrices of the relatedness matrix F, and F itself has been recentered at the mean of the test set (i.e. region B). Then the conditional distribution of polygenic scores in region B given the polygenic scores observed in region A is The conditional mean, reflects the best predictions of population means in region B given the values observed in region A, whereas the conditional covariance matrix F_B|A reflects the scale and form of the variance around this expectation that arises from drift that is independent of drift in the ancestry of populations in region A.

We can then test for over-dispersion of polygenic score in region given the observed polygenic scores in region A by using and F_B|A in (3) to construct a conditional Q_X score. We judge the statistical significance of this conditional Q_X score by comparing it to a frequency matched dataset, as with the standard test. We interpret a non-significant conditional Q_X score for region B as evidence that any selective signal of overdispersion in B is well explained by genome-wide allele-sharing with A. We view this as evidence that the selection signal in B overlaps that in A, due to selection in shared ancestral populations and admixture.

In Figure 3 we plot the observed polygenic scores for Asia against the predicted polygenic scores for Asia (B), conditional on the Europe population sample polygenic scores (A). The error bars are 95% CIs for each population sample, obtained from the variances on the diagonal of V_AF_B|A.

Acknowledgements

We thank the Coop Lab and Doc Edge, Iain Mathieson, Emily Josephs, Joe Pickrell, Molly Prze-worski, Jeff Ross-Ibarra, Guy Sella, and Tim Weaver for helpful discussions and feedback on earlier drafts. The work was supported in part by an NSF GRFP (to JJB), the UC Davis Anthropology department (XZ), and National Institute of General Medical Sciences of the National Institutes of Health under award numbers R01 GM108779 and R01 grant GM115889.

References

↵
Roberts, D. F. Body weight, race and climate. American Journal of Physical Anthropology 11, 533-558 (1953).
OpenUrl CrossRef PubMed
↵
Ruff, C. B. Morphological adaptation to climate in modern and fossil hominids. Am. J. Phys. Anthropol. (1994).
↵
Savell, K. R. R., Auerbach, B. M. & Roseman, C. C. Constraint, natural selection, and the evolution of human body form. Proc. Natl. Acad. Sci. U.S.A. 113, 9492-9497 (2016).
OpenUrl Abstract/FREE Full Text
↵
Bogin, B., Smith, P., Orden, A. B., Varela Silva, M. I. & Loucky, J. Rapid change in height and body proportions of Maya American children. Am. J. Hum. Biol. 14, 753-761 (2002).
OpenUrl CrossRef PubMed Web of Science
↵
Serrat, M. A., King, D. & Lovejoy, C. O. Temperature regulates limb length in homeotherms by directly modulating cartilage growth. Proc. Natl. Acad. Sci. U.S.A. 105, 19348-19353 (2008).
OpenUrl Abstract/FREE Full Text
↵
Pujol, B., Wilson, A., Ross, R. & Pannell, J. Are Q_ST-F_ST comparisons for natural populations meaningful? Molecular Ecology 17, 4782-4785 (2008).
OpenUrl CrossRef PubMed Web of Science
↵
Rogers, A. R. & Harpending, H. C. Population structure and quantitative characters. Genetics 105, 985-1002 (1983).
OpenUrl Abstract/FREE Full Text
↵
Relethford, J. H. Craniometric variation among modern human populations. American Journal of Physical Anthropology 95, 53-62 (1994).
OpenUrl CrossRef PubMed Web of Science
↵
Relethford, J. H. Apportionment of global human genetic diversity based on craniometrics and skin color. American Journal of Physical Anthropology 118, 393-398 (2002).
OpenUrl CrossRef PubMed Web of Science
↵
Tishkoff, S. Strength in small numbers. Science (2015).
↵
Fan, S., Hansen, M. E. B., Lo, Y. & Tishkoff, S. A. Going global by adapting local: A review of recent human adaptation. Science 354, 54-59 (2016).
OpenUrl Abstract/FREE Full Text
↵
Pritchard, J. K. & Di Rienzo, A. Adaptation-not by sweeps alone. Nat Rev Genet (2010).
↵
Pritchard, J. K., Pickrell, J. K. & Coop, G. The genetics of human adaptation: hard sweeps, soft sweeps, and polygenic adaptation. Current biology (2010).
↵
Visscher, P. M., Brown, M. A. & McCarthy, M. I. Five years of GWAS discovery. Am. J. Hum. Genet. (2012).
↵
Price, A. L., Spencer, C. C. A. & Donnelly, P. Progress and promise in understanding the genetic basis of common diseases. Proc. R. Soc. B 282, 20151684-10 (2015).
OpenUrl CrossRef PubMed
↵
Boyle, E. A., Li, Y. I. & Pritchard, J. K. An expanded view of complex traits: from polygenic to omnigenic. Cell (2017).
↵
Turchin, M. C., Chiang, C. & Palmer, C. D. Evidence of widespread selection on standing variation in Europe at height-associated SNPs. Nature (2012).
↵
Berg, J. J. & Coop, G. A population genetic signal of polygenic adaptation. PLoS Genet (2014).
↵
Mathieson, I. et al. Genome-wide patterns of selection in 230 ancient Eurasians. Nature 528, 499-503 (2015).
OpenUrl CrossRef PubMed
↵
Robinson, M. R., Hemani, G. & Medina-Gomez, C. Population genetic differentiation of height and body mass index across Europe. Nature (2015).
↵
Hansen, M. E. B. et al. Shorter telomere length in Europeans than in Africans due to polygenetic adaptation. Hum. Mol. Genet. 25, 2324-2330 (2016).
OpenUrl CrossRef PubMed
↵
Field, Y. et al. Detection of human adaptation during the past 2000 years. Science 354, 760-764 (2016).
OpenUrl Abstract/FREE Full Text
↵
Racimo, F., Berg, J. J. & Pickrell, J. K. Detecting polygenic adaptation in admixture graphs. bioRxiv 146043 (2017).
↵
Lazaridis, I. et al. Ancient human genomes suggest three ancestral populations for present-day Europeans. Nature 513, 409-413 (2014).
OpenUrl CrossRef PubMed Web of Science
↵
1000 Genomes Project Consortium. A global reference for human genetic variation. Nature 526, 68 (2015).
OpenUrl CrossRef PubMed
↵
Berisa, T. & Pickrell, J. K. Approximately independent linkage disequilibrium blocks in human populations. Bioinformatics (2016).
↵
Pickrell, J. K. Joint analysis of functional genomic data and genome-wide association studies of 18 human traits. Am. J. Hum. Genet. (2014).
↵
Pickrell, J. K., Berisa, T., Liu, J. Z., Ségurel, L. & Tung, J. Y. Detection and interpretation of shared genetic influences on 42 human traits. Nature (2016).
↵
Martin, A. R. et al. Human Demographic History Impacts Genetic Risk Prediction across Diverse Populations. Am. J. Hum. Genet. 100, 635-649 (2017).
OpenUrl CrossRef
↵
Wright, S. The genetical structure of populations. Ann Eugen 15, 323-354 (1951).
OpenUrl PubMed Web of Science
↵
Nicholson, G. et al. Assessing population differentiation and isolation from single-nucleotide polymorphism data. J. R. Stat. Soc. 64, 695-715 (2002).
OpenUrl
↵
Prout, T. & Barker, J. S. F statistics in Drosophila buzzatii: selection, population size and inbreeding. Genetics 134, 369-375 (1993).
OpenUrl Abstract/FREE Full Text
↵
Spitze, K. Population structure in Daphnia obtusa: quantitative genetic and allozymic variation. Genetics 135, 367-374 (1993).
OpenUrl Abstract/FREE Full Text
↵
Ovaskainen, O., Karhunen, M., Zheng, C., Arias, J. M. C. & Merila, J. A New Method to Uncover Signatures of Divergent and Stabilizing Selection in Quantitative Traits. Genetics 189, 621-632 (2011).
OpenUrl Abstract/FREE Full Text
↵
Bulik-Sullivan, B. K. et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Gen. 47, 291-295 (2015).
OpenUrl CrossRef PubMed
↵
Zoledziewska, M., Sidore, C., Chiang, C. & Sanna, S. Height-reducing variants and selection for short stature in Sardinia. Nature (2015).
↵
Hellenthal, G. et al. A genetic atlas of human admixture history. Science 343, 747-751 (2014).
OpenUrl Abstract/FREE Full Text
↵
Fu, Q. et al. The genetic history of Ice Age Europe. Nature (2016).
↵
Lazaridis, I. et al. Genomic insights into the origin of farming in the ancient near east. Nature 536, 419-424 (2016).
OpenUrl CrossRef PubMed
↵
Martiniano, R. et al. The population genomics of archaeological transition in west iberia. bioRxiv (2017).
↵
Bulik-Sullivan, B., Finucane, H. K., Anttila, V. & Gusev, A. An atlas of genetic correlations across human diseases and traits. Nature (2015).
↵
Zheng, J. et al. LD Hub: a centralized database and web interface to perform LD score regression that maximizes the potential of summary level GWAS data for SNP heritability and genetic correlation analysis. Bioinformatics 33, 272-279 (2017).
OpenUrl CrossRef PubMed
↵
Lande, R. & Arnold, S. J. The measurement of selection on correlated characters. Evolution 37, 1210-1226 (1983).
OpenUrl CrossRef PubMed Web of Science
↵
Schreider, E. Geographical distribution of the body-weight/body-surface ratio. Nature 165, 286 (1950).
OpenUrl PubMed
↵
Roberts, D. Climate and human variability (Addison-Wesley, 1973).
↵
Ruff, C. Variation in Human Body Size and Shape. Annu. Rev. Anthropol. 31, 211-232 (2002).
OpenUrl CrossRef Web of Science
↵
Katz, D. C., Grote, M. N. & Weaver, T. D. A mixed model for the relationship between climate and human cranial form. Am. J. Phys. Anthropol. 160, 593-603 (2015).
OpenUrl
↵
Bergmann, C. Über die Verhältnisse der Wärmeökonomie der Thiere zu ihrer Grösse. Göttinger Studien 3, 595-708 (1847).
OpenUrl
↵
Mayr, E. Geographical character gradients and climatic adaptation. Evolution 10, 105-108 (1956). URL http://www.jstor.org/stable/2406103.
OpenUrl CrossRef Web of Science
↵
Stulp, G. & Barrett, L. Evolutionary perspectives on human height variation. Biol Rev 91, 206-234 (2014).
OpenUrl
↵
Stearns, S. C., Govindaraju, D. R., Ewbank, D. & Byars, S. G. Constraints on the coevolution of contemporary human males and females. Proceedings of the Royal Society of London B: Biological Sciences 279, 4836-4844 (2012). URL http://rspb.royalsocietypublishing.org/content/279/1748/4836. http://rspb.royalsocietypublishing.org/content/279/1748/4836.full.pdf.
OpenUrl CrossRef PubMed
↵
Ruff, C. B. Climate and body shape in hominid evolution. Journal of Human Evolution 21, 81-105 (1991).
OpenUrl CrossRef Web of Science
↵
Allen, J. A. The Influence of Physical Conditions in the Genesis of Species. Radical Review 1, 108-140 (1877).
OpenUrl
↵
Katzmarzyk, P. T. & Leonard, W. R. Climatic influences on human body size and proportions: ecological adaptations and secular trends. Am. J. Phys. Anthropol. 106, 483-503 (1998).
OpenUrl CrossRef PubMed Web of Science
↵
Chan, Y. et al. Genome-wide Analysis of Body Proportion Classifies Height-Associated Variants by Mechanism of Action and Implicates Genes Important for Skeletal Development. Am. J. Hum. Genet. 96, 695-708 (2015).
OpenUrl
↵
Palmer, C. & Pe’er, I. Statistical correction of the winner’s curse explains replication variability in quantitative trait genome-wide association studies. bioRxiv (2017).
↵
Brown, B. C. et al. Transethnic genetic-correlation estimates from summary statistics. The American Journal of Human Genetics 99, 76-88 (2016).
OpenUrl CrossRef PubMed
↵
Das, S. et al. Next-generation genotype imputation service and methods. Nature genetics 48, 1284-1287 (2016).
OpenUrl CrossRef PubMed
↵
Wakefield, J. Bayes factors for genome-wide association studies: comparison with P-values. Genet. Epidemiol. 33, 79-86 (2009).
OpenUrl CrossRef PubMed Web of Science
Perry, J. R. B. et al. Parent-of-origin-specific allelic associations among 106 genomic loci for age at menarche. Nature 514, 92-97 (2014).
OpenUrl CrossRef PubMed Web of Science
Lambert, J. C., Ibrahim-Verbaas, C. A., Harold, D. & Naj, A. C. Meta-analysis of 74,046 individuals identifies 11 new susceptibility loci for Alzheimer’s disease. Nature (2013).
van der Valk, R. J. P. et al. A novel common variant in DCST2 is associated with length in early life and height in adulthood. Hum. Mol. Genet. 24, 1155-1168 (2014).
OpenUrl PubMed
Horikoshi, M., Yaghootkar, H. & Mook-Kanamori, D. O. New loci associated with birth weight identify genetic links between intrauterine growth and adult height and metabolism. Nature (2013).
Locke, A. E. et al. Genetic studies of body mass index yield new insights for obesity biology. Nature 518, 197-206 (2015).
OpenUrl CrossRef PubMed
Schunkert, H., König, I. R., Kathiresan, S. & Reilly, M. P. Large-scale association analysis identifies 13 new susceptibility loci for coronary artery disease. Nature (2011).
Jostins, L. et al. Host-microbe interactions have shaped the genetic architecture of inflammatory bowel disease. Nature 491, 119-124 (2012).
OpenUrl CrossRef PubMed Web of Science
Manning, A. K., Hivert, M. F., Scott, R. A. & Grimsby, J. L. A genome-wide approach accounting for body mass index identifies genetic variants influencing fasting glycemic traits and insulin resistance. Nature (2012).
Estrada, K., Styrkarsdottir, U., Evangelou, E. & Hsu, Y. H. Genome-wide meta-analysis identifies 56 bone mineral density loci and reveals 14 loci associated with risk of fracture. Nature (2012).
van der Harst, P. et al. Seventy-five genetic loci influencing the human red blood cell. Nature 492, 369-375 (2012).
OpenUrl CrossRef PubMed Web of Science
Teslovich, T. M. et al. Biological, clinical and population relevance of 95 loci for blood lipids. Nature 466, 707-713 (2010).
OpenUrl CrossRef PubMed Web of Science
Wood, A. R., Esko, T., Yang, J., Vedantam, S. & Pers, T. H. Defining the role of common variation in the genomic and biological architecture of adult human height. Nature (2014).
Cousminer, D. L. et al. Genome-wide association and longitudinal analyses reveal genetic loci linking pubertal height growth, pubertal timing and childhood adiposity. Hum. Mol. Genet. 22, 2735-2747 (2013).
OpenUrl CrossRef PubMed Web of Science
Shungin, D. et al. New genetic loci link adipose and insulin biology to body fat distribution. Nature 518, 187-196 (2015).
OpenUrl CrossRef PubMed Web of Science
Taal, H. R. et al. Common variants at 12q15 and 12q24 are associated with infant head circumference. Nat Genet 44, 532-538 (2012).
OpenUrl CrossRef PubMed
Gieger, C. et al. New gene functions in megakaryopoiesis and platelet formation. Nature 480, 201-208 (2011).
OpenUrl CrossRef PubMed Web of Science
Okada, Y. et al. Genetics of rheumatoid arthritis contributes to biology and drug discovery. Nature 506, 376-381 (2014).
OpenUrl CrossRef PubMed Web of Science
Ripke, S. et al. Biological insights from 108 schizophrenia-associated genetic loci. Nature 511, 421-427 (2014).
OpenUrl CrossRef PubMed Web of Science
Chan, Y., Salem, R. M., Hsu, Y. & McMahon, G. Genome-wide analysis of body proportion classifies height-associated variants by mechanism of action and implicates genes important for skeletal …. Am. J. Hum. Genet. (2015).
Morris, A. P., Voight, B. F., Teslovich, T. M. & Ferreira, T. Large-scale association analysis provides insights into the genetic architecture and pathophysiology of type 2 diabetes. Nature (2012).
↵
Pickrell, J. K. & Pritchard, J. K. Inference of population splits and mixtures from genome-wide allele frequency data. PLoS Genet (2012).
↵
Zhao, L., Lascoux, M., Overall, A. & Waxman, D. The characteristic trajectory of a fixing allele: A consequence of fictitious selection that arises from conditioning. Genetics (2013).
↵
Kremer, A. & Le Corre, V. Decoupling of differentiation between traits and their underlying genes in response to divergent selection. Heredity (Edinb) 108, 375-385 (2012).
OpenUrl
↵
Le Corre, V. & Kremer, A. The genetic differentiation at quantitative trait loci under local adaptation. Mol. Ecol. (2012).
↵
Chan, Y., Lim, E. T., Sandholm, N. & Wang, S. R. An excess of risk-increasing low-frequency variants can be a signal of polygenic inheritance in complex diseases. Am. J. Hum. Genet. (2014).
↵
Huxley, J. Problems of Relative Growth (Methuen, London, 1932).
↵
Huxley, J. S. & Teissier, G. Terminology of relative growth. Nature 137, 780-781 (1936).
OpenUrl CrossRef
↵
Cheverud, J. M. Relationships among ontogenetic, static, and evolutionary allometry. American Journal of Physical Anthropology 59, 139-149 (1982). URL http://dx.doi.org/10.1002/ajpa.1330590204.
OpenUrl CrossRef PubMed Web of Science
↵
Lande, R. Quantitative genetic analysis of multivariate evolution, applied to brain: body size allometry. Evolution 402-416 (1979).
↵
Rice, S. H. The evolution of canalization and the breaking of von baer’s laws: Modeling the evolution of development with epistasis. Evolution 52, 647-656 (1998). URL http://www.jstor.org/stable/2411260.
OpenUrl CrossRef Web of Science
↵
Nieuwboer, H. A., Pool, R., Dolan, C. V., Boomsma, D. I. & Nivard, M. G. GWIS: Genome-Wide Inferred Statistics for Functions of Multiple Phenotypes. Am. J. Hum. Genet. 99, 917-927 (2016). URL http://www.sciencedirect.com/science/article/pii/S0002929716303214.
OpenUrl

View the discussion thread.

Posted August 02, 2017.

Download PDF

Supplementary Material

Citation Tools

Subject Area

Evolutionary Biology

Subject Areas

All Articles

Animal Behavior and Cognition (5210)
Biochemistry (11736)
Bioengineering (8749)
Bioinformatics (29186)
Biophysics (14964)
Cancer Biology (12086)
Cell Biology (17403)
Clinical Trials (138)
Developmental Biology (9418)
Ecology (14176)
Epidemiology (2067)
Evolutionary Biology (18299)
Genetics (12235)
Genomics (16795)
Immunology (11863)
Microbiology (28066)
Molecular Biology (11582)
Neuroscience (60936)
Paleontology (451)
Pathology (1870)
Pharmacology and Toxicology (3238)
Physiology (4956)
Plant Biology (10423)
Scientific Communication and Education (1683)
Synthetic Biology (2883)
Systems Biology (7338)
Zoology (1650)

[1] ↵
Roberts, D. F. Body weight, race and climate. American Journal of Physical Anthropology 11, 533-558 (1953).
OpenUrl CrossRef PubMed

[2] ↵
Ruff, C. B. Morphological adaptation to climate in modern and fossil hominids. Am. J. Phys. Anthropol. (1994).

[3] ↵
Savell, K. R. R., Auerbach, B. M. & Roseman, C. C. Constraint, natural selection, and the evolution of human body form. Proc. Natl. Acad. Sci. U.S.A. 113, 9492-9497 (2016).
OpenUrl Abstract/FREE Full Text

[4] ↵
Bogin, B., Smith, P., Orden, A. B., Varela Silva, M. I. & Loucky, J. Rapid change in height and body proportions of Maya American children. Am. J. Hum. Biol. 14, 753-761 (2002).
OpenUrl CrossRef PubMed Web of Science

[5] ↵
Serrat, M. A., King, D. & Lovejoy, C. O. Temperature regulates limb length in homeotherms by directly modulating cartilage growth. Proc. Natl. Acad. Sci. U.S.A. 105, 19348-19353 (2008).
OpenUrl Abstract/FREE Full Text

[6] ↵
Pujol, B., Wilson, A., Ross, R. & Pannell, J. Are Q_ST-F_ST comparisons for natural populations meaningful? Molecular Ecology 17, 4782-4785 (2008).
OpenUrl CrossRef PubMed Web of Science

[7] ↵
Rogers, A. R. & Harpending, H. C. Population structure and quantitative characters. Genetics 105, 985-1002 (1983).
OpenUrl Abstract/FREE Full Text

[8] ↵
Relethford, J. H. Craniometric variation among modern human populations. American Journal of Physical Anthropology 95, 53-62 (1994).
OpenUrl CrossRef PubMed Web of Science

[9] ↵
Relethford, J. H. Apportionment of global human genetic diversity based on craniometrics and skin color. American Journal of Physical Anthropology 118, 393-398 (2002).
OpenUrl CrossRef PubMed Web of Science

[10] ↵
Tishkoff, S. Strength in small numbers. Science (2015).

[11] ↵
Fan, S., Hansen, M. E. B., Lo, Y. & Tishkoff, S. A. Going global by adapting local: A review of recent human adaptation. Science 354, 54-59 (2016).
OpenUrl Abstract/FREE Full Text

[12] ↵
Pritchard, J. K. & Di Rienzo, A. Adaptation-not by sweeps alone. Nat Rev Genet (2010).

[13] ↵
Pritchard, J. K., Pickrell, J. K. & Coop, G. The genetics of human adaptation: hard sweeps, soft sweeps, and polygenic adaptation. Current biology (2010).

[14] ↵
Visscher, P. M., Brown, M. A. & McCarthy, M. I. Five years of GWAS discovery. Am. J. Hum. Genet. (2012).

[15] ↵
Price, A. L., Spencer, C. C. A. & Donnelly, P. Progress and promise in understanding the genetic basis of common diseases. Proc. R. Soc. B 282, 20151684-10 (2015).
OpenUrl CrossRef PubMed

[16] ↵
Boyle, E. A., Li, Y. I. & Pritchard, J. K. An expanded view of complex traits: from polygenic to omnigenic. Cell (2017).

[17] ↵
Turchin, M. C., Chiang, C. & Palmer, C. D. Evidence of widespread selection on standing variation in Europe at height-associated SNPs. Nature (2012).

[18] ↵
Berg, J. J. & Coop, G. A population genetic signal of polygenic adaptation. PLoS Genet (2014).

[19] ↵
Mathieson, I. et al. Genome-wide patterns of selection in 230 ancient Eurasians. Nature 528, 499-503 (2015).
OpenUrl CrossRef PubMed

[20] ↵
Robinson, M. R., Hemani, G. & Medina-Gomez, C. Population genetic differentiation of height and body mass index across Europe. Nature (2015).

[21] ↵
Hansen, M. E. B. et al. Shorter telomere length in Europeans than in Africans due to polygenetic adaptation. Hum. Mol. Genet. 25, 2324-2330 (2016).
OpenUrl CrossRef PubMed

[22] ↵
Field, Y. et al. Detection of human adaptation during the past 2000 years. Science 354, 760-764 (2016).
OpenUrl Abstract/FREE Full Text

[23] ↵
Racimo, F., Berg, J. J. & Pickrell, J. K. Detecting polygenic adaptation in admixture graphs. bioRxiv 146043 (2017).

[24] ↵
Lazaridis, I. et al. Ancient human genomes suggest three ancestral populations for present-day Europeans. Nature 513, 409-413 (2014).
OpenUrl CrossRef PubMed Web of Science

[25] ↵
1000 Genomes Project Consortium. A global reference for human genetic variation. Nature 526, 68 (2015).
OpenUrl CrossRef PubMed

[26] ↵
Berisa, T. & Pickrell, J. K. Approximately independent linkage disequilibrium blocks in human populations. Bioinformatics (2016).

[27] ↵
Pickrell, J. K. Joint analysis of functional genomic data and genome-wide association studies of 18 human traits. Am. J. Hum. Genet. (2014).

[28] ↵
Pickrell, J. K., Berisa, T., Liu, J. Z., Ségurel, L. & Tung, J. Y. Detection and interpretation of shared genetic influences on 42 human traits. Nature (2016).

[29] ↵
Martin, A. R. et al. Human Demographic History Impacts Genetic Risk Prediction across Diverse Populations. Am. J. Hum. Genet. 100, 635-649 (2017).
OpenUrl CrossRef

[30] ↵
Wright, S. The genetical structure of populations. Ann Eugen 15, 323-354 (1951).
OpenUrl PubMed Web of Science

[31] ↵
Nicholson, G. et al. Assessing population differentiation and isolation from single-nucleotide polymorphism data. J. R. Stat. Soc. 64, 695-715 (2002).
OpenUrl

[32] ↵
Prout, T. & Barker, J. S. F statistics in Drosophila buzzatii: selection, population size and inbreeding. Genetics 134, 369-375 (1993).
OpenUrl Abstract/FREE Full Text

[33] ↵
Spitze, K. Population structure in Daphnia obtusa: quantitative genetic and allozymic variation. Genetics 135, 367-374 (1993).
OpenUrl Abstract/FREE Full Text

[34] ↵
Ovaskainen, O., Karhunen, M., Zheng, C., Arias, J. M. C. & Merila, J. A New Method to Uncover Signatures of Divergent and Stabilizing Selection in Quantitative Traits. Genetics 189, 621-632 (2011).
OpenUrl Abstract/FREE Full Text

[35] ↵
Bulik-Sullivan, B. K. et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Gen. 47, 291-295 (2015).
OpenUrl CrossRef PubMed

[36] ↵
Zoledziewska, M., Sidore, C., Chiang, C. & Sanna, S. Height-reducing variants and selection for short stature in Sardinia. Nature (2015).

[37] ↵
Hellenthal, G. et al. A genetic atlas of human admixture history. Science 343, 747-751 (2014).
OpenUrl Abstract/FREE Full Text

[38] ↵
Fu, Q. et al. The genetic history of Ice Age Europe. Nature (2016).

[39] ↵
Lazaridis, I. et al. Genomic insights into the origin of farming in the ancient near east. Nature 536, 419-424 (2016).
OpenUrl CrossRef PubMed

[40] ↵
Martiniano, R. et al. The population genomics of archaeological transition in west iberia. bioRxiv (2017).

[41] ↵
Bulik-Sullivan, B., Finucane, H. K., Anttila, V. & Gusev, A. An atlas of genetic correlations across human diseases and traits. Nature (2015).

[42] ↵
Zheng, J. et al. LD Hub: a centralized database and web interface to perform LD score regression that maximizes the potential of summary level GWAS data for SNP heritability and genetic correlation analysis. Bioinformatics 33, 272-279 (2017).
OpenUrl CrossRef PubMed

[43] ↵
Lande, R. & Arnold, S. J. The measurement of selection on correlated characters. Evolution 37, 1210-1226 (1983).
OpenUrl CrossRef PubMed Web of Science

[44] ↵
Schreider, E. Geographical distribution of the body-weight/body-surface ratio. Nature 165, 286 (1950).
OpenUrl PubMed

[45] ↵
Roberts, D. Climate and human variability (Addison-Wesley, 1973).

[46] ↵
Ruff, C. Variation in Human Body Size and Shape. Annu. Rev. Anthropol. 31, 211-232 (2002).
OpenUrl CrossRef Web of Science

[47] ↵
Katz, D. C., Grote, M. N. & Weaver, T. D. A mixed model for the relationship between climate and human cranial form. Am. J. Phys. Anthropol. 160, 593-603 (2015).
OpenUrl

[48] ↵
Bergmann, C. Über die Verhältnisse der Wärmeökonomie der Thiere zu ihrer Grösse. Göttinger Studien 3, 595-708 (1847).
OpenUrl

[49] ↵
Mayr, E. Geographical character gradients and climatic adaptation. Evolution 10, 105-108 (1956). URL http://www.jstor.org/stable/2406103.
OpenUrl CrossRef Web of Science

[50] ↵
Stulp, G. & Barrett, L. Evolutionary perspectives on human height variation. Biol Rev 91, 206-234 (2014).
OpenUrl

[51] ↵
Stearns, S. C., Govindaraju, D. R., Ewbank, D. & Byars, S. G. Constraints on the coevolution of contemporary human males and females. Proceedings of the Royal Society of London B: Biological Sciences 279, 4836-4844 (2012). URL http://rspb.royalsocietypublishing.org/content/279/1748/4836. http://rspb.royalsocietypublishing.org/content/279/1748/4836.full.pdf.
OpenUrl CrossRef PubMed

[52] ↵
Ruff, C. B. Climate and body shape in hominid evolution. Journal of Human Evolution 21, 81-105 (1991).
OpenUrl CrossRef Web of Science

[53] ↵
Allen, J. A. The Influence of Physical Conditions in the Genesis of Species. Radical Review 1, 108-140 (1877).
OpenUrl

[54] ↵
Katzmarzyk, P. T. & Leonard, W. R. Climatic influences on human body size and proportions: ecological adaptations and secular trends. Am. J. Phys. Anthropol. 106, 483-503 (1998).
OpenUrl CrossRef PubMed Web of Science

[55] ↵
Chan, Y. et al. Genome-wide Analysis of Body Proportion Classifies Height-Associated Variants by Mechanism of Action and Implicates Genes Important for Skeletal Development. Am. J. Hum. Genet. 96, 695-708 (2015).
OpenUrl

[56] ↵
Palmer, C. & Pe’er, I. Statistical correction of the winner’s curse explains replication variability in quantitative trait genome-wide association studies. bioRxiv (2017).

[57] ↵
Brown, B. C. et al. Transethnic genetic-correlation estimates from summary statistics. The American Journal of Human Genetics 99, 76-88 (2016).
OpenUrl CrossRef PubMed

[58] ↵
Das, S. et al. Next-generation genotype imputation service and methods. Nature genetics 48, 1284-1287 (2016).
OpenUrl CrossRef PubMed

[59] ↵
Wakefield, J. Bayes factors for genome-wide association studies: comparison with P-values. Genet. Epidemiol. 33, 79-86 (2009).
OpenUrl CrossRef PubMed Web of Science

[60] Perry, J. R. B. et al. Parent-of-origin-specific allelic associations among 106 genomic loci for age at menarche. Nature 514, 92-97 (2014).
OpenUrl CrossRef PubMed Web of Science

[61] Lambert, J. C., Ibrahim-Verbaas, C. A., Harold, D. & Naj, A. C. Meta-analysis of 74,046 individuals identifies 11 new susceptibility loci for Alzheimer’s disease. Nature (2013).

[62] van der Valk, R. J. P. et al. A novel common variant in DCST2 is associated with length in early life and height in adulthood. Hum. Mol. Genet. 24, 1155-1168 (2014).
OpenUrl PubMed

[63] Horikoshi, M., Yaghootkar, H. & Mook-Kanamori, D. O. New loci associated with birth weight identify genetic links between intrauterine growth and adult height and metabolism. Nature (2013).

[64] Locke, A. E. et al. Genetic studies of body mass index yield new insights for obesity biology. Nature 518, 197-206 (2015).
OpenUrl CrossRef PubMed

[65] Schunkert, H., König, I. R., Kathiresan, S. & Reilly, M. P. Large-scale association analysis identifies 13 new susceptibility loci for coronary artery disease. Nature (2011).

[66] Jostins, L. et al. Host-microbe interactions have shaped the genetic architecture of inflammatory bowel disease. Nature 491, 119-124 (2012).
OpenUrl CrossRef PubMed Web of Science

[67] Manning, A. K., Hivert, M. F., Scott, R. A. & Grimsby, J. L. A genome-wide approach accounting for body mass index identifies genetic variants influencing fasting glycemic traits and insulin resistance. Nature (2012).

[68] Estrada, K., Styrkarsdottir, U., Evangelou, E. & Hsu, Y. H. Genome-wide meta-analysis identifies 56 bone mineral density loci and reveals 14 loci associated with risk of fracture. Nature (2012).

[69] van der Harst, P. et al. Seventy-five genetic loci influencing the human red blood cell. Nature 492, 369-375 (2012).
OpenUrl CrossRef PubMed Web of Science

[70] Teslovich, T. M. et al. Biological, clinical and population relevance of 95 loci for blood lipids. Nature 466, 707-713 (2010).
OpenUrl CrossRef PubMed Web of Science

[71] Wood, A. R., Esko, T., Yang, J., Vedantam, S. & Pers, T. H. Defining the role of common variation in the genomic and biological architecture of adult human height. Nature (2014).

[72] Cousminer, D. L. et al. Genome-wide association and longitudinal analyses reveal genetic loci linking pubertal height growth, pubertal timing and childhood adiposity. Hum. Mol. Genet. 22, 2735-2747 (2013).
OpenUrl CrossRef PubMed Web of Science

[73] Shungin, D. et al. New genetic loci link adipose and insulin biology to body fat distribution. Nature 518, 187-196 (2015).
OpenUrl CrossRef PubMed Web of Science

[74] Taal, H. R. et al. Common variants at 12q15 and 12q24 are associated with infant head circumference. Nat Genet 44, 532-538 (2012).
OpenUrl CrossRef PubMed

[75] Gieger, C. et al. New gene functions in megakaryopoiesis and platelet formation. Nature 480, 201-208 (2011).
OpenUrl CrossRef PubMed Web of Science

[76] Okada, Y. et al. Genetics of rheumatoid arthritis contributes to biology and drug discovery. Nature 506, 376-381 (2014).
OpenUrl CrossRef PubMed Web of Science

[77] Ripke, S. et al. Biological insights from 108 schizophrenia-associated genetic loci. Nature 511, 421-427 (2014).
OpenUrl CrossRef PubMed Web of Science

[78] Chan, Y., Salem, R. M., Hsu, Y. & McMahon, G. Genome-wide analysis of body proportion classifies height-associated variants by mechanism of action and implicates genes important for skeletal …. Am. J. Hum. Genet. (2015).

[79] Morris, A. P., Voight, B. F., Teslovich, T. M. & Ferreira, T. Large-scale association analysis provides insights into the genetic architecture and pathophysiology of type 2 diabetes. Nature (2012).

[80] ↵
Pickrell, J. K. & Pritchard, J. K. Inference of population splits and mixtures from genome-wide allele frequency data. PLoS Genet (2012).

[81] ↵
Zhao, L., Lascoux, M., Overall, A. & Waxman, D. The characteristic trajectory of a fixing allele: A consequence of fictitious selection that arises from conditioning. Genetics (2013).

[82] ↵
Kremer, A. & Le Corre, V. Decoupling of differentiation between traits and their underlying genes in response to divergent selection. Heredity (Edinb) 108, 375-385 (2012).
OpenUrl

[83] ↵
Le Corre, V. & Kremer, A. The genetic differentiation at quantitative trait loci under local adaptation. Mol. Ecol. (2012).

[84] ↵
Chan, Y., Lim, E. T., Sandholm, N. & Wang, S. R. An excess of risk-increasing low-frequency variants can be a signal of polygenic inheritance in complex diseases. Am. J. Hum. Genet. (2014).

[85] ↵
Huxley, J. Problems of Relative Growth (Methuen, London, 1932).

[86] ↵
Huxley, J. S. & Teissier, G. Terminology of relative growth. Nature 137, 780-781 (1936).
OpenUrl CrossRef

[87] ↵
Cheverud, J. M. Relationships among ontogenetic, static, and evolutionary allometry. American Journal of Physical Anthropology 59, 139-149 (1982). URL http://dx.doi.org/10.1002/ajpa.1330590204.
OpenUrl CrossRef PubMed Web of Science

[88] ↵
Lande, R. Quantitative genetic analysis of multivariate evolution, applied to brain: body size allometry. Evolution 402-416 (1979).

[89] ↵
Rice, S. H. The evolution of canalization and the breaking of von baer’s laws: Modeling the evolution of development with epistasis. Evolution 52, 647-656 (1998). URL http://www.jstor.org/stable/2411260.
OpenUrl CrossRef Web of Science

[90] ↵
Nieuwboer, H. A., Pool, R., Dolan, C. V., Boomsma, D. I. & Nivard, M. G. GWIS: Genome-Wide Inferred Statistics for Functions of Multiple Phenotypes. Am. J. Hum. Genet. 99, 917-927 (2016). URL http://www.sciencedirect.com/science/article/pii/S0002929716303214.
OpenUrl