Abstract
Spatial models show that genetic differentiation can be explained by factors ranging from geographic distance to environmental resistance across the landscape. However, genomes exhibit a landscape of differentiation, which could indicate that multiple factors better explain divergence in different portions of the genome. We test whether the best-predictors of intraspecific differentiation vary across the genome in ten bird species that co-occur in Sonoran and Chihuahuan deserts. Using population-level genomic data, we characterized the genomic landscapes across species and modeled five predictors that represented historical and contemporary mechanisms. The extent of genomic landscapes differed across the ten species, influenced by varying levels of population structuring and admixture between deserts. General dissimilarity matrix modeling indicated that the best-fit models differed from the whole genome and partitions along the genome. The most important predictors of genetic distance were environment and contemporary demography, which each explained 25–38% of observed variation, with paleoclimate and the position of the biogeographic barrier explaining 14–16%, and distance only explaining 9%. In particular, the genome was best explained by the biogeographic barrier in regions where the genome showed high fixation between populations. Similar levels of heterogeneity were observed among species and phenotypic divergence within species. These results illustrate that the genomic landscape of differentiation was influenced by alternative spatial factors operating on different portions of the genome.
Introduction
Levels of nucleotide diversity and the degree of differentiation both vary across genomes (e.g., Ellegren et al., 2012; Li and Ralph 2019). These so-called genomic landscapes are produced by a range of variable processes including ones intrinsic to the genome (meiotic recombination, mutation) and those extrinsic (introgression, selection, and drift). Fluctuating levels of genetic diversity across the genome have been shown to be associated with recombination rate indicating that linked selection reduces variation (e.g., Thom G, Moreira LR, Batista R, Gehara M, Aleixo A, Smith BT, unpublished data, https://www.biorxiv.org/content/10.1101/2021.12.01.470789v1). Likewise, speciation genes, mutation rates, and coalescent times are all known to cause variation in differentiation across the genome (Nosil and Schluter 2011,Benzer 1961; Hodgkinson and Eyre-Walker 2011). In contrast to intrinsic processes, extrinsic processes are mediated through interactions with the adaptive and demographic factors operating across space. Despite evidence of the patterns and processes driving a heterogeneous genomic landscape (e.g., Li and Ralph 2019, Wang et al., 2020), studies examining the spatial predictors of genetic differentiation often treat genomic data as homogeneous. Clarifying the relationship between the heterogeneity of the genomic landscape and spatial predictors of differentiation will elucidate how intraspecific variation arises in the complex physical landscape.
The spatial processes attributed to population differentiation operate over historical through contemporary time scales. For example, population history is often linked to Pleistocene glacial cycles that shifted and fragmented distributions over the last 2.6 million years. An association of genome-wide structuring linked to population fragmentation can be tested under isolation-by-history (IBH), where genetic distances are modeled against paleo-climatic suitability (Vasconcellos et al., 2019; Moreira et al., 2020). There are also atemporal manifestations of historical isolation, such as isolation-by-barrier (IBB; sensu Mayr 1942), which posits that population differentiation is best predicted by a landscape feature, for example a mountain range or river. Over shallower evolutionary scales, non-random mating with individuals in closer geographic proximity can cause genetic differentiation. Geographic distances alone may not be the best predictors of differentiation because adaptation to local climatic conditions causes selection to generate intraspecific differentiation across environmental gradients, which is known as isolation-by-environment (IBE; Wang and Bradburd 2014, Myers et al., 2019, Berg et al., 2015; Zamudio et al., 2016). Because local environmental conditions change rapidly, for example due to species turnover or succession (Phillips 1996, Nuvoloni et al., 2016), associations between differentiation and environment are likely more recent phenomena than historical associations. The increased availability of ecological data for many organisms, such as census data, allows for testing even shallower associations with genetic structuring across the landscape. Contemporary demographic data can be used to test isolation-by-abundance (IBA), where genetic differences are associated with abundance troughs that restrict gene flow (Barton and Hewitt 1981, Hewitt 1989, Barrowclough et al., 2005). Local population size is also known to be a strong driver of genetic structure, especially when compounded with environmental change determining local suitability (Weckworth et al., 2013). While the focus of these models is often on genetic variation, they can also be applied to phenotypic variation (e.g., Moreira et al., 2020). Phenotypic variation is often the product of many loci with little effect that are not always distinguishable from the genome itself. As such, looking directly at phenotype can help reveal whether a particular process is associated with trait variance. Examining the genomic landscape in the context of these alternative spatial models will provide evidence for how factors of varying temporal resolutions influence the peaks and valleys of differentiation. To investigate how landscape features impact genotype and phenotype, we use an exemplar community of co-distributed taxa across the Sonoran and Chihuahuan deserts of the southwestern USA and northern Mexico.
Here we characterize the genomic landscapes of birds occurring across the Sonoran and Chihuahuan deserts and test the relative effect of alternative spatial models in predicting patterns of intraspecific differentiation. To do this, we integrate population-level whole-genome resequencing, specimen-based morphometrics, and comparative sampling across ten co-distributed species that occur across the deserts. We hypothesize that the best-predictors of genetic diversity will vary across species and different partitions of the data, reflecting the multiple extrinsic factors that structure variation across the genomic landscape (Supplementary Figure 1). Alternatively, species could show homogeneous patterns either by the same spatial modeling predicting differentiation in windows across the whole genome or by species exhibiting congruent genomic landscapes shaped by the same geographic barrier. We further evaluate whether summary statistics, reflective of alternative evolutionary processes, could explain alternative spatial predictors of genomic landscapes. This comparative framework will provide resolution to the extent at which peaks and valleys of the genomic landscape correspond to historical through contemporary factors.
Results
Genomic results
We sequenced the genomes of 221 individuals across 10 focal species of passerine distributed in the Sonoran and Chihuahuan deserts. Based on the amounts of missing data, we created three datasets: a complete dataset, a dataset where up to 75% missing data was allowed, and a dataset where up to 50% missing data was allowed. We found that the three missing data partitions did not vary substantially with respect to coverage or number of SNPs. As such, here we describe the results for the complete dataset (for the 75% and 50% missing data partitions, see Supplementary Information). We recovered sequences with a mean coverage of 2.9 per individual (range 0.4–8.8), 6–25 million reads per individual, and 5–28 million SNPs per species. Mean coverage within species ranged from 2.1–4.2 Phainopepla nitens the lowest coverage and Melozone fusca having the highest. The average missing data per species ranged from 48–64%. Across individuals, missing data ranged from 13–93% with a mean of 53% (Table 1).
We estimated recombination rates using ReLERNN (Adrion et al., 2020). Mean recombination rates for the entire genome ranged from 8.8–10.6 x 10-10 c/bp (where c is the probability of a crossover) across species. Correlations between species in mean recombination across chromosomes range from −0.57 to 0.61 (mean±SD 0.01±0.29). Correlations in mean recombination at the same genomic positions ranged from −0.31 to 0.36 (mean±SD −0.03±0.17).
Lostruct outliers and FST outliers
We divided the genome into three kinds of partitions. First, we analyzed chromosomes independently. Second, we identified high FST outliers and analyzed those. Finally, we performed a multidimensional scaling (MSDS) analysis the using R package lostruct version 0.0.0.9000 (Li and Ralph 2019), which subdivided genomes into four partitions, three outliers (LS1, LS2, LS3) and one non-outlier partition (Figure 1; Supplementary Figure 2). Note that outlier groupings of the same color are not analogous across taxa. On average across all species 85.3% of labeled values were non-outliers, and ~4.88% each were LS1, LS2, and LS3.
We calculated FST values across the genome using ANGSD’s realSFS function (Meisner and Albrechtsen 2018). FST outlier analysis for our species across the datasets with complete, 75%, and 50% missing data found largely congruent results (see Supplementary Information for 75% and 50% datasets). The number of high FST outliers for the complete dataset ranged from 28–758 across species (with the total number of windows analyzed per species ranging from 100,733–113,555). The outlier lostruct partitions identified above (LS1, LS2, LS3) vary in the proportion of the FST outliers examined (for the complete dataset), ranging from 0.0%–3.4% (mean 0.2%) for peaks. Though not significant, there appears to be a trend where species with generally higher FST have more high FST outliers identified.
Population differentiation
Population differentiation across the Sonoran and Chihuahuan deserts was estimated using PCAngsd in ANGSD (Meisner and Albrechtsen 2018). Species ranged from being highly structured among deserts in four species (T. curvirostre, V. bellii, A. flaviceps, and P. melanura), showing a gradient of structuring with admixture in three (T. crissale, M. fusca, and Cardinalis sinuatus), or unstructured in the remaining taxa (A. bilineata, C. brunneicapillus, P. nitens; Supplementary Figure 3). FST values for the species within these three groups varied accordingly: highly structured=0.03–0.10; gradient=0.03–0.04; and unstructured=0.02–0.03. Population differentiation estimated from the chromosomal partitions were generally concordant with genome-level patterns, but smaller chromosomes and/or those with fewer SNPs showed different patterns (Figure 2, Figure 3, Supplementary Figure 4).
After estimating population differentiation, we calculated clines of population assignment across the range of each species, examining cline width and cline center. For cline-based analyses, mean cline width ranges from 6.94–15.89° longitude, where the total area encompassed by each species was ~18° longitude (with zero on the cline defined as 116.10°W longitude; Supplementary Table 3; Figure 2; Figure 3; Supplementary Figure 1). Cline width increases as chromosome size decreases (p=1.4×10-6, adjusted R2=0.06), though this varies across species (range p 7.7×10-7–0.43, range adjusted R2-0.01–0.51). Mean cline center location ranges from 3.58° along the cline (~112.52°W) to 12.70° along the cline (~103.4°W). We found that there were negative correlations between the degree of population structure (measured by FST; see Supplementary Information) and both mean cline width and the standard deviation of cline center locations, which is expected based on how clines are calculated. Species with higher FST between populations had narrower clines and less variation among partitions in the locations of their clines (Supplementary Figure 5).
Morphological variation across the Cochise Filter Barrier
Across the 10 focal species, we measured 294 individuals, including bill, wing, tail, and leg morphology. We collapsed these metrics into a principal components analysis. There were no clear, desert-specific patterns in variation across the Cochise Filter Barrier (N=234), with morphological changes ranging from subtle to significantly different. In our principal components analysis, the first three principal components (PC1, PC2, PC3) explained 74%, 12%, and 6% of the variation in morphology and corresponded approximately to overall body size, bill size/shape, and wing size/shape, respectively (Supplementary Table 4, Supplementary Table 5; Supplementary Figure 9). We found significant differences across the Cochise Filter Barrier in six species in at least one analysis (see Supplementary Information for more details). Between deserts T. crissale and C. sinuatus differed in body size and bill shape. Vireo bellii and M. fusca differed in bill shape. Polioptila melanura and A. flaviceps differed in body size. No species showed significant differences in wing shape.
Climatic suitability and abundance across the Cochise Filter Barrier
Using MaxEnt (Phillips et al., 2006), WorldClim (Hijmans et al., 2005), and other environmental variables (see Methods), we calculated ecological niche models for the present, the mid-Holocene, and the Last Glacial Maximum. During the Last Glacial Maximum, the most suitable areas for all taxa were projected to be further south than the most suitable areas during the present and mid-Holocene. Regions that are predicted to be suitable through all three periods are often reduced compared to current distributions (Supplementary Figure 8; Supplementary Figure 10). We calculated abundance for each species using the Breeding Bird Survey (Pardieck et al., 2019). Abundance was correlated with predicted climatic suitability across all taxa, with adjusted R2 values of fit lines (log-scaled) ranging from 0.42–0.62 (Figure 3, Supplementary Figure 6, Supplementary Figure 7).
Phenotypic and genotypic datasets are idiosyncratic with respect to landscape features
We chose five metrics of landscape variation (IBA, IBB, IBD, IBE, and IBH) to evaluate against genetic and phenotypic variation within taxa. Differences in variation were attributed to each of these landscape metrics using generalized dissimilarity matrix (GDM) modeling. We evaluated models that were univariate (variation ~ landscape metric), bivariate (variation ~ IBB + landscape metric), and trivariate (variation ~ IBB + IBD + landscape metric); we focus on univariate models. Performance of the GDM models was consistent whether looking at univariate, bivariate, or trivariate data partitions (see Supplementary Information). 2,945/3,030 univariate models converged successfully with an overall 98% convergence. Of the 505 datasets tested, 30.0% selected IBE as the best factor explaining variation, 21.3% selected IBB, 18.2% selected IBA, 14.2% selected IBD, 11.5% selected IBH, and the remainder were ambiguous, with multiple models equally explaining variation. Within the ambiguous models, of which there were 23, 82.6% had IBH as one of the best models, 73.9% had IBE as one of the best models, 56.5% had IBA as one of the best models, 23.1% had IBD as one of the best models, and notably, none of them had IBB as one of the best models.
Across all of the GDMs performed, percent deviance explained by the best model was variable, ranging from 0.1% to 81.9%. The mean±SD percent deviance explained across all datasets was 16.8%±18.2%. Percent deviance explained for the whole genome was lower on average, ranging from 0.5%–6.9% (mean±SD 3.9%±2.2%). FST outliers, both high and low, tended to have higher percent deviances explained, ranging from 0.14%–69.9% (mean±SD 25.9%±22.4%). Lostruct outliers ranged from 1.0%–54.25% (mean±SD 11.0%±12.6%). Percent deviance explained had the most extreme range in morphology, from 0.3% to 81.9% (mean±SD 17.5%±20.8%). The percent deviance explained varied across taxa, with means ranging from 7.5% (M. fusca) to 27.9% (P. nitens) and standard deviations ranging from 12.1%–24.0%.
For the models examining signals across the whole genomes, three species had IBB as the most important predictor, three had IBE, two had IBH, one had IBA, and one had IBD. (Figure 4; Supplementary Figure 11). It is notable that all of the genomes identified as having IBE as the best predictor are taxa that are structured across the Cochise Filter Barrier. Chromosome length does not significantly predict any differences between models (p>0.47, n=347).
For the lostruct partitions, the three outlier partitions (LS1, LS2, LS3) had 5/30 with IBA as the best model, 10/30 IBB, 3/30 IBD, 9/30 IBE, and 3/30 IBH. Most species showed at least some overlap in which model best explained partitions: for example, A. bilineata, T. crissale, and C. sinuatus all have at least two lostruct partitions best explained by IBB. For the non-outlier partitions (LS0, and the “empty” partition for V. bellii and A. flaviceps), these best model chosen is the same as the best model explaining whole-genome variation in four species (V. bellii, P. melanura, C. sinuatus, M. fusca) and that of one of the outlier partitions in all but two species (V. bellii, A. bilineata). Notably, for P. melanura IBE explains all three outlier partitions, the genome, and the non-outlier lostruct partitions. Likewise, for C. sinuatus, all of these are explained by IBB.
All ten species had high-FST partitions identified (see Supplementary Information for 75% and 50%) across the complete, 75%, and 50% datasets. The genome matched at least one of the high or low partitions in four taxa: Vireo bellii, A. bilineata, Toxostoma crissale, and Melozone fusca. With respect to significance, none of the FST outlier partitions were significantly different (but see Supplementary Information).
There was little congruence across the best landscape predictor of morphological data within species. Overall morphological differentiation had the same explanatory variables as PC3 for P. nitens (IBE), and as PC1 for C. sinuatus (IBD). Additionally, some individual PCs did match each other: IBD best explained PC1 and PC3 in C. brunneicapillus, IBA best explained PC2 and PC3 in T. crissale and PC1 and PC2 in A. flaviceps, and for T. curvirostre, PC2 and PC3 both showed ambiguous results. Neither overall morphology nor the PCs were significantly different than expected in the univariate dataset (though some were in the bivariate and trivariate datasets; see Supplementary Information).
Like overall variation, PC1 (body size) showed an even distribution between all models across the 10 species (i.e., 20% each IBA, IBB, IBD, IBE, and IBH). PC2 (bill shape) was best explained in 30% of species by IBA, 20% by IBE, 10% by IBD, and 40% of the species showed ambiguous results. Lastly, PC3 (wing shape) was best explained in 40% of species by IBD, 20% each by IBA and IBE, 10% IBD, and 10% of species had ambiguous results.
Data characteristics of best-fit models
We looked at whether differences in summary statistics could explain our univariate models (IBA, IBB, IBD, IBE, IBH) across taxa (Supplementary Figure 12; Supplementary Figure 13; Supplementary Figure 14). The summary statistics we examined were recombination rate, missing data, FST, DXY calculated using ngsTools (Fumagalli et al., 2014), and the length of the chromosome. The clearest pattern was that datasets with ambiguous results among models had more missing data than all others except IBH models (p<0.0001). IBH results also tended to have more missing data than most other models (p<0.02), but we found that this relationship was not significant when we excluded P. nitens, which had both the largest proportion of models explained by IBH and a high proportion of missing data (p>0.70). FST was significant overall (p<0.04), with IBB models having significantly higher FST than IBH models. This relationship was no longer significant in bivariate or trivariate models because IBB was not present (see Supplementary Information). DXY was also significant overall (p<0.05), but Tukey’s honestly significant difference tests showed that none of the individual comparisons were significant (p>0.06). Recombination rate and chromosome length were not significant for univariate models (p>0.07) though recombination rate was significant for bivariate and trivariate models (see Supplementary Information).
Landscape predictors are not influenced by habitat suitability
From the ENMs, we calculated habitat suitability for each species across the deserts. Species with more variable suitability across the contact zone have a higher proportion of IBH as the best model (adjusted-R2=0.54, n=10, p<0.01). As P. nitens has both the highest proportion of IBH and the highest variance in suitability, we removed this species in case it was acting as an outlier. After removing this species, the relationship was only nearly significant, but strong (adjusted-R2=0.28, n=9, p<0.09). Evaluating this relationship with ANOVA tests finds the same results, where no comparisons are significant without P. nitens.
Significance evaluation of hypotheses of evolution across the Cochise Filter Barrier
Species differ more than expected with respect to what spatial models best explain their genotypes and phenotypes. Best-predictors vary across individual species (χ2=284.0, p~0.0, df=54, simulated p<0.0005), individual partitions of genotype and phenotype differed (χ2=685.6, p~0.0, df=324, simulated p<0.0005), and with respect to phylogeographic structure across the Cochise Filter Barrier (χ2=62.9, p<6.5×10-9, df=12, simulated p<0.0005).
Discussion
We found that the best-fit spatial model differed across partitions at multiple scales. Our taxa, which varied in levels of genomic diversity, showed evidence that different spatial processes (reflecting historical through contemporary phenomena) had distinct impacts on the genome compared to targeted subsets of the genome. Similar patterns of heterogeneity were observed among species and with their phenotypic datasets. The disparity in predictors of intraspecific differentiation among the whole genome versus windows and between windows extends the view that evolutionary inferences are dependent on which portions of the genome are examined in a spatial framework. The heterogeneity in model fit across partitions was consistent with the expectation that various evolutionary processes contribute to the peaks and valleys of the genomic landscape. By applying this framework across an assemblage of birds that evolved across a common, dynamic region we showed that at the community-scale, predictors of genomic structure remain idiosyncratic, which may reflect taxa at different stages of the evolutionary histories and responses to the biogeographic barrier.
Extrinsic drivers of the genomic landscape
Our modeling showed that environmental distance was often a strong predictor of levels of intraspecific differentiation, but this pattern was species- and partition-dependent. Genome-wide patterns of differentiation across the Cochise Filter Barrier are partially shaped by environmental adaptation as observed in non-avian taxa distributed across the barrier (Myers et al., 2019). Environmental adaptation is often recovered in taxa who respond to environmental gradients via altered phenotypes (Branch et al., 2017, Dubec-Messier et al., 2018), genotypes (Berg et al., 2015, Manthey and Moyle 2015), or both (Ribeiro et al., 2019). However, our analyses show there was considerable variation among individual regions in the genome, indicating a more nuanced pattern. The species-specific results we found suggests that individual taxa had unique responses to shared aspects of the landscape. Although the focal taxa are co-distributed, we showed how environmental suitability, their general morphologies, and abundances across space varied among species, which may help explain why best-fit models differed. As such, these species-specific factors may explain isolation-by-environment was the best explanatory variable for many, but not all, of the species we investigated.
Individual partitions of the genome also varied with respect to how much environmental variation played a role. At one extreme, environmental variation appears to have little impact on the sex chromosomes. The Z chromosome often showed the barrier (i.e., IBB) as being the most important factor, even in unstructured species such as Amphispiza bilineata and Campylorhynchus brunneicapillus, perhaps because the locus evolves faster than sites under selection for adaptation to local environmental conditions. Sex chromosomes are known to diverge faster than autosomes due to their differences in effective population size (Mank et al., 2010), importance in sexual selection (Kirkpatrick 2017), and the presence of speciation genes (Sæther et al., 2007). Given the lack of evidence for environmental variation predicting spatial genetic differentiation on the Z chromosome, this would suggest that any speciation genes present in these taxa may not be involved in adaptation to the environment.
Environment was the most important driver for species with genetic structure. The most intuitive explanation for this was that population structuring in these taxa was facilitated by natural selection to different environments. There was some evidence that this could have happened across other taxa that occur across the Cochise Filter Barrier, as IBE was the best predictor of genome-wide divergence in a community of snakes distributed across the barrier (Myers et al., 2019). However, we must stress that while this explanation was the most intuitive and aligns with predictions, there are numerous processes that can produce IBE (Wang and Bradburd 2014), and it is possible that divergence led to adaptation to these environments secondarily, rather than the reverse, or the patterns are being influenced by some unknown factors that we did not quantify. Nevertheless, at present our results are consistent with the importance of ecologically mediated population differentiation, or isolation-by-environment, in structuring communities across the deserts of North America.
Contemporary versus historical predictors of genomic differentiation
Our finding that the best-fit models varied across species was consistent with the expectations that species idiosyncratically respond, over a range of time scales, to the Cochise Filter Barrier. The spatial patterns we examined vary temporally, with Pleistocene environmental changes being a historical process, while geographic distances, abundances, and environmental variation reflecting more contemporary processes. Historical signatures of Pleistocene isolation are commonly recovered patterns for the Cochise Filter Barrier (Provost et al., 2021) and other communities (Shafer et al., 2010; Ralson et al., 2021), but our data showed that isolation in glacial refugia often did not best explain genome-wide differentiation. This could be due to erosion of historical signals as the Cochise Filter Barrier filters taxa and changes contemporary patterns of gene flow. Alternatively, our proxy for IBH (resistance over projected Pleistocene habitat suitability) may be a poor model for actual historical isolation. For example, paleoenvironmental gradients may no longer be as readily detectable. The presence of the barrier alone was a better predictor despite being atemporal.
In contrast, current environments best explain three genomes and the majority of partitions for five species (with abundances and geographic distances playing a lesser role), suggesting that phenomena operating on more recent timescales influenced genetic and morphological variation across the landscape. If some of the taxa herein are going through incipient speciation, then these contemporary factors should be most potent. Our identification of species abundances as a relatively important predictor of genetic divergence aligns well with landscape genetic studies that use proxies for the effects of contemporary phenomenon and ecological factors on genetic variation (Burney and Brumfield 2009, Paz et al 2015). For example, urbanization, which fragments and reduces population sizes, is well known to impact rates of gene flow and drift, acting as a strong barrier of gene flow since the 20th century (Miles et al., 2019). Our use of available abundance data across large spatial scales shows a more direct relationship between varying abundances across the landscape with levels of differentiation. Further, while both historical and contemporary processes are influencing taxa across this biogeographic barrier, contemporary patterns are seemingly more influential.
Relationship between best-models and window summary-stats
In contrast to the extrinsic drivers of the genomic landscape that we have focused on here, there were no clear associations between partition characteristics and support for a particular model. For example, we found no significant differences in any species between recombination rate across chromosomes and which spatial models were most important on that chromosome. At the phylogeographic-scale, low recombination regions of the genome have been shown to be more likely to reflect population structure (Manthey et al., 2021) and the species tree topology (Thom G, Moreira LR, Batista R, Gehara M, Aleixo A, Smith BT, unpublished data, https://www.biorxiv.org/content/10.1101/2021.12.01.470789v1). The avian recombination rate landscape is thought to be conserved across taxa, even though exact genomic locations of divergence across taxa are not (Singhal et al., 2015, Turbek et al., 2021), with our ten focal species ranging in divergence time from ~75 thousand to ~12 million years between taxa (Harris et al., 2018; Kumar et al., 2017; Barker et al., 2015; Mason and Burns 2013; Price et al., 2014; Pasquet et al., 2014; Hooper and Price 2017; Mitchell et al., 2016; Gibb et al., 2015). Correlations in recombination rates at the same genomic position in these species are greater than 0.37 across chromosomes and always positive (Turbek et al., 2021). The ten desert birds we investigated, in contrast, have estimated divergence times ranging from ~10 to ~60 million years between taxa (Kumar et al., 2017; Barker et al., 2015; Mason and Burns 2013), with correlations in recombination rates at the same genomic position that were often smaller in magnitude and negative. This could reflect a real pattern, where the recombination landscapes are only conserved within more closely related species; our closest taxa, the two non-sister Toxostoma, do have the highest correlation in recombination rates across windows and are in the top 25% of the distribution in correlations. However, the differences found could have been caused by coverage depth, differences in the recombination rate estimators used, or missing data allowance. In addition, genetic partitions with higher FST were more likely to show isolation-by-barrier as the best model. These two metrics should be correlated; the former quantifies the degree of differentiation across the Cochise Filter Barrier, and the latter assigns individuals to their respective sides of the Cochise Filter Barrier. In species where there was differentiation, these two measures should describe the same phenomenon. This likely reflects the gradient in differentiation across species in the community. Given the wide variation across taxa, future work must be done to clarify the relationship between genomic architecture and evolutionary signal at multiple phylogenetic scales.
We explored the signal in our data by examining multiple ways of partitioning genomic windows, using different thresholds of missing data, and evaluating how data attributes influenced model support. We found that genetic partitions with more missing data were more likely to have ambiguous results. Genetic summary methods like PCA are impacted by missing data, particularly when they are imputed, which can cause individuals with disproportionately high levels of missing data to appear like they are admixed between populations (Yi and Latch 2021). It is likely that the reverse is true, where individuals with disproportionately low levels of missing data should fall out as their own populations more readily. For example, in some of our species (namely Vireo bellii, Auriparus flaviceps, Polioptila melanura) the individuals with highest missing data clustered as their own population before detecting any other spatial patterning. We ameliorated this by dropping individuals with too much missing data in some of our datasets. Overall, we did not find qualitative differences in population assignments, but it did generally inflate our fixation values and deflate our genetic diversity values. This is sensible, as reducing the number of individuals should both increase the likelihood of fixation due to sampling error as well as decrease the overall amount of nucleotide diversity.
Morphological versus genetic associations
We found that in most taxa, genotypic and phenotypic variation within species, and even different aspects of morphological phenotype within species, were not associated with the same landscape factors. Phenotypes were better explained by abundance, whereas genotypes were better explained by the contemporary environment. Discordance between genetic and phenotypic predictors of spatial variation have been observed in other systems, where phenotypic variation was better explained by the environment (Moreira et al., 2020). This discordance could be due to polygenic traits, where genotype-phenotype associations may be mediated by multiple loci of small effect working in concert, either by changing protein structure or regulation (Yusuf et al., 2020, Knief et al., 2017, Duntsch et al., 2020, Aguillon et al., 2021). However, for some phenotypes like plumage color, single genes of large effect have been implicated which should strengthen correlations between genotype and phenotype, at least for those loci (Sin et al., 2020; Toews et al., 2016). For desert birds in particular, phenotypic variation in metabolism (as well as in microbiomes) has been linked to genes that vary with the environment (Ribero et al., 2019). In our study, as with genetic differentiation, the extent of phenotypic structuring varied across species, with bill and body size being significantly different between deserts in a few taxa, but somewhat surprisingly, environmental variation did not usually explain morphological differences. For example, adaptations in bill morphology are frequently observed, such as in Song Sparrows on the Channel Islands that have higher bill surface area in hotter climates (Gamboa et al., 2021). The lack of a tight correlation between environment and phenotype were likely reflective of the shallowness of the evolutionary divergences and the subtlety of the environmental gradient across deserts. The two Toxostoma species in our study have previously shown contrasting patterns with respect to climate on beak morphology: T. crissale has larger bills in drier habitats, which may aid in cooling while conserving water, while T. curvirostre showed a pattern contrary to thermoregulatory predictions with larger bills in cooler climates (Probst et al., 2021), suggesting even in closely related species climate may not have the same role on morphological variation. Even though phenotypic data partitions often did not have the same explanatory factor with respect to the general dissimilarity modeling, there was a correlation between population structure in the genome (and chromosomes to a lesser extent) and phenotypic variation across these ten birds, in that taxa lacking morphological change also lack genetic variation overall.
Conclusion
By quantifying patterns in genotypic and phenotypic variation in communities distributed across a biogeographic barrier, we found that multiple co-occurring processes occur that impact variation within taxa. Although we found that isolation across an environmental gradient was among the most important associations in predicting genetic and phenotypic variation, the best-fit model varied across species and data partitions to reflect these multiple processes. These findings underscore the importance of accounting for heterogeneity in the genome, phenome, and diversification mechanisms acting across time and space to have the most comprehensive picture of spatial structuring in species. This will allow for an assessment of whether best-fit models that are proxies for neutral and adaptive processes are consistent with partitions that are evolving under the same conditions. Without a holistic understanding at each of these levels of organization, as well as the addition of future work that concurrently estimates selection at the organismal and the nucleotide levels, the actual mechanisms that shape communities will remain obscured. Further, while we did not find consistent predictors of phenotypic divergence, it is still an open question whether other measures of phenotypic variation (e.g., behavioral) may better track divergence, or phenotypic divergence does not follow a deterministic pattern along weak environmental gradients. Overall, this work displays the necessity of integrating spatial predictors of population divergence, differentiation across the genomic landscape, and phenotypic variation in understanding the multiple different mechanisms that have produced the population histories we see across contemporary communities of birds in North America.
Methods and Materials
Study system
The Sonoran and Chihuahuan deserts contain environmental and landscape variation that make them suitable for testing if any of the five discussed spatial models (IBA, IBB, IBD, IBE, and IBH) structure intraspecific variation in taxa. Across the two deserts and the transition zone between them, there is variation in precipitation, elevation, temperature, and vegetation that could result in local adaptation and isolation-by-environment. (Shreve, 1942; Reynolds et al., 2004). Pleistocene glacial cycles repeatedly separated and connected, such that some taxa experienced dramatic range shifts (Zink 2014, Smith et al., 2011), which could have isolated taxa in each desert. Further, there is a well-studied biogeographic barrier separating the deserts, the Cochise Filter Barrier, which is an environmental disjunction that demarcates the transition between the Sonoran and Chihuahuan deserts of southwestern USA and northern Mexico. The barrier is thought to have begun forming during the Oligo-Miocene and completed during the Plio-Pleistocene (Morafka, 1977, Van Devender, 1990; Van Devender et al., 1984, Holmgren et al., 2007, Spencer, 1996) and has formed a community ranging from highly differentiated taxa to unstructured populations (Provost et al., 2021). Demographic troughs caused by spatially varying population abundances could impact the frequency of gene flow across the landscape and the degree of genetic connectivity across the deserts.
Genetic sequencing and genome processing
We performed whole-genome-resequencing across 10 species of birds from the Sonoran and Chihuahuan deserts, obtaining genetic samples from new expeditions and loans from natural history museums (Cardinalis sinuatus; Toxostoma crissale, Toxostoma curvirostre; Amphispiza bilineata, Melozone fusca; Polioptila melanura; Phainopepla nitens; Auriparus flaviceps; Campylorhynchus brunneicapillus; Vireo bellii; Supplementary Table 6; Supplementary Figure 15). These species reflect different songbird morphotypes and ecologies in the deserts (e.g., large- to small-bodied, insectivorous to granivorous, migratory to resident). Three of these species (V. bellii, T. curvirostre, M. fusca) have shown evidence of structure across the Cochise Filter Barrier, while an additional three (P. melanura, A. flaviceps, C. brunneicapillus) have shown evidence of no structure (Zink et al., 2001; Rojas-Soto et al., 2007; Teutimez, 2012; Klicka et al., 2016, Smith et al., 2018).
Using 221 individuals across our 10 focal species, we sequenced 8–14 individuals in both the Sonoran and Chihuahuan deserts per species for a total of 18–25 samples per species. Library preparation and sequencing was performed by RAPiD Genomics (Gainesville, FL). We mapped raw reads of each species to their phylogenetic closest available reference genomes (Supplementary Table 7): notably, A. bilineata and M. fusca were mapped to the same genome, as were C. brunneicapillus, T. crissale, T. curvirostre, P. melanura, and P. nitens (see Supplementary Information). Before mapping, we created pseudo-chromosomal assemblies of these genomes using Satsuma version 3.1.0 (Grabherr et al., 2010) by aligning to the Taeniopygia guttata genome (GCF_000151805.1), retaining pseudo-chromosomes with the prefix “PseudoNC”. Hereafter, pseudo-chromosomes will be referred to as chromosomes.
We filtered our sequences with FastQ Screen version 0.14.0 (Wingett et al., 2018) to remove contamination by filtering out reads that mapped to PhiX and the following genomes: Homo sapiens, Escherichia coli, Enterobacteriophage lambda, and Rhodobacter sphaeroides. From our raw reads, we used a pipeline that produced genotype likelihoods using ANGSD version 0.929 (Korneliussen et al., 2014). We converted cleaned FastQ files to BAM using bwa version 0.7.15 (Li and Durbin 2009, Li and Durbin 2010) and picard version 2.18.7-SNAPSHOT from the GATK pipeline (McKenna et al., 2010, DePristo et al., 2011, Van der Auwera et al., 2013). Next, we prepared the BAM files to be used in the ANGSD pipeline using samtools version 1.9-37 (Li et al., 2009; Li 2011), bamUtil version 1.0.14 (Jun et al., 2015), and GATK version 3.8-1-0 (McKenna et al., 2010). This pipeline creates genotype likelihoods to account for uncertainty for low-coverage sequences.
We investigated the impact of missing data on our analyses using three thresholds for retaining sites: a complete dataset, in which all individuals were retained irrespective of missing data; a 75% dataset, in which individuals were only retained if they had less than 75% missing sites; and a 50% dataset, in which individuals were only retained if they had less than 50% missing sites. These different datasets were used for a suite of downstream analyses to assess the sensitivity of the results to individuals with missing data.
Evaluating population structure across the Cochise Filter Barrier
We characterized the degree of population structure across the whole genome and in individual chromosomes across the Cochise Filter Barrier in our focal species. First, using PCAngsd in ANGSD (Meisner and Albrechtsen 2018), which assigns individuals to K clusters and estimates admixture proportions for each individual. To evaluate whether there was structure across the Cochise Filter Barrier, we selected K=2 (though we visualized K values from two to three). We performed this for the complete, 75%, and 50% missing data datasets, but found that these values were largely congruent across the datasets, and so we only use the complete dataset for describing population structure (Supplementary Figure 16, Supplementary Figure 17, Supplementary Figure 18). Second, we plotted population assignment changes over space using a cline analysis via the hzar version 0.2-5 R package (Derryberry et al., 2014) and custom scripts (modified from Burbrink et al., 2021). Analyses were conducted in R version 3.6.1 (R Core Team 2019). We did this to quantitatively evaluate the differences in population structure across chromosomes and in the genome more broadly. We thus were able to calculate the location and width of clines for the entire genome and each chromosome.
Complementing our genome-wide analyses, we ran a local principal components analysis along the genome on the complete dataset using the R package lostruct version 0.0.0.9000 (Li and Ralph 2019). Different chromosomes showed different relationships between individuals (see Supplementary Information). Because of this, we wanted to cluster regions of the genome together that showed similar relationships between individuals in case specific evolutionary processes were causing this pattern. The lostruct method performs principal component analysis on individual windows of the genome, then uses multidimensional scaling (MSDS) to summarize how similar the windows’ principal component analyses are when dividing the genome. We extracted three subsets of outliers for each species, which we designated LS1, LS2, and LS3, and compared it to the remainder of the genome, representing non-outliers.
Genomic summary statistics
We characterized genetic variation across each species’ genome and partitions of the genome by calculating a suite of summary statistics and metrics. To quantify genetic differentiation within each species, we calculated pairwise genetic distances between individuals from VCF files using the bitwise.dist function in poppr R package version 2.9.2 (Kamvar et al., 2014; Kamvar et al., 2015), which served as the genetic distance matrices for our generalized dissimilarity matrix models (see below). The function bitwise.dist calculates the Hamming distance of the DNA (i.e., number of differences between two strings). We scaled this distance such that missing data was assumed to match sites without missing data, but final distances were scaled such that comparisons with more missing data would have inflated distances. Neighbor-joining trees were calculated from these matrices to contrast genealogies across the genome. Genealogies across the genome were visualized by calculating pairwise and normalized Robinson-Foulds (RF) distances between all pairs of trees per species (Robinson and Foulds 1981). Recombination rates (in crossovers per base pair, c/bp) across the genome were estimated using the program ReLERNN (Adrion et al., 2020). This program combines simulation with a recurrent neural network to estimate the recombination rate on each chromosome in 100,000 bp windows. We also performed a sliding window DXY analysis using the calcDxy R script included with ngsTools version 1.0.2 (Fumagalli et al., 2014), which gives site-wise DXY values, and then averaged across windows. Windows were overlapping with a size of 100,000 base pairs and offset by 10,000 base pairs. Missing data were calculated using vcftools (Danecek et al., 2011). This was calculated per window, per chromosome, per genome, per site, and per individual.
Using ANGSD’s realSFS function, we performed a sliding window FST analysis by converting SAF output from ANGSD to a site frequency spectrum for both desert populations in each species. Detailed settings can be found in the supplementary information. We performed FST outlier analysis for our species using the calculated FST values. Z-scores for FST for each species were calculated using the formula ZFST=(observedFST-meanFST)/SDFST. We split the genome into two different partitions based on these z-scores: FST peaks, for values of FST greater than five standard deviations above the mean (z-score>5) and FST troughs for values of FST greater than five standard deviations below the mean (z-score<-5). We only report the FST peaks in the main manuscript: for FST troughs, see the supplementary information. We performed this outlier detection for the complete, 75%, and 50% missing datasets.
Morphological data
We quantified morphological variation in our 10 focal species to assess which of the spatial models best explain morphological variation across the landscape (see Generalized Dissimilarity Matrix Models). We measured 366 specimens (19–59 per species), excluding known females and known juveniles to account for any variation attributed to sex and age. Of those, 29 were also present in the genomic dataset, with 0–8 individuals per species.
We generated seven raw plus seven compound morphological measurements, which we designated as proxies for thermoregulation and dispersal, respectively (see Supplementary Information). We reduced the dimensionality of the 14 morphological measurements using a principal components analysis (PCA). We then calculated four distance matrices between individuals: one Euclidean distance matrix for all morphological variables, where we calculated the euclidean distance between individuals among all raw and calculated measurements; and three euclidean distance matrices for the first three principal components, PC1, PC2, and PC3. We assessed whether there were differences in morphological PCA space between the Sonoran and Chihuahuan Desert populations in each species using DABEST tests in the dabestr package version 0.3.0 (Figure 5; Supplementary Figure 19; Supplementary Figure 20; Ho et al., 2019). Note that this method does not give explicit significance values, instead it shows whether expected confidence intervals overlap zero (i.e., no difference between deserts) or not.
Isolation across the landscape at different temporal resolutions
We calculated IBD matrices by calculating the euclidean geographic distance between the latitude/longitude pair of each specimen in R. We used the WGS84 projection for all data. These variables were somewhat correlated with one another, though less so after accounting for geographic distance (Supplementary Figure 21).
To produce data for the IBH model, we calculated environmental resistances in the Last Glacial Maximum (LGM; ~21,000 years ago) for each species. To do this, we created ecological niche models (ENMs) using 19 layers representing contemporary climate (WorldClim; Hijmans et al., 2005) at a resolution of 2.5 arcminutes. We used MaxEnt (Phillips et al., 2006), with ENMeval version 0.3.1 as a wrapper function for model selection (Muscarella et al., 2014). ENMeval optimizes MaxEnt models based on different sets of feature classes and regularization values (see Supplementary Information). The contemporary ENMs (see IBE section below) were then backprojected to the LGM using WorldClim paleoclimate data (Hijmans et al., 2005). We also backprojected to the Mid-Holocene, but contemporary and Mid-Holocene ENMs were highly correlated, so we excluded the Mid-Holocene values from downstream analyses. We then scaled the LGM suitability values to range between 0–1 and calculated resistances across the environment using the least cost path distance method in ResistanceGA version 4.0–14 (Peterman et al., 2014, Peterman 2018). Regions of high resistance are predicted to reflect poor habitat and be costly to traverse through. The ENMs were thresholded to equal sensitivity-specificity values for visualization (Supplementary Figure 22).
We approximated IBB by assigning individuals based on their location relative to the Cochise Filter Barrier (see Supplementary Information). For proximity to the Cochise Filter Barrier, we assigned individuals to either Sonoran or Chihuahuan populations either based on the results of the K=2 clustering analysis, if there was structure across longitudes, or according to a cutoff of longitude if there was no structure. We chose 108 °W longitude as our cut off— individuals west of this point were deemed Sonoran, and individuals east of this point were deemed Chihuahuan (but see Provost et al., 2021). In some cases, species with genetic breaks had some uncertainty due to unsampled areas or admixed individuals—we labeled these individuals as being unclear with respect to their desert assignment. Georeferencing on some morphological specimens was poor, but all except two specimens (see Results) were identified at least to county level if not to a specific locality. When localities were given, we georeferenced the specimens to the nearest latitude/longitude. Otherwise, we assigned individuals to the centroid of their state or county.
We independently tested IBE by using two datasets: contemporary environmental distance and resistance. For the environmental distances, we used the 19 WorldClim bioclimatic layers (see IBH section). For the latitude/longitude location of each specimen used in both the morphological and genomic analysis, we extracted the values on those WorldClim layers and then calculated the euclidean distances in environmental space between specimens. This gave us an estimate of how different the environments were at each specimen’s locality. For the environmental resistances, we created ENMs using the WorldClim layers, then added layers for soil properties, distance to water, terrain features, and vegetation, and occurrence data for the focal species (see Supplementary Information). We then calculated resistances and thresholded as described above.
To assess IBA, which had a temporal scale of the last 50 years, we obtained abundance information from the Breeding Bird Survey (Pardieck et al., 2019). This dataset consists of replicated transects where individual birds are counted across the whole of the United States. The methodology for counting is standardized and covers multiple decades of observations, with our dataset comprising data from 1966–2018. We downloaded raw data for all points, then subsetted our data to our ten focal species. We averaged the number of individuals across years (though some points only had a single year). We then interpolated across points using inverse distance weighted interpolation in the spatstat version 2.1-0 package in R (idp=5). The interpolations were converted to rasters with extents and resolutions matching those of the ENMs. We then calculated resistances such that regions of high abundance had low resistance, to generate an abundance distance matrix between individuals.
Generalized dissimilarity matrix models
We assessed the relative effect of alternative spatial models on intraspecific variation in our focal species by building generalized dissimilarity matrix models (GDMs). As spatial layers representing our five models, we calculated geographic distances, abundance resistances, environmental distance and resistance, separation by barrier, and paleoenvironmental resistance between all individuals in each species. The models represent different temporal resolutions, with IBH spanning millions to tens of thousands of years ago, IBD spanning thousands to tens of years, IBE spanning hundreds to tens of years, IBA spanning tens to single years, and IBB describing the present-day configuration of the barrier. These predictors served as the input parameters for our GDMs and will be discussed in detail below. With our numerous response matrices (four morphological matrices, three genome matrices for each missing data cutoff, 35 matrices for chromosomes, five matrices for the lostruct partitions, and six matrices for the FST outliers with missing data cutoffs) and our six predictor matrices (with two for IBE: environmental distance, environmental resistance), we generated generalized dissimilarity matrix models using the gdm package version 1.3.11 in R (Manion et al., 2018). We tested which of IBA, IBB, IBD, IBE, IBH, or a combination best explained the variation in the response matrix (see below). Not all species had all chromosomes sequenced, and not all models converged: we have omitted those data. For each of the 45 response matrices per species, we built a univariate model where the genomic/chromosomal variable was predicted solely by one of the six predictor matrices. We also built models with combinations of two (bivariate) or three variables (trivariate), which we present in the Supplementary Information. Further, we present the GDM results for the chromosomes in the supplementary information. We compared the models based on the highest percent deviance explained.
To identify any overarching patterns with respect to which model of landscape evolution best explained genetic diversity (Supplementary Figure 23), we calculated four summary statistics for each chromosome, each lostruct and FST outlier partition, and the genome as a whole. We tested whether genomic summary statistics on each chromosome (FST, DXY, missing data, recombination rate) were correlated with explained percent deviance with an analysis of variance (ANOVA) test and a Tukey’s honest significant difference test (Chambers et al., 1992, Miller 1981, Yandell 1997) using the stats v. 3.6.1 package in R. We did this for the complete dataset; for 75% and 50% missing data datasets, see Supplementary Information. We also calculated linear models comparing the proportion of each model to species-wide estimates of habitat suitability across the barrier. For all significance tests, we used an alpha value of 0.05 as our significance cutoff.
We evaluated whether the best-predictors of genomic landscapes varied across species and across partitions of the data using Chi-squared tests of significance, via the chisq.test function in the stats package in R. For each, the expected distributions assuming no differences between species, partitions, or structure were calculated and compared to the observed distributions. Chi-squared tests were performed both with and without Monte Carlo simulations (N=2000 simulations each repeated 1000 times).
Data Availability
These custom functions were deposited into a custom R package, subsppLabelR, which is available at github.com/kaiyaprovost/subsppLabelR, and scripts used to perform these analyses are found at github.com/kaiyaprovost/whole_genome_pipeline. All data used to perform analyses will be available on Dryad upon acceptance.
Acknowledgements
This work would not have been possible without generous specimen loans from DMNH (J. Woods), UWBM (R. Faucett, J. Klicka, S. Birks), UMMZ (J. Hinshaw, B. Benz), TCWC (G. Voelker), MSB (M. L. Campbell, M. Andersen, C. Witt, A. Johnson, J. McCullough), LSUMZ (D. Dittmann, F. Sheldon), CUMV (V. Rohwer, C. Dardia), AMNH (P. Sweet, P. Capainolo, B. Bird, T. Trombone). We are grateful to numerous State and Federal Collection Permit officers, and many BLM managers (T. Schnell, S. Cooke, M. McCabe, J. Atkinson, M. Daehler, S. Torrez, D. Tersey). Thanks to staff at Dalquest Desert Research Station (N. Horner) and Indio Mountains Research Station (J. Johnson). We thank M. Ingala for illustrating the birds used in many of our figures and for helpful feedback. Helpful input comes from the Smith Lab, S. Simpson, L. Musher, D. Fletcher, F. Burbrink, L. Alter, D. Kelly, I. Overcast, A. Xue, M. Hickerson, M. Blair, P. Galante, R. Harbert, E. Sterling, A. Xue, and E. Myers., the Underrepresented Genders in Museum Ornithology group, and the B. Carstens lab. This work was funded by the AMNH Frank M. Chapman Fund, American Ornithological Society, Society of Systematic Biologists, RGGS Sydney Anderson Travel Award, AMNH Linda H. Gormezano Fund, and AMNH RGGS Graduate Fellowship. BTS was supported by US NSF award DEB-1655736.