Comprehensive understanding of population structure and adaptation through graphical representation of gene–environment–trait associations

Reiichiro Nakamichi; Shuichi Kitada; Hirohisa Kishino

doi:10.1101/452581

Abstract

A variable environment affects the physiological states of individuals and, in the long run, modifies their shapes. These changes, together with geographic barriers, generate population structure. Here, we propose a graphical representation of significant associations between genes, environments, and traits. A unique feature of the graph is the node of genome F_ST. The subnetwork around this node suggests the cause and the effects of population structure and segregation. A global structure of the graph enables to grasp a comprehensive picture of adaptation to the environment. Focused look at the neighbors of the environmental factors identifies the adaptive traits and the genetic background that supported the adaptation of the traits. Isolated nodes express genetic differentiations that are not explained by the population structure, implying the presence of some unrecognized environmental factor. We show the potential usefulness of our graphical representation by a detailed analysis of public dataset of wild poplar.

Living organisms are adapted to their environment. This environmental adaptation can create significant differences in phenotypes and traits among populations of a species. For example, populations of sockeye salmon exhibit diversity in regards to life history traits such as spawning time and habitat, adult body size and shape, rearing time in freshwater and seawater, and adaptation to local spawning and rearing habitats within complex lake systems (Hilborn et al. 2003). Populations of walking-stick insects have diverged in body size, shape, host preference and behavior in parallel with the divergence of their host-plant species (Nosil et al. 2002). Aridity gradients may be the cause of geographically structured populations of Poaceae characterized by cytotype segregation of diploids and allotetraploids (Manzaneda et al. 2012). When correlated with variation in environmental factors over local populations, such variation in traits and phenotypes can offer an opportunity for understanding natural selection processes (Coop et al. 2010). Adaptation to environmental factors can change traits and phenotypes of a species, thereby creating population structure. Geographical isolation, which can lead to reproductive isolation and consequent differences in allele frequencies, also contributes to population structuring (Wright 1965). Population structure needs to be considered when analyzing correlations among genes, traits and environmental factors across population samples taken from a wide range of geographical regions.

Genome-wide association studies (GWASs) are widely used to identify associations between genes and traits/environments (Visscher et al. 2017). When data are obtained from a metapopulation exhibiting population structure, the effect of genotypes can be inferred by eliminating population structure effects (Devlin and Roeder 1999) to avoid spurious associations (Pritchard and Rosenberg 1999). One representative software program, TASSEL (Yu et al. 2006; Bradbury et al. 2007), performs this type of analysis using a unified mixed model. Alternatively, associations can be tested in a Hardy-Weinberg population that has been decomposed from a structured population (Pritchard et al. 2000). Future challenges for large-scale GWASs from wild populations (wild GWASs) include the development of methods that take population structure into account (Santure and Garant 2018).

So-called “genome scan methods” consider geographically structured populations and detect SNPs related to environmental variables, traits and phenotypes (De Mita et al. 2013; De Villemereuil et al. 2014). For instance, BayeScan (Foll and Gaggiotti 2008) detects SNPs that create major differentiation in terms of global F_ST over a metapopulation. As illustrated in Figure 1A (top), 16 outliers were detected out of 281 SNPs in Atlantic herring in one study (Limborg et al. 2012); these outliers included a SNP in a heat-shock protein (HSP70) whose allele frequency was negatively correlated with mean sea surface salinities in spawning grounds (Figure 1A, bottom). As another example, Bayenv (Coop et al. 2010) and the latent factor mixed model (LFMM (Frichot et al. 2013)) can detect SNPs that are highly correlated with environmental factors and traits.

Figure 1

Conceptual diagram of the detection of genes controlling environmental adaptation. (A) Representative genome scan methods. (B) A network-based method enabling comprehensive understanding of environmental adaptation of traits and genes that leads to population structure. E, environmental factors, such as temperature, daylength, precipitation and salinity. L, location factors, such as longitude, latitude, altitude and geographical distance. T, traits such as height, size, metabolism, disease susceptibility and reproductive season.

To obtain a comprehensive picture of population structure and adaptation using related genes, we propose a novel graphical representation of gene–environment–trait associations. The graph consists of a set of nodes and edges that connect pairs of nodes with significant association. Our graph describes correlations among allele frequencies of SNPs, states of traits, and environmental and location factors. The unique feature of our method is the use of a genome-wide population differentiation node, which enables inference of the determinants of population structure. Environmental factor nodes around this node may be the causal force for the population structure, whereas the among-locality variation of nearby traits may be the result of population differentiation, or vice versa.

In the conceptual figure of Figure 1B, a location factor, L1, is correlated with E1, an environmental factor that is correlated in turn with genome F_ST. Two traits, T1 and T2, are affected by this environmental factor. G4 and G6 are the candidate genes behind the differentiation of T1. Likewise, G9, G10 and G11 are the candidate genes for T2. Population structure (genome F_ST) may have differentiated according to some unknown traits related to G1, G2 and G3, as well as trait T2. By examining the functions of genes G7 and G8, inference of the traits selected by environmental factor E1 may be possible. Of interest, the hidden factor that differentiates gene G5 can be investigated by plotting the allele frequencies of G5 relative to location factor L2. In this way, our method provides a comprehensive perspective for understanding the genetic and ecological mechanisms of environmental adaptation of a species.

Materials and Methods

Significance of gene–environment–trait associations

The node of genome F_ST is in the form of a distance matrix between pairs of local populations. Likewise, all other nodes of SNPs, traits and environmental factors are represented by matrices whose elements are the differences between pairs of local populations (Supplementary Figure S1). Consequently, the correlation between a pair of nodes is the correlation between the between-population distance matrices. The significance of correlation between a pair of nodes is measured by simple linear regression analysis. Here, the dependent variable is a distance matrix of a node, and the explanatory variable is the distance matrix of the other node. To take account of correlations in the error term, we carried out bootstrap resampling of populations and individuals. For each of the bootstrap datasets, we calculated among-population distance matrices for each node pair, and obtained the regression coefficient for each node pair. The z value is the ratio of the original regression coefficient to its bootstrap standard deviation. By applying the Benjamini-Hochberg method (Benjamini and Hochberg 1995) to these p-values, we selected significant correlations with a false discovery rate (FDR) of 0.01. A node pair with a significant correlation was connected by an edge. We note that these edges represent the total associations of direct and indirect effects.

Estimation of pairwise F_ST

A locus pairwise F_ST at a single marker is the normalized difference of the allele frequencies and measures the genetic differentiation between a pair of local populations. To capture the fine-scale population structure even under high gene flow, we adopted an empirical Bayes estimator using EBFST function of the R package FinePop (Kitada et al. 2017). By averaging both numerators and denominators over multiple markers, we obtained genome F_ST. Genome F_ST indicates the magnitude of population differentiation over the genome, while locus F_ST indicates the contribution of each gene to population differentiation.

Application to wild poplar data

We demonstrate how the graphical representation provides a comprehensive picture of population differentiation and environmental adaptation by analyzing a publicly available data. It contains genetic and trait information of 445 individuals of wild poplar (Populus trichocarpa), which were collected from various regions over a range of 2,500 km, near the Canadian-US border at a latitude of 44′ to 59′ N, a longitude of 121′ to 138′ W, and an altitude of 0 to 800 m (McKown et al. 2014). The data included genotypes of 34,131 SNPs (3,516 genes) and values of stomatal anatomy, leaf tannin, ecophysiology, morphology and disease. Here, we focused on the four traits: adaxial stomata density (ADd), abaxial stomata density (ABd), and leaf rust disease morbidity (AUDPC) measured in 2010 and 2011 (DP10 and DPC11, respectively). Each sampling location was described by 11 environmental/geographical variables: latitude (lat), longitude (lon), altitude (alt), longest day length (DAY), frost-free days (FFD), mean annual temperature (MAT), mean warmest month temperature (MWMT), mean annual precipitation (MAP), mean summer precipitation (MSP), annual heat-moisture index (AHM) and summer heat-moisture index (SHM).

We performed a clustering analysis using the geographical distribution and divided the 445 individuals into subpopulations. We applied model-based clustering (Fraley and Raftery 2016) with three types of spatial information—latitude, longitude and altitude—as the explanation variables. Using the Bayesian information criterion based on the mclustBIC function in the R package mclust (Scrucca et al. 2016) under the VEV model (ellipsoidal, equal shape), we obtained 22 subpopulations: 5 in northern British Colombia (NBC), 11 in southern British Colombia (SBC), 3 in inland British Colombia (IBC) and 3 in Oregon (ORE).

Marker screening before analysis of the graphical representation

Because our major concern was identifying correlations between among-population differentiations of genes, traits and environmental factors, we selected the SNP with the highest global F_ST value over 22 populations, designated as the tag SNP, from each of the 3,516 gene regions. Out of the 3,516 tag SNPs, only those that were differentiated among populations were subjected to the graphical representation analysis. We note that the scaled global F_ST values, calculated as: approximately follow a chi-squared distribution with degree of freedom k (= the number of populations – 1) (Weir and Hill 2002). We performed chi-squared tests on the 3,516 genes and identified 507 tag SNPs with significant differentiation among populations (p < 0.05). Therefore, we used a total of 523 variables: 4 traits, 11 location/environmental factors, genome F_ST and 507 genes (Supplementary Table S1).

Results and Discussion

Global structure of the network

Our generated network then identified relationships between genome F_ST, 8 environmental and 2 location factors, 4 traits and 317 genes (Figure 2A, Supplementary Table S2). The network consisted of a large cluster centered around genome F_ST along with several isolated small clusters (Figure 2A, Supplementary Table S2). The location and environmental factors lat, lon, MAT and DAY were directly connected to genome F_ST in the estimated network, whereas alt was not included in the graph. In contrast, four water-related factors, MAP, MSP, AHM and SHM, were several steps away from genome F_ST. Several isolated clusters of genes were present at the boundary of the network. Although these clusters were differentiated between local populations, the absence of a significant correlation with genome F_ST implies that the diversity of these traits was not simply the result of population differentiation, but was instead due to adaptation to the local environment.

Figure 2

Whole network graph of the environmental adaptation of wild poplar. The red circle indicates population differentiation in terms of genome F_ST; green circles are environmental and location factors, blue ones are traits, and gray dots are genes. (A) Global structure of the estimated network. Colored clouds show clusters of genes with similar functions. (B) Relationship between F_ST, abaxial/adaxial stomata density (ABd and ADd), day length (DAY) and morbidity (DP10 and DP11).

Determinants of population structure

The radius-one neighborhood of genome F_ST suggested that temperature and day length were the main environmental factors causing population structure, with the observation that the edges of the graph collected significant correlations (Supplementary Figure S2). Assisted by the Entrez summaries and GO terms of any genes shown in the graph window, we found that many genes related to fertility were affected by the population structure (Supplementary Figure S2, Table S3). An example was SHT (spermidine hydroxycinnamoyl transferase), which is related to pollen development and pollen exine formation (Grienenberger et al. 2009). The scatter plot visualized in the graph window provided information that correlated genome F_ST with the SHT gene. Other fertility-related genes included MYB5 (myb domain protein 5), HAB1 (hyper sensitive to ABA1) and ACT7 (actin 7) functioning in seed germination (Li et al. 2009; Saez et al. 2006; Gilliland et al. 2003), AT3G08640 (alphavirus core family protein, DUF3411) and HOG1 (S-adenosyl-L-homocysteine hydrolase) involved in embryo development (Rocha et al. 2005), LUG (transcriptional corepressor LEUNIG) and VRN1 (AP2/B3-like transcriptional factor family protein) related to flower development (Conner and Liu 2000; Levy et al. 2002) and REV (homeobox-leucine zipper family protein/lipid-binding START domain-containing protein) associated with flower morphogenesis (Talbert et al. 1995).

Daylight, latitude, stomatal density and disease

Consistent with McKown et al. (McKown et al. 2014), our network confirmed a strong connection between ADd and disease progress (DP10 and DP11) (Figure 2B). In contrast, ABd was not directly connected to DPs, but exhibited a strong connection to DAY, as did DPs. All these nodes were directly connected to genome F_ST.

Average ADd was constant in the southern region up to 50° N, but increased with latitude in the northern region (Figure 3A). In contrast, average ABd decreased with latitude in the northern region (Figure 3B). DAY, which occurs in early summer, increased monotonically with latitude (Figure 3C), while MAT decreased on average with latitude (Figure 3D). This result indicates that poplar trees in northern populations experience longer and weaker sunshine in summer but drop their leaves earlier. Interestingly, the pore size of abaxial stomata was larger at lower values of ABd (Figure 4A), which demonstrates that northern populations had larger abaxial stomata, but their density was lower because the leaf area was limited. In contrast, the large variation in the pore size of adaxial stomata displayed no relationship with ADd (Figure 4B). The presence of larger stomata causes leaves to have a lower stomatal density but a greater photosynthetic efficiency (Lawson and Blatt 2014). These results suggest that northern populations must increase photosynthetic efficiency to adapt to an environment with weak sunshine and a shorter period before leaf shed. The increased ADd of northern populations suggests that adaxial stomata compensate for the decrease in abaxial stomata. Stomatal closure is part of the innate immune response to bacterial invasion (Melotto et al. 2006). An increase in abaxial stomata size and adaxial stomata density might increase the risk of disease invasion. Our results suggest that wild poplar can expand its habitat northward by increasing photosynthetic capacity while heightening its risk of disease, although the latter is less significant in northern areas (McKown et al. 2014). This ecological trade-off may be a cause of the population structure of wild poplar.

Figure 3

Geographic distribution of stomatal density of wild poplar, day length and temperature. (A) Latitude (lat) vs. adaxial stomata density (ADd). (B) lat vs. abaxial stomata density (ABd). (C) lat vs. longest day length (DAY). (D) lat vs. mean annual temperature (MAT).

Figure 4

Stomatal density and pore size of wild poplar. (A) Abaxial stomata density (ABd) vs pore length (ABpl). (B) Adaxial stomata density (ADd) vs pore length (ADpl).

Photosynthesis and circadian rhythm in response to day length

Geraldes et al. (Geraldes et al. 2014) have identified a large number of F_ST outliers that are overrepresented in genes involved in circadian rhythm and response to red/far-red light. In our graph, the allele frequencies of genes related to photosynthesis and the circadian cycle were found to be influenced by day length (Figure 5, Supplementary Table S4). For example, ACT7 (actin 7) is related to response to light stimulus (McDowell et al. 1996), and its allele frequencies were negatively correlated with DAY and lon (Figure 5, lower left). Geographical mapping of ATC7 allele frequencies and day length confirmed this correlation (Figure 6A). Other genes included PRR7 (pseudo-response regulator 7) and TOC1 (CCT motif-containing response regulator protein) related to circadian rhythm (Farré et al. 2005; Alabadí et al. 2001), APX2 (ascorbate peroxidase 2) associated with response to high light intensity and response to oxidative stress (Karpinski et al. 1997), EXPA1 (expansin A1) involved in response to red light (Esmon et al. 2006) and SUS4 (sucrose synthase 4) related to the carbon assimilation process (Bieniawska et al. 2007). These results suggest that day length is the most important factor controlling photosynthesis and that latitude causes differentiation of photosynthetic genes. Finally, the allele frequencies of SYP121 (syntaxin of plant 121) related to stomatal movement (Bassham and Blatt 2008) and PIP3 (plasma membrane intrinsic protein 3) participating in response to abscisic acid and water channel activity (Weig et al. 1997) were also significantly correlated with day length (figures not shown).

Figure 5

Radius-one neighborhood of longest day length (DAY) and disease progress (DP10 and DP11). The biological functions (upper left) of genes in the neighborhood and their correlations (lower left) with the center node can be seamlessly examined by entering a gene name in the upper left text box. In this example, ACT7 was examined (left), and genes related to photosynthesis and circadian rhythm are colored in orange (see text).

Figure 6

Values of environmental factors and allele frequencies of differentiated genes superimposed on a geographical map. Pie charts show allele frequencies of genes (black: minor allele; white: major allele). Heat colors are used to illustrate gradients of the environmental factors and traits. (A) Day length (DAY) and a light-response gene (ACT7). (B) Humidity (AHM) and a drought-response gene (CBF4). (C) A gibberellin-response gene (FUS6).

Damage response, the circadian system and stomata related to disease susceptibility

Morbidity due to leaf rust disease (DP10 and DP11) showed a close relationship to adaxial stomatal density (ADd) and day length (DAY) (Figs. 2B and 5, Supplementary Table S5). Genes closely connected to DAY, such as ACT7 (related to response to wounding), PRR7, APX2 and PIP3, were also closely linked to morbidity. Other genes, namely, FHY3 (far-red elongated hypocotyls 3) related to circadian rhythm (Allen et al. 2006) and DRT100 (DNA-damage repair/toleration 100) functioning in DNA repair (Pang et al. 1993), also were involved in this cluster. Because the DAY-related genes control stomatal opening and closing, our subgraph (Figure 5) suggests that fungal invasion into tissues occurs through stomata (Melotto et al. 2006). SHT (spermidine hydroxycinnamoyl transferase) was closely connected to leaf rust disease morbidity. As described above, SHT is related to pollen development and connected to genome F_ST. In addition, spermidine is known as a modulator of the immune process (Theoharides 1980). This result thus implies that the functions of SHT in immune and reproduction play important roles in population differentiation and adaptation through disease resistance. DRT100 allele frequencies were negatively correlated with DP11 (figure not shown), and the geographical gradients of DRT100 allele frequencies and DP11 well explained the correlation (Supplementary Figure S3A). This result suggests that leaf rust disease affects fertility and promotes population differentiation. Principal component analysis using these genes, which were neighbors of ADd, DP10 and DP11, clearly revealed differences in morbidity between locations from north to south (Supplementary Figure S4). This result implies that the phenotypes controlled by circadian and light-responsive genes have adapted to local environments according to latitude and day length and are responsible for the morbidity-related population differentiation.

Body growth affected by temperature

Genes in the subgraph around MAT and FFD were those involved in shoot development (Supplementary Figure S5, Table S6), such as LAS (lateral suppressor, GRAS family transcription factor) related to secondary shoot formation (Greb et al. 2003) and REV (homeobox-leucine zipper family protein/lipid-binding START domain-containing protein) linked to primary shoot apical meristem specification and leaf morphogenesis. LAS allele frequencies were negatively correlated with MAT (Supplementary Figure S5, lower left). The geographical gradients of LAS allele frequencies and MAT supported this correlation (Supplementary Figure S3b). These results imply that temperature strongly supports body growth of poplar.

Drought stress resistance depends on water conditions

The environmental factors MAP, MSP, AHM and SHM exhibited no direct connection to genome F_ST (Figure 2A, Supplementary Figure S6, Table S7). An indirect connection was apparent, however, through genes with functions related to water stress. These genes were XERICO (RING/U-box superfamily protein), HK2 (histidine kinase 2) and ABA1 (ABA deficient 1, zeaxanthin epoxidase) involved in response to osmotic stress and response to salt stress (Ko et al. 2006; Tran et al. 2007; Xiong et al. 2002), CBF4 (C-repeat binding factor 4) related to response to drought (Haake et al. 2002) and AGP14 (arabinogalactan protein 14) participating in root hair elongation (Lin et al. 2011). A close examination of CBF4, directly connected to AHM, revealed that its allele frequencies were negatively correlated with AMH and clustered by geographical groups (Supplementary Figure S6, lower left). CBF4 allele frequencies were particularly differentiated in IBC where AHM was high (Figure 6B). This result suggests that the CBF4 gene has differentiated as an adaptive response to dry weather. The apparent weak relationship between water stress and F_ST may be a consequence of the relatively small differences in water conditions in this dataset.

Vernalization depends on some unknown environmental conditions

Several isolated gene clusters, which were unconnected to genome F_ST, environmental factors or traits, appeared in the global network (Figure 2A). Each cluster contained genes whose functions were strongly related. For example, the largest isolated cluster consisted of vernalization genes (Supplementary Figure S7, Table S8), such as FUS6 related to regulation of flower development and seed germination (Chory et al. 1996), GA3OX1 associated with response to gibberellin and response to red light (McGinnis et al. 2003) and VRN1 linked to vernalization response and regulation of flower development. Although these genes may not be directly responsible for population structure, the appearance of the isolated cluster in the network implies a latent relationship between vernalization and population differentiation. In regards to the geographical distribution of their allele frequencies, FUS6 and GA3OX1 had similar, complicated patterns (Figure 6C, Supplementary Figure S3C). Populations in SBC and eastern IBC had a similar pattern of allele frequencies, while those in northern NBC, southern ORE and western IBC displayed a different pattern. This result may imply an adaptation to a microenvironment not observed in this data. For example, the direction of a mountain slope can create different habitats with different daylight conditions.

Gene ontology enrichment analysis

No significant GOs were predicted by gene ontology enrichment analysis (Subramanian et al. 2005; Alexa et al. 2006) for the union of the set of 317 genes selected for the graph and the sets of neighboring genes mentioned above relative to the other complementary gene sets. To obtain a comprehensive picture based on solid evidence, we focused on geographically differentiating SNPs and selected pairs of nodes by controlling FDR. As a consequence, we may have diminished the ability to identify differences between the two sets of genes. Alternatively, mutations in a few members of the relevant pathways may have enabled adaptation to the variable environments.

Our method identifies genes related to environmental adaptation with a FDR of 1% and visualizes their network, including genome F_ST, environmental and location factors, and traits. Our example using wild poplar has revealed the potential of our graphical model representation to aid comprehensive understanding of ecological and genetic mechanisms underlying environmental adaptation and population structuring. While conventional GWAS and genome scanning effectively search for genes related to some given factors or traits, our method captures the overall picture of the relationship among genes, environmental factors and traits in association with population structure. By following the sub-network of genes around target environmental factors and traits, we can obtain a detailed understanding of the relationship of genes behind environmental adaptation and population differentiation. In particular, detection of collaboratively adapted gene clusters, which are not directly associated with the given environment/trait factors, is an advantage of our graphical representation. Our R software module GET.graph aids this process by displaying subgraphs and scatter plots of allele frequencies of genes vs. environmental factors/traits. GET.graph retains the biological functions of genes retrieved from public databases, such as GO and ENTREZ, and helps us smoothly interpret the graph. Through this process, we can reach comprehensive understanding of population structure and adaptation by characterizing the sub-networks of the graph (Figure 2a).

Our graph collects significant correlations that sum up both direct and indirect relationships, while partial correlations extract direct relationships (Kishino and Waddell 2000; De La Fuente et al. 2004; Liu 2013). Collection of significant partial correlations in this setting is left for future study. Finally, we must be aware of computational feasibility. The calculation load greatly increases depending on the number of variables and is roughly proportional to the square of the number of variables. The analysis for this paper took several hours on an Intel Core i7 (6 core) workstation. The above step of prescreening variables is therefore indispensable. As a final remark, the data, especially genomic data, often include missing values. Our graphical representation method describes relationships between population means of allele frequencies, trait values and environmental/location factors; therefore, like Bayenv (Coop et al. 2010), it analyzes sample means among measured data. As long as the means of measured allele/environmental/trait variables are unbiased estimates of the corresponding sample means, the procedure is also unbiased.

The R software module (GET.graph) that implements the network analysis described in this paper is available in the FinePop package at CRAN (https://CRAN.R-project.org/package=FinePop).

Figure S1

Data matrices. The original dataset is a matrix of environmental factor mean (E_i), trait value mean (T_i) and minor allele frequency of the kth gene in the /th population. Distance data for our graphical representation consist of a matrix of pairwise genome F_ST , the pairwise difference of environmental factors (d(E_i,E_j)), the pairwise difference of trait values (d(T_i, T_j)) and pairwise locus F_ST of the kth gene between ith and jth populations.

Figure S2

Radius-one neighborhood of genome F_ST. Any subgraph can be drawn given any subgraph origins, radii and genes. The Entrez summary, GO term and a scatter plot between any two nodes can be shown in the same graph window. SHT, colored in orange, was examined in this example. The scatter plot shows the correlation between genome F_ST and the SHT gene. Genes related to reproduction are colored in pink (see text).

Figure S3

Values of environmental factors and allele frequencies of differentiated genes superimposed on a geographical map. Pie charts show allele frequencies of genes (black: minor allele; white: major allele). Heat colors are used to display gradients of the environmental factors and traits. (A) Disease progress (DP11) and a DNA repair gene (DRT100). (B) Annual temperature (MAT) and a lateral control gene (LAS). (C) A flower development gene (GA3OX1).

Figure S4

Principal component analysis plot of genes neighboring adaxial stomata density (ADd) and morbidity (DP10, DP11) in 22 populations of wild poplar. NBC, northern British Colombia; SBC, southern British Colombia; IBC, inland British Colombia; ORE, Oregon.

Figure S5

Radius-one neighborhood of mean annual temperature (MAT). The Entrez summary/GO term for LAS and the scatter plot for MAT and allele frequencies of LAS are shown in the graph window. Genes related to body growth are colored in gray (see text).

Figure S6

Neighborhood of precipitation (MAP and MSP) and moisture (AHM and SHM). The Entrez summary/GO term for CBF4, the scatter plot for AHM and allele frequencies of CBF4 are shown. Genes related to drought stress response are colored in brown (see text).

Figure S7

Isolated cluster of vernalization genes from Figure 2A. The Entrez summary/GO term for FUS6 is shown in the graph window. Genes related to vernalization are colored in pink (see text).

Acknowledgements

This study was supported by the Japan Society for the Promotion of Science Grant-in-Aid for Scientific Research 25280006 and 16H02788 to HK and 18K05781 to SK.

Literature Cited

↵
Alabadí D., T. Oyama, M. J. Yanovsky, F. G. Harmon, P. Más et al., 2001 Reciprocal regulation between TOC1 and LHY/CCA1 within the Arabidopsis circadian clock. Science 293: 880–883. https://doi.org/10.1126/science.1061320.
OpenUrl Abstract/FREE Full Text
↵
Alexa A., J. Rahnenführer, and T. Lengauer, 2006 Improved scoring of functional groups from gene expression data by decorrelating GO graph structure. Bioinformatics 22: 1600–1607. https://doi.org/10.1093/bioinformatics/btl140.
OpenUrl CrossRef PubMed Web of Science
↵
Allen T., A. Koustenis, G. Theodorou, D. E. Somers, S. A. Kay et al., 2006 Arabidopsis FHY3 specifically gates phytochrome signaling to the circadian clock. Plant Cell 18: 2506–2516. https://doi.org/10.1105/tpc.105.037358.
OpenUrl Abstract/FREE Full Text
↵
Bassham D. C., and M. R. Blatt, 2008 SNAREs: cogs and coordinators in signaling and development. Plant Physiol. 147: 1504–1515. https://doi.org/10.1104/pp.108.121129.
OpenUrl FREE Full Text
↵
Benjamini Y., and Y. Hochberg, 1995 Controling the false discovery rate: a practical and powerful approach to multiple testing. J. Royal Stat. Soc. 57: 289–300. https://doi.org/10.2307/2346101.
OpenUrl
↵
Bieniawska Z., D. H. Paul Barratt, A. P. Garlick, V. Thole, N. J. Kruger et al., 2007 Analysis of the sucrose synthase gene family in Arabidopsis. Plant J. 49: 810–828. https://doi.org/10.1111/j.1365-313X.2006.03011.x.
OpenUrl CrossRef PubMed Web of Science
↵
Bradbury P. J., Z. Zhang, D. E. Kroon, T. M. Casstevens, Y. Ramdoss et al., 2007 TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics 23: 2633–2635. https://doi.org/10.1093/bioinformatics/btm308.
OpenUrl CrossRef PubMed Web of Science
↵
Chory J., M. Chatterjee, R. K. Cook, T. Elich, C. Fankhauser et al., 1996 From seed germination to flowering, light controls plant development via the pigment phytochrome. Proc. Natl. Acad. Sci. 93: 12066–12071. https://doi.org/10.1073/pnas.93.22.12066.
OpenUrl Abstract/FREE Full Text
↵
Conner J., and Z. Liu, 2000 LEUNIG, a putative transcriptional corepressor that regulates AGAMOUS expression during flower development. Proc. Natl. Acad. Sci. 97: 12902-12907. https://doi.org/10.1073/pnas.230352397.
OpenUrl Abstract/FREE Full Text
↵
Coop G. D., D. Witonsky, A. Di Rienzo, and K. J. Pritchard, 2010 Using environmental correlations to identify loci underlying local adaptation. Genetics 185: 1411–1423. https://doi.org/10.1534/genetics.110.114819.
OpenUrl Abstract/FREE Full Text
↵
De La Fuente, A., N. Bing, I. Hoeschele, and P. Mendes, 2004 Discovery of meaningful associations in genomic data using partial correlation coefficients. Bioinformatics 20: 3565–3574. https://doi.org/10.1093/bioinformatics/bth445.
OpenUrl CrossRef PubMed Web of Science
↵
De Mita S., A. C. Thuillet, L. Gay, N. Ahmadi, S. Manel et al., 2013 Detecting selection along environmental gradients: analysis of eight methods and their effectiveness for outbreeding and selfing populations. Mol. Ecol. 22: 1383–1399. https://doi.org/10.1111/mec.12182.
OpenUrl CrossRef Web of Science
↵
De Villemereuil P., É. Frichot, É. Bazin, O. François, and O. E. Gaggiotti, 2014 Genome scan methods against more complex models: when and how much should we trust them? Mol. Ecol. 23: 2006–2019. https://doi.org/10.1111/mec.12705.
OpenUrl
↵
Devlin B., and K. Roeder, 1999 Genomic control for association studies. Biometrics 55: 997–1004. https://doi.org/10.1111/j.0006-341X.1999.00997.x.
OpenUrl CrossRef PubMed Web of Science
↵
Esmon C. A., A. G. Tinsley, K. Ljung, G. Sandberg, L. B. Hearne et al., 2006 A gradient of auxin and auxin-dependent transcription precedes tropic growth responses. Proc. Natl. Acad. Sci. 103: 236–241. https://doi.org/10.1073/pnas.0507127103.
OpenUrl Abstract/FREE Full Text
↵
Farré E. M. H. S. L, F. Harmon, M. J. Yanovsky, and S. A. Kay, 2005 Overlapping and distinct roles of PRR7 and PRR9 in the Arabidopsis circadian clock. Curr. Biol. 15: 47–54. https://doi.org/10.1016/j.cub.2004.12.067.
OpenUrl CrossRef PubMed Web of Science
↵
Foll M., and O. Gaggiotti, 2008 A genome scan method to identify selected loci appropriate for both dominant and codominant markers: a Bayesian perspective. Genetics 180: 977-993. https://doi.org/10.1534/genetics.108.092221.
OpenUrl Abstract/FREE Full Text
↵
Fraley C., and A. E. Raftery, 2016 Model-based clustering, discriminant analysis and density estimation. BMC Bioinformatics 17: 287. https://doi.org/10.1198/016214502760047131.
OpenUrl
↵
Frichot E., S. D. Schoville, G. Bouchard, and O. François, 2013 Testing for associations between loci and environmental gradients using latent factor mixed models. Mol. Biol. Evol. 30: 1687–1699. https://doi.org/10.1093/molbev/mst063.
OpenUrl CrossRef PubMed Web of Science
↵
Geraldes A., N. Farzaneh, C. J. Grassa, A. D. McKown, R. D. Guy et al., 2014 Landscape genomics of Populus trichocarpa the role of hybridization limited gene flow and natural selection in shaping patterns of population structure. Evolution 68: 3260–3280. https://doi.org/10.1111/evo.12497.
OpenUrl CrossRef
↵
Gilliland L. U., L. C. Pawloski, M. K. Kandasamy, and R. B. Meagher, 2003 Arabidopsis actin gene ACT7 plays an essential role in germination and root growth. Plant J. 33: 319–328. https://doi.org/10.1046/j.1365-313X.2003.01626.x.
OpenUrl CrossRef PubMed Web of Science
↵
Greb T., O. Clarenz, E. Schäfer, D. Müller, R. Herrero et al., 2003 Molecular analysis of the LATERAL SUPPRESSOR gene in Arabidopsis reveals a conserved control mechanism for axillary meristem formation. Genes Dev. 17: 1175–1187. https://doi.org/10.1101/gad.260703.
OpenUrl Abstract/FREE Full Text
↵
Grienenberger E., S. Besseau, P. Geoffroy, D. Debayle, D. Heintz et al., 2009 A BAHD acyltransferase is expressed in the tapetum of Arabidopsis anthers and is involved in the synthesis of hydroxycinnamoyl spermidines. Plant J. 58: 246–259. https://doi.org/10.1111/j.1365-313X.2008.03773.x.
OpenUrl CrossRef PubMed Web of Science
↵
Haake V., D. Cook, J. L. Riechmann, O. Pineda, M. F. Thomashow et al., 2002 Transcription factor CBF4 is a regulator of drought adaptation in Arabidopsis. Plant Physiol. 130: 639–648. https://doi.org/10.1104/pp.006478.
OpenUrl Abstract/FREE Full Text
↵
Hilborn R., T. P. Quinn, D. E. Schindler, and D. E. Rogers, 2003 Biocomplexity and fisheries sustainability. Proc. Natl. Acad. Sci. 100: 6564–6568. https://doi.org/10.1073/pnas.1037274100.
OpenUrl Abstract/FREE Full Text
↵
Karpinski S., C. Escobar, B. Karpinska, G. Creissen, and P. M. Mullineaux, 1997 Photosynthetic electron transport regulates the expression of cytosolic ascorbate peroxidase genes in Arabidopsis during excess light stress. Plant Cell 9: 627–640. https://doi.org/10.1105/tpc.9.4.627.
OpenUrl Abstract/FREE Full Text
↵
Kishino H., and P. J. Waddell, 2000 Correspondence analysis of genes and tissue types and finding genetic links from microarray data. Genome Inform Ser Workshop Genome Inform. 11: 83–95.
OpenUrl PubMed
↵
Kitada S., R. Nakamichi, and H. Kishino, 2017 The empirical Bayes estimators of fine-scale population structure in high gene flow species. Mol. Ecol. Resour. 17: 1210–1222. https://doi.org/10.1111/1755-0998.12663.
OpenUrl
↵
Ko J. H., S. H. Yang, and K. H. Han, 2006 Upregulation of an Arabidopsis RING-H2 gene, XERICO, confers drought tolerance through increased abscisic acid. Plant J. 47: 343–355. https://doi.org/10.1111/j.1365-313X.2006.02782.x.
OpenUrl CrossRef PubMed Web of Science
↵
Lawson T., and M. R. Blatt, 2014 Stomatal size, speed, and responsiveness impact on photosynthesis and water use efficiency. Plant Physiol. 164: 1556–1570. https://doi.org/10.1104/pp.114.237107.
OpenUrl Abstract/FREE Full Text
↵
Levy Y. Y., S. Mesnage, J. S. Mylne, A. R. Gendall, and C. Dean, 2002 Multiple roles of Arabidopsis VRN1 in vernalization and flowering time control. Science 297: 243–246. https://doi.org/10.1126/science.1072147.
OpenUrl Abstract/FREE Full Text
↵
Li S. F., O. N. Milliken, H. Pham, R. Seyit, R. Napoli et al., 2009 The Arabidopsis MYB5 transcription factor regulates mucilage synthesis, seed coat development, and trichome morphogenesis. Plant Cell 21: 72–89. https://doi.org/10.1105/tpc.108.063503.
OpenUrl Abstract/FREE Full Text
↵
Limborg M. T., S. J. Helyar, M. De Bruyn, M. I. Taylor, E. E. Nielsen et al., 2012 Environmental selection on transcriptome‐derived SNPs in a high gene flow marine fish, the Atlantic herring (Clupea harengus). Mol. Ecol. 21: 3686–3703. https://doi.org/10.1111/j.1365-294X.2012.05639.x.
OpenUrl CrossRef PubMed Web of Science
↵
Lin W. D., Y. Y. Liao, T. J. Yang, C. Y. Pan, T. J. Buckhout et al., 2011 Coexpression-based clustering of Arabidopsis root genes predicts functional modules in early phosphate deficiency signaling. Plant Physiol. 155: 1383–1402. https://doi.org/10.1104/pp.110.166520.
OpenUrl Abstract/FREE Full Text
↵
Liu W., 2013 Gaussian graphical model estimation with false discovery rate control. Ann. Stat. 41: 2948–2978. https://doi.org/10.1214/13-AOS1169.
OpenUrl
↵
Manzaneda A. J., P. J. Rey, J. M. Bastida, C. Weiss-Lehman, E. Raskin et al., 2012 Environmental aridity is associated with cytotype segregation and polyploidy occurrence in Brachypodium distachyon (Poaceae). New Phytol. 193: 797–805. https://doi.org/10.1111/j.1469-8137.2011.03988.x.
OpenUrl CrossRef PubMed Web of Science
↵
McDowell L. M., Y. An, S. Huang, E. C. McKinney, and R. B. Meagher, 1996 The Arabidopsis ACT7 actin gene is expressed in rapidly developing tissues and responds to several external stimuli. Plant Physiol. 111: 699–711. https://doi.org/10.1104/pp.111.3.699.
OpenUrl Abstract
↵
McGinnis K. M., S. G. Thomas, J. D. Soule, L. C. Strader, J. M. Zale et al., 2003 The Arabidopsis SLEEPY1 gene encodes a putative F-Box subunit of an SCF E3 ubiquitin ligase. Plant Cell 15: 1120–1130. https://doi.org/10.1105/tpc.010827.
OpenUrl Abstract/FREE Full Text
↵
McKown A. D., R. D. Guy, L. Quamme, J. Klápště, J. La Mantia et al., 2014 Association genetics, geography and ecophysiology link stomatal patterning in Populus trichocarpa with carbon gain and disease resistance trade-offs. Mol. Ecol. 23: 5771–5790. https://doi.org/10.1111/mec.12969.
OpenUrl CrossRef
↵
Melotto M., W. Underwood, J. Koczan, K. Nomura, and S. Y. He, 2006 Plant stomata function in innate immunity against bacterial invasion. Cell 126: 969–98. https://doi.org/10.1016/j.cell.2006.06.054.
OpenUrl CrossRef PubMed Web of Science
↵
Nosil P., B. J. Crespi, and C. P. Sandoval, 2002 Host-plant adaptation drives the parallel evolution of reproductive isolation. Nature 417: 440–443. https://doi.org/10.1038/417440a.
OpenUrl CrossRef PubMed Web of Science
↵
Pang Q., J. B. Hays, I. Rajagopal, and T. S. Schaefer, 1993 Selection of Arabidopsis cDNAs that partially correct phenotypes of Escherichia coli DNA-damage-sensitive mutants and analysis of two plant cDNAs that appear to express UV-specific dark repari activities. Plant Mol. Biol. 22: 411–426. https://doi.org/10.1007/BF00015972.
OpenUrl CrossRef PubMed Web of Science
↵
Pritchard J. K., and N. A. Rosenberg, 1999 Use of unlinked genetic markers to detect population stratification in association studies. Am. J. Hum. Genet. 65: 220–228. https://doi.org/10.1086/302449.
OpenUrl CrossRef PubMed Web of Science
↵
Pritchard J. K., M. Stephens, N. A. Rosenberg, and P. Donnelly, 2000 Association mapping in structured populations. Am. J. Hum. Genet. 67: 170–181. https://doi.org/10.1086/302959.
OpenUrl CrossRef PubMed Web of Science
↵
Rocha P. S., M. Sheikh, R. Melchiorre, M. Fagard, S. Boutet et al., 2005 The Arabidopsis HOMOLOGY-DEPENDENT GENE SILENCING1 gene codes for an S-adenosyl-L-homocysteine hydrolase required for DNA methylation-dependent gene silencing. Plant Cell 17: 404–417. https://doi.org/10.1105/tpc.104.028332.
OpenUrl Abstract/FREE Full Text
↵
Saez A., N. Robert, M. H. Maktabi, J. I. Schroeder, R. Serrano et al., 2006 Enhancement of abscisic acid sensitivity and reduction of water consumption in Arabidopsis by combined inactivation of the protein phosphatases type 2C ABI1 and HAB1. Plant Physiol. 141: 1389–1399. https://doi.org/10.1104/pp.106.081018.
OpenUrl Abstract/FREE Full Text
↵
Santure A. W., and D. Garant, 2018 Wild GWAS‐association mapping in natural populations. Mol. Ecol. Resour. 18: 729–738. https://doi.org/10.1111/1755-0998.12901.
OpenUrl
↵
Scrucca L., M. Fop, T. B. Murphy, and A. E. Raftery, 2016 mclust 5: clustering, classification and density estimation using Gaussian finite mixture models. R J. 8: 205–233.
OpenUrl
↵
Subramanian, A., P. Tamayo, V. K. Mootha, S. Mukherjee, B. L. Ebert et al., 2005 Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. 102: 15545–15550. https://doi.org/10.1073/pnas.0506580102.
OpenUrl Abstract/FREE Full Text
↵
Talbert P. B., H. T. Adler, D. W. Parks, and L. Comai, 1995 The REVOLUTA gene is necessary for apical meristem development and for limiting cell divisions in the leaves and stems of Arabidopsis thaliana. Development 121: 2723–2735.
OpenUrl Abstract
↵
Theoharides T. C., 1980 Polyamines spermidine and spermine as modulators of calcium-dependent immune processes. Life Sci. 27: 703–713. https://doi.org/10.1016/0024-3205(80)90323-9.
OpenUrl CrossRef PubMed Web of Science
↵
Tran L. S., T. Urao, F. Qin, K. Maruyama, T. Kakimoto et al., 2007 Functional analysis of AHK1/ATHK1 and cytokinin receptor histidine kinases in response to abscisic acid, drought, and salt stress in Arabidopsis. Proc. Natl. Acad. Sci. 104: 20623–20628. https://doi.org/10.1073/pnas.0706547105.
OpenUrl Abstract/FREE Full Text
↵
Visscher P. M., N. R. Wray, Q. Zhang, P. Sklar, M. I. McCarthy et al., 2017 10 years of GWAS discovery: biology, function, and translation. Am. J. Hum. Genet. 101: 5–22. https://doi.org/10.1016/j.ajhg.2017.06.005.
OpenUrl CrossRef PubMed
↵
Weig A., C. Deswarte, and M. J. Chrispeels, 1997 The major intrinsic protein family of Arabidopsis has 23 members that form three distinct groups with functional aquaporins in each group. Plant Physiol. 114: 1347–1357. https://doi.org/10.1104/pp.114.4.1347.
OpenUrl Abstract
↵
Weir B. S., and W. G. Hill, 2002 Estimating F-statistics. Annu. Rev. Genet. 36: 721–750. https://doi.org/10.1146/annurev.genet.36.050802.093940.
OpenUrl CrossRef PubMed Web of Science
↵
Wright S., 1965 The interpretation of population structure by F‐statistics with special regard to systems of mating. Evolution 19: 395–420. https://doi.org/10.2307/2406450.
OpenUrl CrossRef Web of Science
↵
Xiong L., H. Lee, M. Ishitani, and J. K. Zhu, 2002 Regulation of osmotic stress-responsive gene expression by the LOS6/ABA1 locus in Arabidopsis. J. Biol. Chem. 277: 8588–8596. https://doi.org/10.1074/jbc.M109275200.
OpenUrl Abstract/FREE Full Text
↵
Yu J., G. Pressoir, W. H. Briggs, I. Vroh Bi, M. Yamasaki et al., 2006 A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nat. Genet. 38: 203–208. https://doi.org/10.1038/ng1702.
OpenUrl CrossRef PubMed Web of Science

View the discussion thread.

Posted October 25, 2018.

Download PDF

Citation Tools

Subject Area

Bioinformatics

Subject Areas

All Articles

Animal Behavior and Cognition (5215)
Biochemistry (11752)
Bioengineering (8752)
Bioinformatics (29200)
Biophysics (14974)
Cancer Biology (12096)
Cell Biology (17411)
Clinical Trials (138)
Developmental Biology (9421)
Ecology (14182)
Epidemiology (2067)
Evolutionary Biology (18308)
Genetics (12245)
Genomics (16803)
Immunology (11869)
Microbiology (28097)
Molecular Biology (11594)
Neuroscience (60969)
Paleontology (451)
Pathology (1871)
Pharmacology and Toxicology (3238)
Physiology (4959)
Plant Biology (10427)
Scientific Communication and Education (1683)
Synthetic Biology (2886)
Systems Biology (7340)
Zoology (1651)

[1] ↵
Alabadí D., T. Oyama, M. J. Yanovsky, F. G. Harmon, P. Más et al., 2001 Reciprocal regulation between TOC1 and LHY/CCA1 within the Arabidopsis circadian clock. Science 293: 880–883. https://doi.org/10.1126/science.1061320.
OpenUrl Abstract/FREE Full Text

[2] ↵
Alexa A., J. Rahnenführer, and T. Lengauer, 2006 Improved scoring of functional groups from gene expression data by decorrelating GO graph structure. Bioinformatics 22: 1600–1607. https://doi.org/10.1093/bioinformatics/btl140.
OpenUrl CrossRef PubMed Web of Science

[3] ↵
Allen T., A. Koustenis, G. Theodorou, D. E. Somers, S. A. Kay et al., 2006 Arabidopsis FHY3 specifically gates phytochrome signaling to the circadian clock. Plant Cell 18: 2506–2516. https://doi.org/10.1105/tpc.105.037358.
OpenUrl Abstract/FREE Full Text

[4] ↵
Bassham D. C., and M. R. Blatt, 2008 SNAREs: cogs and coordinators in signaling and development. Plant Physiol. 147: 1504–1515. https://doi.org/10.1104/pp.108.121129.
OpenUrl FREE Full Text

[5] ↵
Benjamini Y., and Y. Hochberg, 1995 Controling the false discovery rate: a practical and powerful approach to multiple testing. J. Royal Stat. Soc. 57: 289–300. https://doi.org/10.2307/2346101.
OpenUrl

[6] ↵
Bieniawska Z., D. H. Paul Barratt, A. P. Garlick, V. Thole, N. J. Kruger et al., 2007 Analysis of the sucrose synthase gene family in Arabidopsis. Plant J. 49: 810–828. https://doi.org/10.1111/j.1365-313X.2006.03011.x.
OpenUrl CrossRef PubMed Web of Science

[7] ↵
Bradbury P. J., Z. Zhang, D. E. Kroon, T. M. Casstevens, Y. Ramdoss et al., 2007 TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics 23: 2633–2635. https://doi.org/10.1093/bioinformatics/btm308.
OpenUrl CrossRef PubMed Web of Science

[8] ↵
Chory J., M. Chatterjee, R. K. Cook, T. Elich, C. Fankhauser et al., 1996 From seed germination to flowering, light controls plant development via the pigment phytochrome. Proc. Natl. Acad. Sci. 93: 12066–12071. https://doi.org/10.1073/pnas.93.22.12066.
OpenUrl Abstract/FREE Full Text

[9] ↵
Conner J., and Z. Liu, 2000 LEUNIG, a putative transcriptional corepressor that regulates AGAMOUS expression during flower development. Proc. Natl. Acad. Sci. 97: 12902-12907. https://doi.org/10.1073/pnas.230352397.
OpenUrl Abstract/FREE Full Text

[10] ↵
Coop G. D., D. Witonsky, A. Di Rienzo, and K. J. Pritchard, 2010 Using environmental correlations to identify loci underlying local adaptation. Genetics 185: 1411–1423. https://doi.org/10.1534/genetics.110.114819.
OpenUrl Abstract/FREE Full Text

[11] ↵
De La Fuente, A., N. Bing, I. Hoeschele, and P. Mendes, 2004 Discovery of meaningful associations in genomic data using partial correlation coefficients. Bioinformatics 20: 3565–3574. https://doi.org/10.1093/bioinformatics/bth445.
OpenUrl CrossRef PubMed Web of Science

[12] ↵
De Mita S., A. C. Thuillet, L. Gay, N. Ahmadi, S. Manel et al., 2013 Detecting selection along environmental gradients: analysis of eight methods and their effectiveness for outbreeding and selfing populations. Mol. Ecol. 22: 1383–1399. https://doi.org/10.1111/mec.12182.
OpenUrl CrossRef Web of Science

[13] ↵
De Villemereuil P., É. Frichot, É. Bazin, O. François, and O. E. Gaggiotti, 2014 Genome scan methods against more complex models: when and how much should we trust them? Mol. Ecol. 23: 2006–2019. https://doi.org/10.1111/mec.12705.
OpenUrl

[14] ↵
Devlin B., and K. Roeder, 1999 Genomic control for association studies. Biometrics 55: 997–1004. https://doi.org/10.1111/j.0006-341X.1999.00997.x.
OpenUrl CrossRef PubMed Web of Science

[15] ↵
Esmon C. A., A. G. Tinsley, K. Ljung, G. Sandberg, L. B. Hearne et al., 2006 A gradient of auxin and auxin-dependent transcription precedes tropic growth responses. Proc. Natl. Acad. Sci. 103: 236–241. https://doi.org/10.1073/pnas.0507127103.
OpenUrl Abstract/FREE Full Text

[16] ↵
Farré E. M. H. S. L, F. Harmon, M. J. Yanovsky, and S. A. Kay, 2005 Overlapping and distinct roles of PRR7 and PRR9 in the Arabidopsis circadian clock. Curr. Biol. 15: 47–54. https://doi.org/10.1016/j.cub.2004.12.067.
OpenUrl CrossRef PubMed Web of Science

[17] ↵
Foll M., and O. Gaggiotti, 2008 A genome scan method to identify selected loci appropriate for both dominant and codominant markers: a Bayesian perspective. Genetics 180: 977-993. https://doi.org/10.1534/genetics.108.092221.
OpenUrl Abstract/FREE Full Text

[18] ↵
Fraley C., and A. E. Raftery, 2016 Model-based clustering, discriminant analysis and density estimation. BMC Bioinformatics 17: 287. https://doi.org/10.1198/016214502760047131.
OpenUrl

[19] ↵
Frichot E., S. D. Schoville, G. Bouchard, and O. François, 2013 Testing for associations between loci and environmental gradients using latent factor mixed models. Mol. Biol. Evol. 30: 1687–1699. https://doi.org/10.1093/molbev/mst063.
OpenUrl CrossRef PubMed Web of Science

[20] ↵
Geraldes A., N. Farzaneh, C. J. Grassa, A. D. McKown, R. D. Guy et al., 2014 Landscape genomics of Populus trichocarpa the role of hybridization limited gene flow and natural selection in shaping patterns of population structure. Evolution 68: 3260–3280. https://doi.org/10.1111/evo.12497.
OpenUrl CrossRef

[21] ↵
Gilliland L. U., L. C. Pawloski, M. K. Kandasamy, and R. B. Meagher, 2003 Arabidopsis actin gene ACT7 plays an essential role in germination and root growth. Plant J. 33: 319–328. https://doi.org/10.1046/j.1365-313X.2003.01626.x.
OpenUrl CrossRef PubMed Web of Science

[22] ↵
Greb T., O. Clarenz, E. Schäfer, D. Müller, R. Herrero et al., 2003 Molecular analysis of the LATERAL SUPPRESSOR gene in Arabidopsis reveals a conserved control mechanism for axillary meristem formation. Genes Dev. 17: 1175–1187. https://doi.org/10.1101/gad.260703.
OpenUrl Abstract/FREE Full Text

[23] ↵
Grienenberger E., S. Besseau, P. Geoffroy, D. Debayle, D. Heintz et al., 2009 A BAHD acyltransferase is expressed in the tapetum of Arabidopsis anthers and is involved in the synthesis of hydroxycinnamoyl spermidines. Plant J. 58: 246–259. https://doi.org/10.1111/j.1365-313X.2008.03773.x.
OpenUrl CrossRef PubMed Web of Science

[24] ↵
Haake V., D. Cook, J. L. Riechmann, O. Pineda, M. F. Thomashow et al., 2002 Transcription factor CBF4 is a regulator of drought adaptation in Arabidopsis. Plant Physiol. 130: 639–648. https://doi.org/10.1104/pp.006478.
OpenUrl Abstract/FREE Full Text

[25] ↵
Hilborn R., T. P. Quinn, D. E. Schindler, and D. E. Rogers, 2003 Biocomplexity and fisheries sustainability. Proc. Natl. Acad. Sci. 100: 6564–6568. https://doi.org/10.1073/pnas.1037274100.
OpenUrl Abstract/FREE Full Text

[26] ↵
Karpinski S., C. Escobar, B. Karpinska, G. Creissen, and P. M. Mullineaux, 1997 Photosynthetic electron transport regulates the expression of cytosolic ascorbate peroxidase genes in Arabidopsis during excess light stress. Plant Cell 9: 627–640. https://doi.org/10.1105/tpc.9.4.627.
OpenUrl Abstract/FREE Full Text

[27] ↵
Kishino H., and P. J. Waddell, 2000 Correspondence analysis of genes and tissue types and finding genetic links from microarray data. Genome Inform Ser Workshop Genome Inform. 11: 83–95.
OpenUrl PubMed

[28] ↵
Kitada S., R. Nakamichi, and H. Kishino, 2017 The empirical Bayes estimators of fine-scale population structure in high gene flow species. Mol. Ecol. Resour. 17: 1210–1222. https://doi.org/10.1111/1755-0998.12663.
OpenUrl

[29] ↵
Ko J. H., S. H. Yang, and K. H. Han, 2006 Upregulation of an Arabidopsis RING-H2 gene, XERICO, confers drought tolerance through increased abscisic acid. Plant J. 47: 343–355. https://doi.org/10.1111/j.1365-313X.2006.02782.x.
OpenUrl CrossRef PubMed Web of Science

[30] ↵
Lawson T., and M. R. Blatt, 2014 Stomatal size, speed, and responsiveness impact on photosynthesis and water use efficiency. Plant Physiol. 164: 1556–1570. https://doi.org/10.1104/pp.114.237107.
OpenUrl Abstract/FREE Full Text

[31] ↵
Levy Y. Y., S. Mesnage, J. S. Mylne, A. R. Gendall, and C. Dean, 2002 Multiple roles of Arabidopsis VRN1 in vernalization and flowering time control. Science 297: 243–246. https://doi.org/10.1126/science.1072147.
OpenUrl Abstract/FREE Full Text

[32] ↵
Li S. F., O. N. Milliken, H. Pham, R. Seyit, R. Napoli et al., 2009 The Arabidopsis MYB5 transcription factor regulates mucilage synthesis, seed coat development, and trichome morphogenesis. Plant Cell 21: 72–89. https://doi.org/10.1105/tpc.108.063503.
OpenUrl Abstract/FREE Full Text

[33] ↵
Limborg M. T., S. J. Helyar, M. De Bruyn, M. I. Taylor, E. E. Nielsen et al., 2012 Environmental selection on transcriptome‐derived SNPs in a high gene flow marine fish, the Atlantic herring (Clupea harengus). Mol. Ecol. 21: 3686–3703. https://doi.org/10.1111/j.1365-294X.2012.05639.x.
OpenUrl CrossRef PubMed Web of Science

[34] ↵
Lin W. D., Y. Y. Liao, T. J. Yang, C. Y. Pan, T. J. Buckhout et al., 2011 Coexpression-based clustering of Arabidopsis root genes predicts functional modules in early phosphate deficiency signaling. Plant Physiol. 155: 1383–1402. https://doi.org/10.1104/pp.110.166520.
OpenUrl Abstract/FREE Full Text

[35] ↵
Liu W., 2013 Gaussian graphical model estimation with false discovery rate control. Ann. Stat. 41: 2948–2978. https://doi.org/10.1214/13-AOS1169.
OpenUrl

[36] ↵
Manzaneda A. J., P. J. Rey, J. M. Bastida, C. Weiss-Lehman, E. Raskin et al., 2012 Environmental aridity is associated with cytotype segregation and polyploidy occurrence in Brachypodium distachyon (Poaceae). New Phytol. 193: 797–805. https://doi.org/10.1111/j.1469-8137.2011.03988.x.
OpenUrl CrossRef PubMed Web of Science

[37] ↵
McDowell L. M., Y. An, S. Huang, E. C. McKinney, and R. B. Meagher, 1996 The Arabidopsis ACT7 actin gene is expressed in rapidly developing tissues and responds to several external stimuli. Plant Physiol. 111: 699–711. https://doi.org/10.1104/pp.111.3.699.
OpenUrl Abstract

[38] ↵
McGinnis K. M., S. G. Thomas, J. D. Soule, L. C. Strader, J. M. Zale et al., 2003 The Arabidopsis SLEEPY1 gene encodes a putative F-Box subunit of an SCF E3 ubiquitin ligase. Plant Cell 15: 1120–1130. https://doi.org/10.1105/tpc.010827.
OpenUrl Abstract/FREE Full Text

[39] ↵
McKown A. D., R. D. Guy, L. Quamme, J. Klápště, J. La Mantia et al., 2014 Association genetics, geography and ecophysiology link stomatal patterning in Populus trichocarpa with carbon gain and disease resistance trade-offs. Mol. Ecol. 23: 5771–5790. https://doi.org/10.1111/mec.12969.
OpenUrl CrossRef

[40] ↵
Melotto M., W. Underwood, J. Koczan, K. Nomura, and S. Y. He, 2006 Plant stomata function in innate immunity against bacterial invasion. Cell 126: 969–98. https://doi.org/10.1016/j.cell.2006.06.054.
OpenUrl CrossRef PubMed Web of Science

[41] ↵
Nosil P., B. J. Crespi, and C. P. Sandoval, 2002 Host-plant adaptation drives the parallel evolution of reproductive isolation. Nature 417: 440–443. https://doi.org/10.1038/417440a.
OpenUrl CrossRef PubMed Web of Science

[42] ↵
Pang Q., J. B. Hays, I. Rajagopal, and T. S. Schaefer, 1993 Selection of Arabidopsis cDNAs that partially correct phenotypes of Escherichia coli DNA-damage-sensitive mutants and analysis of two plant cDNAs that appear to express UV-specific dark repari activities. Plant Mol. Biol. 22: 411–426. https://doi.org/10.1007/BF00015972.
OpenUrl CrossRef PubMed Web of Science

[43] ↵
Pritchard J. K., and N. A. Rosenberg, 1999 Use of unlinked genetic markers to detect population stratification in association studies. Am. J. Hum. Genet. 65: 220–228. https://doi.org/10.1086/302449.
OpenUrl CrossRef PubMed Web of Science

[44] ↵
Pritchard J. K., M. Stephens, N. A. Rosenberg, and P. Donnelly, 2000 Association mapping in structured populations. Am. J. Hum. Genet. 67: 170–181. https://doi.org/10.1086/302959.
OpenUrl CrossRef PubMed Web of Science

[45] ↵
Rocha P. S., M. Sheikh, R. Melchiorre, M. Fagard, S. Boutet et al., 2005 The Arabidopsis HOMOLOGY-DEPENDENT GENE SILENCING1 gene codes for an S-adenosyl-L-homocysteine hydrolase required for DNA methylation-dependent gene silencing. Plant Cell 17: 404–417. https://doi.org/10.1105/tpc.104.028332.
OpenUrl Abstract/FREE Full Text

[46] ↵
Saez A., N. Robert, M. H. Maktabi, J. I. Schroeder, R. Serrano et al., 2006 Enhancement of abscisic acid sensitivity and reduction of water consumption in Arabidopsis by combined inactivation of the protein phosphatases type 2C ABI1 and HAB1. Plant Physiol. 141: 1389–1399. https://doi.org/10.1104/pp.106.081018.
OpenUrl Abstract/FREE Full Text

[47] ↵
Santure A. W., and D. Garant, 2018 Wild GWAS‐association mapping in natural populations. Mol. Ecol. Resour. 18: 729–738. https://doi.org/10.1111/1755-0998.12901.
OpenUrl

[48] ↵
Scrucca L., M. Fop, T. B. Murphy, and A. E. Raftery, 2016 mclust 5: clustering, classification and density estimation using Gaussian finite mixture models. R J. 8: 205–233.
OpenUrl

[49] ↵
Subramanian, A., P. Tamayo, V. K. Mootha, S. Mukherjee, B. L. Ebert et al., 2005 Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. 102: 15545–15550. https://doi.org/10.1073/pnas.0506580102.
OpenUrl Abstract/FREE Full Text

[50] ↵
Talbert P. B., H. T. Adler, D. W. Parks, and L. Comai, 1995 The REVOLUTA gene is necessary for apical meristem development and for limiting cell divisions in the leaves and stems of Arabidopsis thaliana. Development 121: 2723–2735.
OpenUrl Abstract

[51] ↵
Theoharides T. C., 1980 Polyamines spermidine and spermine as modulators of calcium-dependent immune processes. Life Sci. 27: 703–713. https://doi.org/10.1016/0024-3205(80)90323-9.
OpenUrl CrossRef PubMed Web of Science

[52] ↵
Tran L. S., T. Urao, F. Qin, K. Maruyama, T. Kakimoto et al., 2007 Functional analysis of AHK1/ATHK1 and cytokinin receptor histidine kinases in response to abscisic acid, drought, and salt stress in Arabidopsis. Proc. Natl. Acad. Sci. 104: 20623–20628. https://doi.org/10.1073/pnas.0706547105.
OpenUrl Abstract/FREE Full Text

[53] ↵
Visscher P. M., N. R. Wray, Q. Zhang, P. Sklar, M. I. McCarthy et al., 2017 10 years of GWAS discovery: biology, function, and translation. Am. J. Hum. Genet. 101: 5–22. https://doi.org/10.1016/j.ajhg.2017.06.005.
OpenUrl CrossRef PubMed

[54] ↵
Weig A., C. Deswarte, and M. J. Chrispeels, 1997 The major intrinsic protein family of Arabidopsis has 23 members that form three distinct groups with functional aquaporins in each group. Plant Physiol. 114: 1347–1357. https://doi.org/10.1104/pp.114.4.1347.
OpenUrl Abstract

[55] ↵
Weir B. S., and W. G. Hill, 2002 Estimating F-statistics. Annu. Rev. Genet. 36: 721–750. https://doi.org/10.1146/annurev.genet.36.050802.093940.
OpenUrl CrossRef PubMed Web of Science

[56] ↵
Wright S., 1965 The interpretation of population structure by F‐statistics with special regard to systems of mating. Evolution 19: 395–420. https://doi.org/10.2307/2406450.
OpenUrl CrossRef Web of Science

[57] ↵
Xiong L., H. Lee, M. Ishitani, and J. K. Zhu, 2002 Regulation of osmotic stress-responsive gene expression by the LOS6/ABA1 locus in Arabidopsis. J. Biol. Chem. 277: 8588–8596. https://doi.org/10.1074/jbc.M109275200.
OpenUrl Abstract/FREE Full Text

[58] ↵
Yu J., G. Pressoir, W. H. Briggs, I. Vroh Bi, M. Yamasaki et al., 2006 A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nat. Genet. 38: 203–208. https://doi.org/10.1038/ng1702.
OpenUrl CrossRef PubMed Web of Science