Assigning function to natural allelic variation via dynamic modeling of gene network induction

Magali Richard; Florent Chuffart; Hélène Duplus-Bottin; Fanny Pouyet; Martin Spichty; Etienne Fulcrand; Marianne Entrevan; Audrey Barthelaix; Michael Springer; Daniel Jost; Gaёl Yvert

doi:10.1101/140467

ABSTRACT

More and more natural DNA variants are being linked to physiological traits. Yet, understanding what differences they make on molecular regulations remains challenging. Important properties of gene regulatory networks can be captured by computational models. If model parameters can be ‘personalized’ according to the genotype, their variation may then reveal how DNA variants operate in the network. Here, we combined experiments and computations to visualize natural alleles of the yeast GAL3 gene in a space of model parameters describing the galactose response network. Alleles altering the activation of Gal3p by galactose were discriminated from those affecting its activity (production/degradation or efficiency of the activated protein). The approach allowed us to correctly predict that a non-synonymous SNP would change the binding affinity of Gal3p with the Gal80p transcriptional repressor. Our results illustrate how personalizing gene regulatory models can be used for the mechanistic interpretation of genetic variants.

INTRODUCTION

In the past decade, countless DNA variants have been associated to physiological traits. A major challenge now is to understand how they operate at the molecular level. This is a difficult task because the mechanistic consequences resulting from each variant are not easy to identify. Even when the function of a gene is well documented, investigators need to determine the tissues, cells or organelles in which a mutant allele makes a biological difference, the developmental stage at which this may happen, the metabolic or regulatory network that may be involved, as well as possible molecular scenarios. A mutation may alter the regulation of transcription or mRNA splicing; the enzymatic activity of the target protein; its rate of production, maturation, or degradation; its intracellular localisation; its binding affinity to an interacting partner or the specificity of its molecular interactions. In the vast majority of cases, information from the DNA sequence alone is not sufficient to delimit the perimeter of possible implications.

Systems biology has opened new opportunities to better predict the action of DNA variants. First, ‘omics’ data that are gathered at various levels (DNA, transcripts, proteins, metabolites…) establish relations between target sequences and functional pathways. Information about molecular and genetic interactions, expression profiles, chromatin landscapes, post-transcriptional and post-translational regulations can be exploited to derive functional predictions of DNA variants. Various methods have been proposed to do this, such as Bayesian genetic mapping¹, visualization of SNPs on relational protein networks², prioritization based on negative selection³, or inference of miRNA:RNA binding defects⁴. In addition, structural data of biomolecules can also highlight functional perturbations in specific domains such as catalytic sites or interaction surfaces^5,6.

Another alternative is to model the quantitative and dynamic properties of molecular reactions and to explore which feature(s) may be affected by a DNA variant. The functional consequences of mutations can then be inferred by considering their impact on specific parameters of the model. In other words, assigning function to a DNA variant may be straightforward after it is linked to parameters of a model. This perspective may also, on the long term, generate developments in personalized medicine: if a model can be personalized according to the patient’s genotype then it can help predict disease progress or treatment outcome and therefore adapt medical care to the patient’s specificities. For this to become reality, the model must be i) informative on the biological trait of interest and ii) identifiable and sufficiently constrained so that model parameters can be reliably inferred, accounting for the patient’s specificities. These two requirements antagonize each other regarding the complexity of the model to be used. The former asks for completeness: the molecular control of the trait must be correctly covered by the model, describing known reactions as best as possible. The latter asks for simplicity: if too many parameters are allowed to be adjusted to the data, then the validity of the personalized model is questionable and none of the adjustments are informative. It is therefore important to determine if and how personalizing model parameters can be productive.

For a given molecular network, individuals from natural populations have different genotypes at several nodes (genes) of the network, as well as in numerous external factors that can affect the network properties. Such external factors can modify, for example, global translation efficiencies, metabolic states, or pathways that cross-talk with the network of interest. Adapting model parameters to specific individuals is challenging when so many sources of variation exist. A way to circumvent this difficulty is to study the network experimentally in the context of a more reduced and focused variation. If investigators have access to nearly-isogenic individuals that differ only at specific genes of the network, they can then characterize the differences in network behaviour that result from these specific allelic differences. The numerous external factors affecting the network can then be ignored or drastically simplified in the model because they are common to all individuals. This way, the parameter space is constrained and only potentially-informative parameters are allowed to be adjusted to fit individual-specific data.

Some model organisms such as the yeast S. cerevisiae offer this possibility. They can be manipulated to generate single allelic changes, which provides an ideal framework to link DNA variants to model parameters. In particular, the gene regulatory network controlling the yeast response to galactose (GAL network) is well characterized, both in vivo and in silico. This circuit controls galactose utilization by upregulating the expression of regulatory and metabolic genes in response to extracellular galactose⁷. Regulation is based on the transcriptional activator Gal4p, the galactose transporter Gal2p, a signal transducer Gal3p and the transcriptional inhibitor Gal80p. In addition, the galactokinase Gal1p, involved in galactose metabolism is also a coinducer of the response⁸. This system can display either a gradual induction (where the rate of transcription progressively increases in each cell according to the timing and intensity of the stimulus) or a probabilistic induction (where the probability of having high/low rate of transcription in each cell varies). This dual behaviour has received a lot of attention and important molecular features have been elucidated by experimental and theoretical approaches^9–12. In particular, the dynamic response of a population of cells to galactose can be described by two quantities: the inducibility of the network is defined as the proportion of activated cells in the population, and the amplitude of the response refers to the expression level that is reached by induced cells. Regulatory feedback loops of the network are critical to the switch-like behaviour. They were shown to feed back the dynamics of transcription bursts rather than the levels of expression¹³. They regulate the amplitude response by reducing noise in GAL gene expression¹⁴, they control the inducibility by fine-tuning the timing of the switch¹⁴, and they participate to the memory of previous inductions^15,16. As a consequence, bimodal distributions of expression of the GAL genes can be observed in isogenic populations exposed to intermediate concentrations of inducer^17–19, and this population heterogeneity can confer a growth advantage during the transition from glucose to galactose metabolism (diauxic shift)²⁰. Interestingly, wild yeast isolates present diverse types of induction dynamics during the diauxic shift, ranging from strictly unimodal to transient bimodal distribution of expression levels^21,22. This indicates that natural genetic variation can modify the network dynamics.

The GAL3 gene plays a central role in the network. Its protein product Gal3p is activated by binding to galactose and ATP and then binds as a dimer to Gal80p dimers to release the repression on Gal4p at target promoters⁷. The protein is enriched in the cytoplasm prior to stimulation and in the nucleus after the stimulation, although this cyto-nuclear transfer does not account for the dynamics of activation^23,24. Expression of GAL3 is itself under Gal4p/Gal80p control (positive feedback). In addition, the sequence of GAL3 differs between natural isolates of S. cerevisiae and this allelic variation was recently associated to different sensitivities of the network to galactose (Lee et al. PLoS Genetics, in press). There are multiple ways that a GAL3 variant could affect the dynamics of induction: by modifying the production or degradation rates of the Gal3p protein or of its messenger RNA, by changing the affinity of Gal3p to galactose or ATP, by changing the capacity of Gal3p to dimerize, by changing the nucleocytoplasmic ratio of Gal3p molecules, or by changing the affinity of Gal3p to Gal80p. A GAL3 variant may also affect the background expression level of Gal3p prior to stimulation, which is known to be critical for network memory of prior stimulations²⁵. Thus, it is difficult to predict the functional consequence of sequence variation in GAL3.

Using the yeast GAL3 gene as a model framework, we show here that experimental acquisitions combined with network modeling is efficient to predict the effect of sequence variants. The principle of the approach is to link genetic variation to informative changes of parameter values of the model. We show that replacing natural GAL3 alleles can be sufficient to transform a gradual response into a probabilistic activation, and the approach allowed us to distinguish between different types of GAL3 alleles segregating in S. cerevisiae populations: those altering the activation of Gal3p by galactose, and those altering the strength with which activated Gal3p alleviates the transcriptional inhibition operated by Gal80p. In particular, our approach was efficient to associate a non-synonymous SNP with a change of binding affinity for Gal80p.

RESULTS

Natural variation in GAL3 affects the dynamics of network induction

We constructed a panel of yeast strains that were all isogenic to the reference laboratory strain BY, except for GAL3. At this locus, each strain carried an allele that was transferred from a natural strain of the Saccharomyces Genome Resequencing Project²⁶ (Supplementary Fig. 1). All strains of the panel also harboured a P_GAL1-GFP reporter of network activity, where the promoter of the GAL1 gene controlled the expression of a GFP fluorescent protein destabilized by a degradation signal^27,28. GAL1 is a paralogous gene of GAL3²⁹ and transcription at its promoter is commonly used as a proxy of GAL network activity^15,20,22. Using flow cytometry, we monitored the dynamics of network activation in each strain (Fig. 1). This was done by first culturing cells for 3 hours in a medium containing 2% raffinose, a sugar known to be neutral on network activity, adding galactose (0.5% final concentration), and quantifying fluorescence at multiple time points for 4 hours. Significant differences in the dynamics of activation were observed between the strains. Those harbouring the GAL3^NCYC361, GAL3^K11, GAL3^BY, GAL3^DBVPG1788, GAL3^DBVPG1853 and GAL3^JAY291 alleles displayed a gradual response, all cells of the population were induced and responded with similar rate of expression, maintaining population homogeneity (see example shown in Fig. 1a). In contrast, strains harbouring the GAL3^Y12 and GAL3^YJM978 alleles displayed a binary response, with a transient co-existence of induced (ON) and uninduced (OFF) cells in the population (example in Fig. 1b).

Figure 1. Dynamic response to galactose in the context of GAL3 variants.

Acquisitions were made on strains where the GAL3 allele was replaced by the indicated natural alleles. These strains were otherwise isogenic, with a BY background. (a-b) Flow-cytometry data obtained on strains harboring the GAL3^NCYC361 allele (a) or the GAL3^Y12 allele (b). Cells were cultured in raffinose 2% and induced at time 0 by adding galactose at a final concentration of 0.5%. a. u., arbitrary units. Grey dashed line, threshold used to distinguish ON cells from OFF cells. (c) Amplitude of the response (mean expression) as a function of time for each GAL3 replacements strain. Error bars represent standard error of the mean (n=6). (d) Inducibility of the response (fraction of ON cells) as a function of time for each GAL3 replacement strain. Error bars represent standard error of the mean (n=6).

We quantified induction using two metrics: the mean level of reporter expression in activated cells (response amplitude), and the proportion of activated cells in the population (inducibility of the network). We observed that the response amplitude varied little among the strains, all of them approaching steady state with comparable kinetics (Fig. 1c). In contrast, inducibility of the network differed between strains (Fig. 1d). As expected, in strains showing a gradual response, the fraction of ON cells increased significantly during the first two hours of induction, reaching full inducibility (all cells activated) by the end of the experiment. On the opposite, the strains showing a transient binary response displayed reduced inducibility over time. For instance, 21% of GAL3^Y12 cells were still not induced after 250 minutes of stimulation. These results indicate that natural genetic variation in GAL3 is sufficient to modify the inducibility of the network and to convert a gradual response into a binary response, or vice versa.

A quantitative model of inducibility over time

To examine what functional properties of the GAL3 gene could determine a gradual or a binary response, we constructed a dynamic stochastic model of the network (Fig. 2a). We based our quantitative model on the following current molecular knowledge. In absence of galactose, a homodimer of the transcription factor Gal4p is constitutively bound to upstream activation sites (UAS) of promoter regions of GAL genes. However, transcription is inactive because of the homodimeric Gal80p inhibition of Gal4p^30,31. When intracellular galactose binds Gal3p, it changes conformation and associates with Gal80p dimers³², thereby releasing Gal80p from promoters and allowing Gal4p-mediated transcriptional activation. It was initially thought that activated Gal3p sequestered Gal80p in the cytoplasm, preventing it from its inhibitory role in the nucleus³⁰. Later studies revised this view by showing that Gal3p molecules were not exclusively cytoplasmic²³, that forcing Gal3p to be mostly nuclear did not alter the kinetics of induction²³, and that the dynamics of nucleocytoplasmic trafficking were too slow to explain the fast induction of transcription²⁴. This implies a direct role of Gal3p in promoting the dissociation of Gal80p from UAS. In addition, the galactokinase Gal1p (a paralog of Gal3p) can also act as a co-inducer of the regulatory circuit, presumably using similar mechanisms as Gal3p¹⁸.

Figure 2. In-silico model of network induction.

(a) Schematic representation of the model used. Galactose-activated Gal1p and Gal3p proteins become Gal1p* and Gal3p*, respectively. Pointed and blunt arrows represent activation and inhibition, respectively. Positive and negative feedback loops are highlighted by + and - signs. (b) Example of a gradual response predicted by the model ([gal]=0.5%, ρ_Gal3=140 and K_Gal=0.055). Thin violet lines represent stochastic simulations of network activation in individual cells. Dashed red line represents the threshold distinguishing ON from OFF cells. Green thick line indicates the fraction of ON cells as a function time. (c) Example of a binary response predicted by the model ([gal]=0.5%, ρ_Gal3=40 and K_Gal=0.055). Same color code as in (b).

Our model covers the mRNA and protein species of three major players of GAL network induction: GAL1, GAL3 and GAL80, as well as of the reporter gene. We considered that promoters of each GAL gene could switch between an ON state (full transcription) and an OFF state (leaky transcription) at rates that depended on the concentration of Gal80 dimers, activated Gal3p dimers and activated Gal1p dimers. A detailed description of the model is given in Materials and Methods and in Supplementary Text 1. Most of the parameters of the model were fixed at values obtained from previous studies (Table S1).

Stochastic simulations reproduce the two types of induction observed experimentally

We first explored if our model captured the two types of responses of allele-replacement strains (i.e. binary and gradual). We ran stochastic simulations³³ that accounted for intrinsic and extrinsic sources of noise (see Supplementary Text 1). We observed that tuning the parameters related to GAL3, while keeping all other parameters constant, was sufficient to modify inducibility and to obtain either a gradual (Fig. 2b) or a binary (Fig. 2c) response of the network at a given concentration of galactose. In the gradual system, the simulated single-cell trajectories were all similar; in the binary system, the simulated single-cell trajectories bifurcated with a subset of cells having a stochastic lagging time before responding. The single-cell value of this lag time is directly correlated with the number of potential inducer proteins (Gal1p and Galp3p) present in the cell just before induction (Supplementary Fig. 2). This is in very good agreement with recent single-cell experiments on galactose induction²⁵.

We then studied the response predicted by the model when stimulating the network with various concentrations of galactose while keeping model parameters constant (Supplementary Fig. 3). Inducibility increased with the concentration of galactose, with low concentrations causing a probabilistic induction (binary) and high concentrations a deterministic one (gradual).

Two parameters related to GAL3 control network behaviour

A detailed analysis of the model showed that inducibility of the system was mainly controlled by the average values of promoter switching rates k_on and k_off at the time of induction (see Material and Methods, Supplementary Text 1 and Supplementary Figs. 2 and 4). Rates k_off depend only on GAL80 and are therefore invariant to GAL3 allelic variation. Rates k_on depend on GAL3 in two ways: via Gal3p*, the amount of galactose-activated Gal3p, and via K₃, which corresponds to an effective concentration encompassing the dissociation constants of the Gal3p-Gal80p interaction and of Gal3p dimerization (see Supplementary Text 1). Gal3p* is determined by the level of Gal3p and by parameter K_gal, which represents the typical concentration of galactose needed to efficiently activate Gal3p. While K_gal was identifiable, several other GAL3-related parameters, such as those controlling the level of Gal3p, were not and we grouped them in a meta-parameter, ρ_Gal3, which we termed the strength of GAL3. ρ_Gal3 corresponds to the invert ratio between K₃ and the mean concentration of Gal3p at the time of induction, which depends on the leaky transcription rate, the translation rate and the degradation rates of GAL3 mRNA and protein product.

This formalism made the network sensitive to only two identifiable GAL3-related parameters, K_gal and ρ_Gal3. At a fixed concentration of galactose induction, high ρ_Gal3 values correspond to high numbers of Gal3p dimers that can rapidly be activated to release Gal80 repression. The model predicted that high values of ρ^Gal3 would generate a gradual response (Supplementary Fig. 5a) because the number of potential activators was high enough in each cell to rapidly trigger the GAL1/GAL3-mediated positive feedback loop. In contrast, low values of ρ_Gal3 would generate a binary response (Supplementary Fig. 5b) because the number of activators is more stochastic, with many cells having too few initial Gal1p or Gal3p dimers to directly trigger the response. These cells need a lag time before fast activation (Fig.2b,c and Supplementary Fig. 2). The other important parameter, K_gal, corresponded to a threshold of galactose concentration below which induction was limited and favoured a binary response, and above which induction was efficient and favoured a gradual response (Supplementary Fig. 3c). In summary, both ρ_Gal3 and K_gal values can determine whether the network adopts a gradual or a binary response at a given concentration of galactose induction.

Linking GAL3 alleles to specific parameter values

In order to test the predictions of the model, we measured the transcriptional response of the GAL3^BY, GAL3^Y12 and GAL3^YJM978 strains at different galactose concentrations (0.05%, 0.1% and 0.5%). Our experimental observations confirmed that the inducibility increases with galactose concentration (Fig. 3a). We then used this experimental data to infer parameters ρ_Gal3 and K_Gal for each of the three strains. This was done by selecting a set of parameters that minimized a global chi2-score of deviation between the measured and predicted fractions of induced cells at different times after induction and for the different galactose concentrations (for details, see methods and Supplementary Text 1). To evaluate the usefulness of the inferred parameter values, we used the fitted model to predict the behaviour of each strain at a galactose concentration that was not used for model training (0.2%) (Fig. 3b). Finally, to test model predictions, we experimentally monitored GAL3^BY, GAL3^Y12 and GAL3^YJM978 induction at 0.2% galactose. Without any additional fitting procedure, we observed that inducibility (fraction of activated cells over time) differed between strains in a way that was entirely consistent with model predictions. Thus, the differences among parameter values assigned to the different GAL3 alleles are relevant outside the specific experimental conditions used for parameters estimation.

Figure 3. Strain-specific training of the model and validation.

(a) Model fitting. Each panel corresponds to one strain carrying the indicated GAL3 allele. Inducibility was measured by flow cytometry (data points +/− s.e.m.) after stimulating cells with three different concentrations of galactose (points colored according to the concentration). For each strain, this data was used to fit the GAL3-dependent parameters ρ_Gal3 and K_Gal. Inferred parameter values are shown. Lines in plain (resp. dashed and dotted) represent the inducibility predicted by the model at [gal]=0.5% (resp. 0.1% and 0.05%). (b)With the parameters inferred in (a) we use the model to predict the inducibility of each strain at a galactose concentration of 0.2% (lines), and this prediction was compared to experimental measures (dots +/− s.e.m.).

Natural GAL3 alleles map to distinct locations of the parameter space

We sought to classify GAL3 alleles based on the parameter values assigned to them. We made experimental measurements on two additional strains (GAL3^NCYC361 and GAL3^DBVPG1788) and we determined best-fit ρ_Gal3 and K_gal values to them as for the three strains described above. This data and the corresponding fitted models are shown in Supplementary Fig. 6. Fig. 4a,b shows the obtained parameters, ρ_Gal3 and K_Gal, normalized by the corresponding values of our reference strain GAL3^BY. Different data points represent results obtained by applying the inference process to models with different GAL3-independent parameters (see Supplementary Text 1).

The fold change of a parameter between two different strains is indicative of the functional nature of the genetic variations between the two GAL3 alleles. In agreement with the model predictions (Supplementary Fig. 3), we observed that more gradual strains (GAL3^NCYC361 and GAL3^DBWG1788) display a high GAL3 strength ρ_Gal3 and a low ‘typical’ galactose concentration K_Gal. Interestingly, we observed that ρ_Gal3 and K_Gal can be decorrelated. In particular, although both GAL3^YJM978 and GAL3^Y12 strains were binary responders at all galactose concentrations tested, the model attributed this behaviour to different functional effects: a low sensitivity to galactose (high K_gal) for the Gal3 protein originating from YJM978 and a reduced strength of the GAL3 gene originating from Y12. Thus, the induction specificities of the strains can be attributed to distinct GAL3-related parameters.

To address the direct relationship between the network properties (gradual or binary response) and the GAL3-related parameters, we positioned each of the tested strains within a phenotypic landscape according to their relative ρ_Gal3 and K_Gal parameters (Fig 4c). According to our model, ρ_Gal3 and K_Gal parameters are sufficient to predict the behaviour (gradual or binary) associated with a given GAL3 allele at a given concentration of galactose. As an illustration of these predictions, we specifically observed the dynamics of transcriptional activation of the network for the strain GAL3^DBVPG1788 (Fig. 4d). The position of the GAL3^DBVPG1788 allele on the phenotypic landscape corresponded to a transient probabilistic activation at low concentration ([gal]=0.05%) converted into a gradual response at higher concentration ([gal]=0.1% and [gal]=0.5%).

Figure 4. GAL3 alleles map to distinct locations of the parameter space.

(a-b) Parameter values obtained by fitting the model to experimental data collected on five strains at three concentrations of the inducer ([gal] = 0.05, 0.1 and 0.5%). Six independent fits were performed (one per grey line). For each one, different values of GAL3-independent parameters were chosen (see Supplementary Text 1), and parameters ρ_Gal3 (a) and K_gal (b) were estimated for each strain. Dots represent their value for the indicated strain, relative to the value estimated for the GAL3^BY strain. Error bars: uncertainty on parameter estimation for each inference (see Materials and Methods). (c) Phenotypic landscape predicted by the model. At defined concentrations of the inducer ([gal]), the values of ρ_Gal3 and K_gal determine whether the response is gradual (brown) or binary (yellow). The white zone is an intermediate region where the distinction between gradual and binary is unclear. Using parameters inferred in (a) and (b), alleles are mapped to the landscape (colored dots). Error bars: standard deviation of the 6 distinct estimations. (d) Time-course flow cytometry data of the GAL3^DBVPG1788 strain, showing its transient binary response at low concentration of inducer (left) and its gradual response at higher concentration (right).

Variation in induction dynamics is consistent with variation in diauxic shift decision

The physiological relevance of the GAL network regulation is to switch from the consumption of glucose (the preferred carbon source) to the consumption of galactose when glucose supply is running out. This diauxic switch is controlled not only by galactose induction but also by glucose-mediated repression. When both sugars are present, their relative concentration ratio determines whether cells activate the switch or not^20,21,34. At some ratio values, only a fraction of the cells are induced, even at the steady-state. Given this dual regulation, the propensity of a strain to activate GAL metabolism can be quantified by measuring the fraction of induced cells after a prolonged period (8 hours) of simultaneous induction (by galactose) and repression (by glucose). If this measurement is repeated at a given concentration of galactose and various concentrations of glucose, a useful score can be computed (called ‘decision threshold’ hereafter): the concentration of glucose needed to maintain half the population of cells in the repressed (OFF) state (Fig. 5a). A high decision threshold corresponds to an early activation of GAL genes during the diauxic shift.

Figure 5. Relationship between inducibility and diauxic shift decision threshold.

(a) Schematic representation of decision threshold measurement. The decision threshold corresponds to the concentration of glucose at which 50% of the cells are induced in the presence of 0.25% galactose. The blue curve is theoretical and shown to explain how the fraction of ON cells depends on glucose concentration. (b) Decision thresholds for strains GAL3^BY, GAL3^YJM421, GAL3^YJM428 and GAL3^BC187 at [gal] = 0.25%. (c) Schematic representation of GAL3 induction parameters determination. (d) Location of the GAL3 replacement strains in the phenotypic landscape of the model at [gal] = 0.25%. Inset: ρ_Gal3 values as a function of the decision threshold, with dots corresponding to strains.

A previous study identified GAL3 as an important genetic determinant for this decision: the concentration ratio at which cells turn GAL expression ON differs between strains harbouring different natural alleles of GAL3 (Lee et al. PLoS Genetics, in press). We asked if this variation was correlated with the variation observed on the dynamics of network induction. We chose four strains that showed different decision thresholds because of different GAL3 alleles (Fig 5b) (Lee et al. PLoS Genetics, in press) and we monitored their dynamics of induction at three different concentrations of galactose (with no glucose). We then used our model to assign ρ_Gal3 and K_Gal parameter values to each strain. Experimental data and model fitting are shown in Fig. 5b and Supplementary Fig. 7. We used the inferred parameter values to visualize the four strains in the parameter space where binary and gradual responses upon stimulation at [gal] = 0.25% are delimited (Fig. 5d). Remarkably, the properties of induction dynamics in absence of glucose were fully consistent with the decision threshold during diauxic shift from glucose to galactose. Strains having a low decision threshold, such as GAL3^YJM421, displayed a transient binary response, and strain GAL3^BC187 had a high decision threshold and responded gradually. Coordinates of strains in the parameter space indicate that ρ_Gal3 values are highly informative on the decision threshold (Fig. 5d). Thus, mapping allelic variation to dynamic parameters of induction is also useful to understand trade-offs that are observed at steady-state.

A quantitative parameter change predicts a role of H352D SNP on Gal3:Gal80 complex formation

We noticed that, at position 352 of the Gal3p protein, all natural strains harboured an aspartic acid, whereas the reference laboratory strain BY harboured a histidine. This aspartic acid was also conserved in S. mikatae, S. paradoxus and S. uvarum protein sequences³⁵. Given the prevalence of this aspartic acid, we hypothesized that a single H352D amino-acid change could have consequences on Gal3p regulatory function.

To test this, we generated an artificial GAL3^BY-H352D allele by introducing the H352D mutation in the GAL3^BY strain and we monitored the dynamics of induction of the resulting strain. At similar concentrations of galactose, induction was faster for the modified strain than for the original strain (compare Fig. 6a with Fig. 2a). We then used our model to make functional predictions. We fitted our model to experimental data of induction as described above for natural alleles. Induction dynamics of the modified strain were fully explained by preserving parameter K_Gal and increasing ρ_Gal3 (Fig. 6b). This suggested that the H352D mutation did not affect activation of Gal3p by galactose but rather the strength of Gal3p, which summarizes six biochemical features: the basal level of GAL3 transcription prior to induction, its translation and degradation rate, the degradation rate of its coding mRNA, its capacity to homodimerize and the affinity of activated Gal3p for Gal80p.

Figure 6. Functional inference of the H352D variant of GAL3.

(a). Experimental acquisitions (dots) and model fitting (curves) of the induction dynamics of the GAL3^BY-H352D strain. (b) GAL3^BY (blue dot) and GAL3^BY-H352D (grey dot with standard deviation bars) strains localisation in the phenotypic landscape of the model at [gal]=0.5%. Arrows: phenotypic trajectory between the two alleles. (c) Structure of the tetrameric complex [Gal3p*]₂[Gal80p]₂ (PDB entry 3V2U). Residue His352 of one Gal3p unit is in the back side and not visible. The His352 residue of the other Gal3p unit is shown as green beads in the center; it is located at the binding interface of the Gal3p* dimer (white beads) and the Gal80p dimer (colored surface). Gal80p residues are colored according to their electrostatic surface potential from red (≤ -10 kT/e) to blue (≥+10 kT/e). Yellow beads: the acidic activation domain of Gal4p was inserted in the complex by superimposition with crystal structure 3BTS. A similar insertion in the other Gal80p unit is in the back side and not visible. Created with VMD software. (d) Local stabilization of Gal3p-Asp352 by residues Gal3p-Arg362 and Gal80p-Lys287 in the [Gal3]₂[Gal80]₂ complex. Green and white labels refer to residues from Gal3p and Gal80p units, respectively. The figure shows a snapshot from a molecular dynamics simulation of the mutation H352D carried out for a model system of the complex (see Supplementary Text 2). Atoms within 15 Å of residue 352 are shown as thin sticks in white (Gal3p) or dark grey (Gal80p). Remaining atoms are shown as a solid surface. Created with VMD software. (e) Thermodynamic cycle quantifying the energetic impact of the H352D mutation on the substitution of [Gal4p]₂ by [Gal3p]₂ as binding partner of [Gal80p]₂ (ΔG_sub, horizontal arrows). This impact is measured as ΔΔG = ΔG_sub^D352 - ΔG_sub^H352, which equals to ΔG_alchem^tetramer - ΔG_alchem^dimer (vertical arrows) because free enthalpy is a state function. These latter quantities correspond to the free enthalpy change for the alchemical (double) mutation of His>Asp in the Gal3p-Gal80p tetramer and in the Gal3p dimer, respectively, which were computed by alchemical free energy calculations (see Supplementary Text 2).

How the implicated SNP could change either the leaky transcription level prior to induction or the transcription rate during induction is difficult to imagine. In addition, the amino-acid change was not surrounded by any particular peptide motif, nor was it located at the extremity of the protein. This did not support for an effect on translation or degradation rates. Thus, the most plausible interpretation of the parameter change of the model was that the H352D modification would increase either the capacity of Gal3p* to dimerize or the affinity of the Gal3p* dimer for Gal80p.

To explore these possibilities, we analyzed the structure of the heterotetramer [Gal3p*]₂-[Gal80p]₂ that was previously solved³². We made three important observations. First, His352 is located at the binding interface of the Gal3p* dimer with the Gal80p dimer (Fig. 6c), and distant from the pocket containing galactose and ATP. Secondly, it is spatially close to the Gal80p site where the acidic domain of Gal4p is known to bind³⁶. Finally, the Gal80p dimer exhibits a positive electrostatic surface potential in the vicinity of Gal3p-His352, suggesting that the replacement of the neutral His352 by a negatively charged aspartic acid would stabilize the Gal3p*-Gal80p complex. Stabilization refers here to a gain in thermodynamic stability relative to the Gal4p-Gal80p complex, or in other words, to a decrease of Gibbs free energy change (ΔG_sub) for the substitution of the Gal4p dimer by the Gal3p dimer as binding partner of the Gal80p dimer. A molecular dynamics simulation of the Asp352 mutant (in a model system of the Gal3p*-Gal80 complex) indicates that two positively charged amino acids, Gal3p-Arg362 and Gal80p-Lys287, are able to form direct salt bridges with Asp352 (Fig. 6d). These attractive interactions of Asp352 with its environment are, however, expected to be partially cancelled out by repulsive interactions with the less proximate, negatively charged amino acids Gal3p-Glu363 and Gal80p-Glu348 (Fig. 6d). Also, the polar solution (water + counter ions) could partially reduce the stabilization effect of the H352D mutation because residue 352 is better solvated in the Gal3p* dimer than in the Gal3p*-Gal80p tetramer. Thus, to quantify a possible stabilization effect of the H352D mutation, we computed the change in the Gibbs free energy difference, ΔΔG_sub = ΔG_sub^D352 - ΔG_sub^H352, with the aid of the thermodynamic cycle depicted in Fig. 6e. The actual free energy calculations (see Methods) yielded ΔΔG_sub = -2.8 ± 0.9 kcal/mol, which indicates that the H352D mutation indeed increases the thermodynamic stability of the Gal3p*-Gal80p complex with respect to the Gal4p-Gal80p complex. Thus, as predicted by the dynamic model of network induction, the H352D mutation increases the cellular response by facilitating the formation of the complex.

DISCUSSION

We experimentally monitored the induction dynamics of the yeast GAL network in the context of natural genetic variation at the GAL3 gene. We built a stochastic model of the network and used it to link GAL3 alleles to functional network parameters. This approach discriminated alleles that increased the strength of activated Gal3p (e.g. of strains NCYC361 and DBVPG1788) from alleles that desensitized Gal3p to galactose activation (e.g. of strain YJM978). Alleles showing different glucose/galactose trade-offs at equilibrium displayed different dynamics of induction, and they were associated to different strength of activated Gal3p. Our approach also predicted a functional effect of a single non-synonymous SNP that was validated by atomistic simulations of the binding interface between Gal3p and Gal80p dimers. These results provide further details on the yeast GAL system and, perhaps more importantly, they constitute a proof-of-concept of the feasibility and usefulness of linking genetic variants to model parameters.

Genetic variability of the yeast GAL network

Our in vivo and in silico analysis of the induction kinetics of yeast GAL activation reveals properties of this system and how it is sensitive to genetic variation. Previously, several computational models of the network have been proposed, usually in an effort to understand the properties of the system at steady-state^11,15,18. Particularly, they highlighted the important role of Gal3p, Gal1p and Gal80p-mediated feedback loops. Our in silico analysis suggests that the gradual or binary kinetic response is mainly controlled by the initial number of repressors (Gal80p) and inducers (Gal1p and Gal3p), the efficacy of galactose to activate the inducers and the efficiency of the activated inducers to release the effect of repressors. In particular, a low mean number of inducers at the time of induction may lead to high cell-to-cell variability in their actual number. Cells with few inducers (as compared to repressors) display a lag time before responding, leading to a binary response pattern at the population level. This prediction from our model is fully consistent with recent experiments that tracked the induction of the network at the single-cell level and showed that the initial concentrations of Gal1p and Gal3p are predictive of the transient bimodal response²⁵. We also observed that feedback loops were important to control the strength of cell-to-cell variability before induction (Gal80-mediated negative feedback) and the duration of lag times (Gal3/Gal1-mediated positive feedbacks), which agrees with the previous observation that disabling the Gal80p and Gal3p feedback loops can transform a gradual response into a binary one¹⁴. Our results on GAL3 genetic variants also complement previous genetic manipulations of the feedback loops, where their effect on bimodality was tested by modulating promoter activities^14,15,18,19. Here, we showed that a non-synonymous variant affecting Gal3p:Gal80p interaction directly affects the dynamics of transient bimodality. This is a novel experimentally-based observation that is totally coherent with the conclusions of Venturelli et al. who showed computationally that steady-state bimodality of the network could rely on protein-protein binding affinities¹⁸.

We also observed that genetic variation at GAL3 could affect its propensity to be activated by galactose/ATP binding. In particular, the GAL3^YJM978 allele was associated with increased values of the K_Gal parameter (more galactose needed for its activation). This allele harboured 3 non-synonymous SNPs: M179I, R312I and H352D. As shown above, H352D is found in all natural alleles that we tested and it therefore does not explain a change in K_Gal specifically for GAL3^YJM978. According to the structure of the Gal3p:Gal80p tetramer complexed with galactose and ATP³², the other two polymorphic sites do not map close to the pocket containing the ligands. Met179 is located at the surface of the complex, distant from any binding interface and distant from the bound galactose (30 Å) and ATP (25 Å). The mutational effect of the rather conservative amino acid change (methionine to isoleucine) on the K_Gal parameter might therefore be negligible. In contrast, the non-conservative arginine to isoleucine mutation at site 312 could influence K_Gal in several ways: first, the positively charged arginine contributes favorably to the binding of the negatively charged ATP through long-range electrostatic interactions. The charge-neutral Ile312 variant lacks this favorable interaction and may have lower affinity for ATP, thereby penalizing activation by the two ligands. Second, residues Arg312 of the two Gal3p units are in direct contact with each other, and the non-conservative R312I change may affect the dimerization of Gal3p. Lavy et al.³² reported that, in absence of galactose, Gal3p is monomeric in solution and adopts an open conformation that differs from the conformation generating the Gal3p:Gal3p dimeric interface found upon interaction with Gal80p. If the R312I modification alters Gal3p dimerization, this could modify the overall activation by galactose because these processes are coupled.

We observed that genetic variation of the strength of activated Gal3p (ρ_Gal3), estimated from the dynamic properties of network activation, was correlated with variation of the glucose/galactose trade-off at steady state. This implies that the two traits co-evolve in natural populations of S. cerevisiae. Given the relatively short time-scale of network induction, mild differences in the dynamics of activation alone are unlikely to cause fitness differences unless environmental galactose concentrations are highly dynamic. In contrast, variation in the sensitivity of the network to the ratio of external sugars corresponds to the triggering of an adaptive metabolic process, which is highly related to fitness even for slow environmental changes. The induction dynamics that we observed on short time-scales are probably not themselves under selection, but they provide valuable information on the molecular mechanism affecting a fitness-related trait operating on longer time scales.

The H352D variant is interesting in this regard. At this position in Gal3p, a histidine residue was found in all laboratory strains (BY4741, CEN.PK, D273-10B, FL100, FY1679, JK9-3d, SEY6210, W303, X2180-1A, YPH499), while nearly all natural isolates as well as distant species possess an aspartic acid. Our results showed the importance of this aspartic acid for interaction with Gal80p, which suggests that its conservation in wild population results from purifying selection. The presence of slightly-deleterious mutations in laboratory strains is well-known. Examples from the reference strain BY/S288c include mutations in AMN1 ³⁷, BUL2 ³⁸, ERC1 ³⁹, FLO8 ⁴⁰, GPA1 ³⁷ and HAP1 ⁴¹. These mutations likely resulted from a release of purifying selection caused by strong population bottlenecks when propagating yeast on petri dishes. As for the genes listed above, the implication for GAL3 is that most mechanistic studies refer to a “Wild-Type” protein that is in fact a slightly-hypomorphic allele not found in nature.

We also noted cases where the specificities of a GAL3 allele in the context of the BY strain did not reflect the properties of the donor strain. An extreme example of this was the GAL3^DBVPG1853 allele which improved the response of the BY strain (Fig. 1d) while the DBVPG1853 strain itself did not respond at all to galactose (not shown), presumably because of genetic defects in other genes. Background-specific effects are common and should be taken into account when interpreting the functional impact of natural alleles in their original strain context⁴².

Linking DNA variants to model parameters: feasibility and potential

We developed our approach using a model system, the yeast GAL network, which was an ideal context for investigation: molecular players were well known, important network properties had been previously described, genetic engineering could be used to study the effect of a single gene in an otherwise isogenic background, and experimental measurements were relatively cheap. If network modeling had provided no added value in such a context, it would be hard to imagine how it could be useful in more complex frameworks. We report that it did: observing different dynamics experimentally was not sufficient to make functional inferences, but combining data and modeling was. The concept is therefore fruitful and it is interesting now to consider how it can be extended to other biological systems.

First, it is important to realize that inference is based on the wealth of information contained in the dynamics of activation. Evidently, studying the system at equilibrium would not be sufficient. Mapping DNA variants to model parameters is therefore promising for systems where time-course data is available.

Second, even in the simple context of our study, not all parameters of the model were identifiable and it was necessary to aggregate several of them into a meta-parameter (ρ_Gal3). We admit that this constitutes a limit of the approach: when the H352D SNP was linked to this meta-parameter, additional assumptions were needed to infer biochemical effects. Similar difficulties will likely be encountered in other systems and the identifiability and sensitivity analysis of the model is therefore crucial to determine the nature of biological information that can be retrieved by the approach.

Third, our method here was to infer function and then to validate a prediction by exploring the structural data of a protein complex. Depending on the system under consideration and the data available, it may be judicious to reverse the approach: scanning protein structures first in order to identify variants modifying binding affinities and then studying these variants specifically using experimental measurements and model-fitting. This way, a parameter change is first inferred from structural data and a dynamic model of the network then allows one to predict its phenotypic effect. The SAAP database⁶, which registers structurally-relevant variants of human proteins, may constitute a very helpful resource to do this.

Fourth, while we based our approach on cell population distributions, tracking the response dynamics of individual cells over time is also possible²⁵ and can provide more information on the network response. In other contexts, such methods had been very useful to infer parameters associated to individual cells⁴³. A variant may then be associated to one parameter by a whole distribution of values, which likely carries more information than a single scalar value as presented here.

Fifth, additional work is now needed to extend the approach to more than one gene. At the level of an entire network, the overall genotype of the individual is a combination of alleles. The number of such combinatorial genotypes of the network segregating in natural populations can be very large and mapping this diversity to the parameter space would be very interesting. In particular, models accounting for genetic changes might predict and explain genetic interactions (epistasis) within the network. The challenge to achieve this will likely reside in the number of free parameters: if the genotype is allowed to vary at too-many genes, parameters cannot be constrained efficiently. Mapping variants one gene at a time, as we did here, and then in combination would maintain this necessary constraint while evaluating epistasis. A more difficult task would be to infer the contribution of genes that are external to the network while nonetheless affecting its behavior (e.g. by modifying widely transcription rates or the stability of proteins, or cross-talks with other networks). Studying these factors by our approach is only possible after they are identified and connected to the network. Their identification can be obtained by genetic mapping. For example, we recently identified a locus on yeast chromosome V that affects the variability of the GAL response at transient times of activation²⁸. Once identified, these factors must be integrated in the network model, which may be a complex task.

Network modeling is expected to help the development of personalized medicine and the fact that it is possible, in a yeast system, to personalize model parameters according to DNA variants is encouraging. Can the approach described here be applied to human variants? This requires overcoming several difficulties that could be avoided in our framework. First, most regulatory networks of human systems are incompletely known. Second, most of these networks comprise numerous genes, implying many model parameters and, possibly, too-many degrees of freedom for adjustments and identifiability issues. The first task is therefore a careful identifiability and sensitivity analysis of the model and, as much as possible, a reduction of its complexity. The work of Zhao et al.⁴⁴ is encouraging in this regard. The authors studied the mitochondrial outer membrane permeabilization network controlling entry in apoptosis. Their model comprised ~50 parameters and ~20 molecular species, but the network critical behaviour (bifurcation point) was sensitive to less than half of the parameters. The authors then searched for enrichment of cancer mutations in protein domains involved in molecular interactions and they used molecular dynamics simulations to estimate the affinity changes caused by these mutations. Interestingly, most mutations that were predicted to affect sensitive parameters of the model caused a significant change of affinity in the expected direction, illustrating that the model was able to highlight relevant vulnerabilities. Similarly, Nijhout et al.⁴⁵ studied a model of the folate-mediated one carbon metabolism system. They reported that human mutations that strongly perturb enzymatic activities could have little phenotypic effect if they targeted parameters that are poorly sensitive. Another type of difficulties when studying human networks are experimental limitations: manipulating human cells needs more time and funds than manipulating yeast; replacing alleles of specific genes is possible via CRISPR/Cas9 editing but the large physical size of human genes as well as the functional redundancy between paralogues can be problematic; and setting up dynamic experimental acquisitions is often not straightforward. Thus, applying our approach to a minimal network in human cells compatible with genetic editing and time-series acquisitions will probably constitute an important step in the near future.

METHODS

Yeast strains and plasmids

The strains used is this study are listed in Table S3. We used the strain BY4711 (GY145, isogenic to s288c) as BY reference strain. The P_GAL1GFP reporter cassette was obtained from plasmid pGY338 previously described²⁸. pGY338 was linearized by NheI and integrated at the HIS3 locus of BY4711 to create strains GY1648 and GY1649, two independent transformants. To replace endogenous GAL3^BY allele by natural variants in GY1648 strain, we PCR amplified the TRP1-GAL3 locus of natural wild isolates using primers 1D28(5’-AGAGGCGGTGGAGATATTCCTTATG-3’) and 1D56(5’-ACGTCCGCTATACCTTCGTTTTCTC-3’). The endogenous locus was then replaced by in vivo homologous recombination and positive transformants were selected on SD-TRP plates. GAL3^NCYC361, GAL3^K11, GAL3^Y12, GAL3^DBVPG1788, GAL3^DBVPG1853, GAL3^YJM978, GAL3^JAY291 were PCR amplified from NCYC3451, NCYC3452, NCYC3445, NCYC3311, NCYC3313, NCYC3458 (wild isolates from the Saccaromyces Genome Resequensing Project, SGRP^26,46) and JAY291 (Argueso et al.⁴⁷), respectively. The strains used to characterize the effect of natural variants on galactose response were GY1648, GY1689, GY1692, GY1695, GY1698, GY1704, GY1707 and GY1713, all isogenic to S288c except for GAL3^BY, GAL3^NCYC361, GAL3^K11, GAL3^Y12, GAL3^DBVPG1788, GAL3^DBVPG1853, GAL3^YJM978, GAL3^JAY291, respectively. Strains genotype was verified by PCR and either high-resolution melting curves, restriction fragment length polymorphism typing or sequencing. The TRP1-GAL3 locus from BY strain was PCR amplified with primers 1M95 (5’-tctttcattatgtgagagtttaaaaaccagaaactacatcatcgaaaaagggatccAGAGGCGGTGGAGATATTCCTTATG-3’) and 1M96 (5’-cgcccaatacgcaaaccgcctctccccgcgcgttggccgattcattaatgcagctgACGTCCGCTATACCTTCGTTTTCTC-3’) and cloned into HpaI-linearized plasmid pALREP³⁹ by homologous recombination in yeast, generating plasmid pGY409. The mutated GAL3^BY-H352D allele was synthesized by GeneScript and subcloned into pGY409 using MscI-BstEII restriction sites, generating plasmid pGY418. The TRP1-GAL3^BY-H352D locus was PCR-amplified from pGY418 using primers 1D28 and 1D56 and transformed into GY1649 to create strain GY2009. Genotype was validated by PCR and sequencing. Strains of figure 5 were MPJ125-E06 (GAL3^BY), MPJ143-H01 (GAL3^YJM428), MPJ143-F01 (GAL3^YJM421) and MPJ125-A07 (GAL3^BC187) which were described in another study (Lee et al. PLoS Genetics, in press); they all derived from a S288c hoΔ::GAL1pr-YFP-mTagBFP2-kanMX4; gal3Δ::hphNT1 parental strain.

Galactose response measurements

Liquid cultures in synthetic medium with 2% raffinose (Yeast Nitrogen Base w/o amino acids 6.7g/L, Raffinose 2%, Dropout Mix 2g/L, adjusted to pH=5.8) were inoculated with a single colony and incubated overnight, then diluted to OD600 = 0.1 (synthetic medium, 2% raffinose) and grown for 3 to 6 hours. The galactose induction experiments were carried out in 96-well sterile microplates using a Freedom EVO200 liquid handler (Tecan) equipped with a 96-channel pipetting head (MCA), a high precision 8-channel pipetting arm (LiHa), a robotic manipulator arm (RoMa) and a MOI-6 incubator (Tecan). All robotic steps were programmed in Evoware v2.5.4.0 (Tecan). Cells were resuspended in synthetic medium with 2% raffinose and the appropriate galactose concentration (0.01, 0.1, 0.2 and 0.5%) and grown for the desired time (from 0 to 250 minutes). Cells were then washed with PBS1X, incubated for 8 min in 2% paraformaldehyde (PFA) at room temperature, followed by 12 min of incubation in PBS supplemented with Glycine 0.1M at room temperature and finally resuspended in PBS. They were then analyzed on a FACSCalibur (BD Biosciences) flow cytometer to record 10,000 cells per sample. Each set of data is representative of the results of two independent experiments (each comprising 3 technical replicates).

Flow cytometry data was analysed using the flowCore package from Bioconductor⁴⁸. Cells of homogeneous size were dynamically gated as follows: (i) removal of events with saturated signals (FSC, SSC or FL1 = 1023 or = 0), (ii) correction by subtracting the mean(FL1) at t=0 to each FL1 values, (iii) computation of a density kernel of FSC, SSC values to define a perimeter of peak density containing 60% of events, (iv) cell gating using this perimeter, (v) removal of samples containing less than 3,000 cells at the end of the procedure and (vi) correction of the data according to an eventual experimental bias during cytometer acquisitions. For the twelve time-points (0, 10, 20, 30, 40, 60, 80, 100, 130, 160, 205 and 250 minutes) experimental design, the time course for a given strain was acquired on different plates on the flow cytometer. In order to correct an eventual plate effect, we systematically included 24 replicates on each plate acquired on flow cytometer. We then tested the fixed effect of plates using an ANOVA. The FL1 values of each cell were subsequently corrected according to the plate offset of the ANOVA. For the six time-points (0, 30, 60, 80, 130 and 210 minutes) experimental design, all the timepoints being acquired on the same experimental plate, we did not apply the normalization filter. The GFP expression values presented here in arbitrary units were the FL1 signal of the retained cells (normalized for the plate effect, if required).

Analysis of flow cytometry distributions. All statistical analysis were done using R (version 3.2.4).

Calculation of the response amplitude. The response amplitude A was defined as the mean of P_GAL1GFP expression in activated cells. First, for each strain, at each time point, we determined by eye if the P_GaL1GFP distribution was unimodal (f_{(X_ALL)} = N(μ_ALL, σ_ALL)) or bimodal (f_{(X_ALL)} = f_{(X_OFF)} + f_{(X_ON)}). If the distribution was unimodal, we calculated: A = μ_ALL. Otherwise, bimodal distributions were considered as mixtures of two normal distributions, such as: f_{(X_ALL)} = ρ_OFF N(μ_OFF,σ_OFF) + ρ_ON N(μ_ON,,σ_ON), with A = μ_ON. We used the function mixtools::normalmixEM() to calculate A for mixture distributions. Calculation of inducibility. Inducibility was defined as the proportion of ON cells in the population. The threshold t between OFF and ON cells was calculated as follows: i) a subset of OFF cells (all cells acquired at t=0min) and ON cells (activated cells belonging to unimodal distributions, acquired at the latest time point of the experiments) was defined for each experiments, ii) the mean and standard deviation were extracted from each OFF and ON normal distributions using the function mixtools::normalmixEM(), iii) these parameters were used to determine t such as , with X_ON the observed fluorescence FL1 in ON_cells and X_OFF the observed fluorescence FL1 in OFF_cells, iv) we finally calculated for each time point, for each strain.

Stochastic modeling

We model the stochastic gene expression of GAL1, GAL3, GAL80 and of the reporter gene (under a GAL1 promoter). For each gene we account for the status of the promoter (ON/OFF) and for the production and degradation of mRNAs and proteins. In addition, for the reporter gene, we account for the maturation of the fluorescent protein. The promoter switching rate from ON to OFF for gene i is driven by GAL80p: with n_i the number of strong GAL4p binding sites in the promoter. We assume that GAL80p represses transcription via its dimerized form (with K₈₀ encompassing the dimer dissociation constant). The promoter switching rate from OFF to ON is driven by GAL3p and Gal1p: with the number of activated proteins at a given galactose concentration [gal] (K_gal being the galactose dissociation constant). Here also, we assume that activated Gal3p and Gal1p are mainly found as dimers. K₁ and K₃ encompass the dimer dissociation constants as well as the affinity of activated Gal3p* for Gal80p. For a detailed description of the model see Supplementary Text 1. Most of the parameters of the model (except K₁, K₃, K₈₀ and K_gal) were fixed based on the literature (see Table S1 in Supplementary Text 1). The model had 7 GAL3-dependent parameters: α₃ (leaky transcription rate), γ₃ (translation rate), β₃ (mRNA degradation rate), μ₃ (protein degradation/dilution rate), Δα₃ (full transcription rate), K₃ and K_gal. The phenotypic response of a strain (gradual vs binary) at a given galactose concentration mainly depends on K_gal and on the strength of GAL3 defined by ρ_Gal3=α₃γ₃/(β₃μ₃K₃) (see main text and Supplementary Text 1). For a given set of parameters, the stochastic dynamics of galactose induction was simulated using the stochastic simulation algorithm from Gillespie³³. The system was first allowed to reach steady-state at [gal]=0. At t=0, galactose is introduced and the parallel - independent - evolution of 5,000 cells is monitored during 250 minutes of real time.

Parameter inference

For a fixed set of GAL3-independent parameters, predictions for various values of GAL3-dependent parameters ρ_Gal3 and K_gal were performed at 3 different galactose concentrations (0.05, 0.1 and 0.5%). Parameters were sampled from a 2D logarithmic-grid encompassing the region of interest. Then, for each strain, a global chi2-score between the experimental data and the corresponding model predictions integrating the 3 concentrations were minimized to infer ρ_Gal3 and K_gal. Uncertainties on the parameters reflect the size of the sampling parameter grid. Parameter inference was repeated 6 times for different values of GAL3-independent parameters (see Supplementary Text 1).

Molecular dynamics simulations for free energy calculations were carried out as described in Supplementary Text 2 and Supplementary Fig. 8.

Data availability. All flow cytometry raw data files can be downloaded from http://flowrepository.org under accession number FR-FCM-ZY6Y.

AUTHORS CONTRIBUTION

Performed experiments: M.R., H.D-B., E.F., M.E., A.B.

Contributed analysis tools: M.R., F.C., D.J., and G.Y.

Contributed reagents: M.Spr.

Developed and evaluated pilot versions of the model: F.C. and F.P.

Conceived, implemented and used the model: D.J.

Performed molecular dynamics simulations: M.Spi.

Interpreted results: M.R., M.Spr., M.Spi., D.J. and G.Y.

Conceived and designed the study: G.Y.

Wrote the paper: M.R., D.J. and G.Y.

ACKNOWLEDGEMENTS

We thank Orsolya Symmons for discussions, Olivier Gandrillon for critical reading of the manuscript, Nicolas Rochette for sequence analysis, Audrey Barthelaix for initial tests on the robotic platform, Sandrine Mouradian and SFR Biosciences Gerland-Lyon Sud (UMS344/US8) for access to flow cytometers and technical assistance, the Pôle Scientifique de Modélisation Numérique and the CIMENT infrastructure (supported by the Rhône-Alpes region, Grant CPER07_13 CIRA) for computing resources, BioSyL Federation and Ecofect Labex for inspiring scientific events, developers of R, bioconductor and Ubuntu for their software. This work was supported by the European Research Council under the European Union’s Seventh Framework Programme FP7/2007-2013 Grant Agreement n°281359. DJ acknowledges program AGIR of University Grenoble-Alpes and the Institut Rhône-Alpin des Systèmes Complexes for funding. M. Spr. was supported by grants NSF 1349248 and RO1 GM120122-01 from the National Science Foundation (USA).

REFERENCES

1.↵
Gaffney, D. J. et al. Dissecting the regulatory architecture of gene expression QTLs. Genome Biol. 13, R7 (2012).
OpenUrl CrossRef PubMed
2.↵
Bauer-Mehren, A., Furlong, L. I., Rautschka, M. & Sanz, F. From SNPs to pathways: integration of functional effect of sequence variations on models of cell signalling pathways. BMC Bioinformatics 10, S6 (2009).
OpenUrl
3.↵
Levenstien, M. A. & Klein, R. J. Predicting functionally important SNP classes based on negative selection. BMC Bioinformatics 12, 26 (2011).
OpenUrl CrossRef PubMed
4.↵
Coronnello, C. et al. Novel Modeling of Combinatorial miRNA Targeting Identifies SNP with Potential Role in Bone Density. PLOS Comput. Biol. 8, e1002830 (2012).
OpenUrl CrossRef PubMed
5.↵
Barenboim, M., Masso, M., Vaisman, I. I. & Jamison, D. C. Statistical geometry based prediction of nonsynonymous SNP functional effects using random forest and neuro-fuzzy classifiers. Proteins Struct. Funct. Bioinforma. 71, 1930–1939 (2008).
OpenUrl
6.↵
Al-Numair, N. S. & Martin, A. C. The SAAP pipeline and database: tools to analyze the impact and predict the pathogenicity of mutations. BMC Genomics 14, S4 (2013).
OpenUrl CrossRef PubMed
7.↵
Sellick, C. A., Campbell, R. N. & Reece, R. J. Galactose metabolism in yeast-structure and regulation of the leloir pathway enzymes and the genes encoding them. Int. Rev. Cell Mol. Biol. 269, 111–150 (2008).
OpenUrl CrossRef PubMed
8.↵
Bhat, P. J. & Hopper, J. E. Overproduction of the GAL1 or GAL3 protein causes galactose-independent activation of the GAL4 protein: evidence for a new model of induction for the yeast GAL/MEL regulon. Mol. Cell. Biol. 12, 2701–2707 (1992).
OpenUrl Abstract/FREE Full Text
9.↵
Hawkins, K. M. & Smolke, C. D. The Regulatory Roles of the Galactose Permease and Kinase in the Induction Response of the GAL Network in Saccharomyces cerevisiae. J. Biol. Chem. 281, 13485–13492 (2006).
OpenUrl Abstract/FREE Full Text
10.
Biggar, S. R. & Crabtree, G. R. Cell signaling can direct either binary or graded transcriptional responses. EMBO J. 20, 3167–3176 (2001).
OpenUrl Abstract
11.↵
Apostu, R. & Mackey, M. C. Mathematical model of GAL regulon dynamics in Saccharomyces cerevisiae. J Theor Biol 293, 219–35 (2012).
OpenUrl PubMed
12.↵
Song, C. et al. Estimating the stochastic bifurcation structure of cellular networks. PLoS Comput Biol 6, e1000699 (2010).
OpenUrl CrossRef PubMed
13.↵
Hsu, C. et al. Stochastic signalling rewires the interaction map of a multiple feedback network during yeast evolution. Nat. Commun. 3, 682 (2012).
OpenUrl CrossRef PubMed
14.↵
Ramsey, S. A. et al. Dual feedback loops in the GAL regulon suppress cellular heterogeneity in yeast. Nat Genet 38, 1082–7 (2006).
OpenUrl CrossRef PubMed Web of Science
15.↵
Acar, M., Becskei, A. & van Oudenaarden, A. Enhancement of cellular memory by reducing stochastic transitions. Nature 435, 228–32 (2005).
OpenUrl CrossRef PubMed Web of Science
16.↵
Kundu, S. & Peterson, C. L. Dominant Role for Signal Transduction in the Transcriptional Memory of Yeast GAL Genes. Mol. Cell. Biol. 30, 2330–2340 (2010).
OpenUrl Abstract/FREE Full Text
17.↵
Becskei, A., Seraphin, B. & Serrano, L. Positive feedback in eukaryotic gene networks: cell differentiation by graded to binary response conversion. Embo J 20, 2528–35 (2001).
OpenUrl Abstract
18.↵
Venturelli, O. S., El-Samad, H. & Murray, R. M. Synergistic dual positive feedback loops established by molecular sequestration generate robust bimodal response. Proc. Natl. Acad. Sci. 109, E3324–E3333 (2012).
OpenUrl Abstract/FREE Full Text
19.↵
Peng, W., Liu, P., Xue, Y. & Acar, M. Evolution of gene network activity by tuning the strength of negative-feedback regulation. Nat. Commun. 6, 6226 (2015).
OpenUrl
20.↵
Venturelli, O. S., Zuleta, I., Murray, R. M. & El-Samad, H. Population Diversification in a Yeast Metabolic Program Promotes Anticipation of Environmental Shifts. PLoS Biol 13, e1002042 (2015).
OpenUrl CrossRef PubMed
21.↵
New, A. M. et al. Different Levels of Catabolite Repression Optimize Growth in Stable and Variable Environments. PLoS Biol 12, e1001764 (2014).
OpenUrl CrossRef PubMed
22.↵
Wang, J. et al. Natural Variation in Preparation for Nutrient Depletion Reveals a Cost–Benefit Tradeoff. PLoS Biol 13, e1002041 (2015).
OpenUrl CrossRef PubMed
23.↵
Jiang, F., Frey, B. R., Evans, M. L., Friel, J. C. & Hopper, J. E. Gene Activation by Dissociation of an Inhibitor from a Transcriptional Activation Domain. Mol. Cell. Biol. 29, 5604–5610 (2009).
OpenUrl Abstract/FREE Full Text
24.↵
Egriboz, O., Jiang, F. & Hopper, J. E. Rapid GAL Gene Switch of Saccharomyces cerevisiae Depends on Nuclear Gal3, Not Nucleocytoplasmic Trafficking of Gal3 and Gal80. Genetics 189, 825–836 (2011).
OpenUrl Abstract/FREE Full Text
25.↵
Stockwell, S. R. & Rifkin, S. A. A living vector field reveals constraints on galactose network induction in yeast. Mol. Syst. Biol. 13, 908 (2017).
OpenUrl Abstract/FREE Full Text
26.↵
Liti, G. et al. Population genomics of domestic and wild yeasts. Nature 458, 337–41 (2009).
OpenUrl CrossRef PubMed Web of Science
27.↵
Mateus, C. & Avery, S. V. Destabilized green fluorescent protein for monitoring dynamic changes in yeast gene expression with flow cytometry. Yeast 16, 1313–23 (2000).
OpenUrl CrossRef PubMed Web of Science
28.↵
Chuffart, F. et al. Exploiting Single-Cell Quantitative Data to Map Genetic Variants Having Probabilistic Effects. PLoS Genet. 12, e1006213 (2016).
OpenUrl CrossRef PubMed
29.↵
Hittinger, C. T. & Carroll, S. B. Gene duplication and the adaptive evolution of a classic genetic switch. Nature 449, 677–681 (2007).
OpenUrl CrossRef PubMed Web of Science
30.↵
Peng, G. & Hopper, J. E. Gene activation by interaction of an inhibitor with a cytoplasmic signaling protein. Proc. Natl. Acad. Sci. 99, 8548–8553 (2002).
OpenUrl Abstract/FREE Full Text
31.↵
Pilauri, V., Bewley, M., Diep, C. & Hopper, J. Gal80 Dimerization and the Yeast GAL Gene Switch. Genetics 169, 1903–1914 (2005).
OpenUrl Abstract/FREE Full Text
32.↵
Lavy, T., Kumar, P. R., He, H. & Joshua-Tor, L. The Gal3p transducer of the GAL regulon interacts with the Gal80p repressor in its ligand-induced closed conformation. Genes Dev. 26, 294–303 (2012).
OpenUrl Abstract/FREE Full Text
33.↵
Gillespie, D. T. Exact stochastic simulation of coupled chemical reactions. J. Phys. Chem. 81, 2340–2361 (1977).
OpenUrl CrossRef Web of Science
34.↵
Escalante-Chong, R. et al. Galactose metabolic genes in yeast respond to a ratio of galactose and glucose. Proc. Natl. Acad. Sci. 112, 1636–1641 (2015).
OpenUrl Abstract/FREE Full Text
35.↵
Cherry, J. M. et al. Saccharomyces Genome Database: the genomics resource of budding yeast. Nucleic Acids Res. 40, D700–D705 (2012).
OpenUrl CrossRef PubMed Web of Science
36.↵
Thoden, J. B., Ryan, L. A., Reece, R. J. & Holden, H. M. The Interaction between an Acidic Transcriptional Activator and Its Inhibitor THE MOLECULAR BASIS OF Gal4p RECOGNITION BY Gal80p. J. Biol. Chem. 283, 30266–30272 (2008).
OpenUrl Abstract/FREE Full Text
37.↵
Yvert, G. et al. Trans-acting regulatory variation in Saccharomyces cerevisiae and the role of transcription factors. Nat Genet 35, 57–64 (2003).
OpenUrl CrossRef PubMed Web of Science
38.↵
Kwan, E. X., Foss, E., Kruglyak, L. & Bedalov, A. Natural Polymorphism in BUL2 Links Cellular Amino Acid Availability with Chronological Aging and Telomere Maintenance in Yeast. PLoS Genet 7, e1002250 (2011).
OpenUrl CrossRef PubMed
39.↵
Fehrmann, S. et al. Natural sequence variants of yeast environmental sensors confer cell-to-cell expression variability. Mol. Syst. Biol. 9, 695 (2013).
OpenUrl Abstract/FREE Full Text
40.↵
Kron, S. J. Filamentous growth in budding yeast. Trends Microbiol. 5, 450–454 (1997).
OpenUrl CrossRef PubMed Web of Science
41.↵
Gaisne, M., Bécam, A. M., Verdière, J. & Herbert, C. J. A ‘natural’ mutation in Saccharomyces cerevisiae strains derived from S288c affects the complex regulatory gene HAP1 (CYP1). Curr. Genet. 36, 195–200 (1999).
OpenUrl CrossRef PubMed Web of Science
42.↵
Gerke, J., Lorenz, K., Ramnarine, S. & Cohen, B. Gene-environment interactions at nucleotide resolution. PLoS Genet 6, (2010).
43.↵
Llamosi, A. et al. What Population Reveals about Individual Cell Identity: Single-Cell Parameter Estimation of Models of Gene Expression in Yeast. PLOS Comput Biol 12, e1004706 (2016).
OpenUrl CrossRef
44.↵
Zhao, L., Sun, T., Pei, J. & Ouyang, Q. Mutation-induced protein interaction kinetics changes affect apoptotic network dynamic properties and facilitate oncogenesis. Proc. Natl. Acad. Sci. 112, E4046–E4054 (2015).
OpenUrl Abstract/FREE Full Text
45.↵
Nijhout, H. F., Best, J. A. & Reed, M. C. Using mathematical models to understand metabolism, genes, and disease. BMC Biol. 13, 79 (2015).
OpenUrl
46.↵
Louis, E. & Durbin, R. The Saccharomyces Genome Resequencing Project; http://www.sanger.ac.uk/Teams/Team71/durbin/sgrp/. (2007).
47.↵
Argueso, J. L. et al. Genome structure of a Saccharomyces cerevisiae strain widely used in bioethanol production. Genome Res 19, 2258–70 (2009).
OpenUrl Abstract/FREE Full Text
48.↵
Hahne, F. et al. flowCore: a Bioconductor package for high throughput flow cytometry. BMC Bioinformatics 10, 106 (2009).
OpenUrl CrossRef PubMed

View the discussion thread.

Posted May 20, 2017.

Download PDF

Citation Tools

Subject Area

Systems Biology

Subject Areas

All Articles

Animal Behavior and Cognition (5215)
Biochemistry (11752)
Bioengineering (8752)
Bioinformatics (29200)
Biophysics (14974)
Cancer Biology (12096)
Cell Biology (17411)
Clinical Trials (138)
Developmental Biology (9421)
Ecology (14182)
Epidemiology (2067)
Evolutionary Biology (18308)
Genetics (12245)
Genomics (16803)
Immunology (11869)
Microbiology (28097)
Molecular Biology (11594)
Neuroscience (60969)
Paleontology (451)
Pathology (1871)
Pharmacology and Toxicology (3238)
Physiology (4959)
Plant Biology (10427)
Scientific Communication and Education (1683)
Synthetic Biology (2886)
Systems Biology (7340)
Zoology (1651)

[1] 1.↵
Gaffney, D. J. et al. Dissecting the regulatory architecture of gene expression QTLs. Genome Biol. 13, R7 (2012).
OpenUrl CrossRef PubMed

[2] 2.↵
Bauer-Mehren, A., Furlong, L. I., Rautschka, M. & Sanz, F. From SNPs to pathways: integration of functional effect of sequence variations on models of cell signalling pathways. BMC Bioinformatics 10, S6 (2009).
OpenUrl

[3] 3.↵
Levenstien, M. A. & Klein, R. J. Predicting functionally important SNP classes based on negative selection. BMC Bioinformatics 12, 26 (2011).
OpenUrl CrossRef PubMed

[4] 4.↵
Coronnello, C. et al. Novel Modeling of Combinatorial miRNA Targeting Identifies SNP with Potential Role in Bone Density. PLOS Comput. Biol. 8, e1002830 (2012).
OpenUrl CrossRef PubMed

[5] 5.↵
Barenboim, M., Masso, M., Vaisman, I. I. & Jamison, D. C. Statistical geometry based prediction of nonsynonymous SNP functional effects using random forest and neuro-fuzzy classifiers. Proteins Struct. Funct. Bioinforma. 71, 1930–1939 (2008).
OpenUrl

[6] 6.↵
Al-Numair, N. S. & Martin, A. C. The SAAP pipeline and database: tools to analyze the impact and predict the pathogenicity of mutations. BMC Genomics 14, S4 (2013).
OpenUrl CrossRef PubMed

[7] 7.↵
Sellick, C. A., Campbell, R. N. & Reece, R. J. Galactose metabolism in yeast-structure and regulation of the leloir pathway enzymes and the genes encoding them. Int. Rev. Cell Mol. Biol. 269, 111–150 (2008).
OpenUrl CrossRef PubMed

[8] 8.↵
Bhat, P. J. & Hopper, J. E. Overproduction of the GAL1 or GAL3 protein causes galactose-independent activation of the GAL4 protein: evidence for a new model of induction for the yeast GAL/MEL regulon. Mol. Cell. Biol. 12, 2701–2707 (1992).
OpenUrl Abstract/FREE Full Text

[9] 9.↵
Hawkins, K. M. & Smolke, C. D. The Regulatory Roles of the Galactose Permease and Kinase in the Induction Response of the GAL Network in Saccharomyces cerevisiae. J. Biol. Chem. 281, 13485–13492 (2006).
OpenUrl Abstract/FREE Full Text

[10] 10.
Biggar, S. R. & Crabtree, G. R. Cell signaling can direct either binary or graded transcriptional responses. EMBO J. 20, 3167–3176 (2001).
OpenUrl Abstract

[11] 11.↵
Apostu, R. & Mackey, M. C. Mathematical model of GAL regulon dynamics in Saccharomyces cerevisiae. J Theor Biol 293, 219–35 (2012).
OpenUrl PubMed

[12] 12.↵
Song, C. et al. Estimating the stochastic bifurcation structure of cellular networks. PLoS Comput Biol 6, e1000699 (2010).
OpenUrl CrossRef PubMed

[13] 13.↵
Hsu, C. et al. Stochastic signalling rewires the interaction map of a multiple feedback network during yeast evolution. Nat. Commun. 3, 682 (2012).
OpenUrl CrossRef PubMed

[14] 14.↵
Ramsey, S. A. et al. Dual feedback loops in the GAL regulon suppress cellular heterogeneity in yeast. Nat Genet 38, 1082–7 (2006).
OpenUrl CrossRef PubMed Web of Science

[15] 15.↵
Acar, M., Becskei, A. & van Oudenaarden, A. Enhancement of cellular memory by reducing stochastic transitions. Nature 435, 228–32 (2005).
OpenUrl CrossRef PubMed Web of Science

[16] 16.↵
Kundu, S. & Peterson, C. L. Dominant Role for Signal Transduction in the Transcriptional Memory of Yeast GAL Genes. Mol. Cell. Biol. 30, 2330–2340 (2010).
OpenUrl Abstract/FREE Full Text

[17] 17.↵
Becskei, A., Seraphin, B. & Serrano, L. Positive feedback in eukaryotic gene networks: cell differentiation by graded to binary response conversion. Embo J 20, 2528–35 (2001).
OpenUrl Abstract

[18] 18.↵
Venturelli, O. S., El-Samad, H. & Murray, R. M. Synergistic dual positive feedback loops established by molecular sequestration generate robust bimodal response. Proc. Natl. Acad. Sci. 109, E3324–E3333 (2012).
OpenUrl Abstract/FREE Full Text

[19] 19.↵
Peng, W., Liu, P., Xue, Y. & Acar, M. Evolution of gene network activity by tuning the strength of negative-feedback regulation. Nat. Commun. 6, 6226 (2015).
OpenUrl

[20] 20.↵
Venturelli, O. S., Zuleta, I., Murray, R. M. & El-Samad, H. Population Diversification in a Yeast Metabolic Program Promotes Anticipation of Environmental Shifts. PLoS Biol 13, e1002042 (2015).
OpenUrl CrossRef PubMed

[21] 21.↵
New, A. M. et al. Different Levels of Catabolite Repression Optimize Growth in Stable and Variable Environments. PLoS Biol 12, e1001764 (2014).
OpenUrl CrossRef PubMed

[22] 22.↵
Wang, J. et al. Natural Variation in Preparation for Nutrient Depletion Reveals a Cost–Benefit Tradeoff. PLoS Biol 13, e1002041 (2015).
OpenUrl CrossRef PubMed

[23] 23.↵
Jiang, F., Frey, B. R., Evans, M. L., Friel, J. C. & Hopper, J. E. Gene Activation by Dissociation of an Inhibitor from a Transcriptional Activation Domain. Mol. Cell. Biol. 29, 5604–5610 (2009).
OpenUrl Abstract/FREE Full Text

[24] 24.↵
Egriboz, O., Jiang, F. & Hopper, J. E. Rapid GAL Gene Switch of Saccharomyces cerevisiae Depends on Nuclear Gal3, Not Nucleocytoplasmic Trafficking of Gal3 and Gal80. Genetics 189, 825–836 (2011).
OpenUrl Abstract/FREE Full Text

[25] 25.↵
Stockwell, S. R. & Rifkin, S. A. A living vector field reveals constraints on galactose network induction in yeast. Mol. Syst. Biol. 13, 908 (2017).
OpenUrl Abstract/FREE Full Text

[26] 26.↵
Liti, G. et al. Population genomics of domestic and wild yeasts. Nature 458, 337–41 (2009).
OpenUrl CrossRef PubMed Web of Science

[27] 27.↵
Mateus, C. & Avery, S. V. Destabilized green fluorescent protein for monitoring dynamic changes in yeast gene expression with flow cytometry. Yeast 16, 1313–23 (2000).
OpenUrl CrossRef PubMed Web of Science

[28] 28.↵
Chuffart, F. et al. Exploiting Single-Cell Quantitative Data to Map Genetic Variants Having Probabilistic Effects. PLoS Genet. 12, e1006213 (2016).
OpenUrl CrossRef PubMed

[29] 29.↵
Hittinger, C. T. & Carroll, S. B. Gene duplication and the adaptive evolution of a classic genetic switch. Nature 449, 677–681 (2007).
OpenUrl CrossRef PubMed Web of Science

[30] 30.↵
Peng, G. & Hopper, J. E. Gene activation by interaction of an inhibitor with a cytoplasmic signaling protein. Proc. Natl. Acad. Sci. 99, 8548–8553 (2002).
OpenUrl Abstract/FREE Full Text

[31] 31.↵
Pilauri, V., Bewley, M., Diep, C. & Hopper, J. Gal80 Dimerization and the Yeast GAL Gene Switch. Genetics 169, 1903–1914 (2005).
OpenUrl Abstract/FREE Full Text

[32] 32.↵
Lavy, T., Kumar, P. R., He, H. & Joshua-Tor, L. The Gal3p transducer of the GAL regulon interacts with the Gal80p repressor in its ligand-induced closed conformation. Genes Dev. 26, 294–303 (2012).
OpenUrl Abstract/FREE Full Text

[33] 33.↵
Gillespie, D. T. Exact stochastic simulation of coupled chemical reactions. J. Phys. Chem. 81, 2340–2361 (1977).
OpenUrl CrossRef Web of Science

[34] 34.↵
Escalante-Chong, R. et al. Galactose metabolic genes in yeast respond to a ratio of galactose and glucose. Proc. Natl. Acad. Sci. 112, 1636–1641 (2015).
OpenUrl Abstract/FREE Full Text

[35] 35.↵
Cherry, J. M. et al. Saccharomyces Genome Database: the genomics resource of budding yeast. Nucleic Acids Res. 40, D700–D705 (2012).
OpenUrl CrossRef PubMed Web of Science

[36] 36.↵
Thoden, J. B., Ryan, L. A., Reece, R. J. & Holden, H. M. The Interaction between an Acidic Transcriptional Activator and Its Inhibitor THE MOLECULAR BASIS OF Gal4p RECOGNITION BY Gal80p. J. Biol. Chem. 283, 30266–30272 (2008).
OpenUrl Abstract/FREE Full Text

[37] 37.↵
Yvert, G. et al. Trans-acting regulatory variation in Saccharomyces cerevisiae and the role of transcription factors. Nat Genet 35, 57–64 (2003).
OpenUrl CrossRef PubMed Web of Science

[38] 38.↵
Kwan, E. X., Foss, E., Kruglyak, L. & Bedalov, A. Natural Polymorphism in BUL2 Links Cellular Amino Acid Availability with Chronological Aging and Telomere Maintenance in Yeast. PLoS Genet 7, e1002250 (2011).
OpenUrl CrossRef PubMed

[39] 39.↵
Fehrmann, S. et al. Natural sequence variants of yeast environmental sensors confer cell-to-cell expression variability. Mol. Syst. Biol. 9, 695 (2013).
OpenUrl Abstract/FREE Full Text

[40] 40.↵
Kron, S. J. Filamentous growth in budding yeast. Trends Microbiol. 5, 450–454 (1997).
OpenUrl CrossRef PubMed Web of Science

[41] 41.↵
Gaisne, M., Bécam, A. M., Verdière, J. & Herbert, C. J. A ‘natural’ mutation in Saccharomyces cerevisiae strains derived from S288c affects the complex regulatory gene HAP1 (CYP1). Curr. Genet. 36, 195–200 (1999).
OpenUrl CrossRef PubMed Web of Science

[42] 42.↵
Gerke, J., Lorenz, K., Ramnarine, S. & Cohen, B. Gene-environment interactions at nucleotide resolution. PLoS Genet 6, (2010).

[43] 43.↵
Llamosi, A. et al. What Population Reveals about Individual Cell Identity: Single-Cell Parameter Estimation of Models of Gene Expression in Yeast. PLOS Comput Biol 12, e1004706 (2016).
OpenUrl CrossRef

[44] 44.↵
Zhao, L., Sun, T., Pei, J. & Ouyang, Q. Mutation-induced protein interaction kinetics changes affect apoptotic network dynamic properties and facilitate oncogenesis. Proc. Natl. Acad. Sci. 112, E4046–E4054 (2015).
OpenUrl Abstract/FREE Full Text

[45] 45.↵
Nijhout, H. F., Best, J. A. & Reed, M. C. Using mathematical models to understand metabolism, genes, and disease. BMC Biol. 13, 79 (2015).
OpenUrl

[46] 46.↵
Louis, E. & Durbin, R. The Saccharomyces Genome Resequencing Project; http://www.sanger.ac.uk/Teams/Team71/durbin/sgrp/. (2007).

[47] 47.↵
Argueso, J. L. et al. Genome structure of a Saccharomyces cerevisiae strain widely used in bioethanol production. Genome Res 19, 2258–70 (2009).
OpenUrl Abstract/FREE Full Text

[48] 48.↵
Hahne, F. et al. flowCore: a Bioconductor package for high throughput flow cytometry. BMC Bioinformatics 10, 106 (2009).
OpenUrl CrossRef PubMed