Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Predicting genetic interactions from Boolean models of biological networks

Laurence Calzone, Emmanuel Barillot, Andrei Zinovyev
doi: https://doi.org/10.1101/018507
Laurence Calzone
aInstitut Curie, 26 rue d’Ulm, Paris, France
bINSERM U900. Paris, France
cMines ParisTech, Fontainbleau, France
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Emmanuel Barillot
aInstitut Curie, 26 rue d’Ulm, Paris, France
bINSERM U900. Paris, France
cMines ParisTech, Fontainbleau, France
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Andrei Zinovyev
aInstitut Curie, 26 rue d’Ulm, Paris, France
bINSERM U900. Paris, France
cMines ParisTech, Fontainbleau, France
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

Abstract

Genetic interaction can be defined as a deviation of the phenotypic quantitative effect of a double gene mutation from the effect predicted from single mutations using a simple (e.g., multiplicative or linear additive) statistical model. Experimentally characterized genetic interaction networks in model organisms provide important insights into relationships between different biological functions. We describe a computational methodology allowing to systematically and quantitatively characterize a Boolean mathematical model of a biological network in terms of genetic interactions between all loss of function and gain of function mutations with respect to all model phenotypes or outputs. We use the probabilistic framework defined in MaBoSS software, based on continuous time Markov chains and stochastic simulations. In addition, we suggest several computational tools for studying the distribution of double mutants in the space of model phenotype probabilities. We demonstrate this methodology on three published models for each of which we derive the genetic interaction networks and analyze their properties. We classify the obtained interactions according to their class of epistasis, dependence on the chosen initial conditions and phenotype. The use of this methodology for validating mathematical models from experimental data and designing new experiments is discussed.†

1 Introduction

Genetic interaction is defined as a phenomenon by which the effect of a double gene mutation cannot be predicted from the effect of single mutations using a simple (such as additive or multiplicative) statistical model15,25,32. The strength of the interaction can be characterized by an epistatic score, which is, in the case of purely deleterious mutations, negative for synergistic interactions (when the phenotype of a double mutation is significantly stronger than the expected combined effect of two independent single mutations), and positive for alleviating interactions (when the combined effect is weaker). Examples of synergistic interactions are synthetic lethality and synthetic sickness (in the case of survival-related phenotype) or synthetic enhancement of a phenotype4,21. An example of strong alleviating interaction is the suppression of an effect of one mutation by a second mutation (in classical genetics, such interactions were historically defined as “epistatic”). Genetic interactions in the general case of both beneficial and deleterious mutations can be classified into 9 groups according to various inequality relations between the effects of single and double mutants12.

Genetic interaction networks provide important insights into relations between different biological functions36. Knowledge of genetic interactions with respect to a disease phenotype can provide important hints on personalized treatment strategy, in particular, in cancer1,23,28. This knowledge is currently obtained by costly high-throughput screening techniques based on knocking-out or knocking-down genes (using siRNA or shRNA) in model organisms, such as yeast11,38, worm6, mouse13 or human cells27. Experimentally, one can measure both synthetic and synthetic dosage interactions31. Establishing single genetic interactions can be a result of long and tedious work, in the case of phenotypes that are complex and difficult to observe such as metastasis8.

Computational approaches have been used in order to derive genetic interactions from dynamical mathematical models or by using machine learning approaches. One of the earliest attempts to characterize the genetic networks of the genes involved in metabolism was done using flux balance analysis framework applied to a genome-wide reconstruction of yeast metabolic network32. In this work, the quantitative epistatic measure was introduced to characterize the genetic interactions as a difference between the observed effect of a double mutant and the multiplicative model prediction from the effect of two single mutation effects. It was noted that the distribution of the epistatic measure is tri-modal and that the interactions between functional modules have a tendency for monochromaticity, i.e., having the same dominant sign for between-module interactions. In a recent paper, a similar approach was applied to characterize genetic interactions with respect to multiple metabolism-related phenotypes33.

There have been many attempts to apply machine learning approach for predicting genetic interactions from a subset of known interactions5,35. For instance, in yeast, the structure of physical interaction networks was combined with coexpression networks; data on protein classification was used for predicting genetic interactions40. In worm, identical anatomical expression and microarray co-expression, phenotype proximity, Gene Ontology annotation and presence of interlogs were the parameters used for fitting the logistic regression in order to score genetic interactions42. Decision tree-based approaches trained on the structure of protein-protein interaction and co-expression networks in both yeast and worm were also used9. Short polypeptide cluster detection was utilized to predict synthetic lethal interactions between genes in yeast41. Still in yeast, evolutionary approaches and the notion of functional asymmetry allowed prediction of negative genetic interactions between protein complex components24. There are very few examples of computational predictions of genetic interactions in human, one of them used gene expression analysis to predict synthetic lethal partners of TP53 gene39. The main problem of most of machine learning approaches is the absence of bona fide negative example (absence of interaction) set for training, which is usually needed for a successful application of automated classification methods5. Nevertheless, it was shown that machine learning methods are able to predict genetic interactions significantly better than random choice of a gene pair.

The knowledge about molecular mechanisms involved in a biological phenomenon that one wishes to study can be represented as a network of interacting entities30. Depending on the network type, the translation into a mathematical model can be done using an appropriate formalism (ordinary or partial differential equations, logical, rule-based modeling, etc.). These mathematical models can predict the effect of a perturbation, intrinsic or extrinsic, and anticipate the response of a drug, for instance. Boolean (or, more generally, logical) modeling focuses on how the influences of regulatory molecules combine to control the expression or activity of each molecular entity or process composing the regulatory network. In a purely Boolean framework, each variable of the model can only take two values: 0 or 1 (absent/inactive or present/active). In our studies, we found that Boolean formalism represents a convenient mean of abstraction for modeling cellular biochemistry dynamics and verifying that the topology of the networks representing the studied phenomena fits the experimentally-observed effects of loss or gain of function mutations on a phenotype. So far, there was no attempt to systematically predict genetic interactions using Boolean models of biological mechanisms.

An important remark should be made with respect to any attempt to predict the genetic interactions computationally. Genetic interactions, being functional rather than physical, can strongly depend on the choice of both the phenotype (or model read-out) and the set of initial conditions used for model simulations. Therefore, genetic interactions can be classified as occurring with respect to single versus multiple phenotypes, and dependent versus independent on initial conditions. With the mathematical model of metabolism in yeast, it was shown that genetic interactions synergistic with respect to one phenotype can become alleviating with respect to another one33. Similarly, depending on the set of initial conditions (accounting for homeostatic, physiological, nutrient-deprived, etc. conditions), some phenotypes represented in the model can never be reached or the simulations can lead to a different output with the same set of inputs. For example, in a model of cell fate decision process in response to TNF (or Fas) ligand activation signal, the cell response showed to be either survival or cell death (nonapoptotic and apoptotic with a higher probability for necrotic phenotype though) depending on the activity of some nodes of the model7. In a model describing the kinetics of the restriction point, if the G1 cell cycle phase cyclin, Cyclin D1 (CycD in the model), is initially active (corresponding to presence of growth factors), the cell enters the cycle, otherwise, it stays stuck in G1 arrest14,29.

In this manuscript, we suggest a quantitative methodology to convert a logical model of a regulatory network into a genetic interaction network, defined with respect to a chosen model phenotype (which can be any phenotype and not only survivalrelated as it is often the case). The methodology is based on using the formalism of continuous time Markov chains implemented in MaBoSS software37. Using published models, we applied our method to derive several genetic interaction networks for the genes that compose these models. We analyze genetic network properties and show that they possess many features of experimentally-measured genetic networks. The derived genetic interactions reflect the functional properties of the mathematical models studied, so we briefly compare these predicted functional relations using available databases.

2 Methods and data

2.1 Models used in this study

Three published models were selected for testing the method. The models correspond to signalling pathways involved in cancer with the focus on: the MAPK pathway20 describing the crosstalk between the three mitogen-activated protein kinases: ERK, p38 and JNK, and their role in apoptosis and proliferation balance; the cell cycle with the focus on the biochemical processes regulating the restriction point14,29; and cell fate decision between survival and death in response to extrinsic signals such as death receptor activation7,44.

For each of the model, we provide the models in both GINsim26 and MaBoSS37 formats. Several genetic interaction networks (GINs) per model can be constructed corresponding to different initial conditions and to the chosen phenotype. They can be found as separate Cytoscape sessions in Supplementary materials (Supp Mat GINs).

2.2 Computing phenotype probabilities

For each model, we computed the probability of reaching model phenotypes for all possible single and double mutants (resulting either from gain of function modelled as fixing the corresponding node value to 1, and referred to as “overexpression” or “oe”, or from loss of function modelled as fixing the node value to 0 and referred to as “deletion” or “ko”). For these computations, we used both MaBoSS software and a set of scripts for processing the MaBoSS configuration and output files, implemented into BiNoM Cytoscape plugin2,3,43. MaBoSS is a C++ software designed for simulating continuous/discrete time Markov processes, defined on the state transition graph representing the dynamics of a Boolean network. MaBoSS allows the modeller to associate different rates up and rates down to each variable of the model when the dynamics is known, enabling to account for different time scales of the processes described by the model. Given some initial conditions, MaBoSS computes time trajectories by applying Monte Carlo kinetics algorithm (Details and examples can be found at: http://maboss.curie.fr). More precisely, probabilities to reach a phenotype are computed as the probability for the variable associated to the phenotype to have the value 1, by simulating random walks on the probabilistic state transition graph. The parameters for the stochastic simulations (number of runs, initial conditions, maximum time, etc.) are configured for each simulation. The read-out can be a variable representing the phenotype, a variable representing a protein or gene, or a combination of them. The probabilities for the selected outputs are reported for each mutant based on predefined initial conditions (which can be all random). Since a state in the state transition graph can combine activation of several phenotype variables, some phenotype probabilities appear to be “mixed” or coupled. It is particularly the case for cyclic attractors. For the cell fate model, we investigated the effect of the choice of the initial conditions (“random” versus “physiological”) on the final phenotype probability distribution. The result of the simulations is stored in a simple table, containing the complete set of mutants characterized by probabilities of all pure and mixed model phenotypes (in Supp–Mat–Models and Supp–Mat–GINs).

2.3 Quantifying epistasis in double mutants

2.3.1 Definition of epistasis measures.

The results of double mutant simulations were used to quantify the level of epistasis between two model gene defects A and B with respect to a particular phenotype ϕ. We define the normalized “fitness” of a mutation (or combination of mutations) X with respect to a phenotype ϕ as the ratio between the probability of the phenotype in the mutant X and the wild-type models. Embedded Image To fully characterize a genetic interaction, one should be able to characterize its strength and type. We defined the strength of the interaction as a deviation of the fitness of the double mutant from one of the four simplest statistical models frequently used in this context: additive, logarithmic, multiplicative and min, i.e., Embedded Image where Embedded Image and Embedded Image are phenotype ϕ fitness values of single gene defects, Embedded Image is the phenotype ϕ fitness of the double mutant, and Ψ(x, y) is one of the four functions: Embedded Image To choose the best definition of Ψ(x, y), the Pearson correlation coefficient was computed between the fitness values observed in all double mutants and estimated by the null model. The null model with maximal linear correlation was chosen: Embedded Image Note that the best definition of Ψ can vary from model to model, from phenotype to phenotype, and even for different choices of initial conditions. Our simulations show that ΨLOG performs uniformly optimal or close to optimal in most of the simulations, having also advantage of not producing biased distributions of ε (see next section).

2.3.2 Removing bias in the distribution of epistatic measure values.

After computing the distribution of epistatic measures, it can be observed that the peak of the distribution is shifted towards non-zero epistasis. This can be considered as a bias in estimating the null multiplicative model for quantifying the epistasis measure (2). In our experiments, it was corrected by linear fitting of the observed value y = fAB to the null model x = Ψ(fA, fB) (see Figure 1B). Then the epistatic measure is defined as: Embedded Image where α is the slope coefficient in the best linear fit estimation Embedded Image. Further in the text, we refer to ε(corrected) as ε unless explicitly specified.

Fig. 1
  • Download figure
  • Open in new tab
Fig. 1

Illustrating epistasis measures for cell fate decision model. A) distribution of ε values for three phenotypes. B) Additive model of epistasis, solid line shows uncorrected additive null model and dashed line shows the corrected model; an arrow shows a particular double mutant BAX+/BCL2+, for which the combined effect is stronger than expected by the null model (example of single-nonmonotonic genetic interaction, A < W T < B < AB); the length of the arrow equals to ε(BAX + /BCL2+) in this case. C) comparison between ε values for the case of random initial conditions and the physiological initial condition. D) comparison between ε values for two different cell death phenotypes.

2.3.3 Choosing the threshold for defining the set of genetic interactions.

The distribution of the epistasis measures ε is asymmetric in many examples. Therefore, we set a threshold separately for positive and negative part of epistastis measure distribution (Figure 1A) as a multiplier of one-tailed standard deviations. Those genetic interactions whose strength are above k one-tailed standard deviations are selected, where k is a real number parameter (typically, k = 2 as a moderately stringent selection criterion).

2.3.4 Defining the type of genetic interaction.

Since in model simulations, one can have both deleterious fX < 1, neutral fX ≈ 1 and beneficial fX > 1 mutations X with respect to a phenotype ϕ, multiple possibilities arise for relations between four numbers fA, fB, fAB and fWT = 1 which cannot be simply grouped into alleviating and aggravating, as in the simplest case of pure deleterious mutations. We classified gene interactions using the existing approach12, according to 75 possible inequalities between these four numbers which are further grouped into 9 genetic interaction classes: “suppressive”, “epistatic”, “conditional”, “single-nonmonotonic”, “additive”, “double-nonmonotonic”, “non-interactive”, “synthetic”, “asynthetic”. The first 4 classes in this list can be characterized by a direction of the interaction, i.e., mutation A is epistatic to B means that the effect of A completely cancels the effect of B (and both effects are different from the wild-type), and not the opposite (A → B). Note that the directed genetic interaction maps the causal effects in opposite direction (e.g., mutations in downstream effectors of a phenotype can mask more upstream mutations).

To define inequalities, we introduced a threshold for distinguishing different values of fitness f, i.e., we consider two values of fitness fA and fB equal, if | fA – fB |< δ, where we typically choose δ = 0.2.

For example, one of the most prevalent interactions in our simulations is the “epistatic” (in the sense of the classical definition of the notion “epistasis”) interaction which corresponds to inequalities B < W T < A = AB (denoting fB < fWT = 1 <fA = fAB) or A = AB < W T < B meaning that the effects of single mutants are opposite with respect to the wild-type (one is deleterious and another is beneficial) and the effect of the double mutant is equal to one of the single mutants (one single mutant “wins”). Another interesting example is “synthetic” interaction type which can correspond to the inequality AB < W T = A = B (classical “synthetic sickness”) or to W T = A = B < AB (“synthetic enhancement”).

Some interaction types are counter-intuitive such as “single-nonmonotonic” which can correspond to the inequality A < W T < B < AB, when a combination of deleterious and beneficial mutations lead to enhancement of the phenotype stronger than the beneficial mutation alone. It was shown that these interactions are observed in real data12, and they are also observed in some of our simulations (see Figure 2).

Fig. 2
  • Download figure
  • Open in new tab
Fig. 2

Genetic interaction networks computed for cell fate decision model, with random and physiological initial conditions and for the three considered phenotypes: apoptosis, necrosis and survival

2.3.5 Visualizing genetic interaction network using Cytoscape.

The selected genetic interactions are visualized in Cytoscape10 (see example with Figure 5 and Figure 2). The visual mapping chosen distinguishes, by colour and shape, loss of function and gain of function single mutants. Size of the nodes reflects the effect on the phenotype of a single mutant, and the width of the edge, the epistatic effect strength of the corresponding double mutant. Colouring edges denotes their types, using the colour schema suggested before12 (see Figure 3 for definition of the visualization style).

Fig. 3
  • Download figure
  • Open in new tab
Fig. 3

Colour code for the genetic interaction networks. The name of the interaction and the colour code is in accordance with12. Only the rules found in our analyses of the three models are indicated for each interaction

2.3.6 Using non-linear principal component analysis for mapping double mutant distribution in the space of phenotype probabilities

The non-linear principal manifolds were constructed for the distribution of all single and double mutants of the model in the space of computed model phenotype probabilities, using elastic maps method16–18 and ViDaExpert software19. For computation, only the mixed phenotypes with a probability expectation over the whole set of double mutants with more than 1% were selected. This results in sets of double mutants in multi-dimensional space for which principal manifolds were computed (see Figure 4).

Fig. 4
  • Download figure
  • Open in new tab
Fig. 4

Application of non-linear principal manifold analysis for visualizing the distribution of double mutants in the space of phenotype probabilities. The figure shows projection of phenotype probabilities from multi-dimensional space onto the 2D space of internal coordinates of the non-linear principal manifold. Each point corresponds to a mutant. A big violet pentagon coresponds to the wild-type model, triangles to single-element mutant model and circles to double mutants. Gradients of increase of the model phenotypes probabilities are shown by curved arrows. The gray color in the background visualizes local density of the projections onto the map, allowing to perform cluster analysis visually.

3 Results and discussion

The three Boolean models were downloaded either from The Cell Collective database22 or from GINsim database 26. The stable state analysis was done in GINsim software. The models were then exported in MaBoSS for simulations. Finally, we used some scripts embedded into BiNoM cytoscape plugin to automatically compute probabilities for all single and double mutants (including both gain of function and loss of function mutants for all components of each model) and visualize the results of paired interactions as genetic interaction networks. A thorough description of each model is given in supplementary materials (SuppMat description models) along with the Cytoscape sessions for each model, each phenotype and different initial conditions for one of the models (SuppMat–GINs).

3.1 Cell fate decision model

Figure 2 shows the genetic interaction networks computed with respect to three different phenotypes (survival, apoptosis, nonapoptotic cell death referred to as necrosis for short)7.

The general shape of the epistatic measure distribution exhibit tri-modality (Figure 1A), as it was previously observed in another modeling framework32. The cell fate decision GINs and the distributions of ε show that the networks computed for different phenotypes are less similar than the networks computed for the same phenotype but with different initial conditions (Figures 2 and 1C,D, with the legend for GINs given in Figure 3). Similar conclusions were made for most of the constructed GINs in this study.

For the physiological initial conditions with TNF=1, some gene alterations (and, by extension, some pathways) appear to be more important than when all initial conditions are considered. Indeed, some of these interactions are lost in the numerous genetic interactions when considering all initial conditions. It is particularly evident for the survival phenotype. Overexpressing any gene from the survival pathway, which is described in a linear manner in this model is enough to favour or even force the survival phenotype. When taking in account all possible inputs, other pathways can help reach the survival phenotype: the additive effect of both RIP1 and cIAP gain of function would be equivalent to forcing RIP1ub. Singlenonmonotonic interactions are found numerous in the apoptotic and necrotic genetic networks. Unexpectedly, the gain of function of BCL2, which leads to a null probability of reaching apoptosis, together with the gain of function of BAX increases the apoptotic probability of BAX gain of function alone. In fact, BCL2 gain of function is able to block very efficiently both apoptosis and necrosis. If BAX gain of function promotes apoptosis as observed experimentally, deleting any signal from the necrotic (or necroptotic) pathway seems to increase apoptosis even more. This observation confirms the mutual exclusive nature of the two phenotypes. In accordance with Drees et al.12, this type of single-nonmonotonic interactions occur with a high frequency in our networks but also in experimental data even though they are not “recognized by common genetic nomenclature”.

The distribution of all single and double mutant models forms a set of points in the multi-dimensional space of model phenotype probabilities. We found it very informative to visualize this set with the projection from multi-dimensional to two-dimensional space, using advanced methods of non-linear data visualization such as the projection onto the principal manifolds constructed by the elastic maps method (Figure 4). In these visualizations, one can see that single and double mutants form clusters characterized by some typical phenotype probability values. The cluster around the wild-type model, collects those mutants whose effect can be considered as neutral. Some clusters represent the mutants with extreme effect of induction of some of the phenotypes. The probability of different phenotypes changes along the non-linear directions (gradients) of increasing phenotype probability. Some clusters, labelled here by “BCL2 oe” and “ROS ko” single mutants, correspond to some particular states of the model (“naive survival” for “BCL2 oe” and a complex state combining apoptosis and “naive survival” for “ROS ko”: note that the last state can be artificial due to some irrealistic assumptions such as non-production of ROS, over-abundance of ATP, or impossibility of MPT).

3.2 MAPK model

The MAPK pathway controls several cellular processes such as cell cycle activation, apoptosis, survival or differentiation. The model of Grieco et al.20 details the crosstalk between the pathways of the three mitogen-activated protein kinases: ERK, JNK and p38. In response to four stimuli (EGFR, FGFR3, TGFbeta, and DNA damage), the model produces in silico the cell response in terms of proliferation, growth arrest and apoptosis in diverse conditions, and simulates different sets of mutations often found in cancer. Even though the model is generic, its analysis is applied to studying bladder cancerogenesis.

Three GINs are generated using stringent conditions (interactions are selected above k = 3 standard deviations) for filtering the edges for the three phenotypes: apoptosis, growth arrest and proliferation. The networks are characterized by modular structure, in particular, for the apoptotic phenotype (Figure 5, panel 1). Interestingly, interactions within some modules or between modules are monochromatic with respect to the type of the genetic interactions. For example, a module connecting several transcription factors (JUN, AP1, ATF2) with phosphatase PPP2CA negatively controlling cell growth appears in the GINs for both the apoptosis and growth arrest phenotypes. All interactions inside this module are of “synthetic” type (i.e., synergistic). Monochromatic structure of interactions between modules can be seen in Figure 5, panel 3, where the network can be decomposed into several modules (e.g., PTEN/p21/AKT versus p70/ERK/MEK1 2) based on the same type of interactions in between them.

Fig. 5
  • Download figure
  • Open in new tab
Fig. 5

Genetic interaction networks computed for MAPK model, with random initial conditions and for the three phenotypes: apoptosis, growth arrest and proliferation

Genes of the apoptotic pathway such as ATM, MAX, etc. appear to be hubs in the network with the emphasis on ATM and conditions for the two following situations: loss of function or gain of function of ATM and the partners that contribute to increasing (or compensating for the loss of) apoptosis (Figure 5, panel 1). The combination of p53 gain of function and ERK gain of function seems to be a good combination to improve the growth arrest phenotype (Figure 5, panel 2) whereas loss of function of PTEN reduces the arrest caused by gain of function of BCL2. In the GIN for the proliferation phenotype (Figure 5, panel 3), the gain of function of either MEK1 2 or ERK seems to be crucial in promoting proliferation, particularly in combination with gain of function of AKT or loss of function of p53 or p38, for instance. They form a hub in the network and seem to be very similar (symmetric) in terms of genetic interactions they share with the rest of the proteins of the network.

The MAPK model is the biggest network we study here. We anticipate that in even bigger regulatory network models, the corresponding genetic interaction networks should be modular and provide informative hints on pathways that are activated with respect to a particular phenotype. Predictions about the co-occurrence or the mutual exclusivity between gene alterations could be also derived from these networks.

3.3 Mammalian restriction point model

This Boolean model14 was adapted from a mathematical model based on ordinary differential equations developed by Novak and Tyson29. The model was built to illustrate the behaviour of cells exposed to cycloheximide treatments at different times of the cell cycle. The model describes the dynamics of the restriction point situated in late G1 after which the cell commits to division even if treated by the drug.

For this small model, the GINs are easier to interpret biologically (Figure 6). The model is built such that if the cell does not receive any external growth signals, of which CycD is the sensor, it remains stuck in G1 cell cycle phase. Therefore, neither CycD nor Rb are included in these networks as their gain or loss of function would automatically lead to forcing or deleting the phenotypes. The gain of function of the cell cycle inhibitor p27 is counteracted by the gain of function of downstream cyclins such as CycA and CycE. Similarly, if both inhibitors of the G2 and M cyclins are deleted, Cdc20 and cdh1, it is equivalent to overexpressing the cyclins and the cells can no longer arrest. A similar mechanism is achieved by overexpressing E2F and deleting cdh1. The role of cdh1 seems to be more prevalent in degrading the cyclins. Note that cdh1 and Cdc20 are in both genetic interaction networks for growth arrest and proliferation because the two read-outs are symmetric. The loss of function of both Cdc20 and cdh1 leads to a very low probability of arresting the cycle, and a very high probability for proliferating. The two phenotypes are mutually exclusive.

Fig. 6
  • Download figure
  • Open in new tab
Fig. 6

Genetic interaction networks computed for restriction point model, with random initial conditions and for the two phenotypes: growth arrest and proliferation

3.4 Comparison with experimentally derived genetic interactions

We performed two types of comparisons: first, we compared the genetic interactions from our method to available experimental results, and second, we compared the genetic interactions between models.

We have compared the results from each of the examples we have chosen in this analysis with genetic interactions listed in BioGRID database34. In the database, we queried the genes that appeared as participating in pairs of genetic interactions in a significant manner in our three models. We found that in the MAPK model, TP53 and MDM2 interactions came out in both BioGRID and our study: TP53 and MDM2 were identified in a phenotypic suppression type of genetic interaction in BioGRID and we showed that overexpression of both TP53 and of MDM2 led to a suppressive genetic interaction with respect to the apoptosis phenotype. The pair ATM and TP53 seems to be involved in a phenotypic enhancement in BioGRID, but was not found in our study. In the cell fate model, we listed three phenotypic suppressions between XIAP and CASP8, IKK1 and TNF, and BCL2 and CASP8. The first two were confirmed in our analysis: overexpression of XIAP and of CASP8 lead to an epistatic interaction with respect to apoptosis in the TNFactivated signal, and deletion of IKK1 and deletion of TNF lead to an epistatic interaction with respect to the necrosis (NonACD) phenotype in the TNF-activated signal. Also, overexpression of IKK1 and deletion of TNF lead to an epistatic interaction with respect to the survival phenotype in the TNFactivated signal. The last interaction was not identified with our method. In the mammalian restriction point model, there was only one interaction that appeared in BioGRID and involved a phenotypic enhancement between p21 and p27 which was not found in our analysis. More details can be found in supplementary materials, SuppMat Analysis BioGRID. In conclusion, the comparison showed that some interactions predicted by our method were indeed confirmed in BioGRID database. This type of comparison can serve to validate Boolean models developed for various molecular mechanisms with respect to known genetic interactions and provide additional constraints on the choice of model network topology, logical rules and rate parameters. Of course, in this analysis one should take into account incompleteness of our knowledge on genetic interactions.

We also compared more in detail the results of the genetic interactions among the three examples. Unfortunately, there was no overlap between the three models since the only common gene was BCL2 between the cell fate and the MAPK models. We then looked more carefully at the genetic interactions between phenotypes but for each model individually. With this comparison, we identified the complementary role of some genes in the networks and confirmed findings from the initial publications. The results can be found in supplementary materials, SuppMat comparison phenotypes.

4 Conclusions

In this manuscript, we suggest a methodology for converting a logical mathematical model with a set of initial conditions into the corresponding genetic interaction network characterizing the behaviour of all single and double mutants in terms of phenotype probabilities. The advantage of the methodology is in that it allows:

  1. estimating and classifying possible functional interactions between the different elements composing the model;

  2. distinguishing extreme cases of mutations amplifying or masking each other and, based on this, suggesting intervention points in order to achieve a desired phenotype (such as in8);

  3. suggesting experimental designs from the logical models;

  4. detecting controversial (non-intuitive) properties of mutants with respect to expected phenotypes such as nonmonotonic genetic interactions;

  5. comparing quickly similar logical network models in terms of their functional properties;

  6. validating the model and comparing different models using available screenings for genetic interactions (such as synthetic lethality screens).

The last point deserves further development. We aim at extending our methodology using existing databases containing genetic interactions (similar to what we did with BioGRID34) for matching the model predictions with genetic interactions or single mutation phenotypes known from the literature or from screenings. Moreover, similar to the methodology of parameter fitting in constructing chemical kinetic models, one can fit the kinetic rates defined in our continuous-time discrete approach37 in order to optimize the set of model predictions. Another set of experimental data that could be used with this approach is high-throughput cancer data, such as large-scale mutation landscapes that are collected for series of tumours. Patterns of co-occurrence or mutual exclusivity of mutations can reflect action of genetic interactions in cancer cells. For example, synthetically lethal interactions can lead to the pattern of mutual exclusivity since cancer cells possessing both synthetically lethal mutations will be eliminated from the cell population. Using these data for interpretation and validation of model-based predictions requires the development of a statistical methodology for detecting statistical patterns in highthroughput data.

Genetic interaction networks reconstructed from logical mathematical models possess many properties of experimentally-measured networks. They are characterized by a variety of types of genetic interactions (with predominance of masking, e.g., epistatic interactions), modular structure for sufficiently big discrete models (Figure 5), with some modules characterized by monochromaticity for within-module interactions as well as between-module interactions. Sets of genetic interactions are highly dependent on the phenotype with respect to which they are defined and, to less extent, sensitive to the initial conditions (in other words, to the molecular context) chosen for performing simulations. These properties make the obtained genetic interaction networks a good model for the experimentally-measured ones.

Therefore, we believe that the suggested methodology will contribute to the toolbox of computational approaches in systems biology, connected to mathematical modeling of cellular mechanisms.

5 Acknowledgements

This work was supported by internal project of Institut Curie “PIC Computational Systems Biology of Cancer”.

Footnotes

  • † Electronic Supplementary Information (ESI) available: http://maboss.curie.fr/gins. See DOI: 10.1039/b000000x/

  • ↵d E-mail: Laurence.Calzone{at}curie.fr

  • ↵e E-mail: Emmanuel.Barillot{at}curie.fr

  • ↵f E-mail: Andrei.Zinovyev{at}curie.fr

References

  1. 1.↵
    Barillot, E., Calzone, L., Hupe, P., Vert, J.-P., and Zinovyev, (2012). Computational Systems Biology of Cancer. Chapman & Hall, CRC Mathemtical and Computational Biology.
  2. 2.↵
    Bonnet, E., Calzone, L., Rovera, D., Stoll, G., Barillot, E., and Zinovyev, A. (2013a). Binom 2.0, a cytoscape plugin for accessing and analyzing pathways using standard systems biology formats. BMC Syst Biol, 7, 18.
    OpenUrlCrossRefPubMed
  3. 3.↵
    Bonnet, E., Calzone, L., Rovera, D., Stoll, G., Barillot, E., and Zinovyev, A. (2013b). Practical use of binom: a biological network manager software. Methods Mol Biol, 1021, 127–146.
    OpenUrlCrossRefPubMed
  4. 4.↵
    Boone, C., Bussey, H., and Andrews, B. J. (2007). Exploring genetic interactions and networks with yeast. Nat Rev Genet, 8(6), 437–449.
    OpenUrlCrossRefPubMed
  5. 5.↵
    Boucher, B. and Jenna, S. (2013). Genetic interaction networks: better understand to better predict. Front Genet, 4, 290.
    OpenUrlCrossRef
  6. 6.↵
    Bussey, H., Andrews, B., and Boone, C. (2006). From worm genetic networks to complex human diseases. Nat Genet, 38(8), 862–863.
    OpenUrlCrossRefPubMed
  7. 7.↵
    Calzone, L., Tournier, L., Fourquet, S., Thieffry, D., Zhivotovsky, B., Barillot, E., and Zinovyev, A. (2010). Mathematical modelling of cell-fate decision in response to death receptor engagement. PLoS Comput Biol, 6(3), e1000702.
    OpenUrlCrossRefPubMed
  8. 8.↵
    Chanrion, M., Kuperstein, I., Barrière, C., El Marjou, F., Cohen, D., Vignjevic, D., Stimmer, L., Paul-Gilloteaux, P., Bièche, I., Tavares, S. D. R., Boccia, G.-F., Cacheux, W., Meseure, D., Fre, S., Martignetti, L., Legoix-Né, P., Girard, E., Fetler, L., Barillot, E., Louvard, D., Zinovyev, A., and Robine, S. (2014). Concomitant notch activation and p53 deletion trigger epithelial-to-mesenchymal transition and metastasis in mouse gut. Nat Commun, 5, 5005.
    OpenUrlPubMed
  9. 9.↵
    Chipman, K. C. and Singh, A. K. (2009). Predicting genetic interactions with random walks on biological networks. BMC Bioinformatics, 10, 17.
    OpenUrlCrossRefPubMed
  10. 10.↵
    Cline, M. S., Smoot, M., Cerami, E., Kuchinsky, A., Landys, N., Workman, C., Christmas, R., Avila-Campilo, I., Creech, M., Gross, B., Hanspers, K., Isserlin, R., Kelley, R., Killcoyne, S., Lotia, S., Maere, S., Morris, J., Ono, K., Pavlovic, V., Pico, A. R., Vailaya, A., Wang, P.-L., Adler, A., Conklin, B. R., Hood, L., Kuiper, M., Sander, C., Schmulevich, I., Schwikowski, B., Warner, G. J., Ideker, T., and Bader, G. D. (2007). Integration of biological networks and gene expression data using cytoscape. Nat Protoc, 2(10), 2366–2382.
    OpenUrlCrossRefPubMedWeb of Science
  11. 11.↵
    Costanzo, M., Baryshnikova, A., Bellay, J., Kim, Y., Spear, E. D., Sevier, C. S., Ding, H., Koh, J. L. Y., Toufighi, K., Mostafavi, S., Prinz, J., St Onge, R. P., VanderSluis, B., Makhnevych, T., Vizeacoumar, F. J., Alizadeh, S., Bahr, S., Brost, R. L., Chen, Y., Cokol, M., Deshpande, R., Li, Z., Lin, Z.-Y., Liang, W., Marback, M., Paw, J., San Luis, B.-J., Shuteriqi, E., Tong, A. H. Y., van Dyk, N., Wallace, I. M., Whitney, J. A., Weirauch, M. T., Zhong, G., Zhu, H., Houry, W. A., Brudno, M., Ragibizadeh, S., Papp, B., Pál, C., Roth, F. P., Giaever, G., Nislow, C., Troyanskaya, O. G., Bussey, H., Bader, G. D., Gingras, A.-C., Morris, Q. D., Kim, P. M., Kaiser, C. A., Myers, C. L., Andrews, B. J., and Boone, C. (2010). The genetic landscape of a cell. Science, 327(5964), 425–431.
    OpenUrlAbstract/FREE Full Text
  12. 12.↵
    Drees, B. L., Thorsson, V., Carter, G. W., Rives, A. W., Raymond, M. Z., Avila-Campillo, I., Shannon, P., and Galitski, T. (2005). Derivation of genetic interaction networks from quantitative phenotype data. Genome Biol, 6(4), R38.
    OpenUrlCrossRefPubMed
  13. 13.↵
    Einav, Y., Agami, R., and Canaani, D. (2005). shrnamediated rna interference as a tool for genetic synthetic lethality screening in mouse embryo fibroblasts. FEBS Lett, 579(1), 199–202.
    OpenUrlPubMed
  14. 14.↵
    Fauré, A., Naldi, A., Chaouiya, C., and Thieffry, D. (2006). Dynamical analysis of a generic boolean model for the control of the mammalian cell cycle. Bioinformatics, 22(14), 124–131.
    OpenUrlCrossRefPubMedWeb of Science
  15. 15.↵
    Fisher, R. (1918). The correlations between relatives on the supposition of mendelian inheritance. Trans. Roy. Soc. Edinb., 52, 399433.
    OpenUrl
  16. 16.↵
    Gorban, A. and Zinovyev, A. (2001). Method of elastic maps and its applications in data visualization and data modeling. International Journal of Computing Anticipatory Systems, pages 353–369.
  17. 17.
    Gorban, A., Kegl, B., Wunsch, D., and Zinovyev, A., editors (2008). Principal Manifolds for Data Visualisation and Dimension Reduction, LNCSE 58. Springer.
  18. 18.↵
    Gorban, A. N. and Zinovyev, A. (2010). Principal manifolds and graphs in practice: from molecular biology to dynamical systems. Int J Neural Syst, 20(3), 219–232.
    OpenUrlCrossRefPubMed
  19. 19.↵
    Gorban, A. N., A., P., and Zinovyev, A. (2014). Vidaexpert: user-friendly tool for nonlinear visualization and analysis of multidimensional vectorial data. Arxiv preprint, http://arxiv.org/abs/1406.5550,(1406.5550).
  20. 20.↵
    Grieco, L., Calzone, L., Bernard-Pierrot, I., Radvanyi, F., Kahn-Perlès, B., and Thieffry, D. (2013). Integrative modelling of the influence of mapk network on cancer cell fate decision. PLoS Computational Biology, 9(10), e1003286.
    OpenUrl
  21. 21.↵
    Guarente, L. (1993). Synthetic enhancement in gene interaction: a genetic tool come of age. Trends in Genetics, 9(10), 362–366.
    OpenUrlCrossRefPubMedWeb of Science
  22. 22.↵
    Helikar, T., Kowal, B., McClenathan, S., Bruckner, M., Rowley, T., Madrahimov, A., Wicks, B., Shrestha, M., Limbu, K., and Rogers, J. A. (2012). The cell collective: toward an open and collaborative approach to systems biology. BMC Syst Biol, 6, 96.
    OpenUrlCrossRefPubMed
  23. 23.↵
    Kaelin, Jr, W. G. (2005). The concept of synthetic lethality in the context of anticancer therapy. Nat Rev Cancer, 5(9), 689–698.
    OpenUrlCrossRefPubMedWeb of Science
  24. 24.↵
    Lu, X., Kensche, P. R., Huynen, M. A., and Notebaart, R. A. (2013). Genome evolution predicts genetic interactions in protein complexes and reveals cancer drug targets. Nat Commun, 4, 2124.
    OpenUrlPubMed
  25. 25.↵
    Mani, R., St Onge, R. P., Hartman, 4th, J. L., Giaever, G., and Roth, F. P. (2008). Defining genetic interaction. Proc Natl Acad Sci U S A, 105(9), 3461–3466.
    OpenUrlAbstract/FREE Full Text
  26. 26.↵
    Naldi, A., Berenguier, D., Fauré, A., Lopez, F., Thieffry, D., and Chaouiya, C. (2009). Logical modelling of regulatory networks with ginsim 2.3. Biosystems, 97(2), 134–139.
    OpenUrlCrossRefPubMedWeb of Science
  27. 27.↵
    Nijman, S. M. B. (2011). Synthetic lethality: general principles, utility and detection using genetic screens in human cells. FEBS Lett, 585(1), 1–6.
    OpenUrlCrossRefPubMedWeb of Science
  28. 28.↵
    Nijman, S. M. B. and Friend, S. H. (2013). Cancer. potential of the synthetic lethality principle. Science, 342(6160), 809–811.
    OpenUrlAbstract/FREE Full Text
  29. 29.↵
    Novák, B. and Tyson, J. J. (2004). A model for restriction point control of the mammalian cell cycle. Journal of Theoretical Biology, 230(4), 563–579.
    OpenUrlCrossRefPubMedWeb of Science
  30. 30.↵
    Novere, N. L., Hucka, M., Mi, H., Moodie, S., Schreiber, F., Sorokin, A., Demir, E., Wegner, K., Aladjem, M. I., Wimalaratne, S. M., Bergman, F. T., Gauges, R., Ghazal, P., Kawaji, H., Li, L., Matsuoka, Y., Villeger, A., Boyd, S. E., Calzone, L., Courtot, M., Dogrusoz, U., Freeman, T. C., Funahashi, A., Ghosh, S., Jouraku, A., Kim, S., Kolpakov, F., Luna, A., Sahle, S., Schmidt, E., Watterson, S., Wu, G., Goryanin, I., Kell, D. B., Sander, C., Sauro, H., Snoep, J. L., Kohn, K., and Kitano, H. (2009). The systems biology graphical notation. Nat Biotech, 27(8), 735–741.
    OpenUrlCrossRefPubMedWeb of Science
  31. 31.↵
    Paul, J. M., Templeton, S. D., Baharani, A., Freywald, A., and Vizeacoumar, F. J. (2014). Building high-resolution synthetic lethal networks: a ‘google map’of the cancer cell. Trends in Molecular Medicine, 20(12), 704–715.
    OpenUrlCrossRefPubMed
  32. 32.↵
    Segrè, D., Deluna, A., Church, G. M., and Kishony, R. (2005). Modular epistasis in yeast metabolism. Nat Genet, 37(1), 77–83.
    OpenUrlCrossRefPubMedWeb of Science
  33. 33.↵
    Snitkin, E. S. and Segrè, D. (2011). Epistatic interaction maps relative to multiple metabolic phenotypes. PLoS Genet, 7(2), e1001294.
    OpenUrlCrossRefPubMed
  34. 34.↵
    Stark, C., Breitkreutz, B.-J., Reguly, T., Boucher, L., Breitkreutz, A., and Tyers, M. (2006). Biogrid: a general repository for interaction datasets. Nucleic Acids Res, 34(Database issue), D535–D539.
    OpenUrlCrossRefPubMedWeb of Science
  35. 35.↵
    Steen, K. V. (2012). Travelling the world of gene-gene interactions. Brief Bioinform, 13(1), 1–19.
    OpenUrlCrossRefPubMed
  36. 36.↵
    Stern, D. L. and Orgogozo, V. (2009). Is genetic evolution predictable? Science, 323(5915), 746–751.
    OpenUrlAbstract/FREE Full Text
  37. 37.↵
    Stoll, G., Viara, E., Barillot, E., and Calzone, L. (2012). Continuous time boolean modeling for biological signaling: application of gillespie algorithm. BMC Syst Biol, 6, 116.
    OpenUrlCrossRefPubMed
  38. 38.↵
    Tong, A. H. Y., Lesage, G., Bader, G. D., Ding, H., Xu, H., Xin, X., Young, J., Berriz, G. F., Brost, R. L., Chang, M., Chen, Y., Cheng, X., Chua, G., Friesen, H., Goldberg, D. S., Haynes, J., Humphries, C., He, G., Hussein, S., Ke, L., Krogan, N., Li, Z., Levinson, J. N., Lu, H., Ménard, P., Munyana, C., Parsons, A. B., Ryan, O., Tonikian, R., Roberts, T., Sdicu, A.-M., Shapiro, J., Sheikh, B., Suter, B., Wong, S. L., Zhang, L. V., Zhu, H., Burd, C. G., Munro, S., Sander, C., Rine, J., Greenblatt, J., Peter, M., Bretscher, A., Bell, G., Roth, F. P., Brown, G. W., Andrews, B., Bussey, H., and Boone, C. (2004). Global mapping of the yeast genetic interaction network. Science, 303(5659), 808–813.
    OpenUrlAbstract/FREE Full Text
  39. 39.↵
    Wang, X. and Simon, R. (2013). Identification of potential synthetic lethal genes to p53 using a computational biology approach. BMC Med Genomics, 6, 30.
    OpenUrlPubMed
  40. 40.↵
    Wong, S. L., Zhang, L. V., Tong, A. H. Y., Li, Z., Goldberg, D. S., King, O. D., Lesage, G., Vidal, M., Andrews, B., Bussey, H., Boone, C., and Roth, F. P. (2004). Combining biological networks to predict genetic interactions. Proc Natl Acad Sci U S A, 101(44), 15682–15687.
    OpenUrlAbstract/FREE Full Text
  41. 41.↵
    Zhang, Y., Li, B., Srimani, P. K., Chen, X., and Luo, F. (2012). Predicting synthetic lethal genetic interactions in saccharomyces cerevisiae using short polypeptide clusters. Proteome Sci, 10 Suppl 1, S4.
    OpenUrl
  42. 42.↵
    Zhong, W. and Sternberg, P. W. (2006). Genome-wide prediction of c. elegans genetic interactions. Science, 311(5766), 1481–1484.
    OpenUrlAbstract/FREE Full Text
  43. 43.↵
    Zinovyev, A., Viara, E., Calzone, L., and Barillot, E. (2008). Binom: a cytoscape plugin for manipulating and analyzing biological networks. Bioinformatics, 24(6), 876–877.
    OpenUrlCrossRefPubMedWeb of Science
  44. 44.↵
    Zinovyev, A., Fourquet, S., Tournier, L., Calzone, L., and Barillot, E. (2012). Cell death and life in cancer: mathematical modeling of cell fate decisions. Adv Exp Med Biol, 736, 261– 274.
    OpenUrlPubMed
Back to top
PreviousNext
Posted April 24, 2015.
Download PDF

Supplementary Material

Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Predicting genetic interactions from Boolean models of biological networks
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Predicting genetic interactions from Boolean models of biological networks
Laurence Calzone, Emmanuel Barillot, Andrei Zinovyev
bioRxiv 018507; doi: https://doi.org/10.1101/018507
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Predicting genetic interactions from Boolean models of biological networks
Laurence Calzone, Emmanuel Barillot, Andrei Zinovyev
bioRxiv 018507; doi: https://doi.org/10.1101/018507

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genetics
Subject Areas
All Articles
  • Animal Behavior and Cognition (4665)
  • Biochemistry (10324)
  • Bioengineering (7649)
  • Bioinformatics (26268)
  • Biophysics (13487)
  • Cancer Biology (10656)
  • Cell Biology (15380)
  • Clinical Trials (138)
  • Developmental Biology (8474)
  • Ecology (12789)
  • Epidemiology (2067)
  • Evolutionary Biology (16810)
  • Genetics (11375)
  • Genomics (15441)
  • Immunology (10589)
  • Microbiology (25110)
  • Molecular Biology (10182)
  • Neuroscience (54283)
  • Paleontology (399)
  • Pathology (1663)
  • Pharmacology and Toxicology (2885)
  • Physiology (4329)
  • Plant Biology (9218)
  • Scientific Communication and Education (1584)
  • Synthetic Biology (2548)
  • Systems Biology (6765)
  • Zoology (1459)