Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Benchmark Problems for Dynamic Modeling of Intracellular Processes

View ORCID ProfileHelge Hass, View ORCID ProfileCarolin Loos, Elba Raimundez Alvarez, Jens Timmer, View ORCID ProfileJan Hasenauer, Clemens Kreutz
doi: https://doi.org/10.1101/404590
Helge Hass
1Center for Systems Biology (ZBSA), University of Freiburg, 79104 Freiburg, Germany
2Center for Data Analysis and Modelling (FDM), University of Freiburg, 79104 Freiburg, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Helge Hass
Carolin Loos
3Helmholtz Zentrum München-German Research Center for Environmental Health, Institute of Computational Biology, 85764 Neuherberg, Germany
4Technische Universität München, Center for Mathematics, Chair of Mathematical Modeling of Biological Systems, 85748 Garching, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Carolin Loos
Elba Raimundez Alvarez
3Helmholtz Zentrum München-German Research Center for Environmental Health, Institute of Computational Biology, 85764 Neuherberg, Germany
4Technische Universität München, Center for Mathematics, Chair of Mathematical Modeling of Biological Systems, 85748 Garching, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jens Timmer
1Center for Systems Biology (ZBSA), University of Freiburg, 79104 Freiburg, Germany
2Center for Data Analysis and Modelling (FDM), University of Freiburg, 79104 Freiburg, Germany
5BIOSS Centre for Biological Signalling Studies, University of Freiburg, 79104 Freiburg, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jan Hasenauer
3Helmholtz Zentrum München-German Research Center for Environmental Health, Institute of Computational Biology, 85764 Neuherberg, Germany
4Technische Universität München, Center for Mathematics, Chair of Mathematical Modeling of Biological Systems, 85748 Garching, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jan Hasenauer
Clemens Kreutz
1Center for Systems Biology (ZBSA), University of Freiburg, 79104 Freiburg, Germany
2Center for Data Analysis and Modelling (FDM), University of Freiburg, 79104 Freiburg, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

Abstract

Motivation Dynamic models are used in systems biology to study and understand cellular processes like gene regulation or signal transduction. Frequently, ordinary differential equation (ODE) models are used to model the time and dose dependency of the abundances of molecular compounds as well as interactions and translocations. A multitude of computational approaches have been developed within recent years. However, many of these approaches lack proper testing in application settings because a comprehensive set of benchmark problems is yet missing.

Results We present a collection of 20 ODE models developed given experimental data as benchmark problems in order to evaluate new and existing methodologies, e.g. for parameter estimation or uncertainty analysis. In addition to the equations of the dynamical system, the benchmark collection provides experimental measurements as well as observation functions and assumptions about measurement noise distributions and parameters. The presented benchmark models comprise problems of different size, complexity and numerical demands. Important characteristics of the models and methodological requirements are summarized, estimated parameters are provided, and some example studies were performed for illustrating the capabilities of the presented benchmark collection.

Availability The models are provided in several standardized formats, including an easy-to-use human readable form and machine-readable SBML files. The data is provided as Excel sheets. All files are available at https://github.com/Benchmarking-Initiative/Benchmark-Models, with MATLAB code to process and simulate the models.

Contact jan.hasenauer{at}helmholtz-muenchen.de, ckreutz{at}fdm.uni-freiburg.de

1 Introduction

Dynamic models based on ordinary differential equations (ODEs) have become a widely used tool in systems biology to quantitatively describe regulatory processes in living cells. Within this approach, known biochemical interactions of important compounds can be translated into rate equations describing the temporal evolution of the state of biological processes. Experimental data is then used to estimate parameters like rate constants or initial concentrations and to validate or improve the model structure.

The dimensionality and nonlinearity of these models constitute a challenge for numerical and statistical methods regarding parameter estimation and identification of the most plausible model structure. For that reason, a multitude of new modeling techniques have been developed within recent years. However, they are often not well-tested especially in realistic application settings and therefore performance benefits or limitations are unknown (Vyshemirsky and Girolami, 2008; Lillacci and Khammash, 2010; Raue et al., 2013; Hug et al., 2013; Degasperi et al., 2017; Maier et al., 2016; Stapor et al., 2018). Since the performance of computational approaches depends on model characteristics such as nonlinearity, number of parameters or amount of experimental data, it is essential to have a reasonably large set of benchmark problems. These need to cover a broad range of application settings in order to generalize results obtained in performance studies to new modeling projects.

One frequent limitation is that realistic measurements are typically not available for evaluations. Simulated data, as an example, is often much more informative in terms of number of data points (Tönsing et al., 2014) and does not have a complex noise structure (Villaverde et al., 2015) like measurements from living cells. Moreover, in most cases experimental measurements require augmenting the equations of the dynamic model with so-called observation functions containing scalingsand/or offset parameters, together with transformations of the data such as a log-transformation.

In many scientific fields benchmark collections are available, however, only a limited number of benchmark problems are currently available for modeling intracellular processes and they cover only a small set of application setups: (i) Six benchmark models have been published by Villaverde et al. (2015), however for most of them, only simulated data are provided. For the models with experimental data, one has less data points than parameters, and the other provides its equations only in a compiled version, which limits their use for model evaluation. (ii) Additional benchmark problems were defined within the DREAM6 (Dialogue on Reverse-Engineering Assessment and Methods) and DREAM7 challenges. However, both challenges only had simulated data available because the models do not represent real biological networks occurring in specific living cells. In addition, abundances of the molecular compounds were assumed as known initial values and the dynamic variables were assumed as directly measured without observation functions which renders these problems as rather unrealistic. (iii) Public repositories, e.g., the Biomodels database (Le Novere et al., 2006) provide a large number of realistic / published models. Unfortunately, for most models the measured data used for calibration is not or only partly provided. Moreover, if the data is published, the description of the link between model and data is often not sufficient, i.e., the noise model and observation functions are not comprehensively defined as required for a non-ambiguous benchmark problem. One major reason for this might be that current standards for defining models like the Systems Biology Markup Language (SBML) (Hucka et al., 2003) only comprise the biological part of the model but do not contain equations for observations and noise models used to estimate parameters. Standards for the encoding of experimental descriptions, such as the Simulation Experiment Description Markup Language (SED-ML) (Waltemath et al., 2011), are unfortunately not yet used widely and only supported by a fraction of the available tools.

In this manuscript, 20 models of biochemical reaction networks are presented which should serve as a comprehensive set of benchmark problems enabling testing of a multitude of data-based modeling approaches. The models have different complexity ranging from 9 to 269 parameters. All models comprise measured data (21 to 27132 data points per model). We also provide measurement errors either determined experimentally or from an underlying error model.

2 Methodology

2.1 Pathway models

Biochemical reaction networks can be modeled using reaction rate equations, Embedded Image which describe the dynamics of compound concentrations Embedded Image as a function of parameters θ (Section 2.3) and inputs Embedded Image (Section 2.4).

The initial values x(0) of Eq. (1) might be known. However, in most applications some elements of x(0) are unknown and defined as parameters, i.e., Embedded Image, or functions of parameters, i.e., x(0) = x0(θ). Mathematically, we distinguish between three classes:

  1. The initial conditions might be known / given, e.g., zero before treatment.

  2. The initial conditions might be analytical functions of the parameters, e.g., analytical solutions to a steady-state constraint (Rosenblatt et al., 2016).

  3. The initial conditions might be non-analytical expressions of the parameters, e.g., the result of a pre-simulation Embedded Image of an experimental condition (Rosenblatt et al., 2016; Fiedler et al., 2016).

For a detailed discussion we refer to Rosenblatt et al. (2016) and Fiedler et al. (2016).

2.2 Measurement errors

The state variables of reaction rate equations are linked to measurements via observation functions gi(x, θ), i = 1,…, Nobs, which describe the properties of the experimental device / technique used to acquire measurement data. The observation functions might be nonlinear functions of the state variables, e.g., if the readout saturates, for considering detection limits, and comprise scalings (Loos et al., 2018). For all presented benchmark models, independent normally distributed, additive errors are assumed for the measurements Embedded Image

Note that in the chosen notation, index i enumerates each observation/data point yi at a specific time point and each corresponding standard deviation σi of the measurement error individually.

We consider two broad classes of error models:

  1. The standard deviation σi of measurement errors might be determined as part of the experiment and processing of raw data, e.g., by computing standard errors across replicates. In this case, each data point yi has a given, fixed value σi specifying the accuracy of the measurement.

  2. Standard deviations might be unknown and therefore described as error models with error parameters which might be jointly estimated with other model parameters. The function can depend on parameters, state variables or both.

While class 1 yields a parameters estimation problem with fewer parameters, class 2 does not require the calculation of σi from a potentially small number of replicates and the statistical model accounts for imperfect knowledge of σi (Raue et al., 2013).

An error model E describes the dependence of the standard deviation of an observation on the error parameters θerr and the state variables x, σi = fnc(gi(x, θ), θerr). The most basic parameterdependent error models are unknown standard deviations for the individual observations, ∀i: σi = θabserr,i, or sets of observations Is, s = 1,…, ns, i.e., Embedded Image

Parameter- and state-dependent error models are for instance Embedded Image and Embedded Image with two parameters for absolute or relative noise levels. E(2) is obtained if relative and absolute errors are assumed as two independent sources of variability. E(3) is a phenomenological model which often realistically describes absolute and relative components of observed measurement errors.

2.3 Parameters

Dynamic models in systems biology comprise up to three classes of parameters:

  • Dynamic parameters θdyn that determine the initial states x(0) and the dynamics of the process, see Eq. (1). These parameters are rate constants such as association/dissociation rates or -constants, translocation rates between intraor extracellular compartments, or parameters like Michaelis-Mentenand Hill-coefficients, efficiencies of genetic perturbations or parameters of input functions. We note that the dynamic parameters θdyn do not change over time, although the name might suggest otherwise.

  • Observation parameters θobs that describe the relationship between concentrations of intracellular compounds with outputs, e.g., intensities in an assay. These parameters are for example scaling factors or offsets (Weber et al., 2011).

  • Error parameters θerr that describe the unknown noise levels (see Section 2.2).

Since dynamic parameters depend on the biological context and observationand error parameters are determined by the experimental setup, there is often only a limited amount of prior knowledge about parameters available. For the benchmark models, upperand lower bounds are defined for all parameters. In most cases, these bounds cover eight orders of magnitude or even more. In some cases, additional prior knowledge in terms of prior distributions or penalties is available for specific parameters.

The parameters of the biological process are often transformed to improve the convergence of optimization (Raue et al., 2013) and to eliminate structural non-identifiabilities (Maiwald et al., 2016). A common practice is the transformation of the parameters from linear to logarithmic scale. However, there are also problem-specific transformations as described in the Supplement for the Bachmann or Becker models.

2.4 Inputs

Inputs u describe the dependence of the biochemical reaction network on external factors as well as perturbations. Examples are externally controlled concentration of ligands or nutrients, or genetic perturbations like knockouts or overexpression. Time dependent inputs are often parameterized functions such as polynomials, splines (Schelker et al., 2012) or control vectors (Banga et al., 2005). Time-dependent inputs u = u(t) might depend on parameters which is denoted by u(t, θ) in the following.

3 Model and data formats

For a thorough evaluation of computational methods, we provide a set of 20 published models and corresponding data sets. The models have been extracted from the literature and have been developed by more than 10 different research groups. The information is stored in an easily accessible and standardized format, including an Excel file with general specifications of the model and its fit results. Measurements and model equations are stored as separate Excel files and for each experiment individually. In the data files, measurements with uncertainties and results from the corresponding model simulations are stored. The model files contain finalized ODEs including experiment-specific parameter assignments and observation functions, and are provided as userreadable Excel file and in the standardized, machine-readable SBML standard (Hucka et al., 2003). For a detailed description of the provided files, we refer to Supplementary Section 1.

4 Results

4.1 Benchmark collection

The main focus of this paper is to introduce a comprehensive collection of benchmark problems and their formulation in a standardized format. A comprehensive overview of the benchmark problems is provided in Table 1 on page 12.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 1:

Table summarizing the 20 benchmark models and their properties. The models are abbreviated with the last name of the first author. Many models are based on Western blot data. The number of experimental conditions is specified as the number of different simulation conditions. The feature abbreviations denote the following: C = several compartments, E(1) = constant error parameters, Eq. (3), E(2) = error model of Eq. (4), E(3) = error model of Eq. (5) of main manuscript, Ex = known measurement errors, ev = events, NI = non-identifiable parameters, u(t) = time dependent input function, u(t, θ) = input function with unknown parameter(s). Initial values are specified according to the following order: Embedded Image = known initial values Embedded Image = initial condition given by unknown parameters, x (θ) = parameter dependent functions, and Embedded Image = pre-equilibration for initial steady state conditions. The models are described in more detail in Supplementary Section 3.

The benchmark problems cover a wide range of model and data set sizes (Fig. 1A). A local identifiability analysis using the identifiability test by radial penalization (ITRP) (Kreutz, 2018) revealed that most benchmark models possess non-identifiable parameters. Furthermore, we found that initial conditions are specified in multiple ways, e.g., as equilibrium points of an unperturbed condition, and that different types of noise models and input functions are used (Fig. 1B). This results in a large number of combinations which have to be covered by computational modeling tools.

Figure 1:
  • Download figure
  • Open in new tab
Figure 1: Property distribution in the presented benchmark collection.

(A) Histograms for numerical model properties: number of observables, conditions, data points and parameters. Properties of individual models are indicated with an overlayed parallel coordinate plot. (B) Mosaic plot for the categoric model properties: initial conditions (columns), error models (color) and inputs (saturation). The areas encode the percentage of models with a particular combination of properties. Combinations of model properties which are not observed are crossed out in the legend. Nonanalytical parameter-dependent initial conditions can not be solved analytically and are obtained by simulating the system to steady state.

Although our collection is not unbiased, the spectrum of properties in the published models reveals requirements to be covered by modeling and parameter estimation tools. In the following, we will use the benchmark collection to assess a few common questions and statements.

4.2 Log-transformation of model parameters

A variety of studies in the systems biology field advocate the use of log-transformed parameters, ξ = log10(θ), for optimization: “For parameters that are by definition non-negative a log-scale should be used in the parameter estimation.” (Raue et al., 2013) and recent evaluations verified that this can improve computational efficiency (Kreutz, 2016; Villaverde et al., 2018). A comprehensive evaluation on application problems is however missing and the precise reason for the improvement is still unclear. Here, we used the compiled benchmark collection to confirm the finding for multi-start local optimization (Fig. 2A) and to assess whether changes in the objective function landscape might be a potential reason. The performance metric is the average number of converged starts per minute (see Villaverde et al. (2018)). Starts are considered to be converged if the objective function value differs at most by 10−1 from the best objective function value found across all runs for the given benchmark problem, whereby we only included the models for which the best value was found more than once. We also found a strong dependence of the results on this threshold (see Supplementary Information, Fig. S9).

Figure 2:
  • Download figure
  • Open in new tab
Figure 2: Linear vs. logarithmic scale.

(A) Performance of the multi-start local optimization scheme using the MATLAB optimizer lsqnonlin for: (x-axis) sampling of initial values in log scale and optimization in linear scale; and (y-axis) sampling and optimization in log scale. Performance is measured as average number of converged starts per minute. (B) Level-sets of the objective function for a synthesis-degradation process (see Supplementary Information, Section 4) in linear parameters and log-transformed parameters. (C) Convexity properties of the benchmark problems in linear parameters and log-transformed parameters. It is indicated whether the two parameters are sampled in linear or log space and whether the connection between the two parameters is checked in linear or log space. Statistically significant differences are shown (p-value for rank sum test, * = < 0.05, ** = < 0.01).

Log-transformation leaves the optima unchanged but changes the shape of the level-sets of the objective function. We found several examples for which the level-sets are non-convex in the parameter θ, but convex in log-transformed parameters ξ (see, e.g., Fig. 2B). As local optimizers are well suited for convex problems, the change in the level set structure could be a reason for the improvement. To assess whether log-transformation improved the convexity of the objective function, we drew a random parameter vector θ(1) ∈ Ω and a second random vector θ(2) ∈ Ω with ‖θ(2) − θ(1)‖ = 1 and a random location on the connecting line, α ∼𝒰 (0, 1). For convex problems, the objective function J satisfies ∀ θ(1), θ(2) and α: Embedded Image

Accordingly, the fraction of triples (θ(1), θ(2), α) for which (6) holds provides a measure of convexity. We evaluated this measure for different combination of sampling strategies for θ(1) and θ(2) (lin or log scale, indicated in the x-axis of Fig. 2C), and checking the connecting the two parameters in lin or log scale (see Supplementary Information, Section 5). For each combination, we sampled 1000 triples. Our comparison revealed that for most application problems, log-transformation increases the considered measure of convexity (Fig. 2C). Indeed, some problems appear to be completely convex when using log-transformed parameters. This provides a mechanistic explanation for the observed improvement in optimizer convergence.

4.3 Performance of local optimization methods

The no free lunch theorem for optimization states that

“[…] what an algorithm gains in performance on one class of problems is necessarily offset by its performance on the remaining problems.” (Wolpert and Macready, 1997)

This implies that effective optimization relies on a fortuitous matching between an optimization method and an optimization problem. Here, we used the benchmark collection to assess the performance of the trust-region-reflective and the interior-point algorithm in the MATLAB function fmincon (The MathWorks, 2016) to parameter optimization problems encountered in systems biology. These local optimizers are widely used, indeed, there are studies using both optimizers to exploit there individual benefits and performance differences (Stapor et al., 2018). The choice of the optimizer has direct implication for multi-start local optimization methods (Raue et al., 2013) and meta-heuristics (Villaverde et al., 2018), but also for uncertainty analysis using profile likelihoods (Raue et al., 2009).

For fmincon, mainly the default settings provided by MATLAB were chosen, which can be obtained by optimoptions(’fmincon’). Therein, the algorithm was chosen as trust-region-reflective or interior-point, respectively. Additional changes to the default settings comprise:

  • A user-defined gradient and Hessian for Gauss-Newton optimization.

  • The tolerance on first-order optimality was set to 0.

  • Termination tolerance on the parameters was set to 10−6.

  • As subproblem-algorithm, cg was always chosen.

  • The maximum number of iterations was set to 10000.

The trust-region-reflective algorithm is tailored to optimization problems with linear constraints. The trail step of the optimizer is obtained by minimizing a quadratic approximation of the objective function within the trust region (which is chosen adaptively). Parameter bounds are handled in the step construction by scaling and reflection. The interior-point algorithm is a general purpose method (and the MATLAB default) for optimization problems with linear and nonlinear constraints. It solves a sequence of approximate optimization problems with barrier functions. In each iteration a direct step obtained by solving the so-called Karush-Kuhn-Tucker condition or conjugate gradient step using a trust region is performed. For details we refer to the MATLAB documentation (The MathWorks, 2016).

We performed multi-start local optimization with 1000 fits for all benchmark models. Our results revealed that for the considered benchmark problems the trust-region-reflective algorithm tends to outperform the interior-point algorithm (Fig. 3 and Supplementary Information, Figs. S10S30). Indeed, the trust-region-reflective algorithm achieved a higher number of converged starts per computation time for 18 of the 20 benchmark problems and is for 9 benchmark problems the only algorithm finding the optimal solution. However, the optimal solutions for 2 benchmark problems were only obtained using the interior-point algorithm. Accordingly, although the trustregion-reflective algorithm (which is not the MATLAB default) achieves the higher reliability and performance, it can be beneficial to test alternative local optimizers. Additional information of the multi-start fits and the computation time for each model, as well as a comparison of the trustregion-reflective and the interior point method with the least-squares solver implemented in the MATLAB function lsqnonlin can be found in Supplementary Information, Sections 2, 6 and 7.

Figure 3:
  • Download figure
  • Open in new tab
Figure 3: Comparison of optimizer performance.

Scatter plot of the average number of converged starts per minute for the interior-point algorithm vs. trust-region-reflective algorithm.

4.4 Number of steps for local optimizers

Common questions in practical applications are (i) for how many steps (or iterations) a local optimizer should be run, and (ii) how the number of steps depends on the number of the parameters. For many local optimization algorithms, such bounds and results for scaling properties are available. For interior-point it has for instance been reported that for rather general classes of convex problems

“[…] the number of Newton steps hardly grows at all with m [the number of constraints author’s note] (or any other parameter, in fact).” (Boyd and Van-denberghe, 2004, Section 11.5.6)

Similar findings are reported for other methods (see, e.g., Nesterov (2013)). As the independence of the number of optimization steps from the number of parameters might be surprising, we set out to assess the properties on the benchmark collection. For each problem, the trust-region-reflective algorithm implemented in the MATLAB function lsqnonlin was run, without constraints on the maximum number of function evaluation.

Our assessment of the average number of optimizer steps (Fig. 4) revealed that on average 391±19 iterations were taken. There is – as predicted by theory for convex problems – no significant dependence on the number of parameters (ρ = 0.02, p-value = 0.93). Accordingly, our analysis on the benchmark collection validated for the first time that the theoretical results also hold for application problems in systems biology (which are in general not convex).

Figure 4:
  • Download figure
  • Open in new tab
Figure 4: Influence of problem size.

(A) Average number of optimizer iterations and (B) average computation time vs. the number of parameters. For optimization the trust-region-reflective algorithm implemented in the MATLAB function lsqnonlin was used and the averages across 1000 runs with different starting points were computed. The influence of the number of parameters was analyzed using correlation analysis and linear regression.

In contrast to the number of iterations, the computation time of local optimization depended on the number of parameters (ρ = 0.75, p-value = 1.5 · 10−4). For the trust-region-reflective algorithm using forward sensitivities for gradient calculation, we observed a roughly quadratic dependence Embedded Image

5 Discussion

Mechanistic dynamical models are used to describe and analyze biochemical reaction networks, to determine unknown parameters, gain biological insights and perform in-silico experiments. Novel methods to address these challenging tasks are proposed on a regular basis, however, a thorough assessment is often problematic. To address this problem, we compiled a collection of 20 benchmark problems. Reusability was ensured by providing the models in the machine-readable SBML format and the experimental data in structured Excel files. In addition, all aforementioned models are included in the open-source MATLAB toolbox Data2Dynamics (Raue et al., 2015) and the analysis scripts are provided as Supplementary Material.

To ensure that the benchmark problems are realistic and practically relevant, we exclusively included published models and measured experimental data. This is a key difference to existing benchmark collections which mostly considered models with simulated data (Villaverde et al., 2015; Ballnus et al., 2017). The benchmark models possess a broad spectrum of properties (e.g., different types of initial conditions, noise models and inputs), as well as challenges (e.g., structural and practical non-identifiabilities, and objective functions with multiple minima and valleys). The size of the benchmark problems ranges from roughly 20 data points, 10 parameters to be optimized and a single experimental condition to large models with more than 1000 data points, over 200 parameters and up to 110 distinct experimental conditions. This facilitates the assessment of the scaling behavior of novel algorithms.

We illustrated the value of the benchmark collection by performing three different analyses: (i) Our study of parameter transformations confirmed that optimization benefits from log-transformed parameter space. Furthermore, it suggested that the reason could be a significant increase of convexity of most problems, which provides a more benign setting for local optimizers. The observed change in the convexity appears to be the first mechanistic explanation for the observed improvement in optimizer performance. (ii) Our comparison of trust-region-reflective and interior-point algorithms revealed that the former is better suited for most parameter estimation problems encountered in systems biology. (iii) Our analysis of the scaling behavior confirmed theoretical results showing that the number of optimizer steps does not depend on the number of model parameters. The results of analyses (i)-(iii) could not have been obtained without the benchmark collection, which provided the means for a fair comparison. Indeed, the reliability of the findings depends directly on the size and the representativeness of the benchmark collection. Amongst others, previous studies were not able to provide an assessment of the scaling properties (Raue et al., 2013; Villaverde et al., 2015; Ballnus et al., 2017).

In conclusion, we think that the compiled benchmark collection will be an important resource for the systems biology community. It will facilitate the thorough evaluation of novel computational methods and support an unbiased assessment. In the future, the list of benchmark problems should be extended to enable a more fine-grained analysis and it should be integrated with public resources such as the BioModels database (Le Novere et al., 2006). Therefore, we encourage researchers to provide further models and data sets, e.g., by uploading them to our GitHub repository to obtain an even more powerful collection of benchmark models.

Funding

This work was supported by the German Ministry of Education and Research by the grant EA:Sys (FKZ 031L0080), the grant SYS-Stomach (01ZX1310B) and the grant INCOME (FKZ 01ZX1705A), as well as by the German Research Foundation (DFG) via grant TRR179. The authors also acknowledge support for high-performance computing by the state of Baden-Württemberg through the bwHPC initiative, which is supported by DFG grant INST 35/1134-1 FUGG.

References

  1. Bachmann, J., Raue, A., Schilling, M., Böhm, M. E., Kreutz, C., Kaschek, D., Busch, H., Gretz, N., Lehmann, W. D., Timmer, J., and Klingmüller, U. (2011). Division of labor by dual feedback regulators controls JAK2/STAT5 signaling over broad ligand range. Mol. Syst. Biol., 7, 516.
    OpenUrlAbstract/FREE Full Text
  2. ↵
    Ballnus, B., Hug, S., Hatz, K., Görlitz, L., Hasenauer, J., and Theis, F. J. (2017). Comprehensive benchmarking of Markov chain Monte Carlo methods for dynamical systems. BMC Syst. Biol, 11(63).
  3. ↵
    Banga, J. R., Balsa-Canto, E., Moles, C. G., and Alonso, A. A. (2005). Dynamic optimization of bioprocesses: efficient and robust numerical strategies. J. Biotechnol., 117, 407–419.
    OpenUrlCrossRefPubMed
  4. Becker, V., Schilling, M., Bachmann, J., Baumann, U., Raue, A., Maiwald, T., Timmer, J., and Klingmüller, U. (2010). Covering a broad dynamic range: information processing at the erythropoietin receptor. Science, 328(5984), 1404–1408.
    OpenUrlAbstract/FREE Full Text
  5. Beer, R., Herbst, K., Ignatiadis, N., Kats, I., Adlung, L., Meyer, H., Niopek, D., Christiansen, T., Georgi, F., Kurzawa, N., Meichsner, J., Rabe, S., Riedel, A., Sachs, J., Schessner, J., Schmidt, F., Walch, P., Niopek, K., Heinemann, T., Eils, R., and Di Ventura, B. (2014). Creating functional engineered variants of the single-module non-ribosomal peptide synthetase IndC by T domain exchange. Mol. Biosyst, 10(7), 1709–1718.
    OpenUrlCrossRefPubMed
  6. Boehm, M. E., Adlung, L., Schilling, M., Roth, S., Klingmüller, U., and Lehmann, W. D. (2014). Identification of isoform-specific dynamics in phosphorylation-dependent STAT5 dimerization by quantitative mass spectrometry and mathematical modeling. J. Proteome Res., 13(12), 5685–5694.
    OpenUrl
  7. ↵
    Boyd, S. and Vandenberghe, L. (2004). Convex Optimisation.Cambridge University Press, UK.
  8. Bránnmark, C., Palmer, R., Glad, S. T., Cedersund, G., and Strålfors, P. (2010). Mass and information feedbacks through receptor endocytosis govern insulin signaling as revealed using a parameter-free modeling framework. J. Biol. Chem., 285(26), 20171–20179.
    OpenUrlAbstract/FREE Full Text
  9. Bruno, M., Koschmieder, J., Wuest, F., Schaub, P., Fehling-Kaschek, M., Timmer, J., Beyer, P., and Al-Babili, S. (2016). Enzymatic study on atccd 4 and atccd 7 and their potential to form acyclic regulatory metabolites. Journal of Experimental Botany, 67(21), 5993–6005.
    OpenUrlCrossRefPubMed
  10. Chen, W. W., Schoeberl, B., Jasper, P. J., Niepel, M., Nielsen, U. B., Lauffenburger, D. A., and Sorger, P. K. (2009). Input–output behavior of ErbB signaling pathways as revealed by a mass action model trained against dynamic data. Mol. Syst. Biol., 5(1), 239.
    OpenUrlAbstract/FREE Full Text
  11. Crauste, F., Mafille, J., Boucinha, L., Djebali, S., Gandrillon, O., Marvel, J., and Arpin, C. (2017). Identification of nascent memory CD8 T cells and modeling of their ontogeny. Cell Systems, 4(3), 306–317.
    OpenUrl
  12. ↵
    Degasperi, A., Fey, D., and Kholodenko, B. N. (2017). Performance of objective functions and optimisation procedures for parameter estimation in system biology models. NPJ. Systems Biology and Applications, 3(1), 20.
    OpenUrl
  13. ↵
    Fiedler, A., Raeth, S., Theis, F. J., Hausser, A., and Hasenauer, J. (2016). Tailored parameter optimization methods for ordinary differential equation models with steady-state constraints. BMC Syst. Biol., 10(80).
  14. Fujita, K. A., Toyoshima, Y., Uda, S., Ozaki, Y.-i., Kubota, H., and Kuroda, S. (2010). Decoupling of receptor and downstream signals in the Akt pathway by its low-pass filter characteristics. Sci. Signal., 3(132), ra56–ra56.
    OpenUrlAbstract/FREE Full Text
  15. Hass, H., Kipkeew, F., Gauhar, A., Bouch, E., May, P., Timmer, J., and Bock, H. H. (2017). Mathematical model of early Reelin-induced Src family kinase-mediated signaling. PLoS ONE, 12(10), 1–16.
    OpenUrlCrossRefPubMed
  16. ↵
    Hucka, M., Finney, A., Sauro, H. M., Bolouri, H., Doyle, J. C., Kitano, H., Arkin, A. P., Bornstein, B. J., Bray, D., Cornish-Bowden, A., Cuellar, A. A., Dronov, S., Gilles, E. D., Ginkel, M., Gor, V., Goryanin, I. I., Hedley, W. J., Hodgman, T. C., Hofmeyr, J.-H., Hunter, P. J., Juty, N. S., Kasberger, J. L., Kremling, A., Kummer, U., Le Novere, N., Loew, L. M., Lucio, D., Mendes, P., Minch, E., Mjolsness, E. D., Nakayama, Y., Nelson, M. R., Nielsen, P. F., Sakurada, T., Schaff, J. C., Shapiro, B. E., Shimizu, T. S., Spence, H. D., Stelling, J., Takahashi, K., Tomita, M., Wagner, J., Wang, J., and S. B. M. L Forum (2003). The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models. Bioinformatics, 19(4), 524–531.
    OpenUrlCrossRefPubMedWeb of Science
  17. ↵
    Hug, S., Raue, A., Hasenauer, J., Bachmann, J., Klingmüller, U., Timmer, J., and Theis, F. (2013). High-dimensional bayesian parameter estimation: case study for a model of JAK2/STAT 5 signaling. Math. Biosci., 246(2), 293–304.
    OpenUrlCrossRefPubMed
  18. Isensee, J., Kaufholz, M., Knape, M. J., Hasenauer, J., Hammerich, H., Gonczarowska-Jorge, H., Zahedi, R. P., Schwede, F., Herberg, F. W., and Hucho, T. (2018). PKA-RII subunit phosphorylation precedes activation by camp and regulates activity termination. J. Cell Biol.
  19. ↵
    Kreutz, C. (2016). New concepts for evaluating the performance of computational methods. IFAC-PapersOnLine, 49(26), 63–70.
    OpenUrl
  20. ↵
    Kreutz, C. (2018). An easy and efficient approach for testing identifiability. Bioinformatics, 34(11), 1913–1921.
    OpenUrl
  21. ↵
    Le Novere, N., Bornstein, B., Broicher, A., Courtot, M., Donizelli, M., Dharuri, H., Li, L., Sauro, H., Schilstra, M., Shapiro, B., Snoep, J. L., and Hucka, M. (2006). Biomodels database: a free, centralized database of curated, published, quantitative kinetic models of biochemical and cellular systems. Nucleic Acids Res., 34, D689–D691.
    OpenUrlCrossRefPubMedWeb of Science
  22. ↵
    Lillacci, G. and Khammash, M. (2010). Parameter estimation and model selection in computational biology. PLoS Comput. Biol., 6(3), e1000696.
    OpenUrlCrossRefPubMed
  23. ↵
    Loos, C., Krause, S., and Hasenauer, J. (2018). Hierarchical optimization for the efficient parametrization of ODE models. Bioinformatics, bty514.
  24. Lucarelli, P., Schilling, M., Kreutz, C., Vlasov, A., Boehm, M. E., Iwamoto, N., Steiert, B., Lattermann, S., Wäsch, M., Stepath, M., Matter, M. S., Heikenwalder, M., Hoffmann, K., Deharde, D., Damm, G., Seehofer, D., Muciek, M., Gretz, N., Lehmann, W. D., Timmer, J., and Klingmüller, U. (2018). Resolving the combinatorial complexity of smad protein complex formation and its link to gene expression. Cell Systems, 6(1), 75–89.e11.
    OpenUrl
  25. ↵
    Maier, C., Loos, C., and Hasenauer, J. (2016). Robust parameter estimation for dynamical systems from outlier-corrupted data. Bioinformatics, 33(5), 718–725.
    OpenUrl
  26. ↵
    Maiwald, T., Hass, H., Steiert, B., Vanlier, J., Engesser, R., Raue, A., Kipkeew, F., Bock, H. H., Kaschek, D., Kreutz, C., and Timmer, J. (2016). Driving the model to its limit: Profile likelihood based model reduction. PLoS One, 11(9), e0162366.
    OpenUrl
  27. Merkle, R., Steiert, B., Salopiata, F., Depner, S., Raue, A., Iwamoto, N., Schelker, M., Hass, H., Wäsch, M., Boehm, M. E., Mücke, O., Lipka, D. B., Plass, C., Lehmann, W. D., Kreutz, C., Timmer, J., Schilling, M., and Klingmueller, U. (2016). Identification of cell type-specific differences in erythropoietin receptor signaling in primary erythroid and lung cancer cells. PLoS Comput. Biol., 12(8), e1005049.
    OpenUrl
  28. ↵
    Nesterov, Y. (2013). Introductory Lectures on Convex Optimization: A Basic Course, volume 87. Springer Science & Business Media.
  29. Raia, V., Schilling, M., Bohm, M., Hahn, B., Kowarsch, A., Raue, A., Sticht, C., Bohl, S., Saile, M., Möller, P., Gretz, N., Timmer, J., Theis, F., Lehmann, W.-D., Lichter, P., and Klingmüller, U. (2011). Dynamic mathematical modeling of IL13-induced signaling in hodgkin and primary mediastinal B-cell lymphoma allows prediction of therapeutic targets. Cancer Res., 71(3), 693–704.
    OpenUrlAbstract/FREE Full Text
  30. ↵
    Raue, A., Kreutz, C., Maiwald, T., Bachmann, J., Schilling, M., Klingmüller, U., and Timmer, J. (2009). Structural and practical identifiability analysis of partially observed dynamical models by exploiting the profile likelihood. Bioinformatics, 25(25), 1923–1929.
    OpenUrlCrossRefPubMedWeb of Science
  31. ↵
    Raue, A., Schilling, M., Bachmann, J., Matteson, A., Schelker, M., Kaschek, D., Hug, S., Kreutz, C., Harms, B. D., Theis, F. J., et al. (2013). Lessons learned from quantitative dynamical modeling in systems biology. PLoS ONE, 8(9), e74335.
    OpenUrlCrossRefPubMed
  32. ↵
    Raue, A., Steiert, B., Schelker, M., Kreutz, C., Maiwald, T., Hass, H., Vanlier, J., Tönsing, C., Adlung, L., Engesser, R., Mader, W., Heinemann, T., Hasenauer, J., Schilling, M., Höfer, T., Klipp, E., Theis, F. J., Klingmüller, U., Schöberl, B., and J. Timmer (2015). Data2Dynamics: a modeling environment tailored to parameter estimation in dynamical systems. Bioinformatics, 31(21), 3558–3560.
    OpenUrlCrossRefPubMed
  33. ↵
    Rosenblatt, M., Timmer, J., and Kaschek, D. (2016). Customized steady-state constraints for parameter estimation in non-linear ordinary differential equation models. Front Cell Dev. Biol., 4(41).
  34. ↵
    Schelker, M., Raue, A., Timmer, J., and Kreutz, C. (2012). Comprehensive estimation of input signals and dynamics in biochemical reaction networks. Bioinformatics, 28(18), i529–i534.
    OpenUrlCrossRefPubMed
  35. Schwen, L., Schenk, A., Kreutz, C., Timmer, J., Rodriguez, M. B., Kuepfer, L., and Preusser, T. (2015). Representative sinusoids for hepatic four-scale pharmacokinetics simulations. Plos One, 10, e0133653.
    OpenUrlCrossRefPubMed
  36. Sobotta, S., Raue, A., Huang, X., Vanlier, J., Jünger, A., Bohl, S., Albrecht, U., Hahnel, M. J., Wolf, S., Mueller, N. S., et al. (2017). Model based targeting of IL-6-induced inflammatory responses in cultured primary hepatocytes to improve application of the JAK inhibitor ruxolitinib. Front Physiol., 8, 775.
    OpenUrl
  37. ↵
    Stapor, P., Fröhlich, F., and Hasenauer, J. (2018). Optimization and profile calculation of ODE models using second order adjoint sensitivity analysis. Bioinformatics, 34(13), i151–i159.
    OpenUrl
  38. Swameye, I. M üller, T., Timmer, J., Sandra, O., and Klingmüller, U. (2003). Identification of nucleocytoplasmic cycling as a remote sensor in cellular signaling by data-based modeling. Proc. Natl. Acad. Sci., 100(3), 1028–1033.
    OpenUrlAbstract/FREE Full Text
  39. ↵
    The MathWorks (2016). Matlab optimization toolbox. Natick, MA, USA.
  40. ↵
    Tönsing, C., Timmer, J., and Kreutz, C. (2014). Cause and cure of sloppiness in ordinary differential equation models. Front Cell Dev. Biol., 90(023303).
  41. ↵
    Villaverde, A. F., Henriques, D., Smallbone, K., Bongard, S., Schmid, J., Cicin-Sain, D., Crombach, A., Saez-Rodriguez, J., Mauch, K., Balsa-Canto, E., Mendes, P., Jaeger, J., and Banga, J. R. (2015). BioPreDyn-bench: a suite of benchmark problems for dynamic modelling in systems biology. BMC Syst. Biol., 9, 8.
    OpenUrl
  42. ↵
    Villaverde, A. F., Frohlich, F., Weindl, D., Hasenauer, J., and Banga, J. R. (2018). Benchmarking optimization methods for parameter estimation in large kinetic models. Bioinformatics, bty736.
  43. ↵
    Vyshemirsky, V. and Girolami, M. (2008). BioBayes: a software package for Bayesian inference in systems biology. Bioinformatics, 24(17), 1933–1934.
    OpenUrlCrossRefPubMedWeb of Science
  44. ↵
    Waltemath, D., Adams, R., Bergmann, F. T., Hucka, M., Miller, F. K. A. K., Moraru, I. I., Nickerson, D., Snoep, J. L., and LēNovére, N. (2011). Reproducible computational biology experiments with SED-ML – The Simulation Experiment Description Markup Language. BMC Syst. Biol., 5(1), 198.
    OpenUrlCrossRefPubMed
  45. ↵
    1. S. Bittanti,
    2. A. Cenedese, and
    3. S. Zampieri
    Weber, P., Hasenauer, J., Allgower, F., and Radde, N. (2011). Parameter estimation and identifiability of biological networks using relative data. In S. Bittanti, A. Cenedese, and S. Zampieri, editors, Proc. of the 18th IFAC World Congress, volume 18, pages 11648–11653, Milano, Italy.
    OpenUrl
  46. Weber, P., Hornjik, M., Olayioye, M. A., Hausser, A., and Radde, N. E. (2015). A computational model of pkd and cert interactions at the trans-Golgi network of mammalian cells. BMC Syst. Biol., 9(1), 9.
    OpenUrlCrossRefPubMed
  47. ↵
    Wolpert, D. H. and Macready, W. G. (1997). No free lunch theorems for optimization. IEEE T. Evol. Comput., 1(1), 67–82.
    OpenUrl
  48. Zheng, Y., Sweet, S. M. M., Popovic, R., Martinez-Garcia, E., Tipton, J. D., Thomas, P. M., Licht, J. D., and Kelleher, N. L. (2012). Total kinetic analysis reveals how combinatorial methylation patterns are established on lysines 27 and 36 of histone H3. Proc. Natl. Acad. Sci. U S A, 109(34), 13549–13554.
    OpenUrlAbstract/FREE Full Text
Back to top
PreviousNext
Posted August 30, 2018.
Download PDF

Supplementary Material

Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Benchmark Problems for Dynamic Modeling of Intracellular Processes
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Benchmark Problems for Dynamic Modeling of Intracellular Processes
Helge Hass, Carolin Loos, Elba Raimundez Alvarez, Jens Timmer, Jan Hasenauer, Clemens Kreutz
bioRxiv 404590; doi: https://doi.org/10.1101/404590
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
Benchmark Problems for Dynamic Modeling of Intracellular Processes
Helge Hass, Carolin Loos, Elba Raimundez Alvarez, Jens Timmer, Jan Hasenauer, Clemens Kreutz
bioRxiv 404590; doi: https://doi.org/10.1101/404590

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Systems Biology
Subject Areas
All Articles
  • Animal Behavior and Cognition (4119)
  • Biochemistry (8828)
  • Bioengineering (6532)
  • Bioinformatics (23485)
  • Biophysics (11805)
  • Cancer Biology (9223)
  • Cell Biology (13336)
  • Clinical Trials (138)
  • Developmental Biology (7444)
  • Ecology (11425)
  • Epidemiology (2066)
  • Evolutionary Biology (15173)
  • Genetics (10453)
  • Genomics (14056)
  • Immunology (9188)
  • Microbiology (22199)
  • Molecular Biology (8823)
  • Neuroscience (47626)
  • Paleontology (351)
  • Pathology (1431)
  • Pharmacology and Toxicology (2493)
  • Physiology (3736)
  • Plant Biology (8090)
  • Scientific Communication and Education (1438)
  • Synthetic Biology (2225)
  • Systems Biology (6042)
  • Zoology (1254)