Abstract
Pairwise models are commonly used to describe many-species communities. In these models, a focal species receives additive fitness effects from pairwise interactions with other species in the community (“pairwise additivity assumption”), and all pairwise interactions are represented by a single canonical equation form (“universality assumption”). Here, we analyze the validity of pairwise modeling. We build mechanistic reference models for chemical-mediated interactions in microbial communities, and attempt to derive corresponding pairwise models. Even when one species affects another via a single chemical mediator, different forms of pairwise models are appropriate for consumable versus reusable mediators, with the wrong model producing qualitatively wrong predictions. For multi-mediator interactions, a canonical model becomes even less tenable. These results, combined with potential violation of the pairwise additivity assumption in communities of more than two species, suggest that although pairwise modeling can be useful, we should examine its validity before employing it.
Introduction
Multispecies microbial communities are ubiquitous. Microbial communities are important for industrial applications such as cheese and wine fermentation (van Hijum, Vaughan, and Vogel 2013) and municipal waste treatment (Seghezzo et al. 1998). Microbial communities are also important for human health: they can modulate immune responses and food digestion (Round and Mazmanian 2009; Kau et al. 2011) or cause diseases (Kelly 1980).
Community-level properties (e.g. species composition and biochemical activities) cannot be achieved, or achieved to the same extent, by summing the contributions of individual member species. Community-level properties are influenced by interactions wherein individuals alter the physiology of other individuals. To understand and predict properties of communities, choosing the appropriate mathematical model to describe species interactions is critical.
Two commonly-used modeling approaches are mechanistic modeling and pairwise modeling, each with its pros and cons. In mechanistic modeling, interaction mechanisms are explicitly modeled (Fig 1A and B, left panels). Thus, a mechanistic model requires discovering and quantifying interaction mechanisms (Fig 1 Table, “parameter” rows under “Mech.” column).
Such a mechanistic model can in principle quantitatively predict community dynamics when species evolution is negligible. However, the complexity of microbial interactions and the difficulty in identifying and quantifying interactions have made it challenging to construct mechanistic models.
In contrast to mechanistic modeling, pairwise modeling considers only the fitness effects of pairwise species interactions (Figs 1A and B, right panels). Pairwise models have two central assumptions. First, the “universality” assumption: Regardless of interaction mechanisms, how one species affects another can be abstracted into a single canonical equation form so that only parameters can vary among interactions. Second, the “pairwise additivity” assumption: a focal species receives additive fitness effects from pairwise interactions with other species in the community. Even though pairwise models do not capture the dynamics of chemical mediators, predicting species dynamics is still highly desirable in, for example, forecasting species diversity and compositional stability.
Pairwise models are easy to construct because they do not require knowledge of interaction mechanisms and need fewer parameters than mechanistic models (Fig 1 table). Parameters are relatively easy to estimate using community dynamics (Stein et al. 2013), or more systematically, using dynamics of monocultures and pairwise cocultures (Fig 2).
Not surprisingly, pairwise modeling has been commonly applied to communities (Wootton and Emmerson 2005). Pairwise models are often justified by their success in predicting ecological dynamics of two-species communities of prey-predation (Fig 1-FS1) (Volterra 1926; Wangersky 1978; “BiologyEOC - PopulationChanges” 2016) and competition (Gause 1934a; Gause 1934b). Pairwise modeling has been extended to model communities of more than two species (defined as “multispecies communities”),with empirical support from, for example, an artificial community of four competing protozoa species (Vandermeer 1969). Multispecies pairwise models have been extensively used to predict how perturbations to steady-state species composition exacerbate or decline over time (May 1972; Cohen and Newman 1984; Pimm 1982; Thébault and Fontaine 2010; Mougi and Kondoh 2012; Allesina and Tang 2012; Suweis et al. 2013; Coyte, Schluter, and Foster 2015).
However, pairwise modeling has known limitations. For instance, in a multispecies community, an interaction between two species can be altered by a third species (Levine 1976; Tilman 1987; Wootton 2002; Werner and Peacor 2003; Stanton 2003). Indirect interactions via a third species fall under two categories (Wootton 1993), which can be illustrated using the example of carnivore, herbivore, and plant. In an “interaction chain” (also known as “density-mediated indirect interactions”), a carnivore affects the density of an herbivore which in turn affects the density of plants. In “interaction modification” (also known as “trait-mediated indirect interactions” or “higher order interactions” (Vandermeer 1969; Wootton 1994; Billick and Case 1994; Wootton 2002)), a carnivore affects how often an herbivore forages plants. Interaction modification (but not interaction chain) violates the pairwise additivity assumption (Methods-Section 1). Interaction modification is thought to be common in ecological communities (Werner and Peacor 2003; Schmitz, Krivan, and Ovadia 2004). Limitation of pairwise modeling has also been studied experimentally (Dormann and Roxburgh 2005). However, empirically-observed failure of multispecies pairwise models could be due to limitations in data collection and analysis (Case and Bender 1981; Billick and Case 1994).
Given the benefits, limitations, and intellectual influence of pairwise modeling, we examine conditions under which pairwise models produce realistic predictions. Instead of investigating natural communities where interaction mechanisms can be difficult to identify, we start with in silico communities where species engage in predefined chemical interactions of the types commonly encountered in microbial communities. Based on these interactions, we construct mechanistic models, and attempt to derive from them pairwise models. A mechanistic reference model offers several advantages: community dynamics is deterministically known; deriving a pairwise model is not limited by inaccuracy of experimental tools; and the flexibility in creating different reference models allows us to explore a variety of conditions. This has allowed us to examine the domain of validity for pairwise modeling.
Results
Establishing a mechanistic reference model
In our mechanistic models (Fig 1A, left), we focus on chemical interactions which are widespread in microbial communities (Fig1-FS2) (Stams 1994; Czárán, Hoekstra, and Pagie 2002; Duan et al. 2009). A mechanistic model includes a set of species as well as chemicals that mediate interactions among species. Species Si could release or consume chemical Cj, and chemical Cj could increase or decrease the growth rate of species Sk.
We assume that fitness effects from different chemical mediators on a focal species are additive. Not making this assumption will likely violate the additivity assumption essential to pairwise modeling. Additive fitness effects have been observed for certain “homologous” metabolites. For example, in multi-substrate carbon-limited chemostats of E. coli, the fitness effects from glucose and galactose were additive (Lendenmann and Egli 1998). “Heterologous” metabolites (e.g. carbon and nitrogen sources) likely affect cell fitness in a multiplicative fashion. However, if released mediators cause small changes to the concentrations already in the environment, then additivity approximation may still be acceptable. For example, suppose that the fitness influences of released carbon and nitrogen with respect to those already in the environment are wc and wn, respectively. If wc, wn<<1, the additional relative fitness influence will be (1+wc)(1+wn)-1 ≈wc+wn. However, even among homologous metabolites, fitness effects may not be additive (Hermsen et al. 2015). “Sequential” metabolites (e.g. diauxic shift) provide such examples.
We also assume that resources not involved in interactions are never limiting. We thus simulate continuous community growth similar to that in a turbidostat, diluting the total population to a low density once it has reached a high-density threshold. Within a dilution cycle, a mechanistic model can be represented by a set of first-order differential equations, as
Si and Ci are state variables representing the concentrations of species Si and chemical Ci, respectively. rio is the basal fitness of an individual of species Si (the net growth rate of a single individual in the absence of any intra-species or inter-species interactions). rSiCj reflects the maximal influence of chemical Cj on the growth rate of Si, while KSiCi is the concentration of Cj achieving half maximal influence on the growth rate of Si. βCiSi. and αCiSj are respectively the release rate and the maximum consumption rate of Ci by species Sj. KCiSj is the Ci at which half maximal rate of consumption by Sj is achieved. All parameter definitions are summarized in Fig 1 table.
Deriving a pairwise model
Ideally we would want a canonical pairwise model to represent the fitness effect of one species on another regardless of interaction mechanisms. Specifically, an N-species pairwise model is:
Here, ri0 is the basal fitness of an individual of the focal species Si fij(Sj) describes how Sj, the density of species Sj, positively or negatively affects the fitness of Si. When j=i, fii(Si) represents intra-population density-dependent fitness effect on Si (e.g. inhibition or stimulation of growth at high cell densities). The pairwise additivity assumption means that fij(Sj) is a linear or nonlinear function of only Sj and not of a third species.
fij(Sj) can have several variations (Wangersky 1978): basic Lotka-Volterra, where the fitness influence of Sj on Si linearly increases with the abundance of Sj (Solé and Bascompte 2006); logistic Lotka-Volterra, which considers resource limitation by specifying a carrying capacity for each species (Thébault and Fontaine 2010; Mougi and Kondoh 2012); Lotka-Volterra with delayed influence, where the fitness influence of one species on another may lag in time (Gopalsamy 1992), and saturable Lotka-Volterra, where the fitness effect of Sj on Si saturates at high density of Sj (Thébault and Fontaine 2010). Since we model continuous growth which does not impose carrying capacity and since chemical influence from one species to another is likely saturable, we have adopted the saturable Lotka-Volterra as our canonical pairwise model:
Here, rij is the maximal positive or negative fitness effect of Sj on Si, and Kij is the Sj exerting half maximal fitness influence on Si (parameter definition in Fig 1 table). When j=i, nonzero rii and Kii reflect density-dependent growth effect in Si.
From a mechanistic model, we derive a pairwise model either analytically or numerically (Fig 2A). In the latter case (Fig 2B-C, Methods-Section 2), we should already have a pre-specified pairwise model (e.g. the canonical pairwise model) in mind. We then use the dynamics of monocultures and pairwise cocultures obtained from the mechanistic model to find parameters that minimize the difference between the two models within a training time window T. Specifically, we define a distance measure as the fold-difference between the dynamics from the two models, averaged over any time interval τ and species number N(Fig 2C):
Here Si,pair and Si,mech are Si calculated using pairwise and mechanistic models, respectively. Since species with densities below a set extinction limit, Sext, are assumed to have gone extinct in the model, we set all densities below the extinction limit to Sext in calculating to avoid singularities. Within the training window T, minimizing using a nonlinear least square routine yields parameters of the best-matching pairwise model. We then use outside the training window to quantify how well the best-matching pairwise model predicts the mechanistic model.
Reusable and consumable mediators are best represented by different forms of pairwise models
To build a pairwise model, we must accurately represent the fitness influence of one species on another (rij and Kij). Even though this basic process seems straightforward as outlined in Fig 2, in practice, challenges may arise. For example, identifying the set of best-matching parameters for nonlinear functions may not be straightforward, and measurement errors further hamper parameter estimation. Partly due to these challenges, studies on deriving pairwise model parameters for a given community are scarce (Pascual and Kareiva 1996; Stein et al. 2013), despite the popularity of pairwise models. In this section, we analytically derive pairwise models from mechanistic models of two-species communities where one species affects the other species through a single mediator. The mediator is either reusable such as signaling molecules in quorum sensing (Duan et al. 2009; N.S. Jakubovics 2010) or consumable such as metabolites (Stams 1994; Freilich et al. 2011). To facilitate mathematical analysis, we consider community dynamics within a dilution cycle. We show that a single canonical pairwise model may not encompass these different interaction mechanisms.
Consider a commensal community where species S1 stimulates the growth of species S2 by producing a reusable (Fig 3A) or a consumable chemical C1 (Fig 3B). When C1 is reusable, the mechanistic model can be transformed into a pairwise model (Fig 3A), provided that the concentration of the mediator (which is initially zero) has acclimated to be proportional to the producer population size (Fig 3A; Fig 3-FS1). This pairwise model takes the canonical form (compare with Fig 1B right). Thus, the canonical pairwise model is appropriate, regardless of whether the producer coexists with the consumer, outcompetes the consumer, or is outcompeted by the consumer.
If C1 is consumable, different scenarios are possible within a dilution cycle (Methods-Section 3).
Case I: When supplier S1 always grows faster than consumer S2 (r10 > r20 + rS2C1), C1 will accumulate within each dilution cycle (proportional to S1) without plateauing to a steady state (Fig 3-FS2 left panel, similar to a reusable mediator in Fig 3-FS1A). In this case, C1 may be approximated as a reusable mediator and can be predicted by the canonical pairwise model (Fig 3-FS2 right panel, compare dotted and solid lines).
Case II: When rS2C1 > r10 − r20 > 0, a steady state solution for C1 and RS = S2 / S1 exists (Eq.S3-4). We can rewrite
The above equations can be used to visualize the dynamics trajectory of the community in a phase plane (C1, RS) (Fig3-FS4, A-D). Qualitatively, starting from (0, RS(t=0)), the trajectory moves toward the curve (the “f-zero-isocline”, blue line in Fig 3-FS4, A-D) with time scale tf (Fig3-FS8B third column for Case II). Afterwards the trajectory moves closely along the f-zero-isocline until it reaches the steady state. During this time, we can assume f ≈ 0 to eliminate C1 and derive a pairwise model. The resultant equation below differs from the canonical pairwise model and takes the form where ω and ψ are constants (Fig 3B-ii). As expected, parameter estimation for the alternative pairwise model is more accurate after the initial period of time (Fig 3-FS6 bottom panels).
Case III: When supplier S1 always grows slower than consumer S2 (r10 < r20), Eqs. S3-2 and S3-5 above are still valid. Thus, consumable C1 declines to zero concentration as it is consumed by S2 whose relative abundance over S1 eventually exponentially increases at a rate of r20 − r10. Similar to Case II, after time scale tf when community dynamics reaches the f-zero-isocline (Fig 3-FS4 E-H; Fig3-FS8B third column for Case III), the alternative pairwise model (Eq. 6) can be derived.
The alternative pairwise model always converges to the mechanistic model if ω = 1 − KS2 / KC1S2 ≥ 0 or KC1S2 ≥ KS2C1 (Methods-Section 3, “Conditions for the alternative pairwise model to approximate the mechanistic model”; Fig 3-FS5A and C). Indeed, even if the initial species composition is not at steady state, the alternative model approaches the mechanistic model over time (compare dashed and solid lines in Fig 3-FS3 where ω = 0). When ω < 0, RS(t=0)> −ω/ψ is required for the alternative pairwise model to work (Fig3-FS8A). Otherwise, we can get qualitatively wrong results (Fig 3-FS5B and D).
If any of the requirements on initial conditions or time scale (Methods-Section 3; Fig3-FigS8) is violated, the alternative pairwise model may fail to represent the mechanistic model. For example, if dilution cycles are such that the community can never approach the f-zero-isocline, then C1 could accumulate proportionally to S1 even if rS2C1 > r10 − r20 > 0. In this case, the canonical but not the alternative pairwise model is appropriate (Fig 3-FS7). Similarly, a gentler dilution scheme, which allows the community to remain near the f-zero-isocline, leads to better parameter estimation for a pairwise model (Fig 3-FS6).
The alternative model can be further simplified to if additionally, the half-saturation constant K for C1 consumption (KC1S2) is identical to that for C1’s influence on the growth of consumer (KS2C1) (the “K assumption”), and if S2 has not gone extinct. This equation form has precedence in the literature (e.g. (Mougi and Kondoh 2012)), where the interaction strength r21 reflects the fact that the consumable mediator from S1 is divided among consumer S2.
The canonical and the alternative models are not interchangeable. The alternative pairwise model is not predictive of community dynamics where C1 accumulates without reaching a steady state (Fig 3-FS2, compare dashed and solid lines). Similarly, an interaction mediated by a consumable mediator that eventually reaches steady state can often be described by the alternative pairwise model (Fig 3-FS3, dashed lines). However, if we were to use a canonical model, model accuracy becomes uncertain: Parameters estimated at the steady state can predict community dynamics at steady state (Fig 3-FS3A), but not community dynamics when initial species ratios differ from the steady state ratio (Fig 3-FS3B and C, compare dotted with solid lines).
We have shown here that even when one species affects another species via a single mediator, either a canonical or an alternative form of pairwise model may be more appropriate in different situations. Choosing the appropriate model depends on whether the mediator is reusable or consumable and if consumable, how the fitness of two species compare. Additionally, a pairwise model may approximate the mechanistic model properly only under appropriate initial conditions and after the estimated time scale (Fig 3-FS5, B and D; Fig3-FS8). Considering that reusable and consumable mediators are both common, our results call for revisiting the universality assumption of pairwise modeling.
Multi-mediator interactions require pairwise models different from single-mediator interactions
A species often affects another species via multiple mediators (Kato et al. 2008; Traxler et al. 2013; Kim, Lee, and Ryu 2013). For example, a subpopulation from one species might die and release numerous chemicals that can affect another species in different ways. Here we examine cases where S1 releases two chemicals C1 and C2 which additively affect the growth of S2 (Fig 4). We ask when two mediators can mathematically be regarded as one mediator (to facilitate further abstraction into a pairwise model), and how multi-mediator interactions affect pairwise modeling.
When both mediators are reusable (Methods-Section 4), their combined effect generally cannot be modeled as a single mediator except under special conditions (Fig 4). These special conditions (Methods-Section 4) include: (1) mediators share similar “potency” (Fig 4C, diagonal), or (2) one mediator has much stronger “potency” than the other (i.e. one mediator dominates the interaction; note the log scale in Fig 4C).
When both mediators are consumable as in Cases II and III, the interaction term becomes . Except under special conditions (e.g. when both mediators satisfy the K assumption (Eq. 7), or when ωC1/ωC2 = ΨC1/ΨC2, or when one mediator dominates the interaction), the two mediators may not be regarded as one. Similarly, when one mediator is a steady-state consumable and the other is reusable, they generally may not be regarded as a single mediator and would require yet a different pairwise model with more degrees of freedom (with the interaction term ). All these forms deviate from the canonical form.
In summary, when S1 influences S2 through multiple mediators, rarely can we approximate them as a single mediator. Multiple mediators generally make equations of pairwise modeling more complex than single mediators, casting further doubt on the usefulness of a universal form for all community interactions.
A multispecies pairwise model can work for interaction chains but generally not for interaction modifications
For a community with more than two species, can we construct a multispecies pairwise model from two-species pairwise models? The answer is yes for an interaction chain mediated by chemicals (Fig 5A), so long as mediators between different species pairs are independent and each species pair can be represented by a pairwise model. The equation form of the multispecies pairwise model can vary (Methods-Section 5), as discussed in previous sections.
Consistent with previous work (Methods-Section 1), interaction modification can cause a multispecies pairwise model to fail. For example, S1 releases C1 which stimulates S2 growth; C1 is consumed by S3 and stimulates S3 growth (Fig 5C). Here, the presence of S3 changes the strength of interaction between S1 and S2, an example of interaction modification. Viewing this differently, S1 changes the nature of interactions between S2 and S3: S2 and S3 do not interact in the absence of S1, but S3 inhibits S2 in the presence of S1. This causes the three-species pairwise model to make qualitatively wrong conclusions about species persistence even though each species pair can be described by a pairwise model (Fig 5D). As expected, if S3 does not remove C1, the three-species pairwise model works (Fig 5-FS1, A-B).
Interaction modification can occur even in communities where no species changes “the nature of interactions” between any other two species (Fig 5E). Here, both S1 and S3 contribute reusable C1 to stimulate S2. S1 promotes S2 regardless of S3; S3 promotes S2 regardless of S1; S1 and S3 do not interact regardless of S2. However, a multispecies pairwise model assumes that the fitness effects from the two producers on S2 will be additive, whereas in reality, the fitness effect on S2 saturates at high C1. As a result, even though the dynamics of each species pair can be represented by a pairwise model (Fig 5F right, purple), the three-species pairwise model fails to capture community dynamics (Fig 5F). Thus, the nonlinearity in how a mediator affects a species can also violate the additivity assumption of a pairwise model. As expected, if C1 affects S2 in a linear fashion, the community dynamics is accurately captured in the multispecies pairwise model (Fig 5-FS1, C-D).
In summary, for chemical-mediated indirect interactions, a multispecies pairwise model can work for the interaction chain category but generally not the interaction modification category.
Discussion
Multispecies pairwise models are widely used in theoretical research due to their simplicity. In two-species interactions such as prey-predation based on contact-dependent inhibition (instead of diffusible chemical mediators), Lotka-Volterra pairwise models can in fact be the mechanistic representation of interactions and thus predictive of community dynamics (Fig 1-FS1). The inadequacy of multispecies pairwise models has been discussed theoretically (Wootton 2002; Wootton and Emmerson 2005) and empirically (Case and Bender 1981; Dormann and Roxburgh 2005; Aschehoug and Callaway 2015), although the reasons for model failures in explaining particular experimental results are often unclear (Billick and Case 1994).
Here, we have considered the validity of pairwise models in well-mixed two- and three-species communities where all species interactions are known and thus community dynamics can be described by a mechanistic reference model. We have focused on chemical-mediated interactions commonly encountered in microbial communities (Fig 1-FS2) (Kato et al. 2005; Gause 1934a; Ghuysen 1991; Nicholas S Jakubovics et al. 2008; Chen et al. 2004; D’Onofrio et al. 2010; Johnson et al. 1982; Hamilton and Ng 1983). To favor the odds of successful pairwise modeling, we have also assumed that different chemical mediators exert additive fitness effects on a target species.
What are the conditions under which the influence of one species on another can be represented by a canonical two-species pairwise model (the universality assumption)? When an interaction employs a single mediator, then a canonical saturable pairwise model (Fig 1B) will work for a reusable mediator (Fig 3A). For a consumable mediator, depending on the fitness relationship between the two species, either a canonical (Fig 3A) or an alternative (Fig 3B) pairwise model is appropriate (Methods-Section 3, Fig3-FS8). Canonical and alternative models are not interchangeable (Fig 3-FS2; Fig 3-FS3; Fig 3-FS7). For the alternative pairwise model, depending on the relative strength of KC1S1 and KS2C1, additional requirement on initial S2/S1 may need to be satisfied for the pairwise model to converge to mechanistic model (Fig3-FS8A). Otherwise, qualitatively wrong predictions can ensue (Fig 3-FS5, B and D). In all cases, acclimation time is required for a pairwise model to converge to the mechanistic model (Fig3-FS8B; Fig3A). If one species influences another through multiple mediators, then in general, these mediators may not be regarded as a single mediator (Fig 4 and Methods-Section 4). Thus, conditions for a working canonical pairwise model become even more restrictive.
In communities of more than two species, indirect interactions via a third species can occur. For indirect interactions in the form of interaction chains, as long as each two-species segment of the chain engages in independent interactions and can be represented by a pairwise model, then multispecies pairwise models will generally work (Figs 5A-B, Methods-Section 5). However, depending on whether each mediator is reusable or not, equation forms will vary. For indirect interactions in the form of interaction modification (higher-order interactions), even if each species pair can be accurately represented by a pairwise model, a multispecies pairwise model may fail (Fig 5, C-F). Interaction modification includes trait modification (Wootton 2002; Werner and Peacor 2003; Schmitz, Krivan, and Ovadia 2004), or, in our cases, mediator modification. Mediator modification is very common in microbial communities. For example, antibiotic released by one species to inhibit another species may be inactivated by a third species, and this type of indirect interactions can stabilize microbial communities (Kelsic et al. 2015; Bairey, Kelsic, and Kishony 2016). Moreover, interaction mediators are often shared among multiple species. For example in oral biofilms, organic acids such as lactic acid are generated from carbohydrate fermentation by many species (Bradshaw et al. 1994; Marsh and Bradshaw 1997; Kuramitsu et al. 2007). Such by-products are also consumed by multiple species (Kolenbrander 2000).
Pairwise modeling (or variations of it) still has its uses when simulating a particular community phenomenologically. One can even imagine that an extended pairwise model (e.g. ) embodying both the canonical form and the alternative form can serve as a general-purpose model for pairwise interactions via a single mediator. Even the effects of indirect interactions may be quantified and included in the model by incorporating higher-order interaction terms (Case and Bender 1981; Worthen and Moore 1991), although many challenges will need to be overcome (Wootton 2002). In the end, although these strategies may lead to a sufficiently accurate phenomenological model for specific cases, “one-form-fitting-all” may generate erroneous predictions when modeling different communities.
How much information about interaction mechanisms do we need to construct a mechanistic model? That is, what is the proper level of abstraction which captures the phenomena of interest, yet avoids unnecessary details (Levins 1966; Durrett and Levin 1994; Damore and Gore 2012)? Tilman argued that if a small number of mechanisms (e.g. the “axes of trade-offs” in species’ physiological, morphological, or behavioral traits) could explain much of the observed pattern (e.g. species coexistence), then this abstraction would be highly revealing (Tilman 1987). However, the choice of abstraction is often not obvious. Consider for example a commensal community where S1 grows exponentially (not explicitly depicted in equations in Fig 6) and the growth rate of S2, which is normally zero, is promoted by mediator C from S1 in a linear fashion (Fig 6). If we do not know how S1 stimulates S2, we can still construct a pairwise model (Fig 6A). If we know the identity of mediator C and realize that C is consumable, then we can instead construct a mechanistic model incorporating C (Fig 6B). However, if C is produced from a precursor via an enzyme E released by S1, then we get a different form of mechanistic model (Fig 6C). If, on the other hand, E is anchored on the membrane of S1 and each cell expresses a similar amount of E, then equations in Fig 6D are mathematically equivalent to Fig 6B. This simple example, inspired by extracellular breakdown of cellulose into a consumable sugar C (Bayer and Lamed 1986; Felix and Ljungdahl 1993; Schwarz 2001)), illustrates how knowledge of mechanisms may eventually help us determine the right level of abstraction.
In summary, under certain circumstances, we may already know that interaction mechanisms fall within the domain of validity for a pairwise model. In these cases, pairwise modeling provides the appropriate level of abstraction, and constructing a pairwise model can be far easier than measuring the many parameters required by a mechanistic model. However, if we do not know whether pairwise modeling is valid, we will need to be cautious about indiscriminative use of pairwise models since they can fail to even qualitatively capture community dynamics (e.g. in Fig 3-FS2, Fig 3-FS3, Fig 3-FS5, and Fig 5C-F). We will need to be equally careful in extrapolating and generalizing conclusions obtained from pairwise models. Considering recent advances in identifying and quantifying interactions, we advocate a transition to models that incorporate interaction mechanisms at the appropriate level of abstraction.
Methods
Section 1. Interaction modification but not interaction chain violates the pairwise additivity assumption
In a pairwise model, the fitness of a focal species Si is the sum of its “basal fitness” (ri0, the net growth rate of a single individual in the absence of any intra-species or inter-species interactions) and the additive fitness effects exerted by pairwise interactions with other members of the community. Mathematically, an N-species pairwise model is often formulated as
Here, fij(Sj) describes how Sj, the density of species Sj, positively or negatively affects the fitness of Si, and is a linear or nonlinear function of only Sj and not of a third species.
Indirect interactions via a third species fall under two categories (Wootton 1993). The first type is known as “interaction chain” or “density-mediated indirect interactions”. For example, the consumption of plant S1 by herbivore S2 is reduced when the density of herbivore is reduced by carnivore S3. In this case, the three-species pairwise model does not violate the pairwise additivity assumption (compare with Eq. S1-1) (Case and Bender 1981; Wootton 1994).
The second type of indirect interactions is known as “interaction modification” or “trait-mediated indirect interactions” or “higher order interactions” (Vandermeer 1969; Wootton 1994; Billick and Case 1994; Wootton 2002), where a third species modifies the “nature of interaction” from one species to another (Wootton 2002; Werner and Peacor 2003; Schmitz, Krivan, and Ovadia 2004). For example, when carnivore is present, herbivore will spend less time foraging and consequently plant density increases. In this case, f12 in Eq S1-2 is a function of both S2 and S3, violating the pairwise additivity assumption.
Section 2. Summary of simulation files
Simulations are based on Matlab® and executed on an ordinary PC. Steps are:
Step 1: Identify monoculture parameters r0, rii, and Kii (Fig 2C, Row 1 and Row 2).
Step 2: Identify interaction parameters rij, rji, Kij, and Kji where i≠j (Fig 2C, Row 3).
Step 3: Calculate distance between population dynamics of the reference mechanistic model and the approximate pairwise model over several generations outside of the training window to assess if the pairwise model is predictive.
Fitting is performed using nonlinear least square (lsqnonlin routine) with default optimization parameters. The following list describes the m-files used for different steps of the analysis:
Section 3. Deriving a pairwise model for interactions mediated by a single consumable mediator
To facilitate mathematical analysis, we assume that requirements calculated below are eventually satisfied within each dilution cycle (see Fig3-FS7 for an example where dilution cycles violate requirements for deriving a pairwise model). We further assume r10 > 0 and r20 > 0 so that species cannot go extinct in the absence of dilution. See Fig3-FS8 for a summary of this section.
When S1 releases a consumable mediator which stimulates the growth of S2, the mechanistic model as per Fig 3B, is
Let S1(t = 0) = S10; S2(t = 0) = S20; C1(t = 0) = C10 = 0. Note that the initial condition C10 = 0 can be easily imposed experimentally by prewashing cells. Under which conditions can we eliminate C1 so that we can obtain a pairwise model of S1 and S2?
Define RS = S2/S1 as the ratio of the two populations.
Case I: r10 – r20 > rS2C1
Since producer S1 always grows faster than consumer S2, RS → 0 as t → ∞. Define (“~” indicating scaling against a function).
Since RS declines exponentially with a rate faster than . In the right hand side of Eq. S3-3, we can ignore the third term if it is much smaller than the first term. That is,
Thus for . When initial , this equation can be solved to yield: . After time of the order of 1/r10, the second term can be neglected. Thus, after time of the order of . Then C1 can be replaced by (βC1S1/r10)S1, and a canonical pairwise model can be derived.
Case II: rS2C1 > r10 − r20 > 0
For Eq. S3-1, we find that a steady state solution for C1 and RS, denoted respectively as C1 * and RS*, exist. They can be easily found by setting the growth rates of S1 and S2 to be equal, and dC1/dt to zero.
However, if C1 has not yet reached steady state, imposing steady state assumption would falsely predict Rs at steady state and thus remaining at its initial value (Fig 3-FS3, dotted lines). Since dC1/dt in Eq. S3-1 is the difference between two exponentially growing terms, we factor out the exponential term S1 to obtain where . When f ≈ 0, we can eliminate C1 and obtain an alternative pairwise model
Or where ω and ψ are constants (Fig 3B, ii).
For certain conditions (which will be discussed at the end of this section, Fig3-FS8A), this alternative model can make reasonable predictions of community dynamics even before the community reaches the steady state (Fig 3-FS3, compare dashed and solid lines). Below we discuss the general properties of community dynamics and show that there exists a time scale tf after which it is reasonable to assume f ≈ 0 and the alternative model can be derived. We also estimate tf for several scenarios.
We first make C1 and RS dimensionless by defining and (“^” indicating scaling against steady state values). Eq. S3-2 can then be rewritten as where .
From Eqs. S3-1 and S3-4, we obtain or where.
Using these scaled variables, f can be rewritten as and
Equations S3-7 and S3-10 allow us to construct a phase portrait where the x axis is and the y axis is (Fig 3-FS4A-D). Note that at steady state, . Setting Eq. S3-9 to zero: defines the f-zero-isocline on the phase plane (i.e. values of at which , Fig 3-FS4A-D blue lines). The phase portrait dictates the direction of the community dynamics trajectory as shown in grey arrows in Fig 3-FS4A. Starting from , the trajectory (brown circles and orange lines) moves downward right (Fig3-FS4 A-D) until it hits . Then, it moves upward right and eventually hits the f-zero-isocline. Afterward, the trajectory moves toward the steady state (green circles) very closely along (and not superimposing) the f-zero-isocline during which the alternative pairwise model can be derived (Fig 3-FS4A-D).
It is difficult to solve Eq. S3-7 and S3-8 analytically because the detailed community dynamics depends on the parameters and the initial species composition in a complicated way. However, under certain initial conditions, we can estimate tf, the time scale for the community to approach the f-zero-isocline. Note that tf is not a precise value. Instead it estimates the acclimation time scale after which a pairwise model can be derived.
One assumption used when estimating all tf is that S10 is sufficiently high (Fig3-FS8B) to avoid the long lag phase that is otherwise required for the mediator to accumulate to a high enough concentration.
From Eq. S3-11, the asymptotic value for the tf-zero-isocline is
This is plotted as a black dotted line in Fig 3-FS4, A-D. Below we consider three different initial conditions :
Case II-1.
If we do not scale, . From Eq. S3-4, this becomes .
A typical trajectory of the system is shown in Fig 3-FS4B: at time t = 0, using Eqs. S3-7 and S3-8, the community dynamics trajectory (orange solid line in Fig 3-FS4B inset) has a slope of
From Eqs. S3-11, the slope of the f-zero-isocline (blue line in Fig 3-FS4B inset) at is
The approximation in the last step is due to the very definition of Case II-1: . The initial steepness of the community dynamics trajectory (Eq. S3-13) will be much smaller than that of the f-zero-isocline (Eq. S3-14) if
If we do not scale, together with Eq. S3-4, this becomes:
In this case, the community dynamics trajectory before getting close to the f-zero-isocline can be approximated as a straight line (the orange dotted line) and the change in can be approximated by the green segment in the inset of Fig 3-FS4B. Since the green segment, the orange dotted line and the red dashed line form a right angle triangle, the length of green segment can be calculated once we find the length of the red dashed line , which is the horizontal distance between and the f-zero-isocline and can be calculated from Eq. S3-9:
The approximation holds because under this condition, is close to zero. Thus,
The green segment is then the length of red dashed line (, Eq. S3-16) multiplied with (Eq. S3-13), or
Note that if Eq. S3-15 is satisfied, . What is the time scale tf for the community to traverse the orange dotted line to be close to the f-zero-isocline? Since from Eq. S3-7 due to initial ,
Since here and ln(1 + x) ~ x for small x, we have
If unscaled, using Eq. S3-4, this becomes
Case II-2. is comparable to 1
That is, . If ¾ is low, a typical example is shown in Fig 3-FS4D. Here because it takes a while for C1 to accumulate, during this lagging phase and there is a sharp plunge in before the trajectory levels off and climbs up. Although the trajectory eventually hits the f-zero-isocline where the alternative pairwise model can be derived, estimating tf is more complicated. Here we consider a simpler case where S10 is large enough so that the trajectory levels off immediately after t = 0 and before the trajectory hits the f-zero-isocline (Fig3-FS4 A). Since decreases until and from Eq. S3-13, and similar to the reasoning in CaseII-1, if or if a typical trajectory moves toward the f-zero-isocline almost horizontally (Fig 3-FS4A). The unscaled form of Eq. S3-19 is
To calculate the time it takes for the trajectory to reach the f-zero-isocline, let and at any time point t to respectively represent deviation of away from their steady state values of (1, 1). We can thus linearize Eqs. S3-7 and S3-8 around the steady state. Note that at the steady state f = 0, Δsf = f.
Rewrite Eq. S3-7 as to linearize it around the steady state
At steady state, . Thus, .
Thus,
Recall Eqs. S3-8 and S3-10 as
Linearize around the steady state (note f (1,1)=0):
Thus,
Similar to the above calculation, we expand f around steady state 0,
Utilizing Eq. S3-20, Eq. S3-21, and Eq. S3-22,
Taking the derivative of both sides, and using Eq. S3-21 and Eq. S3-22, we have
The solution to the above equation is: where and are two positive constants. D1 and D2 are two constants that can be determined from the initial conditions of and . M(κ, μ, z) and W(κ, μ, z) are Whittaker functions with argument z. As Z → ∞ (http://dlmf.nist.gov/13.14.E20 and http://dlmf.nist.gov/13.14.E21)
Thus when er10tb/r10» 1,
The second term approaches zero much faster compared to the first term due to the negative exponent with an exponential term. Thus,
Thus, Δsf = f approaches zero at a rate of . Therefore, for the community is sufficiently close to f-zero-isocline.
In unscaled form, this becomes:
Case II-3. or RS (0) ≪ βC1S1/αC1S2
Similar to Case II-2, if Eq. S3-19 is satisfied, a typical trajectory is illustrated in Fig 3-FS4C where the trajectory is almost horizontal and increases to much greater than 1 before the system reaches the f-zero-isocline. tf can then be estimated from how long it takes to increase to . Using Eq. S3-7 and since is very large,
This yields
In the unscaled form, this becomes:
Case III: r20 < r20
In this case, supplier S1 always grows slower than S2. As t → ∞, RS = S2/S1 → ∞ and C1 → 0. The phase portrait is separated into two parts by the f-zero-isocline (Fig 3-FS4E), where, as in Eq. S3-5,
Note that the asymptotic value of RS (black dotted line, Fig 3-FS4, E-H) is
From Eq. S3-2, dRS/dt > 0. From Eq. S3-1, below the f-zero-isocline, dC1/dt > 0 and above the f-zero-isocline, dC1/dt > 0. Thus if the system starts from (0, Rs (0)), the phase portrait dictates that it moves with a positive slope until a time of a scale tf when it hits the f-zero-isocline, after which it moves upward to the left closely along the f-zero-isocline (Fig3-FS4E). After tf, the alternative pairwise model can be derived. Although tf is difficult to estimate in general, it is possible for the following cases.
Case III-1. RS(0)≫βC1S1/αC1S2
Similar to the derivation in Case II-2, if S10 is small, there is a lagging phase during which the trajectory rises steeply before leveling off (Fig 3-FS4H). Although the alternative pairwise model can be derived once the trajectory hits the f-zero-isocline, tf takes a complicated form. Here we consider two cases where S10 is large enough so that we can approximate the trajectory as a straight line going through (0, RS(t=0)) (Fig3-FS4, F, G). Graphically, £10 is large enough so that the green segment in Fig3-FS4F, whose length is ΔRS, is much smaller than RS(0). In other words,
From Eqs. S3-1 and S3-2
And Δ(C1/KC1S2), the red segment in Fig3-FS4F, is the horizontal distance between (0, RS (0)) and the f-zero-isocline and
Thus if or from Eq. S3-2, tf can be calculated as
Case III-2. RS(0)≫βC1S1/αC1S2
If S10 is large enough so that Eq. S3-28 is satisfied, a typical example is displayed in Fig3-FS4G. The trajectory moves with a small positive slope so that the intersection of the community dynamics trajectory with the f-zero-isocline is near the black dotted line βC1S1/αC1S2 (Eq. S3-27) where C1/KC1 is large. ty can thus be estimated from Eq. S3-2: which yields
Conditions for the alternative pairwise model to approximate the mechanistic model
Cases II and III showed that population dynamics of the mechanistic model could be described by the alternative pairwise model. However, since the initial condition for C1 cannot be specified in pairwise model, problems could occur. To illustrate, we examine the phase portrait of the pairwise equation where . From Eqs. S3-6 and S3-1,
Below, we plot Eq. S3-31 under different parameters (Fig 3-FS5) to reveal conditions for convergence between mechanistic and pairwise models.
Case II (rS2C1 >r10 − r20 > 0): steady state RS* exists for mechanistic model.
If ω = 1− KS2C1/KC1S2 ≥ 0(Fig 3-FS5A): When RS <RS*, dRS/dt is positive. When Rs > RS*, dRS/dt is negative. Thus, wherever the initial RS, it will always converge toward the only steady state RS* of the mechanistic model.
If ω < 0 (Fig 3-FS5B): ω + ψRS = 0 or RS = −ω/ψ creates singularity. Pairwise model RS will only converge toward the mechanistic model steady state if
Case III (r10 < r20): RS increases exponentially in mechanistic model (Eq. S3-2). Thus, consumable C1 will decline toward zero as C1 is consumed by S2 whose relative abundance over S1 exponentially increases. Thus, from mechanistic model (Eq. S3-2), RS eventually increases exponentially at a rate of r20 − r10.
If ω ≥ 0 (Fig 3-FS5C): Eq. S3-31 . Thus, Eq. S3-31, which is based on alternative pairwise model, also predicts that RS will eventually increase exponentially at a rate of r20 − r10, similar to the mechanistic model.
If ω < 0 (Fig 3-FS5D): RS (0) > −ω/ψ (Eq. S3-32) is required for unbounded increase in RS(similar to the mechanistic model). Otherwise, RS converges to an erroneous value instead.
Section 4. Conditions under which a pairwise model can represent one species influencing another via two reusable mediators
Here, we examine a simple case where S1 releases reusable C1 and C2, and C1 and C2 additively affect the growth of S2 (see example in Fig 4).
The mechanistic model is:
Now the question is whether the canonical pairwise model can be a good approximation.
For simplicity, let’s define and . Small KCi means large potency (e.g. small KS2C2 which means low C2 required to achieve half maximal effect on S2, and/or large synthesis rate βC2S1). Since S1 from pairwise and mechanistic models are identical, we have can be close to zero when (i) or (ii) and (effects of C1 and C2 on S2) differ dramatically in magnitude. For (ii), without loss of generality, suppose that the effect of C2 on S2 can be neglected. This can be achieved if (iia) rS2C2 is much smaller than rS2C2, or (iib) KC2 is large compared to S1.
Section 5. Multi-species pairwise model for an interaction chain
Without loss of generality, we consider an example where each step of an interaction chain is best represented by a different form of pairwise model. Suppose that S1 releases a reusable mediator C1 that promotes S2 and that S2 releases a consumable mediator C2 that promotes S3. The interaction mediated by C2 can be described by the alterative pairwise model. The corresponding community-pairwise model then will be:
To see this, note that after a transient period of time, the mechanistic model of the three-species community is: