Abstract
In mathematical epidemiology, a well-known formula describes the impact of heterogeneity on the basic reproductive number for situations in which transmission is separable and for which there is one source of variation in susceptibility and one source of variation in infectiousness. This formula is written in terms of the magnitudes of the heterogeneities, as quantified by their coefficients of variation, and the correlation between them. A natural question to ask is whether analogous results apply when there are multiple sources of variation in susceptibility and/or infectiousness. In this paper we demonstrate that under three or more coupled heterogeneities, the basic reproductive number depends on details of the distribution of the heterogeneities in a way that is not seen in the well-known simpler situation. We provide explicit results for the cases of multivariate normal and multivariate log-normal distributions, showing that the basic reproductive number can again be expressed in terms of the magnitudes of the heterogeneities and the pairwise correlations between them. The results, however, differ between the two multivariate distributions, demonstrating that no formula of this type applies generally when there are three or more coupled heterogeneities. We see that the results are approximately equal when heterogeneities are relatively small and show that an earlier result in the literature (Koella, 1991) should be viewed in this light. We provide numerical illustrations of our results.
1. Introduction
The basic reproductive number, R0, plays a crucial role in determining both whether a pathogen is able to spread and the strength of control measures needed to halt its spread. The simplest descriptions of R0 assume simple transmission scenarios, such as perfect mixing of a population and homogeneity of the individuals in the population, e.g. in terms of their susceptibility and infectiousness. The inadequacies of such descriptions have long been realized and much attention has been directed towards understanding the impact of heterogeneities in transmission on the basic reproductive number. Early efforts included accounting for differing activity levels amongst the population and various mixing patterns of the population (e.g. proportionate/random mixing, assortative and disassortative mixing). Much of this work was prompted by the heterogeneities known to exist for the spread of sexually transmitted infections, notably gonorrhea and HIV (Nold (1980); Hethcote and Yorke (1984); Anderson et al. (1986); May and Anderson (1987); Jacquez et al. (1988); Gupta et al. (1989)). In the context of vector-borne diseases, it has long been realized that vectors’ bites are not distributed uniformly across hosts; instead, there is a heterogeneity in hosts’ attractiveness to vectors, with some individuals being disproportionately favored to receive bites (Carnevale et al. (1978); Dye and Hasibeder (1986); De Benedicitis et al. (2003); Liebman et al. (2014)).
A now standard argument shows that the basic reproductive number for a multi-type transmission system can be calculated as the dominant eigenvalue of the next generation matrix (Diekmann and Heesterbeek (2000)). For an n-type setting, the next generation matrix is an n by n non-negative matrix whose (i; j)’th entry gives the average number of secondary infections of type i caused by a type j individual in an otherwise entirely susceptible population. Consequently, much attention has been directed towards those special cases of heterogeneous transmission that lead to next generation matrices whose dominant eigenvalue is analytically tractable and hence for which the basic reproductive number can be calculated explicitly. In the context of spatial heterogeneity, these include symmetric spatial configurations such as equally-sized patches with all-to-all or nearest neighbor contacts (see, for example Lloyd and May (1996)).
More generally, a commonly-studied situation involves separable transmission (Diekmann and Heesterbeek (2000)), where each group has a susceptibility, ai and an infectiousness, bi. In this case, the next generation matrix is of rank one, and, taking groups to be of equal sizes, has entries aibj/n, and dominant eigenvalue

Using the result
for the expectation of a product of random variables, eqn (1) can be rearranged into the following well-known formula (Dietz (1980); Dye and Hasibeder (1986)) that sheds insight into the impact of heterogeneity on R0 in this separable setting:

Here, and
denote the average values of ai and bi, rab denotes the Pearson product-moment correlation coefficient between the ai and bi, CVa and CVb denote the coefficients of variation (i.e. standard deviation divided by the mean) of ai and bi and
denotes the value of R0 that would be predicted if the heterogeneity was ignored, i.e. the average values of ai and bi were used. We emphasize that these results are exact, holding for arbitrary distributions of the ai and bi.
Heterogeneity can inflate or deflate the value of R0, depending on whether there is positive or negative correlation between susceptibility and infectivity across the groups (Dietz, 1980). In the special case where susceptibility and infectivity are proportional, e.g. for a situation such as differing activity levels or mosquito biting preferences where the heterogeneity impacts both susceptibility and infectiousness in the same way, the formula reduces to

This formula has appeared in the literature numerous times in a number of different settings and guises (Dietz, 1980; Dye and Hasibeder, 1986; May and Anderson, 1987).
Particularly with the increasing realization that many systems are subject to multiple, often coupled, heterogeneities (Paull et al., 2012; Vazquez-Prokopec et al., 2016), an important question is whether results such as eqns. (3) and (4) generalize to situations in which there are more than two heterogeneities. In this paper, we show that the answer to this question is no: the effect of multiple interacting heterogeneities on the basic reproductive number depends on the details of the distributions of the heterogeneities, in contrast to what occurs in the two-heterogeneity setting. We provide results for both multivariate normal and multivariate log-normal distributions of heterogeneities and demonstrate that the two settings can give markedly different results.
2. Results
We assume that there are n types of individuals, resulting from N different heterogeneities, N1 of which impact susceptibility and N2 of which impact infectiousness. We further assume that the susceptibility of a type i individual can be written as the product , taken over the heterogeneities that impact susceptibility, and that the infectivity of a type j individual can similarly be written as
. Taking group sizes to be equal and assuming separable transmission, the entries of the next generation matrix will have the form
, where N1 + N2 = N. This matrix is of rank one and has dominant eigenvalue given by

As explained above, the well-known result arises from the ability to express the expectation of the product of a pair of random variables in terms of their two expectations and their covariance. Extension of the result requires corresponding manipulations of expectations of products of three or more random variables—the so-called product moments of the joint distribution.
2.1. Analytic Results
In the case of a set of random variables whose joint distribution is multivari-ate normal, numerous authors have obtained results for product moments (see, for example, Isserlis (1918), Bendat and Piersol (1966), Bar and Dittrich (1971) and Song and Lee (2015)). For instance, in the four dimensional case we have (Bendat and Piersol (1966) and Bar and Dittrich (1971))

The expectations of pairwise products can be rewritten in the way described above to give

We remark that the case of three random variables can be obtained by setting X4 = 1.
For a set of N multivariate lognormally distributed random variables, product moments are given by the formula (Kotz et al. (2000))
where ξ and V are the mean and variance of the corresponding multivariate normal distribution. Some simple manipulation leads to

Given that eqns (7) and (9) differ, and that their reduced forms when X4 = 1 also differ, we have shown that there is no general formula of this type for the basic reproductive number when there are three or more coupled heterogeneities. We do notice, however, that the two formulae give approximately equal results in the limit of small coefficients of variation, i.e. when one can ignore products involving two or more pairs of coefficients of variation.
The majority of papers in the literature that provide analytic results for the basic reproductive number under heterogeneity focus on at most two coupled heterogeneities. One notable exception is the work of Koella (1991), which provides—without proof or qualification for its applicability—the following formula for a vector-borne pathogen subject to heterogeneities in mosquito biting rate, a, human susceptibility, b, and duration of human infection, ρ

Note that the single biological heterogeneity in biting rate impacts both in-fectiousness and susceptibility, resulting in it being treated as two perfectly correlated heterogeneities.
We note that eqution (10) has no terms that involve products of pairs of covariances (or, in the language of the earlier formulae, correlation coefficients). As in the remark above comparing results between multivariate normal and lognormal distributions, this formula should, in general, be seen as an approximation that is likely most accurate when coefficients of variation are small (i.e. the heterogeneities are relatively minor). As a comment that is germane to a numerical example shown below, we remark that the Koella formula does agree with the result for the multivariate normal distribution, eqn (7), if the coefficient of variation describing either human susceptibility or the duration of human infection is equal to zero.
2.2. Numerical Results
We illustrate the above results using numerical simulation, allowing us to explore the differences between predictions made using the formulae for the two distributions and also using the formula in the small coefficient of variation limit. For concreteness, we place these simulations within the vector-host setting described by Koella (1991), but for simplicity we hold one of the factors constant. Specifically, hosts differ in their attractiveness to mosquitoes, impacting their susceptibility and infectiousness (thus treated as two perfectly correlated heterogeneities, X1 and X2, within our framework), and also in their durations of infection, X3. Setting X4 = 1 and taking X2 = X1, we obtain the following two formulae:
for bivariate normally distributed heterogeneities, and
for bivariate lognormally distributed heterogeneities. We notice that in this reduced setting of X4 = 1, the first of these formulae coincides with the small coefficient of variation limit of the two general formulae, and, as discussed above, also agrees with the Koella formula. Furthermore, we see that when the correlation coefficient, rx1,x3, between the two heterogeneities is zero, the two formulae are identical.
2.2.1. Bivariate Normal Distribution
Figure (1) illustrates the performance of eqn (11) when the group attributes X1 and X3 are sampled from a bivariate normal distribution. Parameter values (given in the figure caption) were chosen for illustrative purposes and are not intended to represent a specific real-world infection. For each of a thousand replicates, either ten (panel a) or a thousand (panel b) pairs of values of biting rate and duration of human infection were sampled from a bivariate normal distribution. The actual R0 value for each replicate, calculated from eqn (5), is plotted against the value of R0 predicted for that replicate by the MVN formula, eqn (11). In addition, we show (using a red square) the R0 value that would be predicted if there was no heterogeneity (i.e. X1 and X3 are set equal to their respective average values) and the value predicted by the MVN formula for the underlying MVN distribution (green star).
Comparison of R0 values calculated numerically (using eqn 5) with those predicted by the MVN formula (eqn 11) for heterogeneities distributed according to an MVN distribution. Each blue circle represents the values of R0 obtained for (panel a) a ten group model or (panel b) a thousand group model with pairs of values of biting rate, X1, and average duration of human infection X3 drawn from a bivariate normal distribution with means 1 and 5, respectively, variances Var(X1) = 0:2 and Var(X3) = 4, and correlation rx1;x2 = 0:9. The value of the susceptibility parameter was fixed at 1 for each group. The red square denotes the value of R0 if there was no heterogeneity, i.e. obtained using the average values, , while the green star denotes the value of R0 calculated using the MVN formula using the means, variances and covariances of the underlying bivariate normal distribution. As described in the text, the predictive ability of the MVN formula is measured by the coefficient of determination, R2, and equals: (panel a) 0.991 and (panel b) 0.988. Note the different scales on the axes between the two panels.
To aid comparison, the 45° diagonal line is shown on both plots: deviations from these lines represent deviations from the values predicted by the MVN formula. We quantitate these deviations by calculating the coefficient of determination, R2, in a way that is familiar from regression theory. In both cases, the R2 value is high, and approximately equal to 0.99. The numerically calculated values of R0 fall in a cloud centered on the value predicted by the MVN formula, with the size of the cloud being smaller for the panel resulting from the larger number of groups. The deviations here result from sampling error, with samples drawn from the bivariate normal not being perfectly representative of the entire distribution. This effect is more pointed when there are fewer samples (i.e. fewer groups), with the cloud of points shrinking as the number of groups increases. In fact, since the basic reproductive number, eqn (5), is calculated as the average of a sequence of independent, identically distributed quantities, the central limit theorem can be used to quantify the variation seen about the central predicted value. We remark that a corresponding figure generated in the case of two coupled heterogeneities would exhibit no deviation from the diagonal line as eqn (3) is exact: it does not rely on any distributional assumption of heterogeneities across groups.
2.2.2. Bivariate Lognormal Distribution
Use of a multivariate lognormal distribution allows us to explore settings in which the components of transmission exhibit more severe heterogeneity and to assess the extent to which the impact of such heterogeneities are misrepresented by either the small coefficient of variation formula or by the formula that pertains in the multivariate normal case.
In figure (2), the biting rates and durations of infectiousness are drawn from independent lognormal distributions, with means 1 and 3, and variances 1 and 8, respectively. We compare the performance of the MVN formula (panel a) and MVLN formula (panel b) for a 1000-group setting (i.e. 1000 samples are drawn from the distributions). In this case, because there is zero correlation between the two heterogeneities, the population-level predictions of the MVN and MVLN formulae (i.e. the values obtained using the moments of the underlying distribution) are identical (shown by a yellow diamond). We see that the MVLN formula provides a better description (R2 = 0.748 using the MVN formula, while R2 = 0.852 using the MVLN formula), which is to be expected given that samples were drawn from an MVLN distribution.
Comparison of R0 values calculated numerically (using eqn 5) with those predicted by the formula derived from the MVN distribution (eqn 11) (panel a) and those predicted by the formula derived from the MVLN distribution (eqn 12) (panel b). Each blue circle represents the values of R0 obtained for a thousand group model with values of biting rate, X1, and average duration of human infection, X3, drawn from independent lognormal distributions (means 1 and 3, variances 1 and 8, respectively). The value of the susceptibility parameter was fixed at 1 for each group. The red square denotes the value of R0 obtained using the average values, , while the yellow diamond denotes the value of R0 calculated using the MVLN formula using the means, variances and covariances of the bivariate lognormal distribution. Because the distributions of X1 and X3 are assumed to be independent in this figure, the populationlevel predictions of the MVLN and MVN formulae are identical. The predictive ability of the MVLN formula (panel b, R2 = 0:852) is greater than that of the MVN formula (panel a, R2 = 0:748), as should be expected given that draws were made from an MVLN distribution.
Figure (3) explores a situation in which there is a positive correlation (r = 0.6) between the two components, i.e. the biting rate and duration of infectiousness, of the bivariate lognormal distribution. We see that the MVLN formula performs well (panel b, R2 = 0.852). The positive correlation leads to the MVN formula underestimating R0 compared to the MVLN formula, for example as witnessed by the central estimates of R0 (yellow diamond: MVLN, green star: MVN). The MVN formula consistently provides a large underestimate of the true value of R0 (panel a), and its predictive ability is poor (the negative value of R2, -0.596, indicates that the formula performs worse on these points than a constant predictor).
Comparison of R0 values calculated numerically (using eqn 5) with those predicted by the formula derived from the MVN distribution (eqn 11) (panel a) and those predicted by the formula derived from the MVLN distribution (eqn 12) (panel b). Details are as in Figure (2), except that here there is a positive correlation, r = 0:6, between the two components of the bivariate lognormal distribution. In this case, the population-level predictions of the MVN and MVLN formulae (green star and yellow diamond, respectively) differ. The predictive ability of the MVLN formula (panel b, R2 = 0:828) is greater than that of the MVN formula (panel a, R2 = -0.596). The MVN formula performs worse than a constant predictor and consistently provides a large underestimate of the true value of R0.
Finally, we consider a setting in which the two heterogeneities are negatively correlated, with r = –0.2. Figure (4) shows that in this case, the MVLN formula correctly predicts lower values of R0 than does the MVN formula (e.g. compare the locations of the yellow diamond and the green star, obtained from MVLN and MVN formulae, respectively, using the moments of the underlying distribution). The MVLN formula provides reasonable predictions (R2 = 0.735), whereas the MVN formula (R2 = –4.34) consistently overestimates the value of R0, and often by a considerable amount.
Comparison of R0 values calculated numerically (using eqn 5) with those predicted by the formula derived from the MVN distribution (eqn 11) (panel a) and those predicted by the formula derived from the MVLN distribution (eqn 12) (panel b). Details are as in Figures (2) and (3), except that here there is a negative correlation, r = -0.2, between the two components of the bivariate lognormal distribution. Again, the predictive ability of the MVLN formula (panel b, R2 = 0.721) is greater than that of the MVN formula (panel a, R2 = -4.91), with the latter performing worse than a constant predictor and consistently providing a large overestimate of the true value of R0.
3. Discussion
In this paper we have shown that the well-known result for the impact of two coupled heterogeneities on the basic reproductive number of an epidemiological system under separable transmission does not have a general counterpart when there are three or more coupled heterogeneities. In the more general setting, the formula for the basic reproductive number depends on details of the joint distribution of the heterogeneities in a way that is quite different than in the setting with two heterogeneities. We were able to derive formulae that related the basic reproductive number to the magnitudes of the heterogeneities and their pairwise correlations for the special cases of multivariate normal and mul-tivariate lognormal distributions of heterogeneities. Under particular limiting cases (typically in the limit of low levels of heterogeneity), the two formulae give similar predictions. We showed that an earlier result in the literature (Koella, 1991) should be viewed as an approximate result, although we noted that in appropriate limiting cases, the result agrees with our formula for the multivariate normal distribution.
Given the reliance of the R0 formulae obtained here on the joint distribution of the heterogeneities, our results are only exact as the number of groups in the multi-type model approaches infinity. For a finite number of groups, where the levels of the heterogeneities across groups are sampled from the underlying distribution, the predictions made by these formulae are not perfect. This again is in contrast to the two heterogeneity setting, in which the well-known result is exact.
Although theoretical attention has typically focused on the two heterogeneity case, and this has provided much insight, heterogeneous transmission in the real world typically involves more than two factors (Paull et al. (2012); Vazquez-Prokopec et al. (2016)). As such, it is important to gain understanding of how multiple coupled heterogeneities impact transmission and the limitations of general results that can be obtained in such more realistic settings. This paper provides a theoretical step in that direction and aims to guide more detailed studies that involve numerical exploration of specific situations, yielding further insights into the epidemiological role of individual variability.
Acknowledgments
This work was supported by grants from the National Institutes of Health (P01-AI098670; all authors, and R01-AI091980; ALL) and the National Science Foundation (RTG/DMS1246991; ALL).