Coefficient of determination R2 and intra-class correlation coefficient ICC from generalized linear mixed-effects models revisited and expanded

Shinichi Nakagawa; Holger Schielzeth

doi:10.1101/095851

Abstract

The coefficient of determination R² quantifies the proportion of variance explained by a statistical model and is an important summary statistic of biological interest. However, estimating R² for (generalized) linear mixed models (GLMMs) remains challenging. We have previously introduced a version of R² that we called R²_GLMM for Poisson and binomial GLMMs, but not for other distributional families. Similarly, we earlier discussed how to estimate intra-class correlation coefficients ICC using Poisson and binomial GLMMs, but not for other distributional families. In this article, we expand our methods to all the other non-Gaussian distributions such as negative binomial and gamma GLMMs. While expanding our approach, we highlight two useful concepts, Jensen’s inequality and the delta method, both of which help us in understanding the properties of GLMMs. We illustrate the implementation of our extension by worked examples in the R environment although our method can be used regardless of statistical environments.

Introduction

One of the main purposes of linear modelling is to understand the sources of variation in biological data. In this context, it is not surprising that the coefficient of determination R² is a commonly reported statistic because it represents the proportion of variance explained by a linear model. The intra-class correlation coefficient ICC is a related statistic that quantifies the proportion of variance explained by a grouping (random) factor in multilevel/hierarchical data. In the field of ecology and evolution, a type of ICC is often referred to as repeatability R, where the grouping factor is often individuals that have been phenotyped repeatedly (Lessells and Boag 1987, Nakagawa and Schielzeth 2010). We have reviewed methods for estimating R² and ICC in the past (Nakagawa & Schielzeth 2010, 2013), with a particular focus on non-Gaussian response variables, featuring generalized linear mixed-effects models (GLMMs) as the most versatile engine for estimating R² and ICC (specifically R²_GLMM and ICC_GLMM). Our descriptions were limited to random-intercept GLMMs, but Johnson (2014) has recently extended the methods to random-slope GLMMs, widening the applicability of these statistics (see also, LaHuis et al. 2014; Jaeger et al. 2016).

However, at least one important issue seems to remain. Currently these two statistics are only described for binomial and Poisson GLMMs. Although these two types of GLMMs are arguably the most popular (Bolker et al. 2009), there are other commonly used families for GLMMs, such as negative binomial and gamma distributions (Ver Hoef & Boveng 2007; Bolker 2008). In this article, we revisit and extend R²_GLMM and ICC_GLMM to more distributional families, in particular to negative binomial and gamma distributions. In this context, we discuss Jensen’s inequality and two variants of the delta method, which are useful not only for extending our method, but also for interpreting the results of GLMMs in general. Furthermore, we refer to some special considerations when obtaining R²_GLMM and ICC_GLMM from binomially GLMMs for binary and proportion data, which we did not discuss in the past (Nakagawa & Schielzeth 2010, 2013). We provide worked examples focusing on implementation in the R environment (R Core Team 2016) and finish by referring to two alternative approaches for obtaining R² and ICC from GLMMs along with a cautionary note.

Definitions of R²_GLMM, ICC_GLMM and overdispersion

To start with, we present R²_GLMM and ICC_GLMM for a simple case of Gaussian error distributions based on a linear mixed-effects model (LMM, hence also referred to as R²_LMM and ICC_LMM). Imagine a two-level dataset where the first level corresponds to observations and the second level to some grouping factor (e.g. individuals) with k fixed effect covariates. The model can be written as (model 1): where y_ij is the jth observation of the ith individual, X_hij is the jth value of the ith individual for the hth of k fixed effects predictors, β₀ is the (grand) intercept, β_h is the regression coefficient for the hth predictor, α_i is an individual-specific effect, assumed to be normally distributed in the population with the mean and variance of 0 and σ_α², ε_ij is an observation-specific residual, assumed to be normally distributed in the population with mean and variance of 0 and σ_ε², respectively. For this model, we can define two types of R² as: where R²_LMM(m) represents the marginal R², which is the variance accounted for by the fixed effects, R²_LMM(c) represents the conditional R², which is the variance explained by both fixed and random effects, and σ_f² is the variance explained by fixed effects (Snijders & Bosker 1999, 2011). Since marginal and conditional R² differ only in whether the random effect variance is included in the numerator, we avoid redundancy and present equations only for marginal R² in the following. Similarly, there are two types of ICC:

If no fixed effects are included, the two versions are identical and represent unadjusted ICC, but if fixed effects are fitted, ICC_LMM(adj) represents adjusted ICC, while ICC_LMM represented unadjusted ICC (sensu Nakagawa and Schielzeth 2010). Since the two versions of ICC differ only in whether the fixed effect variance (calculated as in Equation 6) is included in the denominator, we avoid redundancy and present equations only for adjusted ICC in the following.

One of the main difficulties in extending R² from LMMs to GLMMs is defining the residual variance σ_ε². For binomial and Poisson GLMMs with an additive dispersion terms, we have previously stated that σ_ε² is equivalent to σ_e² + σ_d² where σ_e² is the variance for the additive overdispersion term, and σ_d² is the distribution-specific variance (Nakagawa & Schielzeth 2010, 2013). Here overdispersion represents the excess variation relative to what is expected from a certain distribution and can be estimated by fitting an observation-level random effect (OLRE; see, Harrison 2014, 2015). Alternatively, overdispersion in GLMMs can be implemented using a multiplicative overdispersion term (Browne et al. 2005). In such an implementation, we stated that σ_ε² is equivalent to ω·σ_d² where ω is a multiplicative dispersion parameter estimated from the model (Nakagawa & Schielzeth 2010). But obtaining σ_d² for specific distributions is not always possible, because in many families of GLMMs the parameters are less clearly separated into a parameter for the expectation of the mean and a parameter for the (over)dispersion. It turns out that binomial and Poisson distributions are special cases where σ_d² can be usefully calculated, because either all overdispersion is modelled by an OLRE (additive overdispersion) or by a single multiplicative overdispersion parameter (multiplicative overdispersion). However, as we will show below, we can always obtain the GLMM version of σ_ε² (on the latent scale) directly. We refer to this generalised version of σ_ε² as ‘the observation-level variance’ here rather than the residual variance (but we keep the notation σ_ε²).

Extension of R²_GLMM snd ICC_GLMM

We now define R²_GLMM and ICC_GLMM for an overdispersed Poisson (also known as quasi-Poisson) GLMM, because the overdispersed Poisson distribution is similar to the negative binomial distribution at least in their uses (Gelman & Hill 2007; Ver Hoef & Boveng 2007). Imagine count data repeatedly measured from a number of individuals with associated data on k covariates. We fit an overdispersed Poisson (OP) GLMM with the log link function (model 2): where y_ij is the jth observation of the ith individual and y_ij follows an overdispersed Poisson distribution with two parameters, λ_ij and ω, ln(λ_ij) is the latent value for the jth observation of the ith individual, ω is the overdispersion parameter (when the multiplicative dispersion parameter ω is 1, the model becomes a standard Poisson GLMM), α_i is an individual-specific effect, assumed to be normally distributed in the population with the mean and variance of 0 and σ_α², respectively (as in model 1), and the other symbols are the same as above. For such a model, we can define R²_GLMM(m) and (adjusted) ICC_GLMM as: where the subscript of R² and ICC denote the distributional family, here OP-ln for overdispersed Poisson distribution with log link, the term ln(1 + ω /λ) corresponds to the observation-level variance σ_ε² (Table 1, for derivation see Appendix S1), ω is the overdispersion parameter, and λ is the mean value of λ_ij. We discuss how to obtain λ below.

The calculation is very similar for a negative binomial (NB) GLMM with the log link (model 3): where y_ij is the jth observation of the ith individual and y_ij follows a negative binomial distribution with two parameters, λ_ij and θ, where θ is the shape parameter of the negative binomial distribution (given by the software often as the dispersion parameter), and the other symbols are the same as above. R²_GLMM(m) and (adjusted) ICC_GLMM for this model can be calculated as: Finally, for a gamma GLMM with the log link (model 4): where y_ij is the jth observation of the ith individual and y_ij follows a gamma distribution with two parameters, λ_ij and v, where v is the shape parameter of the gamma distribution (sometimes statistical programs report 1/ v instead of v; also note that the gamma distribution can be parameterized in alternative ways, Table 1), R²_GLMM(m) and (adjusted) ICC_GLMM can be calculated as:

View this table:

Table 1.

The observation-level variance σ_ε² for the three distributional families: quasi-Poisson (overdispersed Poisson), negative binomial and gamma with the three different methods for deriving σ_ε²: the delta method, long-normal approximation and the trigamma function, ψ₁.

Obtaining the observation-level variance by the ‘first’ delta method

For overdispersed Poisson, negative binomial and gamma GLMMs with log link, the observation-level variance σ_ε² can be obtained via the variance of the log-normal distribution, as described above (see Appendix S1). There are two more alternative methods to obtain the same target: the delta method and the trigamma function. The two alternatives have different advantages and will be discussed in some detail below.

The delta method for variance approximation uses a first order Taylor series expansion, which is often employed to approximate the standard error (error variance) for transformations (or functions) of a variable x when the (error) variance of x itself is known (see Ver Hoff 2012; for an accessible reference for biologists, Powell 2007). A simple case of the delta method for variance approximation can be written as: where x is a random variable (typically represented by observations), f represents a function (e.g. log or square-root), var denotes variance, and d/dx is a (first) derivative with respect to variable x. Taking derivatives of any function can be easily done using the R environment (examples can be found in the Appendices). It is the delta method that Foulley et al. (1987) used to derive the distribution specific variance σ_d² for Poisson GLMMs as 1/λ Given that var[λ_ij] = λ in Poisson distributions and d ln(λ) / dx = 1/ λ, it follows that var[ln(λ_ij).)] ≈ λ (1 / λ)² (note that for Poisson distributions without overdispersion, σ_d² is equal to σ_ε² because σ_e² = 0). One clear advantage of the delta method is its flexibility, and we can easily obtain the observation-level variance σ_ε² for all kinds of distributions/link functions. For example, by using the delta method, it is straightforward to obtain σ_ε² for the Tweedie (compound Poisson-gamma) distribution, which has been used to model non-negative real numbers in ecology (e.g., Foster & Bravington 2013; Zhang 2013). For the Tweedie distribution, the variance on the observed scale has the relationship var[y] = φμ^p where μ is the mean on the observed scale and φ is the dispersion parameter (comparable to λ and ω in Equation 9), and p is a positive constant called an index parameter. Therefore, when used with the log-link function, an approximated σ_ε² value can be obtained by φμ^(p-2) according to Equation 24. The log-normal approximation ln(1 + φμ^(p-2)) is also possible (see Appendix S1; cf. Table 1).

The use of the trigamma function ψ₁ is limited to distributions with log link, but it should provide the most accurate estimate of the observation level variance σ_ε². This is because the variance of a gamma-distributed variable on the log scale is equal to ψ₁(ν) where v is the shape parameter of the gamma distribution (Tempelman & Gianola 1999) and hence σ_ε² is ψ₁(ν). At the level of the statistical parameters (Table 1; on the ‘expected data’ scale; sensu deVillemereuil et al. 2016; see their Figure 1), Poisson and negative binomial distributions can be both seen special cases of gamma distributions, and σ_ε² can be obtained using the trigamma function (Table 1). For example, σ_ε² for the Poisson distribution is ψ₁(λ) with the speciality that in the case of Poisson distributions σ_ε² = σ_d². As we show in Appendix S2, ln(1+1/λ) (log-normal approximation), 1/λ (delta method approximation) and ψ₁(λ) (trigamma function) are similar if λ is greater than 2. Nonetheless, our recommendation is to use the trigamma function for obtaining σ_ε² whenever this is possible.

Figure 1.

A schematic of how hypothetical datasets are obtained (see the main text for details).

We note that in calculations of heritability (which can be seen as a type of ICC although in a strict sense, it is not; see de Villemereuil et al. 2016) using negative binomial GLMMs, the trigamma function has been previously used to obtain observation-level variance (Matos et al. 1997; Tempelman & Gianola 1999; cf. de Villemereuil et al. 2016). Table 1 summarises observation-level variance σ_ε² for overdispersed Poisson, negative binomial and gamma distributions for commonly used link functions.

How to estimate λ from data

Imagine a Poisson GLMM with log link and additive overdispersion fitted as an observation-level random effect (model 5): where y_ij is the jth observation of the ith individual, and follows a Poisson distribution with the parameter λ_ij, e_ij is an additive overdispersion term for jth observation of the ith individual, and the other symbols are the same as above. Using the log-normal approximation R²_GLMM(m) and (adjusted) ICC_GLMM can be calculated as: where, as mentioned above, the term ln(1+1/λ) is σ_ε² (or σ_d²) for Poisson distributions with the log link (Table 1).

In our earlier papers, we proposed to use the exponential of the intercept (from the intercept-only model or models with centred fixed factors) exp(β₀) as an estimator of λ ((Nakagawa & Schielzeth 2010, 2013)). We also suggested that it is possible to use the mean of observed values y_ij. Unfortunately, these two recommendations are often inconsistent with each other. This is because, given the model 5 (and all the models in the previous section), the following relationships hold: where E represents the expected value (i.e., mean) on the observed scale, β₀ is the mean value on the latent scale (i.e. β₀ from the intercept-only model), σ_τ² is the total variance on the latent scale (e.g., σ_α²+σ_e² in the models 1and 5, and σ_α² in models 2-4; (Nakagawa & Schielzeth 2010); see also Carrasco 2010). In fact, exp(β₀) gives the median value of y_ij rather than the mean of y_ij, assuming a Poisson distribution. Thus, the use of exp(β₀) will often overestimate σ_d², providing conservative (smaller) estimates of R² and ICC, compared to when using averaged y_ij, which is a better estimate of E[y_ij]. Quantitative differences between the two approaches may often be negligible, but when λ is small, the difference can be substantial so the choice of the method needs to be reported for reproducibility (Appendix S2). Our new recommendation is to obtain λ via Equation 32. When sampling is balanced (i.e. observations are equally distributed across individuals and covariates), Equation 32 and the mean of the observed values will give similar values, but when unbalanced, method Equation 32 is preferable. This recommendation for obtaining λ also applies to negative binomial GLMMs (see Table 1).

Jensen’s inequality and the ‘second’ delta method

A general form of Equation 31 is known as Jensen’s inequality, where g is a convex function. Hence, the transformation of the mean value is equal to or larger than the mean of transformed values (the opposite is true for a concave function; that is, ; Rao 2002). In fact, whenever the function is not strictly linear, simple application of the inverse link function (or back-transformation) cannot be used to translate the mean on the latent scale into the mean value on the observed scale. This inequality has important implications for the interpretation of results from GLMMs, and also generalized linear models GLMs and linear models with transformed response variables.

Although log-link GLMMs (e.g., model 5) have an analytical formula (Equation 32), this is not usually the case. Therefore, converting the latent scale values into observation-scale values requires simulation using the inverse link function. However, the delta method for bias correction can be used as a general approximation to account for Jensen’s inequality when using link functions or transformations. This application of the delta method uses a second order Taylor series expansion (Oehlert 1992; Ver Hoef 2012). A simple case of the delta method for bias correction can be written as: ) where d²/dx² is a second derivative with respect to the variable x and the other symbols are as in Equations 24 and 32. By employing this bias correction delta method (with d² exp(x)/ dx² = exp(x)), we can approximate Equation 32 using the same symbols as in Equations 31-33:

The comparison between Equation 32 (exact) and Equation 35 (approximate) is shown in Appendix S3. The approximation is most useful when the exact formula is not available as in the case of a binomial GLMM with logit link (model 6): Where y_ij is the number of ‘success’ in n_ij trials by the ith individual at the jth occasion (for binary data, n_ij is always 1), p_ij is the underlying probability of success, and the other symbols are the same as above.

To obtain corresponding values between the latent scale and data (observation) scale, we need to account for Jensen’s inequality (note the logit function combines of concave and convex sections). For example, the overall intercept, β₀ on the latent scale could be transformed not just with the inverse (anti) logit function (logit^-1(x) = exp(x)/(1 + exp(x))) but also the bias corrected approximation. For the case of the binomial GLMM, we can use this approximation below given that d²logit⁻¹ (x)/ dx² = exp(x) (1 – exp(x)) / (1 + exp(x))³:

We can replace β₀ with any value obtained from the fixed part of the model (i.e. β₀+Σβ_hx_hij). Another approximation proposed by Zeger et al. (1988) produces similar (but slightly better) estimates than Equation 40. Using our notation, this approximation can be written as:

A comparison between Equations 40 and 41 is also shown in Appendix S3. This approximation uses the exact solution for the inverse probit function, which can be written for a model like model 6 but using the probit link (i.e., probit in place of Equation 37):

Simulation will give the most accurate conversions when no exact solutions are available. The use of the delta method for bias correction accounting for Jensen’s inequity is a very general and versatile approach that is applicable for any distribution with any link function (see Appendix S3) and can save computation time. We note that the accuracy of the delta method (both variance approximation and bias correction) depends on the form of the function f, the conditions for and limitation of the delta method are described in Oehlert (1992).

Special considerations for binomial GLMMs

The observation-level variance σ_ε² can be thought of as being added to the latent scale on which other variance components are also estimated in a GLMM (Equations 10, 15, 20, 26, 37 for models 2-6). Since the proposed R²_GLMM and ICC_GLMM are ratios between variance components and their additive combinations, we can show using the delta method that R²_GLMM and ICC_GLMM calculated via σ_ε² approximate to those of R² and ICC on the observation (original) scale (shown in Appendix S4). In some cases, there exist specific formulas for ICC on the observation scale (Nakagawa & Schielzeth 2010). In the past, we distinguished between ICC on the latent scale and on the observation scale (Nakagawa & Schielzeth 2010). Such a distinction turns out to be strictly appropriate only for binomial distributions but not for Poisson distributions (and probably also not for other non-Gaussian distributions). This is because the property of what we have called the distribution-specific variance σ_d² for binomial distributions (e.g. π²/3 for binomial error distribution with the logit link function) is quite different from what we have discussed as the observation-level variance σ_ε² although these two types of variance are related conceptually (i.e., both represents variance due to non-Gaussian distributions with specific link functions). Let us explain this further.

A binomial distribution with a mean of p (the proportion of successes) has a variance of p(1–p) and we find that the observation-level variance is 1/(p(1–p)) using the delta method on the logit-link function (see Table 2). This observation-level variance 1/(p(1–p)) is clearly different from the distribution-specific variance π²/3. As with the observation-level variance for the log-Poisson model (which is 1/λ and changes with λ; note that we would have called 1/λ the distribution-specific variance; Nakagawa & Schielzeth 2010, 2013), the observation-level variance of the binomial distribution changes as p changes (see Appendix S5), suggesting these two observation-level variances (1/λ and 1/(p(1–p))) are analogous while the distribution-specific variance π²/3 is not. Further, the minimum value of 1/(p(1-p)) is 4, which is larger than π²/3 ≈ 3.29, meaning that the use of 1/p(1–p) in R² and ICC will always produce larger values than those using π²/3.

View this table:

Table 2.

The distribution-specific variance σ_d² and observation-level variance σ_ε² for binomial (and Bernoulli) distributions; note that only one of them should be used for obtaining R² and ICC.

Consequently, Browne et al. (2005) showed that ICC values (or variance partition coefficients, VPCs) estimated using π²/3 were higher than corresponding ICC values on the observation (original) scale using logistic-binomial GLMMs (see also Goldstein et al. 2002; Nakagawa & Schielzeth 2010). Then, what is π²/3?

Three common link functions in binomial GLMMs (logit, probit and complementary log-log) all have corresponding distributions on the latent scale: the logistic distribution, standard normal distribution and Gumbel distribution, respectively. Each of these distributions has a theoretical variance, namely, π²/3, 1 and π²/6, respectively (Table 2). As far as we are aware, these theoretical variances only exist for binomial distributions. It is important to notice that, for example, the meaning of 1/(p(1–p)), which is the variance on the latent scale that approximates to the variance due to binomial distributions on the observation scale is distinct from the meaning of π²/3, which is the variance of the latent distribution (i.e., the logistic distribution) according to which the original data are theoretically distributed on the logit scale. We need distinguishing these theoretical (distribution-specific) variances from the observation-level variance. Put another way, R² and ICC values using the theoretical distribution-specific variance can rightly be called the latent (link) scale (sensu Nakagawa & Schielzeth 2010) while, as mentioned above, R² and ICC values using the observation-level variance estimate the counterparts on the observation (original) scale (cf. de Villemereuil et al. 2016). The use of the theoretical distribution-specific variance will almost always provide different values of R²_GLMM and ICC_GLMM from those using the observation-level obtained via the delta method (see Appendix S5). In any case, we should be aware that binomial GLMMS are special cases for obtaining R²_GLMM and ICC_GLMM from binomial GLMMs.

Worked examples: revisting the beetles

In the following, we present a worked example by expanding the beetle dataset that was generated for (Nakagawa & Schielzeth)((2013)). In brief, the dataset represents a hypothetical species of beetle that has the following life cycle: larvae hatch and grow in the soil until they pupate, and then adult beetles feed and mate on plants. Larvae are sampled from 12 different populations (‘Population’; see Fig. 1). Within each population, larvae are collected at two different microhabitats (‘Habitat’): dry and wet areas as determined by soil moisture. Larvae are exposed to two different dietary treatments (‘Treatment’): nutrient rich and control. The species is sexually dimorphic and can be easily sexed at the pupa stage (‘Sex’). Male beetles have two different color morphs: one dark and the other reddish brown (‘Morph’, labeled as A and B in Fig 1). Sexed pupae are housed in standard containers until they mature (‘Container’). Each container holds eight same-sex animals from a single population, but with a mix of individuals from the two habitats (N_[container] = 120; N_[animal] = 960).

We have data on the five phenotypes, two of them sex-limited: (i) the number of eggs laid by each female after random mating which we had generated previously using Poisson distributions (with additive dispersion) and we revisit here for analysis with quasi-Poisson models (i.e. multiplicative dispersion), (ii) the incidence of endo-parasitic infections that we generated as being negative binomial distributed, (iii) body length of adult beetles which we had generated previously using Gaussian distributions and that we revisit here for analysis with gamma distributions, (iv) time to visit five predefined sectors of an arena (employed as a measure of exploratory tendencies) that we generated as being gamma distributed, and (v) the two male morphs, which was again generated with binomial distributions. We will use this simulated dataset to estimate R²_GLMM and ICC_GLMM.

All data generation and analyses were conducted in R 3.3.1 (R Development Core Team). We used functions to fit GLMMs from the three R packages: 1) the glmmadmb function from glmmADMB (Fournier et al. 2012), 2) the glmmPQL function from MASS (Venables & Ripley 2002) and 3) the glmer and glmer.nb functions from lme4 (Bates et al. 2015). In Table 1, we only report results from glmmADMB because this is the only function that can fit models with all relevant distributional families. All scripts and results are provided as an electronic supplement (Appendix S6). In addition, Appendix S6 includes an example of a model using the Tweedie distribution, which was fitted by the cpglmm function from the cplm package (Zhang 2013). Notably, our approach for R²_GLMM is kindly being implemented in the rsquared function in the R package, piecewiseSEM (Lefcheck 2016). Another important note is that we often find less congruence in GLMM results from the different packages than those of linear mixed-effects models, LMM. Thus, it is recommended to run GLMMs in more than one package to check robustness of the results although this may not always be possible.

In all the models, estimated regression coefficients and variance components are very much in agreement with what is expected from our parameter settings (Table 1 and Appendix S6). When comparing the null and full models, which had ‘sex’ as a predictor, the magnitudes of the variance component for the container effect always decrease in the full models. This is because the variance due to sex is confounded with the container variance in the null model. As expected, (unadjusted) ICC values from the null models are usually smaller than adjusted ICC values from the full models because the observation-level variance (analogous to the residual variance) was smaller in the full models (implying that the denominator of Equation 10 shrinks). However, the numerator also becomes smaller for ICC values for the container effect from the parasite, size and exploration models so that adjusted ICC values are not necessarily larger than unadjusted ICC values. Accordingly, adjusted ICC_[container] is smaller in the parasite and size models but not in the exploration model. The last thing to note is that for the morph models (binomial mixed models), both R² and ICC_vaiues are larger when using the distribution-specific variance rather than the observation-level variance, as discussed above (Table 3; also see Appendix S4).

View this table:

Table 3.

Mixed-effects model analysis of a simulated dataset estimating variance components and regression slopes for nutrient manipulations on fecundity, endoparasite loads, body length, exploration levels and male morph types; N_[population]=12, N_[contamer]=120 and N_[animal]=960.

Alternatives and a cautonary note

Here we extended our simple methods for obtaining R²_GLMM and ICC_GLMM for Poisson and binomial GLMMs to other types of GLMMs such as negative binomial and gamma. We have described three different ways of obtaining the observational-level variance and how to obtain the key rate parameter λ for Poisson and negative binomial distributions. We discussed important considerations which arise for estimating R²_GLMM and ICC_GLMM with binomial GLMMs. As we have shown, the merit of our approach is not only its ease of implementation but also that our approach encourages researchers to pay more attention to variance components at different levels. Research papers in the field of ecology and evolution often report only regression coefficients but not variance components of GLMMs (Schielzeth & Nakagawa 2013).

We would like to highlight two recent studies that provide alternatives to our approach. First, Jaeger et al. (2016) have proposed R² for fixed effects in GLMMs, which they referred to as R²_β* (an extension of an R² for fixed effects in linear mixed models or R²_β* by Edwards et al. 2008). They show that R²_β* is a general form of our marginal R²_GLMM; in theory, R²_β* can be used for any distribution (error structure) with any link function. Jaeger et al. (2016) highlight that in the framework of R²_β*, they can easily obtain semi-partial R², which quantifies the relative importance of each predictor (fixed effect). As they demonstrate by simulation, their method potentially gives a very reliable tool for model selection. One current issue for this approach is that implementation does not seem as simple as our approach. We note that our R²_GLMM framework could also provide semi-partial R² via commonality analysis (see Ray-Mukherjee et al. 2014; note that unique variance for each predictor in commonality analysis corresponds to semi-partial R^2;; Nimon & Oswald 2013).

Second, de Villemereuil et al. (2016) provided a framework with which one can estimate exact heritability using GLMMs at different scales (e.g. data and latent scales). Their method can be extended to obtain exact ICC values on the data (observation) scale, which is analogous to, but not the same as, our ICC_GLMM using the observation-level variance, σ_ε² described above. Further, this method can, in theory, be extended to estimate R²_GLMM on the data (observation) scale. One potential difficulty is that the method of de Villemereuli et al. is exact but that a numerical method is used to solve relevant equations so one will require a software package (e.g., the QGglmm package; de Villemereuil et al. 2016).

Finally, we finish by repeating what we said at the end of our original R² paper (Nakagawa & Schielzeth 2013). Both R² and ICC are indices that are likely to reflect only one or a few aspects of a model fit to the data and should not be used for gauging the quality of a model. We encourage biologists use R² and ICC in conjunctions with other indices like information criteria (e.g. AIC, BIC and DIC), and more importantly, with model diagnostics such as checking for model assumptions, heteroscedasticity and sensitivity to outliers.

Acknowledgements

We thank Losia Lagisz for help in making Figure 1. This work has been benefited from discussion with Jarrod Hadfield, Pierre de Villemereuil, Alistair Senior, Joel Pick and Dan Noble. SN was supported by an Australian Research Council Future Fellowship (FT130100268). HS was supported by an Emmy Noether fellowship from the German Research Foundation (DFG; SCHI 1188/1-2).

Footnotes

Authorship Statement: SN conceived ideas and conducted analysis. Both developed the ideas further, and contributed to writing and editing of the manuscript.

References

1.↵
Bates, D., Machler, M., Bolker, B.M. & Walker, S.C. (2015). Fitting Linear Mixed-Effects Models Using lme4. J Stat Softw, 67, 1–48.
OpenUrl CrossRef
2.↵
Bolker, B.M. (2008). Ecological models and data in R. Princeton University Presss, Princeton, NJ.
3.↵
Bolker, B.M., Brooks, M.E., Clark, C.J., Geange, S.W., Poulsen, J.R., Stevens, M.H.H. et al. (2009). Generalized linear mixed models: a practical guide for ecology and evolution. Trends Ecol Evol, 24, 127–135.
OpenUrl CrossRef PubMed Web of Science
4.↵
Browne, W.J., Subramanian, S.V., Jones, K. & Goldstein, H. (2005). Variance partitioning in multilevel logistic models that exhibit overdispersion. J R Stat Soc a Stat, 168, 599–613.
OpenUrl
5.↵
Carrasco, J.L. (2010). A generalized concordance correlation coefficient based on the variance components generalized linear mixed models for overdispersed count data. Biometrics, 66, 897–904.
OpenUrl CrossRef PubMed Web of Science
6.↵
de Villemereuil, P., Schielzeth, H., Nakagawa, S. & Morrissey, M. (in press). General methods for evolutionary quantitative genetic inference from generalised mixed models. Genetics.
7.↵
Foster, S.D. & Bravington, M.V. (2013). A Poisson-Gamma model for analysis of ecological non-negative continuous data. Environ Ecol Stat, 20, 533–552.
OpenUrl CrossRef
8.↵
Foulley, J.L., Gianola, D. & Im, S. (1987). Genetic Evaluation of Traits Distributed as Poisson-Binomial with Reference to Reproductive Characters. Theor. Appl. Genet., 73, 870–877.
OpenUrl CrossRef PubMed
9.↵
Fournier, D.A., Skaug, H.J., Ancheta, J., Ianelli, J., Magnusson, A., Maunder, M.N. et al. (2012). AD Model Builder: using automatic differentiation for statistical inference of highly parameterized complex nonlinear models. Optim Method Softw, 27, 233–249.
OpenUrl CrossRef Web of Science
10.↵
Gelman, A. & Hill, J. (2006). Data analysis using regression and multilevel/hierarchical models Cambridge University Press, Cambridge.
11.↵
Goldstein, H., Browne, W. & Rasbash, J. (2002). Partitioning variation in multilevel models. Understanding Statistics, 1, 223–231.
OpenUrl CrossRef
12.↵
Hoef, J.M.V. (2012). Who Invented the Delta Method? Am Stat, 66, 124–127.
OpenUrl CrossRef Web of Science
13.
Hox, J. (2010). Multilevel analysis. Routledg, New York.
14.↵
Jaeger, B.C., Edwards, L.J., Das, K. & Sen, P.K. (2016). An R2 statistic for fixed effects in the generalized linear mixed model. Journal of Applied Statistics, 10.1080/02664763.02662016.01193725.
15.↵
Johnson, P.C.D. (2014). Extension of Nakagawa & Schielzeth’s R-GLMM(2) to random slopes models. Methods Ecol Evol, 5, 944–946.
OpenUrl CrossRef PubMed
16.↵
LaHuis, D.M., Hartman, M.J., Hakoyama, S. & Clark, P.C. (2014). Explained Variance Measures for Multilevel Models. Organ Res Methods, 17, 433–451.
OpenUrl
17.↵
Lefcheck, J.S. (2016). PIECEWISESEM: Piecewise structural equation modelling in R for ecology, evolution, and systematics. Methods Ecol Evol, 7, 573–579.
OpenUrl CrossRef
18.↵
Lessells, C.M. & Boag, P.T. (1987). Unrepeatable repeatabilities - a common mistake. Auk, 104, 116–121.
OpenUrl CrossRef Web of Science
19.↵
Matos, C.A.P., Thomas, D.L., Gianola, D., Tempelman, R.J. & Young, L.D. (1997). Genetic analysis of discrete reproductive traits in sheep using linear and nonlinear models .1. Estimation of genetic parameters. J. Anim. Sci., 75, 76–87.
OpenUrl PubMed Web of Science
20.
Morrissey, M.B., de Villemereuil, P., Doligez, B. & Gimenez, O. (2014). Bayesian approaches to the quantitative genetic analysis of natural populations. In: Quantitative genetics in the wild (eds. Charmantier, A, Garant, D & Kruuk, LEB). Oxford University Press Oxford, pp. 228–253.
21.↵
Nakagawa, S. & Schielzeth, H. (2010). Repeatability for Gaussian and non-Gaussian data: a practical guide for biologists. Biol Rev, 85, 935–956.
OpenUrl CrossRef PubMed
22.↵
Nakagawa, S. & Schielzeth, H. (2013). A general and simple method for obtaining R2 from generalized linear mixed-effects models. Methods Ecol Evol, 4, 133–142.
OpenUrl CrossRef
23.↵
Nimon, K.F. & Oswald, F.L. (2013). Understanding the Results of Multiple Linear Regression: Beyond Standardized Regression Coefficients. Organ Res Methods, 16, 650–674.
OpenUrl
24.↵
Oehlert, G.W. (1992). A note on the delta method. Am Stat, 46, 27–29.
OpenUrl CrossRef Web of Science
25.↵
Powell, L.A. (2007). Approximating variance of demographic parameters using the delta method: A reference for avian biologists. Condor, 109, 949–954.
OpenUrl CrossRef
26.↵
R Development Core Team (2016). R: A language and environment for statistical computing. R Foundation for Statistical Computing Vienna, Austria.
27.↵
Rao, C.R. (2002). Linear statistical inference and its applications. 2nd ed. edn. John Wiley & Sons, New York.
28.↵
Ray-Mukherjee, J., Nimon, K., Mukherjee, S., Morris, D.W., Slotow, R. & Hamer, M. (2014). Using commonality analysis in multiple regressions: a tool to decompose regression effects in the face of multicollinearity. Methods Ecol Evol, 5, 320–328.
OpenUrl CrossRef
29.↵
Schielzeth, H. & Nakagawa, S. (2013). Nested by design: model fitting and interpretation in a mixed model era. Methods Ecol Evol, 4, 14–24.
OpenUrl CrossRef
30.↵
Snijders, T. & Bosker, R. (1999). Multilevel Analysis: an Introduction to basic and advanced multilevel modeling. Sage, London.
31.↵
Snijders, T. & Bosker, R. (2011). Multilevel Analysis: an Introduction to basic and advanced multilevel modeling. 2^nd edn. Sage, London.
32.↵
Tempelman, R.J. & Gianola, D. (1999). Genetic analysis of fertility in dairy cattle using negative binomial mixed models. J. Dairy Sci., 82, 1834–1847.
OpenUrl PubMed
33.↵
Venables, W.N. & Ripley, B.D. (2002). Modern applied statistics with S. 4 edn. Springer, New York.
34.↵
Ver Hoef, J.M. & Boveng, P.L. (2007). Quasi-Poisson vs. negative binomial regression: how should we model overdispersed count data? Ecology, 88, 2766–2772.
OpenUrl CrossRef PubMed Web of Science
35.↵
Zhang, Y.W. (2013). Likelihood-based and Bayesian methods for Tweedie compound Poisson linear mixed models. Stat Comput, 23, 743–757.
OpenUrl CrossRef

View the discussion thread.

Posted March 06, 2017.

Download PDF

Supplementary Material

Citation Tools

Subject Area

Ecology

Subject Areas

All Articles

Animal Behavior and Cognition (5210)
Biochemistry (11736)
Bioengineering (8749)
Bioinformatics (29186)
Biophysics (14964)
Cancer Biology (12086)
Cell Biology (17403)
Clinical Trials (138)
Developmental Biology (9418)
Ecology (14176)
Epidemiology (2067)
Evolutionary Biology (18299)
Genetics (12235)
Genomics (16795)
Immunology (11863)
Microbiology (28066)
Molecular Biology (11582)
Neuroscience (60936)
Paleontology (451)
Pathology (1870)
Pharmacology and Toxicology (3238)
Physiology (4956)
Plant Biology (10423)
Scientific Communication and Education (1683)
Synthetic Biology (2883)
Systems Biology (7338)
Zoology (1650)

[1] 1.↵
Bates, D., Machler, M., Bolker, B.M. & Walker, S.C. (2015). Fitting Linear Mixed-Effects Models Using lme4. J Stat Softw, 67, 1–48.
OpenUrl CrossRef

[2] 2.↵
Bolker, B.M. (2008). Ecological models and data in R. Princeton University Presss, Princeton, NJ.

[3] 3.↵
Bolker, B.M., Brooks, M.E., Clark, C.J., Geange, S.W., Poulsen, J.R., Stevens, M.H.H. et al. (2009). Generalized linear mixed models: a practical guide for ecology and evolution. Trends Ecol Evol, 24, 127–135.
OpenUrl CrossRef PubMed Web of Science

[4] 4.↵
Browne, W.J., Subramanian, S.V., Jones, K. & Goldstein, H. (2005). Variance partitioning in multilevel logistic models that exhibit overdispersion. J R Stat Soc a Stat, 168, 599–613.
OpenUrl

[5] 5.↵
Carrasco, J.L. (2010). A generalized concordance correlation coefficient based on the variance components generalized linear mixed models for overdispersed count data. Biometrics, 66, 897–904.
OpenUrl CrossRef PubMed Web of Science

[6] 6.↵
de Villemereuil, P., Schielzeth, H., Nakagawa, S. & Morrissey, M. (in press). General methods for evolutionary quantitative genetic inference from generalised mixed models. Genetics.

[7] 7.↵
Foster, S.D. & Bravington, M.V. (2013). A Poisson-Gamma model for analysis of ecological non-negative continuous data. Environ Ecol Stat, 20, 533–552.
OpenUrl CrossRef

[8] 8.↵
Foulley, J.L., Gianola, D. & Im, S. (1987). Genetic Evaluation of Traits Distributed as Poisson-Binomial with Reference to Reproductive Characters. Theor. Appl. Genet., 73, 870–877.
OpenUrl CrossRef PubMed

[9] 9.↵
Fournier, D.A., Skaug, H.J., Ancheta, J., Ianelli, J., Magnusson, A., Maunder, M.N. et al. (2012). AD Model Builder: using automatic differentiation for statistical inference of highly parameterized complex nonlinear models. Optim Method Softw, 27, 233–249.
OpenUrl CrossRef Web of Science

[10] 10.↵
Gelman, A. & Hill, J. (2006). Data analysis using regression and multilevel/hierarchical models Cambridge University Press, Cambridge.

[11] 11.↵
Goldstein, H., Browne, W. & Rasbash, J. (2002). Partitioning variation in multilevel models. Understanding Statistics, 1, 223–231.
OpenUrl CrossRef

[12] 12.↵
Hoef, J.M.V. (2012). Who Invented the Delta Method? Am Stat, 66, 124–127.
OpenUrl CrossRef Web of Science

[13] 13.
Hox, J. (2010). Multilevel analysis. Routledg, New York.

[14] 14.↵
Jaeger, B.C., Edwards, L.J., Das, K. & Sen, P.K. (2016). An R2 statistic for fixed effects in the generalized linear mixed model. Journal of Applied Statistics, 10.1080/02664763.02662016.01193725.

[15] 15.↵
Johnson, P.C.D. (2014). Extension of Nakagawa & Schielzeth’s R-GLMM(2) to random slopes models. Methods Ecol Evol, 5, 944–946.
OpenUrl CrossRef PubMed

[16] 16.↵
LaHuis, D.M., Hartman, M.J., Hakoyama, S. & Clark, P.C. (2014). Explained Variance Measures for Multilevel Models. Organ Res Methods, 17, 433–451.
OpenUrl

[17] 17.↵
Lefcheck, J.S. (2016). PIECEWISESEM: Piecewise structural equation modelling in R for ecology, evolution, and systematics. Methods Ecol Evol, 7, 573–579.
OpenUrl CrossRef

[18] 18.↵
Lessells, C.M. & Boag, P.T. (1987). Unrepeatable repeatabilities - a common mistake. Auk, 104, 116–121.
OpenUrl CrossRef Web of Science

[19] 19.↵
Matos, C.A.P., Thomas, D.L., Gianola, D., Tempelman, R.J. & Young, L.D. (1997). Genetic analysis of discrete reproductive traits in sheep using linear and nonlinear models .1. Estimation of genetic parameters. J. Anim. Sci., 75, 76–87.
OpenUrl PubMed Web of Science

[20] 20.
Morrissey, M.B., de Villemereuil, P., Doligez, B. & Gimenez, O. (2014). Bayesian approaches to the quantitative genetic analysis of natural populations. In: Quantitative genetics in the wild (eds. Charmantier, A, Garant, D & Kruuk, LEB). Oxford University Press Oxford, pp. 228–253.

[21] 21.↵
Nakagawa, S. & Schielzeth, H. (2010). Repeatability for Gaussian and non-Gaussian data: a practical guide for biologists. Biol Rev, 85, 935–956.
OpenUrl CrossRef PubMed

[22] 22.↵
Nakagawa, S. & Schielzeth, H. (2013). A general and simple method for obtaining R2 from generalized linear mixed-effects models. Methods Ecol Evol, 4, 133–142.
OpenUrl CrossRef

[23] 23.↵
Nimon, K.F. & Oswald, F.L. (2013). Understanding the Results of Multiple Linear Regression: Beyond Standardized Regression Coefficients. Organ Res Methods, 16, 650–674.
OpenUrl

[24] 24.↵
Oehlert, G.W. (1992). A note on the delta method. Am Stat, 46, 27–29.
OpenUrl CrossRef Web of Science

[25] 25.↵
Powell, L.A. (2007). Approximating variance of demographic parameters using the delta method: A reference for avian biologists. Condor, 109, 949–954.
OpenUrl CrossRef

[26] 26.↵
R Development Core Team (2016). R: A language and environment for statistical computing. R Foundation for Statistical Computing Vienna, Austria.

[27] 27.↵
Rao, C.R. (2002). Linear statistical inference and its applications. 2nd ed. edn. John Wiley & Sons, New York.

[28] 28.↵
Ray-Mukherjee, J., Nimon, K., Mukherjee, S., Morris, D.W., Slotow, R. & Hamer, M. (2014). Using commonality analysis in multiple regressions: a tool to decompose regression effects in the face of multicollinearity. Methods Ecol Evol, 5, 320–328.
OpenUrl CrossRef

[29] 29.↵
Schielzeth, H. & Nakagawa, S. (2013). Nested by design: model fitting and interpretation in a mixed model era. Methods Ecol Evol, 4, 14–24.
OpenUrl CrossRef

[30] 30.↵
Snijders, T. & Bosker, R. (1999). Multilevel Analysis: an Introduction to basic and advanced multilevel modeling. Sage, London.

[31] 31.↵
Snijders, T. & Bosker, R. (2011). Multilevel Analysis: an Introduction to basic and advanced multilevel modeling. 2^nd edn. Sage, London.

[32] 32.↵
Tempelman, R.J. & Gianola, D. (1999). Genetic analysis of fertility in dairy cattle using negative binomial mixed models. J. Dairy Sci., 82, 1834–1847.
OpenUrl PubMed

[33] 33.↵
Venables, W.N. & Ripley, B.D. (2002). Modern applied statistics with S. 4 edn. Springer, New York.

[34] 34.↵
Ver Hoef, J.M. & Boveng, P.L. (2007). Quasi-Poisson vs. negative binomial regression: how should we model overdispersed count data? Ecology, 88, 2766–2772.
OpenUrl CrossRef PubMed Web of Science

[35] 35.↵
Zhang, Y.W. (2013). Likelihood-based and Bayesian methods for Tweedie compound Poisson linear mixed models. Stat Comput, 23, 743–757.
OpenUrl CrossRef

Coefficient of determination R² and intra-class correlation coefficient ICC from generalized linear mixed-effects models revisited and expanded

Abstract

Introduction

Definitions of R²_GLMM, ICC_GLMM and overdispersion

Extension of R²_GLMM snd ICC_GLMM

Obtaining the observation-level variance by the ‘first’ delta method

How to estimate λ from data

Jensen’s inequality and the ‘second’ delta method

Special considerations for binomial GLMMs

Worked examples: revisting the beetles

Alternatives and a cautonary note

Acknowledgements

Footnotes

References

Citation Manager Formats

Subject Area

Coefficient of determination R2 and intra-class correlation coefficient ICC from generalized linear mixed-effects models revisited and expanded

Abstract

Introduction

Definitions of R2GLMM, ICCGLMM and overdispersion

Extension of R2GLMM snd ICCGLMM

Obtaining the observation-level variance by the ‘first’ delta method

How to estimate λ from data

Jensen’s inequality and the ‘second’ delta method

Special considerations for binomial GLMMs

Worked examples: revisting the beetles

Alternatives and a cautonary note

Acknowledgements

Footnotes

References

Citation Manager Formats

Subject Area

Coefficient of determination R² and intra-class correlation coefficient ICC from generalized linear mixed-effects models revisited and expanded

Definitions of R²_GLMM, ICC_GLMM and overdispersion

Extension of R²_GLMM snd ICC_GLMM