A sparse observation model to quantify species interactions in time and space

Sadoune Ait Kaci Azzou; Liam Singer; Thierry Aebischer; Beat Wolf; Daniel Wegmann

doi:10.1101/815027

Summary

Camera traps and acoustic recording devices are essential tools to quantify the distribution, abundance and behavior of mobile species. Varying detection probabilities among device locations must be accounted for when analyzing such data, which is generally done using occupancy models.
We introduce a Bayesian Time-dependent Observation Model for Camera Trap data (Tomcat), suited to estimate relative event densities in space and time. Tomcat allows to learn about the environmental requirements and daily activity patterns of species while accounting for imperfect detection. It further implements a sparse model that deals well will a large number of potentially highly correlated environmental variables.
By integrating both spatial and temporal information, we extend the notation of overlap coefficient between species to time and space to study niche partitioning.
Synthesis and applications. We illustrate the power of Tomcat through an application to camera trap data of eight sympatrically occurring duiker Cephalophinae species in the the savanna - rainforest ecotone in the Central African Republic and show that most species pairs show little overlap. Exceptions are those for which one species is very rare, likely as a result of direct competition.

1. Introduction

Camera traps, acoustic recorders and other devices that allow for continuous recording of animal observations have become an essential part of many wildlife monitoring efforts that aim at quantifying the distribution, abundance and behavior of mobile species. However, the inference of these biological characteristics is not trivial due to the confounding factor of detection, which may vary greatly among recording locations. Hence, variation in the rates at which a species is recorded (e.g. the photographic rate) may indeed reflect differences in local abundance, but might just as well reflect differences in the probabilities with which individuals are detected, or more likely a combination of both (see Burton et al., 2015; Sollmann, 2018, for two excellent reviews).

Since local detection rates are generally not known, both processes have to be inferred jointly. The most often used methods are variants of occupancy models that treat the detection probability explicitly (MacKenzie et al., 2002). The basic quantity of interest in these models is whether or not a particular site is occupied by the focal species. While the detection of a species implies that it is present, the absence of a record does not necessarily imply the species is absent. Since the probabilities of detection and occupation are confounded, they can not be inferred for each site individually. It is therefore common to use hierarchical models that express detection probabilities as a function of environmental variables (MacKenzie et al., 2002).

Here we introduce Tomcat, a Time-dependent Observation Model for Camera Trap data, that extends currently used occupancy models in three important ways:

First, we propose to quantify a measure of relative species density, namely the rate at which animals pass through a specific location, rather than occupancy. Quantifying occupancy assumes there exists a well defined patch or site that is either occupied by a species or not (closure assumption). The notation of a discrete patch is, however, often difficult when analyzing camera trap or similar data of mobile species, which complicates interpretation (Efford and Dawson, 2012; Steenweg et al., 2018). In addition, summarizing such data by a simple presence-absence matrix ignores the information about differences in population densities at occupied sites. Occupancy is therefore not necessarily a good surrogate for abundance (Efford and Dawson, 2012; Steenweg et al., 2018; MacKenzie and Royle, 2005), even though it has been advocated for birds (MacKenzie and Nichols, 2004).

By quantifying relative densities as an alternative measure to occupancy, this limitation can be overcome as we do not need to make strict assumptions about the independence of recording locations. Furthermore, Tomcat quantifies measures of relative species densities while accounting for imperfect detection of unmarked species, which has previously been identified as a key challenge in wildlife surveys (Burton et al., 2015). However, we note that relative densities do not easily allow for an absolute quantification of density because it is not possible to distinguish mobility from abundance. On the other hand, they readily allow for the comparison of densities in space and time and hence to identify habitat important for a particular species. It therefore appears more useful than occupancy to monitor changes in species abundances over time, as for many species, changes in population size will be reflected in the rate at which a species is detected prior to local extinction.

Second, we explicitly model daily activity patterns. Several models have been proposed to estimate such patterns from continuous recording data (Frey et al., 2017), including testing for non-random distributions of observation events in predefined time-bins (Bu et al., 2016) and circular kernel density functions (Oliveira-santos et al., 2013; Rowcliffe et al., 2014), with latter allowing for the quantification of activity overlap between species (Ridout and Linkie, 2009). Jointly inferring activity patterns with relative densities allows us not only account for imperfect detection, but also extent the idea of overlap to space, shedding additional light on species interactions.

Third, we explicitly account for the sparsity among environmental coefficients. This is relevant since many environmental variables are generally available and it is usually not known which ones best explain the variation in abundance of a species. Enforcing sparsity on the vector of coefficients avoids the problem of over-fitting in case the number of recording locations is smaller or on the same order as the number of environmental coefficients.

In this article, we begin by describing the proposed model in great details. We then verify its performance using extensive simulations and finally illustrate this idea by inferring spatio-temporal overlap of eight duiker species Cephalophinae within the forest-savanna ecotone of Central Africa.

2. Material and Methods

2.1. A Time-dependent Observation Model

We present a Bayesian Time-dependent Observation Model for Camera Trap data (Tomcat), suited to estimate relative event densities in space and time. Let us denote by Λ_j(𝒯) the rate at which a device at location j = 1, …, J takes observations of a particular species (or guild) at the time of the day τ ∈ [0, T], T = 24h. We assume that this rate is affected by three processes: 1) the average rate at which individuals pass through location j, 2) the daily activity patterns 𝒯 (τ), and 3) the probability p_j with which an individual passing through location j is detected by the device and the downstream processing of the records: The number of records W_j (d, τ₁, τ₂) taken by a device at location j within the interval [τ₁, τ₂) on day d is then given by the non-homogeneous Poisson process with intensity function As in all models dealing with imperfect detection, the parameters and p_j related to species densities and detection, respectively, are confounded and can not be estimated individually for each location without extra information. However, it is possible to estimate relative densities between locations using a hierarchical model. Following others (e.g. Tobler et al., 2015), we assume that both parameters are functions of covariates (e.g. the environment), and hence only attempt to learn these hierarchical parameters. Here, we use where X_j and Y_j are known (environmental) covariates at location j and a, A and B are species specific coefficients. Note that to avoid non-identifiability issues, we did not include an intercept for p_j, and hence we set the average detection probability across locations (assuming X and Y have mean zero). Also, X_j and Y_j should not contain strongly correlated covariates.

2.1.1. Non-independent events

Another issue specific to data from continuous recordings is that not every record is necessarily reflective of an independent observation as the same individual might trigger multiple observations while passing (or feeding, resting, …) in front of a recording device. It is often difficult and certainly laborious to identify such recurrent events. We thus propose to account for non-independent events by dividing the day into n_o intervals of equal length (o for observation), and then only consider whether or not at least one recording was taken within each interval [c_m−1, c_m), m = 1, …, n_o, where c₀ = c_M = T. Specifically, for an interval m, where Λ_j(c_m−1, c_m) is given by (1).

2.1.2. Daily activity patterns

Here we assume that 𝒯 (τ) is a piece-wise constant function with n_a activity intervals of equal length (a for activity). While activity patterns are unlikely strictly piece-wise constant, we chose this function over a combination of periodic functions (e.g. Oliveira-santos et al., 2013) as they fit to complicated, multi-peaked distributions with fewer parameters.

Since the best tiling of the day is unknown, we allow for a species-specific shift δ such that the first interval is [δ, h_a + δ) and the last overlaps midnight and becomes [T − h_a + δ, δ) (Figure 1). We therefore have where the indicator function is 1 if t ∈ [τ₀, τ₁) and zero otherwise and k_i reflects the relative activity of the focal species (or guild) in interval i = 1, …, n_h with k_i = 0 implying no activity and k_i = 1 implying average activity. Note that and hence

Figure 1:

The solid line plot represents the piecewise-constant function 𝒯 with h_a = 3 hours, and the dashed line plot represents 𝒯 shifted with δ = 1 hour.

2.1.3. Bayesian inference

We conduct Bayesian inference on the parameter vector θ = {a, A, B, δ, k}, where , by numerically evaluating the posterior distribution ℙ(θ|W) ∝ ℙ(W |θ)ℙ(θ), where W = {W₁, …, W_J} denotes the full data from all locations j = 1, …, J.

The likelihood ℙ(W |θ) is calculated as where ℙ(W_j(d, c_m−1, c_m)|θ) is given by equation (4) and D_j1 and D_j2 denote the first and last day of recording at location j.

Since it is usually not known which covariates X_j and Y_j are informative, nor at which spatial scale they should be evaluated, the potential number of covariates to be considered may be large. To render inference feasible, we enforce sparsity on the vectors of coefficients A and B. Specifically, we assume that ℙ (A_i ≠ 0) = π_λ and, correspondingly, ℙ (B_i 0) ≠ π_p.

We chose uniform priors on all other parameters, namely ℙ(a) ∝ 1, P(k) ∝ 1 for all vectors of k that satisfy , and ℙ (δ) ∝ 1 for all 0 ≤ δ < h_a. For simplicity, we only consider cases in which h_a, the length of the activity intervals, is a multiple of h_o, the length of the observation intervals, and allow only for discrete δ ∈ {0, …, h_a/h_o}. Finally, we set π_p = π_λ = 0.1.

We use a reversible-jump MCMC algorithm (Green, 1995) to generate samples from the posterior distribution ℙ(θ|W). The update k → k′ is noteworthy. We begin by picking a random activity interval i and proposing a move according to a symmetric transition kernel. We then scale all other entries of k′ to satisfy the constraint on the sum. Specifically, we set for all j ≠ i, where

2.1.4. Prediction

Using a set of S posterior samples θ₁, …, θ_S ∼ ℙ (θ|W), we project event densities to a not-surveyed location ι with covariates X_ι by calculating the mean of the posterior as where a⁽ⁱ⁾ and A⁽ⁱ⁾ denote the i-th posterior sample of these parameters.

2.1.5. Species overlap in space and time

An important interest in ecology is to compare activity patterns among species and to see how overlapping patterns may relate to competition or predation (e.g. Ridout and Linkie, 2009; Rowcliffe et al., 2014).

We can quantify overlapping patterns of animal activity by estimating the coefficient of overlap Δ (Ridout and Linkie, 2009). This quantitative measure ranges from 0 (no overlap) to 1 (identical activity patterns) and is the area lying under two activity density curves (see Figure 4). For two known density functions f (x) and g(x), Δ is given by:

Figure 2:

Distribution of bias in the estimated overlap coefficients and for different sample sizes.

Figure 3:

Habitat preference of the three duiker species C. dorsalis, C. weynsi and S. grimmia. Left: Distribution of closed canopy forest (CCF, top, green) and open savanna woodland (OSW, bottom, yellow) across the study region with the ACC borders and camera trap locations (black dots). Top right: Relative densities d_sj of the three duikers predicted at 2,639 grid points. For each species the colors indicates d_sj = log₁₀ , where median is the median value over all the grid points j. Red shades indicate d_sj > 0, blue shades d_sj < 0. Bottom right: Posterior inclusion probabilities for the CCF (green) and OSW (yellow) habitat variables for each buffer. Values above the dashed line indicate the posterior probability that the habitat correlates positively with the relative species density, values below the dashed line imply a negative correlation.

Figure 4:

Interaction in space and time between the duiker species C. dorsalis, C. weynsi, and S. grimma. Top row: interactions in space quantified as log₁₀ between species 1 and 2. Bottom: posterior mean (solid line) and 90% credible intervals (shades) of temporal activity patterns. The area shaded in gray represents the overlap coefficient Δ_T.

The overlap measure Δ(f, g) can be related to the well known measure of distance between two densities L₁ as which justifies the visualization of overlap coefficients between k species in a n-dimensional space using a Multidimensional Scaling (MDS) by considering as a measure of dissimilarity.

In practice, the true density functions f (x) and g(x) are usually not known. Here we obtain an estimate of Δ numerically from posterior samples. We distinguish three types of overlap coefficients, Δ_T for overlap in time, Δ_S for overlap in space and Δ_ST for overlap in time and space.

Overlap coefficient Δ_T. For a large number n_T of equally spaced time values , we sample from the posterior distribution ℙ (Δ_T |W ⁽¹⁾, W ⁽²⁾) where denotes the full data for a species l = 1, 2. where is computed according to equation (5) with species specific parameters and sampled from ℙ (θ_l|W ^(l)).

Overlap coefficient Δ_S. For a given number n_S of sites reflecting the habitat in a region, we sample from the posterior distribution ℙ (Δ_S|W ⁽¹⁾, W ⁽²⁾) as where is computed according to equation (2) and normalized such as with species specific parameters and sampled from ℙ (θ_l|W ^(l)).

Overlap coefficient Δ_ST. For n_T time values and n_S number of sites, we sample from the posterior distribution P(Δ_ST |W ⁽¹⁾, W ⁽²⁾) as where for species l = 1, 2 we calculate according to equation (5) and according to equation (2) with species specific parameters , and sampled from ℙ (θ_l|W ^(l)), but normalized such that

2.1.6. Implementation

All methods were implemented in the C++ program Tomcat, available through a git repository at https://bitbucket.org/WegmannLab/tomcat/.

2.2. Simulations

We assessed the performance of our algorithm using 100 replicates for each combination of J = 20 or 100 device locations and D = 1, 2, 5, 10, 20, 50 or 100 days at which data was collected. All simulations were conduced with one-dimensional X_j ∼ N (0, 1) and Y_j ∼ N (0, 1) and parameter choices such that on average one observation per species per location was expected, and hence the expected number of observations per species was JD.

To evaluate the accuracy of our estimates we then estimated the overlap between two species, since errors in parameter estimates directly translate into biases in overlap coefficients. We thus simulated data for two species with little overlap (Δ_T = 0.2), moderate overlap (Δ_T = 0.5) or large overlap in time (Δ_T = 0.8) as described in Table 1 as well as for two species with varying overlap in space (Δ_S = 0.2, 0.5, and 0.8) and with varying overlap in space and time (Δ_ST = 0.2, 0.5, and 0.8) as described in the Supporting Information.

View this table:

Table 1:

Values of k used for the simulation of the daily activity patterns 𝒯 (τ).

2.3. Application to Central African duikers

We applied Tomcat to camera trapping data obtained during the dry seasons from 2012 to 2018 from a region in the eastern Central African Republic (CAR), a wilderness exceeding 100,000 km² without permanent settlements, agriculture or commercial logging (Aebischer et al., 2017). The available data was from 1,059 camera traps set at 532 distinct locations that cover the Aire de Conservation de Chinko (ACC), a protected area of about 20,000 km². For more information about camera deployment and sampling design, see Aebischer et al. (2017). Here, we focus on duikers Cephalophinae, which are a diverse mammalian group common in the data set and for which near-perfect manual annotation was available.

To infer habitat preferences for these species, we benefited from an existing land cover classification at a 30m resolution that represents the five major habitat types of the Chinko region: moist closed canopy forest (CCF), open savanna woodland (OSW), dry lakéré grassland (DLG), wet marshy grassland (WMG) and surface water (SWA) (Aebischer et al., 2017). Around every camera trap location and at 10,200 regular grid points spaced 2.5 km apart and spanning the entire ACC, we calculated the percentage of each of these habitats in 11 buffers of sizes 30; 65; 125; 180; 400; 565; 1260; 1785; 3,990; 5,640 and 17,840 meters. We complemented this information with the average value within every buffer for each of 15 additional environmental and bioclimatic variables from the WorldClim database version 2 (Table S.1 Fick and Hijmans, 2017) that we obtained at a resolution of 30 seconds, which translates into a spatial resolution of roughly 1km² per grid cell. To aid in the interpretation, we then processed our environmental data by 1) keeping only the additional effect of each variables after regressing out the habitat variables CCF and OSW at the same buffer, and by 2) keeping only the additional effect of every variable after regressing out the information contained in the same variable but at smaller buffers (see Supporting Information for details).

To avoid extrapolation, we restricted our analyses to 2,639 grid locations that exhibited similar environments to those at which camera traps were placed as measured by the Mahanalobis distance between each grid point and the average across all camera trap locations (see Supporting Information for details). For each location, we further used the binary classification of the four most common habitat types (CCF, OSW, MWG, DLG) and determined the presence or absence of six additional habitat characteristics: animal path, road, salt lick, mud hole, riverine zone and bonanza.

3. Results

3.1. Performance against simulations

We first assessed the performance of our algorithm by inferring overlap coefficients between two species from simulated data. We chose to focus on overlap coefficients as they are directly affected by any error or bias in parameter estimates. As shown in Figure 2, the posterior means and were unbiased and highly accurate for all overlap coefficients regardless of the true values, if sufficient data is provided, i.e. if at least several hundred pictures were available (J × D ≥ 500). If less data was available, estimates were biased towards the prior expectations, which are 0.5 and 1.0 for Δ_T and Δ_S, respectively.

4. Application to Central African duikers

We used Tomcat to study the spatio-temporal distribution and overlap of duikers Cephalophinae in the eastern Central African Republic (CAR). We benefited from existing camera trapping data (Aebischer et al., 2017), collected between 2012 and 2018 at 532 distinct locations in the Aire de Conservation de Chinko (ACC), a protected area of about 20,000 km² that was established in 2014 by the government of the CAR in former hunting zones.

The ACC consists of an ecotone of tropical moist closed canopy forests of the northeastern Congolian lowland rain forest biome and a Sudanian-Guinean woodland savanna that is interspersed with small patches of edaphic grasslands on rocky ground or swampy areas (Boulvert, 1985; Olson and Dinerstein, 1998).

Duikers are common in the data set and often observed in sympatry, i.e. several species were captured by the same camera trap within a few hours. We detected a total of eight species in the data set (Table 2): Eastern Bay Duiker Cephalophus dorsalis castaneus, Uele White Bellied Duiker Cephalophus leucogaster arrhenii, Black Fronted Duiker Cephalophus nigrifrons, Red Flanked Duiker Cephalophus rufilatus, Western Yellow Backed Duiker Cephalophus silvicultor castaneus, Weyns Duiker Cephalophus weynsi, Eastern Blue Duiker Philantomba monticola aequatorialis and Bush Duiker Sylvicapra grimmia.

View this table:

Table 2:

Available data on the eight detected species of duikers.

The eight duiker species varied greatly in their habitat preferences (Figures 3, S.2) as inferred by Tomcat. As shown in Figure 3, C. dorsalis and C. weynsi both have a strong preference for CCF over OSW habitat at the smallest buffers, in contrast to S. grimmia that shows a strong preference for OSW. At higher buffers the signal is less clear, probably owing to the heterogeneous nature of the habitat in which both CCF and OSW correlated negatively with WMG and DLG, habitats not well suited for any of these species. Interestingly, P. monticola and C. silvicultor seem to be true ecotone species preferring a mixture of the canonical habitats CCF and OSW (Figure S.2). Our analysis further suggests that the two rarely studied species C. weynsi and C. leucogaster not only occur in large blocks of CCF as suggested in the literature (Kingdon et al., 2013), but also in narrow gallery forests within the forest-savanna ecotone several kilometers away from the next extensive forest block (Figures 3 and S.2).

As shown in Figure 4 and S.1, the species also varied greatly in their daily activity patterns, with some being almost exclusively nocturnal (C. dorsalis and C. silvicultor), some almost exclusively diurnal (C. leucogaster, P. monticola, C. nigrifons, C. rufilatus, C. weynsi) and one crepuscular (S. grimmia).

To better understand how these closely related duiker species of similar size and nutritional needs can occur sympatrically, we estimated pairwise overlap coefficients in space and time (Figure 4, Table S.2). Not surprisingly, most species pairs differed substantially either in their habitat preferences or daily activity patterns. Of the two forest dwellers C. dorsalis and C. weynsi for instance, one is almost exclusively nocturnal and the other almost exclusively diurnal , resulting in a small overlap in space and time . Similarly, the nocturnal C. dorsalis and the crepuscular S. grimmia that share a lot of temporal overlap use highly dissimilar habitats , resulting in a very small overlap in time and space .

A visualization using Multidimensional Scaling (MDS) of the pair-wise overlap coefficients of all six species with observations from at least 50 independent camera trap locations is shown in Figure 5. For these species, 84.6% of variation in the temporal overlap can be explained by a single axis separating nocturnal from diurnal species. In contrast, only 44.5% of the variation in the spatial overlap is explained by the first axis distinguishing forest dwellers from savanna species. When using both temporal and spatial information, the two first axis explain 32.3% and 25.5%, respectively, suggesting that a single axis is not sufficient to explain both temporal and spatial differences between species.

Figure 5:

Illustration of the overlap coefficients in time and space between six duiker species visualized in two dimensions using the multidimensional scaling.

A striking observation is that frequently observed and therefore evidently abundant species within a certain community tend to differ in their habitat preference and/or daily activity. In contrast, infrequently observed and therefore putative rare taxa seem to have large overlap with co-occurring species. C. leucogaster, for instance, which is rather rare and was only observed at 11 distinct locations (Table 2), has similar habitat preferences and is active at the same time (Table S.2) as C. weynsi, which is among the most common forest duikers within the ACC. In contrast, C. dorsalis, which is strictly nocturnal, seems to co-exist with the C. weynsi at higher densities (Figure 4, Table S.2).

Conclusion

Despite world-wide efforts, there are still major areas for which almost no information on biodiversity is available (e.g. Hickisch et al., 2019). But technological advances, in particular camera traps and acoustic recorders, make it possible to obtain a first glimpse on the presence and distribution of larger or highly vocal animals like mammals and birds in relatively short time and with a reasonable budget. Thanks to increased battery life, larger media to store data and other technical advances, such data sets can now be produced with comparatively little manual labor, even under the demanding conditions of large and remote areas. In addition, the annotation of these data sets on the species level is now aided by machine learning algorithms that automatize the detection of common species and recordings without observations (e.g. Norouzzadeh et al., 2018). Thanks to these developments, existing knowledge gaps may now be increasingly addressed, allowing for a re-evaluation of conservation strategies and optimization of conservation management.

Here we introduce Tomcat, an model that infers habitat preferences and daily activities from spatio-temporal observations and hence allows to learn about the ecological requirements of animals, including rare, elusive and unmarked species. Unlike many previous methods that estimate the presence or absence of a species, Tomcat estimates relative species densities, from which overlap coefficients between species can be estimated in space and time while accounting for variation in detection probabilities between locations. While estimates of overlap coefficients require larger data sets than the inference of pure occupancy, we believe they constitute a major step forward in understanding the complex species interactions in an area, which is particularly relevant for conservation planning in heterogeneous or fragmented habitats.

Author contributions

TA and DW conceived the idea; SA, BW and DW developed and implemented the method; TA collected the data; SA, LS, TA analyzed the data; SA, LS and DW led the writing of the manuscript. All authors contributed critically to the drafts and gave final approval for publication.

Supporting Information

S1. Simulating data with specific overlap coefficients

Simulating data with specific Δ_S

For the simulation of scenarios with a given Δ_S, we have simulated for each site j, an environmental variable X_j from N (0, 1). Δ_S will be estimated with , where where in presence of n sites. For a species i and site j, is given by equation (2), with the constraint : We have for species 1 : Equivalently, we have for species 2, . Therefore, We have We have The integral in equation (S.15) is given by: where Φ represents the cumulative distribution function (CDF) of the standard normal distribution.

Finally, we have It is possible using equation (S.16) to simulate a model for which Δ_S = a.

For example, for Δ_S = δ_s, and for A₁ = a₁, it is possible to get the value of A₂ from (S.16), which gives After computing A₂, it is easy to deduce the value of µ₂ using equation (S.13).

Simulating data with specific Δ_ST

To simulate scenarios for a given Δ_ST, we have simulated for each site j an environmental variable X_j from N (0, 1), and T ∼ U_[0,24]. Δ_ST is estimated using equation (11).

For a species i and site j, we have the constraint where 𝒯_i(τ) is the daily activity for a species i, i = 1, 2 observed at time t.

We have and which gives We can therefore simulate the scenario Δ_ST = δ_ST with known activity patterns 𝒯₁, 𝒯₂ respectively for species 1 and 2 as follows:

Simulate n_s X_j ∼ N (0, 1), j = 1, 2, … n_s.
Propose a value of A₁. (Should not be very large. Namely between -2 and 2).
Compute the value of µ₁ using equations (S.18).
Compute numerically the value of A₂ by solving the equation :
Using Tomcat for the two species, and by fixing the value A_i, i = 1, 2, we can sample from the posterior distribution of Δ_ST and compute the posterior mean of the values Δ_ST using equation (11).

S2. Decorrelation environmental variables

While Tomcat readily handles correlated environmental variables, we chose to decorrelate specific variables to aid in interpretation. Specifically, we processed our environmental data as follow (two steps):

Step 1: A major interest in our application was to study the impact of the prevalence of closed canopy forest (f) and savanna (s) habitat on species densities. For each scale (buffer) b, we therefore regress each environmental variable V_ib, i ≠f,s where V_fb and V_sb represents, respectively, the forest and the Savannah habitat for a buffer b = 1, …, B, i = 1, …, n_env, and ϵ_ib is the error from the linear model described by equation (S.20), which captures the information of the variable V_ib independent of V_fb and V_sb at buffer b. We therefore replace the V_ib variables by in our model, but kept and .

Step 2: To evaluate relevant spatial scale of environmental variables, we also regressed out the larger buffers from the smaller one as: In the second step, and for a given environmental variable at the scale, we give priority to the smaller scales by keeping only the additional explanation by the studied variable to what we already know by replacing by .

S3. Restricting analysis to environmentally homogeneous regions

Let be a matrix where the rows represent the observed camera traps, and the columns the environmental variables, and let a matrix containing the environmental variables for the locations for which we want to predict , s = 1, 2, …, m.

We define the Mahanalobis distance D_O of given by where , and S is the variance covariance matrix.

We define a second Mahalanobis distance D_P which measures the distance of from the the mean of the camera traps environmental variables. D_P is given by After computing D_O and D_P, we decided to remove the grid points for which we have D_P > 1.2 × max(D_O).

S4. Supplementary figures

Figure S.1:

Estimation of the daily activity of the eight observed duiker species. The solid line represents the posterior mean and the shades, 90% credible intervals of the temporal activity patterns.

Figure S.2:

Relative densities d_sj of the eight duiker species predicted at 2,639 grid points. For each species the colors indicates d_sj = log₁₀ , where median is the median value over all the grid points j. Red shades indicate d_sj > 0, blue shades d_sj < 0. Posterior inclusion probabilities for the CCF (green) and OSW (yellow) habitat variables for each buffer. Values above the dashed line indicate the posterior probability that the habitat correlates positively with the relative species density, values below the dashed line imply a negative correlation.

S5. Supplementary tables

View this table:

Table S.1:

Environmental and bioclimatic variables obtained from the WorldClim database version 2.

View this table:

Table S.2:

Summary of the overlap coefficients between the six duiker species.

Acknowledgments

This work was supported by a Swiss National Science Foundation grant (31003A 173062) to DW.

References

↵
Aebischer, T., Siguindo, G., Rochat, E., 2017. First quantitative survey delineates the distribution of chimpanzees in the eastern central african republic. Biological Conservation 213, 84–94.
OpenUrl
↵
Boulvert, Y., 1985. Carte phytogéographique de la république centrafricaine. ORSTOM (Office de la recherche scientifique et technique Outre-Mer).
↵
Bu, H., Wang, F., Mcshea, W.J., Lu, Z., Wang, D., Li, S., 2016. Spatial Co-Occurrence and Activity Patterns of Mesocarnivores in the Temperate Forests of Southwest China. PLOS ONE, 1–15 doi: 10.1371/journal.pone.0164271.
OpenUrl CrossRef
↵
Burton, A.C., Neilson, E., Moreira, D., Ladle, A., Steenweg, R., Fisher, J.T., Bayne, E., Boutin, S., 2015. Review: Wildlife camera trapping: a review and recommendations for linking surveys to ecological processes. Journal of Applied Ecology 52, 675–685. doi: 10.1111/1365-2664.12432.
OpenUrl CrossRef
↵
Efford, M.G., Dawson, D.K., 2012. Occupancy in continuous habitat.Ecosphere 3, art32. doi: 10.1890/ES11-00308.1.
OpenUrl CrossRef
↵
Fick, S.E., Hijmans, R.J., 2017. Worldclim 2: new 1-km spatial resolution climate surfaces for global land areas. International Journal of Climatology 37, 4302–4315. doi: 10.1002/joc.5086.
OpenUrl CrossRef PubMed
↵
Frey, S., Fisher, J.T., Burton, A.C., Volpe, J.P., 2017. Investigating animal activity patterns and temporal niche partitioning using camera-trap data: challenges and opportunities. Remote Sensing in Ecology and Conservation 3, 123–132. doi: 10.1002/rse2.60.
OpenUrl CrossRef
↵
Green, P., 1995. Reversible jump markov chain monte carlo computation and bayesian model determination. Biometrika 82, 711–732.
OpenUrl CrossRef Web of Science
↵
Hickisch, R., Hodgetts, T., Johnson, P.J., Sillero-Zubiri, C., Tockner, K., Macdonald, D.W., 2019. Effects of publication bias on conservation planning. Conservation Biology 33, 1151–1163. doi: 10.1111/cobi.13326.
OpenUrl CrossRef
↵
Kingdon, J., Happold, D., Butynski, T., Hoffmann, M., Happold, M., Kalina, J., 2013. Mammals of Africa. v. 1–6, Bloomsbury Publishing.
↵
MacKenzie, D.I., Nichols, J.D., 2004. Occupancy as a surrogate for abundance estimation. Animal biodiversity and conservation 27, 461–467.
OpenUrl
↵
MacKenzie, D.I., Nichols, J.D., Lachman, G.B., Droege, S., Andrew Royle, J., Langtimm, C.A., 2002. Estimating site occupancy rates when detection probabilities are less than one. Ecology 83, 2248–2255. doi: 10.1890/0012-9658(2002)083[2248:ESORWD]2.0.CO;2.
OpenUrl CrossRef Web of Science
↵
MacKenzie, D.I., Royle, J.A., 2005. Designing occupancy studies: general advice and allocating survey effort. Journal of Applied Ecology 42, 1105–1114. doi: 10.1111/j.1365-2664.2005.01098.x.
OpenUrl CrossRef Web of Science
↵
Norouzzadeh, M.S., Nguyen, A., Kosmala, M., Swanson, A., Palmer, M.S., Packer, C., Clune, J., 2018. Automatically identifying, counting, and describing wild animals in camera-trap images with deep learning. Proceedings of the National Academy of Sciences 115, E5716–E5725. doi: 10.1073/pnas.1719367115.
OpenUrl Abstract/FREE Full Text
↵
Oliveira-santos, L.G.R., Zucco, C.A., Agostinelli, C., 2013. Using conditional circular kernel density functions to test hypotheses on animal circadian activity. Animal Behaviour 85, 269–280. URL:http://dx.doi.org/10.1016/j.anbehav.2012.09.033, doi: 10.1016/j.anbehav.2012.09.033.
OpenUrl CrossRef
↵
Olson, D.M., Dinerstein, E., 1998. The global 200: A representation approach to conserving the earth’s most biologically valuable ecoregions. Conserv. Biol. 12, 502–515.
OpenUrl CrossRef Web of Science
↵
Ridout, M.S., Linkie, M., 2009. Estimating overlap of daily activity patterns from camera trap data. Journal of Agricultural, Biological, and Environmental Statistics 14, 322–337. URL:https://doi.org/10.1198/jabes.2009.08038, doi: 10.1198/jabes.2009.08038.
OpenUrl CrossRef
↵
Rowcliffe, J.M., Kays, R., Kranstauber, B., Carbone, C., Jansen, P.A., 2014. Quantifying levels of animal activity using camera trap data. Methods in Ecology and Evolution 5, 1170–1179.
OpenUrl
↵
Sollmann, R., 2018. A gentle introduction to camera-trap data analysis. African Journal of Ecology 56, 740–749. doi: 10.1111/aje.12557.
OpenUrl CrossRef
↵
Steenweg, R., Hebblewhite, M., Whittington, J., Lukacs, P., McKelvey, K., 2018. Sampling scales define occupancy and underlying occupancy–abundance relationships in animals. Ecology 99, 172–183. doi: 10.1002/ecy.2054.
OpenUrl CrossRef
↵
Tobler, M.W., Hartley, A., Carrillo-Percastegui, S.E., Powell, G.V.N., 2015. Spatiotemporal hierarchical modelling of species richness and occupancy using camera trap data. Journal of Applied Ecology 43, 413–421.
OpenUrl

View the discussion thread.

Posted July 21, 2020.

Download PDF

Citation Tools

Subject Area

Ecology

Subject Areas

All Articles

Animal Behavior and Cognition (5210)
Biochemistry (11739)
Bioengineering (8750)
Bioinformatics (29189)
Biophysics (14967)
Cancer Biology (12093)
Cell Biology (17409)
Clinical Trials (138)
Developmental Biology (9419)
Ecology (14178)
Epidemiology (2067)
Evolutionary Biology (18301)
Genetics (12238)
Genomics (16797)
Immunology (11865)
Microbiology (28068)
Molecular Biology (11583)
Neuroscience (60953)
Paleontology (451)
Pathology (1870)
Pharmacology and Toxicology (3238)
Physiology (4957)
Plant Biology (10425)
Scientific Communication and Education (1683)
Synthetic Biology (2884)
Systems Biology (7338)
Zoology (1651)

[1] ↵
Aebischer, T., Siguindo, G., Rochat, E., 2017. First quantitative survey delineates the distribution of chimpanzees in the eastern central african republic. Biological Conservation 213, 84–94.
OpenUrl

[2] ↵
Boulvert, Y., 1985. Carte phytogéographique de la république centrafricaine. ORSTOM (Office de la recherche scientifique et technique Outre-Mer).

[3] ↵
Bu, H., Wang, F., Mcshea, W.J., Lu, Z., Wang, D., Li, S., 2016. Spatial Co-Occurrence and Activity Patterns of Mesocarnivores in the Temperate Forests of Southwest China. PLOS ONE, 1–15 doi: 10.1371/journal.pone.0164271.
OpenUrl CrossRef

[4] ↵
Burton, A.C., Neilson, E., Moreira, D., Ladle, A., Steenweg, R., Fisher, J.T., Bayne, E., Boutin, S., 2015. Review: Wildlife camera trapping: a review and recommendations for linking surveys to ecological processes. Journal of Applied Ecology 52, 675–685. doi: 10.1111/1365-2664.12432.
OpenUrl CrossRef

[5] ↵
Efford, M.G., Dawson, D.K., 2012. Occupancy in continuous habitat.Ecosphere 3, art32. doi: 10.1890/ES11-00308.1.
OpenUrl CrossRef

[6] ↵
Fick, S.E., Hijmans, R.J., 2017. Worldclim 2: new 1-km spatial resolution climate surfaces for global land areas. International Journal of Climatology 37, 4302–4315. doi: 10.1002/joc.5086.
OpenUrl CrossRef PubMed

[7] ↵
Frey, S., Fisher, J.T., Burton, A.C., Volpe, J.P., 2017. Investigating animal activity patterns and temporal niche partitioning using camera-trap data: challenges and opportunities. Remote Sensing in Ecology and Conservation 3, 123–132. doi: 10.1002/rse2.60.
OpenUrl CrossRef

[8] ↵
Green, P., 1995. Reversible jump markov chain monte carlo computation and bayesian model determination. Biometrika 82, 711–732.
OpenUrl CrossRef Web of Science

[9] ↵
Hickisch, R., Hodgetts, T., Johnson, P.J., Sillero-Zubiri, C., Tockner, K., Macdonald, D.W., 2019. Effects of publication bias on conservation planning. Conservation Biology 33, 1151–1163. doi: 10.1111/cobi.13326.
OpenUrl CrossRef

[10] ↵
Kingdon, J., Happold, D., Butynski, T., Hoffmann, M., Happold, M., Kalina, J., 2013. Mammals of Africa. v. 1–6, Bloomsbury Publishing.

[11] ↵
MacKenzie, D.I., Nichols, J.D., 2004. Occupancy as a surrogate for abundance estimation. Animal biodiversity and conservation 27, 461–467.
OpenUrl

[12] ↵
MacKenzie, D.I., Nichols, J.D., Lachman, G.B., Droege, S., Andrew Royle, J., Langtimm, C.A., 2002. Estimating site occupancy rates when detection probabilities are less than one. Ecology 83, 2248–2255. doi: 10.1890/0012-9658(2002)083[2248:ESORWD]2.0.CO;2.
OpenUrl CrossRef Web of Science

[13] ↵
MacKenzie, D.I., Royle, J.A., 2005. Designing occupancy studies: general advice and allocating survey effort. Journal of Applied Ecology 42, 1105–1114. doi: 10.1111/j.1365-2664.2005.01098.x.
OpenUrl CrossRef Web of Science

[14] ↵
Norouzzadeh, M.S., Nguyen, A., Kosmala, M., Swanson, A., Palmer, M.S., Packer, C., Clune, J., 2018. Automatically identifying, counting, and describing wild animals in camera-trap images with deep learning. Proceedings of the National Academy of Sciences 115, E5716–E5725. doi: 10.1073/pnas.1719367115.
OpenUrl Abstract/FREE Full Text

[15] ↵
Oliveira-santos, L.G.R., Zucco, C.A., Agostinelli, C., 2013. Using conditional circular kernel density functions to test hypotheses on animal circadian activity. Animal Behaviour 85, 269–280. URL:http://dx.doi.org/10.1016/j.anbehav.2012.09.033, doi: 10.1016/j.anbehav.2012.09.033.
OpenUrl CrossRef

[16] ↵
Olson, D.M., Dinerstein, E., 1998. The global 200: A representation approach to conserving the earth’s most biologically valuable ecoregions. Conserv. Biol. 12, 502–515.
OpenUrl CrossRef Web of Science

[17] ↵
Ridout, M.S., Linkie, M., 2009. Estimating overlap of daily activity patterns from camera trap data. Journal of Agricultural, Biological, and Environmental Statistics 14, 322–337. URL:https://doi.org/10.1198/jabes.2009.08038, doi: 10.1198/jabes.2009.08038.
OpenUrl CrossRef

[18] ↵
Rowcliffe, J.M., Kays, R., Kranstauber, B., Carbone, C., Jansen, P.A., 2014. Quantifying levels of animal activity using camera trap data. Methods in Ecology and Evolution 5, 1170–1179.
OpenUrl

[19] ↵
Sollmann, R., 2018. A gentle introduction to camera-trap data analysis. African Journal of Ecology 56, 740–749. doi: 10.1111/aje.12557.
OpenUrl CrossRef

[20] ↵
Steenweg, R., Hebblewhite, M., Whittington, J., Lukacs, P., McKelvey, K., 2018. Sampling scales define occupancy and underlying occupancy–abundance relationships in animals. Ecology 99, 172–183. doi: 10.1002/ecy.2054.
OpenUrl CrossRef

[21] ↵
Tobler, M.W., Hartley, A., Carrillo-Percastegui, S.E., Powell, G.V.N., 2015. Spatiotemporal hierarchical modelling of species richness and occupancy using camera trap data. Journal of Applied Ecology 43, 413–421.
OpenUrl

A sparse observation model to quantify species interactions in time and space

Summary

1. Introduction

2. Material and Methods

2.1. A Time-dependent Observation Model

2.1.1. Non-independent events

2.1.2. Daily activity patterns

2.1.3. Bayesian inference

2.1.4. Prediction

2.1.5. Species overlap in space and time

2.1.6. Implementation

2.2. Simulations

2.3. Application to Central African duikers

3. Results

3.1. Performance against simulations

4. Application to Central African duikers

Conclusion

Author contributions

Supporting Information

S1. Simulating data with specific overlap coefficients

Simulating data with specific ΔS

Simulating data with specific ΔST

S2. Decorrelation environmental variables

S3. Restricting analysis to environmentally homogeneous regions

S4. Supplementary figures

S5. Supplementary tables

Acknowledgments

References

Citation Manager Formats

Subject Area

Simulating data with specific Δ_S

Simulating data with specific Δ_ST