Abstract
Between October 2013 and April 2014, more than 30,000 cases of Zika virus (ZIKV) disease were reported in French Polynesia. ZIKV has also been reported in Africa and Asia, and in 2015 the virus spread to South America and the Caribbean. Infection with ZIKV has been associated with neurological complications including Guillain-Barré Syndrome (GBS) and microcephaly, which led the World Health Organization declare a Public Health Emergency of International Concern in February 2015. To better understand the transmission dynamics of ZIKV, we used a mathematical model to examine the 2013–14 outbreak on the six major archipelagos of French Polynesia. Our median estimates for the basic reproduction number ranged from 1.9–3.1, with an estimated 11.2% (95% CI: 10.1-12.9%) of total infections reported. As a result, we estimated that 86% (95% CI: 75-93%) of the total population of the six archipelagos were infected during the outbreak. There were 42 GBS cases reported during the ZIKV outbreak in French Polynesia, but the presence of a large number of unreported ZIKV infections could have implications for the design of case-control studies to further investigate a possible association between the two. Based on the demography of French Polynesia, our results also imply that if ZIKV infection provides complete protection against future infection, it would take 15–20 years before there are a sufficient number of susceptible individuals for ZIKV to re-emerge, which is on the same timescale as the circulation of dengue virus serotypes in the region. Our analysis suggests that ZIKV may exhibit similar dynamics to dengue virus in island populations, with transmission characterised by large, sporadic outbreaks with a high proportion of asymptomatic or unreported cases.
Introduction
Originally identified in Africa [1], the first large reported outbreak of Zika virus (ZIKV) disease occurred in Yap during April-July 2007 [2], followed by an outbreak in French Polynesia between October 2013 and April 2014 [3], and cases in other Pacific countries [4, 5]. During 2015, local transmission was also reported in South American countries, including Brazil [6, 7] and Colombia [8].
Transmission of ZIKV is predominately vector-borne, but can also occur via sexual contact and blood transfusions [9]. The virus is spread by the Aedes species of mosquito [10], which is also the vector for dengue virus (DENV), ZIKV is therefore likely to be capable of sustained transmission in other tropical areas [11]. As well as causing symptoms such as fever and rash, ZIKV infection has also been linked to increased incidence of neurological sequelae, including Guillain-Barré Syndrome (GBS) [12, 13] and microcephaly in infants born to mothers who were infected with ZIKV during pregnancy [14]. On 1st February 2015, the World Health Organization declared a Public Health Emergency of International Concern in response to the clusters of microcephaly and other neurological disorders reported in Brazil, possibly linked to the recent rise in ZIKV incidence. The same phenomena were observed in French Polynesia, with 42 GBS cases [13] reported during the outbreak. In addition to the GBS cluster, there were 18 fetal or newborn cases with unusual and severe neurological features reported between March 2014 and May 2015 in French Polynesia, including 10 cases with microcephaly and severe brain lesions, and 8 normocephalic cases with severe anatomical or functional neurological abnormalities [15].
Given the potential for ZIKV to spread globally, it is crucial to characterise the transmission dynamics of the infection. This includes estimates of key epidemiological parameters, such as the basic reproduction number, R0 (defined as the average number of secondary cases generated by a typical infectious individual in a fully susceptible population), and of how many individuals (including both symptomatic and asymptomatic) are typically infected during an outbreak. Such estimates could help assist with outbreak planning, assessment of potential countermeasures, and the design of studies to investigate putative associations between ZIKV infection and other conditions.
Islands can be useful case studies for outbreak analysis. Small, centralised populations are less likely to sustain endemic transmission than a large, heterogeneous population [16], which means outbreaks are typically self-limiting after introduction from external sources [17]. Further, if individuals are immunologically naive to a particular pathogen, it is not necessary to consider potential effect of pre-existing immunity on transmission dynamics [18]. Using a mathematical model of vector-borne infection, we examined the transmission dynamics of ZIKV on six archipelagos in French Polynesia during the 2013-14 outbreak. We inferred the basic reproduction number, and the overall size of the outbreak, and hence how many individuals would still be susceptible to infection in coming years.
Methods
Data
We used weekly reported numbers of suspected ZIKV infections from the six main regions of French Polynesia between 11th October 2013 and 28th March 2014 (Table 1), taken from Centre d’hygiène et de salubrité publique situation reports [19, 20]. Confirmed and suspected cases were reported from sentinel surveillance sites across the country; the number of sentinel sites varied in number from 27-55 during the outbreak. In total, 8,744 suspected cases were reported from the sentinel sites. As there were 162 healthcare sites across all six regions, it has been estimated that around 30,000 suspected cases attended health facilities in total [19]. For each region, we calculated the proportion of total sites that acted as sentinels, to allow us to adjust for variation in reporting over time in the analysis. Population size data were taken from the 2012 French Polynesia Census [21]. In our analysis, the first week with at least one reported case was used as the first observation date.
Mathematical model
We used a compartmental mathematical model to simulate vector-borne transmission [22]. Both people and mosquitoes were modelled using a susceptible-exposed-infectious-removed (SEIR) framework. This model incorporated delays as a result of the intrinsic (human) and extrinsic (vector) latent periods (Figure 1). Since there is evidence that asymptomatic DENV-infected individuals are capable of transmitting DENV to mosquitoes [23], we assumed the same for ZIKV: all people in the model transmitted the same, regardless of whether they displayed symptoms or were reported as cases.
SH represents the number of susceptible people, EH the number of latent people, IH the number of infectious people, RH the number recovered people that have recovered. Similarly, SV represents the proportion of mosquitoes currently susceptible, EV the proportion in their latent period, and IV the proportion of mosquitoes infectious. (βV is the transmission rate from humans to mosquitoes; 1/αH and 1/αV are the mean latent periods for humans and mosquitoes respectively; 1/γ is the mean infectious period for humans; 1/μ is the mean lifespan of mosquitoes; and N is the human the population size.
The main vectors for ZIKV in French Polynesia are thought to be Ae. aegypti and Ae. polynesiensis[12]. In the southern islands, the extrinsic incubation period for Ae. polynesiensis is longer during the cooler period from May to September [24], which may act to reduce transmission. However, climate data from French Polynesia [25] indicated that the ZIKV outbreaks on the six archipelagos ended before a decline in mean temperature or rainfall occurred (Figure S1). Hence it is likely that transmission ceased as a result of depletion of susceptible humans rather than seasonal changes in vector transmission. Therefore we did not include seasonal effects in our analysis.
(A) Mean monthly temperature and rainfall in French Polynesia from 1990-2012. (B) Suspected ZIKV cases in 2013-14.
In the model, SH represents the number of susceptible people, EH is the number of people currently in their latent period, IH is the number of infectious people, RH is the number of people that have recovered, C denotes the cumulative number of people infected (used to fit the model), and N is the population size. Similarly, SV represents the proportion of mosquitoes currently susceptible, EV the proportion in their latent period, and IV the proportion of mosquitoes currently infectious. As the mean human lifespan is much longer than the outbreak duration, we omitted human births and deaths. The full model is as follows:

Parameter definitions and values are given in Table 2. We used informative prior distributions for the human latent period, αH, because the incubation period in humans is typically between 2-7 days [26, 27]; the infectious period, 1/γ, lasted for 4-7 days in clinical descriptions of 297 PCR-confirmed cases in French Polynesia [20]; and the the extrinsic latent period, which has been estimated at 1/αv = 10 days [1]. We assumed uniform prior distributions for all other parameters.
Prior distributions are given for all parameters, along with source(s) if the prior incorporates a specific mean value. All rates are given in units of days−1.
Serological analysis of samples from blood donors between July 2011 and October 2013 suggests that only 0.8% of the population of French Polynesia were seropositive to ZIKV [28]; we therefore assumed that the population was fully susceptible initially. We also assumed that the initial number of latent and infectious people were equal (i.e. ), and the same for mosquitoes
. The basic reproduction number is equal to the product of the average number of mosquitoes infected by the typical infectious human, and vice versa:

Statistical inference
Model fitting was performed using Markov chain Monte Carlo (MCMC). Incidence in week t, denoted ct, was defined as the difference in the cumulative proportion of cases over the previous week i.e. ct = C(t) − C(t − 1). We defined kt as the proportion of total sites that reported cases as sentinels in week t. To account for potential variability in reporting, we assumed the number of confirmed and suspected cases in week t followed a negative binomial distribution with mean rktct and dispersion parameter ϕ [29]. The joint posterior distribution of the parameters was obtained from sampling 30,000 MCMC iterations, after a burn-in period of 10,000 iterations (Figures S2–S7). The model was implemented in R version 3.2.3 [30].
Plot shows marginal posterior estimates for: the mean vector latent period and lifespan; the mean human latent and infectious periods; the basic reproduction number, R0; the proportion of cases reported, r; the magnitude of environmental noise, ϕ; the number of initially infectious humans, and the proportion of the mosquito population initially infectious. Red lines show the assumed prior distributions for the latent periods in humans and mosquitoes, infectious period in humans, and mosquito lifespan.
Plot shows marginal posterior estimates for: the mean vector latent period and lifespan; the mean human latent and infectious periods; the basic reproduction number, R0; the proportion of cases reported, r; the magnitude of environmental noise, ϕ; the number of initially infectious humans, and the proportion of the mosquito population initially infectious. Red lines show the assumed prior distributions for the latent periods in humans and mosquitoes, infectious period in humans, and mosquito lifespan.
Plot shows marginal posterior estimates for: the mean vector latent period and lifespan; the mean human latent and infectious periods; the basic reproduction number, R0; the proportion of cases reported, r; the magnitude of environmental noise, ϕ the number of initially infectious humans, and the proportion of the mosquito population initially infectious. Red lines show the assumed prior distributions for the latent periods in humans and mosquitoes, infectious period in humans, and mosquito lifespan.
Plot shows marginal posterior estimates for: the mean vector latent period and lifespan; the mean human latent and infectious periods; the basic reproduction number, R0; the proportion of cases reported, r; the magnitude of environmental noise, ϕ; the number of initially infectious humans, and the proportion of the mosquito population initially infectious. Red lines show the assumed prior distributions for the latent periods in humans and mosquitoes, infectious period in humans, and mosquito lifespan.
Plot shows marginal posterior estimates for: the mean vector latent period and lifespan; the mean human latent and infectious periods; the basic reproduction number, R0; the proportion of cases reported, r; the magnitude of environmental noise, ϕ; the number of initially infectious humans, and the proportion of the mosquito population initially infectious. Red lines show the assumed prior distributions for the latent periods in humans and mosquitoes, infectious period in humans, and mosquito lifespan.
Plot shows marginal posterior estimates for: the mean vector latent period and lifespan; the mean human latent and infectious periods; the basic reproduction number, R0; the proportion of cases reported, r; the magnitude of environmental noise, ϕ; the number of initially infectious humans, and the proportion of the mosquito population initially infectious. Red lines show the assumed prior distributions for the latent periods in humans and mosquitoes, infectious period in humans, and mosquito lifespan.
Demographic model
We implemented simple a demographic model to examine the repletion of the number of susceptible individuals over time. In 2014, French Polynesia had a birth rate of b = 15.47births/1,000 population, a death rate of d = 4.93deaths/1,000 population, and net migration rate of m = −0.87 migrants/1,000 [31]. The number of susceptible individuals in year t, S(t), and total population size, N(t), was therefore expressed as the following discrete process:

We set S(2014) as the fraction of the population remaining in the S compartment at the end of the 2013-14 ZIKV outbreak, and propagated the model forward to estimate susceptibility in future years. The effective reproduction number, Reff(t), in year t was the product of the estimated basic reproduction number, and the proportion of the population susceptible: Reff(t) = R0S(t). We sampled 5,000 values from the estimated joint posterior distributions of S(2014) and R0 to obtain the curves shown in Figure 3.
Results
Across the six regions, the median estimates for the basic reproduction number, R0, ranged from 1.9 (95% CI: 1.4-3.7) in Marquises to 3.1 (95% CI: 2.1-4.8) in Moorea (Table 3). The credible interval was broadest for the smallest region, Australes, for which we estimated R0 2.9 (1.7-79). Our results suggest that only a small proportion of ZIKV infections were reported as suspected cases: our median estimates implied that between 8-22% of infections were reported. Estimated dispersion in reporting was greatest for Marquises and Australes (Figures S2–S7), reflecting the variability in the observed data (Figure 2), even after adjusting for variation in the number of sentinel sites.
The estimated proportion of the population that were infected during the outbreak (including both reported and unreported cases) was above 75% for all six regions (Table 3), and we estimated that 86% (95% CI: 75-93%) of the total population were infected during the outbreak. Serological analysis of the ZIKV outbreak on Yap also found a high level of infection, with 73% (95% CI 66-77%) of individuals aged 3 and over infected during the outbreak [2]. Our posterior estimates for the human latent and infectious periods were consistent with the assumed prior distributions (Figures S2–S7), suggesting either that there was no strong evidence that these parameters had a different distribution, or that the model had limited ability to identify these parameters from the available data.
During the 2013-14 outbreak in French Polynesia, there were 42 reported cases of GBS [13]. This corresponds to an incidence rate of 15.3 (95% binomial CI: 11.0-20.7) cases per 100,000 population, whereas the established annual rate for GBS is 1-2 cases per 100,000 [10]. In total, there were 8,744 confirmed and suspected ZIKV cases reported at sentinel sites in French Polynesia, which gives an incidence rate of 480 (95% CI: 346-648) GBS cases per 100,000 suspected Zika cases reported at these sites.
Black dots show weekly reported confirmed and suspected ZIKV cases from sentinel sites. Blue line shows median of 2,000 simulated trajectories from the fitted model, adjusted for variation in reporting over time; shaded region shows 95% credible interval.
Estimates for the basic reproduction number, R0; the proportion of infected individuals that were reported as suspected cases at sentinel sites; and the total proportion of the population infected (including both symptomatic and asymptomatic cases). Median estimates are given, with 95% credible intervals in parentheses. The full posterior distributions are shown in Figures S2–S7.
However, if we calculate the GBS incidence rate per estimated total ZIKV cases, we obtain a rate of 18.3 (12.9-24.5). These credible intervals overlap substantially with the above incidence rate calculated with population size as the denominator, indicating that the two rates are not significantly different.
Using a demographic model we also estimated the potential for ZIKV to cause a future outbreak in French Polynesia. We combined our estimate of the proportion of the population that remained susceptible after the 2013-14 outbreak and R0 with a birth-death-migration model to estimate the effective reproduction number, Reff, of ZIKV in future years. If Reff is greater than one, an epidemic would be possible in that location. Assuming that ZIKV infection confers lifelong immunity against infection with ZIKV, our results suggest that it would likely take 15-20 years for the susceptible pool in French Polynesia to be sufficiently replenished for another outbreak to occur (Figure 3). This is remarkably similar to the characteristic dynamics of DENV in the Pacific island countries and territories, with each of the four DENV serotypes re-emerging in sequence every 15-20 years, likely as a result of the birth of a new generation of susceptible individuals [17, 32].
(A) Tahiti, (B) Sous-le-vent, (C) Moorea, (D) Tuamotu, (E) Marquises, (F) Australes. Line shows median from 1,000 samples of the posterior distribution, shaded region shows 95% credible interval.
Discussion
Using a mathematical model of ZIKV transmission, we analysed the dynamics of the infection during the 2013-14 outbreak in French Polynesia. In particular, we estimated key epidemiological parameters, such as the basic reproduction number, R0, and the proportion of infections that were reported. Across the six regions, our median estimates suggest that between 8-22% of infections were reported as suspected cases. This does not necessarily mean these cases were asymptomatic; individuals may have had mild symptoms and hence did not enter the healthcare system. For example, although the attack rate for suspected ZIKV disease cases was 2.5% in the 2007 Yap ZIKV outbreak, a household study following the outbreak found that around 19% of individuals who were seropositive to ZIKV had experienced ZIKV disease-like symptoms during the outbreak period [2].
Our median estimates for R0 ranged from 1.9-3.1 across the six main archipelagos of French Polynesia, and as a result the median estimates of the proportion of the populations became infected in our model spanned 76-95%. The ZIKV outbreak in French Polynesia coincided with a significant increase in Guillain-Barre syndrome (GBS) incidence [13]. We found that although there was a raw incidence rate of 480 (95% CI: 346-648) GBS cases per 100,000 suspected ZIKV cases reported, the majority of the population was likely to have been infected during the outbreak, and therefore the rate per infected person was similar to the overall rate per capita. This could have implications for the design of case-control studies to test for an association between ZIKV infection and neurological complications.
If infection with ZIKV confers lifelong immunity, we found it would take at least a decade before re-invasion were possible. In the Pacific island countries and territories, replacement of DENV serotypes occurs every 4-5 years [17, 32], and therefore each specific serotype re-emerges in a 15-20 year cycle. The similarity of this timescale to our results suggest that ZIKV may exhibit very similar dynamics to DENV in island populations, causing infrequent, explosive outbreaks with a high proportion of the population becoming infected. However, it remains unclear whether ZIKV could become established as an endemic disease in larger populations, as dengue has.
For immunising infectious diseases, there is typically a ‘critical community size’, below which random effects frequently lead to disease extinction, and endemic transmission cannot be sustained [16]. Analysis of dengue outbreaks in Peru from 1994-2006 found that in populations of more than 500,000 people, dengue was reported in at least 70% of weekly records [33]. Large cities could have the potential to sustain other arboviruses too, and understanding which factors-from population to climate-influence whether ZIKV transmission can become endemic will be an important topic for future research. We did not consider seasonal variation in transmission as a result of climate factors in our analysis, because all six outbreaks ended before there was a substantial seasonal change in rainfall or temperature. If the outbreaks had ended as a result of seasonality, rather than depletion of susceptibles, it would reduce the estimated proportion of the population infected, and shorten the time interval before ZIKV would be expected to re-emerge.
There are some additional limitations to our analysis. We used a homogeneous mixing model, in which all individuals had equal chance of contact. In reality, there may be spatial heterogeneity in transmission, leading to a depletion of the susceptible human pool in some areas but not in others. As we were only fitting to a single time series for each region, we also assumed prior distributions for the latent and infectious periods in humans. If seroprevalence data become available in the future, they could give an indication of how many people were actually infected, which would make it possible to constrain more of the model parameters. However, such studies may require careful interpretation, as antibodies may cross-react between different flaviviruses [12].
Our results suggest that ZIKV transmission in island populations may follow similar patterns to dengue fever, generating large, sporadic outbreaks with a high degree of under-reporting. If a substantial proportion of such populations become infected during an outbreak, it may take several years for the infection to re-emerge in the same location. A high level of infection, combined with rare outbreaks, could also make it more challenging to establish a causal link between infection and concurrent neurological complications.
Footnotes
↵* E-mail: adam.kucharski{at}lshtm.ac.uk