Socio-environmental and measurement factors drive spatial variation in influenza-like illness

Elizabeth C. Lee; Ali Arab; Sandra Goldlust; Cécile Viboud; Shweta Bansal

doi:10.1101/112680

Abstract

The mechanisms hypothesized to drive spatial heterogeneity in reported influenza activity include: environmental factors, contact patterns, population age structure, and socioeconomic factors linked to healthcare access and quality of life. Harnessing the large volume and high specificity of diagnosis codes in medical claims data for influenza seasons from 2002-2009, we estimate the importance of socio-environmental determinants and measurement-related factors on observed variation in influenza-like illness (ILI) across United States counties. We found that South Atlantic states tended to have higher ILI seasonal intensity, and a combination of transmission, environmental, influenza subtype, socioeconomic and measurement factors explained the variation in seasonal intensity across our study period. Moreover, our models suggest that sentinel surveillance systems should have fixed report locations across years for the most robust inference and prediction, and high volumes of data can offset measurement biases in opportunistic data samples.

Introduction

Seasonal influenza represents an important public health burden worldwide, and even within a single year, there is substantial variation in disease burden across populations (Moorthy et al., 2012; Lee et al., 2015). Many studies have examined the drivers and patterns influenza seasonality (Lofgren et al., 2007; Tamerius et al., 2011), while others have focused on the large-scale spatial patterns in influenza epidemic timing, suggesting for instance, spread from West to East across North America due to a combination of local contact patterns and global travel patterns (Wenger and Naumova, 2010; Schanzer et al., 2011b;Grais et al., 2003; Brownstein et al., 2006). While there are numerous studies explaining spatial variation in seasonal influenza transmission and disease burden, most studies focus on very aggregated or very local study areas (e.g., country-level or one school district, respectively) compare only one or two hypotheses in isolation.

Among these, humidity and temperature have each been associated with seasonal flu onset, seasonal fluctuations, and heightened morbidity and mortality in epidemiological contexts (Shaman et al., 2010;Yu et al., 2013;Barreca and Shimshack, 2012;Deyle et al., 2016), and lower humidity and colder temperatures may increase influenza virus transmission and survival (Lowen et al., 2007; Shaman and Kohn, 2009) Chronic illnesses such as asthma, exacerbated by air pollution, elevated the risk for severe symptoms of pandemic H1N1 (Van Kerkhove et al., 2011) Empirical evidence supports the occurrence of both aerosol and droplet transmission of influenza virus (Killingley and Nguyen-Van-Tam, 2013) and these transmission modes suggest that influenza seasons may follow both density-dependent and frequency-dependent disease dynamics (per capita contact rates between susceptible and infectious individuals do and do not change with population density, respectively). The high connectivity of school-aged children in contact surveys (Mossong et al., 2008; Kucharski et al., 2014) has led to hypotheses that children drive local transmission and adults seed new infections across longer distances (Viboud et al., 2006; Apolloni et al., 2013) which may manifest in shifted epidemic timings across age groups (Lemaitre and Carrat, 2010;Peters et al., 2014;Schanzer et al., 2010;Wallinga et al., 2006; Timpka et al., 2012) Immune landscapes vary across locations; epidemic outcomes in one season may trickle down to subsequent years through differences in cross-protective immunity, and high flu vaccination coverage may reduce morbidity and incidence of severe clinical outcomes (Kostova et al., 2013) Finally, flu type and subtype circulation may also drive spatial heterogeneity; A/H3-dominant flu seasons are associated with greater morbidity and mortality and an older patient age distribution than A/H1 season (Frank et al., 1985;Simonsen et al., 1997;Khiabanian et al., 2009; Peters et al., 2014) while influenza B is thought to circulate predominantly and earlier among children (Peters et al., 2014;Hayward et al., 2014; Beauté et al., 2015)

Beyond socio-environmental mechanisms, we must consider the possibility that the measurement of influenza disease burden plays a significant role in driving the observed spatial heterogeneity. While poverty and other social determinants are thought to increase risk for influenza morbidity, hospitalization, and mortality (Lowcock et al., 2012;Kumar et al., 2015;Hadler et al., 2016;Charland et al., 2011; Grantz et al., 2016) these observations are often confounded by care-seeking behavior, the likelihood that sick individuals will seek treatment from a health care provider. Roughly 43% of adults and 60% of elderly seek care for influenza-like illness (ILI) in the United States, as many cases are too mild to warrant a visit to the doctor (Biggerstaff et al., 2014b). In addition to differences in personal choice, limited access to health care and health insurance also delay or reduce care-seeking behavior, further generating biases in reported case severity or patient numbers among physician-based surveillance systems (Biggerstaff et al., 2014b)

In this study, we examine the transmission, environmental, influenza-specific, and socioeconomic mechanisms and measurement processes underlying the spatial variation in reported influenza-like illness across counties in the United States. Leveraging highly resolved medical claims data, we identified important drivers of spatial heterogeneity in the magnitude and duration of flu seasons from 2002 to 2009 in a large-scale ecological analysis. We then used our Bayesian modeling framework in new applications to probe the robustness of this ecological inference with limited data availability and to assess the predictive ability of our model in a more recent flu season. Our results highlight the relative contributions of surveillance data collection and socio-environmental processes to disease reporting, and highlight the importance of considering measurement biases when using surveillance data for epidemiological inference and prediction.

Results

We examined the socio-environmental and measurement-related drivers of spatial heterogeneity in influenza disease burden across U.S. counties for flu seasons from 2002-2003 through 2008-2009 using a hierarchical Bayesian modeling approach. Using medical claims data representing 2.5 billion visits from upwards of 120,000 health care providers each year, our study considered six disease burden response variables: two measures of influenza disease burden (relative risk of seasonal intensity, which is a proxy for attack rate, and epidemic duration in number of weeks) in three populations (total population, children 5-19 years old, and adults 20-69 years old) with multi-season and single season model structures. There were 13 county-level, 2 state-level and 4 HHS region-level predictors in the final model Table 1; all predictors were the same across response variables except care-seeking behavior, which was specific to the age group in the response. The seasonal intensity model fit the data well and the Pearson’s cross-correlation coefficient between the log seasonal intensity and log prediction was R = 0.87 (Figure 1). Results reported in the following sections are from the multi-season total population seasonal intensity model unless otherwise noted.

View this table:

Table 1. Final model predictors and hypotheses.

Figure 1.

Continental U.S. county map for fitted and observed relative risk of seasonal intensity for an example flu season (2006-2007).

Figure 1-Figure supplement 1. Continental U.S. county maps for fitted (left) and observed (right) relative risk of seasonal intensity for remaining influenza seasons.

Temporal and spatial patterns of influenza-like illness

Group (random) effects were used to identify consistent spatial or temporal patterns across locations and study years. We found that the 2004-2005 flu season had greater seasonal intensity, while 2008-2009 had relatively low seasonal intensity (Figure 2). For the seasonal intensity model, no single region had a significant group effect, although several South Atlantic states like Georgia, Maryland, North Carolina, South Carolina, and Virginia had relatively greater risk than other states across the study period, while several Plains and Rocky Mountain states like Kansas, Minnesota, Missouri, Montana, and Utah had relatively lower risk.

Figure 2.

Temporal and spatial group effects for total population seasonal intensity. A) 95% credible intervals for group (random) effects by influenza season. B) Continental U.S. maps highlighting states with significantly greater or lower seasonal intensity across the study period.

Drivers of seasonal intensity

Several socio-environmental drivers of seasonal intensity risk were identified in the multi-season model (Figure 3). Total seasonal intensity had positive associations with the adult-flu H3 and child-flu B interaction terms, estimated average household size, and a proxy for prior immunity. There were negative associations with adult and child population proportions, average flu season specific humidity, proportion of the population in poverty, proportion of single person households, and infant vaccination coverage.

Figure 3.

For the total population multi-season seasonal intensity models, these are the 95% credible intervals for the posterior distributions of the A) socio-environmental coefficients and B) measurement-related coefficients. Distributions indicated in green were statistically significant.

Figure 3-Figure supplement 1. For the total population single-season seasonal intensity models, these are the 95% credible intervals for the posterior distributions of the socio-environmental coefficients.

Figure 3-Figure supplement 2. For the total population single-season seasonal intensity models, these are the 95% credible intervals for the posterior distributions of the measurement coefficients.

We found that careseeking behavior and claims database coverage had strong positive associations with seasonal intensity (Figure 3). In considering the single-season models, the positive effect of claims database coverage on seasonal intensity appeared to decline in magnitude over time (Figure 3 supplement). This corresponded with an increase in claims database coverage over time (Appendix 5).

Drivers of age-specific seasonal intensity

Children and adults comprise the largest components of the U.S. population, and many studies have considered shifts in epidemic timing and immunity due to differences in contact patterns, shifting risk between children and adults over time, interactions between influenza types/subtypes by age, and differences in vaccine effectiveness by age group (Bansal et al., 2010;Lee et al., 2015;Ewing et al., 2016; Schanzer et al., 2011a;Gostic et al., 2016; Khiabanian et al., 2009). Considering the potential to elucidate age-specific transmission mechanisms and improve targeting of public health interventions, we used the multi-season model to examine drivers of seasonal intensity in the child and adult populations. Full model results are reported in Appendix 2, and for both age groups, predicted value means appeared to be systematically over-estimated relative to the observed relative risk of seasonal intensity. The Pearson’s cross-correlation coefficient between the log observation and log predicted mean was R = 0.89 and R = 0.90 for the child and adult seasonal intensity models, respectively.

Children had greater intensity in the 2003-2004 flu season and lower intensity in the 2002-2003 and 2008-2009 flu seasons. Adults had greater intensity in the 2004-2005 flu season and lower intensity in the 2008-2009 flu seasons. Similar to results for the total population, several South Atlantic states had greater risk while Plains states had lower risk of seasonal intensity for both children and adults.

Across the three age group responses (i.e., total, children, adults), child seasonal intensity had a unique positive association with influenza B circulation and adult seasonal intensity had a unique positive association with H3 circulation among influenza A and proportion of the population in poverty. Also notable, both child and adult seasonal intensity had a negative association with estimated average household size, while the total seasonal intensity model had a positive effect.

Drivers of epidemic duration

We also considered the mechanisms associated with epidemic duration, a measure of influenza disease burden that captures the number of weeks with heightened ILI activity. Better understanding of factors associated with longer epidemics might improve hospital preparedness in surge capacity and staffing needs and aid local public health departments in planning their influenza information or vaccination campaigns. Full results for a multi-season model of epidemic duration for the total population are reported in Appendix 3, but predicted value means appeared to be systematically under-estimated relative to the observed epidemic durations and the Pearson’s cross-correlation coefficient between the observed and predicted mean number of epidemic weeks was R = 0.71.

The 2004-2005 and 2007-2008 flu seasons had longer epidemics while the 2002-2003 and 2008-2009 seasons tended to have shorter epidemics. The Southeastern U.S. region (HHS region 4) had longer epidemics than other regions, while only five states with no geographic identity had significant group effects for the epidemic duration model. Epidemic duration had positive associations with the interaction between adult population and influenza H3 circulation, influenza B circulation, estimated average household size, population density, a proxy for prior immunity, and elderly vaccination coverage. There were negative associations with H3 circulation among influenza A, average flu season specific humidity, and proportion of the population in poverty. With regard to measurement factors, careseeking behavior and claims database coverage had strong positive associations with epidemic duration.

Applications to surveillance

Considering the large volume and spatial resolution of our data, we sought to explore the robustness of our inference and model predictions under more realistic circumstances. Two sequences of models were designed to mimic different types of real-world sentinel flu surveillance systems —fixed-location sentinels, where the same sentinel locations reported data every year, and moving-location sentinels, where new sentinel locations are recruited each year. A third model sequence considered the specificity of inference and model predictions to certain inclusion of historical data, thus providing insight into the generalizability of our model to epidemic forecasting. We examine these applications for the total population seasonal intensity model, and these may also serve as a sensitivity analysis to missing observations. Ten replicates were performed for each model with missingness to generalize findings beyond that of random chance.

Sentinels in fixed locations

In this sequence of four models, 20, 40, 60, and 80% of randomly chosen county observations were removed across all years. The effect sizes of drivers were pulled towards zero as fewer sentinel counties reported ILI seasonal intensity, but the primary conclusions remained robust. We noted that the positive effect of care-seeking increased across most model replicates and insurance coverage shifted from no effect to a slightly positive effect as sentinel reporting declined (Figure 4A). Model predictions (county-season fitted values) remained quite robust relative to the complete model, even when 80% of counties were excluded (Figure 4B).

Figure 4.

A) Diagram indicating changes to model inference as fewer fixed-location sentinels reported data. Color indicates directionality of the significant effect (blue is positive, red is negative) while greater transparency indicates a lower percentage of replicates with a significant effect (for models with missingness); dot size represents the magnitude of the posterior mean (or average of the posterior mean across replicates). Predictors with no significant effect across the sequence of models were removed for viewing ease, and absence of a dot means the effect was not significant across any replicates. B) Map of model prediction match between the complete model and the 40% and 20% reporting levels for fixed-location sentinels. Match between the complete and sentinel models were aggregated across 70 season-replicate combinations (7 seasons * 10 replicates). Color indicates match between posterior predictions in the missing and complete models (purple represents a failure to match in at least half of season-replicate combinations).

Figure 4-Figure supplement 1. Diagram indicating changes to model inference as fewer moving-location sentinels reported data.

Figure 4-Figure supplement 2. Map of model prediction match between the complete model and the 60% and 80% missing levels for moving-location sentinels.

Figure 4-Figure supplement 3. Diagram indicating changes to model inference as historical seasons were randomly removed from the model.

Figure 4-Figure supplement 4. Map of model prediction match between the complete model and models missing one, three, or five historical flu seasons.

Sentinels in moving locations

In this sequence of four models, 20, 40, 60, and 80% of randomly chosen seasonally-stratified observations were removed. Similar to the fixed-location sequence, drivers were pulled towards zero as fewer sentinel counties reported ILI, the drivers with the smallest means were pulled towards zero and predictors with no effect in the complete model were found to be significant (Figure 4 supplement). Model predictions had good agreement with the complete model up to a threshold between 60 and 80% missingness, where many county-season fits suddenly became poor.

Inclusion of historical data

In this sequence of models, one, three, and five out of seven flu seasons in the study period were completely removed. As hinted by the inconsistency of inference across seasons in the single season model results (Figure 3), important drivers changed substantially when more than one season was removed, particularly when they had small effect sizes in the complete model (Figure 4 supplement). Notably, medical claims coverage and care-seeking were two of three predictors that remained consistent in the magnitude and direction of inference across all model replicates. Model predictions were robust relative to the complete model only when one season was removed. Beyond that, many seasonal fitted values were poor, particularly for some seasons where data had been removed.

Discussion

Using hierarchical modeling approaches, we explored the contributions of 19 potential predictors towards county-level variation in influenza disease burden across the United States during flu seasons from 2002-2003 to 2008-2009. To our knowledge, this is the first large-scale study to compare the relative importance of environmental, demographic, and socioeconomic hypotheses about influenza disease burden in addition to data reporting biases. The fine spatial resolution and high coverage of our medical claims data (estimated to represent 20% of all health care visits across the United States in our study period) enabled the comparison of multiple hypotheses, and the inclusion of several flu seasons and sensitivity analyses enhance confidence in the robustness of our findings.

Our model results suggest that South Atlantic states may experience flu seasons most acutely because they have higher seasonal intensities relative to their baselines, and greater examination of flu season surveillance and surge capacity in these areas may be warranted. We also found that a mixture of factors explained the variation in our model and that these factors changed across different cross-sections of time, thus highlighting the necessity of cross-disciplinary approaches (e.g., from sociology to epidemiology to immunology) in future pursuits of this question. Moreover, the declining importance of claims database coverage (i.e., population representativeness of the data) as coverage increased underscores the relevance of collecting and using metadata when making epidemiological inference from opportunistic sources or undesigned observational samples. The ability for our model to project relatively accurate fitted values across increasingly missing data suggests that routine sentinel surveillance in fixed locations may be more accurate for interpolating ILI disease burden among uncovered areas than surveillance across changing locations, even when fewer locations may be surveyed.

Prior studies have reported relationships between low absolute humidity and greater influenza transmission and survival in experimental settings, and that fluctuations in absolute humidity may explain the seasonality of influenza across large geographic scales (Tamerius et al., 2011; Lowen and Steel, 2014). Our study adds to this literature in finding strong negative associations between absolute humidity and both seasonal intensity and epidemic duration. In addition, our results elucidate the debate about whether influenza transmits primarily through frequency-or density-dependent contact. Greater seasonal intensity was associated with populations with larger household sizes (a proxy for infection risk from frequent contacts), while longer epidemics were associated with larger household sizes and greater population density. We suspect that density-dependent transmission explained differences in epidemic duration but not seasonal intensity because the calculation for seasonal intensity accounted for population size; population density did not explain variation in the risk of seasonal intensity after adjusting for greater transmission among larger populations.

Household studies of influenza transmission often examine age-specific risks of household influenza introduction (Cauchemez et al., 2004; Lau et al., 2015), and differences in contact and travel patterns between children and adults have led to the hypothesis that children drive local transmission while adults drive global influenza spread (Apolloni et al., 2013; Viboud et al., 2006) Contrary to these hypotheses, larger child and adult population proportions were both associated with lower seasonal intensity. Rather than serving as proxies for local and global transmission, the complement of these predictors together may in fact capture the “high-risk” population proportion in a given location—infants, toddlers, and the elderly—which typically experience greater clinical severity (Thompson et al., 2006) and have higher rates of care-seeking (Biggerstaff et al., 2012). In examining seasonal intensity models for the child and adult populations specifically, we were surprised to find negative associations with population density and average household size, when there was no effect or a positive effect in the total population model (Appendix 2). While it may be that children and adults in less connected areas have greater seasonal intensity relative to their ILI baselines, these patterns may also be an artifact of smaller volumes of data among age groups.

The positive association between influenza A/H3 and adult intensity and influenza B and child intensity corroborate the results of previous epidemiological studies (Hayward et al., 2014; Beauté et al., 2015) and agree with the positive effect of the interaction terms between children and influenza B and adults and influenza A/H3 from our total seasonal intensity models (Appendix 2). Despite a positive linear correlation between the seasonal intensity and epidemic duration measures (Appendix 4), influenza B circulation uniquely indicated longer epidemics, in line with hypotheses that flu seasons are elongated when influenza B resurges among children after a first wave of influenza A (Hayward et al., 2014; Beauté et al., 2015). We acknowledge that our findings may be specific to our study period; recent research highlights the importance of childhood hemagglutinin imprinting on immune responses to subsequent influenza infections (Gostic et al., 2016).

We were surprised to observe that higher estimated prior immunity was associated with greater seasonal intensity and longer epidemic durations for the multi-season models and most seasons in the single-season models (some years experienced no effect). One possible interpretation is that some locations always tend to have high disease burden relative to their epidemic baselines. Prior work suggests that larger epidemics induce more antigenic drift in subsequent seasons (Boni et al., 2004); building off this finding, we suggest that influenza drift renews population susceptibility every flu season, even on small spatial scales. We also acknowledge limitations underlying the calculation of this predictor; in using the seasonal intensity measure to represent the previous flu season’s attack rate, we ignore asymptomatic infection, vaccination rates, and the reporting biases found to be an important component to data observation. Additionally, membership in the same antigenic cluster is a simplification of the immunity conferred by infection with a given strain. Beyond “pre-existing immunity”, we report mixed findings on the effect of flu vaccination. While higher vaccination coverage among toddlers was associated with lower seasonal intensity, we note that higher vaccination coverage among elderly was associated with longer epidemics. We posit that vaccination campaigns among elderly populations may increase in anticipation of large or severe flu seasons, due to their risk of severe complications from flu and clustered living in nursing homes.

Our study found that locations with greater poverty had lower influenza disease burden, in contrast with ample evidence that there are heightened rates of influenza-related hospitalizations, influenza-like illness, respiratory illness, neglected chronic diseases, and other measures of poor health among populations with greater material deprivation (Hadler et al., 2016; Monto and Ullman, 1974; Tam et al., 2014; Biggerstaff et al., 2014b,a; Charland et al., 2011;Hotez, 2008;Adler and Newman, 2002; Steptoe and Feldman, 2001). Several possible non-exclusive explanations for this discrepancy exist. Differences in socio-economic background may change recognition and therefore reporting of disease symptoms (Monto and Ullman, 1974). Material deprivation and lack of social cohesion have also been implicated in lower rates of health care utilization for ILI, which would reduce the observation of influenza disease burden in our medical claims data among the poorest populations (Charland et al., 2011; Biggerstaff et al., 2014a). Indeed, higher rates of health care-seeking were associated with greater disease burden, while hospitals per capita had no effect among our results, which further suggests that patient-side needs and concerns captured ILI variation better than deficits in health resource availability. Future studies focused on estimation and surveillance of influenza disease burden should consider collecting and incorporating data on health care utilization in their populations of interest in order to account for reporting biases and limited forecasting ability in poorer neighborhoods (Scarpino et al., 2016).

Building off mechanistic explanations for measurement biases, we noted that the positive explanatory effect of claims database coverage declined as coverage itself increased throughout our study period (Appendix 5). Conversely, when we artificially removed counties from our model (fixed-location sentinels) or subset our data into age groups, health care-seeking behavior more strongly explained the variation in seasonal intensity among the remaining observations. These two results together suggest that statistical inference from opportunistic data samples may avoid some types of reporting biases when the coverage or volume of data achieves a minimum threshold, in response to concerns posed in Lee et al. (2016). In our specific case, increases to claims database coverage or care-seeking behavior might reduce reporting biases by increasing the representativeness of a given location’s sample. Additionally, we present the concept of a network of sentinel locations, in contrast to sentinel physicians or hospitals, which may be composed of administrative units (e.g., counties) that were chosen for either their representativeness of the larger population or their status as an outlier (e.g., match or failure to match locations in Figure 4, respectively). Given the growing availability of health-associated big data in infectious disease surveillance (Bansal et al., 2016; Simonsen et al., 2016), we project the possibility that sentinel locations may report high volume digital health data from disparate sources to a central public health organization and that the informed choice of sentinels may improve the robustness of sentinel surveillance systems.

We urge caution in the interpretation of our results because they are correlative and prone to invoking the ecological fallacy, where statistical inference about a group (in our case, county populations) is falsely assumed to apply at the individual level (Morgenstern, 1982; Robinson, 2009). Future research should build off our study to design experiments that may provide causal or individual-level evidence that supports or rejects these hypotheses. We also acknowledge the limitations of the spatial and temporal resolutions of the data used in our analysis. Previous work suggests that statistically-identified drivers of disease distributions depend on the spatial scale of analysis (Cohen et al., 2016), and our results may be biased by the county unit observations of our disease data. In addition, we incorporated multiple scales of predictors (county, state, and HHS region) according to the best available data, thus potentially altering our statistical inference, although we did attempt to account for differences in variation across these different predictors with the inclusion of group effects. In addition, we note that the nature of our disease burden estimation procedure means that a given county’s seasonal intensity is relative to its own baseline across years. It may not be appropriate to use our model predictions to inform national-level decision makers about absolute intensity of the flu season in a given location, although local public health departments could use our procedure to assess intensity in a given year relative to that of previous flu seasons.

Methods

Medical claims data

Weekly visits for influenza-like illness (ILI) and any diagnosis from October 2002 to May 2009 were obtained from a records-level database of CMS-1500 US medical claims managed by IMS Health and aggregated to three-digit patient US zipcode prefixes (zip3s), where ILI was defined with International Classification of Diseases, Ninth Revision (ICD-9) codes for: direct mention of influenza, fever combined with respiratory symptoms or febrile viral illness, or prescription of oseltamivir. Medical claims have been demonstrated to capture respiratory infections accurately and in near real-time (Cadieux and Tamblyn, 2008; Santillana et al., 2016), and our specific dataset was validated to independent ILI surveillance data at multiple spatial scales and age groups and captures spatial dynamics of influenza spread in seasonal and pandemic scenarios (Viboud et al., 2014; Gog et al., 2014; Charu et al., 2017).

We also obtained database metadata from IMS Health on the percentage of reporting physicians and the estimated effective physician coverage by visit volume; these data were used to generate “measurement” predictors (Table 1). ILI reports and measurement factors at the zip3-level were redistributed to the county-level according to population weights derived from the 2010 US Census ZIP Code Tabulation Area (ZCTA) to county relationship file, assuming that ZCTAs that shared the first three digits belonged to the same zip3.

Defining influenza disease burden

We performed the following data processing steps for each county-level time series of ILI per population: i) Fit a LOESS curve to non-flu period weeks (flu period defined as November through March each year) to capture moderate-scale time trends (span = 0.4, degree = 2); ii) Subtract LOESS predictions from original data to detrend the entire time series; iii) Fit a linear regression model with annual harmonic terms and a time trend to non-flu period weeks (Yu et al., 2013); iv) Counties “had epidemics” in a given flu season if at least two consecutive weeks of detrended ILI observations exceeded the ILI epidemic threshold during the flu period (i.e., epidemic period) (Denoeud et al., 2007). The epidemic threshold was the upper bound of the 95% confidence interval for the linear model prediction. Counties with a greater number of consecutive weeks above the epidemic threshold during the non-flu period than during the flu period were removed from the analysis.; v) Disease burden metrics were calculated for counties with epidemics.

Multiple measures of influenza disease burden were defined for each county. For a given season: seasonal intensity was the one plus the sum of detrended ILI observations during the epidemic period (shifted by one to accomodate the likelihood distribution); epidemic duration was the number of weeks in the epidemic period and counties without epidemics were assigned the value zero.

Predictor data collection and variable selection

Quantifiable proxies were identified for each hypothesis found in the literature, and these mechanistic predictors were collected from probability-sampled or gridded, publicly available sources and collected or aggregated to the smallest available spatial unit among US counties, states, and Department of Health and Human Services (HHS) regions for each year or flu season in the study period, as appropriate (Table 1, Appendix 5).

We selected one predictor to represent each hypothesis according to the following criteria, in order: i) Select for the finest spatial resolution; ii) Select for the greatest temporal coverage for years in the study period; iii) Select for limited multicollinearity with predictors representing the other hypotheses, as indicated by the magnitude of Spearman rank cross-correlation coefficients between predictor pairs. We also compared the results of single predictor models and our final multivariate models as another check of multicollinearity (Appendix 5). For the modeling analysis, if a predictor had missing data at all locations for an entire year, data from the subsequent or closest other survey year were replicated to fill in that year. If a predictor data source was available only at the state or region-level, all inclusive counties were assigned the corresponding state or region-level predictor value (e.g., assign estimated percentage of flu vaccination coverage for state of California to all counties in California). Predictors were centered and standardized prior to all exploratory analyses and modeling, as appropriate. Interaction terms comprised the product of their component centered and standardized predictors. Data cleaning and exploratory data analysis were conducted primarily in R (R Core Team, 2015). Final model predictors are described below, and our hypotheses for each predictor are described in Table 1.

Environmental data

Daily specific humidity data on a 2m grid were collected from the National Oceanic and Atmospheric Administration (NOAA) North American Regional Reanalysis (NARR), provided by the NOAA/OAR/ESRL PSD, Boulder, Colorado, USA, from their website at http://www.esrl.noaa.gov/psd/. Values were assigned to the grid point nearest to the county centroid.

Readings of fine particulate matter, defined as pollutants with aerodynamic diameter less than 2.5 micrometers, were collected from the CDC WONDER database at the county and daily scales from their website at https://wonder.cdc.gov/.

Social contact and population data

Annual total and age-specific population data were taken from the intercensal population estimates and land area and number of housing units were reported during the 2000 and 2010 Census; both datasets were available at the county scale from the U.S. Census Bureau. These data were used to calculate proportion of total population that are children (5-19 years old) and adults (20-69 years old), population density by land area, and estimated average household size.

Flu-specific data

Annual flu vaccination rates for toddlers (19-35 months old) and the elderly (≥ 65 years old) were estimated at the state-level from the Centers for Disease Control and Prevention (CDC) National Immunization Survey and Behavioral Risk Factor Surveillance System, respectively. Annual proportion of A-typed flu samples subtyped as H3 and annual proportion of confirmed flu samples typed as B across U.S. Department of Health and Human Services (HHS) regions were collected by WHO/NREVSS Collaborating Labs and available at the CDC FluView website at http://www.cdc.gov/flu/weekly/fluviewinteractive.htm.

Prior immunity

For a given county, a proxy for prior immunity was derived from the following data: 1) the previous flu season’s total population seasonal intensity; the proportion of positive flu strains identified as A/H3, A/H1, and B in the broader HHS region during 2) the previous flu season and 3) the current flu season; 4) the most prominently circulating flu strain for each category (A/H3, A/H1, or B) for each flu season; 5) antigenic clusters for A/H3 and A/H1 strains as identified in Du et al. (2012); Liu et al. (2015); and 6) Victoria-or Yamagata-like lineages for B strains as noted in Bedford et al. (2014). Data for items 1-3 are described above in “Defining influenza disease burden” and “Flu-specific data.” We obtained the antigenic characterizations for circulating strains (item 4) from CDC influenza season summaries, which are available at https://www.cdc.gov/flu/weekly/pastreports.htm.

Using these data, we calculated a proxy of prior immunity that captures “the proportion of individuals infected in the previous flu season that would have protection during the current flu season, accounting for the distribution of circulating flu strains.” For each flu category among A/H3, A/H1, and B, we calculated the product of the previous and current year’s proportion of total circulation and a binary value to indicate if previous and current strains were from the same antigenic cluster or lineage (1 = same cluster/lineage, 0 = different cluster/lineage). For a given county, these products were summed across A/H3, A/H1, and B, and multiplied by the previous year’s seasonal intensity.

Socioeconomic and access to care data

Annual data on number of hospitals were obtained at the county-level from the Health Resources and Services Administration (HRSA) Area Health Resources Files (AHRF). County-level data on proportion of households with a single person were obtained from five-year averages of American Community Survey (ACS) estimates, which were available starting in 2005. Annual estimates on proportion of the population in poverty was obtained at the county-level from the model-based Small Area Income and Poverty Estimates (SAIPE). Annual estimates on proportion of the population with health insurance was obtained at the county-level from the model-based Small Area Health Insurance Estimates (SAHIE). SAIPE and SAHIE are both products of the U.S. Census Bureau that were derived from the Current Population Survey or ACS.

Medical claims measurement factors

IMS Health provided us with weekly aggregated data on visits for any diagnosis by age group and location. Care-seeking behavior was defined as the total visits per population size from November through April of a given flu season. Claims database coverage was the estimated physician coverage among all physicians registered by the American Medical Association in the IMS Health medical claims database.

Model structure

We present the most common version of our model structure here. The generic model for county-year observations (for i counties and t years) of influenza disease burden y_it is: where y = (y₁,…, y_n)’ denotes the vector of all observations (Equation 1). We modeled the mean of the observed disease burden magnitude (μ_i), where f(y|μ, τ) is the distribution of the likelihood of the disease burden data, parameterized with mean μ = (μ₁,…, μ_n)′ and precision τ, as appropriate to the likelihood distribution (N.B., for the Poisson likelihood, μ = 1/τ)

The mechanisms driving disease burden were modeled: where g(.) is the link function, α is the intercept, there are m socio-environmental and measurement predictors (X_i’s), and E_i is an offset of the expected disease burden, such that Equation 2 models the relative risk of disease (μ_i/Ε_i) in county i, common in disease mapping (Lawson, 2013;Banerjee et al., 2015; Waller and Carlin, 2010). Group terms at the county, state, region, and season levels (γ_i,ζ_j_[i],η_k[i],ν_t, respectively) and the error term (ϵ_it) are independent and identically distributed (iid)

Geographical proximity appears to increase the synchrony of flu epidemic timing (Schanzer et al., 2011b; Stark et al., 2012), while connectivity between cities has been linked with spatial spread in the context of commuting and longer distance travel (Charaudeau et al., 2014;Brownstein et al., 2006; Crépey and Barthélemy, 2007; Lemey et al., 2014) We modeled county spatial dependence φ_t with an intrinsic conditional autoregressive (ICAR) model, which smooths model predictions by borrowing information from neighbors (Besag et al., 1991): where ξ_i represents the number of neighbors for node i. ϕ_j is a vector indicating the neighborhood relationship between node i and all nodes j (i ~ j) and τ_ϕ is the precision parameter (Equation 3).

Model fit, sensitivity, and validation

To assess model fit, we examined scatterplots and Pearson’s cross-correlation coefficients between observed and fitted values for the relative risk of total population seasonal intensity and for epidemic duration. We also examined scatterplots of standardized residuals and fitted values; standardized residuals were defined as (y – μ_ŷ)Ισ _ŷ, where μ _ŷ is the fitted value posterior mean and σ _ŷ is the fitted value standard deviation. Model sensitivity was assessed by comparing model fits and inference robustness when observations were randomly removed from the model, as described below under “Applications to missing data & inference robustness.”

For each disease burden measure, we compared models with no spatial dependence, county-level dependence only, state-level dependence only, and both county and state-level dependence. The goal of the county-level dependence was to capture local population flows, while state-level dependence attempted to capture state-level flight passenger flows (details in Appendix 1). We determined that models with only county-level spatial neighborhood structure best fit the data after examining the Deviance Information Criteria (DIC) values and spatial dependence coefficients of the four model structures. County-level spatial structure was subsequently used in all final model combinations. We report results from models with county-level dependence only.

For model validation, we compared model fitted values for seasonal intensity with CDC ILI and laboratory surveillance data (details in Appendix 1).

Statistical analysis

The goals of our modeling approach were to i) estimate the contribution of each predictor to influenza disease burden, ii) predict disease burden in locations with missing data, and iii) improve mapping of influenza disease burden. We performed approximate Bayesian inference using Integrated Nested Laplace Approximations (INLA) with the R-INLA package (www.r-inla.org) (Rue et al., 2009; Martins et al., 2013). INLA has demonstrated computational efficiency for latent Gaussian models and produced similar estimates for fixed parameters as established implementations of Markov Chain Monte Carlo (MCMC) methods for Bayesian inference (Carroll et al., 2015). Extensions to INLA have enabled its application to spatial, spatio-temporal, and zero-inflated models (Lindgren et al., 2011; Arab, 2015), which is implicated in INLA’s growing use in the disease mapping and spatial ecology communities (Schrödle and Held, 2011; Blangiardo et al., 2013)

Seasonal intensity was modeled with a lognormal distribution, and epidemic duration was modeled with a Poisson distribution and log link and excluded the offset term in Equation 2. Consequently, we note that all seasonal intensity models examine the relative risk of seasonal intensity, while epidemic duration models directly examine the duration in weeks. Multi-season models included all terms in Equation 2, while single-season models included all terms in Equation 2 except the season grouping (v_t). Model coefficients were interpreted as statistically significant if the 95% credible interval for a parameter’s posterior distribution failed to include zero.

Applications to missing data & inference robustness

We considered the robustness of our total population model results by refitting models where 20%, 40%, 60% and 80% of all county observations were replaced with NAs (sentinels in fixed locations), and where 20%, 40%, 60% and 80% of model observations were stratified by season and randomly replaced with NAs (sentinels in moving locations). We also refit three models where one, three, and five of seven flu seasons were randomly chosen and completely replaced with NAs (inclusion of historical data). To account for variability due to random chance, models were replicated ten times each with different random seeds. For each sequence of missingness, we compared the magnitude and significance of socio-environmental and measurement drivers, and the posterior distributions of county-season fitted values. Fitted value distributions were noted as significantly different if the interquartile ranges for two fitted values failed to overlap with each other.

Funding

This work was supported by the Jayne Koskinas Ted Giovanis Foundation for Health and Policy (JKTG) [dissertation support grant to ECL]; and the RAPIDD Program of the Science & Technology Directorate, Department of Homeland Security and the Fogarty International Center, National Institutes of Health. The content is solely the responsibility of the authors and does not necessarily reflect the official views of JKTG, the National Institutes of Health, or IMS Health.

Figure 1-Figure supplement 1.

Continental U.S. county maps for fitted (left) and observed (right) relative risk of seasonal intensity for remaining influenza seasons.

Figure 3-Figure supplement 1.

For the total population single-season seasonal intensity models, these are the 95% credible intervals for the posterior distributions of the socioenvironmental coefficients.

Figure 3-Figure supplement 1.

For the total population single-season seasonal intensity models, these are the 95% credible intervals for the posterior distributions of the measurement coefficients.

Figure 4-Figure supplement 1.

Diagram indicating changes to model inference as fewer moving-location sentinels reported data.

Figure 4-Figure supplement 2.

Map of model prediction match between the complete model and the 60% and 80% missing levels for moving-location sentinels.

Figure 4-Figure supplement 3.

Diagram indicating changes to model inference as historical seasons were randomly removed from the model.

Figure 4-Figure supplement 4.

Map of model prediction match between the complete model and models missing one, three, or five historical flu seasons.

Acknowledgments

The authors thank IMS Health for kindly sharing the medical claims data.

Appendix 1

Seasonal intensity model fit and validation Model fit

Appendix 1 Figure 1.

Observed vs. fitted values for the relative risk of total population seasonal intensity.

Appendix 1 Figure 2.

Residuals vs. fitted values for the total population log seasonal intensity.

Selection for spatial dependence terms

To determine county-level spatial neighbors, we started with the 2010 U.S. Census Bureau 500k resolution county shapefile, and connected abutting counties that were separated by bodies of water. We then used the clean shapefile to identify neighbors as counties that shared borders.

To define state-level spatial neighbors, monthly air travel passenger flows were collected from the Bureau of Transportation Statistics T-100 Domestic Market (U.S. Carriers) table from their website at http://www.transtats.bts.gov/. Airport flows were aggregated to the state-level and states were neighbors if passengers traveled between them from November 2007 through April 2008.

View this table:

Appendix 1 Table 1.

Comparison of total seasonal intensity models with different spatial dependence structures according to Deviance Information Criterion (DIC).

Appendix 1 Figure 3.

95% credible intervals for the state-level spatially structured coefficients when modeling seasonal intensity with state-level spatial dependence (ϕ_i). None of the spatially structured state coefficient distribution were significant.

Validation to CDC surveillance data

We collected a) the percentage of ILI out of all patient visits among the total population, and child and adult populations as reported by CDC’s ILINet, and b) the percentage of positive influenza laboratory confirmations as reported by CDC laboratory surveillance. = We note that child and adult ILI percentage was calculated with a denominator of patient visits across all age groups due to limited data availability. Both CDC surveillance systems were reported at the HHS region level and aggregated cumulatively for each flu season in our study period. We then examined scatterplots and Pearson cross-correlation coefficients (double-sided test where H₀ = no difference) between the mean model fits (where we took the mean across all counties in a given HHS region) and each CDC surveillance dataset.

Appendix 1 Figure 4.

Mean model fit averaged across counties in a given HHS region vs. percentage of positive influenza laboratory confirmations in a given HHS region and flu season. The Pearson cross-correlation coefficient was 0.35 with a p-value of 0.003 for a double-sided hypothesis test.

Appendix 1 Figure 5.

Mean model fit averaged across counties in a given HHS region vs. cumulative percentage of ILI visits in a given HHS region for all age groups. The Pearson cross-correlation coefficient was 0.38 with a p-value of 0.001 for a double-sided hypothesis test.

Appendix 1 Figure 6.

Mean model fit averaged across counties in a given HHS region vs. cumulative percentage of ILI visits in a given HHS region for children. The Pearson cross-correlation coefficient was 0.42 with a p-value of 0.0002 for a double-sided hypothesis test.

Appendix 1 Figure 7.

Mean model fit averaged across counties in a given HHS region vs. cumulative percentage of ILI visits in a given HHS region for adults. The Pearson cross-correlation coefficient was 0.42 with a p-value of 0.0003 for a double-sided hypothesis test.

Appendix 2

Age-specific drivers of seasonal intensity Model Fit

Appendix 2 Figure 1.

Comparison of observed and predicted relative risk of seasonal intensity across flu seasons from 2002-2003 through 2008-2009 for children and adults.

Spatial and temporal patterns

Appendix 2 Figure 2.

Temporal group effects for seasonal intensity among children. 95% credible interval for flu season coefficients in child population seasonal intensity.

Appendix 2 Figure 3.

Spatial group effects for seasonal intensity among children. Continental U.S. maps highlighting states with significantly greater or lower child seasonal intensity.

Appendix 2 Figure 4.

Temporal group effects for seasonal intensity among adults. 95% credible interval for flu season coefficients in adult population seasonal intensity.

Appendix 2 Figure 5.

Spatial group effects for seasonal intensity among adults. Continental U.S. maps highlighting states with significantly greater or lower adult seasonal intensity.

Socio-environmental and measurement drivers

In reference to the total seasonal intensity results, the child and adult models shared the same significant positive associations for the interaction term between child population and influenza B circulation and a proxy for prior immunity, and the same significant negative associations for adult and child population sizes, average flu season specific humidity, proportion of single person households, and infant vaccination coverage. The child and adult models shared a positive association with hospitals per capita where the total population model had no effect, and a negative association with estimated average household size where the total population model had a positive effect.

Child population seasonal intensity had a unique positive association with influenza B circulation and a unique negative association with elderly vaccination coverage. Adult population seasonal intensity had unique positive associations with H3 circulation among influenza A, proportion of the population in poverty, and elderly vaccination coverage, and a unique negative association with the interaction between adult and influenza H3.

Similar to the total population models, the child and adult seasonal intensity models had significant positive associations with careseeking behavior and claims database coverage. However, both the child and adult seasonal intensity models had significant negative associations with proportion of the population with health insurance, where the total population model demonstrated no effect.

Appendix 2 Figure 6.

Diagram comparing model inference between total, child, and adult seasonal intensity.

Appendix 3

Drivers of epidemic duration Model fit

Appendix 3 Figure 1.

Observed versus fitted values for epidemic duration.

Appendix 3 Figure 2.

Residuals versus fitted values for epidemic duration.

Appendix 3 Figure 3.

Continental U.S. county maps for fitted (left) and observed (right) epidemic duration in weeks from 2002-03 through 2008-09.

Spatial and temporal patterns

Appendix 3 Figure 4.

Temporal group effects for influenza-like illness. 95% credible interval for flu season coefficients in epidemic duration.

Appendix 3 Figure 5.

Spatial group effects for influenza-like illness. Continental U.S. map highlighting states with significantly longer or shorter epidemic durations.

Socio-environmental and measurement drivers

Appendix 3 Figure 6.

For the total population multi-season epidemic duration models, these are the 95% credible intervals for the posterior distributions of the socio-environmental coefficients and B) measurement-related coefficients.

Appendix 3 Figure 7.

For the total population multi-season epidemic duration models, these are the 95% credible intervals for the posterior distributions of measurement-related coefficients.

Appendix 4

Comparison of disease burden metrics

Appendix 4 Figure 1.

Comparison of epidemic duration and relative risk for seasonal intensity among fitted (left) and observed (right) values.

Appendix 5

Model predictors Checks for multicollinearity

We checked for multicollinearity among predictors by examining Spearman rank crosscorrelation coefficients between all pairs of final model predictors (excluding interaction terms). No single pair had a linear correlation coefficient that exceeded a magnitude of 0.6.

Appendix 5 Figure 1.

Spearman rank cross-correlation matrix for all pairs of final model predictors.

Additionally, we ran our multi-season seasonal intensity model with each coefficient individually. Multicollinearity between predictors may sometimes be detected when a predictor significantly deviates from zero in the single predictor model, but does not appear to have an effect in a multivariate context. Some predictors (pollution, popDensity, fluB) that were significant in the single predictor context no longer had an effect in our complete model (and vice versa for householdSize and child). Nevertheless, all of these predictors had small effect sizes in both single and multivariate models, and the other predictors that were significant in both models retained effect sizes with the same order of magnitude and directionality.

Appendix 5 Figure 2.

These are the 95% credible intervals among multi-season models with a single predictor for seasonal intensity.

Medical claims coverage

Medical claims database coverage increased over time across each state.

Appendix 5 Figure 3.

Medical claims database coverage by year and state. Colors represent states that belong to the same HHS region. The black horizontal line at 20% effective physician coverage is a visual guide to ease the comparison of data across panels.

References

↵
Adler NE, Newman K. Socioeconomic disparities in health: Pathways and policies. Health Aff. 2002; 21(2):60–76. doi: 10.1377/hlthaff.21.2.60.
OpenUrl Abstract/FREE Full Text
↵
Apolloni A, Poletto C, Colizza V. Age-specific contacts and travel patterns in the spatial spread of 2009 H1N1 influenza pandemic. BMC Infect Dis. 2013 jan; 13:176. doi: 10.1186/1471-2334-13-176.
OpenUrl CrossRef PubMed
↵
Arab A. Spatial and Spatio-Temporal Models for Modeling Epidemiological Data with Excess Zeros. Int J Environ Res Public Health. 2015; 12(9):10536–10548. doi: 10.3390/ijerph120910536.
OpenUrl CrossRef
↵
Banerjee S, Carlin BP, Gelfand AE. Hierarchical Modeling and Analysis for Spatial Data. Second ed. Boca Raton (FL): CRC Press; 2015.
↵
Bansal S, Chowell G, Simonsen L, Vespignani A, Viboud C. Big Data for Infectious Disease Surveillance and Modeling. J Infect Dis. 2016; 214(suppl 4):S375–S379. doi: 10.1093/infdis/jiw400.
OpenUrl CrossRef
↵
Bansal S, Pourbohloul B, Hupert N, Grenfell B, Meyers LA. The shifting demographic landscape of pandemic influenza. PLoS One. 2010 jan; 5(2):e9360. doi: 10.1371/journal.pone.0009360.
OpenUrl CrossRef PubMed
↵
Barreca AI, Shimshack JP. Absolute humidity, temperature, and influenza mortality: 30 years of county-level evidence from the United States. Am J Epidemiol. 2012 oct; 176 Suppl(7):S114–22. doi: 10.1093/aje/kws259.
OpenUrl CrossRef PubMed Web of Science
↵
Beauté J, Zucs P, Korsun N, Bragstad K, Enouf V, Kossyvakis A, et al. Age-specific differences in influenza virus type and subtype distribution in the 2012/2013 season in 12 European countries. Epidemiol Infect. 2015 oct; 143(14):2950–2958. doi: 10.1017/S0950268814003422.
OpenUrl CrossRef
↵
Bedford T, Suchard Ma, Lemey P, Dudas G, Gregory V, Hay AJ, et al. Data from: Integrating influenza antigenic dynamics with molecular evolution. Dryad Digit Repos. 2014; doi: http://dx.doi.org/10.5061/dryad.rc515.
↵
Besag J, York J, Mollié A. Bayesian image restoration, with two applications in spatial statistics. Ann Inst Stat Math. 1991; 43(1):1–20. doi: 10.1007/BF00116466.
OpenUrl CrossRef PubMed
↵
Biggerstaff M, Jhung Ma, Reed C, Garg S, Balluz L, Fry aM, et al. Impact of medical and behavioural factors on influenza-like illness, healthcare-seeking, and antiviral treatment during the 2009 H1N1 pandemic: USA, 2009-2010. Epidemiol Infect. 2014 jan; 142(1):114–25. doi: 10.1017/S0950268813000654.
OpenUrl CrossRef Web of Science
↵
Biggerstaff M, Jhung M, Kamimoto L, Balluz L, Finelli L. Self-reported influenza-like illness and receipt of influenza antiviral drugs during the 2009 pandemic, United States, 2009-2010. Am J Public Health. 2012 oct;102(10):e21–26. doi: 10.2105/AJPH.2012.300651.
OpenUrl CrossRef PubMed
↵
Biggerstaff M, Jhung MA, Reed C, Fry AM, Balluz L, Finelli L. Influenza-like illness, the time to seek healthcare, and influenza antiviral receipt during the 2010-11 influenza season - United States. J Infect Dis. 2014; 210(4):535–44.
OpenUrl CrossRef PubMed
↵
Blangiardo M, Cameletti M, Baio G, Rue H. Spatial and spatio-temporal models with R-INLA. Spat Spatiotemporal Epidemiol. 2013; 4:33–49.
OpenUrl CrossRef PubMed
↵
Boni MF, Gog JR, Andreasen V, Christiansen FB. Influenza drift and epidemic size: the race between generating and escaping immunity. Theor Popul Biol. 2004 mar; 65(2):179–91. doi: 10.1016/j.tpb.2003.10.002.
OpenUrl CrossRef PubMed Web of Science
↵
Brownstein JS, Wolfe CJ, Mandl KD. Empirical evidence for the effect of airline travel on inter-regional influenza spread in the United States. PLoS Med. 2006 sep; 3(10):e401. doi: 10.1371/journal.pmed.0030401.
OpenUrl CrossRef PubMed
↵
Cadieux G, Tamblyn R. Accuracy of physician billing claims for identifying acute respiratory infections in primary care. Health Serv Res. 2008; 43(6):2223–2238. doi: 10.1111/j.1475-6773.2008.00873.x.
OpenUrl CrossRef PubMed Web of Science
↵
Carroll R, Lawson AB, Faes C, Kirby RS, Aregay M, Watjou K. Comparing INLA and OpenBUGS for hierarchical Poisson modeling in disease mapping. Spat Spatiotemporal Epidemiol. 2015; 14-15:45–54. doi: 10.1016/j.sste.2015.08.001.
OpenUrl CrossRef
↵
Cauchemez S, Carrat F, Viboud C, Valleron aJ, Boëlle PY. A Bayesian MCMC approach to study transmission of influenza: application to household longitudinal data. Stat Med. 2004 nov; 23(22):3469–87. doi: 10.1002/sim.1912.
OpenUrl CrossRef PubMed Web of Science
↵
Charaudeau S, Pakdaman K, Boëlle PY. Commuter mobility and the spread of infectious diseases: application to influenza in France. PLoS One. 2014 jan; 9(1):e83002. doi: 10.1371/journal.pone.0083002.
OpenUrl CrossRef PubMed
↵
Charland KM, Brownstein JS, Verma A, Brien S, Buckeridge DL. Socio-economic disparities in the burden of seasonal influenza: The effect of social and material deprivation on rates of influenza infection. PLoS One. 2011; 6(2):1–5. doi: 10.1371/journal.pone.0017207.
OpenUrl CrossRef PubMed
↵
Charu V, Zeger S, Gog J, Bjørnstad ON, Kissler S, Simonsen L, et al. Human mobility and the spatial transmission of influenza in the United States. PLOS Comput Biol. 2017; 13(2):e1005382. doi: 10.1371/journal.pcbi.1005382.
OpenUrl CrossRef PubMed
↵
Cohen JM, Civitello DJ, Brace AJ, Feichtinger EM, Ortega CN, Richardson JC, et al. Spatial scale modulates the strength of ecological processes driving disease distributions. Proc Natl Acad Sci. 2016; p. 201521657. doi: 10.1073/pnas.1521657113.
OpenUrl Abstract/FREE Full Text
↵
Crépey P, Barthélemy M. Detecting robust patterns in the spread of epidemics: a case study of influenza in the United States and France. Am J Epidemiol. 2007 dec; 166(11):1244–51. doi: 10.1093/aje/kwm266.
OpenUrl CrossRef PubMed Web of Science
↵
Denoeud L, Turbelin C, Ansart S, Valleron AJ, Flahault A, Carrat F. Predicting pneumonia and influenza mortality from morbidity data. PLoS One. 2007 jan; 2(5):e464. doi: 10.1371/journal.pone.0000464.
OpenUrl CrossRef PubMed
↵
Deyle ER, Maher MC, Hernandez RD, Basu S, Sugihara G. Global environmental drivers of influenza. Proc Natl Acad Sci. 2016; doi: 10.1073/pnas.1607747113.
OpenUrl Abstract/FREE Full Text
↵
Du X, Dong L, Lan Y, Peng Y, Wu A, Zhang Y, et al. Mapping of H3N2 influenza antigenic evolution in China reveals a strategy for vaccine strain recommendation. Nat Commun. 2012; 3:709. doi: 10.1038/ncomms1710.
OpenUrl CrossRef PubMed
↵
Ewing A, Lee EC, Viboud C, Bansal S. Contact, travel, and transmission: The impact of winter holidays on influenza dynamics in the United States. J Infect Dis. 2016; doi: https://doi.org/10.1093/infdis/jiw642.
↵
Frank AL, Taber LH, Wells JM. Comparison of Infection Rates and Severity of Illness for Influenza A Sub-types H1N1 and H3N2. J Infect Dis. 1985; 151(1):73–80.
OpenUrl CrossRef PubMed Web of Science
↵
Gog JR, Ballesteros S, Viboud C, Simonsen L, Bjornstad ON, Shaman J, et al. Spatial Transmission of 2009 Pandemic Influenza in the US. PLoS Comput Biol. 2014 jun; 10(6):e1003635. doi: 10.1371/journal.pcbi.1003635.
OpenUrl CrossRef PubMed
↵
Gostic KM, Ambrose M, Worobey M, Lloyd-Smith JO. Potent protection against H5N1 and H7N9 influenza via childhood hemagglutinin imprinting. Science (80-). 2016; 354(6313):722–726. doi: 10.1126/science.aag1322.
OpenUrl Abstract/FREE Full Text
↵
Grais RF, Ellis JH, Glass GE. Assessing the impact of airline travel on the geographic spread of pandemic influenza. Eur J Epidemiol. 2003; 18(11):1065–1072. doi: 10.1023/A:1026140019146.
OpenUrl CrossRef PubMed Web of Science
↵
Grantz KH, Rane MS, Salje H, Glass GE, Schachterle SE, Cummings DAT. Disparities in influenza mortality and transmission related to sociodemographic factors within Chicago in the pandemic of 1918. Proc Natl Acad Sci. 2016; 113(48):13839–13844. doi: 10.1073/pnas.1612838113.
OpenUrl Abstract/FREE Full Text
↵
Hadler JL, Yousey-Hindes K, Pérez A, Anderson EJ, Bargsten M, Bohm SR, et al. Influenza-Related Hospitalizations and Poverty Levels — United States, 2010-2012. Morb Mortal Wkly Rep. 2016; 65(05):101–105. doi: 10.15585/mmwr.mm6505a1.
OpenUrl CrossRef PubMed
↵
Hayward AC, Fragaszy EB, Bermingham A, Wang L, Copas A, Edmunds WJ, et al. Comparative community burden and severity of seasonal and pandemic influenza: Results of the Flu Watch cohort study. Lancet Respir Med. 2014; 2(6):445–454. doi: 10.1016/S2213-2600(14)70034-7.
OpenUrl CrossRef
↵
Hotez PJ. Neglected Infections of Poverty in the United States of America. PLoS Negl Trop Dis. 2008; 2(6):e256. doi: 10.1371/journal.pntd.0000256.
OpenUrl CrossRef PubMed
↵
Khiabanian H, Farrell GM, St George K, Rabadan R. Differences in patient age distribution between influenza A subtypes. PLoS One. 2009 jan; 4(8):e6832. doi: 10.1371/journal.pone.0006832.
OpenUrl CrossRef PubMed
↵
Killingley B, Nguyen-Van-Tam J. Routes of influenza transmission. Influenza Other Respi Viruses. 2013; 7(SUPPL.2):42–51. doi: 10.1111/irv.12080.
OpenUrl CrossRef
↵
Kostova D, Reed C, Finelli L, Cheng PY, Gargiullo PM, Shay DK, et al. Influenza Illness and Hospitalizations Averted by Influenza Vaccination in the United States, 2005-2011. PLoS One. 2013 jan; 8(6):e66312. doi: 10.1371/journal.pone.0066312.
OpenUrl CrossRef PubMed
↵
Kucharski AJ, Kwok KO, Wei VWI, Cowling BJ, Read JM, Lessler J, et al. The Contribution of Social Behaviour to the Transmission of Influenza A in a Human Population. PLoS Pathog. 2014 jun; 10(6):e1004206. doi: 10.1371/journal.ppat.1004206.
OpenUrl CrossRef PubMed
↵
Kumar S, Piper K, Galloway DD, Hadler JL, Grefenstette JJ. Is population structure sufficient to generate area-level inequalities in influenza rates? An examination using agent-based models. BMC Public Health. 2015; 15(1):947. doi: 10.1186/s12889-015-2284-2.
OpenUrl CrossRef
↵
Lau MSY, Cowling BJ, Cook AR, Riley S. Inferring influenza dynamics and control in households. Proc Natl Acad Sci. 2015; p. 201423339. doi: 10.1073/pnas.1423339112.
OpenUrl Abstract/FREE Full Text
↵
Lawson AB. Bayesian Disease Mapping: hierarchical modeling in spatial epidemiology. 2 ed. New York: CRC Press; 2013.
↵
Lee EC, Asher JM, Goldlust S, Kraemer JD, Lawson AB, Bansal S. Mind the Scales: Harnessing Spatial Big Data for Infectious Disease Surveillance and Inference. J Infect Dis. 2016; 214(Suppl 4):S409–S413. doi: 10.1093/infdis/jiw344.
OpenUrl CrossRef
↵
Lee EC, Viboud C, Simonsen L, Khan F, Bansal S. Detecting Signals of Seasonal Influenza Severity through Age Dynamics. BMC Infect Dis. 2015; 15(587). doi: 10.1186/s12879-015-1318-9.
OpenUrl CrossRef
↵
Lemaitre M, Carrat F. Comparative age distribution of influenza morbidity and mortality during seasonal influenza epidemics and the 2009 H1N1 pandemic. BMC Infect Dis. 2010 jan; 10(April 2009):162. doi: 10.1186/1471-2334-10-162.
OpenUrl CrossRef PubMed
↵
Lemey P, Rambaut A, Bedford T, Faria N, Bielejec F, Baele G, et al. Unifying Viral Genetics and Human Transportation Data to Predict the Global Transmission Dynamics of Human Influenza H3N2. PLoS Pathog. 2014; 10(2). doi: 10.1371/journal.ppat.1003932.
OpenUrl CrossRef PubMed
↵
Lindgren F, Rue H, Lindström J. An explicit link between Gaussian fields and Gaussian Markov random field: The stochastic partial differential equations approach. J R Stat Soc Ser B Stat Methodol. 2011; 73:423–498. doi: 10.1111/j.1467-9868.2011.00777.x.
OpenUrl CrossRef
↵
Liu M, Zhao X, Hua S, Du X, Peng Y, Li X, et al. Antigenic Patterns and Evolution of the Human Influenza A (H1N1) Virus. Sci Rep. 2015; 5:14171. doi: 10.1038/srep14171.
OpenUrl CrossRef
↵
Lofgren E, Fefferman NH, Naumov YN, Gorski J, Naumova EN. Influenza seasonality: underlying causes and modeling theories. J Virol. 2007 jun; 81(11):5429–36. doi: 10.1128/JVI.01680-06.
OpenUrl FREE Full Text
Longini IM, Koopman JS, Monto aS, Fox JP. Estimating household and community transmission parameters for influenza. Am J Epidemiol. 1982 may; 115(5):736–51.
OpenUrl PubMed Web of Science
↵
Lowcock EC, Rosella LC, Foisy J, McGeer A, Crowcroft N. The social determinants of health and pandemic H1N1 2009 influenza severity. Am J Public Health. 2012; 102(8):51–58. doi: 10.2105/AJPH.2012.300814.
OpenUrl CrossRef
↵
Lowen AC, Mubareka S, Steel J, Palese P. Influenza virus transmission is dependent on relative humidity and temperature. PLoS Pathog. 2007 oct; 3(10):1470–6. doi: 10.1371/journal.ppat.0030151.
OpenUrl CrossRef PubMed Web of Science
↵
Lowen AC, Steel J. Roles of humidity and temperature in shaping influenza seasonality. J Virol. 2014; 88(14):7692–5. doi: 10.1128/JVI.03544-13.
OpenUrl Abstract/FREE Full Text
↵
Martins TG, Simpson D, Lindgren F, Rue H. Bayesian computing with INLA: New features. Comput Stat Data Anal. 2013; 67:68–83. doi: 10.1016/j.csda.2013.04.014.
OpenUrl CrossRef Web of Science
↵
Monto AS, Ullman BM. Acute Respiratory Illness in an American Community: The Tecumseh Rspiratory. JAMA. 1974; 227(2):164–169.
OpenUrl CrossRef PubMed Web of Science
↵
Moorthy M, Castronovo D, Abraham A, Bhattacharyya S, Gradus S, Gorski J, et al. Deviations in influenza seasonality: odd coincidence or obscure consequence? Clin Microbiol Infect. 2012; 18(10):955–962.
OpenUrl PubMed
↵
Morgenstern H. Uses of ecologic analysis in epidemiologic research. Am J Public Health. 1982; 72(12):1336–1344. doi: 10.2105/AJPH.72.12.1336.
OpenUrl CrossRef PubMed Web of Science
↵
Mossong J, Hens N, Jit M, Beutels P, Auranen K, Mikolajczyk R, et al. Social contacts and mixing patterns relevant to the spread of infectious diseases. PLoS Med. 2008 mar; 5(3):e74. doi: 10.1371/journal.pmed.0050074.
OpenUrl CrossRef PubMed
↵
Peters TR, Snively BM,Suerken CK, Blakeney E, Vannoy L, Poehling KA. Relative timing of influenza disease by age group. Vaccine. 2014; 32(48):6451–6456. doi: 10.1016/j.vaccine.2014.09.047.
OpenUrl CrossRef
↵
R Core Team, R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing; 2015.
↵
Robinson WS. Ecological correlations and the behavior of individuals. Int J Epidemiol. 2009; 40(4):1134. doi: 10.1093/ije/dyr082.
OpenUrl CrossRef
↵
Rue H, Martino S, Chopin N. Approximate Bayesian Inference for Latent Gaussian Models Using Integrated Nested Laplace Approximations. J R Stat Soc Ser B. 2009; 71(2):319–392.
OpenUrl CrossRef
↵
Santillana M, Nguyen AT, Louie T, Zink A, Gray J, Sung I, et al. Cloud-based Electronic Health Records for Real-time, Region-specific Influenza Surveillance. Sci Rep. 2016; 6(April):25732. doi: 10.1038/srep25732.
OpenUrl CrossRef
Scarpino SV, Dimitrov NB, Meyers LA. Optimizing provider recruitment for influenza surveillance networks. PLoS Comput Biol. 2012; 8(4). doi: 10.1371/journal.pcbi.1002472.
OpenUrl CrossRef PubMed
↵
Scarpino SV, Scott JG, Eggo R, Dimitrov NB, Meyers LA. Data Blindspots: High-Tech Disease Surveillance Misses the Poor. Online J Public Health Inform. 2016; 8(1):2579. doi: 10.5210/OJPHI.V8I1.6451.
OpenUrl CrossRef
↵
Schanzer D, Vachon J, Pelletier L. Age-specific differences in influenza A epidemic curves: do children drive the spread of influenza epidemics? Am J Epidemiol. 2011 jul; 174(1):109–17. doi: 10.1093/aje/kwr037.
OpenUrl CrossRef PubMed Web of Science
↵
Schanzer DL, Langley JM, Dummer T, Aziz S. The geographic synchrony of seasonal influenza: a waves across Canada and the United States. PLoS One. 2011 jan; 6(6):e21471. doi: 10.1371/journal.pone.0021471.
OpenUrl CrossRef PubMed
↵
Schanzer DL, Langley JM, Dummer T, Viboud C, Tam TWS. A composite epidemic curve for seasonal influenza in Canada with an international comparison. Influenza Other Respi Viruses. 2010 sep; 4(5):295–306. doi: 10.1111/j.1750-2659.2010.00154x
OpenUrl CrossRef PubMed Web of Science
↵
Schrödle B, Held L. Spatio-temporal disease mapping using INLA. Environmetrics. 2011; 22(6):725–734. doi: 10.1002/env.1065.
OpenUrl CrossRef Web of Science
↵
Shaman J, Kohn M. Absolute humidity modulates influenza survival, transmission, and seasonality. Proc Natl Acad Sci U S A. 2009 mar; 106(9):3243–8. doi: 10.1073/pnas.0806852106.
OpenUrl Abstract/FREE Full Text
↵
Shaman J, Pitzer VE, Viboud C, Grenfell BT, Lipsitch M. Absolute humidity and the seasonal onset of influenza in the continental United States. PLoS Biol. 2010 feb; 8(2):e1000316. doi: 10.1371/journal.pbio.1000316.
OpenUrl CrossRef PubMed
↵
Simonsen L, Clarke MJ, Williamson GD, Stroup DF, Arden NH, Schonberger LB. The impact of influenza epidemics on mortality: introducing a severity index. Am J Public Health. 1997 dec; 87(12):1944–50.
OpenUrl CrossRef PubMed Web of Science
↵
Simonsen L, Gog JR, Olson D, Viboud C. Infectious Disease Surveillance in the Big Data Era: Towards Faster and Locally Relevant Systems. J Infect Dis. 2016; 214(Suppl 4):S380–S3385. doi: 10.1093/infdis/jiw376.
OpenUrl CrossRef
↵
Stark JH, Cummings DaT, Ermentrout B, Ostroff S, Sharma R, Stebbins S, et al. Local variations in spatial synchrony of influenza epidemics. PLoS One. 2012 jan; 7(8):e43528. doi: 10.1371/journal.pone.0043528.
OpenUrl CrossRef PubMed
↵
Steptoe A, Feldman PJ. Neighborhood Problems as Sources of Chronic Stress: Development of a Measure of Neighborhood Problems, and Associations With Socioeconomic Status and Health. Ann Behav Med. 2001; 23(3):177–185.
OpenUrl CrossRef PubMed Web of Science
↵
Tam K, Yousey-Hindes K, Hadler JL. Influenza-related hospitalization of adults associated with low census tract socioeconomic status and female sex in New Haven County, Connecticut, 2007-2011. Influenza Other Respi Viruses. 2014 may; 8(3):274–81. doi: 10.1111/irv.12231.
OpenUrl CrossRef Web of Science
↵
Tamerius J, Nelson MI, Zhou SZ, Viboud C, Miller Ma, Alonso WJ. Global influenza seasonality: Reconciling patterns across temperate and tropical regions. Environ Health Perspect. 2011; 119(4):439–445. doi: 10.1289/ehp.1002383.
OpenUrl CrossRef PubMed Web of Science
↵
Thompson WW, Comanor L, Shay DK. Epidemiology of Seasonal Influenza: Use of Surveillance Data and Statistical Models to Estimate the Burden of Disease. J Infect Dis. 2006; 194(Suppl 2):S82–S91.
OpenUrl CrossRef PubMed Web of Science
↵
Timpka T, Eriksson O, Spreco A, Gursky Ea, Strömgren M, Holm E,et al. Age as a determinant for dissemination of seasonal and pandemic influenza: An open cohort study of influenza outbreaks in Östergötland county, Sweden. PLoS One. 2012; 7(2). doi: 10.1371/journal.pone.0031746.
OpenUrl CrossRef PubMed
↵
Van Kerkhove MD, Vandemaele KAH, Shinde V, Jaramillo-gutierrez G, Koukounari A, Donnelly CA, et al. Risk Factors for Severe Outcomes following 2009 Influenza A (H1N1) Infection: A Global Pooled Analysis. PLOS Med. 2011; 8(7):e1001053. doi: 10.1371/journal.pmed.1001053.
OpenUrl CrossRef PubMed
↵
Viboud C, Bjørnstad ON, Smith DL, Simonsen L, Miller MA, Grenfell BT. Synchrony, Waves, and Spatial Hierarchies in the Spread of Influenza. Science (80-). 2006 apr; 312(April):447–451. doi: 10.1126/science.1125237.
OpenUrl Abstract/FREE Full Text
↵
Viboud C, Charu V, Olson D, Ballesteros S, Gog J, Khan F, et al. Demonstrating the use of high-volume electronic medical claims data to monitor local and regional influenza activity in the US. PLoS One. 2014 jan; 9(7):e102429. doi: 10.1371/journal.pone.0102429.
OpenUrl CrossRef PubMed
↵
1. Gelfand AE,
2. Diggle P,
3. Guttorp P,
4. Fuentes M
Waller LA, Carlin BP. Disease Mapping. In: Gelfand AE, Diggle P, Guttorp P, Fuentes M, editors. Handbook of Spatial Statistics Boca Raton (FL): CRC Press; 2010.p. 217–243. https://www.crcpress.com/Handbook-of-Spatial-Statistics/Gelfand-Diggle-Guttorp-Fuentes/9781420072877, doi: 10.1201/9781420072884-c14.Disease.
OpenUrl CrossRef
↵
Wallinga J, Teunis P, Kretzschmar M. Using data on social contacts to estimate age-specific transmission parameters for respiratory-spread infectious agents. Am J Epidemiol. 2006 nov; 164(10):936–44. doi: 10.1093/aje/kwj317.
OpenUrl CrossRef PubMed Web of Science
↵
Wenger JB, Naumova EN. Seasonal synchronization of influenza in the United States older adult population. PLoS One. 2010 jan; 5(4):e10187. doi: 10.1371/journal.pone.0010187.
OpenUrl CrossRef PubMed
↵
Yu H, Alonso WJ, Feng L, Tan Y, Shu Y, Yang W, et al. Characterization of regional influenza seasonality patterns in china and implications for vaccination strategies: spatio-temporal modeling of surveillance data. PLoS Med. 2013 dec; 10(11):e1001552. doi: 10.1371/journal.pmed.1001552.
OpenUrl CrossRef PubMed

View the discussion thread.

Posted March 03, 2017.

Download PDF

Citation Tools

Subject Area

Epidemiology

Subject Areas

All Articles

Animal Behavior and Cognition (5210)
Biochemistry (11740)
Bioengineering (8750)
Bioinformatics (29189)
Biophysics (14967)
Cancer Biology (12093)
Cell Biology (17410)
Clinical Trials (138)
Developmental Biology (9420)
Ecology (14178)
Epidemiology (2067)
Evolutionary Biology (18301)
Genetics (12239)
Genomics (16797)
Immunology (11865)
Microbiology (28070)
Molecular Biology (11583)
Neuroscience (60953)
Paleontology (451)
Pathology (1870)
Pharmacology and Toxicology (3238)
Physiology (4957)
Plant Biology (10425)
Scientific Communication and Education (1683)
Synthetic Biology (2884)
Systems Biology (7338)
Zoology (1651)

[1] ↵
Adler NE, Newman K. Socioeconomic disparities in health: Pathways and policies. Health Aff. 2002; 21(2):60–76. doi: 10.1377/hlthaff.21.2.60.
OpenUrl Abstract/FREE Full Text

[2] ↵
Apolloni A, Poletto C, Colizza V. Age-specific contacts and travel patterns in the spatial spread of 2009 H1N1 influenza pandemic. BMC Infect Dis. 2013 jan; 13:176. doi: 10.1186/1471-2334-13-176.
OpenUrl CrossRef PubMed

[3] ↵
Arab A. Spatial and Spatio-Temporal Models for Modeling Epidemiological Data with Excess Zeros. Int J Environ Res Public Health. 2015; 12(9):10536–10548. doi: 10.3390/ijerph120910536.
OpenUrl CrossRef

[4] ↵
Banerjee S, Carlin BP, Gelfand AE. Hierarchical Modeling and Analysis for Spatial Data. Second ed. Boca Raton (FL): CRC Press; 2015.

[5] ↵
Bansal S, Chowell G, Simonsen L, Vespignani A, Viboud C. Big Data for Infectious Disease Surveillance and Modeling. J Infect Dis. 2016; 214(suppl 4):S375–S379. doi: 10.1093/infdis/jiw400.
OpenUrl CrossRef

[6] ↵
Bansal S, Pourbohloul B, Hupert N, Grenfell B, Meyers LA. The shifting demographic landscape of pandemic influenza. PLoS One. 2010 jan; 5(2):e9360. doi: 10.1371/journal.pone.0009360.
OpenUrl CrossRef PubMed

[7] ↵
Barreca AI, Shimshack JP. Absolute humidity, temperature, and influenza mortality: 30 years of county-level evidence from the United States. Am J Epidemiol. 2012 oct; 176 Suppl(7):S114–22. doi: 10.1093/aje/kws259.
OpenUrl CrossRef PubMed Web of Science

[8] ↵
Beauté J, Zucs P, Korsun N, Bragstad K, Enouf V, Kossyvakis A, et al. Age-specific differences in influenza virus type and subtype distribution in the 2012/2013 season in 12 European countries. Epidemiol Infect. 2015 oct; 143(14):2950–2958. doi: 10.1017/S0950268814003422.
OpenUrl CrossRef

[9] ↵
Bedford T, Suchard Ma, Lemey P, Dudas G, Gregory V, Hay AJ, et al. Data from: Integrating influenza antigenic dynamics with molecular evolution. Dryad Digit Repos. 2014; doi: http://dx.doi.org/10.5061/dryad.rc515.

[10] ↵
Besag J, York J, Mollié A. Bayesian image restoration, with two applications in spatial statistics. Ann Inst Stat Math. 1991; 43(1):1–20. doi: 10.1007/BF00116466.
OpenUrl CrossRef PubMed

[11] ↵
Biggerstaff M, Jhung Ma, Reed C, Garg S, Balluz L, Fry aM, et al. Impact of medical and behavioural factors on influenza-like illness, healthcare-seeking, and antiviral treatment during the 2009 H1N1 pandemic: USA, 2009-2010. Epidemiol Infect. 2014 jan; 142(1):114–25. doi: 10.1017/S0950268813000654.
OpenUrl CrossRef Web of Science

[12] ↵
Biggerstaff M, Jhung M, Kamimoto L, Balluz L, Finelli L. Self-reported influenza-like illness and receipt of influenza antiviral drugs during the 2009 pandemic, United States, 2009-2010. Am J Public Health. 2012 oct;102(10):e21–26. doi: 10.2105/AJPH.2012.300651.
OpenUrl CrossRef PubMed

[13] ↵
Biggerstaff M, Jhung MA, Reed C, Fry AM, Balluz L, Finelli L. Influenza-like illness, the time to seek healthcare, and influenza antiviral receipt during the 2010-11 influenza season - United States. J Infect Dis. 2014; 210(4):535–44.
OpenUrl CrossRef PubMed

[14] ↵
Blangiardo M, Cameletti M, Baio G, Rue H. Spatial and spatio-temporal models with R-INLA. Spat Spatiotemporal Epidemiol. 2013; 4:33–49.
OpenUrl CrossRef PubMed

[15] ↵
Boni MF, Gog JR, Andreasen V, Christiansen FB. Influenza drift and epidemic size: the race between generating and escaping immunity. Theor Popul Biol. 2004 mar; 65(2):179–91. doi: 10.1016/j.tpb.2003.10.002.
OpenUrl CrossRef PubMed Web of Science

[16] ↵
Brownstein JS, Wolfe CJ, Mandl KD. Empirical evidence for the effect of airline travel on inter-regional influenza spread in the United States. PLoS Med. 2006 sep; 3(10):e401. doi: 10.1371/journal.pmed.0030401.
OpenUrl CrossRef PubMed

[17] ↵
Cadieux G, Tamblyn R. Accuracy of physician billing claims for identifying acute respiratory infections in primary care. Health Serv Res. 2008; 43(6):2223–2238. doi: 10.1111/j.1475-6773.2008.00873.x.
OpenUrl CrossRef PubMed Web of Science

[18] ↵
Carroll R, Lawson AB, Faes C, Kirby RS, Aregay M, Watjou K. Comparing INLA and OpenBUGS for hierarchical Poisson modeling in disease mapping. Spat Spatiotemporal Epidemiol. 2015; 14-15:45–54. doi: 10.1016/j.sste.2015.08.001.
OpenUrl CrossRef

[19] ↵
Cauchemez S, Carrat F, Viboud C, Valleron aJ, Boëlle PY. A Bayesian MCMC approach to study transmission of influenza: application to household longitudinal data. Stat Med. 2004 nov; 23(22):3469–87. doi: 10.1002/sim.1912.
OpenUrl CrossRef PubMed Web of Science

[20] ↵
Charaudeau S, Pakdaman K, Boëlle PY. Commuter mobility and the spread of infectious diseases: application to influenza in France. PLoS One. 2014 jan; 9(1):e83002. doi: 10.1371/journal.pone.0083002.
OpenUrl CrossRef PubMed

[21] ↵
Charland KM, Brownstein JS, Verma A, Brien S, Buckeridge DL. Socio-economic disparities in the burden of seasonal influenza: The effect of social and material deprivation on rates of influenza infection. PLoS One. 2011; 6(2):1–5. doi: 10.1371/journal.pone.0017207.
OpenUrl CrossRef PubMed

[22] ↵
Charu V, Zeger S, Gog J, Bjørnstad ON, Kissler S, Simonsen L, et al. Human mobility and the spatial transmission of influenza in the United States. PLOS Comput Biol. 2017; 13(2):e1005382. doi: 10.1371/journal.pcbi.1005382.
OpenUrl CrossRef PubMed

[23] ↵
Cohen JM, Civitello DJ, Brace AJ, Feichtinger EM, Ortega CN, Richardson JC, et al. Spatial scale modulates the strength of ecological processes driving disease distributions. Proc Natl Acad Sci. 2016; p. 201521657. doi: 10.1073/pnas.1521657113.
OpenUrl Abstract/FREE Full Text

[24] ↵
Crépey P, Barthélemy M. Detecting robust patterns in the spread of epidemics: a case study of influenza in the United States and France. Am J Epidemiol. 2007 dec; 166(11):1244–51. doi: 10.1093/aje/kwm266.
OpenUrl CrossRef PubMed Web of Science

[25] ↵
Denoeud L, Turbelin C, Ansart S, Valleron AJ, Flahault A, Carrat F. Predicting pneumonia and influenza mortality from morbidity data. PLoS One. 2007 jan; 2(5):e464. doi: 10.1371/journal.pone.0000464.
OpenUrl CrossRef PubMed

[26] ↵
Deyle ER, Maher MC, Hernandez RD, Basu S, Sugihara G. Global environmental drivers of influenza. Proc Natl Acad Sci. 2016; doi: 10.1073/pnas.1607747113.
OpenUrl Abstract/FREE Full Text

[27] ↵
Du X, Dong L, Lan Y, Peng Y, Wu A, Zhang Y, et al. Mapping of H3N2 influenza antigenic evolution in China reveals a strategy for vaccine strain recommendation. Nat Commun. 2012; 3:709. doi: 10.1038/ncomms1710.
OpenUrl CrossRef PubMed

[28] ↵
Ewing A, Lee EC, Viboud C, Bansal S. Contact, travel, and transmission: The impact of winter holidays on influenza dynamics in the United States. J Infect Dis. 2016; doi: https://doi.org/10.1093/infdis/jiw642.

[29] ↵
Frank AL, Taber LH, Wells JM. Comparison of Infection Rates and Severity of Illness for Influenza A Sub-types H1N1 and H3N2. J Infect Dis. 1985; 151(1):73–80.
OpenUrl CrossRef PubMed Web of Science

[30] ↵
Gog JR, Ballesteros S, Viboud C, Simonsen L, Bjornstad ON, Shaman J, et al. Spatial Transmission of 2009 Pandemic Influenza in the US. PLoS Comput Biol. 2014 jun; 10(6):e1003635. doi: 10.1371/journal.pcbi.1003635.
OpenUrl CrossRef PubMed

[31] ↵
Gostic KM, Ambrose M, Worobey M, Lloyd-Smith JO. Potent protection against H5N1 and H7N9 influenza via childhood hemagglutinin imprinting. Science (80-). 2016; 354(6313):722–726. doi: 10.1126/science.aag1322.
OpenUrl Abstract/FREE Full Text

[32] ↵
Grais RF, Ellis JH, Glass GE. Assessing the impact of airline travel on the geographic spread of pandemic influenza. Eur J Epidemiol. 2003; 18(11):1065–1072. doi: 10.1023/A:1026140019146.
OpenUrl CrossRef PubMed Web of Science

[33] ↵
Grantz KH, Rane MS, Salje H, Glass GE, Schachterle SE, Cummings DAT. Disparities in influenza mortality and transmission related to sociodemographic factors within Chicago in the pandemic of 1918. Proc Natl Acad Sci. 2016; 113(48):13839–13844. doi: 10.1073/pnas.1612838113.
OpenUrl Abstract/FREE Full Text

[34] ↵
Hadler JL, Yousey-Hindes K, Pérez A, Anderson EJ, Bargsten M, Bohm SR, et al. Influenza-Related Hospitalizations and Poverty Levels — United States, 2010-2012. Morb Mortal Wkly Rep. 2016; 65(05):101–105. doi: 10.15585/mmwr.mm6505a1.
OpenUrl CrossRef PubMed

[35] ↵
Hayward AC, Fragaszy EB, Bermingham A, Wang L, Copas A, Edmunds WJ, et al. Comparative community burden and severity of seasonal and pandemic influenza: Results of the Flu Watch cohort study. Lancet Respir Med. 2014; 2(6):445–454. doi: 10.1016/S2213-2600(14)70034-7.
OpenUrl CrossRef

[36] ↵
Hotez PJ. Neglected Infections of Poverty in the United States of America. PLoS Negl Trop Dis. 2008; 2(6):e256. doi: 10.1371/journal.pntd.0000256.
OpenUrl CrossRef PubMed

[37] ↵
Khiabanian H, Farrell GM, St George K, Rabadan R. Differences in patient age distribution between influenza A subtypes. PLoS One. 2009 jan; 4(8):e6832. doi: 10.1371/journal.pone.0006832.
OpenUrl CrossRef PubMed

[38] ↵
Killingley B, Nguyen-Van-Tam J. Routes of influenza transmission. Influenza Other Respi Viruses. 2013; 7(SUPPL.2):42–51. doi: 10.1111/irv.12080.
OpenUrl CrossRef

[39] ↵
Kostova D, Reed C, Finelli L, Cheng PY, Gargiullo PM, Shay DK, et al. Influenza Illness and Hospitalizations Averted by Influenza Vaccination in the United States, 2005-2011. PLoS One. 2013 jan; 8(6):e66312. doi: 10.1371/journal.pone.0066312.
OpenUrl CrossRef PubMed

[40] ↵
Kucharski AJ, Kwok KO, Wei VWI, Cowling BJ, Read JM, Lessler J, et al. The Contribution of Social Behaviour to the Transmission of Influenza A in a Human Population. PLoS Pathog. 2014 jun; 10(6):e1004206. doi: 10.1371/journal.ppat.1004206.
OpenUrl CrossRef PubMed

[41] ↵
Kumar S, Piper K, Galloway DD, Hadler JL, Grefenstette JJ. Is population structure sufficient to generate area-level inequalities in influenza rates? An examination using agent-based models. BMC Public Health. 2015; 15(1):947. doi: 10.1186/s12889-015-2284-2.
OpenUrl CrossRef

[42] ↵
Lau MSY, Cowling BJ, Cook AR, Riley S. Inferring influenza dynamics and control in households. Proc Natl Acad Sci. 2015; p. 201423339. doi: 10.1073/pnas.1423339112.
OpenUrl Abstract/FREE Full Text

[43] ↵
Lawson AB. Bayesian Disease Mapping: hierarchical modeling in spatial epidemiology. 2 ed. New York: CRC Press; 2013.

[44] ↵
Lee EC, Asher JM, Goldlust S, Kraemer JD, Lawson AB, Bansal S. Mind the Scales: Harnessing Spatial Big Data for Infectious Disease Surveillance and Inference. J Infect Dis. 2016; 214(Suppl 4):S409–S413. doi: 10.1093/infdis/jiw344.
OpenUrl CrossRef

[45] ↵
Lee EC, Viboud C, Simonsen L, Khan F, Bansal S. Detecting Signals of Seasonal Influenza Severity through Age Dynamics. BMC Infect Dis. 2015; 15(587). doi: 10.1186/s12879-015-1318-9.
OpenUrl CrossRef

[46] ↵
Lemaitre M, Carrat F. Comparative age distribution of influenza morbidity and mortality during seasonal influenza epidemics and the 2009 H1N1 pandemic. BMC Infect Dis. 2010 jan; 10(April 2009):162. doi: 10.1186/1471-2334-10-162.
OpenUrl CrossRef PubMed

[47] ↵
Lemey P, Rambaut A, Bedford T, Faria N, Bielejec F, Baele G, et al. Unifying Viral Genetics and Human Transportation Data to Predict the Global Transmission Dynamics of Human Influenza H3N2. PLoS Pathog. 2014; 10(2). doi: 10.1371/journal.ppat.1003932.
OpenUrl CrossRef PubMed

[48] ↵
Lindgren F, Rue H, Lindström J. An explicit link between Gaussian fields and Gaussian Markov random field: The stochastic partial differential equations approach. J R Stat Soc Ser B Stat Methodol. 2011; 73:423–498. doi: 10.1111/j.1467-9868.2011.00777.x.
OpenUrl CrossRef

[49] ↵
Liu M, Zhao X, Hua S, Du X, Peng Y, Li X, et al. Antigenic Patterns and Evolution of the Human Influenza A (H1N1) Virus. Sci Rep. 2015; 5:14171. doi: 10.1038/srep14171.
OpenUrl CrossRef

[50] ↵
Lofgren E, Fefferman NH, Naumov YN, Gorski J, Naumova EN. Influenza seasonality: underlying causes and modeling theories. J Virol. 2007 jun; 81(11):5429–36. doi: 10.1128/JVI.01680-06.
OpenUrl FREE Full Text

[51] Longini IM, Koopman JS, Monto aS, Fox JP. Estimating household and community transmission parameters for influenza. Am J Epidemiol. 1982 may; 115(5):736–51.
OpenUrl PubMed Web of Science

[52] ↵
Lowcock EC, Rosella LC, Foisy J, McGeer A, Crowcroft N. The social determinants of health and pandemic H1N1 2009 influenza severity. Am J Public Health. 2012; 102(8):51–58. doi: 10.2105/AJPH.2012.300814.
OpenUrl CrossRef

[53] ↵
Lowen AC, Mubareka S, Steel J, Palese P. Influenza virus transmission is dependent on relative humidity and temperature. PLoS Pathog. 2007 oct; 3(10):1470–6. doi: 10.1371/journal.ppat.0030151.
OpenUrl CrossRef PubMed Web of Science

[54] ↵
Lowen AC, Steel J. Roles of humidity and temperature in shaping influenza seasonality. J Virol. 2014; 88(14):7692–5. doi: 10.1128/JVI.03544-13.
OpenUrl Abstract/FREE Full Text

[55] ↵
Martins TG, Simpson D, Lindgren F, Rue H. Bayesian computing with INLA: New features. Comput Stat Data Anal. 2013; 67:68–83. doi: 10.1016/j.csda.2013.04.014.
OpenUrl CrossRef Web of Science

[56] ↵
Monto AS, Ullman BM. Acute Respiratory Illness in an American Community: The Tecumseh Rspiratory. JAMA. 1974; 227(2):164–169.
OpenUrl CrossRef PubMed Web of Science

[57] ↵
Moorthy M, Castronovo D, Abraham A, Bhattacharyya S, Gradus S, Gorski J, et al. Deviations in influenza seasonality: odd coincidence or obscure consequence? Clin Microbiol Infect. 2012; 18(10):955–962.
OpenUrl PubMed

[58] ↵
Morgenstern H. Uses of ecologic analysis in epidemiologic research. Am J Public Health. 1982; 72(12):1336–1344. doi: 10.2105/AJPH.72.12.1336.
OpenUrl CrossRef PubMed Web of Science

[59] ↵
Mossong J, Hens N, Jit M, Beutels P, Auranen K, Mikolajczyk R, et al. Social contacts and mixing patterns relevant to the spread of infectious diseases. PLoS Med. 2008 mar; 5(3):e74. doi: 10.1371/journal.pmed.0050074.
OpenUrl CrossRef PubMed

[60] ↵
Peters TR, Snively BM,Suerken CK, Blakeney E, Vannoy L, Poehling KA. Relative timing of influenza disease by age group. Vaccine. 2014; 32(48):6451–6456. doi: 10.1016/j.vaccine.2014.09.047.
OpenUrl CrossRef

[61] ↵
R Core Team, R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing; 2015.

[62] ↵
Robinson WS. Ecological correlations and the behavior of individuals. Int J Epidemiol. 2009; 40(4):1134. doi: 10.1093/ije/dyr082.
OpenUrl CrossRef

[63] ↵
Rue H, Martino S, Chopin N. Approximate Bayesian Inference for Latent Gaussian Models Using Integrated Nested Laplace Approximations. J R Stat Soc Ser B. 2009; 71(2):319–392.
OpenUrl CrossRef

[64] ↵
Santillana M, Nguyen AT, Louie T, Zink A, Gray J, Sung I, et al. Cloud-based Electronic Health Records for Real-time, Region-specific Influenza Surveillance. Sci Rep. 2016; 6(April):25732. doi: 10.1038/srep25732.
OpenUrl CrossRef

[65] Scarpino SV, Dimitrov NB, Meyers LA. Optimizing provider recruitment for influenza surveillance networks. PLoS Comput Biol. 2012; 8(4). doi: 10.1371/journal.pcbi.1002472.
OpenUrl CrossRef PubMed

[66] ↵
Scarpino SV, Scott JG, Eggo R, Dimitrov NB, Meyers LA. Data Blindspots: High-Tech Disease Surveillance Misses the Poor. Online J Public Health Inform. 2016; 8(1):2579. doi: 10.5210/OJPHI.V8I1.6451.
OpenUrl CrossRef

[67] ↵
Schanzer D, Vachon J, Pelletier L. Age-specific differences in influenza A epidemic curves: do children drive the spread of influenza epidemics? Am J Epidemiol. 2011 jul; 174(1):109–17. doi: 10.1093/aje/kwr037.
OpenUrl CrossRef PubMed Web of Science

[68] ↵
Schanzer DL, Langley JM, Dummer T, Aziz S. The geographic synchrony of seasonal influenza: a waves across Canada and the United States. PLoS One. 2011 jan; 6(6):e21471. doi: 10.1371/journal.pone.0021471.
OpenUrl CrossRef PubMed

[69] ↵
Schanzer DL, Langley JM, Dummer T, Viboud C, Tam TWS. A composite epidemic curve for seasonal influenza in Canada with an international comparison. Influenza Other Respi Viruses. 2010 sep; 4(5):295–306. doi: 10.1111/j.1750-2659.2010.00154x
OpenUrl CrossRef PubMed Web of Science

[70] ↵
Schrödle B, Held L. Spatio-temporal disease mapping using INLA. Environmetrics. 2011; 22(6):725–734. doi: 10.1002/env.1065.
OpenUrl CrossRef Web of Science

[71] ↵
Shaman J, Kohn M. Absolute humidity modulates influenza survival, transmission, and seasonality. Proc Natl Acad Sci U S A. 2009 mar; 106(9):3243–8. doi: 10.1073/pnas.0806852106.
OpenUrl Abstract/FREE Full Text

[72] ↵
Shaman J, Pitzer VE, Viboud C, Grenfell BT, Lipsitch M. Absolute humidity and the seasonal onset of influenza in the continental United States. PLoS Biol. 2010 feb; 8(2):e1000316. doi: 10.1371/journal.pbio.1000316.
OpenUrl CrossRef PubMed

[73] ↵
Simonsen L, Clarke MJ, Williamson GD, Stroup DF, Arden NH, Schonberger LB. The impact of influenza epidemics on mortality: introducing a severity index. Am J Public Health. 1997 dec; 87(12):1944–50.
OpenUrl CrossRef PubMed Web of Science

[74] ↵
Simonsen L, Gog JR, Olson D, Viboud C. Infectious Disease Surveillance in the Big Data Era: Towards Faster and Locally Relevant Systems. J Infect Dis. 2016; 214(Suppl 4):S380–S3385. doi: 10.1093/infdis/jiw376.
OpenUrl CrossRef

[75] ↵
Stark JH, Cummings DaT, Ermentrout B, Ostroff S, Sharma R, Stebbins S, et al. Local variations in spatial synchrony of influenza epidemics. PLoS One. 2012 jan; 7(8):e43528. doi: 10.1371/journal.pone.0043528.
OpenUrl CrossRef PubMed

[76] ↵
Steptoe A, Feldman PJ. Neighborhood Problems as Sources of Chronic Stress: Development of a Measure of Neighborhood Problems, and Associations With Socioeconomic Status and Health. Ann Behav Med. 2001; 23(3):177–185.
OpenUrl CrossRef PubMed Web of Science

[77] ↵
Tam K, Yousey-Hindes K, Hadler JL. Influenza-related hospitalization of adults associated with low census tract socioeconomic status and female sex in New Haven County, Connecticut, 2007-2011. Influenza Other Respi Viruses. 2014 may; 8(3):274–81. doi: 10.1111/irv.12231.
OpenUrl CrossRef Web of Science

[78] ↵
Tamerius J, Nelson MI, Zhou SZ, Viboud C, Miller Ma, Alonso WJ. Global influenza seasonality: Reconciling patterns across temperate and tropical regions. Environ Health Perspect. 2011; 119(4):439–445. doi: 10.1289/ehp.1002383.
OpenUrl CrossRef PubMed Web of Science

[79] ↵
Thompson WW, Comanor L, Shay DK. Epidemiology of Seasonal Influenza: Use of Surveillance Data and Statistical Models to Estimate the Burden of Disease. J Infect Dis. 2006; 194(Suppl 2):S82–S91.
OpenUrl CrossRef PubMed Web of Science

[80] ↵
Timpka T, Eriksson O, Spreco A, Gursky Ea, Strömgren M, Holm E,et al. Age as a determinant for dissemination of seasonal and pandemic influenza: An open cohort study of influenza outbreaks in Östergötland county, Sweden. PLoS One. 2012; 7(2). doi: 10.1371/journal.pone.0031746.
OpenUrl CrossRef PubMed

[81] ↵
Van Kerkhove MD, Vandemaele KAH, Shinde V, Jaramillo-gutierrez G, Koukounari A, Donnelly CA, et al. Risk Factors for Severe Outcomes following 2009 Influenza A (H1N1) Infection: A Global Pooled Analysis. PLOS Med. 2011; 8(7):e1001053. doi: 10.1371/journal.pmed.1001053.
OpenUrl CrossRef PubMed

[82] ↵
Viboud C, Bjørnstad ON, Smith DL, Simonsen L, Miller MA, Grenfell BT. Synchrony, Waves, and Spatial Hierarchies in the Spread of Influenza. Science (80-). 2006 apr; 312(April):447–451. doi: 10.1126/science.1125237.
OpenUrl Abstract/FREE Full Text

[83] ↵
Viboud C, Charu V, Olson D, Ballesteros S, Gog J, Khan F, et al. Demonstrating the use of high-volume electronic medical claims data to monitor local and regional influenza activity in the US. PLoS One. 2014 jan; 9(7):e102429. doi: 10.1371/journal.pone.0102429.
OpenUrl CrossRef PubMed

[84] ↵
Gelfand AE,
Diggle P,
Guttorp P,
Fuentes M
Waller LA, Carlin BP. Disease Mapping. In: Gelfand AE, Diggle P, Guttorp P, Fuentes M, editors. Handbook of Spatial Statistics Boca Raton (FL): CRC Press; 2010.p. 217–243. https://www.crcpress.com/Handbook-of-Spatial-Statistics/Gelfand-Diggle-Guttorp-Fuentes/9781420072877, doi: 10.1201/9781420072884-c14.Disease.
OpenUrl CrossRef

[85] Gelfand AE,

[86] Diggle P,

[87] Guttorp P,

[88] Fuentes M

[89] ↵
Wallinga J, Teunis P, Kretzschmar M. Using data on social contacts to estimate age-specific transmission parameters for respiratory-spread infectious agents. Am J Epidemiol. 2006 nov; 164(10):936–44. doi: 10.1093/aje/kwj317.
OpenUrl CrossRef PubMed Web of Science

[90] ↵
Wenger JB, Naumova EN. Seasonal synchronization of influenza in the United States older adult population. PLoS One. 2010 jan; 5(4):e10187. doi: 10.1371/journal.pone.0010187.
OpenUrl CrossRef PubMed

[91] ↵
Yu H, Alonso WJ, Feng L, Tan Y, Shu Y, Yang W, et al. Characterization of regional influenza seasonality patterns in china and implications for vaccination strategies: spatio-temporal modeling of surveillance data. PLoS Med. 2013 dec; 10(11):e1001552. doi: 10.1371/journal.pmed.1001552.
OpenUrl CrossRef PubMed

Socio-environmental and measurement factors drive spatial variation in influenza-like illness

Abstract

Introduction

Results

Temporal and spatial patterns of influenza-like illness

Drivers of seasonal intensity

Drivers of age-specific seasonal intensity

Drivers of epidemic duration

Applications to surveillance

Sentinels in fixed locations

Sentinels in moving locations

Inclusion of historical data

Discussion

Methods

Medical claims data

Defining influenza disease burden

Predictor data collection and variable selection

Environmental data

Social contact and population data

Flu-specific data

Prior immunity

Socioeconomic and access to care data

Medical claims measurement factors

Model structure

Model fit, sensitivity, and validation

Statistical analysis

Applications to missing data & inference robustness

Funding

Acknowledgments

Appendix 1

Seasonal intensity model fit and validation Model fit

Selection for spatial dependence terms

Validation to CDC surveillance data

Appendix 2

Age-specific drivers of seasonal intensity Model Fit

Spatial and temporal patterns

Socio-environmental and measurement drivers

Appendix 3

Drivers of epidemic duration Model fit

Spatial and temporal patterns

Socio-environmental and measurement drivers

Appendix 4

Comparison of disease burden metrics

Appendix 5

Model predictors Checks for multicollinearity

Medical claims coverage

References

Citation Manager Formats

Subject Area