Movement-related activity dominates cortex during sensory-guided decision making

Simon Musall; Matthew T. Kaufman; Steven Gluf; Anne K. Churchland

doi:10.1101/308288

Abstract

An animal’s movements and internal state transitions generate an “internal backdrop” of activity that is dynamically modulated. During behavior, this internal backdrop interacts with signals arising from incoming sensory stimuli and may have a substantial impact on task-related computations, like those underlying decision-making. To understand the joint effects of internal backdrop and task-imposed variables, we measured neural activity across the entire dorsal cortex of task-performing mice. We characterized internal backdrop using multiple measures of self-generated parameters, e.g., pupil diameter, whisking and body motion. Surprisingly, internal backdrop dominated neural activity across the entire cortex, dwarfing task-related variables and even sensory stimuli. Single neurons in frontal cortex were likewise dominated by internal backdrop. A linear model allowed us to account for multiple dimensions of internal backdrop and uncover hidden signatures of task-related activity. The internal backdrop therefore captures a fundamental dimension of complex behavior that must be accounted for when studying decision-making.

Highlights

We imaged cortex-wide neural activity during auditory and visual decisions in mice.
Cortical activity was surprisingly similar during sensory-guided versus random decisions.
Movement and state variables vastly outperformed task variables in predicting neural activity.
A linear model revealed hidden task-related activity in single neurons.

Introduction

Complex behaviors are accompanied by dynamic responses across cortical circuits. During decisionmaking, cortical activity reflects multiple processes including sensory inputs (Freedman and Assad, 2006), selection and integration of behaviorally-relevant information (Roitman and Shadlen, 2002), estimation and anticipation of reward (Bouret and Sara, 2004; Pratt and Mizumori, 2001), choice confidence (Kepecs et al., 2008) and recent trial history (Abrahamyan et al., 2016; Bichot and Schall, 1999; Manoach et al., 2007; Morcos and Harvey, 2016).

Many decision-making studies have acknowledged the potential impact of decision-related movements on neural activity. Because neural activity in many decision-making structures is known to reflect movements, it is essential to separate the impact of movements from that of decision formation. Movements that are associated with decision reporting, such as head orientation (Erlich et al., 2011), eye movements (Roitman and Shadlen, 2002) or licking (Allen et al., 2017) are therefore often taken into account to ensure that the variable of concern cannot fully explain decision-related activity.

Beyond decision-reporting, other movements are known to strongly modulate neural activity. For instance, whisking is critical for texture discrimination and object localization in mice (Chen et al., 2013; O’Connor et al., 2013). Running modulates the gain of visual inputs (Niell and Stryker, 2010) and is critical for integration of visual motion (Ayaz et al., 2013; Saleem et al., 2013) and predictive coding (Keller et al., 2012). Some movements are also known to modulate neural activity in multiple cortical areas (Allen et al., 2017; Ferezou et al., 2007; Shimaoka et al., 2018). A potential explanation for these widespread effects is that certain movements reflect changes in the animal’s internal state, like increased arousal during running (Niell and Stryker, 2010). Indeed, internal state can account for changes in neural activity of different sensory areas that are as strong as responses to sensory stimuli (Crochet and Petersen, 2006; Okun et al., 2015; Pachitariu et al., 2015). Internal state is also reflected in pupil dilation, which is associated with increased excitability and desynchronization of cortical neurons (Reimer et al., 2014). Importantly, movements and pupil dilation have different effects on cortical activity (Vinck et al., 2015), suggesting that internal state is multidimensional and driven by a variety of internal sources (Harris and Thiele, 2011). The combined effects of movements and internal state transitions can therefore be thought of as an ‘internal backdrop’ that is critical to consider when analyzing neural responses.

Broad measures of the internal backdrop are rarely incorporated into analyses of decision-making activity. This is in part because most studies of cortical modulation due to internal state have been focused on sensory areas (Niell and Stryker, 2010; Okun et al., 2015; Pachitariu et al., 2015; Reimer et al., 2014; Vinck et al., 2015), the impact of internal backdrop on decision-making areas is therefore poorly understood. Since most studies also use only narrow measures of internal state, like pupil dilation or running speed, the combined importance of multiple movements on neural activity is also unclear. Broadening this scope has been challenging because it requires measuring many different movements together with cortex-wide neural activity in task-performing animals.

To assess the impact of internal backdrop on decision-making, we used widefield imaging to measure neural activity across the entire dorsal cortex of mice performing auditory or visual decisions, while tracking a wide array of movements and pupil diameter. To evaluate how cortical activity was affected by task-related or self-generated variables, we built a linear encoding model. Surprisingly, animal movements captured the majority of signal variability across the cortex, outpacing other variables such as sensory stimuli, choice and reward. Moreover, task-aligned movements had a significant impact on trial-averaged data and accounted for features commonly attributed to cognitive task demands, like evidence accumulation, urgency, or motor planning. These observations argue that that the internal backdrop has a much larger impact on neural activity during decision-making than previously appreciated.

Results

To measure cortex-wide neural dynamics during perceptual decisions, we trained mice to report the spatial position of an auditory or visual stimulus. Animals interacted with handles to initiate trials and lick spouts to report choices. Handles and spouts were controlled by servo motors to limit their accessibility to appropriate epochs in the task (Batista-Brito et al., 2017; Goard et al., 2016) (Fig. 1).

Figure 1. Widefield calcium imaging during auditory and visual decision making.

(A) Bottom view of a mouse in the behavioral setup. (B) Single-trial timing of behavior. Mice held the handles for 1 s to trigger the stimulus sequence. After a 1 s delay, water spouts moved in so mice could report a choice. (C) Expert vs. novice behavior. Visual experts (blue) had high performance with visual but novice performance with auditory stimuli. Auditory experts (green) showed the opposite. Error bars represent mean ± s.e.m. (D) Schematic of the widefield macroscope. Alternating blue and violet excitation light was projected on the brain surface. Green emission light was captured by a sCMOS camera, mounted on two macro lenses. (E) Example of a visual sign map, aligned to the Allen CCF. Mapped areas largely agreed with corresponding location of visual areas in the CCF (white lines). (F) Cortical activity during different task episodes averaged over 11 mice. Shown are responses when holding the handles (‘Hold’), visual stimulus presentation (‘Stim 1&2’), the subsequent delay (‘Delay’) and the response period (‘Response’). In each trial, stimulus onset was pseudo-randomized within a 0.25 s long time window (inset). (G) Left: Traces show average responses in V1, retrosplenial cortex (RS), hindlimb somatosensory cortex (HL) and secondary motor cortex (M2) on the right hemisphere during visual (black) or auditory (red) stimulation. Trial averages are double-aligned to the time of trial initiation (left dashed line) and stimulus onset (gray bars). Right dashed line indicates response period, shading indicates s.e.m. Right: d’ between visual and auditory trials during first visual stimulus (top) and the subsequent delay period (bottom). (H) Same as (G) but for correct visual trials on the left versus right side. (I) Same as (G) but for expert versus novice modality.

Stimuli were presented 0.875-1.125 s after handle touch and consisted of auditory or visual stimulus sequences. Each sequence consisted of two 0.6-s long presentations separated by a 0.5 s gap. After a 1 s delay, animals could report a decision and received a water reward when licking the spout that corresponded to the stimulus presentation side (Fig. 1B). Two distinct cohorts of animals were trained on either auditory or visual stimuli (but not both) and consequently achieved expert performance in the trained modality (Fig. 1C). Expert mice generalized the task timing, but not contingencies, to the untrained modality. This enabled us to measure cortical activity during either sensory-guided decisions or random guesses in the same animals (e.g., vision experts in blue were ~80% correct in visual trials but remained at novice level in auditory trials).

To study neural activity during decision making, we used a custom-built widefield macroscope (Ratzlaff and Grinvald, 1991) with a large 12.5 x 10.5 mm field of view (Fig. 1D). Mice were transgenic (Ai93; Emx-Cre; LSL-tTA; CaMKII-tTA), expressing the Ca²⁺-indicator GCaMP6f in excitatory neurons. Fluorescence was measured through the cleared skull (Guo et al., 2014). To avoid contamination from intrinsic signals (e.g., hemodynamic responses), we used excitation light at 473 nm to record Ca²⁺-dependent fluorescence and excitation light at 405 nm to record Ca²⁺-independent fluorescence (Lerner et al., 2015) on alternating frames. By rescaling and subtracting Ca²⁺-independent fluorescence we were then able to isolate a purely Ca²⁺-dependent signal (Allen et al., 2017; Wekselblatt et al., 2016). Using a combination of four brain landmarks, we aligned all data to the Allen Institute Common Coordinate Framework v3 (CCF, Figure S1). To confirm accurate CCF alignment, we performed retinotopic visual mapping (Marshel et al., 2011) in each animal and found high correspondence between functionally identified visual areas and the CCF (Fig. 1E).

Supplementary Figure S1. Overview over cortical areas.

Shown are cortical areas based on the Allen common coordinate framework v.3. The labels of the corresponding cortical areas are shown on the right.

Baseline-corrected fluorescence (ΔF/F) revealed significant modulation of neural activity across dorsal cortex during different episodes of the task (Fig. 1F, average response to visual trials, 22 sessions from 11 mice). While holding the handles, cortical activity was strongest in the somato-motor areas for hind- and forepaw (‘Hold’). The first visual stimulus caused robust activation of visual areas in posterior cortex and weaker responses in secondary motor cortex (M2) (‘Stim 1’). Activity in anterior cortex increased during stimulus presentation (‘Stim 2’) and the delay period (‘Delay’). When animals were allowed to respond, neural activity strongly increased across the entire dorsal cortex (‘Response’). A comparison of neural activity across conditions confirmed that neural activity was modulated by whether the stimulus was auditory vs. visual (Fig. 1G) and whether it was presented on the left vs. right (Fig. 1H). In both cases, differences across conditions were mainly restricted to primary and secondary visual areas. Activity in more anterior structures was nearly identical across conditions. This similarity may be because areas for motor planning are less lateralized (Li et al., 2015) and exhibit mixed tuning for both decision sides and modalities. Surprisingly, a comparison of neural activity in novice vs. expert decisions revealed almost no difference between the two trial categories (Fig. 1I). This similarity across the entire dorsal cortex was evident despite markedly different behavioral performance (Fig. 1C), suggesting that large parts of cortical activity did not distinguish informed decisions vs. guesses.

To better understand how behavior related to neural activity, we built a linear model. The model was designed to account for the fluorescence of each pixel via any time-varying combination of 23 possible behavioral variables, while at the same time preventing overfitting of the dataset. The predictor matrix (i.e., the design matrix) was constructed from sets of regressors, where each set was locked to a different sensory or motor event (Fig 2A, Steps 1-2). The regressors in each set formed a temporal sequence of pulses to allow the linear reconstruction of neural activity over time, relative to event onset. For sensory events, each regressor set contained regressors locked to each frame from stimulus onset until the end of the trial (‘Post-event’, blue). For motor events, regressors spanned a fixed duration of 0.5 s before until 1 s after event onset (‘Peri-event’, green). To account for cognitive task variables with no defined event onset, such as animal success in a given trial, we used regressor sets that spanned the entire trial (‘Whole trial’, black). We also included non-binary regressors, such as data from a piezo sensor underneath the animal to track hindpaw movements (‘Analog’, orange). Each behavioral variable was thus represented by a set of specific regressors. The model was fit to the data using ridge regression. Each regressor was assigned a β-weight, indicating how strongly that single regressor was linearly related to the neural activity in a given pixel (Fig. 2A, Step 3). To reduce computational cost, we used singular value decomposition (SVD) on the imaging data and predicted changes in data dimensions instead of individual pixels. Multiplying the full design matrix with the corresponding β-weights results in a model reconstruction of the imaging data (Fig. 2A, Step 4).

Figure 2. A linear model to reveal behavioral correlates of cortical activity.

(A) Schematic of the linear model. Behavioral variables were encoded with regressor sets (Step 1) that were combined into a design matrix (Step 2). A single-trial example shows regressors for a rightward stimulus, animal licks, animal success and hindpaw movement. Each regressor is assigned a β-weight via ridge regression, describing its impact on each pixel (Step 3). Multiplying regressors with their respective weights allows reconstruction of the imaging data (Step 4). (B) Example image of facial video camera. Video data was used to extract pupil diameter, whisker and nose motion. A reduced-dimensionality version was also included as a model variable. (C) Maps of β-weights for right visual stimulus or grabbing the right handle, 100 ms after event onset. (D) Maps of cross-validated explained variance for different episodes of the task. (E) Explained variance for individual model variables. Shown is either all explained variance (light green) or unique explained variance (dark green). Values are averaged across cortex and bars represent mean ± s.e.m over 22 sessions. Y-axis scale is different for all vs. unique variance. Nov.: novice; Exp.: expert; Prev.: previous. (F) Maps of unique explained variance for right visual stimulus or grabbing the right handle. (G) Explained variance for groups of model variables. Conventions as in (E), white bar indicates explained variance of the full model. (H) Maps of unique explained variance for groups of model variables. Area outlines indicate location of V1 and HL. (I) Example traces from two visual trials in areas V1 (bottom) and HL (top). Gray traces indicate recorded imaging data, purple and orange traces indicate predictions from a movement-only or task-only model, respectively. The movement model predicted single trial dynamics more accurately than the task model.

In addition to traditional behavioral measurements (such as lick times), we leveraged video data from two cameras, observing the animal’s face and body. These data were used in two ways: first, we used video data to estimate variables known to modulate neural activity, such as whisking and pupil size (Fig. 2B). Second, we used SVD to extract the 200 highest-variance video dimensions and used them as analog regressors to provide additional information on animal movements that we could not track otherwise or had not previously considered (Powell et al., 2015; Stringer et al., 2018). To ensure that video regressors did not overlap with other model regressors, we used a QR decomposition to orthogonalize video regressors from already-described model variables.

Cortical maps of β-weights confirmed expected features of the data, matching known functions of visual and motor cortices. For example, pixel weights located in left V1 were highly positive in response to a rightward visual stimulus (Fig. 2C, left); pixels located in left somatosensory and primary motor forelimb area were highly positive when the right handle was grabbed (Fig. 2C, right). To evaluate how well the model captured neural activity at different cortical locations, we computed the 10-fold cross-validated R² for the full model at different epochs during the trial (Fig. 2D). While some areas were particularly well predicted in specific trial epochs (e.g. V1 during stimulus presentation), there was high predictive power throughout the cortex during all epochs of the trial. For all data (‘Whole trial’), the model predicted 37.8 ± 1.2% of all variance across cortex.

We next sought to address which particular model variables were most critical for its success. The simplest way to do this is to fit a model consisting of a single variable, and ask how well it predicts the data. We therefore computed cross-validated R² values, over all data, for each single-variable model separately. As shown in the light green bars in Fig. 2E, many variables could individually predict a large amount of variance in the imaging data. However, model variables that were associated with animal movement or internal state (‘Movement’) contained particularly high predictive power compared to task-related variables (‘Task’). This suggests that these movement and state variables, which reflect the internal backdrop, are particularly important for predicting cortical activity. Interestingly, video was the most predictive model variable, explaining ~20% of all variance. By projecting β-weights of the video dimension regressors back into video pixel space, we found that specific areas in the animal’s face, especially the jaw, were particularly important for predicting multiple dimensions of cortical activity (Figure S2).

Supplementary Figure S2. Relationship of widefield data to behavioral video

To understand the relation between widefield data and behavioral video, we analyzed the matrix of β-weights in the model corresponding to video regressors. (A) Variance remaining after including d PCA dimensions of this matrix (see Methods). Dashed line shows 10% variance remaining (90% accounted for). (B) Widefield maps corresponding to the top video β-weight dimensions, after varimax sparsening. Six of eight dimensions shown. Overlaid white lines show Allen atlas borders. (C) Influence of each behavioral video pixel on widefield data. The opacity and color ofthe overlay were scaled between the 0th and 99th percentile over all values.

While many model variables contained high predictive power, it is critical to quantify the amount of unique, non-redundant information contained in each variable. For instance, while licking had high predictive power, it could also be strongly correlated to other task variables such as choice, since licking occurs at roughly the same time in each trial. It might therefore contain little unique information that is not present in other model variables. If true, then removing lick regressors from the model should not affect the model’s overall predictive power since other variables could predict the cortical data equally well.

To isolate the predictive power that is unique to each variable, we created reduced models in which we temporally shuffled the regressor set of a given variable, and compared these reduced models to the full model. The resulting loss of predictive power (ΔR²) with shuffling provides a conservative estimate of the amount of unique information contained in that variable. Pixel-wise ΔR² maps showed that unique information was highly spatially localized (Fig. 2F, see Figure S3 for other model variables) and matched the cortical areas where β-weights were highest (Fig. 2C).

Supplementary Figure S3. Maps of unique explained variance.

Shown are cortical maps of unique explained variance for different model variables. Maps are averaged over 22 recordings from 11 animals.

This analysis revealed considerable variability in how essential each variable was to the model (Fig. 2E, dark green bars). A good example is the ‘time’ variable, a regressor set designed to capture signal deviations that always occur at the same time in each trial (similar to an average over all trials). Although the time-only model captured considerable variance (light green bar), eliminating it had a negligible effect on the model’s predictive power (dark green bar). This is because other task variables, such as choice or stimulus regressors, could capture time-varying modulation equally well. In contrast, movement variables contained large amounts of unique information. Notably, the video regressors contained a high degree of both overall and unique information, substantially outperforming all task-related model variables (Fig. 2E, both dark and light green bars corresponding to ‘Video’ are large).

To directly compare the impact of movement and internal state vs. task variables, we assigned each variable into either a ‘movement’ or ‘task’ category (Fig. 2G). The resulting movement model contained a very high amount of unique information, more than 5-fold as much as the task model (ΔR²_Motor = 19.54 ± 0.8% vs. ΔR²_Task = 3.43 ± 0.2%; dark green bars). This stark difference was even more pronounced in cortical maps of unique explained variance. These maps revealed that the movement model was far more predictive than the task model throughout the entire cortex (Fig. 2H). The same result was also clearly visible when comparing the accuracy of single-trial reconstructions in different cortical areas, including V1 (Fig. 2I). These results strongly argue that cortical activity is much better explained by the internal backdrop than by cognitive or sensory task variables.

Importantly, the large fraction of variance that is uniquely explained by the movement model is, by definition, orthogonal to the temporal structure of the task. This activity therefore cannot be captured when averaging over trials. However, there was also a significant amount of explained variance that was shared between the movement and task model (R²_Shared = 14.86 ± 0.9%; Fig. 2G, light green bars same for task and movement), indicating that many features that are visible in a trial average may be either due to task variables or to certain movements that are task-aligned (e.g., licking at a specific time in every trial). To assess which movement variables were task-aligned, for each movement variable we computed how much explained variance influenced the trial average (‘task shared’ variance) and how much was trial-by-trial variability that averaged out across trials (‘task independent’ variance). Surprisingly, almost all movement regressors contained a large amount of explanatory power that was shared with task variables (Fig. 3A, light blue bars), indicating that each may have a considerable impact on the trial average.

Figure 3. Impact of movement variables on trial-averaged data.

(A) Explained variance for individual movement variables. Shown is either unique, task-independent (dark blue) or task-shared explained variance (light blue). Values are averaged across cortex and bars represent mean ± s.e.m over 22 sessions. (B) Trial-averaged data for areas V1 and M2. Top row shows averaged imaging (black traces) and modeled (red traces) data over all trials. Bottom row shows average reconstructions based on either movement or task variables alone, using weights from the full model. Dashed boxes show post-stimulus period in M2 that is jointly modulated by movement and task variables. Trial-averages are aligned to the time of trial initiation (dashed line) as well as stimulus onset (gray bars). Right dashed line indicates response period, shading indicates s.e.m. (C) Cortical maps of task-based reconstructions as shown in green in (B). Shown are average modulations during the first and second stimulus and the delay period. (D) Absolute modulation of trial averages for either the task or movement model. (E) Cortical map of the task modulation index. Dashed circle indicates location of ALM.

To better understand how movement and task variables influenced the trial average, we used the full model to reconstruct the imaging data and computed trial averages for different cortical areas (Fig. 3B, top). As expected, the model closely reconstructed the imaging data. We then split the model prediction into two parts, based on movement and task variables, without re-fitting. This provides the best available estimate of the relative contribution of all movement variables (blue traces) and task variables (green traces) on the trial average. In V1 (left), baseline activity was mainly reconstructed with movement variables whereas activity after visual stimulation was well explained by task variables. In M2 (right), baseline activity was also mostly explained by movement whereas later activity was explained by a combination of both groups. Separating trial averages into task and movement components therefore allowed us to assess which features of trial-averaged activity are likely to be truly task-related when taking animal movements and state into account.

When we reconstructed trial-averaged activity across cortex based on task variables alone, we found several areas that were substantially task-modulated. Shortly after stimulus onset, task modulation was highest in the visual areas (Fig. 3C, ‘Stim1’). During subsequent visual stimulation and the delay (‘Stim2’ & ‘Delay’), additional modulation developed along the midline, especially in retrosplenial cortex but also parts of M2 and facial somatosensory cortex. To summarize these effects, we summed absolute task modulation over the whole trial duration (Fig. 3D left). We then computed a task modulation index (TI) to identify areas that were most strongly affected by task vs. movement variables (Fig. 3E). The TI was defined as the difference between absolute task and movement modulation (Fig. 3D, left minus right) divided by their sum, rescaled between 0 and 1. High TI values indicate stronger trial-average modulation due to task variables, while low values indicate a strong movement contribution. The TI revealed multiple cortical areas with considerable relative task modulation. These areas are potential candidates for involvement in decisionmaking, and included primary and secondary visual cortex, facial somatosensory cortex and specific subareas within medial and anterior M2.

One of these identified areas was the anterior lateral motor cortex (ALM; circled in Fig. 3E). This area was of particular interest because recent work has identified ALM as causally involved in comparable decisionmaking tasks (Chen et al., 2017; Li et al., 2015). We therefore used two-photon (2p) imaging to investigate ALM more closely and determine whether activity of individual ALM neurons is strongly task-modulated (Fig. 4A). This was also particularly important because widefield imaging mainly reflects average activity across many neural structures in superficial layers (Allen et al., 2017). It was therefore not clear whether the importance of animal movement and state would be equally strong on a single-cell level.

Figure 4. Decoding single-neuron tuning with task and movement variables.

(A) 2p imaging in ALM. Example field of view with isolated neurons in color, inset shows location of imaging site. Traces show raw and de-noised ΔF/F of individual neurons in gray and color, respectively. (B) Trial-averaged activity for example neurons with different tuning. Colors indicate responses to left/right auditory stimuli (blue) and left/right visual stimuli (green). (C) Explained variance for individual model variables. Shown is either all explained variance (light green) or unique explained variance (dark green). Values are averaged across all neurons, mean ± s.e.m over 315 cells. Y-axis scale is different for all vs. unique variance. Nov.: novice; Exp.: expert; Prev.: previous. (D) Explained variance for groups of model variables. Conventions as in (C), white bar indicates all explained variance of the full model. Expl.: explained. (E) Explained variance of individual neurons, sorted by full-model performance (red traces). Light blue trace shows explained variance of a movement-only model, dark blue shows the unique explained variance by movement (same as light/dark green bars in D). Light/dark green traces show full/unique explained variance by the task model. (F) Absolute modulation of single-cell trial averages due to task or movement variables. Green bars show average deviations due to task variables, blue bars due to movement variables. Neurons are sorted by absolute deviation of the trial average. (G) Linear model reveals tuning preference of individual neurons. Blue box: single cell trial average with substantial modulation after stimulus onset (gray bars) and increasing activity before the response period, that is well-explained by movement variables. Green box: cell with strong modulation that is largely explained by task variables. Dashed lines indicate trial initiation, shading is s.e.m over trials.

In agreement with earlier reports (Li et al., 2015), many individual ALM neurons were highly active during licks to the contralateral spout (Fig. 4B, top). Other neurons exhibited modulation that was aligned to other task events, such as grabbing the handles, or showed mixed tuning (middle). Some neurons exhibited no modulation in their trial averages (‘untuned’, bottom).

We then applied the exact same linear model as above to the single-cell 2p data. In the single-cell data, as in the widefield data, individual movement variables strongly outperformed task variables (Fig. 4C, light green bars). Given the known causal role of ALM for licking (Li et al., 2015), one might expect that licking would be a particularly important variable to predict ALM activity. Instead, in agreement with our widefield results, we found that almost all movement variables contained considerable information and video-based regressors were far more powerful than any other model variable.

Many movement variables also contained a large amount of unique information (ΔR², dark green bars). In contrast, task variables explained much less of the overall variance across neurons and contained very little unique explanatory power. Again, this strong difference between movement and task variables became clearer still when comparing the variables by group (Fig. 4D). The full model’s predicted variance was almost entirely matched by the movement model (R²_full = 28.85 ± 0.7%; R²_Motor = 28.13 ± 0.7%; both light + dark green bars), whereas the task model accounted for much less variance and contained very little unique information (R²_Task = 8.74 ± 0.6%, both bars; ΔR²_Task = 0.7 ± 0.003%, dark green bar). These effects were not driven by outliers but found in almost every recorded neuron. Across all neurons, a movement-only model performed almost identically to the full model in predicting single-cell variance (light blue trace overlies red trace). For all cells, a large portion of variance was also uniquely explained by the movement model (dark blue trace). Conversely, the task model predicted less variance in most neurons (light green trace) and accounted for any significant variance at all in only about half of all cells. Very few cells contained variance that was uniquely explained by the task model (dark green trace). These results demonstrate that the internal backdrop is of key importance for predicting activity of individual neurons, just as for widefield population data. Moreover, many neurons that would usually be considered untuned due to their lack of modulation in the trial average could still be explained and rendered interpretable by movement variables.

However, the dominance of the backdrop in single cell activity is also worrying, as it implies that many neural response features that appear to be task-related might in fact be due to movements or state transitions that are temporally aligned with the task. It is important to note that this concern is limited to variance that is shared between movement and task variables (light green bars). The majority of movement-explained variance is unique to the movement model, and therefore orthogonal to the task. That is, the majority of the internal backdrop accounts for ‘spontaneous’ trial-by-trial variability that is removed when averaging over trials.

To determine whether features in the trial average were best explained by task or movement variables, we repeated the analysis from Fig. 3 and reconstructed trial-averaged data for each neuron based on the full model. We then computed the absolute sum of all deviations in the trial average that were either due to movement or due to task variables. As shown in Fig. 4F, the trial average of many neurons was still appreciably modulated by task variables. Using the TI described above, we could then isolate neurons that were strongly modulated by either movement or task variables. For neurons with a low TI, the trial average was almost exclusively modulated by movement variables, including average features that could easily be confused with stimulus-evoked responses or evidence integration signals (Fig. 4G, blue box). Conversely, neurons with a high TI were strongly modulated by task variables, thus identifying individual neurons whose trial average was strongly affected by the behavioral task instead of animal movement or state (green box).

Importantly, this distinction would not have been visible by examination of the trial average alone. The movement-driven example cell exhibited many average features that might have appeared to be responses to the stimuli, and a late rise in firing is reminiscent of decision formation. The model argues that these explanations are inaccurate. On the other hand, in the task-driven example cell, the rising activity might have appeared closely linked to licking, but was found to be mainly driven by task variables. Our model-driven approach therefore provided much more detailed insight into each neuron’s tuning preference and enabled us to isolate single neurons that were truly task-modulated when taking internal backdrop into account.

Discussion

Our results demonstrate that activity across dorsal cortex is dominated by the internal backdrop. By including a wide array of self-generated movements and pupil dilation into our linear model, we were able to take these variables into account and predict neural activity with high accuracy. The dominance of the internal backdrop was observed in both cortex-wide population activity and single neuron data. By quantifying the modulation of trial-averaged data through movement and task variables, we could also identify cortical areas or individual neurons that were most affected by task variables and thus reveal the spatiotemporal dynamics of truly task-related activity.

Cortical activity is widely invariant to animal expertise

By training animals on either visual or auditory stimuli but testing them with both modalities, we could compare neural activity during sensory-guided decisions (expert) versus random guesses (novice) in the same animal. This allowed us to separate neural activity that was due to stimulus presentation or movement from informed utilization of sensory inputs. Surprisingly, though animals understood one contingency and were at chance for the other, cortical responses were highly similar for expert and novice decisions across the many activated areas in dorsal cortex. This suggests that most trial-averaged activity we observed across cortex does not reflect the transformation of sensory evidence to guide animals’ choices, but instead reflects responses closely related to sensory input, movements and state changes. This might also explain the discrepancy between studies that have shown widespread task-related activity in many different brain areas (Allen et al., 2017; Goard et al., 2016; Merre et al., 2017), and studies in which systematic inactivation of many cortical areas found no behavioral effects outside of primary sensory and secondary motor cortex (Allen et al., 2017; Guo et al., 2014).

More subtle decision-related activity might be overshadowed by such cortex-wide modulations. But when we separated movement-from task-related activity, cortical responses for expert and novice decisions remained similar (Figure S4). There are at least two potential reasons for this. Sensory-guided decisions may be encoded by specific sub-populations of cortical neurons that are intermixed within more diverse local networks (Li et al., 2015); or, they may exhibit extensive mixed selectivity (Park et al., 2014; Raposo et al., 2014). Either scenario would obscure the impact of relevant neurons on the population average that is reflected in widefield signals. While this issue is best addressed by measuring individual neurons locally, cell-type-specific widefield imaging could also be used to measure activity of neuronal subtypes across the cortex (Allen et al., 2017; Chan et al., 2017). By measuring from layer- or projection-specific subpopulations instead of all excitatory neurons, this approach may provide a more detailed view of large-scale cortical information processing. It may also help to alleviate an important caveat of widefield imaging: its bias towards superficial layers (Allen et al., 2017), which may obscure more task-related neural activity in deeper layers. While our 2p imaging results revealed individual neurons with interesting task modulation, recordings in deeper layers might be even more informative to find decision-related activity that was not seen with widefield imaging.

Supplementary Figure S4. Difference between expert and novice decisions.

(A) Reconstruction of widefield data, based on task variables without re-fitting as shown in Figure 3. Left: Traces show average responses in V1, retrosplenial cortex (RS), hindlimb somatosensory cortex (HL) and secondary motor cortex (M2) on the right hemisphere during visual (black) or auditory (red) stimulation. Trial averages are double-aligned to the time of trial initiation (left dashed line) and stimulus onset (gray bars). Right dashed line indicates response period, shading indicates s.e.m. Right: d’ between expert versus novice modality during first visual stimulus (top) and the subsequent delay period (bottom). Isolating task-modulated activity did not result in a clear separation of expert and novice decisions. (B) Same as (A) but using reconstructed widefield data, based on movement variables.

Another explanation for the lack of cortical modulation during informed decisions could be the behavioral task design. Our task allowed for fast training (2-4 weeks), robust behavioral performance and comparison of expert vs. novice decisions. However, some cortical areas may be more important in a different setting, like learning a new behavior (Chen et al., 2013; Kawai et al., 2015; Merre et al., 2017), during tasks that require temporal accumulation of noisy sensory evidence (Erlich et al., 2011; Licata et al., 2017) or during spatial navigation (Harvey et al., 2012; Pinto et al., 2018). If true, the methods and analyses that we describe here might be critical to detect additional cortical involvement in other behavioral paradigms.

One of the non-sensory areas that we identified as task-modulated was ALM, which has been shown to be involved in planning and execution of motor output in comparable tasks to ours (Guo et al., 2014; Li et al., 2015). However, it remains unclear whether ALM is involved in evidence integration, or equally driven by sensory-guided versus random decisions. Our recordings show that many ALM neurons were mostly driven by internal backdrop whereas unique task-modulation was present but sparse. Furthermore, neural activity in about half of all recorded ALM neurons was modulated by spontaneous movements but completely orthogonal to the task. The master decision circuitry in our task may therefore lie mostly in subcortical targets like the dorsal striatum (Wang et al., 2018b), hippocampus (Aronov et al., 2017; Merre et al., 2017) or thalamus (Schmitt et al., 2017) and subsequently be relayed to ALM to create or sustain a motor plan. To address these questions, future studies should therefore combine more complex paradigms or subcortical recordings with close monitoring of animal movements and behavioral controls to disentangle differences between sensory-guided versus random decisions.

Cortical activity is dominated by the internal backdrop

Earlier studies that reported a large impact of the internal backdrop on cortical activity mostly focused on spontaneous behaviors like running on a wheel, where internal states are highly variable (Niell and Stryker, 2010; Vinck et al., 2015). One might assume that the internal state of task-performing animals is more constrained: animals are well-trained to the timing and contingencies of the task and perform the same behavior consistently over long periods of time, which might keep them in a less variable, attentive state (Harris and Thiele, 2011). This view is also supported by a reduction of trial-to-trial variance of cortical responses over the course of learning as behavioral performance increases (Ni et al., 2018). Our task design aimed to promote such a stable internal state by allowing mice to self-initiate trials, thereby ensuring that they were aware of an upcoming trial and were willing to perform the task. Despite this, we found that the large majority of cortical activity was dominated by animal movements and internal state changes instead of the behavioral task.

The profound impact of the internal backdrop has important implications when analyzing neural dynamics during decision-making. Although task variables alone explained a considerable amount of variance in cortical data, only ~3% was uniquely explained by the task. Most neural dynamics that might have been considered task-related were therefore ambiguous and equally well explained by internal dynamics or movements. The prevalence of movement modulation across cortex may explain why task-related activity has been observed in a variety of cortical areas (Allen et al., 2017; Goard et al., 2016; Merre et al., 2017) and highlights the importance of additional controls like neural inactivation to test the relevance of a given area for decision-making.

Even in ALM, which had been identified as causal for behavior (Chen et al., 2017; Li et al., 2015), much of the observed single-cell dynamics may be due to ongoing movements. Many of our ALM neurons were strongly modulated in their trial average and exhibited dynamics that seemed reminiscent of evidence accumulation or urgency signals; nonetheless, their activity was often fully explained by movement variables (Fig. 4G). This argues that even when focusing on areas that have been identified with neural inactivation, much of the observed single-cell dynamics may be due to internal backdrop. To address this issue, our linear model could be leveraged to isolate neurons that are best explained by task variables, when taking movements into account. Careful quantification of animal behavior can therefore be utilized to uncover previously obscured task-related neural dynamics.

The large and widespread impact of movements may appear to be in contrast with earlier decision-making studies that mostly found a weak relation between neural activity and movements (Allen et al., 2017; Erlich et al., 2011). The main difference between these earlier findings and our current study is most likely the number of parameters used to describe animal behavior. Our model included a wide variety of different movements and we found that most of them contributed a substantial amount of unique predictive power (Fig. 2E). This means that each variable had a distinct impact on cortical activity that cannot be inferred from other movements. While individual movement variables were indeed less informative than the task model, combining all variables into a larger model led to a pronounced increase in predictive power (Fig. 2G). This highlights the importance of tracking different sources for the internal backdrop when assessing their cumulative impact on cortical activity. Notably, our results are still a lower bound for how well neural activity can be predicted from observing animal behavior. Using more sophisticated machine vision analysis (Mathis et al., 2018) or additional sensors (Bollu et al., 2018) could result in far more detailed information on animal movement or state changes. Such information may enable dissociating effects of state change from specific motor activity, and a deeper understanding of the physiological mechanisms through which different components of the internal backdrop modulate cortical activity.

Notably, using video data alone captured a significant amount of neural variance. This is in agreement with recent work that used PCA to extract facial features from video data, explaining large amounts of variance in dense recordings of many individual neurons in V1 and multiple other brain regions (Stringer et al., 2018). It is therefore possible to extract a surprisingly large amount of information on the animal’s state by recording video data and using well-established linear analysis. Given the feasibility of this approach, we believe it should become standard practice to acquire video data during behavioral experiments.

Finally, the prominence of the internal backdrop raises the question of its role in cortical information processing. Historically, non-task related activity has often been described as random internal noise that is reduced when performing a behavioral task. Yet, this view seems largely incompatible with the tight coupling of ‘spontaneous’ activity to the animal movements and internal state that we describe here. Some earlier work in sensory areas has hypothesized that integration of specific motor feedback is advantageous for sensory processing, like the integration of running in visual areas for motion perception or predictive coding (Ayaz et al., 2013; Keller et al., 2012; Saleem et al., 2013). However, just as auditory and somatosensory cortices were also found to be modulated by running (Ayaz et al., 2018; Schneider et al., 2014; Shimaoka et al., 2018) our results may indicate that this concept is not specific to sensory processing but holds true on a much larger scale. It is not yet clear what purpose this large and widespread modulation serves. As previously speculated, it may relate to cancelling or tracking self-motion (Sommer and Wurtz, 2008), gating of inputs (Schmitt et al., 2017); biasing circuits toward receptive ‘ON’ states (Engel et al., 2016), or permitting distributed associational learning (Engel et al., 2015; Wang et al., 2018a). Every cortical area, regardless of its specific computation, plays a potentially important role in case of unexpected feedback. Global transmission of the internal backdrop might therefore be a key component to broadcast behavioral context and flexibly adapt information processing in local cortical networks.

Methods

Animal Subjects

The Cold Spring Harbor Laboratory Animal Care and Use Committee approved all animal procedures and experiments. Experiments were conducted with male mice from the ages of 6-25 weeks. All mouse strains were of C57BL/6J background and purchased from Jackson Laboratory. Four transgenic strains were crossed to create the transgenic mice used for imaging: Emx-Cre (JAX 005628), LSL-tTA (JAX 008600), CaMK2α-tTA (JAX 003010) and Ai93 (JAX 024103). All trained mice were housed in groups of two or more under an inverted 12:12-h light-dark regime and trained during their active dark cycle.

Surgical procedures

All surgeries were performed under 1-2% isoflurane in oxygen anesthesia. After induction of anesthesia, 1.2 mg/kg of Meloxicam was injected subcutaneously and Lidocaine ointment was topically applied to the skin. After making a medial incision, the skin was pushed to the side and fixed in position with tissue adhesive (Vetbond, 3M). We then created an outer wall using dental cement (Ortho-Jet, Lang Dental) while leaving as much of the skull exposed as possible. A circular headbar was attached to the dental cement. For widefield imaging, after carefully cleaning the exposed skull we applied a layer of cyanoacrylate (Zap-A-Gap CA+, Pacer technology) to clear the bone. After the cyanoacrylate was cured, cortical blood vessels were clearly visible.

For two photon imaging, instead of clearing the skull, we performed a circular craniotomy using a biopsy punch (diameter: 3 mm), centered 1.5 mm mediolateral and 1.5 mm anterior to bregma. We then positioned a circular window over the cortex and sealed the remaining gap between the bone and glass with tissue glue. The window was then secured to the skull using C&B Metabond (Parkell) and the remaining exposed skull was sealed using dental cement. After surgery, animals were kept on a heating mat for recovery and a daily dose of analgesia (1.2 mg/kg Meloxicam) and antibiotics (2.3 mg/kg Enroflaxin) was administered subcutaneously for at least 3 days.

Behavior

The behavioral setup was based on an Arduino-controlled finite state machine (Bpod r0.5, Sanworks) and custom Matlab code (2015b, Mathworks) running on a linux PC. Servo motors and visual stimuli were controlled by microcontrollers (Teensy 3.2, PJRC) running custom code. Eleven mice were trained on a delayed 2-alternative forced choice (2AFC) spatial discrimination task. Mice initiated trials by touching either of two handles with their forepaws. Handles were mounted on servo motors and were moved out of reach between trials. After one second of holding a handle, sensory stimuli were presented. Sensory stimuli consisted of either a sequence of auditory clicks, or repeated presentation of a visual moving bar (3 repetitions, 200 ms each). Auditory stimuli were presented from either a left or right speaker, and visual stimuli were presented on one of two small LED displays on the left or right side. The sensory stimulus was presented for 600 ms, there was a 500 ms pause with no stimulus, and then the stimulus was repeated for another 600 ms. The 500 ms inter-stimulus period was added to allow probing neural dynamics during potential decision formation in the absence of sensory stimuli. After the second stimulus, a 1000 ms delay was imposed, then servo motors moved two lick spouts into close proximity of the animal’s mouth. If the animal licked to the spout on the same side as the stimulus, he was rewarded with a drop of water. After one spout was contacted, the other spout was moved out of reach to force the animal to commit to its initial decision.

Animals were trained over the course of approximately 30 days. After 2-3 days of restricted water access, animals were head-fixed and received water in the setup. Water was given by presenting a sensory stimulus, subsequently moving the correct spout close to the animal and dispensing water automatically. After several habituation sessions, animals had to touch the handles to trigger the stimulus presentation. Once animals reliably reached for the handles, the required touch duration was gradually increased up to 1 second. Lastly, the probability for fully self-performed trials, at which both spouts were moved towards the animal after stimulus presentation, was gradually increased until animals reached stable detection performance levels of 80% or higher.

Each animal was trained exclusively on a single modality (6 visual animals, 5 auditory). Only during imaging sessions were trials of the untrained modality presented as well. This allowed us to compare neural activity on trials where animals performed sensory guided decision-making versus trials where animal decisions were random. To ensure that detection performance was not overly affected by presentation of the untrained modality, the trained modality was presented in 75% and the untrained modality in 25% of all trials.

Behavioral sensors

We used information from several sensors in the behavioral setup to measure different aspects of animal movement. The handles detected contact with the animal’s forepaws, and the lick spouts detected contact with the tongue. An additional piezo sensor below the animal’s trunk was used to detect hindpaw and whole-body movements.

Video monitoring

Two webcams (C920 and B920, Logitech) were used to monitor animal movements. Cameras were positioned to capture the animal’s face (side view) and the body (bottom view). To target particular behavioral variables of interest, we defined subregions of the video which were then examined in more detail. These included a region surrounding the eye, the whisker pad and the nose. From the eye region we extracted changes in pupil diameter using custom Matlab code. To analyze whisker movements, we computed the absolute temporal derivative averaged over the entire whisker pad. The resulting 1-D trace was then normalized and thresholded at 2 standard deviations to extract whisking events. Based on whisking events we created a binary peri-event design matrix that was included in the linear model (see below). The same approach was used for the nose.

Widefield imaging

Widefield imaging was done using an inverted tandem-lens macroscope (Grinvald et al., 1991) in combination with an sCMOS camera (Edge 5.5, PCO) running at 60 fps. The top lens had a focal length of 105 mm (DC-Nikkor, Nikon) and the bottom lens 85 mm (85M-S, Rokinon), resulting in a magnification of 1.24x. The total field of view was 12.5 x 10.5 mm and the image resolution was 640 x 540 pixels after 4x spatial binning (spatial resolution: ~20μm/pixel). To capture GCaMP fluorescence, a 500 nm long-pass filter (ET500lp, Chroma) was placed in front of the camera. Excitation light was projected on the cortical surface using a 495 nm long-pass dichroic mirror (T495lpxr, Chroma) placed between the two macro lenses. The excitation light was generated by a collimated blue LED (470 nm, M470L3, Thorlabs) and a collimated violet LED (405 nm, M405L3, Thorlabs) that were coupled into the same excitation path using a dichroic mirror (#87-063, Edmund optics). We alternated illumination between the two LEDs from frame to frame, resulting in one set of frames with blue and the other with violet excitation at 30 fps each. Excitation of GCaMP at 405 nm results in non-calcium dependent fluorescence (Lerner et al., 2015), allowing us to isolate the true calcium-dependent signal by rescaling and subtracting frames with violet illumination from the preceding frames with blue illumination (Allen et al., 2017). All subsequent analysis was based on this differential signal at 30 fps.

Two-photon imaging

Two-photon imaging was performed in 2 mice (visual experts) with a resonant-scanning two-photon microscope (Sutter Instruments, Movable Objective Microscope, configured with the “Janelia” option for collection optics), a Ti:Sapphire femtosecond pulsed laser (Ultra II, Coherent Inc.), and a 16X 0.8 NA objective (Nikon Instruments). Images were acquired at 30.9 Hz with an excitation wavelength of 930 nm. All focal planes were between 140-150 μm below the pial surface. The objective height was manually adjusted during recording in 1-2 μm increments as often as necessary to maintain the same focal plane.

Images were processed using Suite2P (Pachitariu et al., 2016) with model-based background subtraction. Sessions yielded 63-126 neurons each, for 271-529 behavioral trials.

Preprocessing of neural data

To analyze widefield data, we used SVD to compute the 200 highest-variance dimensions. These dimensions accounted for at least 88% of the total variance in the data. Using 500 dimensions accounted for little additional variance (~0.15%), indicating that additional dimensions were mostly capturing recording noise. SVD returns ‘spatial components’ U (of size pixels x components), ‘temporal components’ V^T (of size components x frames) and singular values S (of size components x components) to scale components to match the original data. To reduce computational cost, all subsequent analysis was performed on the product SV^T. Results of analyses on SV^T were later multiplied with U, to recover results for the original pixel space. All widefield data was rigidly aligned to the Allen Common Coordinate Framework v3, using four anatomical landmarks: the left, center, and right points where anterior cortex meets the olfactory bulbs and the medial point at the base of retrosplenial cortex.

To analyze 2p data, Suite2P was used to perform rigid motion correction on the image stack, identify neurons, extract their fluorescence, and correct for neuropil contamination (Pachitariu et al., 2016). ΔF/F traces were produced using the method of Jia et al. (Jia et al., 2011), skipping the final filtering step. Using these traces, we produced a matrix of size neurons x time, and treated this similarly to SV^T above. Finally, we confirmed imaging stability by examining the average firing rate of neurons over trials. If this varied substantially at the beginning or end of a session, the unstable portion was discarded.

To compute trial-averages, imaging data were double-aligned to the time when animals initiated a trial and to the stimulus onset. After alignment, single trials consisted of 1.8 s of baseline, 0.83 s of handle touch and 3.3 s following stimulus onset. The randomized additional interval between initiation and stimulus onset (0 – 0.25 s) was discarded in each trial and the resulting trials of equal length were averaged together.

Linear model

The linear model was constructed by combining multiple sets of regressors into a design matrix, to capture signal modulation by different task or motor events (Fig. 2A). Each regressor set was based on a single binary vector that contained a pulse at the time of the relevant event. To produce the regressor set, we repeated this vector with each copy being shifted in time by one frame relative to the original. For sensory stimuli, we created post-event regressor sets spanning all frames from stimulus onset until the end of the trial. For motor events like licking or whisking, we created peri-event regressor sets that spanned the frames from 0.5 s before until 1 s after each event. Lastly, we created whole-trial regressors, covering each frame in a given trial. Whole-trial regressors were aligned to stimulus onset and contained information about decision variables, such as animal choice or whether a given trial was rewarded. The model also contained several analog (non-binary) regressors, such as 1-D regressors for pupil diameter. To capture animal movements, we used SVD to compute the 200 highest dimensions of video information in both cameras. SVD was performed either on the raw video data (‘video’) or the absolute temporal derivative (‘motion’). SVD analysis of behavioral video was the same as for the widefield data, and we used the product SV^T of temporal components and singular values as analog regressors in the linear model. We did not use lagged versions of the analog regressors, including the video regressors.

To use video data regressors, it was important to ensure that they would not contain explanatory power from other model variables like licking and whisking that can also be inferred from video data. To accomplish this, we first created a reduced design matrix X_r, containing all movement regressors as well as times when spouts or handles were moving. X_r was ordered so that the motion and video columns were at the end. We then performed a QR decomposition of X_r (Mumford et al., 2015). The QR decomposition of a matrix A is A = QR, where Q is an orthonormal matrix and R is upper triangular. Columns 1 to j of Q therefore span the same space as columns 1 to j of A, but all the columns are orthogonal to one another. Finally, we replaced the motion and video columns of the full design matrix X with the corresponding columns of Q. This allowed the model to improve the fit to the data using any unique contributions of the motion and video regressors, while ensuring that the weights given to other regressors were not altered.

When a design matrix has columns that are close to linearly dependent (multicollinear), model fits are not reliable. To test for this, we devised a novel method we call “cumulative subspace angles.” The idea is that for each column of the design matrix, we wish to know how far it lies from the space spanned by the previous columns (note that pairwise angles do not suffice to determine multicollinearity). Our method works as follows: (1) the columns of the matrix were normalized to unit magnitude, (2) a QR decomposition of X was performed, (3) the absolute value of the elements along the diagonal of R were examined. Each of these values is the absolute dot product of the original vector with the same vector orthogonalized relative to all previous vectors. The values range from zero to one, where zero indicates complete degeneracy and one indicates no multicollinearity at all. Over all experiments, the most collinear regressor received a 0.26, indicating that it was 15° from the space of all other regressors. The average value was 0.84, corresponding to a mean angle of 57°.

To avoid overfitting, the model was fit using ridge regression. The regularization penalty was estimated separately for each column of the widefield data using marginal maximum likelihood estimation (Karabatsos, 2017) with minor modifications that reduced numerical instability for large regularization parameters.

Variance analysis

Explained variance (R²) was obtained using 10-fold cross-validation. To compute all explained variance by individual model variables, we created reduced models where all regressors that did not correspond to a given variable were shuffled in time. The explained variance by each reduced model revealed the maximum potential predictive power of the corresponding model variable.

To assess unique explained variance by individual variables, we created reduced models for each variable where only the corresponding regressor set was shuffled in time. The difference in explained variance between the full and the reduced model yielded the unique contribution ΔR² of that model variable. The same approach was used to compute unique contributions for groups of variables, i.e., ‘movement’ or ‘task’. Here, all variables that corresponded to a given group were shuffled together.

To compute the ‘task-shared’ or ‘task-independent’ explained variance for each movement variable, we created reduced models where all movement variables were shuffled in time. This task-only model was then compared to other reduced models where all movement variables but one were shuffled. The difference between the task-only model and this model yielded the task-independent contribution of that movement variable. The task-shared contribution was the difference between the total variance explained by a given variable and its task-independent contribution.

Model-based reconstruction of trial-averages

Reconstructed trial averages (Fig. 3 & 4) were produced by fitting the full model and averaging the reconstructed data over all trials. To split the model into the respective contributions of movement and task variables, we reconstructed the data based on either the movement or task variables alone (using the weights as in the full model) and averaging over all trials. To evaluate the relative impact of task variables on the trial average, we computed a task modulation index (TI), defined as where ΔTask and ΔMovement denote the mean absolute deviation of the reconstructed trial average based on either task or movement variables. The TI ranges from 0 (fully motor related) to 1 (fully task related). Intermediate values denote a mixed contribution of task and motor regressors to the trial-average.

Model-based video reconstruction

To better understand how the video related to the neural data, we analyzed the portion of the β-weight matrix that corresponded to the video regressors. This portion of the matrix was projected back up into the original video space. The result was of size p x d, where p is the number of video pixels (153,600) and d is the number of dimensions of the widefield data (200). We performed PCA on this matrix, reducing the number of rows. The top few ‘scores’ (projections onto the principal components) are low-dimensional representations of the widefield maps that were most strongly influenced by the video. To choose the dimensionality, we used the number of dimensions required to account for >90% of the variance (Fig. S2A). To obtain the widefield maps showing how the video was related to neural activity (Fig. S2B), we projected the scores back into widefield data pixel space and sparsened them using the varimax rotation. To determine the influence of each video pixel on the widefield (Fig. S2C), we projected the low-dimensional β-weights into video pixel space, took the magnitude of the β-weights for each pixel, and multiplied by the standard deviation for that pixel.

Acknowledgements

We thank Onyekachi Odoemene, Sashank Pisupati and Hien Nguyen for technical assistance and scientific discussions. Financial support was received from the Swiss National Science foundation (SM), the Pew Charitable Trusts (AKC) and the Simons Collaboration on the Global Brain (AKC, MTK).

References

↵
Abrahamyan, A., Silva, L.L., Dakin, S.C., Carandini, M., and Gardner, J.L. (2016). Adaptable history biases in human perceptual decisions. PNAS 113, E3548–E3557.
OpenUrl Abstract/FREE Full Text
↵
Allen, W.E., Kauvar, I.V., Chen, M.Z., Richman, E.B., Yang, S.J., Chan, K., Gradinaru, V., Deverman, B.E., Luo, L., and Deisseroth, K. (2017). Global Representations of Goal-Directed Behavior in Distinct Cell Types of Mouse Neocortex. Neuron 94, 891–907.e6.
OpenUrl CrossRef PubMed
↵
Aronov, D., Nevers, R., and Tank, D.W. (2017). Mapping of a non-spatial dimension by the hippocampal-entorhinal circuit. Nature 543, 719–722.
OpenUrl CrossRef PubMed
↵
Ayaz, A., Saleem, A.B., Schölvinck, M.L., and Carandini, M. (2013). Locomotion controls spatial integration in mouse visual cortex. Curr. Biol. 23, 890–894.
OpenUrl CrossRef PubMed
↵
Ayaz, A., Staeuble, A., Saleem, A.B., and Helmchen, F. (2018). Layer-specific integration of locomotion and concurrent wall touching in mouse barrel cortex. BioRxiv 265165.
↵
Batista-Brito, R., Vinck, M., Ferguson, K.A., Chang, J.T., Laubender, D., Lur, G., Mossner, J.M., Hernandez, V.G., Ramakrishnan, C., Deisseroth, K., et al. (2017). Developmental Dysfunction of VIP Interneurons Impairs Cortical Circuits. Neuron 95, 884–895.e9.
OpenUrl
↵
Bichot, N.P., and Schall, J.D. (1999). Effects of similarity and history on neural mechanisms of visual selection. Nat. Neurosci. 2, 549–554.
OpenUrl CrossRef PubMed Web of Science
↵
Bollu, T., Whitehead, S.C., Prasad, N., Walker, J.R., Shyamkumar, N., Subramaniam, R., Kardon, B.M., Cohen, I., and Goldberg, J.H. (2018). Cortical control of kinematic primitives in mice performing a hold-still-center-out reach task. BioRxiv 304907.
↵
Bouret, S., and Sara, S.J. (2004). Reward expectation, orientation of attention and locus coeruleus-medial frontal cortex interplay during learning. Eur. J. Neurosci. 20, 791–802.
OpenUrl CrossRef PubMed Web of Science
↵
Chan, K.Y., Jang, M.J., Yoo, B.B., Greenbaum, A., Ravi, N., Wu, W.-L., Sánchez-Guardado, L., Lois, C., Mazmanian, S.K., Deverman, B.E., et al. (2017). Engineered AAVs for efficient noninvasive gene delivery to the central and peripheral nervous systems. Nat. Neurosci. 20, 1172–1179.
OpenUrl CrossRef PubMed
↵
Chen, J.L., Carta, S., Soldado-Magraner, J., Schneider, B.L., and Helmchen, F. (2013). Behaviour-dependent recruitment of long-range projection neurons in somatosensory cortex. Nature advance online publication.
↵
Chen, T.-W., Li, N., Daie, K., and Svoboda, K. (2017). A Map of Anticipatory Activity in Mouse Motor Cortex. Neuron 94, 866–879.e4.
OpenUrl
↵
Crochet, S., and Petersen, C.C.H. (2006). Correlating whisker behavior with membrane potential in barrel cortex of awake mice. Nat. Neurosci. 9, 608–610.
OpenUrl
↵
Engel, T.A., Chaisangmongkon, W., Freedman, D.J., and Wang, X.-J. (2015). Choice-correlated activity fluctuations underlie learning of neuronal category representation. Nature Communications 6, 6454.
OpenUrl
↵
Engel, T.A., Steinmetz, N.A., Gieselmann, M.A., Thiele, A., Moore, T., and Boahen, K. (2016). Selective modulation of cortical state during spatial attention. Science 354, 1140–1144.
OpenUrl Abstract/FREE Full Text
↵
Erlich, J.C., Bialek, M., and Brody, C.D. (2011). A Cortical Substrate for Memory-Guided Orienting in the Rat. Neuron 72, 330–343.
OpenUrl CrossRef PubMed Web of Science
↵
Ferezou, I., Haiss, F., Gentet, L.J., Aronoff, R., Weber, B., and Petersen, C.C.H. (2007). Spatiotemporal dynamics of cortical sensorimotor integration in behaving mice. Neuron 56, 907–923.
OpenUrl CrossRef PubMed Web of Science
↵
Freedman, D.J., and Assad, J.A. (2006). Experience-dependent representation of visual categories in parietal cortex. Nature 443, 85–88.
OpenUrl CrossRef PubMed Web of Science
↵
Goard, M.J., Pho, G.N., Woodson, J., and Sur, M. (2016). Distinct roles of visual, parietal, and frontal motor cortices in memory-guided sensorimotor decisions. ELife 5, e13764.
OpenUrl CrossRef PubMed
↵
Guo, Z.V., Li, N., Huber, D., Ophir, E., Gutnisky, D., Ting, J.T., Feng, G., and Svoboda, K. (2014). Flow of Cortical Activity Underlying a Tactile Decision in Mice. Neuron 81, 179–194.
OpenUrl CrossRef PubMed Web of Science
↵
Harris, K.D., and Thiele, A. (2011). Cortical state and attention. Nat Rev Neurosci advance online publication.
↵
Harvey, C.D., Coen, P., and Tank, D.W. (2012). Choice-specific sequences in parietal cortex during a virtual-navigation decision task. Nature 484, 62–68.
OpenUrl CrossRef PubMed Web of Science
↵
Jia, H., Rochefort, N.L., Chen, X., and Konnerth, A. (2011). In vivo two-photon imaging of sensory-evoked dendritic calcium signals in cortical neurons. Nat Protoc 6, 28–35.
OpenUrl CrossRef PubMed Web of Science
↵
Karabatsos G. (2017). Marginal maximum likelihood estimation methods for the tuning parameters of ridge, power ridge, and generalized ridge regression. Communications in Statistics – Simulation and Computation.
↵
Kawai, R., Markman, T., Poddar, R., Ko, R., Fantana, A.L., Dhawale, A.K., Kampff, A.R., and Ölveczky, B.P. (2015). Motor cortex is required for learning but not for executing a motor skill. Neuron 86, 800–812.
OpenUrl CrossRef PubMed
↵
Keller, G.B., Bonhoeffer, T., and Hübener, M. (2012). Sensorimotor mismatch signals in primary visual cortex of the behaving mouse. Neuron 74, 809–815.
OpenUrl CrossRef PubMed Web of Science
↵
Kepecs, A., Uchida, N., Zariwala, H.A., and Mainen, Z.F. (2008). Neural correlates, computation and behavioural impact of decision confidence. Nature 455, 227–231.
OpenUrl CrossRef PubMed Web of Science
↵
Lerner, T.N., Shilyansky, C., Davidson, T.J., Evans, K.E., Beier, K.T., Zalocusky, K.A., Crow, A.K., Malenka, R.C., Luo, L., Tomer, R., et al. (2015). Intact-Brain Analyses Reveal Distinct Information Carried by SNc Dopamine Subcircuits. Cell 162, 635–647.
OpenUrl CrossRef PubMed
↵
Li, N., Chen, T.-W., Guo, Z.V., Gerfen, C.R., and Svoboda, K. (2015). A motor cortex circuit for motor planning and movement. Nature 519, 51–56.
OpenUrl CrossRef PubMed
↵
Licata, A.M., Kaufman, M.T., Raposo, D., Ryan, M.B., Sheppard, J.P., and Churchland, A.K. (2017). Posterior Parietal Cortex Guides Visual Decisions in Rats. J. Neurosci. 37, 4954–4966.
OpenUrl Abstract/FREE Full Text
↵
Manoach, D.S., Thakkar, K.N., Cain, M.S., Polli, F.E., Edelman, J.A., Fischl, B., and Barton, J.J.S. (2007). Neural activity is modulated by trial history: a functional magnetic resonance imaging study of the effects of a previous antisaccade. J. Neurosci. 27, 1791–1798.
OpenUrl
↵
Marshel, J.H., Garrett, M.E., Nauhaus, I., and Callaway, E.M. (2011). Functional Specialization of Seven Mouse Visual Cortical Areas. Neuron 72, 1040–1054.
OpenUrl CrossRef PubMed Web of Science
↵
Mathis, A., Mamidanna, P., Abe, T., Cury, K.M., Murthy, V.N., Mathis, M.W., and Bethge, M. (2018). Markerless tracking of user-defined features with deep learning. ArXiv:1804.03142 [Cs, q-Bio, Stat].
↵
Merre, P.L., Esmaeili, V., Charrière, E., Galan, K., Salin, P.-A., Petersen, C.C.H., and Crochet, S. (2017). Reward-Based Learning Drives Rapid Sensory Signals in Medial Prefrontal Cortex and Dorsal Hippocampus Necessary for Goal-Directed Behavior. Neuron 0.
↵
Morcos, A.S., and Harvey, C.D. (2016). History-dependent variability in population dynamics during evidence accumulation in cortex. Nat Neurosci advance online publication.
↵
Mumford, J.A., Poline, J.-B., and Poldrack, R.A. (2015). Orthogonalization of Regressors in fMRI Models. PLOS ONE 10, e0126255.
OpenUrl CrossRef PubMed
↵
Ni, A.M., Ruff, D.A., Alberts, J.J., Symmonds, J., and Cohen, M.R. (2018). Learning and attention reveal a general relationship between population activity and behavior. Science 359, 463–465.
OpenUrl Abstract/FREE Full Text
↵
Niell, C.M., and Stryker, M.P. (2010). Modulation of Visual Responses by Behavioral State in Mouse Visual Cortex. Neuron 65, 472–479.
OpenUrl CrossRef PubMed Web of Science
↵
O’Connor, D.H., Hires, S.A., Guo, Z.V., Li, N., Yu, J., Sun, Q.-Q., Huber, D., and Svoboda, K. (2013). Neural coding during active somatosensation revealed using illusory touch. Nat Neurosci 16, 958–965.
OpenUrl CrossRef PubMed
↵
Okun, M., Steinmetz, N.A., Cossell, L., lacaruso, M.F., Ko, H., Barthó, P., Moore, T., Hofer, S.B., Mrsic-Flogel, T.D., Carandini, M., et al. (2015). Diverse coupling of neurons to populations in sensory cortex. Nature 521, 511–515.
OpenUrl CrossRef PubMed
↵
Pachitariu, M., Lyamzin, D.R., Sahani, M., and Lesica, N.A. (2015). State-Dependent Population Coding in Primary Auditory Cortex. J. Neurosci. 35, 2058–2073.
OpenUrl Abstract/FREE Full Text
↵
Pachitariu, M., Stringer, C., Schröder, S., Dipoppa, M., Rossi, L.F., Carandini, M., and Harris, K.D. (2016). Suite2p: beyond 10,000 neurons with standard two-photon microscopy. BioRxiv 061507.
↵
Park, I.M., Meister, M.L.R., Huk, A.C., and Pillow, J.W. (2014). Encoding and decoding in parietal cortex during sensorimotor decision-making. Nat. Neurosci. 17, 1395–1403.
OpenUrl CrossRef PubMed
↵
Pinto, L., Koay, S.A., Engelhard, B., Yoon, A.M., Deverett, B., Thiberge, S.Y., Witten, I., Tank, D.W., and Brody, C. (2018). An accumulation-of-evidence task using visual pulses for mice navigating in virtual reality. BioRxiv 232702.
↵
Powell, K., Mathy, A., Duguid, I., and Häusser, M. (2015). Synaptic representation of locomotion in single cerebellar granule cells. ELife Sciences 4, e07290.
OpenUrl CrossRef PubMed
↵
Pratt, W.E., and Mizumori, S.J. (2001). Neurons in rat medial prefrontal cortex show anticipatory rate changes to predictable differential rewards in a spatial memory task. Behav. Brain Res. 123, 165–183.
OpenUrl CrossRef PubMed Web of Science
↵
Raposo, D., Kaufman, M.T., and Churchland, A.K. (2014). A category-free neural population supports evolving demands during decision-making. Nat Neurosci 17, 1784–1792.
OpenUrl CrossRef PubMed
↵
Ratzlaff, E.H., and Grinvald, A. (1991). A tandem-lens epifluorescence macroscope: hundred-fold brightness advantage for wide-field imaging. J. Neurosci. Methods 36, 127–137.
OpenUrl CrossRef PubMed Web of Science
↵
Reimer, J., Froudarakis, E., Cadwell, C.R., Yatsenko, D., Denfield, G.H., and Tolias, A.S. (2014). Pupil fluctuations track fast switching of cortical states during quiet wakefulness. Neuron 84, 355–362.
OpenUrl CrossRef PubMed
↵
Roitman, J.D., and Shadlen, M.N. (2002). Response of neurons in the lateral intraparietal area during a combined visual discrimination reaction time task. J. Neurosci. 22, 9475–9489.
OpenUrl Abstract/FREE Full Text
↵
Saleem, A.B., Ayaz, A., Jeffery, K.J., Harris, K.D., and Carandini, M. (2013). Integration of visual motion and locomotion in mouse visual cortex. Nat. Neurosci. 16, 1864–1869.
OpenUrl
↵
Schmitt, L.I., Wimmer, R.D., Nakajima, M., Happ, M., Mofakham, S., and Halassa, M.M. (2017). Thalamic amplification of cortical connectivity sustains attentional control. Nature 545, 219–223.
OpenUrl CrossRef PubMed
↵
Schneider, D.M., Nelson, A., and Mooney, R. (2014). A synaptic and circuit basis for corollary discharge in the auditory cortex. Nature 513, 189–194.
OpenUrl CrossRef PubMed Web of Science
↵
Shimaoka, D., Harris, K.D., and Carandini, M. (2018). Effects of Arousal on Mouse Sensory Cortex Depend on Modality. Cell Reports 22, 3160–3167.
OpenUrl
↵
Sommer, M.A., and Wurtz, R.H. (2008). Brain circuits for the internal monitoring of movements. Annu Rev Neurosci 31, 317.
OpenUrl CrossRef PubMed Web of Science
↵
Stringer, C., Pachitariu, M., Steinmetz, N., Reddy, C.B., Carandini, M., and Harris, K.D. (2018). Spontaneous behaviors drive multidimensional, brain-wide population activity. BioRxiv 306019.
↵
Vinck, M., Batista-Brito, R., Knoblich, U., and Cardin, J.A. (2015). Arousal and Locomotion Make Distinct Contributions to Cortical Activity Patterns and Visual Encoding. Neuron 86, 740–754.
OpenUrl CrossRef PubMed
↵
Wang, J.X., Kurth-Nelson, Z., Kumaran, D., Tirumala, D., Soyer, H., Leibo, J.Z., Hassabis, D., and Botvinick, M. (2018a). Prefrontal cortex as a meta-reinforcement learning system. BioRxiv 295964.
↵
Wang, L., Rangarajan, K.V., Gerfen, C.R., and Krauzlis, R.J. (2018b). Activation of Striatal Neurons Causes a Perceptual Decision Bias during Visual Change Detection in Mice. Neuron 97, 1369–1381.e5.
OpenUrl
↵
Wekselblatt, J.B., Flister, E.D., Piscopo, D.M., and Niell, C.M. (2016). Large-scale imaging of cortical dynamics during sensory perception and behavior. Journal of Neurophysiology 115, 2852–2866.
OpenUrl CrossRef PubMed

View the discussion thread.

Posted April 25, 2018.

Download PDF

Supplementary Material

Citation Tools

Subject Area

Neuroscience

Subject Areas

All Articles

Animal Behavior and Cognition (5195)
Biochemistry (11697)
Bioengineering (8714)
Bioinformatics (29110)
Biophysics (14921)
Cancer Biology (12045)
Cell Biology (17347)
Clinical Trials (138)
Developmental Biology (9404)
Ecology (14133)
Epidemiology (2067)
Evolutionary Biology (18260)
Genetics (12214)
Genomics (16758)
Immunology (11838)
Microbiology (27985)
Molecular Biology (11543)
Neuroscience (60766)
Paleontology (450)
Pathology (1864)
Pharmacology and Toxicology (3224)
Physiology (4934)
Plant Biology (10379)
Scientific Communication and Education (1679)
Synthetic Biology (2876)
Systems Biology (7331)
Zoology (1640)

[1] ↵
Abrahamyan, A., Silva, L.L., Dakin, S.C., Carandini, M., and Gardner, J.L. (2016). Adaptable history biases in human perceptual decisions. PNAS 113, E3548–E3557.
OpenUrl Abstract/FREE Full Text

[2] ↵
Allen, W.E., Kauvar, I.V., Chen, M.Z., Richman, E.B., Yang, S.J., Chan, K., Gradinaru, V., Deverman, B.E., Luo, L., and Deisseroth, K. (2017). Global Representations of Goal-Directed Behavior in Distinct Cell Types of Mouse Neocortex. Neuron 94, 891–907.e6.
OpenUrl CrossRef PubMed

[3] ↵
Aronov, D., Nevers, R., and Tank, D.W. (2017). Mapping of a non-spatial dimension by the hippocampal-entorhinal circuit. Nature 543, 719–722.
OpenUrl CrossRef PubMed

[4] ↵
Ayaz, A., Saleem, A.B., Schölvinck, M.L., and Carandini, M. (2013). Locomotion controls spatial integration in mouse visual cortex. Curr. Biol. 23, 890–894.
OpenUrl CrossRef PubMed

[5] ↵
Ayaz, A., Staeuble, A., Saleem, A.B., and Helmchen, F. (2018). Layer-specific integration of locomotion and concurrent wall touching in mouse barrel cortex. BioRxiv 265165.

[6] ↵
Batista-Brito, R., Vinck, M., Ferguson, K.A., Chang, J.T., Laubender, D., Lur, G., Mossner, J.M., Hernandez, V.G., Ramakrishnan, C., Deisseroth, K., et al. (2017). Developmental Dysfunction of VIP Interneurons Impairs Cortical Circuits. Neuron 95, 884–895.e9.
OpenUrl

[7] ↵
Bichot, N.P., and Schall, J.D. (1999). Effects of similarity and history on neural mechanisms of visual selection. Nat. Neurosci. 2, 549–554.
OpenUrl CrossRef PubMed Web of Science

[8] ↵
Bollu, T., Whitehead, S.C., Prasad, N., Walker, J.R., Shyamkumar, N., Subramaniam, R., Kardon, B.M., Cohen, I., and Goldberg, J.H. (2018). Cortical control of kinematic primitives in mice performing a hold-still-center-out reach task. BioRxiv 304907.

[9] ↵
Bouret, S., and Sara, S.J. (2004). Reward expectation, orientation of attention and locus coeruleus-medial frontal cortex interplay during learning. Eur. J. Neurosci. 20, 791–802.
OpenUrl CrossRef PubMed Web of Science

[10] ↵
Chan, K.Y., Jang, M.J., Yoo, B.B., Greenbaum, A., Ravi, N., Wu, W.-L., Sánchez-Guardado, L., Lois, C., Mazmanian, S.K., Deverman, B.E., et al. (2017). Engineered AAVs for efficient noninvasive gene delivery to the central and peripheral nervous systems. Nat. Neurosci. 20, 1172–1179.
OpenUrl CrossRef PubMed

[11] ↵
Chen, J.L., Carta, S., Soldado-Magraner, J., Schneider, B.L., and Helmchen, F. (2013). Behaviour-dependent recruitment of long-range projection neurons in somatosensory cortex. Nature advance online publication.

[12] ↵
Chen, T.-W., Li, N., Daie, K., and Svoboda, K. (2017). A Map of Anticipatory Activity in Mouse Motor Cortex. Neuron 94, 866–879.e4.
OpenUrl

[13] ↵
Crochet, S., and Petersen, C.C.H. (2006). Correlating whisker behavior with membrane potential in barrel cortex of awake mice. Nat. Neurosci. 9, 608–610.
OpenUrl

[14] ↵
Engel, T.A., Chaisangmongkon, W., Freedman, D.J., and Wang, X.-J. (2015). Choice-correlated activity fluctuations underlie learning of neuronal category representation. Nature Communications 6, 6454.
OpenUrl

[15] ↵
Engel, T.A., Steinmetz, N.A., Gieselmann, M.A., Thiele, A., Moore, T., and Boahen, K. (2016). Selective modulation of cortical state during spatial attention. Science 354, 1140–1144.
OpenUrl Abstract/FREE Full Text

[16] ↵
Erlich, J.C., Bialek, M., and Brody, C.D. (2011). A Cortical Substrate for Memory-Guided Orienting in the Rat. Neuron 72, 330–343.
OpenUrl CrossRef PubMed Web of Science

[17] ↵
Ferezou, I., Haiss, F., Gentet, L.J., Aronoff, R., Weber, B., and Petersen, C.C.H. (2007). Spatiotemporal dynamics of cortical sensorimotor integration in behaving mice. Neuron 56, 907–923.
OpenUrl CrossRef PubMed Web of Science

[18] ↵
Freedman, D.J., and Assad, J.A. (2006). Experience-dependent representation of visual categories in parietal cortex. Nature 443, 85–88.
OpenUrl CrossRef PubMed Web of Science

[19] ↵
Goard, M.J., Pho, G.N., Woodson, J., and Sur, M. (2016). Distinct roles of visual, parietal, and frontal motor cortices in memory-guided sensorimotor decisions. ELife 5, e13764.
OpenUrl CrossRef PubMed

[20] ↵
Guo, Z.V., Li, N., Huber, D., Ophir, E., Gutnisky, D., Ting, J.T., Feng, G., and Svoboda, K. (2014). Flow of Cortical Activity Underlying a Tactile Decision in Mice. Neuron 81, 179–194.
OpenUrl CrossRef PubMed Web of Science

[21] ↵
Harris, K.D., and Thiele, A. (2011). Cortical state and attention. Nat Rev Neurosci advance online publication.

[22] ↵
Harvey, C.D., Coen, P., and Tank, D.W. (2012). Choice-specific sequences in parietal cortex during a virtual-navigation decision task. Nature 484, 62–68.
OpenUrl CrossRef PubMed Web of Science

[23] ↵
Jia, H., Rochefort, N.L., Chen, X., and Konnerth, A. (2011). In vivo two-photon imaging of sensory-evoked dendritic calcium signals in cortical neurons. Nat Protoc 6, 28–35.
OpenUrl CrossRef PubMed Web of Science

[24] ↵
Karabatsos G. (2017). Marginal maximum likelihood estimation methods for the tuning parameters of ridge, power ridge, and generalized ridge regression. Communications in Statistics – Simulation and Computation.

[25] ↵
Kawai, R., Markman, T., Poddar, R., Ko, R., Fantana, A.L., Dhawale, A.K., Kampff, A.R., and Ölveczky, B.P. (2015). Motor cortex is required for learning but not for executing a motor skill. Neuron 86, 800–812.
OpenUrl CrossRef PubMed

[26] ↵
Keller, G.B., Bonhoeffer, T., and Hübener, M. (2012). Sensorimotor mismatch signals in primary visual cortex of the behaving mouse. Neuron 74, 809–815.
OpenUrl CrossRef PubMed Web of Science

[27] ↵
Kepecs, A., Uchida, N., Zariwala, H.A., and Mainen, Z.F. (2008). Neural correlates, computation and behavioural impact of decision confidence. Nature 455, 227–231.
OpenUrl CrossRef PubMed Web of Science

[28] ↵
Lerner, T.N., Shilyansky, C., Davidson, T.J., Evans, K.E., Beier, K.T., Zalocusky, K.A., Crow, A.K., Malenka, R.C., Luo, L., Tomer, R., et al. (2015). Intact-Brain Analyses Reveal Distinct Information Carried by SNc Dopamine Subcircuits. Cell 162, 635–647.
OpenUrl CrossRef PubMed

[29] ↵
Li, N., Chen, T.-W., Guo, Z.V., Gerfen, C.R., and Svoboda, K. (2015). A motor cortex circuit for motor planning and movement. Nature 519, 51–56.
OpenUrl CrossRef PubMed

[30] ↵
Licata, A.M., Kaufman, M.T., Raposo, D., Ryan, M.B., Sheppard, J.P., and Churchland, A.K. (2017). Posterior Parietal Cortex Guides Visual Decisions in Rats. J. Neurosci. 37, 4954–4966.
OpenUrl Abstract/FREE Full Text

[31] ↵
Manoach, D.S., Thakkar, K.N., Cain, M.S., Polli, F.E., Edelman, J.A., Fischl, B., and Barton, J.J.S. (2007). Neural activity is modulated by trial history: a functional magnetic resonance imaging study of the effects of a previous antisaccade. J. Neurosci. 27, 1791–1798.
OpenUrl

[32] ↵
Marshel, J.H., Garrett, M.E., Nauhaus, I., and Callaway, E.M. (2011). Functional Specialization of Seven Mouse Visual Cortical Areas. Neuron 72, 1040–1054.
OpenUrl CrossRef PubMed Web of Science

[33] ↵
Mathis, A., Mamidanna, P., Abe, T., Cury, K.M., Murthy, V.N., Mathis, M.W., and Bethge, M. (2018). Markerless tracking of user-defined features with deep learning. ArXiv:1804.03142 [Cs, q-Bio, Stat].

[34] ↵
Merre, P.L., Esmaeili, V., Charrière, E., Galan, K., Salin, P.-A., Petersen, C.C.H., and Crochet, S. (2017). Reward-Based Learning Drives Rapid Sensory Signals in Medial Prefrontal Cortex and Dorsal Hippocampus Necessary for Goal-Directed Behavior. Neuron 0.

[35] ↵
Morcos, A.S., and Harvey, C.D. (2016). History-dependent variability in population dynamics during evidence accumulation in cortex. Nat Neurosci advance online publication.

[36] ↵
Mumford, J.A., Poline, J.-B., and Poldrack, R.A. (2015). Orthogonalization of Regressors in fMRI Models. PLOS ONE 10, e0126255.
OpenUrl CrossRef PubMed

[37] ↵
Ni, A.M., Ruff, D.A., Alberts, J.J., Symmonds, J., and Cohen, M.R. (2018). Learning and attention reveal a general relationship between population activity and behavior. Science 359, 463–465.
OpenUrl Abstract/FREE Full Text

[38] ↵
Niell, C.M., and Stryker, M.P. (2010). Modulation of Visual Responses by Behavioral State in Mouse Visual Cortex. Neuron 65, 472–479.
OpenUrl CrossRef PubMed Web of Science

[39] ↵
O’Connor, D.H., Hires, S.A., Guo, Z.V., Li, N., Yu, J., Sun, Q.-Q., Huber, D., and Svoboda, K. (2013). Neural coding during active somatosensation revealed using illusory touch. Nat Neurosci 16, 958–965.
OpenUrl CrossRef PubMed

[40] ↵
Okun, M., Steinmetz, N.A., Cossell, L., lacaruso, M.F., Ko, H., Barthó, P., Moore, T., Hofer, S.B., Mrsic-Flogel, T.D., Carandini, M., et al. (2015). Diverse coupling of neurons to populations in sensory cortex. Nature 521, 511–515.
OpenUrl CrossRef PubMed

[41] ↵
Pachitariu, M., Lyamzin, D.R., Sahani, M., and Lesica, N.A. (2015). State-Dependent Population Coding in Primary Auditory Cortex. J. Neurosci. 35, 2058–2073.
OpenUrl Abstract/FREE Full Text

[42] ↵
Pachitariu, M., Stringer, C., Schröder, S., Dipoppa, M., Rossi, L.F., Carandini, M., and Harris, K.D. (2016). Suite2p: beyond 10,000 neurons with standard two-photon microscopy. BioRxiv 061507.

[43] ↵
Park, I.M., Meister, M.L.R., Huk, A.C., and Pillow, J.W. (2014). Encoding and decoding in parietal cortex during sensorimotor decision-making. Nat. Neurosci. 17, 1395–1403.
OpenUrl CrossRef PubMed

[44] ↵
Pinto, L., Koay, S.A., Engelhard, B., Yoon, A.M., Deverett, B., Thiberge, S.Y., Witten, I., Tank, D.W., and Brody, C. (2018). An accumulation-of-evidence task using visual pulses for mice navigating in virtual reality. BioRxiv 232702.

[45] ↵
Powell, K., Mathy, A., Duguid, I., and Häusser, M. (2015). Synaptic representation of locomotion in single cerebellar granule cells. ELife Sciences 4, e07290.
OpenUrl CrossRef PubMed

[46] ↵
Pratt, W.E., and Mizumori, S.J. (2001). Neurons in rat medial prefrontal cortex show anticipatory rate changes to predictable differential rewards in a spatial memory task. Behav. Brain Res. 123, 165–183.
OpenUrl CrossRef PubMed Web of Science

[47] ↵
Raposo, D., Kaufman, M.T., and Churchland, A.K. (2014). A category-free neural population supports evolving demands during decision-making. Nat Neurosci 17, 1784–1792.
OpenUrl CrossRef PubMed

[48] ↵
Ratzlaff, E.H., and Grinvald, A. (1991). A tandem-lens epifluorescence macroscope: hundred-fold brightness advantage for wide-field imaging. J. Neurosci. Methods 36, 127–137.
OpenUrl CrossRef PubMed Web of Science

[49] ↵
Reimer, J., Froudarakis, E., Cadwell, C.R., Yatsenko, D., Denfield, G.H., and Tolias, A.S. (2014). Pupil fluctuations track fast switching of cortical states during quiet wakefulness. Neuron 84, 355–362.
OpenUrl CrossRef PubMed

[50] ↵
Roitman, J.D., and Shadlen, M.N. (2002). Response of neurons in the lateral intraparietal area during a combined visual discrimination reaction time task. J. Neurosci. 22, 9475–9489.
OpenUrl Abstract/FREE Full Text

[51] ↵
Saleem, A.B., Ayaz, A., Jeffery, K.J., Harris, K.D., and Carandini, M. (2013). Integration of visual motion and locomotion in mouse visual cortex. Nat. Neurosci. 16, 1864–1869.
OpenUrl

[52] ↵
Schmitt, L.I., Wimmer, R.D., Nakajima, M., Happ, M., Mofakham, S., and Halassa, M.M. (2017). Thalamic amplification of cortical connectivity sustains attentional control. Nature 545, 219–223.
OpenUrl CrossRef PubMed

[53] ↵
Schneider, D.M., Nelson, A., and Mooney, R. (2014). A synaptic and circuit basis for corollary discharge in the auditory cortex. Nature 513, 189–194.
OpenUrl CrossRef PubMed Web of Science

[54] ↵
Shimaoka, D., Harris, K.D., and Carandini, M. (2018). Effects of Arousal on Mouse Sensory Cortex Depend on Modality. Cell Reports 22, 3160–3167.
OpenUrl

[55] ↵
Sommer, M.A., and Wurtz, R.H. (2008). Brain circuits for the internal monitoring of movements. Annu Rev Neurosci 31, 317.
OpenUrl CrossRef PubMed Web of Science

[56] ↵
Stringer, C., Pachitariu, M., Steinmetz, N., Reddy, C.B., Carandini, M., and Harris, K.D. (2018). Spontaneous behaviors drive multidimensional, brain-wide population activity. BioRxiv 306019.

[57] ↵
Vinck, M., Batista-Brito, R., Knoblich, U., and Cardin, J.A. (2015). Arousal and Locomotion Make Distinct Contributions to Cortical Activity Patterns and Visual Encoding. Neuron 86, 740–754.
OpenUrl CrossRef PubMed

[58] ↵
Wang, J.X., Kurth-Nelson, Z., Kumaran, D., Tirumala, D., Soyer, H., Leibo, J.Z., Hassabis, D., and Botvinick, M. (2018a). Prefrontal cortex as a meta-reinforcement learning system. BioRxiv 295964.

[59] ↵
Wang, L., Rangarajan, K.V., Gerfen, C.R., and Krauzlis, R.J. (2018b). Activation of Striatal Neurons Causes a Perceptual Decision Bias during Visual Change Detection in Mice. Neuron 97, 1369–1381.e5.
OpenUrl

[60] ↵
Wekselblatt, J.B., Flister, E.D., Piscopo, D.M., and Niell, C.M. (2016). Large-scale imaging of cortical dynamics during sensory perception and behavior. Journal of Neurophysiology 115, 2852–2866.
OpenUrl CrossRef PubMed