ABSTRACT
Flexible responses to sensory stimuli based on changing rules are critical for adapting to a dynamic environment. However, it remains unclear how the brain encodes rule information and uses this information to guide behavioral responses to sensory stimuli. Here, we made single-unit recordings while head-fixed mice performed a cross-modal sensory selection task in which they switched between two rules in different blocks of trials: licking in response to tactile stimuli applied to a whisker while rejecting visual stimuli, or licking to visual stimuli while rejecting the tactile stimuli. Along a cortical sensorimotor processing stream including the primary (S1) and secondary (S2) somatosensory areas, and the medial (MM) and anterolateral (ALM) motor areas, the single-trial activity of individual neurons distinguished between the two rules both prior to and in response to the tactile stimulus. Variable rule-dependent responses to identical stimuli could in principle occur via appropriate configuration of pre-stimulus preparatory states of a neural population, which would shape the subsequent response. We hypothesized that neural populations in S1, S2, MM and ALM would show preparatory activity states that were set in a rule-dependent manner to cause processing of sensory information according to the current rule. This hypothesis was supported for the motor cortical areas by findings that (1) the current task rule could be decoded from pre-stimulus population activity in ALM and MM; (2) neural subspaces containing the population activity differed between the two rules; and (3) optogenetic disruption of pre-stimulus states within ALM and MM impaired task performance. Our findings indicate that flexible selection of an appropriate action in response to a sensory input can occur via configuration of preparatory states in the motor cortex.
HIGHLIGHTS
Task rules are reflected in preparatory activity in sensory and motor cortices.
Neural subspaces for processing tactile signals depend on the current task rule.
Motor cortical activity tracks rule switches and is required for flexible rule-guided behavior.
INTRODUCTION
Abstract relationships between objects, events, and actions can be established by rules. How we receive, process, and respond to sensory signals is guided by our understanding of rules that apply to the current context. For example, a student picks up a vibrating phone while waiting for a phone interview, but not during a lecture. This rule-guided flexibility is essential to adapt in a dynamic environment1 and is at the core of many advanced cognitive functions such as economic decision making and social interaction2,3. Several lines of studies have shown that impairments in rule-based decision making are linked to neurodevelopmental conditions including autism spectrum disorder4,5 and schizophrenia6,7.
Flexible rule-guided behaviors require at least two processes from the brain. First, (#1) to maintain and update rule representations; and (#2) to apply the appropriate rule to correctly transform sensory signals into motor outputs. Higher-order brain areas in frontal and parietal cortices are thought to play an important role in process #1, that is in encoding abstract rules and guiding the sensorimotor transformation1,8–12. It remains less clear how process #2 occurs, i.e. how rules are applied to sensorimotor pathways to govern the mapping between stimulus and response13.
Rodent orofacial circuits provide well-defined sensorimotor pathways to study rule implementation. Recent studies have uncovered important cortical areas linking whisker input to licking output in goal-directed behavior14–17. For example, findings in a delayed tactile detection task have shown that the whisker region of primary somatosensory cortex is required for sensory detection and the anterior lateral motor cortex (ALM) is critical for motor planning and execution of licking15,18. Additionally, the medial motor cortex (MM), which contains the whisker primary and secondary motor cortices19,20, demonstrates early selectivity of whisker-based tactile signals compared with ALM21.
Activity in sensorimotor pathways can reflect both sensory input and the rules dictating the appropriate use of that input11,13,22–24. Analysis of neural populations may provide insight into how these sensory and contextual signals are integrated to govern behavior. Recent progress in the study of neural population dynamics has uncovered population-level computations for motor control25–28, timing29–31 and decision-making32–35. The evolution of the neural population state is controlled by initial conditions, internal dynamics, and external inputs, and can allow computations to be carried out36. For example, preparatory activity in non-human primate motor and premotor areas has been proposed to initialize dynamical systems to generate appropriate arm movements37,38. In addition, thalamic input has been shown to drive cortical dynamics in the mouse motor cortex during reaching28. In the primate dorsomedial frontal cortex, the speed of neural trajectories encoding the passage of time is adjusted by both initial conditions and thalamic inputs30,31.
Here we investigate how task rules affect activity in a sensorimotor pathway to govern the mapping between stimulus and response. We trained mice in a cross-modal sensory selection task that we recently developed39 and then recorded and analyzed population single-unit activity from a set of cortical areas along the pathway that transforms whisker sensory inputs into licking motor outputs15,18,21. We found that single-neuron and population activity before stimulus delivery reflected task rules in the whisker regions of primary (S1) and secondary (S2) somatosensory cortical areas, and in the medial (MM) and anterolateral (ALM) motor cortical areas. Across these cortical areas, neural subspaces containing the trial activity differed between the two rules. Pre-stimulus population states in the motor cortical areas shifted in a manner that tracked the rule switch. Optogenetic inhibition designed to disrupt pre-stimulus states in the motor cortical areas impaired rule-dependent tactile detection. Together, our results show that the application of task rules—to link a stimulus to the correct response— involves rule-dependent configuration of pre-stimulus preparatory states within the sensorimotor cortex.
RESULTS
Task rules modulate touch-evoked activity of individual neurons in the tactile processing stream
We trained head-fixed mice to perform a cross-modal sensory selection task39 in which they switched between “respond-to-touch” and “respond-to-light” rules in different blocks of trials (Fig. 1a,b). During the respond-to-touch rule, mice responded to a tactile stimulus by licking to the right reward port, and to withhold licking following a visual stimulus. During the respond-to-light rule, mice responded to a visual stimulus by licking to the left reward port, and withheld licking following a tactile stimulus. For each trial, the stimulus duration was 0.15 s and an answer period extended from 0.1 to 2 s from stimulus onset. Blocks of trials under the two rules alternated multiple times in a session (Fig. 1b; 4-6 blocks per session, ∼60 trials per block). No cue was immediately provided after rule switching, so mice detected the rule change through reward availability over the first few trials. On the 9th trial, a drop of water from the correct reward port was released following the stimulus to ensure switching (Fig. 1b, black dots).
The two stimulus-response rules determined the behavioral relevance of sensory stimuli. For example, tactile stimuli were behaviorally relevant in respond-to-touch blocks but irrelevant in respond-to-light blocks. Four trial outcomes were defined based on the behavioral relevance of a sensory stimulus and on the response of the mouse. Correct licking responses following a relevant stimulus were “hits”, and correct withholding of responses following an irrelevant stimulus were “correct rejections”. Failed responses to a relevant stimulus were “misses”, and incorrect licking responses (following an irrelevant stimulus and/or at the incorrect port) were “false alarms”. Two sensory modalities and four trial outcomes composed eight trial types in the cross-modal sensory selection task (Fig. 1c). Only hit trials were rewarded with a drop of water, and the other trial types were neither rewarded nor punished. We used stimulus strengths (tactile stimulus: single whisker, 20 Hz, 150 ms, ∼800° s−1; visual stimulus: 470 nm LED, 150 ms, ∼3 μW) that yielded performance of approximately 75% correct for both respond-to-touch and respond-to-light blocks (Fig. 1d; touch: 74 ± 1%, light: 75 ± 1% (mean ± sem), p = 0.56, paired sample t-test, n = 12 mice). Thus, mice flexibly responded to tactile and visual stimuli in a rule-dependent manner.
To examine how the task rules influenced the sensorimotor transformation occurring in the tactile processing stream, we performed single-unit recordings from sensory and motor cortical areas including S1, S2, MM and ALM (Fig. 1e-g, Fig. S1a-h, and Fig. S2a; S1: 6 mice, 10 sessions, 177 neurons, S2: 5 mice, 8 sessions, 162 neurons, MM: 7 mice, 9 sessions, 140 neurons, ALM: 8 mice, 13 sessions, 256 neurons). We recorded from one of these cortical areas per session, using 64-channel silicon probes. Across neurons, activity varied by orders of magnitude (Fig. S2b) and showed diverse selectivity. In tactile stimulus trials, some neurons in sensory and motor cortical areas showed prominent touch-evoked responses in both block types (e.g. Fig. S1c). Most of these touch-evoked responses appeared enhanced and sustained at timepoints prior to licking in the respond-to-touch (tactile hit, “tHit”) blocks when compared with similar timepoints in the respond-to-light (tactile correct rejection, “tCR”) blocks (Fig.1f and Fig. S1a,e). In contrast, some neurons only showed increased activity when mice made lick responses, regardless of the block type (Fig. S1d,f,h). Neurons could also show a mixture of these two response types (Fig. S1b,g).
We first determined to what extent the rules modulated touch-evoked activity by comparing tactile correct trials between respond-to-touch and respond-to-light blocks (tHit vs tCR). We restricted analysis to the 150 ms period of stimulus delivery, to focus on rule-dependent processing that was not influenced by overt movements (97% of lick onsets occurred >150 ms after stimulus onset). We used ideal observer analysis to determine how well trial-by-trial activity of a single neuron could discriminate between the task rules (Methods). We found that 18-33% of neurons in these cortical areas had area under the receiver-operating curve (AUC) values significantly different from 0.5, and therefore discriminated between tHit and tCR trials (Fig. 1h; S1: 28.8%, 177 neurons; S2: 17.9%, 162 neurons; MM: 32.9%, 140 neurons; ALM: 23.4%, 256 neurons; criterion to be considered significant: Bonferroni corrected 95% CI on AUC did not include 0.5 for at least 3 consecutive 10-ms time bins). Moreover, the distribution of AUC values for discriminative neurons showed that the majority had enhanced responses (AUC > 0.5) to tactile stimuli that were behaviorally relevant. Therefore, touch-evoked responses were overall enhanced by the relevance of tactile stimuli according to the current stimulus-response rule.
Pre-stimulus activity of single neurons signals task rules
In our cross-modal selection task, rules were block-based and there were no cues to indicate the current rule (except for the rule transition cue on the 9th trial of each block), so the mice were required to maintain rule information during the inter-trial interval (ITI). To test if ITI activity of neurons in the somatosensory and motor cortical areas reflected task rules, we first analyzed neural activity during the one second window preceding stimulus delivery. Trials with licking in this time window were removed to minimize possible movement effects on pre-stimulus activity (Methods). We found that some cortical neurons showed obvious changes in their pre-stimulus activity across blocks (Fig. 2a). A preference for the respond-to-touch rule (that is, with activity higher in respond-to-touch blocks compared with respond-to-light blocks) and a preference for the respond-to-light rule were both observed (Fig. 2a,b). We next calculated discriminability between block types for each neuron to see how well an ideal observer could categorize the current trial’s task rule on the basis of pre-stimulus activity (−100 to 0 ms from stimulus onset). To ensure mice were in the correct state to treat the following stimuli according to the rule, only correct tactile and visual trials were included. Less than 5% of neurons in S1 and S2 showed significant rule discriminability, while MM and ALM had around 10-20% significant neurons (Fig. 2c; S1: 4.5%, 177 neurons; S2: 2.5%, 162 neurons; MM: 21.4%, 140 neurons; ALM: 10.2%, 256 neurons; Bonferroni corrected 95% CI on AUC did not include 0.5).
We also calculated the ability of each neuron to discriminate between tactile vs visual stimuli in windows before (−100 to 0 ms) or after (0 to 100 ms) stimulus onset, to serve as negative and positive controls, respectively, for our use of ideal observer analysis. As expected, single-unit activity before stimulus onset did not discriminate between tactile and visual trials (Fig. 2d; S1: 0%, 177 neurons; S2: 0%, 162 neurons; MM: 0%, 140 neurons; ALM: 0.8%, 256 neurons). After stimulus onset, more than 35% of neurons in the sensory cortical areas and approximately 15% of neurons in the motor cortical areas showed significant stimulus discriminability (Fig. 2e; S1: 37.3%, 177 neurons; S2: 35.2%, 162 neurons; MM: 15%, 140 neurons; ALM: 14.1%, 256 neurons).
The rule dependence of pre-stimulus activity indicates that neurons were in different states immediately prior to stimulus onset, suggesting that their responses to the subsequent sensory input might also differ. We therefore next investigated early touch-evoked responses (0 to 50 ms from stimulus onset) in those neurons that showed significant rule discriminability prior to stimulus delivery. We found that neurons with stronger preference for the respond-to-touch rule before stimulus onset showed larger touch-evoked activity in the respond-to-touch blocks, whereas neurons with a preference for the respond-to-light rule showed larger touch-evoked activity in the respond-to-light blocks (Fig. 2f; Pearson correlation; S1: r = 0.69, p = 0.056, 8 neurons; S2: r = 0.91, p = 0.093, 4 neurons; MM: r = 0.93, p < 0.001, 30 neurons; ALM: r = 0.83, p < 0.001, 26 neurons). This result suggests that configuration of pre-stimulus states could play a role in achieving rule dependence of sensory processing.
Together, these results demonstrate that the pre-stimulus activity of single units in the sensory and motor cortical areas reflected task rules, with better discrimination of rules by units in the motor areas.
Pre-stimulus states of neuronal populations reflect task rules
We next investigated the function of rule-dependent pre-stimulus activity from the perspective of neural population dynamics. In a dynamical system, state variables change based on their current values. This implies that the evolution of neural population activity during a trial will depend in part on the activity state of the population at the beginning of the trial36,40. We hypothesized that pre-stimulus activity—i.e. the state of the population at the start of the trial— would be set in such a way as to enable processing of the upcoming sensory signals according to the appropriate rule. Predictions from this hypothesis are that: (1) the pre-stimulus state of a neural population could be used to decode the current task rule; (2) the rule-dependent separation of neural subspaces before and after the tactile stimulus onset should be correlated; (3) pre-stimulus states would shift when the mice switched between rules; and (4) perturbations of the pre-stimulus state should disrupt task performance. We tested each prediction via the following analyses and experiments.
We first determined whether the pre-stimulus population state could be used to decode the current task rule. For each session, we used linear discriminant analysis (LDA) to obtain a classification accuracy for block type (respond-to-touch vs respond-to-light) based on the pre-stimulus activity (−100 to 0 ms relative to stimulus onset) of simultaneously recorded neurons on correct trials (Fig. 3a). Pre-stimulus states in S1, S2, MM and ALM could each be used to decode the block type (Fig. 3b; medians of classification accuracy [true vs shuffled]: S1 [0.61 vs 0.5], 10 sessions; S2 [0.62 vs 0.53], 8 sessions; MM [0.7 vs 0.52], 9 sessions; ALM [0.68 vs 0.55], 13 sessions). Support vector machine (SVM) and Random Forest classifiers showed similar decoding abilities (Fig. S3a,b; medians of classification accuracy [true vs shuffled]; SVM: S1 [0.6 vs 0.53], 10 sessions, S2 [0.61 vs 0.51], 8 sessions, MM [0.71 vs 0.51], 9 sessions, ALM [0.65 vs 0.52], 13 sessions; Random Forests: S1 [0.59 vs 0.52], 10 sessions, S2 [0.6 vs 0.52], 8 sessions, MM [0.65 vs 0.49], 9 sessions, ALM [0.7 vs 0.5], 13 sessions).
Interestingly, activity in the motor cortical areas allowed robust block-type classification (no overlap between bootstrap 95% CIs for the true and shuffled data, 95% CIs for true vs shuffled data: MM [0.60,0.73] vs [0.48, 0.53]; ALM [0.56,0.66] vs [0.49, 0.54]). In contrast, the sensory cortical areas showed limited block-type discriminability (Fig. 3c; bootstrap 95% CIs for the true data were above 0.5 but overlapped with the bootstrap 95% CIs for the shuffled data, 95% CIs for true vs shuffled data: S1 [0.52,0.61] vs [0.49,0.55]; S2 [0.53,0.63] vs [0.49, 0.54]). In positive and negative control analyses, we found that neural population activity could be used to discriminate between stimulus types (tactile vs visual) after stimulus onset (Fig. 3e and Fig. S3d; 0 to 100 ms relative to stimulus onset) but not before stimulus onset (Fig. 3d and Fig. S3c; from −100 to 0 ms).
Together these results show that each of the two task rules was associated with a different pattern of pre-stimulus population activity in sensory and motor cortical areas, with a larger difference in the motor areas.
Separation of pre-stimulus states predicts subsequent divergent processing
We found pre-stimulus population activity was rule-dependent across sensory and motor cortical areas. We next asked if these rule-dependent pre-stimulus states affect post-stimulus neural activity. We investigated the relationship between the difference in pre-stimulus states between tHits and tCRs and the divergence of their subsequent neural trajectories. To assess this for the four cortical areas, we quantified how the tHit and tCR trajectories diverged from each other by calculating the Euclidean distance between matching time points for all possible pairs of tHit and tCR trajectories for a given session and then averaging these for the session (Fig. 4a,b; S1: 10 sessions, S2: 8 sessions, MM: 9 sessions, ALM: 13 sessions, individual sessions in gray and averages across sessions in black; window of analysis: −100 to 150 ms relative to stimulus onset; 10 ms bins; using the top 3 PCs; Methods). The resulting time series of distance values from all sessions (n = 40) were then ranked according to their means over the 100 ms period preceding stimulus onset and split into two groups, those above and below the median. The top 50% group showed a larger mean distance between tHit and tCR trajectories after stimulus onset compared with the bottom 50% group41,42 (Fig. 4c; permutation test, p < 0.001, 40 sessions). This result shows that the difference in population responses to the tactile stimuli under the two rules is commensurate with the difference in pre-stimulus states.
Neural subspaces for tactile processing are rule-dependent
Although the full dimensionality of a neural state space is equal to the number of neurons, correlations among neurons typically cause dynamics to occur within a subspace of lower dimensionality40,43. Population activity associated with each task rule might not only follow distinct trajectories, but could also occur within different neural subspaces. To test this, we calculated the overlap between the subspaces associated with tHit and tCR trials35,44,45 (Fig. 4d; see Methods). We expect that, if tHit and tCR trial activity occupies largely overlapping subspaces, then the neural dimensions capturing most of the tHit activity will also explain much of the tCR activity (Fig. 4d, middle). Conversely, if they occupy largely distinct subspaces, then the dimensions capturing most of the tHit activity will explain little of the tCR activity (Fig. 4d, right). For each session, tHit trials were divided randomly into equally-sized “reference” and “control” groups. The reference tHit trials were then used to perform a principal component analysis (PCA; using data 0 to 150 ms from stimulus onset; 10 ms bins). We projected activity from tCR trials and from the control tHit trials into the space of the top-three principal components obtained from the reference group PCA, then calculated and normalized their variance explained (Methods). For S1, S2, MM and ALM, subspace overlaps for tCR trials were significantly lower than the corresponding subspace overlaps for the control tHit trials (Fig. 4e, purple vs gray symbols; tCR – control tHit: S1 [−0.23], 8 sessions, p = 0.0016; S2 [−0.23], 7 sessions, p = 0.0086; MM [−0.36], 5 sessions, p = <0.001; ALM [−0.35], 11 sessions, p < 0.001, paired t-test). This finding suggests that different neural subspaces were used for processing tactile stimuli under each of the two rules, in both sensory and motor cortical areas.
We next asked if the rule-dependent separation of subspaces during stimulus delivery was related to how the subspaces were separated prior to the tactile stimulus. This would provide evidence that differences in tactile stimulus processing follow from differences in the state of the neural population at the time the stimulus is received. We found that, across the cortical areas, the subspace overlaps for tCR trials calculated from periods before and after the tactile stimulus onset were correlated (Fig. 4f; Pearson correlation, r = 0.68, p < 0.001; 31 sessions). This indicates that the shift between neural subspaces associated with each rule occurred prior to stimulus delivery and should thus impact processing of the stimulus.
Together, our results suggest that cortical populations are “pre-configured” according to the current rule, such that an incoming sensory signal leads to distinct processing and ultimately distinct actions.
Choice coding dimensions change with task rule
Gating of sensory information involves changing how sensory information is represented and read out32. This can be achieved by shifting sensory and/or choice coding dimensions in the population activity space34,46. In the previous section, we showed that neural subspaces containing trial dynamics changed between the two task rules. We next asked whether stimulus and choice coding dimensions within these subspaces also shifted with task rules. For each task rule, we estimated coding dimensions (CDs) that maximally discriminated the neural trajectories for different conditions47,48 (Fig. S4a-d; Methods). Since mice rarely licked the wrong port for a given block (Fig. 1b), right-lick and no-lick trials were used to obtain choice coding dimensions for the respond-to-touch blocks, and left-lick and no-lick trials for the respond-to-light blocks. To assess whether stimulus and choice CDs changed with task rule, we calculated for each session the dot product between the CDs obtained from respond-to-touch and respond-to-light blocks. We then used the magnitude of this dot product as an unsigned measure of the relative orientations of the CDs. In ALM, the dot product magnitudes calculated between stimulus CDs for the two block types were not significantly different from those calculated after shuffling trial-type labels (Fig. S4e; significance defined as non-overlap of 95% CIs). This suggests that the stimulus CD in a respond-to-touch block had an orientation unrelated to that in a respond-to-light block. In contrast, we found that S1, S2 and MM had stimulus CDs that were significantly aligned between the two block types (Fig. S4e; magnitude of dot product between the respond-to-touch stimulus CDs and the respond-to-light stimulus CDs, mean ± 95% CI for true vs shuffled data: S1: 0.5 ± [0.34, 0.66] vs 0.21 ± [0.12, 0.34], 10 sessions; S2: 0.62 ± [0.43, 0.78] vs 0.22 ± [0.13, 0.31], 8 sessions; MM: 0.48 ± [0.38, 0.59] vs 0.24 ± [0.16, 0.33], 9 sessions; ALM: 0.33 ± [0.2, 0.47] vs 0.21 ± [0.13, 0.31], 13 sessions). In contrast, the choice CDs for the two block types were not aligned well in S1, S2, MM, or ALM (Fig. S4f; magnitude of dot product between the respond-to-touch choice CD and the respond-to-light choice CD, mean ± 95% CI for true vs shuffled data: S1: 0.28 ± [0.15, 0.43] vs 0.21 ± [0.12, 0.33], 10 sessions; S2: 0.22 ± [0.11, 0.33] vs 0.21 ± [0.13, 0.32], 8 sessions; MM: 0.22 ± [0.13, 0.33] vs 0.22 ± [0.14, 0.3], 9 sessions; ALM: 0.27 ± [0.16, 0.39] vs 0.21 ± [0.13, 0.31], 13 sessions).
Choice coding dimensions were obtained from left-lick and no-lick trials in respond-to-touch blocks and right-lick and no-lick trials in respond-to-light blocks. Because the required lick directions differed between the block types, the difference in choice CDs across task rules (Fig. S4f) could have been affected by the different motor responses. To rule out this possibility, we did a new version of this analysis using right-lick and left-lick trials to calculate the choice coding dimensions for both task rules. We found that the orientation of the choice coding dimension in a respond-to-touch block was still not aligned well with that in a respond-to-light block (Fig. S4h; magnitude of dot product between the respond-to-touch choice CD and the respond-to-light choice CD, mean ± 95% CI for true vs shuffled data: S1: 0.39 ± [0.23, 0.55] vs 0.2 ± [0.1, 0.31], 10 sessions; S2: 0.32 ± [0.18, 0.46] vs 0.2 ± [0.11, 0.3], 8 sessions; MM: 0.35 ± [0.21, 0.48] vs 0.18 ± [0.11, 0.26], 9 sessions; ALM: 0.28 ± [0.17, 0.39] vs 0.21 ± [0.12, 0.31], 13 sessions).
Overall, these results suggest that the different subspaces for tactile processing under the two rules result at least in part from changes in choice coding dimensions.
Pre-stimulus states in motor cortex track behavioral rule switches
Task rules switched multiple (3-5) times in each behavioral session. Mice detected a rule switch either through trial and error during the first few trials after the switch, or when a drop of water from the correct reward port was given on the 9th trial (which served as a cue to ensure that mice switched by this point; Fig. 1b). The probabilities of right-licks and left-licks showed that the mice switched their motor responses during block transitions depending on task rules (Fig. 5a, mean ± 95% CI across 12 mice). We used the first hit trial as the mark of a successful behavioral switch and found that mice switched before or immediately after the cue (Fig. 5b, total number of block switches: respond-to-touch: 88 switches, respond-to-light: 91 switches). To analyze how pre-stimulus states changed over the course of a rule transition, we defined a “transition period” that spanned from the first trial after a block switch until the first hit trial of the new block. In addition, we divided each transition period into “early” and “late” parts based on the occurrence of the first false alarm trial of the new block (Fig. 5c). We considered the first false alarm trial to be the point at which the mouse first received feedback to indicate that a block change had occurred.
We hypothesized that the behavioral change that marked a successful switch in rule application would be accompanied by a neural change. Specifically, that the pre-stimulus population state would progress from that typical of the prior type of block to that typical of the new type of block in parallel with the behavioral shift. To test this, we trained an LDA classifier to discriminate respond-to-touch block and respond-to-light block trials using pre-stimulus neural activity. We used 90% of the correct trials as training data and tested classifier performance on the held-out 10% of correct trials (Fig. 5d,e). Using our “transition period” definition as described above, we tested classifier performance on “early” transition and “late” transition trials (Fig. 5c). For respond-to-touch to respond-to-light block transitions, the fractions of trials classified as respond-to-touch for MM and ALM decreased progressively over the course of the transition (Fig. 5d; rank correlation of the fractions calculated for each of the separate periods spanning the transition, Kendall’s tau, mean ± 95% CI: MM: −0.39 ± [−0.67, −0.11], 9 sessions, ALM: −0.29 ± [−0.54, −0.04], 13 sessions; criterion to be considered significant: 95% CI on Kendall’s tau did not include 0). Similarly, the fractions of trials classified as respond-to-touch increased progressively over the course of transitions from respond-to-light to respond-to-touch blocks (Fig. 5e; Kendall’s tau, mean ± 95% CI: MM: 0.37 ± [0.07, 0.63], 9 sessions, ALM: 0.27 ± [0.03, 0.49], 13 sessions). Accuracies for classification of trials by block type based on S1 and S2 activity were uniformly poor and showed no clear trends across block transitions (Fig. S5; mean ± 95% CI on Kendall’s tau for respond-to-touch → respond-to-light transitions: S1: −0.16 ± [−0.43, 0.1], 10 sessions, S2: −0.15 ± [−0.54, 0.21], 8 sessions; for respond-to-light → respond-to-touch transitions: S1: 0.21 ± [−0.07, 0.5], 10 sessions, S2: 0.25 ± [−0.17, 0.63], 8 sessions). Together, these results indicate that the pre-stimulus states of neural populations in MM and ALM shifted over the course of block transitions in a manner commensurate with the behavioral shift in rule application.
Disruption of pre-stimulus motor cortical state impairs rule-dependent tactile detection
So far, we have shown that pre-stimulus neural population states in the somatosensory and motor cortical areas differed between the two task rules. This suggests that pre-stimulus activity may play a critical role in our task. To test this, we bilaterally inhibited the different cortical areas shortly before stimulus onset via optogenetic activation of parvalbumin-positive (PV) GABAergic neurons18,49 (Fig. 6a, middle panel; −0.8 to 0 seconds from stimulus onset). Additionally, we included two other areas of the dorsal cortex, the anteromedial part of the motor cortex (AMM) and the posterior parietal cortex (PPC). We recorded from AMM in our cross-modal sensory selection task and observed visually-evoked activity (Fig. S1i-k), suggesting that AMM may play an important role in rule-dependent visual processing. PPC contributes to multisensory processing51–53 and sensory-motor integration50,54–58. Therefore, we wanted to test the roles of these areas in our cross-modal sensory selection task. Finally, we also performed negative control (sham) sessions that were identical except that the optogenetic light path was obstructed.
We defined detection sensitivity for tactile stimuli as the difference between tactile hit rate and visual false alarm rate during the response-to-touch blocks. Tactile detection sensitivity was significantly decreased when MM and ALM but not S1 and S2 were inhibited during the prestimulus period (Fig. 6c; criterion to be considered significant: 95% CI on Δ tactile sensitivity did not include 0; S1/S2: [−0.47, 0.08], 4 mice, 10 sessions; MM: [−0.65, −0.12], 4 mice, 11 sessions; ALM: [−0.68, −0.02], 4 mice, 10 sessions). This was primarily due to a reduction in the tactile hit rate (Fig. S6b; 95% CI on Δ tactile hit rate: S1/S2 [−0.31, 0.01], 4 mice, 10 sessions; MM [−0.49, −0.15], 4 mice, 11 sessions; ALM [−0.46, −0.07], 4 mice, 10 sessions). Inhibition of S1, S2, MM, and ALM during a 2-second window starting at the time of the stimulus onset served as a positive control (Fig. 6a, right panel). Consistent with previous studies18,50, inhibition of these cortical areas after stimulus onset reduced detection sensitivity for tactile stimuli (Fig. 6d; 95% CI on Δ tactile sensitivity: S1/S2 [−0.67, −0.08], 4 mice, 10 sessions; MM [−0.6, −0.34], 4 mice, 11 sessions; ALM [−0.55, −0.27], 4 mice, 10 sessions). Neither the negative control (sham) condition, nor inhibition of AMM before stimulus onset, showed an effect on the detection sensitivity for tactile stimuli (Fig. 6c; sham: [−0.12, 0.13], 7 mice, 28 sessions; AMM [−0.19, 0.06], 7 mice, 21 sessions). Together, our results suggest that the pre-stimulus network activity states in MM and ALM play an important role in rule-dependent tactile processing.
We defined detection sensitivity for visual stimuli as the difference between visual hit rate and tactile false alarm rate during the respond-to-light blocks. Visual detection sensitivity was not affected when S1 and S2 were inhibited before (Fig. S7g; 95% CI on Δ visual sensitivity: [−0.27, 0.05]) or after (Fig. S7h; [−0.44, 0.04], 4 mice, 10 sessions) the stimulus onset. For pre-stimulusonset inhibition of the motor cortical areas, visual detection sensitivity was decreased when MM and AMM but not ALM were suppressed (Fig. S7g; MM [−0.6, −0.18], 4 mice, 11 sessions; ALM [−0.5, 0.04], 4 mice, 10 sessions; AMM [−0.35, −0.08], 7 mice, 21 sessions). Inhibition of any of the three motor cortical areas after stimulus onset caused a reduction in visual sensitivity (Fig. S7h; MM [−0.83., −0.52]; ALM [−0.84, −0.39]; AMM [−0.72, −0.36]). Together, these results indicate that pre-stimulus states of neural populations in the medial parts of motor cortex, such as MM and AMM, are critical for processing visual stimuli in a rule-dependent manner.
PPC is involved in multisensory processing51–53 and decision-making54–58. Here we tested for a critical role of PPC in our cross-modal selection task (7 mice, 21 sessions total). Detection sensitivities for both tactile and visual stimuli were decreased when PPC was inhibited before (Fig. 6c; Δ tactile sensitivity: [−0.4, −0.06]; Fig. S7g; Δ visual sensitivity [−0.31, −0.05]) or after the stimulus onset (Fig. 6d; Δ tactile sensitivity: [−0.43, −0.04]; Fig. S7h; Δ visual sensitivity: [−0.54, −0.07]). These reductions in tactile and visual detection sensitivities were caused by a decrease in hit rate and/or an increase in false alarm rate. In general, inhibition of PPC before stimulus onset decreased the hit rate (Fig. S6b; Δ tactile hit [−0.36, −0.05]; Fig. S7b; Δ visual hit [−0.26, 0]), whereas inhibition of PPC after stimulus onset increased the false alarm rate (Fig. S6f; Δ visual false alarm [0.13, 0.38]; Fig. S7f; Δ tactile false alarm [0, 0.25]).
It is possible that disruption of pre-stimulus states may affect aspects of tactile sensory processing and/or lick production that are unrelated to rules. To exclude this possibility, in a new cohort of mice (n = 5), we inhibited each cortical area in either the pre- or the post-stimulus-onset period during performance of a simple tactile detection task (Fig. 6e-g). In this task, mice had only to report with Go/NoGo licking whether a whisker was deflected, without rule-switching components to the task or the need to suppress responses to distracting stimuli. We found that tactile sensitivity was decreased when the sensory and motor cortical areas were inhibited after but not before stimulus onset (Fig. 6f,g; 95% CI on Δ tactile sensitivity for pre-stimulus-onset inhibition: S1/S2 [−0.34, 0.18], 3 mice, 3 sessions; MM [−0.23, 0.06], 3 mice, 5 sessions; ALM [−0.28, 0.07], 3 mice, 4 sessions; Δ tactile sensitivity for post-stimulus-onset inhibition: S1/S2 [−0.59, −0.31]; MM [−0.59, −0.25]; ALM [−0.45, −0.1]). This was mainly caused by the decrease in hit rate in the post-stimulus-onset inhibition (Fig. S6i; 95% CI on Δ tactile hit rate: S1/S2 [−0.61, −0.28]; MM [−0.67, −0.2]; ALM [−0.73, −0.05]). Inhibition of AMM and PPC did not influence tactile sensitivity in either inhibition condition (Fig. 6f,g; Δ tactile sensitivity for pre-stimulus-onset inhibition: AMM [−0.25, 0.16], 3 mice, 4 sessions; PPC [−0.12, 0.1], 3 mice, 4 sessions; Δ tactile sensitivity for post-stimulus-onset inhibition: AMM [−0.17, 0.18]; PPC [−0.14, 0.3]). Together, these results show that inhibition immediately prior to stimulus delivery did not impact the performance of a simple tactile detection task in which the stimulus-response rule remained fixed.
We conducted an additional analysis to rule out the possibility that the behavioral effects of cortical inhibition we observed could be due simply to a deficit in lick production per se. Specifically, we compared the probability of licking in laser-only trials (catch trials where there was no sensory stimulus) with the probability of licking during intertrial intervals, for both the cross-modal selection task and the simple tactile detection task (Fig. 6h). Lick probability was unaffected during S1, S2, MM and ALM experiments for both tasks, indicating that the behavioral effects were not due to an inability to lick (Fig. 6i, j; 95% CI on Δ lick probability for cross-modal selection task: S1/S2 [−0.18, 0.24], 4 mice, 10 sessions; MM [−0.31, 0.03], 4 mice, 11 sessions; ALM [−0.24, 0.16], 4 mice, 10 sessions; Δ lick probability for simple tactile detection task: S1/S2 [−0.13, 0.31], 3 mice, 3 sessions; MM [−0.06, 0.45], 3 mice, 5 sessions; ALM [−0.18, 0.34], 3 mice, 4 sessions).
Taken together, our results suggest that the pre-stimulus states of motor cortical networks play an important role in rule-dependent sensorimotor transformations.
DISCUSSION
Here we investigated how rules modulate the transformation of tactile signals into actions across a set of key sensory-motor cortical areas comprising S1, S2, MM and ALM. We found that neural activity prior to stimulus delivery reflected task rules at both the single-neuron and population levels in each area, but more prominently so in the motor cortical areas MM and ALM. Across the areas examined, each of the two task rules was associated with its own neural subspace for processing tactile signals. In ALM and MM, pre-stimulus population states shifted concomitantly with the behavioral signs of a rule switch. Optogenetic inhibition of motor cortical areas during the pre-stimulus period impaired tactile detection during the cross-modal selection task, but not during a simpler tactile detection task that did not require switching among rules. Together, our results suggest that the neural population states in motor cortical areas ALM and MM play an important role in transforming sensory stimuli into actions in a flexible, rule-dependent manner.
The responses to tactile stimuli were enhanced in S1, S2, MM and ALM when they were behaviorally relevant according to the current rule (Fig. 1h). In our task, relevance relates to reward, movement preparation, and movement, factors known to influence activity across many brain areas21,23,59–62. We chose not to attempt to dissociate “relevance” from these factors in our task design, given that they are linked in many natural scenarios63,64. Below we address potential concerns and confounding effects associated with reward and movement.
First, neural responses on tactile hits and tactile false alarms were similar, despite the fact that hits but not false alarms were rewarded (proportions of neurons showing a significant difference in mean response between tactile hits and tactile false alarms: S1 (0/177); S2 (2/162); MM (0/140); ALM (6/256); permutation tests on PSTHs with Bonferroni correction for multiple comparisons; Methods). Second, we minimized movement effects by limiting the analysis window to a period that preceded 97% of lick onsets (0 to 150 ms from stimulus onset). We also included censor and grace periods (Methods) to reduce the impact of compulsive licking, and excluded trials with licking during the one second window preceding stimuli. We note that motor-related signals need not always occur together with overt movement. For instance subthreshold stimulation of the frontal eye fields in primates can drive V4 activity and mimic the effects of attention without causing eye movements65,66.
Responses to tactile stimuli within a respond-to-light block were significantly reduced but still observable in ALM (Fig. 1g and Fig. S1g). This suggests that gating of tactile information likely occurred in part prior to ALM67–69. In contrast, we did not observe visually-evoked activity in ALM (Fig. 1g and Fig. S1g-h). This modality bias is consistent with the long-range connectivity between sensory and frontal areas. Specifically, the somatosensory cortex is connected to the motor cortex, whereas the visual cortex is connected to the anterior cingulate cortex (ACC) 70. Also consistent with this anatomy is that we observed visually-evoked activity in the anterior part of the ACC (Fig. S1i-k; the anteromedial part of the motor cortex, AMM), and ACC has been shown to modulate V1 activity in rodents24.
In our task, the right or left water port was the rewarded port in a respond-to-touch block or a respond-to-light block, respectively. Although the mice could not anticipate stimulus types and licking responses during the intertrial interval, there might be a subtle bias of posture and movement across blocks given the different positions of the rewarded port. To reduce the effects of movement bias on pre-stimulus activity (−100 to 0 ms), we removed trials with licking during a one second window before stimulus onset. Moreover, in a separate study using the same task (Finkel et al., unpublished), high-speed video analysis demonstrated no significant differences in whisker motion between respond-to-touch and respond-to-light blocks in most (12 of 14) behavioral sessions.
We found that the neural subspaces containing population activity patterns were different during respond-to-touch and respond-to-light rules. Specifically, S1, S2, MM, and ALM showed lower subspace overlaps when calculated between tactile hits and tactile correct rejections than when calculated between tactile hits and control (held-out) tactile hits (Fig. 4e). These subspace differences could result from: (1) involvement of different sets of neurons; (2) differently signed changes in firing patterns (such as some neurons fire more and others fire less); and/or (3) differently scaled changes in firing patterns (such as the firing rates of some neurons do not change and the firing rates of other neurons increased two-fold).
We found that how well neural subspaces were separated during tactile processing was associated with how well they were separated prior to the stimulus (Fig. 4f). This result is consistent with a dynamical systems view of neural population processing, where initial conditions are of critical importance36,38. Bringing the population activity to an optimal initial state could allow the evolution of the population activity to produce the desired movement. Primate studies have shown that motor cortical areas implement this strategy to control movement25,37,71. Here we identified a potentially similar role for motor cortex activity states in the transformation of incoming sensory signals into actions in a rule-dependent manner. In addition, we found that not only motor but also sensory cortical areas had initial states that varied with the current rule. Indeed, pre-stimulus activity has been shown to encode rule information in primate visual cortex 72 and rodent auditory cortex11. Overall, these findings indicate that setting up appropriate initial states could be a general strategy by which cortical networks integrate external inputs to achieve context-specific processing.
No-lick trials included misses, which could be caused by mice not being engaged in the task. While the majority of no-lick trials were correct rejections (respond-to-touch: 75%; respond-to-light: 76%), we treated no-licks as one of the available choices in our task and included them to calculate choice coding dimensions (Fig. S4c,d,f). To ensure stable and balanced task engagement across task rules, we removed the last 20 trials of each session and used stimulus parameters that achieved similar behavioral performance for both task rules (Fig. 1d; ∼75% correct for both rules). However, we also calculated choice coding dimensions using only right- and left-lick trials. In S1, S2, MM and ALM, the choice CDs calculated this way were also not aligned well across task rules (Fig. S4h), consistent with the results calculated from lick and no-lick trials (Fig. S4f). Data were limited for this analysis, however, because mice rarely licked to the unrewarded water port (# of licksunrewarded port / # of lickstotal, respond-to-touch: 0.13, respond-to-light: 0.11). These trials usually came from rule transitions (Fig. 5a) and, in some cases, were potentially caused by exploratory behaviors. These factors could affect choice CDs.
Inhibition of the motor cortical areas prior to stimulus delivery slightly but significantly impaired tactile detection in the respond-to-touch rule and visual detection in the respond-to-light rule (Fig. 6c and Fig. S7g). However, the ability to detect sensory stimuli was not completely abolished. This suggests that circuits for encoding the task rules may be redundant and/or other gating mechanisms may be involved8. Indeed, the pre-stimulus activity in either MM and ALM could be used to decode the task rules, although we inhibited only one area at a time. Additionally, it has been shown that loops between ALM and subcortical regions including thalamus and cerebellum maintain persistent activity during short-term memory33,73. It is possible that recurrent circuits across multiple brain areas contribute to holding rule information during intertrial intervals.
To test for a rule-specific function of pre-stimulus states, we used a simple tactile detection task to assess the potential effects of inhibition on sensory processing and lick production (Fig. 6e-g). We found that inhibition of the pre-stimulus states of MM and ALM only reduced the detection sensitivity for tactile stimuli in the cross-modal selection task but not in the simple tactile detection task (Fig. 6c,f), suggesting a role specific to flexible rule-dependent sensorimotor transformations. We balanced the behavioral performances in these tasks (∼75% correct) via the adjustment of stimulus intensity to make the task difficulties similar. However, the effects of silencing cortex can also depend on factors that we did not probe, such as the time course of an area’s task involvement74. To more precisely dissect the effects of perturbations on different cognitive processes in rule-dependent sensory detection, more complex behavioral tasks and richer behavioral measurements are needed in the future.
Pre-stimulus activity in MM and ALM showed a strong dependence on the current rule (Figs. 2,3,5), correlated with aspects of subsequent tactile processing (Fig. 4), and was required for tactile detection during the cross-modal selection task (Fig. 6c). These motor cortical areas are therefore likely to play an important role in the rule-dependent sensorimotor transformations occurring within cortical networks75. A greater rule-dependence of activity in motor compared with sensory areas is consistent with primate visual and somatosensory studies showing that attention effects become more prominent in higher-order areas76–78. Moreover, a number of studies in primates and rodents have shown that sensory-related responses in sensory cortical areas are modulated by motor cortical areas16,63,65,79,80. It is possible that MM and ALM received rule information from other brain regions such as the medial prefrontal cortex8,73,81,82 and send this information to S1 and S2 in the cross-modal selection task. Future work is needed to identify and characterize the neural circuits responsible for implementation, encoding, and updating of rules3.
AUTHOR CONTRIBUTIONS
Conceptualization, Y.-T.C., E.A.F. and D.H.O.; Investigation, Y.-T.C.; Methodology, Y.-T.C., E.A.F. and D.X.; Formal Analysis, Y.-T.C.; Writing ⎼ Original Draft, Y.-T.C. and D.H.O.; Writing ⎼ Review & Editing, Y.-T.C. and D.H.O.; Funding Acquisition, D.H.O.; Resources, D.H.O.; Supervision, D.H.O.
METHODS
Mice
All procedures were performed in accordance with protocols approved by the Johns Hopkins University Animal Care and Use Committee. Twelve mice (8 male, 4 female) were obtained by crossing PVcre lines83 (Jackson Labs: 008069) with Ai32 lines84 (Jackson Labs: 012569). Seven PVcre; Ai32 mice (5 male, 2 female) were trained to perform the cross-modal selection task and included in behavioral and optogenetic inhibition experiments. Five PVcre; Ai32 mice (3 male, 2 female) were trained to perform the tactile detection task and included in optogenetic inhibition experiments. Four male mice included in behavioral experiments were obtained by crossing Emx1cre mice85 (Jackson Labs: 005628) with Ai32 mice. Two male mice included in behavioral experiments were heterozygous VGATChR2-EYFP (Jackson Labs: 014548)86. Mice ranged in age from 2-5 months at the start of training. Mice were housed in a vivarium with a reverse light-dark cycle (12 h each phase), and were singly housed after surgery and during behavioral experiments. Details of assignment to different experimental conditions are listed in Table 1.
Behavioral task
All behavioral experiments were conducted with head-fixed mice during the dark phase. Behavioral apparatus was controlled by BControl software (C. Brody, Princeton University). Four to 7 days after a headpost implantation and 7-14 days before behavioral training, mice were allowed 1 mL of water daily until reaching ∼80% of their starting body weight. On training days, mice were allowed to perform until sated (∼ 1 hr/ day) and were weighed before and after each session to determine the amount of water consumed. Additional water was given if mice consumed <0.3 mL of water in order to maintain a stable body weight. On days when their behavior was not tested, they received 1 mL of water.
Cross-modal sensory selection task
The cross-modal sensory selection training consists of two stages. Mice were first trained to perform tactile and visual detection separately, then trained on the cross-modal selection task where tactile and visual stimuli were randomly interleaved.
In the first 1-2 sessions, mice were acclimated to head fixation in the behavioral apparatus while being given free access to water via two reward ports located 6-10 mm and ∼35 degrees to the left and right of the mouse midline. In subsequent sessions, mice were randomly assigned to start with tactile or visual detection training. After the hit rate of one modality reached >70% (∼3 days; hit rate = 100*(# hits) / (# hits + # misses)), stimulus detection training of the other modality began. For tactile detection training, a single whisker (always on the right whisker pad) was threaded into a glass pipette attached to a piezo actuator (D220-A4-203YB, Piezo Systems), which was driven by a piezo controller (MDTC93B, Thorlabs). Approximately 1.5 mm of whisker remained exposed at the base. All whiskers except the target whisker were trimmed to near the base. Mice were given a drop of water (∼6 µl) for licking to the right reward port in response to a tactile stimulus (1 s sinusoidal deflections at 40 Hz, ∼1400 deg / s) during an answer period (0.1 to 2 s from stimulus onset). For visual detection training, mice were rewarded for licking to the left water port in response to a visual stimulus. Each visual stimulus comprised 470 nm light (1 s flash at ∼5 mW) generated by an LED (M470F1 LED driven by LEDD1B, Thorlabs) and emitted from the tip of an optic fiber (105 µm diameter, 0.22 NA; M43L01, Thorlabs) positioned 5.5 cm away from the tip of the mouse’s nose along its midline. To reduce compulsive licking, licks that occurred within a “grace period” (0 to 0.1 s from stimulus onset) were not rewarded. Licks occurring in a “censor period” (−0.2 to 0 s from stimulus onset) resulted in the withholding of the stimulus presentation for that trial and no reward or punishment. Trials with licks occurring in the grace and censor periods were omitted from analysis. In all sessions, ambient white noise (cut off at 40 kHz, ∼80 dB SPL) was played to mask any potential sound associated with movement of the piezo stimulator.
After the stimulus-detection training, mice were trained to perform the cross-modal selection task. Tactile and visual stimuli were randomly interleaved (subject to a limit of 4 consecutive trials of the same type) and trials were separated by a random interval (3.5 s fixed interval + random interval drawn from an exponential distribution with mean 4 s). Trials were grouped into either respond-to-touch or respond-to-light blocks (54-66 trials per block from an uniform distribution with mean 60 trials). Each session randomly began with one of two block types, and the block types subsequently alternated multiple times (4-6 blocks per session). In respond-to-touch blocks, mice were rewarded with a drop of water if they licked the right reward port following tactile but not visual stimuli. In respond-to-light blocks, mice were rewarded with a drop of water for licking the left reward port following visual but not tactile stimuli. The answer, grace and censor periods were as described above for the stimulus-detection training.
Four trial outcomes were defined based on block types, sensory stimuli, and responses (Fig. 1c). Trials in which mice licked to the correct reward port following tactile stimuli in respond-to-touch blocks or visual stimuli in respond-to-light blocks were scored as “hit” trials. Failures to lick to the correct port after tactile stimuli in respond-to-touch blocks or visual stimuli in respond-to-light blocks were scored as “miss” trials. Licks to either reward port after tactile stimuli in respond-to-light blocks, visual stimuli in respond-to-touch blocks, or to the incorrect reward port after either stimulus type, resulted in “false alarm” trials. Trials in which mice correctly withheld licks after tactile stimuli in respond-to-light blocks, or after visual stimuli in respond-to-touch blocks, were scored as “correct rejection” trials. Performance was quantified as percent correct: 100*(# hits + # correct rejections) / (# of trials total).
In an initial stage of cross-modal selection training (∼7 sessions), a drop of water from the rewarded port was automatically released following 80% behaviorally relevant stimuli. Subsequently, automatically released water only occurred on the 9th trial after a block switch. Once performance reached >70% of trials correct, task difficulty was gradually increased by reducing stimulus intensity and duration. In a final stage of training, faint stimuli (tactile: 0.15 s sinusoidal deflections at 20 Hz, ∼800 deg/s; visual: 0.15 s flash at ∼3 µW) were used to increase cognitive load and to result in error trials for analysis. Mice were considered trained when performance reached >70% correct for at least three consecutive days. After reaching this performance criterion, mice proceeded with test sessions. Seven PVcre; Ai32 mice performed the cross-modal task during inhibition experiments, and six of them continued for electrophysiology recordings. Other transgenic mice were given test sessions for electrophysiology recordings but not optogenetic experiments.
Behavioral sessions lasted until mice were sated. To ensure stable engagement, the last 20 trials of each session were removed from further analysis. In addition, sessions were omitted from analysis if overall performance was < 60% correct, at least one of block performances (respond-to-touch or respond-to-light blocks) was < 55% correct, or at least one of hit rates (tactile or visual hits) was < 35%. Three sessions in total were removed for these reasons (from two mice).
Tactile detection task
Head-fixed mice were trained to perform a go/no-go tactile detection task. On Go trials, the whisker was deflected (0.15 s sinusoidal deflections at 20 Hz, ∼600 deg/s). If mice licked the right reward port following a tactile stimulus, a drop of water was released and it was scored as a hit trial. If mice failed to respond to a tactile stimulus, it was scored as a miss trial. On NoGo trials, the target whisker was not deflected. If mice licked during the answer period, it was scored as a false alarm. If mice withheld licking, it was scored as a correct rejection. Go and NoGo trials were randomly interleaved (subject to a limit of 4 consecutive trials of the same type), and no trial-start cue was presented. The answer, grace and censor periods were as described above for the cross-modal selection task. Tactile stimuli of the tactile detection task were slightly weaker compared with the cross-modal selection task in order to control task difficulties by making behavioral performance similar (∼75% correct).
Similar to the cross-modal selection task, the last 20 trials in each session were excluded, and sessions with performance < 60% correct or tactile hit rate < 35% were removed from subsequent analysis. Five PVcre; Ai32 mice performed the tactile detection task during inhibition experiments.
Surgery
Prior to behavioral testing, mice were implanted with clear-skull caps18 and metal headposts designed to expose a large area of the dorsal surface of the skull. During surgery, mice were anesthetized under isoflurane (1-2% in O2; Surgivet) and mounted in a stereotaxic apparatus (David Kopf Instruments) with a thermal blanket (Harvard Apparatus). Mice were given a subcutaneous injection of Marcaine or Lidocaine for local analgesia and an intraperitoneal injection of Ketoprofen to reduce inflammation. The scalp and periosteum over the dorsal surface of the skull were removed. To expose S2 on the left hemisphere, the left temporal muscle was detached and the bone ridge at the temporal-parietal junction was thinned using a dental drill. Headposts were fixed to the skull over the lambda structure using clear adhesive luting cement (C&B Metabond Quick Adhesive Cement System; Parkell). A thin layer of clear cement followed by an additional layer of cyanoacrylate glue (Krazy Glue) was applied to the entire surface of the exposed skull, leaving it largely transparent. To protect the clear skull from scratching, a silicone elastomer (Kwik-Cats) was applied prior to optogenetic experiments. Intrinsic signal imaging (ISI) was used to guide the whisker parts of S1 and S2 for neural recordings and optogenetic experiments87,88. Mice were lightly anesthetized with isoflurane (0.5-1%) and chlorprothixene. The C2 or C3 whisker was stimulated with a Piezo at 10 Hz. Since S2 is close to the auditory cortex, white noise was played during imaging.
For silicon probe recording, a small craniotomy (∼1 mm in diameter) over the recording site (always on the left hemisphere) was made (S1 and S2 determined by ISI; MM: 1.5 mm anterior, 1.0 mm lateral; ALM: 2.5 mm anterior, 1.5 mm lateral to bregma). The dental acrylic and skull was thinned using a dental drill and the remaining bone was removed with a tungsten needle or forceps. A separate, smaller craniotomy (∼0.6 mm in diameter) on the right hemisphere was made for implantation of a ground screw (0.6 mm anterior, 3.0 mm lateral to bregma; S1 trunk region). Additional craniotomies were usually made in new locations after finishing recordings in previous ones (12 mice; 1-4 recording sites per mouse).
Electrophysiology and data preprocessing
Linear 64-channel probes (H3, Cambridge NeuroTech) were coated with DiI (saturated) or DiD (5-10mg/mL) to histologically verify the site of recording post hoc. The silicon probe was inserted into the cortex either vertically (for MM and ALM) or at ∼ 40 degrees from vertical (for S1 and S2). After probe insertion, the brain was covered with a layer of 1.5% agarose and ACSF and was left for ∼10 minutes prior to recording.
Neural signals and behavioral timestamps were recorded using an Intan system (RHD2000 series multi-channel amplifier chip; Intan Technologies). Neural signals were sampled at 30 kHz. Kilosort was used for spike sorting89 and spike clusters were manually curated using Phy. Units were excluded from further analysis if the rate of inter-spike-interval violations within a 1.5 ms window was >0.5%, L-ratios were >0.1, the presence of spikes was <90% of the whole session, the cumulative drift of spike depth was >40 um (Unit Quality Metrics, Allen Institute).
For analyses about stimulus-evoked responses, neural spike rates were calculated in10 ms bins and smoothed with a Gaussian kernel (50 ms). For analyses about pre-stimulus activity, neural spike rates were calculated in 100 ms bins without smoothing. Spike rates of simultaneously recorded neurons were normalized for all population-level analyses including linear discriminant analysis and principal component analysis. We used soft normalization to make activity in a roughly unity range and to reduce the impact of units with low firing rate (normalized response = (response-mean(response))/(range(response)+5))27,45. In addition, to minimize movement effects on neural activity during the pre-stimulus window (−1 to 0 s from stimulus onset), trials with licking occurring in this window were removed (∼25%).
Optogenetic inhibition
PVcre; Ai32 mice implanted with clear-skull caps were given optogenetic inhibition experiments after behavioral task training (cross-modal selection or tactile detection). Laser stimuli (473 nm; MBL-III-473-100, Ultralasers) were directed to the brain via optic fibers (200 μm diameter, 0.22 NA; TM200FL1B, Thorlabs) positioned over (∼2 mm above) the cortical areas bilaterally (8-10 mW each side). For S1 and S2, the left whisker areas were guided by intrinsic signal imaging (as described above) and the right whisker areas were determined as the symmetric positions. Other targeted areas on the dorsal cortex included MM (1.5 mm anterior, 1.0 mm lateral to bregma), ALM (2.5 mm anterior, 1.5 mm lateral), AMM (2.5 mm anterior, 0.5 mm lateral), and PPC (1.94 mm posterior, 1.6 mm lateral). Sham sessions were identical to optogenetic inhibition sessions except that the dorsal cortex was covered by blackout cloth in order to not inhibit any brain areas. For each session, one of the cortical areas or the sham condition was randomly assigned for inhibition. A cone, blackout cloth and tape were used to shield the mouse’s eyes from scattered light due to the laser.
For the cross-modal sensory selection task, laser stimuli were delivered to the targeted brain areas in ∼30% of tactile and visual stimulus trials. In around half of these trials, optogenetic inhibition occurred before stimulus onset to suppress baseline activity (−0.8 to 0 s from stimulus onset, 40 Hz sinusoidal waveform with a 0.1 s linearly modulated ramp-down at the end). In the other half of these trials, optogenetic inhibition began simultaneously with the stimulus onset to suppress sensory-evoked activity (0 to 2 s from stimulus onset, 40 Hz sinusoidal waveform with a 0.2 s linearly modulated ramp-down at the end). Additionally, in a subset of trials (∼20%), laser stimuli were delivered alone. These “laser only” trials consisted of short (0.8 s) and long (2 s) trains of laser pulses that are identical to the laser stimuli in pre-stimulus-onset and post-stimulus-onset conditions respectively.
For the tactile detection task, laser stimuli were delivered in ∼30% of trials. Among these laser trials, Go trials consist of approximately half and half pre-stimulus-onset and post-stimulus-onset inhibition. Since there was no tactile stimulus in NoGo trials, short (0.8 s) and long (2 s) laser stimuli were delivered before and after trial onset respectively. Laser stimuli are identical to those used in the cross-modal selection task.
Baseline behavioral performance was measured by trials without laser stimuli and used to determine if a session passed the criteria of good performance (as described in the behavioral task section). In addition, sessions with laser catch rates >75% or > hit rates were removed from analysis because a high laser catch rate indicates that mice detected laser stimuli instead of tactile or visual stimuli (laser catch rate = 100*(# of laser only trials in which licking occurred) / (# of laser only trials total)).
Single-neuron discrimination analyses
Receiver-Operating Characteristic (ROC) analysis
ROC analysis was used to calculate how well trial-by-trial activity of a single neuron could discriminate certain conditions (e.g., tactile hit vs tactile correct rejection). The area under the ROC curve (AUC) represents the performance of an ideal observer in discriminating trials based on these conditions (MATLAB “perfcurve”). For discriminability of touch-evoked activity between task rules (Fig. 1h), tactile correct trials were split into tactile hits (respond-to-touch) and tactile correct rejections (respond-to-light). The analysis window was the first 150 ms after stimulus onset to minimize any movement effects resulting from licking. A Bonferroni corrected 95% confidence interval for AUC was obtained by bootstrap. For each time bin (10ms), if its 95% CI did not include the chance level (0.5), it was considered significant. We defined a unit as showing significant tHit-tCR selectivity when three consecutive time bins (>30 ms) of AUC values were significant.
For discriminability of pre-stimulus activity between task rules (Fig. 2c), correct trials were split based on block types (respond-to-touch: tactile hits and visual correct rejections; respond-to-light: visual hits and tactile correct rejections). The analysis window was the 100 ms window before stimulus onset. For discriminability of pre-stimulus activity between stimulus types (Fig. 2d), correct trials were split based on stimulus types rather than block types (tactile: tactile hits and tactile correct rejections; visual: visual hits and visual correct rejections). For discriminability of sensory-evoked activity between stimulus types (Fig. 2e), correct trials were split based on stimulus types, and the analysis window was the first 100 ms after stimulus onset rather than before stimulus onset. For Fig. 2c-e, the analysis window was one time bin (100 ms). If the Bonferroni corrected 95% CI for this time bin did not include the chance level (0.5), it was considered significant.
PSTH-based permutation test
To determine whether water reward affected touch-evoked activity in the cross-modal selection task, we compared the mean PSTHs for tactile hits and for tactile false alarms in which mice licked to the right water port following a tactile stimulus in the respond-to-light blocks (Fig. 1c). For each neuron, the Euclidean distance between the mean PSTHs for tactile hits and tactile false alarms was calculated (0 to 250 ms from stimulus onset). We then performed a permutation test on whether this Euclidean distance was significantly different from zero41,42. A p-value was then calculated using the distribution of resampled Euclidean distances. Significance was determined at the alpha = 0.05 level after Bonferroni correction for the number of neurons.
Population decoding analyses
We used linear discriminant analysis (LDA; MATLAB “fitcdiscr”) to measure how well population activity from simultaneously recorded neurons could decode (1) task rules (respond-to-touch vs respond-to-light) prior to stimulus delivery (−100 to 0 ms from the stimulus onset), (2) stimulus types (tactile vs visual stimuli) prior to stimulus delivery, and (3) stimulus types after stimulus onset (0 to 100 ms). All correct trials were used and classification accuracy was obtained using ten-fold cross validation (Fig. 3 and Fig. S3c,d). In addition, support vector machines (Fig. S3a; MATLAB “fitcsvm”) and Random Forests (Fig. S3b; MATLAB “TreeBagger” with 500 trees) were used to discriminate task rules prior to stimulus delivery. The shuffled data was generated by shuffling the labels for individual trials (e.g. block types).
We also applied LDA to determine how the pre-stimulus states shifted during rule transitions (Fig. 5 and Fig. S5). We used 90% of the correct trials as training data for task rules and the held-out 10% of correct trials to classify trials as having come from respond-to-touch or respond-to-light blocks. We also separately classified trials occurring in the “early transition” and “late transition” periods as having come from one or the other of the block types.
Distance between neural trajectories
We calculated the distance between tHit and tCR trajectories to determine how these trajectories diverged (Fig. 4a,b). For each session, we performed a Principal Components Analysis (PCA) using the trial-averaged tHit and tCR population spike rate responses (−100 to 150 ms from stimulus onset). Population responses for individual tHit and tCR trials were projected onto the top three principal component (PC) space. For each pair of tHit and tCR trials, the Euclidean distances between the neural states of tHit and tCR trajectories at each time point were calculated. The distances between tHit and tCR trajectories were averaged across these pairs in each session.
To investigate the relationship between a difference in pre-stimulus activity and a difference in subsequent sensory-evoked activity, the distances between tHit and tCR trajectories from all recording sessions (total 40 sessions; S1 [10], S2 [8], MM [9], ALM [13]) were ranked based on the distances before stimulus delivery (−100 to 0 ms). The mean tHit-tCR distances after stimulus onset (0 to 150 ms) between the top and bottom 50% groups were compared using a permutation test (Fig. 4c). Specifically, we calculated the Euclidean distance between the mean tHit-tCR distances for these two groups. The group labels were then randomly shuffled, and new mean tHit-tCR distances of the shuffled groups were obtained. The Euclidean distance between these shuffled mean tHit-tCR distances was calculated. This shuffling procedure was repeated 1,000 times, and then the p value was calculated (one-tailed; null hypothesis: no difference; distance >=0).
Subspace overlap
The subspace overlap between tHit and tCR trials was obtained through their variance alignment (Fig. 4d-f). For each session, the trial-averaged tHit activity was used to perform a PCA (0 to 150 ms from stimulus onset). The trial-averaged tCR activity was projected onto the top three principal component (PC) space (tCRtHit-subspace), and the variance explained was calculated. For normalization, a separated PCA was performed on the trial-averaged tCR activity, and its own (tCRtCR-subspace) variance explained was calculated. The subspace overlap was defined as the ratio of the variance explained of tCRtHit-subspace to the variance explained of tCRtCR-subspace. We chose the top three PCs because most of the variances of tHittHit-subspace (∼90%) and tCRtCR-subspace (∼85%) were captured.
To test if the subspace for processing tactile signals significantly changed under different rules, we compared the subspace overlap between tHit and tCR trials with a control group. Specifically, we randomly assigned tactile hit trials into equal sized reference and control groups. The tCR and tHit control group were projected to the PC space of the tHit reference group, and their subspace overlaps were compared (Fig. 4e). To calculate the separation of subspaces prior to stimulus delivery, pre-stimulus activity in tCR trials (−100 to 0 ms from stimulus onset) was projected to the PC space of the tHit reference group and the subspace overlap was calculated. In this analysis, we used tHit activity during stimulus delivery (0 to 150 ms from stimulus onset) to obtain reliable PCs. In addition, the subspace overlap could be overestimated when there were only few neurons in a session (low dimensionality). To avoid this issue, sessions having less than ten units were excluded from this analysis.
Stimulus and choice coding dimensions
For each session, n simultaneously recorded neurons created an n dimensional space. A coding dimension (CD) is defined as an nx1 vector that maximally separates the neural trajectories for different conditions47,48. For example, to estimate a stimulus CD in respond-to-touch blocks, we used trial-averaged trajectories for tactile and visual trials and calculated their difference at each time point . We then averaged vt during the analysis window (0 to 150 ms from stimulus onset) to obtain the stimulus CD.
The trial types used to calculate stimulus and choice CDs were:
To test if stimulus (choice) CDs changed with the task rules, we calculated the dot product between the stimulus (choice) CD in respond-to-touch blocks and the stimulus (choice) CD in respond-to-light blocks. The CDs here are unit vectors, so the magnitude of the dot product ranges from 0 (orthogonal) to 1 (aligned).
Stimulus sensitivity
For the cross-modal sensory selection task, the detection sensitivity for tactile stimuli was calculated as the difference of tactile hit rate and visual false alarm rate during the respond-to-touch blocks. The tactile hit rate was the probability of licking right in response to tactile stimuli, and the visual false alarm rate was the probability of licking right in response to visual stimuli. Correspondingly, the detection sensitivity for visual stimuli was determined by the difference of visual hit rate and tactile false alarm rate during the respond-to-light blocks. The visual hit rate was the probability of licking left in response to visual stimuli, and the tactile false alarm rate was the probability of licking left in response to tactile stimuli.
For the tactile detection task, the detection sensitivity for tactile stimuli was calculated as the difference of hit rate and false alarm rate. The hit rate was the probability of licking in the stimulus trials (Go trials), and the false alarm rate was the probability of licking in the no-stimulus trials (NoGo trials).
Statistics
We report data as mean ± standard error of the mean (s.e.m.) except where noted. Statistical tests were two-tailed unless otherwise noted. We made the Bonferroni correction for multiple comparisons across neurons in each cortical area (Fig. 1h and 2c-e).
We calculated confidence intervals using a nonparametric hierarchical bootstrap method90 to simulate the data generation process and to incorporate variability at different levels including mice, sessions, neurons, and trial types (1000 iterations). For population decoding analysis (Fig. 3-5 and Fig. S3-5), statistical tests were performed across sessions (e.g., a mean classification accuracy for task rules across sessions). For behavioral analysis during optogenetic inhibition (Fig. 6 and Fig. S6,7), statistical tests were performed across mice (e.g., a mean tactile sensitivity across mice).
Data and code availability
Data and MATLAB scripts used to analyze the data are available from the corresponding author upon request.
ACKNOWLEDGEMENTS
We thank William Olson for comments on the manuscript; Emily Lubin and Jae Hun Lee for technical assistance; and Genki Minamisawa for technical advice. This work was supported by a Government Scholarship to Study Abroad from the Ministry of Education of Taiwan to Y.-T. C., Seed Grant S-2021-GR-045 from The Kavli Foundation to D.H.O., and NIH grants R01NS089652 and 1R01NS104834-01 to D.H.O.
Footnotes
This version of the manuscript has been revised to incorporate feedback from peer reviewers.
REFERENCES
- 1.↵
- 2.↵
- 3.↵
- 4.↵
- 5.↵
- 6.↵
- 7.↵
- 8.↵
- 9.
- 10.
- 11.↵
- 12.↵
- 13.↵
- 14.↵
- 15.↵
- 16.↵
- 17.↵
- 18.↵
- 19.↵
- 20.↵
- 21.↵
- 22.↵
- 23.↵
- 24.↵
- 25.↵
- 26.
- 27.↵
- 28.↵
- 29.↵
- 30.↵
- 31.↵
- 32.↵
- 33.↵
- 34.↵
- 35.↵
- 36.↵
- 37.↵
- 38.↵
- 39.↵
- 40.↵
- 41.↵
- 42.↵
- 43.↵
- 44.↵
- 45.↵
- 46.↵
- 47.↵
- 48.↵
- 49.↵
- 50.↵
- 51.↵
- 52.
- 53.↵
- 54.↵
- 55.
- 56.
- 57.
- 58.↵
- 59.↵
- 60.
- 61.
- 62.↵
- 63.↵
- 64.↵
- 65.↵
- 66.↵
- 67.↵
- 68.
- 69.↵
- 70.↵
- 71.↵
- 72.↵
- 73.↵
- 74.↵
- 75.↵
- 76.↵
- 77.
- 78.↵
- 79.↵
- 80.↵
- 81.↵
- 82.↵
- 83.↵
- 84.↵
- 85.↵
- 86.↵
- 87.↵
- 88.↵
- 89.↵
- 90.↵