Inferring action-dependent outcome representations depends on anterior but not posterior medial orbitofrontal cortex

doi:10.1016/j.nlm.2018.09.008

Neurobiology of Learning and Memory

Volume 155, November 2018, Pages 463-473

https://doi.org/10.1016/j.nlm.2018.09.008 Get rights and content

Highlights

•
The medial orbitofrontal cortex (mOFC) of the rat is functionally heterogeneous.
•
Anterior vs. posterior mOFC has stronger connections with the accumbens core.
•
Anterior vs. posterior mOFC is critical for inferring unobservable action outcomes.
•
Anterior vs. posterior mOFC is more directly involved in goal-directed action.

Abstract

Although studies examining orbitofrontal cortex (OFC) often treat it as though it were functionally homogeneous, recent evidence has questioned this assumption. Not only are the various subregions of OFC (lateral, ventral, and medial) hetereogeneous, but there is further evidence of heterogeneity within those subregions. For example, several studies in both humans and monkeys have revealed a functional subdivision along the anterior-posterior gradient of the medial OFC (mOFC). Given our previous findings suggesting that, in rats, the mOFC is responsible for inferring the likelihood of unobservable action outcomes (Bradfield, Dezfouli, van Holstein, Chieng, & Balleine, 2015), and given the anterior nature of the placements of our prior manipulations, we decided to assess whether the rat mOFC also differs in connection and function along its anteroposterior axis. We first used retrograde tracing to compare the density of efferents from mOFC to several structures known to contribute to goal-directed action: the mediodorsal thalamus, basolateral amygdala, posterior dorsomedial striatum, nucleus accumbens core and ventral tegmental area. We then compared the functional effects of anterior versus posterior mOFC excitotoxic lesions on tests of Pavlovian-instrumental transfer, instrumental outcome devaluation and outcome-specific reinstatement. We found evidence that the anterior mOFC had greater connectivity with the accumbens core and greater functional involvement in goal-directed action than the posterior mOFC. Consistent with previous findings across species, therefore, these results suggest that the anterior and posterior mOFC of the rat are indeed functionally distinct, and that it is the anterior mOFC that is particularly critical for inferring unobservable action outcomes.

Introduction

The OFC has been argued to mediate abroad array of cognitive and behavioural functions in learning and decision-making, not all of which can be easily reconciled. This is due, in part, to the fact that the vast majority of studies examining the OFC have lacked specificity in detailing the particular subregion targeted (i.e. medial, ventral, or lateral OFC); something that is particularly true of studies into rodent OFC. One example is the well-established finding that OFC damage causes impairments in reversal learning (e.g. Izquierdo et al., 2004, Rolls et al., 1994, Schoenbaum et al., 2002, Schoenbaum et al., 2003). This effect was subsequently shown to be specific to lesions of the lateral portion of the OFC (lOFC), with medial OFC damage actually resulting in a facilitation rather than a deficit in reversal learning (Mar, Walker, Theobald, Eagle, & Robbins, 2011). Likewise, there have been several demonstrations of lOFC inactivation leaving instrumental outcome devaluation intact (Ostlund and Balleine, 2007, Parkes et al., 2017), whereas mOFC inactivation has been found to impair it (Bradfield et al., 2015). It is possible, therefore, that a number of seemingly inconsistent findings are in fact a result of functional heterogeneity across the OFC regions being manipulated. A recent review (Izquierdo, 2017) has added weight to this suggestion by detailing, with unprecedented specificity, the neuroanatomical placements described in studies of rodent OFC and how each subregion might be linked to its specific functions.

The impairment in outcome devaluation we observed as a result of mOFC inactivation was a part of a larger investigation into the function of the mOFC more generally (Bradfield et al., 2015). Specifically, we inactivated mOFC using both excitotoxic lesions and inhibitory (hM4Di) Designer Receptors Exclusively Activated by Designer Drugs (DREADDs) in an instrumental choice situation where food outcomes (pellets or sucrose) were either observable or unobservable. Inactivating the mOFC selectively impaired performance during tasks in which outcomes were unobservable, i.e., in which they had to be recalled from memory, including outcome devaluation and specific-Pavlovian-instrumental-transfer (specific PIT). In contrast, performance on tasks in which the outcomes were presented and so observable, i.e., reinforced devaluation, outcome-selective reinstatement, and instrumental contingency degradation tests, was intact. Together, these results suggest that the mOFC is critical for inferring the occurrence of outcomes when they are unobservable as opposed to when they need merely to be recognised in the environment.

Beyond localising our placements in the medial subregion of OFC, however, we did not explore any differences of function along its anterior-posterior gradient. This could be significant because there are several lines of evidence suggesting that a further anterior-posterior distinction might exist. First, although we did not intentionally target the anterior mOFC, the placements in our 2015 study did tend to omit its posterior regions in an attempt to avoid overlap with prelimbic cortex. By contrast, in a recent study Munster and Hauber (2017) used more posterior (and from our inspection of their lesion image (Fig. 2), more dorsal) mOFC lesion placements, and were unable to replicate the impairments we observed in outcome devaluation and specific PIT. Second, in her review spanning various rodent studies, Izquierdo (2017) suggested that the anterior and posterior regions of mOFC might be functionally distinct, proposing that unobservable outcome retrieval is restricted to anterior mOFC, whereas delay discounting involves the more posterior regions (Izquierdo, 2017; Fig. 3). Finally, a recent article has shown that the anterior and posterior sections of lateral OFC play functionally distinct roles in aspects of decision-making (Panayi & Killcross, 2018), highlighting the possibility of a similar distinction in medial OFC. Indeed, in other species and in humans especially, several studies have suggested the existence of functional distinctions between anterior and posterior OFC (e.g. Kringelbach and Rolls, 2004, Mansouri et al., 2017, Smith et al., 2010). One particularly interesting finding from a meta-analysis of human neuroimaging studies suggests that activity in anterior but not posterior OFC correlates with representations of more complex or abstract reinforcers (Kringelbach & Rolls, 2004). Putting the question of homologies aside for a moment, this function appears to align closely with our proposal that the mOFC is necessary to infer action outcomes from memory.

Based on these findings, therefore, it might be reasonable to expect that the anterior and posterior regions of rodent mOFC carry out functionally distinct roles within dissociable neural circuits. This was the hypothesis investigated in the current study. First, we explored whether there were any observable differences in the density of output pathways from anterior versus posterior mOFC by placing retrograde tracers into the basolateral amygdala (BLA), posterior dorsomedial striatum (pDMS), nucleus accumbens core (NAc core), the mediodorsal thalamus (MD), and ventral tegmental area (VTA) and then quantifying the number of retrogradely labelled neurons in each region of the mOFC. We chose these structures because they have all been reported to receive some degree of input from mOFC (Hoover & Vertes, 2011), and they have all been previously identified as critical for various aspects of goal-directed action (see Hart, Leung, & Balleine, 2014). We next compared the performance of rats with specific excitotoxic lesions of either anterior or posterior mOFC on instrumental tasks in which action outcomes are absent on test, including specific PIT and instrumental outcome devaluation, and a further task for which outcomes are present on test: outcome-selective reinstatement. We predicted that rats with anterior but not posterior mOFC lesions would display deficits in both specific PIT and outcome devaluation, as these tasks require rats to infer absent outcomes, which we hypothesise relies on the anterior mOFC. We further predicted that all rats would demonstrate intact performance on a test of outcome-selective reinstatement, as the outcomes are presented during this test, can be directly recognised and, therefore, do not need to be inferred.

Section snippets

Material and methods

Our first aim was to establish whether the output pathways of anterior vs. posterior mOFC to NAc core, pDMS, BLA, MD and VTA differed in density. Rats received unilateral injections of the retrograde tracer flurogold (FG) into the pDMS, NAc core, or VTA plus an injection of the retrograde tracer cholera toxin B (CTb) into the NAc core, BLA, MD or VTA. Ten days after surgery, rats were perfused and brains were processed for immunofluorescence identification of retrogradely labelled neurons in

Experiment 1. Comparison of afferents from anterior vs. posterior mOFC to pDMS, NAc core, BLA and MD using retrograde tracing

Of the 46 retrograde injections in 23 rats that were conducted, 16 injections were excluded from the analysis due to misplacement or spread of the tracer beyond the boundaries of the target structure. This left 28 injections in 20 rats for the subsequent analysis; 6 in the NAc core, 6 in the pDMS, 6 in the BLA, 4 in the MD and 6 in the VTA. The results of tracing from these injections are shown in Fig. 1. The top row (Fig. 1A–E) shows examples of retrograde labelling in the anterior mOFC

Discussion

Taken together, the current findings demonstrate that the anterior and posterior subregions of the rodent mOFC can be dissociated both with regard to the density of their projections to specific target regions and with regard to their functions. First, we used retrograde tracing to assess the density of projections from anterior and posterior mOFC to the BLA, pDMS, NAc core, MD and VTA and found that, whereas projections to DMS, MD and VTA were relatively similar across anterior and posterior

Conclusions

Together with the findings of our previous study (Bradfield et al., 2015) and those of Munster and Hauber (2017), the current findings suggest that it is the anterior portion of the mOFC that is critical for animals to infer action-dependent outcomes when they are unobservable, whereas the posterior mOFC subserves a distinct function, perhaps related to response effort. Overall, these findings add to a growing trend within the literature of producing more specificity and consistency with

Conflict of interest

The authors declare no conflicts of interest.

Acknowledgements

The research reported in the manuscript was supported by grants to BWB and LAB from the National Health and Medical Research Council (NHMRC);Project Grant 1087689 and Project Grant 1148244. BWB is supported by a Senior Principal Research Fellowship from the NHMRC of Australia, Research Fellowship 1079561.

References (34)

B.W. Balleine et al.
Goal-directed instrumetnal action, contingency and incentive learning and their cortical substrates
Neuropharmacology
(1998)
L.A. Bradfield et al.
Medial orbitofrontal cortex mediates outcome retrieval in partially observable task situations
Neuron
(2015)
L.H. Corbit et al.
The role of the prelimbic cortex in instrumental conditioning
Behavioural Brain Research
(2003)
A.M. Graybiel et al.
Toward a neurobiology of obsessive-compulsive disorder
Neuron
(2000)
G. Hart et al.
Dorsal and ventral streams: The distinct role of striatal subregions int eh acquisition and performance of goal-directed actions
Neurobiology of Learning and Memory
(2014)
S.R. Heilbronner et al.
Circuit-based corticostriatal homologies between rat and primate
Biological Psychiatry
(2016)
M.L. Kringelbach et al.
The functional neuroanatomy of the human orbitofrontal cortex: Evidence from neuroimaging and neuropsychology
Progress in Neurobiology
(2004)
L.A. Bradfield et al.
Obsessive-compulsive disorder as a failure to integrate goal-directed and habitual action control
L.H. Corbit et al.
The role of the nucleus accumbens in instrumental conditioning: Evidence of a functional dissociation between accumbens core and shell
Journal of Neuroscience
(2001)
P.L.A. Gabbott et al.
Prefrontal cortex in the rat: Projections to subcortical autonomic, motor, and limbic centers
The Journal of Comparative Neurology
(2005)

G. Hart et al.

Consolidation of goal-directed action depends on MAPK/ERK signaling in rodent prelimbic cortex

Journal of Neuroscience

(2016)

G. Hart et al.

Prefrontal cortico-striatal disconnection blocks the acquisition of goal-directed action

Journal of Neuroscience

(2018)

W.L. Hays

Statistics for the social sciences

(1973)

W.B. Hoover et al.

Projections of the medial orbital and ventral orbital cortex in the rat

Journal of Comparative Neurology

(2011)

A. Izquierdo

Functional heterogeneity within rat orbitofrontal cortex in reward learning and decision making

Journal of Neuroscience

(2017)

A.D. Izquierdo et al.

Bilateral orbital prefrotnal cortex lesions in rhesus monkeys disrupt choices guided by both reward value and reward contingency

Journal of Neuroscience

(2004)

T.V. Maia et al.

The neural bases of obsessive-compulsive disorder in children and adults

Development and Pscyhopathology

(2008)

Cited by (33)

Impairments in expression of devaluation in a Pavlovian goal-tracking task, but not a free operant devaluation task, after fentanyl exposure in female rats
2024, Behavioural Brain Research
In laboratory animals, there are numerous demonstrations that past exposure to drugs of abuse can lead to devaluation impairments weeks after the final drug exposure, with the majority of these demonstrations examining effects of exposure to psychostimulants. There has been minimal investigation into whether prior exposure to opiates can lead to devaluation impairments. Here, we first trained female rats that two separate cuelights predicted two different foods and measured Pavlovian goal-tracking responses (Experiment 1) or trained female rats to press two levers to earn two different foods and measured this operant response (Experiment 2). In both experiments, we subsequently gave the rats injections of fentanyl twice daily for 6 days, and then tested rats for conditioned responses after satiation on one of the foods 48-h after the final injection. We found that rats were impaired in the expression of devaluation in the Pavlovian task after fentanyl exposure, but were unimpaired in the expression of devaluation in the operant task. The pattern of results is most consistent with an impairment in lateral orbitofrontal cortex function, but additional research is needed to determine the neurobiological cause of this pattern of results.
Medial orbitofrontal neurotrophin systems integrate hippocampal input into outcome-specific value representations
2022, Cell Reports
In everyday life, we mentally represent possible consequences of our behaviors and integrate specific outcome values into existing knowledge to inform decisions. The medial orbitofrontal cortex (MO) is necessary to adapt behaviors when outcomes are not immediately available—when they and their values need to be envisioned. Nevertheless, neurobiological mechanisms remain unclear. We find that the neuroplasticity-associated neurotrophin receptor tropomyosin receptor kinase B (TrkB) is necessary for mice to integrate outcome-specific value information into choice behavior. This function appears attributable to memory updating (and not retrieval) and the stabilization of dendritic spines on excitatory MO neurons, which led us to investigate inputs to the MO. Ventral hippocampal (vHC)-to-MO projections appear conditionally necessary for value updating, involved in long-term aversion-based value memory updating. Furthermore, vHC-MO-mediated control of choice is TrkB dependent. Altogether, we reveal a vHC-MO connection by which specific value memories are updated, and we position TrkB within this functional circuit.
Outcome-selective reinstatement is predominantly context-independent, and associated with c-Fos activation in the posterior dorsomedial striatum
2022, Neurobiology of Learning and Memory
Citation Excerpt :
Finally, because little is known about the neural mechanisms underlying outcome-selective reinstatement, we aimed to identify its potential neural correlates by examining expression of the immediate early gene and activity marker c-Fos in various brain regions. One brain region that selective reinstatement has been demonstrated to depend upon is the posterior dorsomedial striatum (pDMS) (Yin et al., 2005) and one region it does not depend upon is the medial orbitofrontal cortex (mOFC) (Bradfield et al., 2015; Bradfield et al., 2018). Thus, we included both regions in our analysis with the expectation that c-Fos expression would reflect performance in the pDMS but not the mOFC.
Research from human and animal studies has found that after responding has been successfully reduced following treatment it can return upon exposure to certain contexts. An individual in recovery from alcohol use disorder, for example, might relapse to drinking upon visiting their favourite bar. However, most of these data have been derived from experiments involving a single (active) response, and the context-dependence of returned responding in situations involving choice between multiple actions and outcomes is less well-understood. We thus investigated how outcome-selective reinstatement – a procedure involving choice between two actions and outcomes – was affected by altering the physical context in rats. In Experiment 1, rats were trained over 6 days to press a left lever for one food outcome (pellets or sucrose) and a right lever for the other outcome. Then, rats received an extinction session in either the same context (A) as lever press training, or in a different context (B). Rats were tested immediately (5 min) after extinction in Context A or B such that there were four groups in total: AAA, ABB, ABA, and AAB. Reinstatement testing consisted of one food outcome being delivered ‘freely’ (i.e. unearned by lever pressing and unsignalled by cues) to the food magazine every 4 min in the following order: Sucrose, Pellet, Pellet, Sucrose. Selective reinstatement was considered intact if pellet delivery increased pressing selectively on the pellet lever, and sucrose delivery selectively increased pressing on the sucrose lever. This result (Reinstated > Nonreinstated) was observed for rats in group AAA and ABB, but not rats in groups ABA and AAB. Experiment 2 was conducted identically, except that rats received two extinction sessions over two days and tested one day later. This time, all groups demonstrated intact outcome-selective reinstatement regardless of context. Analysis of c-Fos expression in several brain regions revealed that only c-Fos expression in the posterior dorsomedial striatum (pDMS) was related to intact reinstatement performance. Overall, these results suggest that outcome-selective reinstatement is predominantly context-independent, and that intact reinstatement is related to neuronal activity in the pDMS.
Is the core function of orbitofrontal cortex to signal values or make predictions?
2021, Current Opinion in Behavioral Sciences
One dominant hypothesis about the function of the orbitofrontal cortex (OFC) is that the OFC signals the subjective values of possible outcomes to other brain areas for learning and decision making. This popular view generally neglects the fact that OFC is not necessary for simple value-based behavior (i.e. when values have been directly experienced). An alternative, emerging view suggests that OFC plays a more general role in representing structural information about the task or environment, derived from prior experience, and relevant to predicting behavioral outcomes, such as value. From this perspective, value signaling is simply one derivative of the core underlying function of OFC. New data in favor of both views have been accumulating rapidly. Here we review these new data in discussing the relative merits of these two ideas.
Inactivation of posterior but not anterior dorsomedial caudate-putamen impedes learning with self-administered nicotine stimulus in male rats
2021, Behavioural Brain Research
Citation Excerpt :
Using this stringent testing approach, we show that rats with lesions to the p-dmCPu had blunted nicotine-evoked goal-tracking in the form of dipper entry duration after the non-contingent nicotine infusion on Test 4 (Fig. 3D, black squares). These findings are consistent with the deficits seen in previous studies demonstrating that the inactivation of p-dmCPu impairs instrumental learning [43,46,55,56] and impairs the acquisition of learning with nicotine stimulus using the discriminated goal-tracking task [8]. Therefore, our findings further strengthen the understanding that p-dmCPu is central for a broad range of associative learning mechanisms that include learning involving pharmacological states and appetitive stimuli.
The rodent caudate-putamen is a large heterogeneous neural structure with distinct anatomical connections that differ in their control of learning processes. Previous research suggests that the anterior and posterior dorsomedial caudate-putamen (a- and p-dmCPu) differentially regulate associative learning with a non-contingent nicotine stimulus. The current study used bilateral NMDA-induced excitotoxic lesions to the a-dmCPu and p-dmCPu to determine the functional involvement of a-dmCPu and p-dmCPu in appetitive learning with contingent nicotine stimulus. Rats with a-dmCPu, p-dmCPu, or sham lesions were trained to lever-press for intravenous nicotine (0.03 mg/kg/inf) followed by access to sucrose 30 s later. After 1, 3, 9, and 20 nicotine-sucrose training sessions, appetitive learning in the form of a goal-tracking response was assessed using a non-contingent nicotine-alone test. All rats acquired nicotine self-administration and learned to retrieve sucrose from a receptacle at equal rates. However, rats with lesions to p-dmCPu demonstrated blunted learning of the nicotine-sucrose association. Our primary findings show that rats with lesions to p-dmCPu had a blunted goal-tracking response to a non-contingent nicotine administration after 20 consecutive days of nicotine-sucrose pairing. Our findings extend previous reports to a contingent model of nicotine self-administration and show that p-dmCPu is involved in associative learning with nicotine stimulus using a paradigm where rats voluntarily self-administer nicotine infusions that are paired with access to sucrose—a paradigm that closely resembles learning processes observed in humans.
Controlling one's world: Identification of sub-regions of primate PFC underlying goal-directed behavior
2021, Neuron
Citation Excerpt :
Areas 14 (rostral vmPFC/mOFC) and 14-25 (caudal vmPFC) were not specifically involved in the response to changes in A-O contingencies. The complete lack of effects of inactivation is consistent with rodent studies, in which lesions of a putative homolog of these regions, the anterior mOFC, impaired the effects of outcome devaluation but not contingency degradation (Bradfield et al., 2015, 2018). Although marmosets receiving overactivation of area 14 did not differentiate between degraded and non-degraded sessions, the finding that baseline responding was also affected prevents any firm conclusions concerning contingency degradation.
Impaired detection of causal relationships between actions and their outcomes can lead to maladaptive behavior. However, causal roles of specific prefrontal cortex (PFC) sub-regions and the caudate nucleus in mediating such relationships in primates are unclear. We inactivated and overactivated five PFC sub-regions, reversibly and pharmacologically: areas 24 (perigenual anterior cingulate cortex), 32 (medial PFC), 11 (anterior orbitofrontal cortex, OFC), 14 (rostral ventromedial PFC/medial OFC), and 14-25 (caudal ventromedial PFC) and the anteromedial caudate to examine their role in expressing learned action-outcome contingencies using a contingency degradation paradigm in marmoset monkeys. Area 24 or caudate inactivation impaired the response to contingency change, while area 11 inactivation enhanced it, and inactivation of areas 14, 32, or 14-25 had no effect. Overactivation of areas 11 and 24 impaired this response. These findings demonstrate the distinct roles of PFC sub-regions in goal-directed behavior and illuminate the candidate neurobehavioral substrates of psychiatric disorders, including obsessive-compulsive disorder.

View all citing articles on Scopus

View full text

Inferring action-dependent outcome representations depends on anterior but not posterior medial orbitofrontal cortex

Highlights

Abstract

Introduction

Section snippets

Material and methods

Experiment 1. Comparison of afferents from anterior vs. posterior mOFC to pDMS, NAc core, BLA and MD using retrograde tracing

Discussion

Conclusions

Conflict of interest

Acknowledgements

Neuropharmacology

Neuron

Behavioural Brain Research

Neuron

Neurobiology of Learning and Memory

Biological Psychiatry

Progress in Neurobiology

Obsessive-compulsive disorder as a failure to integrate goal-directed and habitual action control

The role of the nucleus accumbens in instrumental conditioning: Evidence of a functional dissociation between accumbens core and shell

Journal of Neuroscience

Prefrontal cortex in the rat: Projections to subcortical autonomic, motor, and limbic centers

The Journal of Comparative Neurology

Consolidation of goal-directed action depends on MAPK/ERK signaling in rodent prelimbic cortex

Journal of Neuroscience

Prefrontal cortico-striatal disconnection blocks the acquisition of goal-directed action

Journal of Neuroscience

Statistics for the social sciences

Projections of the medial orbital and ventral orbital cortex in the rat

Journal of Comparative Neurology

Functional heterogeneity within rat orbitofrontal cortex in reward learning and decision making

Journal of Neuroscience

Bilateral orbital prefrotnal cortex lesions in rhesus monkeys disrupt choices guided by both reward value and reward contingency

Journal of Neuroscience

The neural bases of obsessive-compulsive disorder in children and adults

Development and Pscyhopathology