Neural representation of behavioral outcomes in the orbitofrontal cortex

https://doi.org/10.1016/j.conb.2009.03.010Get rights and content

The orbitofrontal cortex (OFC) is important in processing rewards and other behavioral outcomes. Here, we review from a computational perspective recent progress in understanding this complex function. OFC neurons appear to represent abstract outcome values, which may facilitate the comparison of options, as well as concrete outcome attributes, such as flavor or location, which may enable predictive cues to access current outcome values in the face of dynamic modulation by internal state, context and learning. OFC can use reinforcement learning to generate outcome predictions; it can also generate outcome predictions using other mechanisms, including the evaluation of decision confidence or uncertainty. OFC neurons encode not only the mean expected outcome but also the variance, consistent with the idea that OFC uses a probabilistic population code to represent outcomes. We suggest that further attention to the nature of its representations and algorithms will be critical to further elucidating OFC function.

Introduction

The orbitofrontal cortex (OFC) was initially characterized as an area whose destruction profoundly impacted human personality, but, paradoxically, left no obvious deficits in standard cognitive tests (reviewed in [1]). Yet, through intensifying scrutiny over the last decade the function of the OFC has arisen from obscurity to take a central place in our understanding of learning and decision-making [2, 3]. Today, through a remarkable convergence of studies conducted in species ranging from rats to humans, OFC is widely conceived as a place where the ‘value’ of things is represented in the brain.

While the concept of ‘value’ may strike a hard-nosed neuroscientist as hopelessly fuzzy, this concept plays a central role in most behavioral theories of decision-making. In neuroeconomic theory, assignment of economic value allows qualitatively different goods to be compared in a single ‘universal currency’ [4]. In animal learning theory, the similar concept of ‘incentive value’ measures the ability of outcomes to motivate behavior [5, 6]. In machine learning theory, ‘state values’ and ‘action values’ are the principal targets of learning and action selection; by maximizing these values, agents learn optimal behavior [7]. By offering formal (i.e. quantitative) definitions of value and related concepts, these theoretical frameworks can help one to test and eventually to understand more precisely what the OFC does. That is because formal definitions can yield concrete predictions that are testable using traditional neurophysiological and behavioral measurements without resorting to semantic arguments about abstract terms [8].

While theoretical perspectives are helpful, they also bring on more work. In the light of theory, questions about OFC function become not only more clear but also more detailed and nuanced, opening up and demanding further experimental tests. Moreover, different theoretical frameworks present partially overlapping, sometimes incongruent, views that must eventually be reconciled. Finally, applying theories of behavior to the brain requires one to bridge the gap between the functional level that forms the basis for the theory and the level of neurophysiology. As famously framed by David Marr [9], two key pieces are needed to bridge between behavioral (computational) and neural (implementational) levels: first, understanding the nature of the neural code or representation; second, understanding the processes or algorithms used to create and utilize these representations.

This review will examine recent progress in OFC function in light of economic, psychological and computational theories of value. While we wholeheartedly acknowledge the convergence of many threads evidence, our main goal is to emphasize the ragged edges and emerging complexities. These become apparent especially when asking what exactly a neural representation within OFC might look like, and therefore our primary focus will be on recordings from individual OFC neurons in monkeys and rats, with secondary attention to lesions and neuroimaging studies. We will also review what we know about the origin of these representations, and touch upon the issue of how they are used.

Section snippets

Specific and abstract properties of neural representations in OFC

OFC was identified in monkeys as an area containing neurons that responded to ‘rewarding’ substances such as palatable foods but whose activity was not tied directly to their physical attributes: responses could be changed dramatically by associative learning and by the current hunger or satiety of the subject (reviewed by [10]). In this sense, OFC responses reflected something ‘subjective’ about the value of a reward.

A more precise operational definition of ‘subjective value’ can be phrased in

Dynamic updating of values: context, needs and learning

Central to the concept of values is that they can be dynamically modulated even when the objects of value themselves remain unchanged, and this is a property reflected in OFC. An important example of such dynamic modulation is how the value of a given option depends on the menu of alternatives, called the ‘reference frame’. A classic study showed that single neurons in monkey OFC change their response to a given reward depending on the relative value of an alternative reward [23]. This might

What are the components of value?

Value has a number of components and it is somewhat controversial at this moment which ones are represented together or separately in the OFC. First, values have a positive and negative component: value = benefit  cost. Human neuroimaging studies tend to indicate that rewards and losses/punishments are processed in distinct subregions of OFC, with lateral regions being more modulated by costs and medial regions by benefits [22, 30, 31]. However, the same neurons that respond to rewards can also

Computing outcome predictions using reinforcement learning

We have considered in some detail the properties of neural representations of outcomes or predicted outcomes in OFC. Along with this question of representation comes the question of how these representations, particularly the predictive ones, are generated. In particular, what algorithms can be used to obtain accurate predictions of outcomes? Reinforcement learning (RL) theory provides a normative framework for how to predict and obtain maximal values using a two-part procedure: first, learn

Computing outcome predictions using confidence estimates

The outcome value predictions considered in the reinforcement-learning framework above are generated by learning from experience. In principle, outcome predictions can be generated by other mechanisms. In many situations, a behavioral outcome depends on a decision that is subject to uncertainty arising from subjective limitations, such as imperfect perception or memory. In such a case, if the decision-maker can assess the quality of the internal representation on which a particular decision is

Conclusions

In this review we have emphasized recent progress and open questions in the function of the OFC from a computational perspective. Much evidence points to OFC as representing the ends or outcomes that motivate goal-directed behavior but much remains to be done to flesh out how these highly abstract entities are represented and computed at the level of individual neurons. We suggest that thinking more about how OFC represents information and the algorithms with which it generates and manipulates

References and recommended reading

Papers of particular interest, published within the period of review, have been highlighted as:

  • • of special interest

  • •• of outstanding interest

Acknowledgements

Supported by Champalimaud Foundation (ZFM) and the Whitehall Foundation (AK). We wish to thank G Schoenbaum, J Paton and members of our laboratories for helpful comments on the manuscript.

Reference (58)

  • M.C. Dorris et al.

    Activity in posterior parietal cortex is correlated with the relative subjective desirability of action

    Neuron

    (2004)
  • D.J. Barraclough et al.

    Prefrontal cortex and decision making in a mixed-strategy game

    Nat Neurosci

    (2004)
  • S.J. Ramus et al.

    Interactions between the orbitofrontal cortex and the hippocampal memory system during the storage of long-term memory

    Ann N Y Acad Sci

    (2007)
  • R.J. Dolan

    Keynote address: revaluing the orbital prefrontal cortex

    Ann N Y Acad Sci

    (2007)
  • E.A. Murray et al.

    What we know and do not know about the functions of the orbitofrontal cortex after 20 years of cross-species studies

    J Neurosci

    (2007)
  • G. Schoenbaum et al.

    Reconciling the roles of orbitofrontal cortex in reversal learning and the encoding of outcome expectancies

    Ann N Y Acad Sci

    (2007)
  • B.W. Balleine et al.

    The role of incentive learning in instrumental outcome revaluation by sensory-specific satiety

    Anim Learn Behav

    (1998)
  • R.A. Rescorla

    Instrumental responses become associated with reinforcers that differ in one feature

    Anim Learn Behav

    (1990)
  • R.S. Sutton et al.

    Reinforcement Learning an Introduction, in Adaptive Computation and Machine Learning

    (1998)
  • L.P. Sugrue et al.

    Choosing the greater of two goods: neural currencies for valuation and decision making

    Nat Rev Neurosci

    (2005)
  • D. Marr

    Vision: A Computational Investigation into the Human Representation and Processing of Visual Information

    (1982)
  • A. Tversky et al.

    The framing of decisions and the psychology of choice

    Science

    (1981)
  • C. Padoa-Schioppa et al.

    Neurons in the orbitofrontal cortex encode economic value

    Nature

    (2006)
  • K.A. Burke et al.

    The role of the orbitofrontal cortex in the pursuit of happiness and more specific rewards

    Nature

    (2008)
  • T. Hosokawa

    Neurons in the macaque orbitofrontal cortex code relative preference of both rewarding and aversive outcomes

    Neurosci Res

    (2007)
  • R. Kiani

    Object category structure in response patterns of neuronal population in monkey inferior temporal cortex

    J Neurophysiol

    (2007)
  • C.E. Feierstein

    Representation of spatial goals in rat orbitofrontal cortex

    Neuron

    (2006)
  • M.R. Roesch et al.

    Encoding of time-discounted rewards in orbitofrontal cortex is independent of value representation

    Neuron

    (2006)
  • H.L. Dean et al.

    Allocentric spatial referencing of neuronal activity in macaque posterior cingulate cortex

    J Neurosci

    (2006)
  • Cited by (56)

    • Selective impact of lateral orbitofrontal cortex inactivation on reinstatement of alcohol seeking in male Long-Evans rats

      2020, Neuropharmacology
      Citation Excerpt :

      However, our results are in alignment with the role of OFC in motivation for alcohol. Previous studies have demonstrated a role for OFC in encoding outcome value and signaling motivation (Balleine et al., 2011; Dalley et al., 2004; Kringelbach, 2005; Mainen and Kepecs, 2009; Moorman and Aston-Jones, 2014; Noonan et al., 2012; O'Doherty, 2007; Padoa-Schioppa, 2011; Rolls and Grabenhorst, 2008; Rudebeck and Murray, 2014; Schoenbaum et al., 2009; Schoenbaum et al., 2011; van Duuren et al., 2009; van Wingerden et al., 2010; Wallis, 2011; Walton et al., 2011). In particular, lOFC inactivation has been shown to reduce cue-and context-induced reinstatement of cocaine seeking (Arguello et al., 2017; Fuchs et al., 2004; Lasseter et al., 2009, 2011).

    • The role of the orbitofrontal cortex in alcohol use, abuse, and dependence

      2018, Progress in Neuro-Psychopharmacology and Biological Psychiatry
      Citation Excerpt :

      Since the focus of this review is primarily on the OFC in the context of alcohol use and dependence, I will not present a comprehensive discussion of the OFC more generally. I refer interested readers to some of the many excellent reviews on OFC structure and function (Balleine et al., 2011; Dalley et al., 2004; Izquierdo, 2017; Kringelbach, 2005; Kringelbach and Rolls, 2004; Mainen and Kepecs, 2009; McDannald et al., 2014b; Noonan et al., 2012; O'Doherty, 2007; Ongur and Price, 2000; Padoa-Schioppa, 2011; Price, 2007; Rolls and Deco, 2016; Rolls and Grabenhorst, 2008; Rudebeck and Murray, 2014; Schoenbaum et al., 2009; Schoenbaum et al., 2011; Stalnaker et al., 2015; Wallis, 2011; Walton et al., 2011). However, before focusing specifically on the OFC and alcohol, it is worth considering some of the modern conceptualizations of what the OFC is and what role it plays in motivated, goal-directed behavior outside the context of drugs or alcohol.

    View all citing articles on Scopus
    View full text