Elsevier

Brain Research Reviews

Volume 58, Issue 2, August 2008, Pages 322-339
Brain Research Reviews

Review
What is reinforced by phasic dopamine signals?

https://doi.org/10.1016/j.brainresrev.2007.10.007Get rights and content

Abstract

The basal ganglia have been associated with processes of reinforcement learning. A strong line of supporting evidence comes from the recording of dopamine (DA) neurones in behaving monkeys. Unpredicted, biologically salient events, including rewards cause a stereotypic short-latency (70–100 ms), short-duration (100–200 ms) burst of DA activity — the phasic response. This response is widely considered to represent reward prediction errors used as teaching signals in appetitive learning to promote actions that will maximise future reward acquisition. For DA signalling to perform this function, sensory processing afferent to DA neurones should discriminate unpredicted reward-related events. However, the comparative response latencies of DA neurones and orienting gaze-shifts indicate that phasic DA responses are triggered by pre-attentive sensory processing. Consequently, in circumstances where biologically salient events are both spatially and temporally unpredictable, it is unlikely their identity will be known at the time of DA signalling. The limited quality of afferent sensory processing and the precise timing of phasic DA signals, suggests that they may play a less direct role in ‘Law of Effect’ appetitive learning. Rather, the ‘time-stamp’ nature of the phasic response, in conjunction with the other signals likely to be present in the basal ganglia at the time of phasic DA input, suggests it may reinforce the discovery of unpredicted sensory events for which the organism is responsible. Furthermore, DA-promoted repetition of preceding actions/movements should enable the system to converge on those aspects of context and behavioural output that lead to the discovery of novel actions.

Introduction

When trying to discover the role of a specific component in a complicated system it is helpful to have some understanding of the system's overall function. The nigro-striatal and mesolimbic dopamine (DA) projections are important components of the basal ganglia, which are one of the brain's fundamental processing units. Thus, in this article we will start by considering briefly some prevalent ideas on basal ganglia involvement in action selection and reinforcement learning. Within this context we will proceed to evaluate the specific contribution of the phasic response of DA neurones to the processes of reinforcement learning. Although it is clear that DA projections ascending from the ventral midbrain target numerous structures, including intrinsic nuclei of the basal ganglia, frontal cortex, amygdala, hippocampus, septal area, several thalamic nuclei, and the habenula (Lindvall and Bjorklund, 1974), for the purpose of the present article we will restrict ourselves to discussion of how striatal function may be influenced by phasic DA input. Hopefully, a better understanding of this comparatively well characterised system will provide important clues concerning the role of phasic DA input to other target structures. This paper is an extension of ideas expressed in our recent perspectives article (Redgrave and Gurney, 2006).

Section snippets

Basal ganglia functions

In humans, basal ganglia dysfunction has been associated with numerous debilitating conditions including Parkinson's disease, Huntington's disease, Tourette's syndrome, schizophrenia, attention-deficit disorder, obsessive–compulsive disorder (Crossman, 2000, Fuxe et al., 2006, Ring and SerraMestres, 2002), and many of the addictions (Everitt and Robbins, 2005). However, as with investigating the roles of specific components, to understand and correctly interpret such malfunctions it would be

The source of short-latency visual input to the ventral midbrain

Since most experiments analysing the sensory properties of DA neurones have used visual stimuli (Morris et al., 2004, Morris et al., 2006, Ravel and Richmond, 2006, Satoh et al., 2003, Schultz, 1998, Schultz, 2006, Takikawa et al., 2004), we will concentrate on the potential sources of visual afferents to DA containing regions of the ventral midbrain.

In recent reviews of cortical visual processing (Rousselet et al., 2004, Thorpe and Fabre-Thorpe, 2001) Thorpe et al. indicated that signals

Visual perception in the dorsal midbrain

Electrophysiological responses of neurones in the mammalian superior colliculus are characterised by an exquisite sensitivity to spatially localised luminance changes in the retina (Grantyn, 1988, Sparks, 1986, Stein and Meredith, 1993, Wurtz and Albano, 1980). Such changes typically signify the appearance, disappearance or movement of something in the visual field. While there may be cell-types that can perform broad species specific stimulus classifications (de Gelder and Hadjikhani, 2006,

Visual perception in the ventral midbrain

Given that the superior colliculus is specialised to inform its efferent targets where something has occurred in the visual field, not what has occurred, it would be appropriate to pause and consider how DA neurones seem able to perform the detailed visual processing required to discriminate the complex conditioned visual stimuli that have been used to signal different reward magnitudes and probabilities (Fiorillo et al., 2003, Tobler et al., 2003, Tobler et al., 2005). Careful reading of

Agency determination and discovery of novel actions

The following discussion will extend our recent proposal (Redgrave and Gurney, 2006) that sensory driven phasic DA signals reinforce the discovery of events for which the animal, or more generally the agent, is responsible. Reinforcement of relevant signals within the presumed selection architecture of the basal ganglia (Fig. 2B) could enable the system to converge on aspects of context and behavioural output responsible for eliciting initially unpredicted sensory events.

The fog lifts a little?

Having identified a specific reinforcing role for DA in a connectional architecture whose function is to determine sources of agency and discover novel actions, we will now consider several hitherto puzzling issues.

Summary and conclusions

This paper is an elaboration of our recent proposal that sensory driven DA responses provide reinforcement signals required for the brain to discriminate the sensory events for which it is responsible (Redgrave and Gurney, 2006). As part of this process new responses required in specific circumstances to make events happen are discovered. A critical feature is that the proposed architecture requires a minimum of sensory processing to generate the necessary reinforcing signals. All the system

Acknowledgments

The authors would like to acknowledge the contributions of Paul Overton, Veronique Coizet, John McHaffie and Terry Stanford throughout the progressive evolution of these ideas. This review was written while the authors were in receipt of research funding from The Wellcome Trust (PR), BBSRC (PR), EPSRC (KG, PR), and the Marsden Fund of the Royal Society of New Zealand (JR, PR).

References (165)

  • DayanP. et al.

    Reward, motivation, and reinforcement learning

    Neuron

    (2002)
  • DeanP. et al.

    Event or emergency? Two response systems in the mammalian superior colliculus

    Trends Neurosci.

    (1989)
  • DeanerR.O. et al.

    Monkeys pay per view: adaptive valuation of social images by rhesus macaques

    Curr. Biol.

    (2005)
  • FreemanA.S.

    Firing properties of substantia nigra dopaminergic neurons in freely moving rats

    Life Sci.

    (1985)
  • FudgeJ.L. et al.

    Amygdaloid projections to ventromedial striatal subterritories in the primate

    Neuroscience

    (2002)
  • GraceA.A.

    The tonic phasic model of dopamine system regulation—its relevance for understanding how stimulant abuse can alter basal ganglia function

    Drug Alcohol Depend.

    (1995)
  • GreengardP. et al.

    Beyond the dopamine receptor: the DARPP-32/protein phosphatase-1 cascade

    Neuron

    (1999)
  • GrillnerS. et al.

    Mechanisms for selection of basic motor programs — roles for the striatum and pallidum

    Trends Neurosci.

    (2005)
  • GuarraciF.A. et al.

    An electrophysiological characterization of ventral tegmental area dopaminergic neurons during differential Pavlovian fear conditioning in the awake rabbit

    Behav. Brain Res.

    (1999)
  • HikosakaO.

    GABAergic output of the basal ganglia

    Prog. Brain Res.

    (2007)
  • HorvitzJ.C. et al.

    Burst activity of ventral tegmental dopamine neurons is elicited by sensory stimuli in the awake cat

    Brain Res.

    (1997)
  • HylandB.I. et al.

    Firing modes of midbrain dopamine cells in the freely moving rat

    Neuroscience

    (2002)
  • IkedaT. et al.

    Reward-dependent gain and bias of visual responses in primate superior colliculus

    Neuron

    (2003)
  • KakadeS. et al.

    Dopamine: generalization and bonuses

    Neural. Netw.

    (2002)
  • KatsutaH. et al.

    Release from GABA(A) receptor-mediated inhibition unmasks interlaminar connection within superior colliculus in anesthetized adult rats

    Neurosci. Res.

    (2003)
  • KlopE.M. et al.

    In cat four times as many lamina I neurons project to the parabrachial nuclei and twice as many to the periaqueductal gray as to the thalamus

    Neuroscience

    (2005)
  • KobayashiS. et al.

    Influences of rewarding and aversive outcomes on activity in macaque lateral prefrontal cortex

    Neuron

    (2006)
  • LevesqueM. et al.

    Corticostriatal projections from layer V cells in rat are collaterals of long-range corticofugal axons

    Brain Res.

    (1996)
  • LismanJ.E. et al.

    The hippocampal-VTA loop: controlling the entry of information into long-term memory

    Neuron

    (2005)
  • MaedaH. et al.

    Effects of peripheral stimulation on the activity of neurons in the ventral tegmental area, substantia nigra and midbrain reticular formation of rats

    Brain Res. Bull.

    (1982)
  • MahonS. et al.

    Corticostriatal plasticity: life after the depression

    Trends Neurosci.

    (2004)
  • MarinO. et al.

    Evolution of the basal ganglia in tetrapods: a new perspective based on recent studies in amphibians

    Trends Neurosci.

    (1998)
  • McHaffieJ.G. et al.

    Subcortical loops through the basal ganglia

    Trends Neurosci.

    (2005)
  • McHaffieJ.G. et al.

    A direct projection from superior colliculus to substantia nigra pars compacta in the cat

    Neuroscience

    (2006)
  • MinkJ.W.

    The basal ganglia: focused selection and inhibition of competing motor programs

    Prog. Neurobiol.

    (1996)
  • MorrisG. et al.

    Coincident but distinct messages of midbrain dopamine and striatal tonically active neurons

    Neuron

    (2004)
  • NakaharaH. et al.

    Dopamine neurons can represent context-dependent prediction error

    Neuron

    (2004)
  • O'DohertyJ.P. et al.

    Temporal difference models and reward-related learning in the human brain

    Neuron

    (2003)
  • OhmanA.

    The role of the amygdala in human fear: automatic detection of threat

    Psychoneuroendocrinology

    (2005)
  • OmelchenkoN. et al.

    Glutamate synaptic inputs to ventral tegmental area neurons in the rat derive primarily from subcortical sources

    Neuroscience

    (2007)
  • AlexanderG.E. et al.

    Parallel organization of functionally segregated circuits linking basal ganglia and cortex

    Ann. Rev. Neurosci.

    (1986)
  • ApicellaP. et al.

    Responses of tonically discharging neurons in the monkey striatum to primary rewards delivered during different behavioral states

    Exp. Brain Res.

    (1997)
  • BardoM.T. et al.

    Role of D1 and D2 receptors in novelty-maintained place preference

    Exp. Clin. Psychopharm.

    (1993)
  • BerridgeK.C.

    The debate over dopamine's role in reward: the case for incentive salience

    Psychopharmacol. (Berl.)

    (2007)
  • BerridgeK.C. et al.

    Sequential super-stereotypy of an instinctive fixed action pattern in hyper-dopaminergic mutant mice: a model of obsessive compulsive disorder and Tourette's

    BMC Biol.

    (2005)
  • BickfordM.E. et al.

    Collateral projections of predorsal bundle cells of the superior colliculus in the rat

    J. Comp. Neurol.

    (1989)
  • BlatterK. et al.

    Rewarding properties of visual stimuli

    Exp. Brain Res.

    (2006)
  • CalabresiP. et al.

    Dopamine and cAMP-regulated phosphoprotein 32 kDa controls both striatal long-term depression and long-term potentiation, opposing forms of synaptic plasticity

    J. Neurosci.

    (2000)
  • CentonzeD. et al.

    Dopaminergic control of synaptic plasticity in the dorsal striatum

    Eur. J. Neurosci.

    (2001)
  • ChevalierG. et al.

    Disinhibition as a basic process in the expression of striatal functions

    Trends Neurosci.

    (1990)
  • Cited by (0)

    View full text