What is reinforced by phasic dopamine signals?

doi:10.1016/j.brainresrev.2007.10.007

Brain Research Reviews

Volume 58, Issue 2, August 2008, Pages 322-339

https://doi.org/10.1016/j.brainresrev.2007.10.007 Get rights and content

Abstract

The basal ganglia have been associated with processes of reinforcement learning. A strong line of supporting evidence comes from the recording of dopamine (DA) neurones in behaving monkeys. Unpredicted, biologically salient events, including rewards cause a stereotypic short-latency (70–100 ms), short-duration (100–200 ms) burst of DA activity — the phasic response. This response is widely considered to represent reward prediction errors used as teaching signals in appetitive learning to promote actions that will maximise future reward acquisition. For DA signalling to perform this function, sensory processing afferent to DA neurones should discriminate unpredicted reward-related events. However, the comparative response latencies of DA neurones and orienting gaze-shifts indicate that phasic DA responses are triggered by pre-attentive sensory processing. Consequently, in circumstances where biologically salient events are both spatially and temporally unpredictable, it is unlikely their identity will be known at the time of DA signalling. The limited quality of afferent sensory processing and the precise timing of phasic DA signals, suggests that they may play a less direct role in ‘Law of Effect’ appetitive learning. Rather, the ‘time-stamp’ nature of the phasic response, in conjunction with the other signals likely to be present in the basal ganglia at the time of phasic DA input, suggests it may reinforce the discovery of unpredicted sensory events for which the organism is responsible. Furthermore, DA-promoted repetition of preceding actions/movements should enable the system to converge on those aspects of context and behavioural output that lead to the discovery of novel actions.

Introduction

When trying to discover the role of a specific component in a complicated system it is helpful to have some understanding of the system's overall function. The nigro-striatal and mesolimbic dopamine (DA) projections are important components of the basal ganglia, which are one of the brain's fundamental processing units. Thus, in this article we will start by considering briefly some prevalent ideas on basal ganglia involvement in action selection and reinforcement learning. Within this context we will proceed to evaluate the specific contribution of the phasic response of DA neurones to the processes of reinforcement learning. Although it is clear that DA projections ascending from the ventral midbrain target numerous structures, including intrinsic nuclei of the basal ganglia, frontal cortex, amygdala, hippocampus, septal area, several thalamic nuclei, and the habenula (Lindvall and Bjorklund, 1974), for the purpose of the present article we will restrict ourselves to discussion of how striatal function may be influenced by phasic DA input. Hopefully, a better understanding of this comparatively well characterised system will provide important clues concerning the role of phasic DA input to other target structures. This paper is an extension of ideas expressed in our recent perspectives article (Redgrave and Gurney, 2006).

Section snippets

Basal ganglia functions

In humans, basal ganglia dysfunction has been associated with numerous debilitating conditions including Parkinson's disease, Huntington's disease, Tourette's syndrome, schizophrenia, attention-deficit disorder, obsessive–compulsive disorder (Crossman, 2000, Fuxe et al., 2006, Ring and SerraMestres, 2002), and many of the addictions (Everitt and Robbins, 2005). However, as with investigating the roles of specific components, to understand and correctly interpret such malfunctions it would be

The source of short-latency visual input to the ventral midbrain

Since most experiments analysing the sensory properties of DA neurones have used visual stimuli (Morris et al., 2004, Morris et al., 2006, Ravel and Richmond, 2006, Satoh et al., 2003, Schultz, 1998, Schultz, 2006, Takikawa et al., 2004), we will concentrate on the potential sources of visual afferents to DA containing regions of the ventral midbrain.

In recent reviews of cortical visual processing (Rousselet et al., 2004, Thorpe and Fabre-Thorpe, 2001) Thorpe et al. indicated that signals

Visual perception in the dorsal midbrain

Electrophysiological responses of neurones in the mammalian superior colliculus are characterised by an exquisite sensitivity to spatially localised luminance changes in the retina (Grantyn, 1988, Sparks, 1986, Stein and Meredith, 1993, Wurtz and Albano, 1980). Such changes typically signify the appearance, disappearance or movement of something in the visual field. While there may be cell-types that can perform broad species specific stimulus classifications (de Gelder and Hadjikhani, 2006,

Visual perception in the ventral midbrain

Given that the superior colliculus is specialised to inform its efferent targets where something has occurred in the visual field, not what has occurred, it would be appropriate to pause and consider how DA neurones seem able to perform the detailed visual processing required to discriminate the complex conditioned visual stimuli that have been used to signal different reward magnitudes and probabilities (Fiorillo et al., 2003, Tobler et al., 2003, Tobler et al., 2005). Careful reading of

Agency determination and discovery of novel actions

The following discussion will extend our recent proposal (Redgrave and Gurney, 2006) that sensory driven phasic DA signals reinforce the discovery of events for which the animal, or more generally the agent, is responsible. Reinforcement of relevant signals within the presumed selection architecture of the basal ganglia (Fig. 2B) could enable the system to converge on aspects of context and behavioural output responsible for eliciting initially unpredicted sensory events.

The fog lifts a little?

Having identified a specific reinforcing role for DA in a connectional architecture whose function is to determine sources of agency and discover novel actions, we will now consider several hitherto puzzling issues.

Summary and conclusions

This paper is an elaboration of our recent proposal that sensory driven DA responses provide reinforcement signals required for the brain to discriminate the sensory events for which it is responsible (Redgrave and Gurney, 2006). As part of this process new responses required in specific circumstances to make events happen are discovered. A critical feature is that the proposed architecture requires a minimum of sensory processing to generate the necessary reinforcing signals. All the system

Acknowledgments

The authors would like to acknowledge the contributions of Paul Overton, Veronique Coizet, John McHaffie and Terry Stanford throughout the progressive evolution of these ideas. This review was written while the authors were in receipt of research funding from The Wellcome Trust (PR), BBSRC (PR), EPSRC (KG, PR), and the Marsden Fund of the Royal Society of New Zealand (JR, PR).

References (165)

ArbuthnottG.W. et al.
Space, time and dopamine
Trends Neurosci.
(2007)
BalleineB.W. et al.
Parallel incentive processing: an integrated view of amygdala function
Trends Neurosci.
(2006)
BarGadI. et al.
Information processing, dimensionality reduction and reinforcement learning in the basal ganglia
Prog. Neurobiol.
(2003)
BayerH.M. et al.
Midbrain dopamine neurons encode a quantitative reward prediction error signal
Neuron
(2005)
BevinsR.A. et al.
Novel-object place conditioning: behavioral and dopaminergic processes in expression of novelty reward
Behav. Brain Res.
(2002)
BlackJ. et al.
Reinforcement delay of one second severely impairs acquisition of brain self-stimulation
Brain Res.
(1985)
CalabresiP. et al.
Dopamine-mediated regulation of corticostriatal synaptic plasticity
Trends Neurosci.
(2007)
CoizetV. et al.
Nociceptive responses of midbrain dopaminergic neurones are modulated by the superior colliculus in the rat
Neuroscience
(2006)
CorbitL.H. et al.
The role of prelimbic cortex in instrumental conditioning
Behav. Brain Res.
(2003)
CorlettP.R. et al.
Prediction error during retrospective revaluation of causal associations in humans: fMRI evidence in favor of an associative model of learning
Neuron
(2004)

DayanP. et al.

Reward, motivation, and reinforcement learning

Neuron

(2002)

DeanP. et al.

Event or emergency? Two response systems in the mammalian superior colliculus

Trends Neurosci.

(1989)

DeanerR.O. et al.

Monkeys pay per view: adaptive valuation of social images by rhesus macaques

Curr. Biol.

(2005)

FreemanA.S.

Firing properties of substantia nigra dopaminergic neurons in freely moving rats

Life Sci.

(1985)

FudgeJ.L. et al.

Amygdaloid projections to ventromedial striatal subterritories in the primate

Neuroscience

(2002)

GraceA.A.

The tonic phasic model of dopamine system regulation—its relevance for understanding how stimulant abuse can alter basal ganglia function

Drug Alcohol Depend.

(1995)

GreengardP. et al.

Beyond the dopamine receptor: the DARPP-32/protein phosphatase-1 cascade

Neuron

(1999)

GrillnerS. et al.

Mechanisms for selection of basic motor programs — roles for the striatum and pallidum

Trends Neurosci.

(2005)

GuarraciF.A. et al.

An electrophysiological characterization of ventral tegmental area dopaminergic neurons during differential Pavlovian fear conditioning in the awake rabbit

Behav. Brain Res.

(1999)

HikosakaO.

GABAergic output of the basal ganglia

Prog. Brain Res.

(2007)

HorvitzJ.C. et al.

Burst activity of ventral tegmental dopamine neurons is elicited by sensory stimuli in the awake cat

Brain Res.

(1997)

HylandB.I. et al.

Firing modes of midbrain dopamine cells in the freely moving rat

Neuroscience

(2002)

IkedaT. et al.

Reward-dependent gain and bias of visual responses in primate superior colliculus

Neuron

(2003)

KakadeS. et al.

Dopamine: generalization and bonuses

Neural. Netw.

(2002)

KatsutaH. et al.

Release from GABA(A) receptor-mediated inhibition unmasks interlaminar connection within superior colliculus in anesthetized adult rats

Neurosci. Res.

(2003)

KlopE.M. et al.

In cat four times as many lamina I neurons project to the parabrachial nuclei and twice as many to the periaqueductal gray as to the thalamus

Neuroscience

(2005)

KobayashiS. et al.

Influences of rewarding and aversive outcomes on activity in macaque lateral prefrontal cortex

Neuron

(2006)

LevesqueM. et al.

Corticostriatal projections from layer V cells in rat are collaterals of long-range corticofugal axons

Brain Res.

(1996)

LismanJ.E. et al.

The hippocampal-VTA loop: controlling the entry of information into long-term memory

Neuron

(2005)

MaedaH. et al.

Effects of peripheral stimulation on the activity of neurons in the ventral tegmental area, substantia nigra and midbrain reticular formation of rats

Brain Res. Bull.

(1982)

MahonS. et al.

Corticostriatal plasticity: life after the depression

Trends Neurosci.

(2004)

MarinO. et al.

Evolution of the basal ganglia in tetrapods: a new perspective based on recent studies in amphibians

Trends Neurosci.

(1998)

McHaffieJ.G. et al.

Subcortical loops through the basal ganglia

Trends Neurosci.

(2005)

McHaffieJ.G. et al.

A direct projection from superior colliculus to substantia nigra pars compacta in the cat

Neuroscience

(2006)

MinkJ.W.

The basal ganglia: focused selection and inhibition of competing motor programs

Prog. Neurobiol.

(1996)

MorrisG. et al.

Coincident but distinct messages of midbrain dopamine and striatal tonically active neurons

Neuron

(2004)

NakaharaH. et al.

Dopamine neurons can represent context-dependent prediction error

Neuron

(2004)

O'DohertyJ.P. et al.

Temporal difference models and reward-related learning in the human brain

Neuron

(2003)

OhmanA.

The role of the amygdala in human fear: automatic detection of threat

Psychoneuroendocrinology

(2005)

OmelchenkoN. et al.

Glutamate synaptic inputs to ventral tegmental area neurons in the rat derive primarily from subcortical sources

Neuroscience

(2007)

AlexanderG.E. et al.

Parallel organization of functionally segregated circuits linking basal ganglia and cortex

Ann. Rev. Neurosci.

(1986)

ApicellaP. et al.

Responses of tonically discharging neurons in the monkey striatum to primary rewards delivered during different behavioral states

Exp. Brain Res.

(1997)

BardoM.T. et al.

Role of D1 and D2 receptors in novelty-maintained place preference

Exp. Clin. Psychopharm.

(1993)

BerridgeK.C.

The debate over dopamine's role in reward: the case for incentive salience

Psychopharmacol. (Berl.)

(2007)

BerridgeK.C. et al.

Sequential super-stereotypy of an instinctive fixed action pattern in hyper-dopaminergic mutant mice: a model of obsessive compulsive disorder and Tourette's

BMC Biol.

(2005)

BickfordM.E. et al.

Collateral projections of predorsal bundle cells of the superior colliculus in the rat

J. Comp. Neurol.

(1989)

BlatterK. et al.

Rewarding properties of visual stimuli

Exp. Brain Res.

(2006)

CalabresiP. et al.

Dopamine and cAMP-regulated phosphoprotein 32 kDa controls both striatal long-term depression and long-term potentiation, opposing forms of synaptic plasticity

J. Neurosci.

(2000)

CentonzeD. et al.

Dopaminergic control of synaptic plasticity in the dorsal striatum

Eur. J. Neurosci.

(2001)

ChevalierG. et al.

Disinhibition as a basic process in the expression of striatal functions

Trends Neurosci.

(1990)

Cited by (0)

View full text

ReviewWhat is reinforced by phasic dopamine signals?

Abstract

Introduction

Section snippets

Basal ganglia functions

The source of short-latency visual input to the ventral midbrain

Visual perception in the dorsal midbrain

Visual perception in the ventral midbrain

Agency determination and discovery of novel actions

The fog lifts a little?

Summary and conclusions

Acknowledgments

Trends Neurosci.

Trends Neurosci.

Prog. Neurobiol.

Neuron

Behav. Brain Res.

Brain Res.

Trends Neurosci.

Neuroscience

Behav. Brain Res.

Neuron

Neuron

Trends Neurosci.

Curr. Biol.

Life Sci.

Neuroscience

Drug Alcohol Depend.

Neuron

Trends Neurosci.

Behav. Brain Res.

Prog. Brain Res.

Brain Res.

Neuroscience

Neuron

Neural. Netw.

Neurosci. Res.

Neuroscience

Neuron

Brain Res.

Neuron

Brain Res. Bull.

Trends Neurosci.

Trends Neurosci.

Trends Neurosci.

Neuroscience

Prog. Neurobiol.

Neuron

Neuron

Neuron

Psychoneuroendocrinology

Neuroscience

Parallel organization of functionally segregated circuits linking basal ganglia and cortex

Ann. Rev. Neurosci.

Responses of tonically discharging neurons in the monkey striatum to primary rewards delivered during different behavioral states

Exp. Brain Res.

Role of D1 and D2 receptors in novelty-maintained place preference

Exp. Clin. Psychopharm.

The debate over dopamine's role in reward: the case for incentive salience

Psychopharmacol. (Berl.)

Sequential super-stereotypy of an instinctive fixed action pattern in hyper-dopaminergic mutant mice: a model of obsessive compulsive disorder and Tourette's

BMC Biol.

Collateral projections of predorsal bundle cells of the superior colliculus in the rat

J. Comp. Neurol.

Rewarding properties of visual stimuli

Exp. Brain Res.

Dopamine and cAMP-regulated phosphoprotein 32 kDa controls both striatal long-term depression and long-term potentiation, opposing forms of synaptic plasticity

J. Neurosci.

Dopaminergic control of synaptic plasticity in the dorsal striatum

Eur. J. Neurosci.

Disinhibition as a basic process in the expression of striatal functions

Trends Neurosci.

Review
What is reinforced by phasic dopamine signals?