Dopamine ramps are a consequence of reward prediction errors

Samuel J Gershman

doi:10.1162/NECO_a_00559

Dopamine ramps are a consequence of reward prediction errors

Neural Comput. 2014 Mar;26(3):467-71. doi: 10.1162/NECO_a_00559. Epub 2013 Dec 9.

Author

Samuel J Gershman¹

Affiliation

¹ Department of Brain and Cognitive Sciences, MIT, Cambridge, MA 02139, U.S.A. sjgershm@mit.edu.

PMID: 24320851
DOI: 10.1162/NECO_a_00559

Abstract

Temporal difference learning models of dopamine assert that phasic levels of dopamine encode a reward prediction error. However, this hypothesis has been challenged by recent observations of gradually ramping stratal dopamine levels as a goal is approached. This note describes conditions under which temporal difference learning models predict dopamine ramping. The key idea is representational: a quadratic transformation of proximity to the goal implies approximately linear ramping, as observed experimentally.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
Animals
Corpus Striatum / physiology*
Dopamine / metabolism*
Goals
Learning / physiology*
Maze Learning / physiology
Models, Neurological*
Rats
Reward*
Space Perception / physiology

Substances

Dopamine