Dopamine ramps are a consequence of reward prediction errors

Neural Comput. 2014 Mar;26(3):467-71. doi: 10.1162/NECO_a_00559. Epub 2013 Dec 9.

Abstract

Temporal difference learning models of dopamine assert that phasic levels of dopamine encode a reward prediction error. However, this hypothesis has been challenged by recent observations of gradually ramping stratal dopamine levels as a goal is approached. This note describes conditions under which temporal difference learning models predict dopamine ramping. The key idea is representational: a quadratic transformation of proximity to the goal implies approximately linear ramping, as observed experimentally.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Animals
  • Corpus Striatum / physiology*
  • Dopamine / metabolism*
  • Goals
  • Learning / physiology*
  • Maze Learning / physiology
  • Models, Neurological*
  • Rats
  • Reward*
  • Space Perception / physiology

Substances

  • Dopamine