Adding prediction risk to the theory of reward learning

Kerstin Preuschoff; Peter Bossaerts

doi:10.1196/annals.1390.005

Adding prediction risk to the theory of reward learning

Ann N Y Acad Sci. 2007 May:1104:135-46. doi: 10.1196/annals.1390.005. Epub 2007 Mar 7.

Authors

Kerstin Preuschoff¹, Peter Bossaerts

Affiliation

¹ Computational and Neural Systems, California Institute of Technology, Pasadena, CA 91125, USA.

PMID: 17344526
DOI: 10.1196/annals.1390.005

Abstract

This article analyzes the simple Rescorla-Wagner learning rule from the vantage point of least squares learning theory. In particular, it suggests how measures of risk, such as prediction risk, can be used to adjust the learning constant in reinforcement learning. It argues that prediction risk is most effectively incorporated by scaling the prediction errors. This way, the learning rate needs adjusting only when the covariance between optimal predictions and past (scaled) prediction errors changes. Evidence is discussed that suggests that the dopaminergic system in the (human and nonhuman) primate brain encodes prediction risk, and that prediction errors are indeed scaled with prediction risk (adaptive encoding).

Publication types

Research Support, Non-U.S. Gov't
Review

MeSH terms

Animals
Association Learning
Brain / anatomy & histology*
Brain / physiology
Brain Mapping*
Conditioning, Classical*
Dopamine / metabolism
Humans
Learning*
Least-Squares Analysis
Neurons / metabolism
Probability
Reinforcement, Psychology
Reward
Risk

Substances

Dopamine