Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Reference-point centering and range-adaptation enhance human reinforcement learning at the cost of irrational preferences

Sophie Bavard, Maël Lebreton, Mehdi Khamassi, Giorgio Coricelli, View ORCID ProfileStefano Palminteri
doi: https://doi.org/10.1101/295022
Sophie Bavard
1Laboratoire de Neurosciences Cognitives Computationnelles, Institut National de la Santé et Recherche Médicale, Paris, France
2Département d’Etudes Cognitives, Ecole Normale Supérieure, Paris, France
3Institut d’Etudes de la Cognition, Université de Paris Sciences et Lettres, Paris, France
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Maël Lebreton
4CREED lab, Amsterdam School of Economics, Faculty of Business and Economics, University of Amsterdam.
5Amsterdam Brain and Cognition, University of Amsterdam.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Mehdi Khamassi
6Institut des Sciences de l'Information et de leurs Interactions, Sorbonne Universités, Paris France
7Institut des Systèmes Intelligents et Robotiques, Centre National de la Recherche Scientifique, Paris, France
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Giorgio Coricelli
8Departement of Economics, University of Southern California, Los Angels, USA
9Centro Mente e Cervello, Università di Trento, Trento, Italia
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Stefano Palminteri
1Laboratoire de Neurosciences Cognitives Computationnelles, Institut National de la Santé et Recherche Médicale, Paris, France
2Département d’Etudes Cognitives, Ecole Normale Supérieure, Paris, France
3Institut d’Etudes de la Cognition, Université de Paris Sciences et Lettres, Paris, France
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Stefano Palminteri
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

Abstract

In economics and in perceptual decision-making contextual effects are well documented, where decision weights are adjusted as a function of the distribution of stimuli. Yet, in reinforcement learning literature whether and how contextual information pertaining to decision states is integrated in learning algorithms has received comparably little attention. Here, in an attempt to fill this gap, we investigated reinforcement learning behavior and its computational substrates in a task where we orthogonally manipulated both outcome valence and magnitude, resulting in systematic variations in state-values. Over two experiments, model comparison indicated that subjects’ behavior is best accounted for by an algorithm which includes both reference point-dependence and range-adaptation – two crucial features of state-dependent valuation. In addition, we found state-dependent outcome valuation to progressively emerge over time, to be favored by increasing outcome information and to be correlated with explicit understanding of the task structure. Finally, our data clearly show that, while being locally adaptive (for instance in negative valence and small magnitude contexts), state-dependent valuation comes at the cost of seemingly irrational choices, when options are extrapolated out from their original contexts.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted April 06, 2018.
Download PDF

Supplementary Material

Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Reference-point centering and range-adaptation enhance human reinforcement learning at the cost of irrational preferences
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Reference-point centering and range-adaptation enhance human reinforcement learning at the cost of irrational preferences
Sophie Bavard, Maël Lebreton, Mehdi Khamassi, Giorgio Coricelli, Stefano Palminteri
bioRxiv 295022; doi: https://doi.org/10.1101/295022
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Reference-point centering and range-adaptation enhance human reinforcement learning at the cost of irrational preferences
Sophie Bavard, Maël Lebreton, Mehdi Khamassi, Giorgio Coricelli, Stefano Palminteri
bioRxiv 295022; doi: https://doi.org/10.1101/295022

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Animal Behavior and Cognition
Subject Areas
All Articles
  • Animal Behavior and Cognition (4235)
  • Biochemistry (9140)
  • Bioengineering (6784)
  • Bioinformatics (24005)
  • Biophysics (12131)
  • Cancer Biology (9537)
  • Cell Biology (13781)
  • Clinical Trials (138)
  • Developmental Biology (7638)
  • Ecology (11704)
  • Epidemiology (2066)
  • Evolutionary Biology (15513)
  • Genetics (10647)
  • Genomics (14327)
  • Immunology (9484)
  • Microbiology (22849)
  • Molecular Biology (9095)
  • Neuroscience (49003)
  • Paleontology (355)
  • Pathology (1483)
  • Pharmacology and Toxicology (2570)
  • Physiology (3848)
  • Plant Biology (8331)
  • Scientific Communication and Education (1471)
  • Synthetic Biology (2296)
  • Systems Biology (6193)
  • Zoology (1301)