PT - JOURNAL ARTICLE AU - Sakamoto, Kazuhiro AU - Okuzaki, Hidetake AU - Sato, Akinori AU - Mushiake, Hajime TI - Experience resetting in reinforcement learning facilitates exploration–exploitation transitions during a behavioral task for primates AID - 10.1101/2021.09.30.462676 DP - 2021 Jan 01 TA - bioRxiv PG - 2021.09.30.462676 4099 - http://biorxiv.org/content/early/2021/10/01/2021.09.30.462676.short 4100 - http://biorxiv.org/content/early/2021/10/01/2021.09.30.462676.full AB - The exploration–exploitation trade-off is a fundamental problem in re-inforcement learning. To study the neural mechanisms involved in this problem, a target search task in which exploration and exploitation phases appear alternately is useful. Monkeys well trained in this task clearly understand that they have entered the exploratory phase and quickly acquire new experiences by resetting their previous experiences. In this study, we used a simple model to show that experience resetting in the exploratory phase improves performance rather than decreasing the greediness of action selection, and we then present a neural network-type model enabling experience resetting.Competing Interest StatementThe authors have declared no competing interest.