RT Journal Article
SR Electronic
T1 Encoding of reinforcement along the hippocampal long axis and the transition from exploration to exploitation
JF bioRxiv
FD Cold Spring Harbor Laboratory
SP 2020.01.02.893255
DO 10.1101/2020.01.02.893255
A1 Alexandre Y. Dombrovski
A1 Beatriz Luna
A1 Michael N. Hallquist
YR 2020
UL http://biorxiv.org/content/early/2020/01/02/2020.01.02.893255.abstract
AB Hippocampal maps incorporate reward information, yet the functional contributions of the anterior and posterior hippocampal divisions (AH and PH) to reinforcement learning remain unclear. Here, we examined exploration and exploitation of a continuous unidimensional task with a basis function reinforcement learning model. In model-based fMRI analyses, we found doubly dissociated representations along the hippocampal long axis: state-wise reward prediction error signals in the PH (tail) and global value maximum signals in the AH (anterior body). PH prediction error signals predicted exploration whereas AH global value maximum signals predicted exploitation. AH-mediated exploitation depended on value representations compressed across episodes and options. PH responses to reinforcement were early and phasic while AH responses were delayed and evolved throughout learning. During choice, AH (head) displayed goal cell-like responses to the global value maximum. In summary, granular reinforcement representations in PH facilitate exploration and compressed representations of the value maximum in AH facilitate exploitation.