RT Journal Article SR Electronic T1 Encoding of reinforcement along the hippocampal long axis and the transition from exploration to exploitation JF bioRxiv FD Cold Spring Harbor Laboratory SP 2020.01.02.893255 DO 10.1101/2020.01.02.893255 A1 Alexandre Y. Dombrovski A1 Beatriz Luna A1 Michael N. Hallquist YR 2020 UL http://biorxiv.org/content/early/2020/01/02/2020.01.02.893255.abstract AB Hippocampal maps incorporate reward information, yet the functional contributions of the anterior and posterior hippocampal divisions (AH and PH) to reinforcement learning remain unclear. Here, we examined exploration and exploitation of a continuous unidimensional task with a basis function reinforcement learning model. In model-based fMRI analyses, we found doubly dissociated representations along the hippocampal long axis: state-wise reward prediction error signals in the PH (tail) and global value maximum signals in the AH (anterior body). PH prediction error signals predicted exploration whereas AH global value maximum signals predicted exploitation. AH-mediated exploitation depended on value representations compressed across episodes and options. PH responses to reinforcement were early and phasic while AH responses were delayed and evolved throughout learning. During choice, AH (head) displayed goal cell-like responses to the global value maximum. In summary, granular reinforcement representations in PH facilitate exploration and compressed representations of the value maximum in AH facilitate exploitation.