Investigating working memory updating processes of the human subcortex using 7 Tesla fMRI

The prefrontal-cortex basal ganglia working memory (PBWM) model (Hazy et al., 2007; O’Reilly & Frank, 2006) proposes that working memory representations are updated via a striatal gating mechanism but lacks conclusive empirical support for the postulated subcortical involvement. A growing body of research suggests that dopamine is also involved in working memory updating (Braver & Cohen, 2000; Cools & D’Esposito, 2011; D’Ardenne et al., 2012; Jongkees, 2020). In this study, we investigated subcortical–in particular, possible dopaminergic–involvement in working memory updating subprocesses using the reference-back task and ultra-high field 7 Tesla fMRI. Using a scanning protocol optimized for BOLD-sensitivity in the subcortex, we found no evidence of subcortical activation during working memory gate opening, which challenges the PBWM model’s striatal gating mechanism. However, during gate closing, subcortical activation was observed. Furthermore, a ready-to-update mode demonstrated large-spread subcortical activation, including basal ganglia nuclei, suggesting that the basal ganglia are engaged in general updating processes rather than specifically controlling the working memory gate. Evidence for activity in dopaminergic midbrain regions was also observed in both contrasts. Also, substituting new information into working memory elicited activation in dopamine-producing midbrain regions along with the striatum, thalamus, and prefrontal cortex, indicating engagement of the basal ganglia-thalamo-cortical loop possibly driven by dopaminergic activity. These findings expand our understanding of subcortical regions involved in working memory updating, providing additional insights into the role of the dopaminergic midbrain.


Introduction
Dopamine exerts a profound neuromodulatory impact on neural activity and is capable of shaping behavior at a large scale.In particular, dopamine plays a central role in cognitive control, and leading theories detail how dopamine networks influence cognitive functions (Cools & D'Esposito, 2011;Durstewitz & Seamans, 2008;Ott & Nieder, 2019).These theories assign key roles to the mesocortical and nigrostriatal dopaminergic pathways, which originate in the midbrain and project to the prefrontal cortex (PFC) and cingulate (for the mesocortical pathway), and the striatum (for the nigrostriatal pathway).These pathways are believed to promote either cognitive stability (protection from distractors) or flexibility (ability to incorporate/adapt to new information), respectively (Armbruster et al., 2012;Cools, 2006;Cools & D'Esposito, 2011;Durstewitz & Seamans, 2008;Goschke & Bolte, 2014).This balance is known as the stability-flexibility trade-off (Dreisbach, 2012;Dreisbach & Fröber, 2019;Hommel, 2015), influencing cognitive functions like working memory.For instance, maintaining stable working memory representations relies on PFC activity, the effectiveness of which is closely tied to cortical dopamine levels controlled by the mesocortical pathway (Durstewitz & Seamans, 2008).By contrast, the ability to flexibly update working memory representations is thought to be indirectly regulated by dopamine through the basal ganglia, implicating mesolimbic and nigrostriatal pathways (Chatham & Badre, 2015;Cohen et al., 2002).Several studies have provided initial support for working memory updating via the basal ganglia (Chatham & Badre, 2015;Murty et al., 2011;Nir-Cohen et al., 2020, 2023).Even though the processes involved in allowing new information into working memory (i.e., updating) depend on dopamine (Cools & D'Esposito, 2011;Goldman-Rakic, 1995), the neural basis of working memory updating remains unclear.
General working memory functioning has been robustly associated with the fronto-parietal network (FPN), which consists of the dorsolateral and medial PFC (dl/mPFC), and the posterior parietal cortex (PPC) (Jacob & Nieder, 2014;Mehta et al., 2000;Nir-Cohen et al., 2020, 2023;Vallentin et al., 2012).Additionally, several studies have implicated the basal ganglia in working memory encoding and maintenance (Bedwell et al., 2005;Chang et al., 2007), updating (Chatham & Badre, 2015;Murty et al., 2011), and gating (Nir-Cohen et al., 2020).However, the exact neural processes involved in the updating of working memory, especially those involving the subcortical structures, are still not fully understood.Much of this uncertainty can be attributed to the relatively low signal-to-noise ratios of functional magnetic resonance imaging (fMRI) scans within the subcortex, as well as the inherent biophysical characteristics of subcortical nuclei, such as the presence of iron.These factors pose challenges when using conventional fMRI protocols to investigate the subcortex.In this study, we addressed this gap by conducting ultra-high field 7 Tesla (T) fMRI to shed light on the subcortical structures thought to underlie working memory updating.
Possible mechanisms behind selectively gating new information into working memory, i.e., updating, are captured in the prefrontal-cortex basal-ganglia working memory (PBWM) model (Frank et al., 2001;Hazy et al., 2007;O'Reilly & Frank, 2006).Central to the PBWM model is its proposition of a dynamic gating mechanism, where activation of the striatum acts as the pivotal initiator to open the gate to working memory-resulting in disinhibition of the thalamus-which in turn modulates the stability of working memory representations in PFC.
Initial support for the PBWM model has been found in studies employing the referenceback paradigm (Rac-Lubashevsky & Kessler, 2016a) in combination with EEG (Rac-Lubashevsky & Kessler, 2018), dopaminergic manipulations (Jongkees, 2020), and fMRI (Nir- Cohen et al., 2020).The reference-back task is designed to identify the behavioral signatures of several subprocesses involved in working memory updating.These include working memory gate opening and closing, substituting new items into working memory, and a contrast between updating and maintenance modes (indicating readiness to either update or shield the contents of working memory, respectively).These processes are derived by presenting a stream of reference and comparison stimuli, which require updating and maintenance, respectively.Gate opening represents the active process of switching from a maintenance mode to a mode that facilitates updating.Thus, opening the gate takes place when cortical cell assemblies are prepared for new information to enter working memory in trials that may call for updating.Similarly, gate closing occurs when information needs to be maintained, and thus, the working memory gate should be closed and a maintenance mode activated in order to avoid interference.Substitution represents the process of replacing old information with new information and thus represents the actual act of updating working memory.The updating mode reflects an open-gate state (successive reference trials) independent of whether updating of information is required.
Nir- Cohen et al. (2020) found striatal activity associated with gate opening and substitution with fMRI at 3T.However, they found no evidence of subcortical involvement in the other aspects of working memory updating, like gate closing.This implies that distinct neural mechanisms control gate opening and gate closing.Nir-Cohen et al. suggested the involvement of the mesocortical pathway as a possible explanation for the lack of striatal activity during gate closing in line with the dual-state theory (Durstewitz & Seamans, 2008;O'Reilly, 2006), which postulates that mesocortical dopamine neurons of the ventral tegmental area (VTA) modify PFC neuronal firing to influence the stability of working memory representations.This implies the possibility of a direct dopaminergic mechanism originating in the midbrain governing the switch from updating mode to maintenance mode in gate closing.Also, in the PBWM model, dopamine may take a key role in the selective updating of working memory as the striatal pathways of the basal ganglia are abundant in dopamine receptors, enabling dopamine to affect basal ganglia processes critically.The neural origin of the striatal dopamine is found in the midbrain's dopaminergic cell populations, VTA, and substantia nigra (SN).In fact, a revised version of the PBWM model for Parkinson's disease patients (Moustafa, Sherman, & Frank, 2008) accounts for the interaction of dopamine with working memory by including the substantia nigra pars compacta (SNc), emphasizing the role of dopamine in working memory updating.
Since dopaminergic functioning in the brain is almost exclusively based on the activity of dopaminergic cell assemblies in the midbrain, the VTA, and SN-specifically SNc, these areas warrant special attention in explorations of dopamine-related processes.Initial fMRI evidence suggests a link between working memory and midbrain activity.There were indications of the involvement of the VTA and SN in the updating of context-related information (D'Ardenne et al., 2012).Additionally, the combined processing of information removal and updating (Murty et al., 2011) also implicated these midbrain structures.Moreover, Hazy, Frank, and O'Reilly (2006) provide their biologically based cognitive architecture onto the PBWM model and, in fact, suggest that dopaminergic signals arising from midbrain nuclei, specifically the VTA and SNc, modulate learning when striatal gating to mediate PFC activity should occur.
The indication that dopaminergic midbrain activity might play a pivotal role in updating working memory representations hints at a significant overlap in the brain mechanisms responsible for updating representations in working memory and updating learned reward-based values.The same neural circuits have been associated with updating of value-based decisionmaking and reinforcement learning (Corlett et al., 2022;Jocham et al., 2011;O'Reilly, 2006), in which the dopaminergic midbrain signal known as the reward prediction error (RPE) plays a key role (Montague et al., 1996;Schultz, 2013;Schultz et al., 1997).The RPE signal represents a value-updating signal that occurs when a received reward differs from what was expected according to reinforcement learning models (Miletić et al., 2021;O'Doherty et al., 2017;Sutton & Barto, 2018).The signal is encoded by phasic dopaminergic signals in the VTA and SNc (e.g., Cooper et al., 2012;Montague et al., 1996;O'Doherty et al., 2017;Schultz et al., 1997), targeting PFC representations via the striatum, following the same neural route that has been proposed to be involved in working memory updating (Cohen et al., 2002;Hazy et al., 2007;Murty et al., 2011;Ponzi, 2008).
In this study, we aimed to extend the work of Nir- Cohen et al. (2020) by exploring the neural correlates of working memory updating subprocesses, particularly focusing on the subcortex.To achieve this, we employed a scanning and analysis protocol to improve the signal quality from subcortical nuclei.This protocol included ultra-high field 7 T fMRI with a scanning protocol tailored to meet the requirements for imaging of the subcortex (de Hollander et al., 2017;Miletić et al., 2020) and individually parcellated masks of several subcortical nuclei (Bazin et al., 2020).While Nir-Cohen et al.'s results on striatal gate opening align with the PBWM model's predictions, they do not offer enough evidence to definitively rule out the subcortex's involvement in other working memory update subprocesses.Hence, the present study specifically aimed to shed light on subcortical involvement in working memory subprocessesassociated with gating, substitution, and being in an updating mode-and to discern contributions from several subcortical structures.In addition to investigating subcortical nuclei associated with the basal ganglia-thalamo-cortical loop, we hypothesized that midbrain nuclei containing dopaminergic neurons (VTA and SN) might play a crucial role in several working memory updating subprocesses, with their activation in each contrast indicating different neural mechanisms.Precisely, in light of the findings from Nir-Cohen et al. discussed earlier, we test whether the VTA demonstrates enhanced activity during gate closing as postulated by the dualstate theory (Durstewitz & Seamans, 2008).Furthermore, we aimed to investigate evidence of neural correlates of working memory substitution in the midbrain, specifically in the VTA and/or SN, which would support the idea that a phasic dopamine signal contributes to the act of changing working memory content, akin to the process of value updating following RPE.In contrast, VTA and/or SN activity in the gate opening process would lend support to theoretical accounts suggesting that a phasic dopamine signal opens the gate to working memory and, this way, facilitates (potential) subsequent working memory updating.Along these lines, activation in the VTA and/or SN during an updating mode may support the idea that dopamine signals are actively engaged in keeping the gates to working memory open.
Taken together, this 7T fMRI study was designed to shed light on subcortical-in particular dopaminergic-contributions to working memory updating subprocesses in the human brain.

Procedure
37 participants (20 female; mean age 26.65±5.72 years; age range 19 -39 years) took part in the study, which was approved by the ethical committee at the University of Amsterdam, the Netherlands and the Regional Committees for Medical and Health Research Ethics, Norway.All participants provided written informed consent and completed MRI screening forms to ensure they were eligible for scanning.The recruitment was conducted at the Norwegian University of Science and Technology.The participants had a corrected-to-normal vision and no history of epilepsy or overt clinical neuropsychiatric disease.Two participants' data were excluded because they had more than 30 percent non-responses or less than 70 percent accuracy on the referenceback task, suggesting that these participants were either disengaged or misunderstood the task instructions.

Reference-back task
To disentangle the various working memory updating subprocesses, participants completed the reference-back task (Rac-Lubashevsky & Kessler, 2016a).The task required participants to compare a presented stimulus to a reference stimulus held in working memory (i.e., the referent; Fig. 1).The color of the stimulus frame indicated whether to update the referent with the presented stimulus (on red frame/reference trials) or to maintain the existing referent (on blue frame/comparison trials).When the presented stimulus matched the referent, participants were instructed to respond "same" (by pressing the right key).When the presented stimulus did not match the referent, participants were instructed to respond "different" (by pressing the left key).In other words, reference trials required updating working memory because the current stimulus served as a referent for subsequent trials.By contrast, comparison trials did not require updating working memory.Working memory gate opening was needed in reference trials that followed comparison trials, and working memory gate closing was required in comparison trials that followed reference trials; hence, both involved a switch in the gate state.Accordingly, the gate state 'switched' or 'repeated' in each trial.This resulted in eight conditions: 2 (trial type: reference vs. comparison) × 2 (switch type: repeat vs. switch) × 2 (response: same vs.different), which formed the basis of the various reference-back contrasts.
In each block of the task, the trial sequence began with a reference trial that required no response.Following the reference trial, a fixation cross was displayed at the center of the display, followed by a framed letter ("X" or "O") presented for a duration of two seconds.Participants were instructed to respond with either "same" or "different" during the stimulus presentation phase.Between each pair of stimuli, a fixation cross was presented.The duration of the fixation cross varied randomly and was selected from a pseudo-exponential distribution, with possible durations of 0.75, 1.5, 2.5, or 3 seconds, in order to decorrelate the design matrix.After the stimulus presentation phase, an intertrial interval was introduced, also in the form of a fixation cross.This intertrial interval had a variable duration ranging from 0.52 to 2.77 seconds in order to ensure that each trial's total duration was set at four times the TR (Repetition Time), totaling 5.53 seconds per trial.This design resulted in a range of total fixation cross durations between 1.27 and 5.77 seconds for each trial.Within each block, all eight conditions formed by the combination of trial type, switch type, and response were presented 16 times, leading to a total of 256 trials (32 trials for each condition).
In accordance with Nir- Cohen et al. (2020), we defined four contrasts based on the eight reference-back conditions (Fig. 2): gate opening, gate closing, substitution, and updating mode.Gate opening was measured by the difference between reference/switch and reference/repeat trials.This is because the process of opening the gate to working memory should only take place in reference/switch trials, as no change in the gate state is required in repeated reference trials.Similarly, gate closing was measured by the difference between comparison/switch and comparison/repeat trials.Substitution takes place in reference/different trials.However, in order to isolate it from any gate-switching effects, the contrast only takes into account 'repeat' trials.Furthermore, in order to set apart the general effects of "same" and "different" responses, the difference between "same" and "different" responses in comparison trials was used as a baseline.Thus, the cost of substitution is indicated by a larger difference between "same" and 'different' responses in reference/repeat trials compared to comparison/repeat trials.Additionally, we computed the updating mode contrast (Nir-Cohen et al., 2020), operationalized as the difference between reference/repeat and comparison/repeat trials (Fig. 2).

Behavioral analysis
Statistical tests on all the conditions, including four a priori contrasts of interest, were conducted to examine the effects of different trial types and reference-back measures on mean response time (RT) and accuracy.
The first trial from each block, which did not require a response, was excluded, as well as any trials with response times faster than 0.150 seconds or slower than 3 seconds, following the exclusion criteria used in Boag et al. (2021).For RT analysis, error trials were excluded.
Linear mixed models were employed to assess the statistical significance of trial type (reference/comparison), switch type (switch/repeat), and response (same/different), along with the four contrasts (gate opening, gate closing, substitution, and updating mode) on mean RT and accuracy.General linear mixed models with a Gaussian link function were used for mean RT, and generalized linear mixed models with a probit link function were used for accuracy.Each model included trial type, switch type, and response as fixed effects, along with random intercepts for each participant.We used a significance criterion of alpha equivalent to .05.

MRI data acquisition
Each participant was scanned four times using a 7T Siemens MAGNETOM TERRA scanner (gradient strength = 80 mT/m at 200 T/m/s) equipped with a 32-channel Nova Medical's single channel transmit 32-channel receive head coil.Anatomical scans were collected in the first session, and the remaining sessions involved different functional scans.The data discussed in this article were collected in one of the functional sessions that included four functional runs in which the participants completed two different tasks.One task was the reference-back (Fig. 1), which consisted of two runs of 129 trials each.The other task was a reversal-learning task and is not analyzed in the present article.The order of the two tasks was randomized between subjects.
The experimental session comprised four functional echo-planar imaging (EPI) runs with four EPI volumes acquired with opposite phase encoding directions for susceptibility distortion correction.The functional data were collected using a single-echo 2D-EPI BOLD sequence with the following parameters: TR = 1380 ms, TE = 14 ms, MB = 2, GRAPPA = 3, voxel size = 1.5 mm isotropic, partial Fourier = 6/8, flip angle = 60°, MS mode = interleaved, FOV = 192 x 192 x 128 mm, matrix size = 128 x 128, BW = 1446 Hz/Px, slices = 82, phase encoding direction = A >> P, and echo spacing = 0.8 ms.Each of the two tasks of the functional session had a total of two runs, with each run lasting 13 minutes and 45 seconds, resulting in a total of four runs and 55 minutes.To help co-register the functional scans to the high-resolution data from the anatomical session, another anatomical MP2RAGE scan (1mm) was collected at the end of the functional session.
Physiological data, including heart rate and respiration, were recorded for all participants to assess the impact of physiological noise on the fMRI data.
The following preprocessing was performed for each of the two functional (BOLD) runs per task per participant.A reference volume and its skull-stripped version were generated by aligning and averaging 1 single-band reference (SBRefs).A B0-nonuniformity map (or fieldmap) was estimated based on two EPI references with opposing phase-encoding directions, with 3dQwarp (Cox & Hyde, 1997;AFNI 20160207).Based on the estimated susceptibility distortion, a corrected EPI reference was calculated for a more accurate co-registration with the anatomical reference.The BOLD reference was then co-registered to the T1w reference using bbregister (FreeSurfer 6.0.1), which implements boundary-based registration (Greve & Fischl, 2009).Coregistration was configured with six degrees of freedom.Head-motion parameters with respect to the BOLD reference (transformation matrices and six corresponding rotation and translation parameters) were estimated before any spatiotemporal filtering using mcflirt (FSL 5.0.9, Jenkinson et al., 2002).BOLD runs were slice-time corrected using 3dTshift from AFNI 20160207 (Cox & Hyde, 1997;RRID:SCR_005927).A reference volume and its skull-stripped version were generated using a custom methodology of fMRIPrep.The BOLD time-series (including slicetiming correction when applied) were resampled onto their original, native space by applying a single, composite transform to correct for head-motion and susceptibility distortions.These resampled BOLD time-series will be referred to as preprocessed BOLD in original space, or just preprocessed BOLD.Several confounding time-series were calculated based on the preprocessed BOLD: framewise displacement (FD), 'DVARS' (the spatial standard deviation of difference images), and three region-wise global signals.FD was computed using two formulations following Power (absolute sum of relative motions, Power et al., 2014) and Jenkinson (relative root mean square displacement between affines, Jenkinson et al., 2002).FD and DVARS were calculated for each functional run, both using their implementations in Nipype (following the definitions by Power et al., 2014).The three global signals were extracted within the CSF, the WM, and the whole-brain masks.Additionally, a set of physiological regressors was extracted to allow for component-based noise correction (CompCor, Behzadi et al., 2007).Principal components were estimated after high-pass filtering the preprocessed BOLD time-series (using a discrete cosine filter with 128 seconds cut-off) for the two CompCor variants: temporal (tCompCor) and anatomical (aCompCor).tCompCor components were then calculated from the top 2 percent variable voxels within the brain mask.For aCompCor, three probabilistic masks (CSF, WM, and combined CSF + WM) were generated in anatomical space.The implementation differs from that of Behzadi et al. (2007).Instead of eroding the masks by two pixels on BOLD space, the aCompCor masks were subtracted from a mask of pixels that likely contain a volume fraction of GM.This mask is obtained by dilating a GM mask extracted from the FreeSurfer's aseg segmentation, and it ensures that components were not extracted from voxels containing a minimal fraction of GM.Finally, these masks were resampled into BOLD space and binarized by thresholding at 0.99 (as in the original implementation).Components were also calculated separately within the WM and CSF masks.For each CompCor decomposition, the k components with the largest singular values were retained, such that the retained components' time-series are sufficient to explain 50 percent of variance across the nuisance mask (CSF, WM, combined, or temporal).The remaining components were excluded.The head-motion estimates calculated in the correction step were also placed within the corresponding confounds file.The confound time-series derived from head motion estimates and global signals were expanded with the inclusion of temporal derivatives and quadratic terms for each (Satterthwaite et al., 2013).Frames that exceeded a threshold of 0.5 mm FD or 1.5 standardized DVARS were annotated as motion outliers.All resamplings can be performed with a single interpolation step by composing all the pertinent transformations (i.e., head-motion transform matrices, susceptibility distortion correction when available, and co-registrations to anatomical and output spaces).Gridded (volumetric) resamplings were performed using antsApplyTransforms (ANTs), configured with Lanczos interpolation to minimize the smoothing effects of other kernels (Lanczos, 1964).Nongridded (surface) resamplings were performed using mri_vol2surf (FreeSurfer).Many internal operations of fMRIPrep use Nilearn 0.6.2(Abraham et al., 2014, RRID:SCR_001362), mostly within the functional processing workflow.

Regions-of-interest
For the selection of regions-of-interests (ROIs), we selected only subcortical masks for our ROI analyses given our main research aim centered on the subcortex.Second, we included individual subcortical masks derived from the MASSP automated parcellation algorithm (Bazin et al., 2020) to increase delineation accuracy on a subject level.Third, bilateral masks for each ROI were included to maximize regional specificity.Fourth, the pallidum was included with individual masks for its external (GPe) and internal (GPi) segments.Fifth, playing an important role in subcortical functioning in general and in basal ganglia processes in particular, the subthalamic nucleus (STN) mask was also selected for this study.Most importantly, we incorporated masks of the dopaminergic midbrain given our secondary research aim focused on the dopamine sources.This resulted in 7 masks (14 bilaterally).Hence, the MASSP algorithm (Bazin et al., 2020) was used to parcellate individual anatomical masks for the thalamus (Tha), striatum (Str), GPe and GPi, STN, SN, and VTA.
In addition, for a post-hoc ROI analysis we extended the dopaminergic midbrain masks utilizing the subdivision masks based on the probabilistic atlas from Pauli, Nili, and Tyszka (2018).These masks diverge from the delineations by MASSP (Bazin et al., 2020) in both shape and volume (Trutti et al., 2021), and were constructed with the aim of delineating two functionally different component nuclei of the VTA.Specifically, the nucleus of the VTA (VTAnc) and Parabrachial Pigmented (PBP) nucleus.The VTAnc refers to a small ventromedially located nucleus of the VTA, and the PBP mask represents a dorsolateral component nucleus of the VTA.Both VTAnc and PBP masks provided by Pauli et al. (2018) fall into the region defined as VTA by MASSP (Trutti et al., 2019).In addition, the authors put forward a subdivided SN, providing separate masks of the SNc and SNr, respectively (Pauli et al., 2018).Given that both VTA masks and the SNc are associated with dopaminergic cell populations, ROI analyses were repeated employing these three masks.
Furthermore, a supplementary exploration into the BOLD signal clusters within ROIs was carried out for comparison with Nir- Cohen et al. (2020).This analysis utilized the wholebrain statistical parametric maps (SPMs) derived from the whole-brain GLMs and, therefore, did not include individual parcellations but, aligning with Nir- Cohen et al. (2020), employed probabilistic masks of the striatal subdivision putamen (Pu) and caudate nucleus (Ca) from Pauli et al. (2018), instead of the entire striatum from MASSP.The remaining masks were taken from the probabilistic group-level masks from Bazin et al. (2020).See the appendix for the clusterbased ROI analysis elaboration (appendix A1.1).

fMRI statistical analysis
The aim of our 7T fMRI study was to shed light on subcortical-in particular possible dopaminergic-contributions to working memory updating subprocesses in the human brain.To achieve this, we extended Nir-Cohen and colleagues' (2020) fMRI data analysis procedure, with a specific emphasis on the subcortical regions.Our foundational objective was to compare the results obtained from a protocol optimized for BOLD sensitivity in the subcortex with the current empirical evidence, which has not definitively established the involvement of subcortical regions.This included a multi-step fMRI analysis.First, a whole-brain analysis was conducted to investigate brain activation by means of SPMs for each contrast on a whole-brain level.Second, ROI-wise GLM analysis with increased regional specificity due to extraction from unsmoothed data was conducted to explore the contribution of several subcortical nuclei in working memory updating.Details about the GLM analysis on both the whole-brain and region-specific levels are listed below.Third, a cluster-based ROI analysis was carried out to explore the presence of significant clusters within each ROI, akin to Nir-Cohen et al. (2020) (for detailed methodology, see A1.2 in the appendix).
A canonical double gamma hemodynamic response function (HRF) with temporal derivative was employed as the basis set for all methods of analysis (Glover, 1999).The design matrix was constructed to encompass the eight experimental conditions associated with the reference-back resulting from the 2 (trial type: reference vs. comparison) × 2 (switch type: repeat vs. switch) × 2 (response type: same vs.different) factorial design.Before conducting the GLM analysis, the functional data underwent high-pass filtering (Smith & Brady, 1997), and subsequently, spatial smoothing was applied using SUSAN (kernel-size full-width half-maximum = 4.5 mm).Notably, this spatial smoothing factor deviates from the one employed in the study by Nir-Cohen et al. in 2020, where a larger smoothing factor of 6 mm was used.This variation arises due to differences in image acquisition protocols and the improved image resolution and quality in our study, making such extensive smoothing unsuitable for our dataset (De Hollander et al., 2015).In addition to the task-specific regressors, our design matrix incorporated six motion parameters (comprising three translational and three rotational parameters), along with DVARS and FD estimates obtained during preprocessing.An 18-regressor RETROICOR model (Glover et al., 2000) was employed to model physiological noise.The model comprised a fourthorder phase Fourier expansion of the heart rate signal, a second-order phase expansion of the respiration signal, and a second-order phase Fourier expansion of the interaction between heart rate and respiration (Harvey et al., 2008).Additional regressors were used to account for heart rate variability (HRV; Chang et al., 2009) and respiratory volume per time unit (RVT; Birn et al., 2008;Harrison et al., 2021).The PhysIO toolbox (Kasper et al., 2017) implemented in the TAPAS software (Frässle et al., 2021) was employed to estimate the physiological regressors.For two participants, either both runs or only the second run of physiological data were not collected due to technical issues.In these cases, we substituted the first 20 components obtained through aCompCor, as described by Behzadi et al. (2007).Therefore, a total of 36 regressors were used in the model.
Whole-brain analyses were conducted using the FILM method from FSL FEAT (Jenkinson et al., 2012;Woolrich et al., 2001).These analyses took into account autocorrelated residuals.For the purpose of combining the GLMs at the run level for each task, fixed effects analyses were employed.Group-level models were subsequently estimated using FLAME1+2 from FSL (Woolrich et al., 2001).SPMs were generated to visualize the resulting group-level models.The maps were corrected for the family-wise error rate (FWER) using the random Gaussian field (GRF) procedure and a critical value of q < 0.05 (Nichols & Hayasaka, 2003), and a minimum cluster size (K E ) of 10 voxels.
In addition, the ROI GLM analysis was conducted as follows: Mean time-series data were extracted from each subcortical region of interest first, using probabilistic masks provided by the individual MASSP parcellation (Bazin et al., 2020) and second, using the probabilistic (non-individual) atlas by Pauli et al. (2018).Each voxel's contribution to the mean signal of the region was weighted based on its probability of belonging to that region.Subsequently, the timeseries data were transformed into percentage signal change values by dividing each timepoint by the mean signal of the time-series, multiplying the result by 100, and then subtracting 100.These time-series data were extracted from unsmoothed data to ensure regional specificity.The individual runs were concatenated for analysis.We infer exclusively from positive BOLD responses, given the disagreements surrounding negative BOLD responses (Schridde et al., 2008;Wade, 2002).The ROI GLMs thus encompassed ROI-wise GLM's fit on the mean time-series extracted from the unsmoothed functional data of each voxel.Bayesian one-sample t-tests were computed to investigate the evidence for each contrast based on the mean beta (i.e., signal change) derived from the ROI GLMs.For classification, we utilized Bayesian classification by Jeffreys (1998), where Bayes Factors between 1 and 3 indicate weak evidence, Bayes Factors between 3 and 10 indicate moderate evidence, and Bayes Factors greater than 10 indicate strong evidence.
Lastly, by means of providing comparability with the only other existing fMRI study employing the same experimental paradigm (Nir- Cohen et al., 2020), we also ran an additional 'cluster-based ROI analysis' which examined each ROI for clusters of activation based on the whole-brain GLMs after correcting for multiple comparisons across the ROI's voxels.Here, we investigated the involvement of specific brain regions previously associated with working memory, including the frontoparietal network and the basal ganglia in each working memory subprocess (for methods and results, see A1 in the appendix).

Behavioral Analysis
The results of the behavioral analyses of the reference-back task are presented in the appendix (Tab.A3 -A5).Descriptive statistics are reported in Table 1.
Figure 3 illustrates the group-averaged mean RT and accuracy for each condition of the experimental design.The overall mean RT was 0.86 seconds (mean RT for correct responses was 0.85 seconds), and the overall accuracy on the task was 94.7 percent, indicating that participants understood the task instructions.Correct responses were, on average, faster than errors (Δ = 0.169 s, t = 12.8, df = 149, p < .001,d = 0.61).
For mean RT, there were significant main effects of trial type, switch type, and response.Additionally, the two-way interactions but not the three-way interactions were significant.For accuracy, there were no significant main effects.However, we found two-way interactions between trial type and switch type paired with trial type and response.There was no three-way interaction (Tab.A3-A5, Fig. A2).
Responses were not substantially slower on comparison/switch (M = 0.829 s, SEM = 0.012) than comparison/repeat trials (M = 0.821 s, SEM = 0.008), suggesting no behavioral RT cost of closing the working memory gate.Interestingly, responses were more accurate on comparison/switch trials (M = 0.962, SEM = 0.005) compared to comparison/repeat trials (M = 0.933, SEM = 0.008), indicating that gate closing did not produce an accuracy cost but instead, these results suggest accuracy costs in keeping the gate open.
For the substitution contrast, the difference between "same" and "different" responses on repeated reference trials was significantly slower but not less accurate (RT: M = 0.960 s, SEM = 0.009; Accuracy: M = 0.953, SEM = 0.007) than the difference between "same" and "different" responses in repeated comparison trials (RT: M = 0.863 s; SEM = 0.007; Accuracy: M = 0.936, SEM = 0.008).This suggests only costs of response time but not accuracy in working memory substitution processing.Finally, responses were slower but more accurate on repeated reference trials (RT: M = 0.856 s, SEM = 0.016; Accuracy: M = 0.962, SEM = 0.005) responses compared with repeated comparison trials (RT: M = 0.821 s, SEM = 0.008; Accuracy: M = 0.933, SEM = 0.008), reflecting the response time cost of being in a general updating mode.

Whole-brain GLMs
Whole-brain analyses were conducted to explore the brain activations associated with gate opening, gate closing, substitution, and updating.

Gate Opening
Gate opening was associated with large bilateral clusters of activation across the cortex.Specifically, we found increased activation in frontal cortical regions, including the mPFC and dlPFC, as well as the somatosensory and motor cortices (Fig. 4; Tab. 2).Gate opening was also associated with increased activation in posterior parietal regions, including the precuneus cortex and left parietal lobe.The occipital lobe, including the left and right fusiform cortex, also showed increased activation during gate opening.Subcortically, gate opening was associated with clusters of increased activation in the thalamus, in particular, a large cluster in the left thalamus.These findings are consistent with Nir- Cohen et al. (2020).Additionally, we found a large cluster of activation in the brainstem and midbrain regions (see Fig. 8), covering areas such as the left and right red nucleus, periaqueductal gray, superior peduncle, and left subthalamic nucleus.

Gate Closing
Gate closing was associated with a few clusters of cortical activations, in line with Nir- Cohen et al. (2020).Our largest cluster covered much of the left PPC; another cluster occurred in the left dlPFC.In contrast to findings by Nir-Cohen et al., there was no activation in the right hemisphere except for relatively small activation clusters in the bilateral occipital fusiform gyrus.In addition, gate closing was associated with increased activity in the left inferior temporal gyrus and cerebellum (Fig. 5; Tab. 2).

Substitution
Substitution was associated with several large clusters of activation.There was increased activation in the premotor cortex and a substantial portion of the PFC, including BA8 (dlPFC), BA24, and BA32 (mPFC).There was increased activation in the pre-supplementary motor area (preSMA), the superior and inferior frontal gyrus, the left and right PPC, and the left inferior parietal lobule (IPL).The subcortex showed heightened activation in both the right and left striatal regions, including the caudate and putamen.A complete list of all active clusters is given in Table 2.The pattern of activation associated with substitution is broadly consistent with Nir- Cohen et al. (2020).However, we found a considerably greater extent of cortical activation, along with subcortical activation not observed by Nir-Cohen et al. (2020) (Fig. 6 and 8).

Updating Mode
The updating mode was associated with increased activation in frontal and posterior parietal regions.The frontal activation covered the left and right medial to dorsolateral prefrontal regions (from the preSMA, over the superior and middle frontal gyrus, to the inferior frontal gyrus).The posterior parietal cluster covered the IPL and stretched from the supramarginal gyrus, medially toward the precuneus cortex, and ventrally along the angular gyrus toward the medial temporal gyrus.There was increased activation in occipital regions, including the fusiform gyrus and the intra-and supracalcarine cortex.In the subcortex, clusters of increased activation were found in the left and right putamen, caudate nuclei, and the right thalamus.This pattern of activation is consistent with Nir- Cohen et al. (2020) but extends their findings to a broader range of cortical (Fig. 7; Tab. 2) and subcortical regions (Fig. 8).

ROI Analyses
We conducted three sets of ROI analyses, as described in 2.6.The main ROI analyses included ROI-wise GLMs based on the individually parcellated masks derived from MASSP (Bazin et al., 2020).In addition, we conducted an additional ROI analysis into subdivisions of the dopaminergic midbrain regions through the exploration of masks derived from a probabilistic atlas.Hence, the first ROI-wise GLMs ('ROI-wise GLMs on individually parcellated masks') emphasized individual delineations and thus provided high precision with regard to individual anatomy.While the post-hoc ROI-wise GLMs provided information on the possible differential involvement of subnuclei of the VTA and SN, however, at the cost of individual anatomical detail and will be referred to as 'ROI-wise GLMs based on VTA/SN subdivisions atlas' in the following.Results of the ROI analyses are illustrated in Figures 9 and 10, Tables 3 and 4.

ROI-wise GLMs on individually parcellated masks
ROI-wise GLMs analyses demonstrated that subcortical regions play a role in each experimental contrast.The main findings for each experimental contrast are summarized below and depicted in Figure 9 and Table 3.

Gate opening
During gate opening, subcortical activation was limited to the right thalamus.Bayesian analyses provided only weak evidence for activity in the right thalamus, contrasting the observation of extensive bilateral thalamus activation in the whole-brain contrast.Notably, we found moderate evidence against activity in any other ROI in the basal ganglia and midbrain, along with weak evidence against activity in the right thalamus, right GPe, and right GPi.

Gate closing
ROI-wise GLM results indicated moderate evidence for activity in the right GPe and weak evidence for activity in the right basal ganglia nuclei, namely right striatum and right STN, during gate closing.Moreover, bilateral midbrain activation of the SN was observed, indicated by weak evidence.

Substitution
During substitution, the results of ROI-wise GLMs confirmed the engagement of subcortical structures, although there were variations in their individual contributions.Moderate evidence of activity was observed in the striatum bilaterally, in the right STN, and the right VTA.Moreover, weak evidence indicated activity in the thalamus, left GPe, left GPi, and left STN.Overall, Bayesian analyses did not indicate strong evidence against activity in any subcortical ROI during the process of substitution

Updating mode
Being in an updating mode was associated with both basal ganglia and midbrain activity.Precisely, ROI-wise GLM results provided Bayes factors suggesting strong evidence for activity in the left GPe and right STN.Yet, only moderate evidence for the right GPe, left GPi, and right SN.Furthermore, weak evidence from the ROI GLMs indicated activity of the thalamus, striatum, left STN, and right VTA.Overall, there were no Bayes factors indicating strong evidence against activity related to the updating mode in any ROI.

ROI-wise GLMs based on VTA/SN subdivisions atlas
ROI-wise GLM analysis demonstrated that midbrain nuclei play a role in gate closing, substitution, and updating mode.The main findings for each experimental contrast are summarized below and depicted in Figure 10 and Table 4.

Gate opening
Confirming results from the preceding ROI-wise GLM analysis, ROI-wise GLMs using the probabilistic atlas masks implicated no evidence for activity in midbrain dopamine-producing nuclei in gate opening, supported by moderate evidence against activity in any mask from Pauli et al. (2018).

Gate closing
ROI-wise GLMs provided evidence for SNc activation during gate closing.There was strong evidence for activity in the right SNc and moderate evidence for activity in the left SNc.This high Bayes factor in favor of the right SNc suggests a dopaminergic involvement during gate closing.Furthermore, the results of the Bayesian analyses did not provide strong evidence against activity in any ROI during gate closing trials.

Substitution
In line with the results from the ROI analyses based on MASSP delineations (see above), results implicated activity in the right VTA.Specifically, the right PBP mask by Pauli et al. (2018) indicated activity with moderate evidence, suggesting an involvement of the dorsolateral nucleus of the VTA during substitution.In addition, BOLD signal change in the right SNc demonstrated a high Bayes factor.Both findings indicate dopaminergic involvement in the substitution process.

Updating mode
Midbrain masks by Pauli et al. (2018) demonstrated an extremely high Bayes factor for the right SNc-insinuating very strong evidence for increased activity-moderate evidence for activation in the left SNc, along with weak evidence for activity in the right PBP during updating mode.Again, there were no Bayes factors that indicated strong evidence against activity related to the updating mode in any ROI.
In summary, no subcortical engagement except for the thalamus was revealed in the gate opening contrast.However, ROI-GLMs indicated only weak evidence for the right thalamus during gate opening, rendering our results on gate-opening-related thalamic activity inconclusive.Notably, we observed moderate evidence against the involvement of most ROIs in the basal ganglia and midbrain during gate opening.Nevertheless, during gate closing, substitution, and updating mode, evidence did suggest activity in the basal ganglia and midbrain.It appears, however, that basal ganglia nuclei are differently engaged across gate closing, substitution, and updating mode contrasts.Reasonable evidence for striatal engagement was only found during working memory substitution trials.In the pallidum, gate closing-associated activity implicated the right GPe, and there was very strong evidence for the left GPe in updating mode.Additionally, results implicated the involvement of the right STN in updating mode and in substitution.
Also, the regions of the midbrain appear to be differently engaged across the three contrasts.The right SN was active in the updating mode while there was only weak evidence for increased SN activation during gate closing and none during substitution.However, using the probabilistic mask by Pauli et al. (2018), data indicated strong evidence for SNc activation during updating mode and strong evidence for SNc engagement during gate closing and substitution.Interestingly, evidence for engagement of the left SNc was substantially lower in all three contrasts.Furthermore, results implicated the right VTA mask by MASSP (Bazin et al., 2020) in substitution.Interestingly, considering the analyses using the probabilistic masks, this activation appears to be driven by the (right) PBP, a VTA component nucleus.

Discussion
The present study aimed to shed light on the neural substrates of working memory subprocesses, particularly focusing on the subcortex.As an extension of the work by Nir-Cohen et al. ( 2020), we employed the reference-back paradigm in conjunction with 7T fMRI, including a scanning and analysis protocol optimized for the subcortex, to precisely discern contributions from several subcortical structures to working memory updating subprocesses associated with gating, substitution, and being in an updating mode.In addition to investigating nuclei in the basal ganglia-thalamo-cortical loop, we hypothesized that midbrain nuclei containing dopaminergic neurons, the VTA, and SN might play a pivotal role in working memory updating subprocesses.
The whole-brain analysis not only revealed a substantially broader range of brain activation in both the cortex and subcortex but also provided more detail.Consistent with previous work (Nir -Cohen et al., 2020), we found that substitution and being in a general mode of updating show increased activation in all regions belonging to the frontoparietal network (FPN).Overall, the components of the FPN seem to be involved across all contrasts.However, our data suggests that each working memory subprocess differently engages the individual FPN components: posterior parietal regions play a greater role in gating, while substitution and updating primarily recruit the frontal regions and subcortex.This observation implies the involvement of diverse functional subregions in distinct working memory-updating subprocesses.Furthermore, our whole-brain analysis identified thalamic activation in the process of opening the gate to working memory, working memory substitution, and the updating modealigning with Nir-Cohen et al.'s findings.Nir-Cohen and colleagues found thalamic activity only in a region-of-interest (ROI) analysis and not their whole-brain analysis, demonstrating our enhanced signal and spatial resolution.Additionally, our whole-brain analysis revealed activation in distinct subcortical regions: the brainstem during gate opening, cerebellar regions during gate closing, and midbrain during gate opening, substitution, and updating mode.Notably, during substitution and updating mode, there was activation in striatal subregions-specifically the caudate nucleus and putamen-particularly extensive bilateral striatal activation during substitution.
We extend previous work by investigating individual masks of subcortical ROIs using the automated subcortical parcellation algorithm MASSP (Bazin et al., 2020), including regions associated with the dopaminergic midbrain-additionally supported by ROI analysis using masks from Pauli et al. (2018)-and consider each ROI bilaterally.Together, they revealed activation in the basal ganglia and midbrain during gate closing, substitution, and updating mode, albeit with differing degrees of engagement across these contrasts.Notably, no subcortical engagement was observed in gate opening, except for weak evidence for the right thalamus.This still might indicate a possible involvement of the thalamus during gate opening since the whole-brain analysis yielded large bilateral activation clusters in the thalamus.Furthermore, while the Bayesian analysis provided moderate evidence against any subcortical ROI in gate opening, the evidence against the right thalamus was nihil.All these factors render the results for thalamic involvement during gate opening inconclusive.However, it should be noted that activation limited to a thalamic subnucleus may explain the observed findings from the ROI analysis.Precisely, it is possible that an existing activation was negated as the BOLD signal change was averaged across all voxels included in a specific mask.Our data, therefore, did not definitively support nor contradict thalamic involvement in gate opening, underscoring the necessity for individually delineated masks for subnuclei in the thalamus in future studies.
Our findings challenge prominent neural theories of working memory gating by showing that, rather than controlling gating, the basal ganglia's role may be more specific to updating working memory representations with new information and an open-gate state.The reasons for this are twofold.First, there was an absence of evidence for striatal participation in gate opening.Second, we found midbrain and basal ganglia activation during substitution, suggesting a neural signature of dopaminergic activation of the basal ganglia-thalamo-cortical loop.These findings significantly advance our understanding of the cortical and subcortical neural basis of working memory subprocesses.

No support for striatal gate opening
The first aspect of the twofold revelation, indicating a role for the basal ganglia in working memory that differs from what was previously postulated, is the absence of evidence for striatal gate opening.We observed no increased activation in the striatum during gate opening, suggesting no active striatal involvement in opening the gate to working memory.In fact, ROIwise GLMs suggest evidence against the involvement of any basal ganglia nucleus in gate opening.This contrasts with Nir- Cohen et al. (2020) and challenges the PBWM model's (Frank et al., 2001;Hazy et al., 2007;O'Reilly & Frank, 2006) indirect striatal gating mechanism-in which a striatal dopaminergic signal is required to trigger gate opening-and other neurocomputational theories (Hazy et al., 2007;Jongkees, 2020).However, our finding is partially consistent with the dual-state theory's proposal that dopamine directly modulates PFC representations via mesocortical pathways (Durstewitz & Seamans, 2008) without involving the basal ganglia.Aligning with the dual-state theory (Durstewitz & Seamans, 2008), the whole-brain analysis did suggest the involvement of the dopaminergic midbrain in gate opening (Fig. 8), yet ROI-wise GLM as well as cluster-based ROI analyses could not further corroborate these findings.In fact, ROI-wise GLMs indicated moderate evidence against midbrain activity during the process of opening the gate.All analyses thus point to no basal ganglia involvement specific to gate opening to working memory.Instead, there is a possibility that mesocortical dopamine pathways could regulate gating.However, the evidence for the midbrain involvement is mixed, leaving the matter inconclusive and implying future work should follow up on the exact role of the dopaminergic midbrain in gate opening.
Another explanation for the lack of enhanced striatal activity in gate opening challenges the conceptualization of the gating mechanism in the reference-back task.The gate opening contrast includes reference trials that follow comparison trials regardless of the stimulus match condition.This means that actually gating new, relevant information into the PFC is only necessary on 50% of gate opening trials (i.e., on reference/switch/mismatch trials and not reference/switch/match trials; not to be confused with substitution contrast, which takes into account only repeated trials).Furthermore, in light of the PBWM's proposal that the basal ganglia sit in a gate-closed state by default, gate opening should take place on every reference trial.However, engaging in striatal gating every time a reference cue is encountered (i.e., on every reference trial) would be inefficient if the cue is not predictive of the subsequent updating, as is the case for the reference-back task.As a result, this conceptualization of gate opening would be expected to destabilize working memory representations, potentially reducing the accuracy of working memory-based decisions.In support of the idea that striatal gating in response to each reference cue would be an uneconomic brain process, we did not find lower accuracy in reference trials than in comparison trials.Since we find accuracy costs related to gate opening, our findings imply that the costs associated with gate opening might be driven by trial type/mode rather than gate-state switching.In light of the basal ganglia's default closed-gate state, these costs might represent attentional processes rather than working memory.
However, suppose a striatal gating process were to occur in response to every reference trial to facilitate working memory updating.In that case, there should also be striatal/basal ganglia activation in trials where the gate is opened independent of a switch in the gate state.This situation is partially represented in the updating mode, which contrasts repeated reference over repeated comparison trials.If the striatum is only involved when the gate switches from closed to open, no striatal activation should exist in an updating mode.
Intriguingly, we found that the updating mode was associated with strong evidence for basal ganglia engagement, along with strong evidence for the right SNc and right VTAsupported by the right PBP-activity.These findings suggest that repeated reference trials, hence repeated updating cues, engage the basal ganglia-which, in fact, is postulated for gate opening by the PBWM model-and it is not switching between trial types that cause this activation.Hence, there is a striatal mechanism that is engaged when updating is (repeatedly) required.However, this finding is inconsistent with Nir- Cohen et al. (2020), who found no evidence for any subcortical involvement in the updating mode contrast.In fact, the authors found activation contradictory to our results in both the updating mode and gate opening contrasts, rendering future work on the subcortical engagement in these working memory processes essential to understanding the exact neural mechanisms involved.Alternatively, it is also possible that the basal ganglia involvement in the updating mode reflects attentional processes (Cools et al., 2004;Leber et al., 2008;Schouwenburg et al., 2014) triggered by successive reference trials.
Taken together, our data aligns with the PBWM model, emphasizing the central role of the basal ganglia in working memory processes.However, we could not find any evidence for striatal involvement in the gate opening contrast, opposing crucial characteristics of the model and other neurocomputational theories (Frank et al., 2001;Hazy et al., 2007;Jongkees, 2020;O'Reilly & Frank, 2006).Our results suggest a more specific role for the basal ganglia in a readyto-update mode rather than controlling the gate to working memory.The contrasting patterns of activation observed in the basal ganglia, particularly the striatum, between the gate opening and updating mode in our dataset compared to the findings of Nir- Cohen et al. (2020) pose a significant challenge and emphasize the critical need for additional research in this area.

Dopaminergic involvement in working memory substitution
The second aspect of the twofold revelation, indicating a revised role for the basal ganglia in working memory, is the observation of basal ganglia and midbrain activation during substitution.Our findings suggest that the striatum-along with the rest of the basal ganglia and the thalamus-is more involved in the actual process of updating working memory representations ('substitution') than controlling the more general ready-to-update state ('updating mode').This observation is supported by the enhanced subcortical activity we found in substitution trials evident from ROI-wise GLMs and whole-brain analyses (as well as clusterbased ROI analysis, see appendix).Nir- Cohen et al. (2023) further affirm our observation through their modified version of the reference-back paradigm, which includes task switching, demonstrating that the gate opens only when truly necessary (i.e., when updating of the task that is held in working memory is required) and is associated with increased activity in the basal ganglia, thalamus, and midbrain.
Given the resemblance of the neural mechanisms between updating cortical working memory representations via the basal ganglia (Frank et al., 2001;Hazy et al., 2007;O'Reilly & Frank, 2006) and updating cortical value representations via a (dopaminergic) reward prediction error signal also via the basal ganglia (Montague et al., 1996;Schultz et al., 1997;Schultz, 2013), we expected both processes to share similar neural signatures.We found enhanced neural activation in the dopaminergic midbrain, basal ganglia, thalamus, and PFC when substituting old with new information in working memory.In greater detail, the whole-brain analysis showed large cortical and subcortical activation clusters.Further confirmation from ROI analyses indicated the involvement of the basal ganglia-thalamo-cortical loop in substituting information in working memory.Precisely, ROI-wise GLMs suggested evidence for increased activity in the striatum, right STN, and right VTA, along with activity in the right SNc and right PBP.Furthermore, the cluster-based ROI analysis revealed the involvement of the dlPFC and mPFC, thalamus, and basal ganglia, as well as VTA and SN, both bilaterally (see Tab. A5).
It is important to note the differences between the ROI-wise GLMs and the clusterbased ROI analysis.The cluster-based ROI analysis examines peak activation in a cluster-based approach, just like whole-brain GLMs, but limited to the ROI.Meanwhile, the ROI-wise GLMs consider the mean signal change over all voxels within a mask.Hence, it is possible that the observed signal change found for both left and right midbrain nuclei in cluster-based ROI analysis for substitution was canceled out in the ROI-wise GLMs, resulting in activation observed only in the right VTA, PBP, and SNc during substitution, but in none of the individually delineated SN mask from MASSP (Bazin et al., 2020), which also covers the region associated with the non-dopaminergic reticulata part of the SN (SNr).Nonetheless, all three analyses imply engagement of the basal ganglia-thalamo-cortical loop, along with dopaminergic neurons of the midbrain, and thus align with our hypothesis that the working memory updating subprocess concerning substituting working memory resembles the RPE, and thus the valueupdating network associated with reinforcement learning.To what extent these networks overlap precisely represents an exciting avenue for future research employing methods such as jointmodeling of functional and behavioral data across tasks (Palestro et al., 2018;Stevenson et al., 2024).
Finally, our results challenge the PBWM's prediction that a phasic dopamine signal controls the gating of information in working memory and instead hints toward phasic dopamine representing a signal to alter (in this framework, substitute) representations held in working memory.This suggests that phasic dopamine release signals updating cortical representations independent of the cognitive domain.
Taken together, our results provide convincing evidence for dopaminergic processing when substituting new information into working memory.Moreover, our results suggest shared mechanisms between working memory updating and value-based (reinforcement) learning, both of which update cortical representations in an adaptive manner.Additionally, these findings challenge the PBWM model's gating mechanism and suggest that phasic dopamine release, in fact, may signal the updating of cortical representations.

Subcortical involvement in gate closing
Based on the whole-brain and cluster-based ROI analyses, we found that the (left) PPC was the primary region involved in gate closing, without involvement from the basal ganglia.This finding is in line with Nir- Cohen et al. (2020) and the predictions of the PBWM model (Frank et al., 2001;Hazy et al., 2007;O'Reilly & Frank, 2006).However, it stands in contrast to Nir- Cohen et al. (2023), who found that the striatum was involved during distractor conditions that required active gate closing (by filtering out conflicting task cues).Intriguingly, our ROI-wise GLM analyses did reveal additional yet weak support for (right-lateralized) striatal activity during gate closing.Furthermore, our results provide weak evidence for lateralized right STN and SN activation and particularly strong evidence for the right GPe and right SNc activation during the process of gate closing.Given the findings across all analyses into the gate-closing process, we observed only little activation of the PFC.Still, large clusters of activation in the PPC, along with evidence against thalamic activations, it remains unlikely that the entire basal ganglia-thalamocortical loop is engaged during gate closing.Thus, the ROI-wise GLM-based evidence implicating parts of the right basal ganglia in gate closing may hint toward a different functional basal ganglia loop.This is supported by evidence against the activity of the basal ganglia output nucleus GPi and the thalamus in this contrast, suggesting no engagement of the two structures crucial for the signaling between the cortex and basal ganglia.
It is possible that the neural signature of gate closing represents the suppression of inappropriate actions (i.e., working memory updating).Interestingly, the dominance of the right hemisphere in the subcortex observed in the ROI GLMs contrasts with the predominantly left hemisphere findings from the whole-brain analysis.This difference in hemispheric dominance may suggest distinct lateralized functional roles in gate closing trials.On the one hand, inhibition tasks support a right-lateralized inhibitory network involving the striatum, GPe, and SN (Maizey et al., 2020).On the other hand, the authors observed minimal BOLD changes in the expected right inferior frontal gyrus associated with active ignorance of updating during the ignorance task.This observation is reminiscent of gate closing trials, possibly indicating an active suppression of updating working memory.In fact, a subset of GPe neurons -which exhibits the strongest evidence of activity in gate closing compared to any other subcortical ROI-projects to the striatum, where they powerfully suppress neurons of the direct and indirect pathway (Bevan, 2021;Glajch et al., 2016;Mallet et al., 2012Mallet et al., , 2016)).This may indicate an inhibition process within the basal ganglia in our data, ensuring no erroneous updating signal is transmitted through the basal ganglia-thalamo-cortical loop.
Hence, our findings suggest that the basal ganglia circuit is not engaged in an active gating process due to the lack of evidence involving essential parts of the basal ganglia-thalamocortical loop, but instead support preventing updating related signaling in the subcortex during trials requiring the protection of working memory content.As a result, findings imply the working memory gate is closed by default and that substituting working memory requires active signaling through the otherwise inhibited basal ganglia.Finally, no evidence for activity in the VTA was found in gate closing, proving no support for gate closing via the mesocortical pathway, as suggested by Nir-Cohen et al. (2020) and Durstewitz and Seamans (2008).However, ROI-wise GLMs indicate a potential role of the nigrostriatal pathway originating in the SNc during gate closing, which is a surprising finding.It might represent an erroneous, premature updating signal that engages the subsequent GPe-driven inhibition; however, this remains entirely speculative.
Taken together, our analyses provide evidence against a striatal gate-closing process yet implicate the involvement of other subcortical nuclei (right GPe and the right SNc) during gate closing, which might represent a process to inhibit false updating-related signaling from being transmitted to the cortex.

Functional subdivisions of the dopaminergic midbrain
Gate closing, substitution, and updating mode are differently associated with the activity of the individually parcellated midbrain ROIs VTA and SN.Yet, it remains unclear whether nigrostriatal, mesolimbic, or mesocortical cell populations were involved in the individual contrasts due to limitations in identifying the associated cell assemblies in the individual MRI scan.In the following, dopaminergic-midbrain-associated results across all ROI analyses are discussed in detail in order to potentially shed light on the functional spatial organization of the dopaminergic midbrain region.
For gate closing, in contrast to only very weak evidence from the individual masks by MASSP (Bazin et al., 2020), the use of the masks by Pauli et al. (2018) revealed bilateral involvement of the SNc in gate closing-particularly very strong evidence for the right SNcimplicating a role for the dopamine neurons of the SN in closing the gate to working memory.The right SNc was also engaged during substitution, in addition to the right PBP, as suggested by ROI-wise GLMs.Interestingly, in contrast to these ROI-wise GLM results for substitution, the cluster-based ROI analysis for substitution using the probabilistic VTA and SN masks from MASSP (Bazin et al., 2020) showed broad bilateral activation for both, while analysis using Pauli et al.'s (2018) masks implicated only the SNc, bilaterally (Tab.A2).Nonetheless, all three ROI analyses support the idea of dopaminergic modulation of working memory substitution.Overall, the cluster-based ROI analysis revealed activated clusters in the midbrain limited to the substitution contrast, with no activated midbrain clusters in updating mode.However, ROI-wise GLMs using the probabilistic atlas from Pauli et al. (2018) revealed strong evidence for the right SNc in updating mode, accompanied by evidence for the left SNc and right PBP.These findings suggest a pronounced involvement of a dopaminergic mechanism in this particular working memory updating mode.On the contrary, no evidence was observed for the VTA nucleus mask by Pauli et al. (2018) in any working memory subprocess, which may suggest that this mask does not cover neural populations in the VTA related to working memory updating.
These findings suggest initial evidence for nigrostriatal involvement in gate closing and dopaminergic influence in the control of substituting new information into working memory and a ready-to-update mode.However, the ROI-wise GLMs did not support the bilateral engagement of the SNc in substitution indicated by the cluster-based ROI results and instead additionally implicated the right PBP nucleus.On the one hand, this highlights the differences in the two ROI analysis approaches, using unsmoothed and smoothed data, respectively, as discussed earlier.On the other hand, it challenges the regional specificity suggested if only cluster-based ROI results were considered: Given the directly adjacent location of the two nuclei, it is unclear whether exclusively nigrostriatal neurons are involved during working memory substitution.This ambiguity highlights the necessity of delineating PBP and SNc masks, preferably based on the A10 and A9 cell groups, respectively, in the individual.This is crucial for accurately distinguishing between contributions from the mesolimbic and nigrostriatal pathways (Haber, 2003;Haber & Knutson, 2010), especially on an individual basis.
Nevertheless, the strong evidence from various ROI analyses for SNc activity in gate closing, substitution, and updating mode clearly points towards the involvement of the nigrostriatal pathway and, thus, the basal ganglia-thalamo-cortical loop associated with motor function.Along these lines, there lies great potential in investigating the individual striatal subdivision to shed light on the contributions of individual dopaminergic pathways in working memory updating processes.The results from both whole-brain GLM and cluster-based ROI analyses indicate that the putamen and caudate are active during substitution and updating mode (please refer to the appendix, Tab.A1).However, it remains unclear how helpful individual parcellations-based ROI-wise GLMs would be in answering this question if they become available.After all, findings implicate the dorsal striatum, which comprises both the dorsal regions of the putamen and the caudate, which is classically associated with the nigrostriatal pathway (Haber, 2003;Haber & Knutson, 2010).
Furthermore, the findings associated with working memory substitution suggest meaningful differences between the VTA masks from the two ROI sources.Despite the equal amount of evidence for the VTA and PBP in substitution and updating mode, distinct functional differences associated with the underlying neural populations have become apparent.Specifically, the MASSP VTA mapping includes a large number of voxels that appear to be involved in working memory substitution, while Pauli et al.'s (2018) VTA nucleus mask did not elicit activation.Apparently, it does not functionally overlap with the MASSP VTA voxels involved in working memory substitution.The latter is further supported by the finding implicating the right PBP in substitution, reinforcing that the substitution-related activity is located in the dorsolateral VTA.The dorsolateral region of the VTA contains the PBP nucleus of the VTA (Trutti et al., 2019).This again underscores the demand for enhanced delineation of the VTA, particularly in distinguishing the dorsolateral VTA functionally from the ventromedial site.
To summarize, follow-up analyses using different masks associated with dopaminergic cell populations of the midbrain revealed interesting insights.There was strong evidence for the role of the SNc in the process of substitution, supported by both ROI analysis approaches that complement each other's limitations.Also, during gate closing and updating mode, the SNc showed increased activation, suggesting nigrostriatal pathway engagement.Additionally, there was evidence of the right PBP nucleus activity for both substitution and updating mode.At the same time, no contrast revealed evidence for working memory updating-related activity in the VTA nucleus.These observations suggest that the evidence for activity limited to the right VTA using the MASSP masks (Bazin et al., 2020) was driven by activity in a region of the midbrain that is associated with both neighboring nuclei, PBP and SNc.The intricacy in the functional engagement by midbrain nuclei emphasizes the necessity for individually parcellated masks, in particular of the VTA, in future studies that provide more precision both with respect to functional and anatomical subdivisions.This becomes especially apparent when considering the likelihood of neighboring nuclei influencing the observed activity (de Hollander et al., 2017).
Consequently, the fewer trials compared to other work (Nir-Cohen et al., 2020, 2023) may have lowered the fMRI signal-to-noise ratio, potentially resulting in decreased efficacy in detecting neural markers associated with gate opening in the basal ganglia.Nevertheless, compared to other work, our protocol did unveil a substantially broader range of cortical and subcortical activation during other working memory processes, including the involvement of the right thalamus during gate opening, which is a critical part of the basal ganglia-thalamo-cortical loop.Consequently, it remains unclear if this accounts for the present findings.Future work employing ultra-high field fMRI using a higher trial-number design may shed light on more signals in the subcortical mechanisms involved in the process of gate opening.
Finally, the ROI-wise GLM analysis does not consider striatal subdivisions as these are not parcellated by MASSP.In light of the relatively large volume of the striatum as a whole, the association of the dorsal and ventral striatum with the nigrostriatal and mesolimbic pathways, respectively, in combination with the different functional profiles of the striatal subdivision (Haber, 2003), it would be intriguing to explore the contribution of individual striatal subdivision in working memory updating processes in the future, as discussed in the previous paragraph.

Conclusions
Our finding of the absence of evidence for striatal activity during the process of gate opening suggests the striatum is not crucially involved in gating new information into working memory.This finding challenges the prominent PBWM model's striatal working memory gating mechanism and previous empirical results (Nir-Cohen et al., 2020, 2023).Instead of striatal activity due to trial-type switching according to the PBWM model paired with the reference-back paradigm, our data revealed basal ganglia engagement associated with ready-to-update working memory mode ('updating mode'), suggesting a more specific role for the basal ganglia in a readyto-update mode rather than controlling the gate to working memory.Moreover, in line with previous work (Nir -Cohen et al., 2020), our findings implicate that the basal ganglia-thalamocortical loop is not engaged in gate closing, as evidence for basal ganglia output is lacking.Instead, our ROI analyses indicate the suppression of updating-related signaling in the subcortex during gate closing.Furthermore, our data indicates the involvement of dopamine-producing midbrain nuclei during the process of working memory gate closing and during a general readyto-update mode.
In addition, we found evidence suggesting that substituting new information into working memory is driven by dopaminergic pathways from the midbrain to the basal ganglia and involving the basal ganglia-thalamo-cortical loop.This finding is consistent with neural signatures of value updating triggered by dopaminergic signals via reward prediction errors in reinforcement learning, potentially pointing to a common network underlying the updating of cortical representations.However, our data does not provide enough detail to identify a specific dopaminergic pathway, such as the mesolimbic pathway.
This study furthers understanding of the neural mechanisms underlying working memory updating subprocesses in the human subcortex, providing additional insights into the role of the dopaminergic midbrain.Table 1.Hierarchical descriptive statistics for the behavioral data.Note.For all tests, the alternative hypothesis specifies that the population mean differs from 0. BF10 1-3 indicates weak evidence, BF10 3-10 moderate evidence and BF10 >10 strong evidence.

Figure 5 .
Figure 5. Statistical parametric map of the 'gate closing' contrast with a threshold determined using the FWER method (p<.05; corresponding to a threshold of z = 2.3).

Figure 6 .
Figure 6.Statistical parametric map of the 'substitution' contrast with a threshold determined using the FWER method (p<.05; corresponding to a threshold of z = 2.3).

Figure 7 .
Figure 7. Statistical parametric map of the 'updating mode' contrast with a threshold determined using the FWER method (p<.05; corresponding to a threshold of z = 2.3).

Figure 10 .
Figure 10.Results of the region-of-interest wise GLMs using the probabilistic atlas from Pauli et al. (2018).Error bars represent the 95% credible intervals.Abbreviations indicate ventral tegmental area nucleus (VTA), parabrachial pigmented nucleus (PBP), and substantia nigra pars compacta (SNc).* indicate moderate evidence and ** indicate strong evidence that the observed BOLD signal change is larger than 0.

Table 2 .
List of peak activation in MNI coordinates from the whole-brain analysis.

Table 3 .
Results from the Bayesian one-sample t-test on the beta values derived from the ROIwise GLMs are reported for each contrast and ROI.Moderate or higher evidence for ROI activity is indicated by bold font.Bayes factors favoring the alternative hypothesis are reported under BF 10 , and Bayes factors favoring the null hypothesis in column BF 01 .