Abstract
Working memory involves a series of functions: encoding a stimulus, maintaining or manipulating its representation over a delay, and finally making a behavioral response. While working memory engages dorsolateral prefrontal cortex (dlPFC), few studies have investigated whether these subfunctions are localized to different cortical depths in this region, and none have done so in humans. Here, we use high-resolution functional MRI to interrogate the layer specificity of neural activity during different epochs of a working memory task in dlPFC. We detect activity timecourses that follow the hypothesized patterns: superficial layers are preferentially active during the delay period, while deeper layers are preferentially active during the response. Results demonstrate that layer-specific fMRI can be used in higher-order brain regions to non-invasively map cognitive information processing along cortical circuitry in humans.
Introduction
Working memory, or the mental capacity to briefly store and manipulate information before acting on it, has been linked to the dorsolateral prefrontal cortex (dlPFC) in both humans and non-human primates (Courtney et al., 1998, 1997, D’Esposito et al., 1995, Goldman-Rakic, 1995). Like much of the cerebral cortex, dlPFC gray matter is organized into layers with distinct cytoarchitecture, connectivity and function. Early electrophysiological work in non-human primates suggested that different epochs of working memory tasks are preferentially associated with activity in different cortical layers (Sawaguchi et al., 1989, 1990). Specifically, it is thought that delay period activity is driven by recurrently connected networks of pyramidal cells in layer III (Goldman-Rakic, 1995), while response-related activity takes place predominantly in layer V (Arnsten et al., 2012, Opris et al., 2011, Wang et al., 2004). Yet direct evidence for this dissociation is limited, due in part to the challenge of separating activity recorded from distinct cortical layers. A recent study in macaques overcame this by using single probes capable of simultaneous recordings from multiple cortical depths (Bastos etal., 2018)
However, to date there is no empirical evidence for such a dissociation in humans, largely because conventional neuroimaging techniques lack the sensitivity and specificity to resolve cortical layers. Recent methodological advances in fMRI, including higher field strengths (i.e., 7 Tesla and above) combined with innovations in pulse sequences and contrast mechanisms, now allow for non-invasive, reliable measurements of cortical depth-dependent activity in humans. These advances have enabled layer-specific imaging in several primary cortices, including visual (Kok et al., 2016, Muckli et al., 2015, Polimeni et al., 2010), auditory (De Martino et al., 2015), and motor (Huber et al., 2017). But it is not yet clear if these techniques are sensitive and robust enough to be applied outside confirmatory studies of primary cortices.
Here, we develop and extend layer fMRI methods to distinguish depth-dependent activity in a region of human association cortex during a cognitive task. Specifically, we use simultaneously acquired blood oxygen level-dependent (BOLD) and cerebral blood volume (CBV) images of dlPFC during working memory. We show that during the delay period, activity is localized to superficial layers, and especially when the task calls for manipulating (as opposed to merely maintaining) information, while during the response period, activity is specifically localized to deeper layers. These results demonstrate the promise of high-resolution fMRI for mapping cognitive cortical circuitry in humans at the mesoscale.
Results
Task paradigm
To test our hypotheses about layerdependent activity in dlPFC, we adapted a well-validated verbal working memory paradigm (D’Esposito et al., 1999). The task contained two types of contrasts (Fig. 1a). In the first contrast (Fig. 1a, top), participants see a string of five random letters (e.g., ‘PXEDL’), then a cue instructing them to either rearrange the letters in alphabetical order (‘ALPHABETIZE’, manipulation condition) or to simply remember them in their original order (‘REMEMBER’, maintenance condition) over the course of a delay period, during which they see only a fixation cross. Finally, a probe letter comes onscreen (e.g, ‘L?’), and participants make a response to indicate the alphabetical or ordinal position of the probed letter. The second contrast (Fig. 1a, bottom) is identical to the first until the response period, at which point participants see either a true probe requiring a button press (e.g., ‘L?’, action condition), or a dummy probe (i.e., ‘∗?’, non-action condition), which indicates that no response is required and they can forget the information associated with that trial. All trials are thus matched for sensory input, with the only difference being the nature of the mental activity during the delay for the first contrast, or the presence or absence of action selection and execution during the response period for the second contrast.
Thus the task paradigm followed a 2×2×2 design, with trial type (manipulation/maintenance versus action/non-action), period (delay versus response), and cortical depth (superficial versus deep) as the three factors. We hypothesized a triple dissociation between trial type, period, and cortical depth, such that: (1) superficial layers would respond more strongly during the delay period of manipulation trials (as compared to the delay period of maintenance trials), and (2) deeper layers would respond more strongly during the response period of action trials (as compared to the response period of nonaction trials). See Fig. 1b for a schematic of the hypothesis. The strength of this experimental design is that we control for each layer’s timecourse of activity primarily by observing the same layer in a different condition, rather than directly comparing activity levels across layers; this helps avoid measurement biases associated with different cortical depths.
Data acquisition
Nine participants were scanned in a combined total of 13 functional imaging sessions. An additional 14 imaging sessions were conducted for piloting purposes and to collect anatomical data from the same participants (see Methods).
During each high-resolution functional run, we simultaneously measured changes in cerebral blood volume (CBV) and blood-oxygen-level dependent (BOLD) signal using the SS-SI-vascular space occupancy (VASO) method (Lu et al., 2003) with a 3D-EPI readout (Poser et al., 2010) on a 7 Tesla scanner. This method has been implemented to successfully demonstrate layer-specific activity in human motor cortex with good sensitivity and specificity (Huber et al., 2017). The conventional BOLD signal has poor spatial specificity at high resolutions, since it tends to be dominated by large veins at the pial surface. VASO, while it has a lower contrast-to-noise ratio, is a more quantitative measurement that is less biased toward superficial depths. In short, BOLD is more sensitive, while VASO is more specific. Jointly interpreting both contrasts allows for stronger inferences about the magnitude and timing of layer-specific activity than using either on its own (Huber et al.,2018, 2014).
We used two different acquisition protocols over the course of the study. The first had a nominal voxel resolution of 0.9 × 0.9 × 1.1mm (“axial readout protocol”), and was used to quantitatively compare activity timecourses from two distinct cortical depths (superficial versus deep) across participants. Later, we introduced a second, higher resolution acquisition with nominal voxel resolution of 0.76 × 0.76 × 0.99mm (“sagittal readout protocol”) to better visualize activity across different layers in individual participants.
Location of region of interest
Prefrontal cortex is large, and quite variable across individuals in its structure and functional anatomy. Unlike other cortical landmarks, such as the ‘hand knob’ of the primary motor cortex, functional subdivisions of dlPFC are difficult to pinpoint in individual participants using macroscale anatomical features. Therefore, regions of interest (ROIs) were selected for each participant on the basis of an online functional localizer conducted just prior to the experimental task runs (see Methods).
To better specify our macroscale position within dlPFC, we estimated and visualized the average ROI location across participants (Fig. 1c). Overlap across participants was generally high. The activity of all participants fell within a sphere of 9.5 mm radius (origin at RL 44.2 mm, AP −8.1 mm, IS 47.5 mm), and the peak overlap was centered on a region of the left middle frontal gyrus/superior frontal sulcus corresponding approximately to area 8a, rostral to the frontal eye field (Walker, 1940). In both humans and non-human primates, area 8a is distinguished from areas 8/8b by a more pronounced concentration of large pyramidal cells in layer IIIb, making its cytoarchitecture comparable to neighboring region 9/46d (Petrides and Pandya, 1999), another region classically implicated in WM tasks. It is precisely these pyramidal cells in deep layer III whose recurrent local excitation supports delay-related WM processes (Goldman-Rakic, 1995), making this region a logical place to test our hypotheses.
For each participant, two layers, superficial and deep, were drawn manually within the selected ROI (see Fig. S1 for layer masks for all participants scanned using the axial readout protocol). To ensure that this ROI drawing approach was robust, we conducted test-retest scans separated by several weeks on two participants. Results showed good overlap between ROIs derived from independent experimental sessions (Fig. S4, S5), indicating that the functional region in question can be reliably localized within participants, and the observed layer activity profiles are consistent across sessions. To better specify the position of our “superficial” and “deep” layers with respect to cortical laminae defined cytoarchitectonically, we normalized all available MRI-based anatomical contrasts to an existing histological image (Fig. S3). The boundary between our superficial and deeper layers fell approximately between layer III and layer IV.
Task performance
Participants performed well on the task (mean accuracy = 77.9 percent, s.d. = 14.4 percent, range = 58.8 – 95.0 percent; note that chance is approximately 20 percent) and there was no difference in accuracy on manipulation versus maintenance trials (paired t-test, t(14) = −0.55, p = 0.59). It is therefore unlikely that differences in difficulty between conditions confound the results.
Activity timecourses
Using data from six participants scanned during eight experimental sessions with the axial readout protocol, we observed layer-dependent activity time-courses that followed the hypothesized patterns: in superficial layers, activity was higher in manipulation relative to maintenance trials during the delay period, and in deeper layers, activity was higher in action versus non-action trials during the response period (Fig. 2). These patterns were visible in both contrasts (BOLD, Fig. 2a; and VASO, Fig. 2b). Below we summarize characteristics of these depth-dependent timecourses during the two main periods of interest, delay and response.
Delay-related activity
In superficial layers, delay-period activity was uniformly high during manipulation trials. This is evident in both BOLD and VASO contrasts, in trials labeled ‘alpha’, ‘action’ and ‘non-action’ (Fig. 2a and 2b, top row; recall that both action and non-action trials call for manipulation, and they are indistinguishable from one another until the probe appears). While it appears from BOLD data as though activity in superficial layers is slightly above baseline even during maintenance trials (Fig. 2a, top left), VASO data indicate little to no response during maintenance trials (apart from a brief initial uptick that may reflect a stimulus-driven sensory encoding signal; Fig. 2b, top left). This agrees with previous reports that human dlPFC is not strictly necessary when the task calls for maintenance only: repetitive transcranial magnetic stimulation (rTMS) to dlPFC selectively impairs manipulation but not maintenance (Postle et al., 2006), and lesions to dlPFC have no effect on maintenance (Mackey et al., 2016).
Compared to superficial layers, deeper layers are markedly less active during the delay. BOLD data appear to indicate some delay-related activity in deeper layers, again more so for manipulation than maintenance trials (Fig. 2a, bottom row). However, VASO data, which have higher spatial specificity, suggest little to no role for deeper layers during this period (Fig. 2b, bottom row). Thus, it seems that delay-related activity occurs predominantly (if not exclusively) in the superficial layers.
Response-related activity
During the response period, we observe the opposite pattern: activity in deeper layers is relatively high, but only in trials requiring an action during the response period. Deeper-layer activity peaks at the time of the motor response, which is approximately 6 seconds after the probe comes onscreen (reflecting hemodynamic delay). As expected, this peak is present in action but not non-action trials, and can be seen in both the BOLD (Fig. 2a, bottom right) and VASO (Fig. 2b, bottom right) contrasts.
As for superficial layers, it appears from BOLD data as though their activity remains high through the response period in action trials, while falling off in non-action trials (Fig. 2a, top right). However, this may simply be due to the superficial bias of BOLD, since VASO data (Fig. 2b, top right) indicate that superficial-layer activity is, if anything, suppressed at the response peak in both trial types. This confirms our prediction that response period is preferentially associated with activity in deeper cortical layers.
Quantification of differential activity
To directly compare activity between trial types, trial periods and cortical depth, we subtracted the average timecourse within each layer between the conditions of interest (i.e., for superficial layers, manipulations¬–maintenance; for deeper layers, action-non-action). Results from both BOLD and VASO confirm that for superficial layers, the difference between manipulation and maintenance peaks during the delay period (Fig. 3a, top and Fig 3b, top), while for deeper layers, the difference between action and non-action trials peaks at the time of the response (Fig. 3a, bottom and Fig. 3b, bottom).
The more quantitative nature of CBV, and the relative lack of depth-dependent biases, permit a direct comparison of VASO-derived activity levels across layers and trial periods. For each cortical depth, we calculated the average differential activity for manipulation over maintenance from measurements acquired during the delay period (timepoints 4, 5 and 6, corresponding to 12, 16 and 20 sec in trial time), and the average differential activity for action over non-action from measurements acquired during the response period (timepoints 7, 8 and 9, corresponding to 24, 28 and 32 sec in trial time). Comparing levels of differential activity (Fig. 3c), it is again evident that superficial layers are more sensitive to the delay-period contrast than the response contrast, while deeper layers are almost exclusively sensitive to the response contrast.
Visualization of depth-dependent activity
To better visualize the depth-dependent distribution of signal associated with different periods within the trial, we used a second, higher-resolution imaging protocol in which the field of view was a sagittal slab centered on dlPFC with in-plane resolution of 0.76 × 0.76mm. In these experiments, the task consisted exclusively of manipulation/maintenance trials, all requiring an active response (i.e., the first contrast type shown in Fig. 1a, top). Functional signals during manipulation and maintenance trials were investigated across cortical depths.
Layer-dependent activity could be detected in all individual participants imaged using this protocol (n = 5; Fig. 4). Manipulation evoked more activity than maintenance predominantly in superficial layers (green stripes), while signal associated with response (as compared to baseline; red stripes) was predominantly localized to deeper layers. These patterns were visible in both the BOLD (Fig. 4A) and VASO (Fig. 4B) contrasts. Layer ROIs for each participant are shown in Fig. 4C.
Discussion
By developing and optimizing state-of-the-art techniques in high-resolution fMRI for cognitive brain areas, we have achieved layer-specific imaging of cortical activity during a working memory task in human dlPFC. We used a three-factor design for which we had clear hypotheses about the location, magnitude and timing of neural activity, and detected timecourses at different cortical depths that followed the expected patterns. Namely, we observed delay-related manipulation activity that was predominantly localized to superficial layers, and response-related activity that was predominantly localized to deeper layers. While working memory has been known to engage dlPFC for decades, the degree to which its subfunctions were layer-specific had been hypothesized but not consistently shown, with few demonstrations even in nonhuman primates (though see Bastos et al. (2018) for recent evidence). Our data interrogate layer-specific functionality directly and non-invasively in humans.
The observed laminar specificity of distinct working memory operations can be interpreted in light of what is known about underlying neural circuitry. First, superficial activity during the delay period likely reflects recurrent excitatory connections. While in early parts of the cortical hierarchy, superficial layers give rise to feedforward connections, at the highest levels (i.e., PFC), laminar projections become more complex. Layer III expands and is the focus of extensive local, recurrent excitatory connections, as well as long-range recurrent connections, e.g., with parietal association cortex (Arnsten et al., 2012, Medalla and Barbas, 2006), which is also heavily involved in working memory. Recurrent excitation among these cells is a feature of their unique molecular profile, notably their preferential expression of n-methyl-d-aspartate (NMDA) receptors and specifically the NR2B subunit, whose slower kinetics allow for persistent firing over long delays; this has been predicted by computational models (Wang, 1999) and confirmed experimentally in primates (Wang et al., 2013).
Second, response-period activity in deeper layers likely reflects functions related to motor control, such as initiating a motor action, suppressing prepotent responses, or a feedback mechanism such as corollary discharge. dlPFC does not project directly to primary motor cortex; rather, it influences motor behavior polysynaptically via higher-order motor areas (Arikuni et al., 1988, Takada et al., 2004). Thus response-related activity in deeper layers may reflect output-circuit activity involving one or multiple such areas. Layer V cells also have dense projections to striatum, which likely also serve to guide movements (Arikuni and Kubota, 1986, Yeterian and Pandya, 1994).
Of note, schizophrenia is associated with altered genetics (reviewed in Arnsten et al. (2012), morphology (Garey et al., 1998, Glantz and Lewis, 2000) and function (Perlstein et al., 2001) in this very dlPFC circuitry. It is hypothesized that decreased delay-related activity in superficial layers, as well as disinhibition in deeper layers, may underlie the deficits in working memory and other cognitive functions seen in these patients. We expect that future studies using layer fMRI in populations with or at risk for schizophrenia will shed new light on the spatiotemporal dynamics of cognitive dysfunction in this illness.
From a methodological perspective, here we used advanced contrast mechanisms and balanced task design to offset differences in vascular physiology across cortical depths, which can introduce substantial biases and limit the interpretability of layer fMRI (Kay et al., 2018). In contrast to gradient-echo BOLD (GE-BOLD), CBV-weighted fMRI signal acquired with VASO allows appropriate separation of microvascular responses at a layer-dependent level (Goense et al., 2012, Kim and Kim, 2010). We avoid biases of different hemodynamic response functions (HRFs) across cortical depths (Petridou and Siero, 2017, Yacoub et al., 2006) by refraining from using statistical general linear model (GLM) deconvolution with predefined HRFs, and by restricting our interpreting to quantitative signal differences that are obtained at the same latency within identical task blocks. Additionally, we collected conventional GE-BOLD fMRI concomitantly with VASO data. The near-simultaneous acquisition of BOLD and VASO data allowed us to obtain a clean BOLD-corrected, CBV-weighed VASO signal. The higher sensitivity of BOLD compared to VASO was helpful in selecting the correct ROI, while the higher spatial specificity of VASO was helpful for interpreting signal across cortical depths.
This work has important implications for non-invasive, in vivo mapping of input-output and feedforward-feedback connections in the human neocortex. Outstanding methodological challenges include expanding spatial coverage without sacrificing resolution. Simultaneous imaging of dlPFC, premotor and primary motor cortices would allow for detecting information flow during response generation and execution. Expanding coverage to parietal and sensory areas as well as neighboring prefrontal areas would allow for characterizing interactions that support stimulus perception, information storage and manipulation during the encoding and delay periods. Given current trade-offs between field of view and voxel size, it is still difficult to resolve individual cytoarchi-tectonic layers (hence we are limited here to drawing conclusions about “superficial” versus “deeper” layers (Fig. S3), rather than the six distinct canonical laminae). But we expect that the ever-advancing tools of high-field fMRI data acquisition and analysis will ultimately transform our understanding of cognition in the awake, behaving human brain.
Funding
The research was funded by the National Institute of Mental Health Intramural Research Program (ZIAMH002783).
Authors’ contributions
E.S.F. conceptualized the study. E.S.F., L.H., and D.C.J. designed the experimental paradigm. L.H. designed and optimized the data acquisition and analysis methodology. L.H. and E.S.F. collected the data and performed the analyses. E.S.F., L.H., and D.C.J. generated data visualizations. P.A.B. supervised study design and interpretation. E.S.F. wrote the original draft, with contributions from L.H. and revisions from D.C.J. and P.A.B.
Competing interests
No authors have competing interests to declare.
Data and materials availability
Data and code will be made available online at publication <link to be inserted here>.
This preprint is formatted based on a LATEXclass by Ricardo Henriques.
Acknowledgements
We thank Amy Arnsten for guidance on experimental design and interpretation. We thank Benedikt Poser and Dimo Ivanov for the 3D-EPI readout that is used in the VASO sequence used here. We thank Andrew Harry Hall and Kenny Chung for administrative support of human volunteer scanning. We thank Sriranga Kashyap for helpful tips on adjusting manual initial registration used to generate Fig. 1c and Fig. S4-5. Portions of this study used the high-performance computational capabilities of the Biowulf Linux cluster at the National Institutes of Health, Bethesda, MD (biowulf.nih.gov).