Abstract
Left hemisphere damage in adulthood often leads to linguistic deficits, but many cases of early damage leave linguistic processing preserved, and a functional language system can develop in the right hemisphere. To explain this early apparent equipotentiality of the two hemispheres for language, some have proposed that the language system is bilateral during early development and only becomes left-lateralized with age. We examined language lateralization using functional magnetic resonance imaging with two large pediatric cohorts (total n=273 children ages 4-16; n=107 adults). Strong, adult-level left-hemispheric lateralization (in activation volume and response magnitude) was evident by age 4. Thus, although the right hemisphere can take over language function in some cases of early brain damage, and although some features of the language system do show protracted development (magnitude of language response and strength of inter-regional correlations in the language network), the left-hemisphere bias for language is robustly present by 4 years of age. These results call for alternative accounts of early equipotentiality of the two hemispheres for language.
Significance Statement Language is the most canonical function that shows a strong hemispheric asymmetry in adult brains. However, whether the language system is already lateralized to the left hemisphere early in development has long been debated, given that early left-hemisphere damage often leaves language processing unimpaired. We examined the developmental trajectory of language lateralization in two large-scale pediatric datasets using robust individual-subject fMRI approaches. We found that the language system exhibits adult-like left-hemispheric lateralization by age 4, although other aspects of the neural infrastructure for language show a clear change between age 4 and late childhood. These findings challengethe claim that the language system is bilateral during early development and call for alternative accounts of early hemispheric equipotentiality for language.
Introduction
In a truly incredible feat, approximately six months after they are born, human babies begin to recognize words for common objects1, and toward their first birthday, they utter their first words2. Over the following couple of years, their vocabularies explode and they learn to combine words into phrases and sentences. By about 4 years of age, typically developing children can understand and express complex ideas through language3,4. But is the brain infrastructure for language the same in these young but already quite competent language users as in adults?
In adult brains, language processing, including both comprehension and production, draws on a specialized network of frontal and temporal areas5–7. In the vast majority of individuals, this network is lateralized to the left hemisphere, which manifests as a) stronger and more spatially extensive responses to language in the left hemisphere (see8 for data from >800 individuals), and b) a greater likelihood of linguistic deficits (aphasia) following left hemisphere damage in adulthood9–11. One important controversy about the development of language-processing mechanisms concerns the degree of left-hemispheric lateralization of the language network in children.
According to one influential proposal, the language system starts out as bilateral and only becomes left-lateralized with age12. This proposal is motivated by the apparent equipotentiality of the two hemispheres for language early in life: damage to the left hemisphere in childhood can leave linguistic functions preserved, with the right hemisphere homotopic areas taking over13–16 (cf.17,18). This proposal also makes predictions about language development in typical brains, which can be evaluated using functional brain imaging. In particular, the fronto-temporal networks in the two hemispheres should be similar in their responses to language during early development. Indeed, some neuroimaging studies that have examined responses to language from about age 4 years onwards have reported more bilateral responses at younger ages and more strongly left-lateralized responses with age19–26. However, this empirical claim is controversial: other studies have reported already left-lateralized responses to language in children27–29, by four years of age30.
The complexity of the empirical landscape may have to do with i) the predominant reliance of past fMRI studies on the traditional group-averaging approach, which suffers from low sensitivity, low functional resolution, and low interpretability31–33, and ii) the diversity of experimental paradigms, which makes comparisons across studies challenging. Some paradigms further conflate language processing with lower-level speech perception or with general task demands, both of which draw on bilateral systems even in adults (speech perception34; task demands35). For example, many language experiments involve not only comprehension of language but also meta-linguistic judgments or comprehension questions, which are known to recruit the domain-general Multiple Demand network36. If these meta-linguistic tasks are more difficult for children compared to adults, they may recruit the Multiple Demand network to a greater degree, which would lead to the appearance of more bilateral responses in children (we return to this issue in the Discussion).
Here, we characterize the development of the language network across two independent pediatric cohorts (Dataset 1: 206 children, aged 4-14, and 91 adults; Dataset 2: 67 children, aged 4-16, and 16 adults). We use a robust individual-subject fMRI approach (‘precision fMRI’31,33,37,38) and an extensively validated language ‘localizer’ paradigm8,31,39, which robustly isolates the language areas from both lower-level speech areas and domain-general areas sensitive to task demands 40,41. Although our main focus is on lateralization, we also examine other properties of the language network, for which the developmental trajectory remains also debated in the developmental neuroscience literature. These include the language network’s topography (e.g., how early does the frontal component of the language network emerge?22,24,42,43), its selectivity for language, and the degree of functional connectivity among the different language regions22,44.
To foreshadow the results, across both datasets, we found that left-hemisphere lateralization based on (1) response magnitude and (2) volume of activation was already adult-like by four years of age (see SI-10 for generalization to a third, previously published dataset25). In contrast, other aspects of the brain organization for language show a clear developmental trajectory: both the magnitude of response to language and the strength of inter-regional functional correlations increase from early to middle to late childhood, at which point they reach adult levels. These findings suggest that the capacity of the right-hemisphere frontal and temporal areas to support language processing in cases of early left-hemisphere damage is not due to the language system being fully or more bilateral in young children. In addition, these results establish the normative developmental trajectory for key individual-level neural markers of language processing, thus laying the groundwork for future investigations of linguistic processing at earlier ages and in developmental populations with language disorders.
Results
All participants performed an extensively validated language localizer task31 based on a contrast between listening to short age-appropriate stories/passages (the critical condition) and a perceptually similar condition where linguistic content is not comprehensible (the control condition) (Figure 1; see Methods-Section2 and SI-1 for details). This localizer has been shown to engage language areas, and not to engage other nearby functional areas, such as the lower-level speech perception areas34,45 and areas of the domain-general Multiple Demand network46,47. All statistical analyses were performed on the neural measures of language processing extracted from individual activation maps from the language localizer (see Methods-Section7), including three measures of language lateralization: inter-hemispheric difference in the magnitude of neural (BOLD signal) response, in the volume of activation, and, for completeness, in the strength of inter-regional functional correlations during naturalistic cognition (resting state).
1. By age 4, the left-hemisphere language network shows adult-like functional topography
Because of past claims that the frontal component of the language network exhibits protracted development43,49, we first examined the general topography of the language responses across our different age groups, to decide which component(s) to focus on for the critical questions about language lateralization. As shown in Table 1 and Figure 2A, reliable responses to language processing—in both temporal and frontal components—are robustly present in children, including in the youngest pediatric group examined here (4-6 year-olds).
In particular, consistent with what has been reported previously in adults (e.g., see39 for the results for the adult group in Dataset 1), participants in each pediatric group showed a reliably stronger response during the Language condition compared to the Control condition (Figure 2B). This effect held across the LH language network as a whole in both datasets and every age group (all ps<0.001; Table 1). The effect also held for the temporal and frontal components of the network separately, again in both datasets and every age group (all ps<0.001, except the early group in Dataset 2 (n=6): ps<0.01; Table 1). (For the parallel analyses of the homotopic RH language network, see SI-2.)
These results suggest that we can examine both the frontal and temporal components of the language network for our critical question of language lateralization across childhood, instead of restricting our analyses to the temporal component.
2. The language network exhibits an adult-like left-hemispheric bias by age 4
In adults, the left-hemispheric (LH) bias for language manifests as a stronger (higher magnitude) and more spatially extensive (greater volume of significant voxels) response to language in the LH language areas. We examined both of these measures for each age group across development.
The effect of hemisphere on the magnitude of the Language>Control effect held in both datasets and all age groups (ps<0.05; Table 2A, Figure 2B), except the middle childhood group in Dataset 1 and the early childhood group in Dataset 2. The effect of hemisphere on the volume of activation for the Language>Control contrast also held in both datasets and in all age groups (ps<0.05; Table 2B, Figure 2C), except the early childhood group in Dataset 2. The lack of significant effects in the early childhood group in Dataset 2 is likely due to a small sample size of n=6 (note though that the mean inter-hemispheric differences are similar in size to other groups; e.g., in the middle childhood group, there are, on average, 32% more significant Language>Control voxels in the LH than the RH, and in the early childhood group, there are 37% more voxels in the LH than the RH). For the groups that showed lateralized responses at the network level, the effects also held for the temporal and frontal components of the network separately, for both the magnitude of the Language>Control effect and the volume of activation effect (all ps<0.001; SI-3).
In addition to exhibiting left-lateralized responses during language tasks, adults also manifest a left-hemispheric bias in the strength of inter-regional correlations within the language network during naturalistic cognition39,46,47,50. In Dataset 1, which had naturalistic (resting state) data, the LH regions were more robustly inter-correlated than the RH regions in all groups (ps<0.01; Table 2C, Figure 2D), except the early childhood group (see SI-4 for evidence that participants in all pediatric groups showed reliable inter-regional correlations for both local, within-lobe pairs and distant, frontal-to-temporal, pairs: all ps<0.001). So, the degree of lateralization as measured by inter-regional functional correlations is the only measure that shows a potential developmental change, with a hemispheric bias not emerging until middle childhood (but see next section and Discussion).
To test for age-related changes in lateralization, we used measures that control for overall group differences in the magnitude of effects, which vary with age (Figure 2B-D; Section 3). A standard way to control for these differences is to use laterality index (LI) measures, which are calculated by dividing the difference (in some effect of interest) between the left and right hemispheres by the sum of the effects in the left and right hemispheres21,51,52. As shown in Figure 3 and Table 3, we find no significant association between age and degree of lateralization in any of the three measures (magnitude and volume of activation for the Language>Control contrast or strength of inter-regional correlations for resting state data) when treating age as a continuous variable. Similarly, in pairwise group comparisons between children (in different age groups) and adults, we did not find evidence of significant differences in any lateralization measures, across datasets. The few significant group differences that we observed among the childhood groups did not follow the predicted age trajectory (e.g., the middle childhood group in Dataset 1 showed slightly lower lateralization compared to both the early childhood group and the late childhood group).
3. Response magnitude and inter-regional correlations increase between ages 4 and 16 and reach maturity by late childhood
In addition to lateralization, we examined developmental changes in overall response magnitude and strength of inter-regional correlations. These properties of the language network showed gradual development across childhood (Figure 4), as evident in both the analyses that treated age as a continuous variable and the analyses that treated age group as a categorical variable.
The analyses where age was treated as a continuous variable revealed a positive effect of age on the magnitude of the Language>Control contrast in Dataset 1 (b = 0.13, SE = 0.02, t = 5.66, p < 0.001; Figure 4A); this effect was also positive though not significant in Dataset 2 (b = 0.065, SE = 0.034, t = 1.91, p = 0.06; Figure 4). In line with these results, in Dataset 1, the Language>Control effect was larger in the late childhood and adult groups compared to the early and middle childhood groups (ps<0.001; Table 4A); the early and middle childhood groups did not differ significantly, and the late childhood group also did not differ from adults. In Dataset 2, the qualitative pattern was overall similar—with responses increasing from early to middle to late childhood and not showing a further increase between late childhood and adulthood—but none of the pairwise comparisons reached significance (Table 2A). This pattern of increasing response magnitude was also similar in the RH language network, across the two datasets (SI-2).
Similarly, for inter-regional correlations, the analysis where age was treated as a continuous variable revealed a positive effect of age (b = 0.02, SE = 0.00, t = 8.20, p < 0.001; Figure 4B, Table 4B). In line with this result, the correlations were stronger in the late childhood and adult groups compared to the early and middle childhood groups (ps<0.001; Table 4B; see SI-5 for evidence that these effects were not driven by differences related to the amount of head motion53); the early and middle childhood groups did not differ significantly, and the late childhood group did not differ from adults.
Discussion
We investigated development of lateralization and overall organization of the fronto-temporal language network using two relatively large independent pediatric fMRI datasets and extensively validated language ‘localizer’ tasks31. Using robust individual-subject analyses, we found strong evidence for the presence of an adult-like left-hemisphere (LH) bias in the magnitude of response and volume of activation during language processing even in our youngest pediatric cohorts (4-6 year-olds), although we also found evidence that other features of the language network show a clear developmental change. In the remainder of the Discussion, we position these findings in the context of prior literature and discuss their broader implications.
Language processing is strongly lateralized by 4 years of age and does not show a change between early childhood and adulthood
One important claim in the literature has been that the language network starts out as a bilateral system and only becomes lateralized to the left hemisphere with age12. This hypothesis was put forward to explain a difference in the presence of language deficits following damage to the LH in mature brains vs. child brains. In particular, LH damage in adulthood typically results in aphasia54,55. In stark contrast, damage to the LH in children and even some adolescents can leave language largely preserved (see15,56 for reviews). In such cases, a typical-like language system in the homotopic right-hemisphere areas appears to support language processing13–16,57–59. These findings have led to arguments that RH frontal and temporal areas support language processing alongside the LH areas in developing brains, and until a certain point in the developmental trajectory can take over language function60.
Indeed, developing brains show remarkable plasticity and can reorganize in response to injury or atypical experience. For example, in individuals born blind, the occipital cortex responds to auditory stimuli61, olfaction62, and even language63. But this plasticity, evident under atypical development conditions, need not imply the lack of functional biases in typical brain development, which is the focus of the current study.
The hypothesis whereby the language system starts out bilateral during early development makes a clear prediction that the LH and RH language areas should respond similarly to language in children, in contrast to adults, where the LH areas show stronger and more extensive responses during language processing compared to the RH areas. Several studies have indeed purported to show more bilateral responses to language in children and increasing LH-lateralization with age (e.g.,25; see20 for a review). However, we do not replicate this developmental pattern. Instead, across two large-scale datasets, we find that a) the language network is strongly lateralized already by age 4 years (complementing some earlier studies; e.g.,27,28,30), and b) the degree of language lateralization does not change across development. In other words, even in young children, language areas in the left hemisphere respond more strongly to language and are larger in size, compared to their RH homotopic areas. This evidence rules out the possibility that the language system is supported by a fully or more bilateral network in early childhood.
Why have some prior studies found more bilateral responses to language in childhood? Perhaps the most important contributor to these discrepancies is the use of paradigms that conflate linguistic and general task demands. In particular, many language paradigms that require overt responses from children, especially those used commonly in the clinical literature (e.g., the verb generation task19,64), tax both language processing and general cognitive demands associated with performing the task. Task-related cognitive demands recruit the domain-general Multiple Demand (MD) network35,65—a bilateral network of frontal and parietal areas whose left frontal component is adjacent to, but distinct from, the language areas41,47,66,67; (see36 for evidence that language paradigms accompanied by task demands recruit both the language and the MD network in adults). Given that most tasks are more difficult for younger children, they may recruit the MD network to a greater extent. Because the MD network is bilateral, its greater recruitment will manifest as more bilateral responses at younger ages. In other words, what looks like an increase in the degree of LH lateralization of the language network may instead reflect age-related reduction in the reliance on the (bilateral) MD network, and increased reliance on the (lateralized) language network.
We believe this issue affects an influential study by Olulade and colleagues25, who used a paradigm that included both linguistic demands and task demands (judging whether a sentence was true in the critical, forward-speech condition and deciding whether a tone was present at the end of the stimulus in the control, backward-speech condition), with the task in the critical condition being more difficult. When Olulade et al.’s data are preprocessed and modeled through a pipeline that helps restrict the analyses to the language areas (cf. examining activity within large anatomical masks that encompass both language and MD cortex) and when more standard lateralization measures are used—the results are similar to what we observe in the current study: little to no developmental change in language lateralization in children 4 years and older (SI-10). To the extent that a slight change in the LI scores is present in the Olulade et al.’s data (due to an increase in LH activity with age rather than a decrease in RH activity, as argued by Olulade et al.), this pattern does not replicate in either of our two datasets.
Future work on the development of the language network should take care to separate the language network from the domain-general Multiple Demand network, and other networks known to be functionally distinct from the language network, including in children30. One way to do this is to adopt paradigms that do not include task demands and focus on passive comprehension (e.g., contrasting passive comprehension of language with perceptually matched control conditions31,48). If a task is included, care should be taken to a) ensure that the task in the control condition is at least as difficult as in the critical condition, b) separate the task temporally from the language-processing component and model it separately (as in Dataset 2 in the current study), and/or c) use spatial priors for language-responsive areas based on large-scale datasets from validated paradigms (e.g.,8).
Reconciling early left-hemispheric language lateralization with early equipotentiality of the two hemispheres for language
If language processing is already lateralized to the left hemisphere by age 4, how do we square this with evidence from early LH brain damage, which at least in some cases leaves language processing unimpaired? Existing evidence unequivocally shows that the two hemispheres are equipotential for language early in life (e.g.,16,58). Evidently, however, this equipotentiality does not manifest as similar responses in the two hemispheres to language. One possibility is that brain imaging studies, including ours, are measuring language responses too late in the developmental trajectory: maybe language processing is bilateral between age ∼6 months, when linguistic abilities start to emerge1 and some early age: say, 1-3 years. Some evidence exists of left-hemispheric lateralization prior to age 4, but the youngest children in those cohorts (2-3 year-olds) may still be too old. Alternatively, early equipotentiality of the two hemispheres for language may be mediated by a different mechanism.
Language processing robustly recruits RH homotopic areas across the lifespan8,68 see SI-2 for evidence from the current study), although with consistently lower and less spatially extensive responses in the RH compared to the LH language areas. One reason why these RH areas can take on language function early but not later in life may be that, with age, the RH areas become specialized for storing and processing certain kinds of non-linguistic information (e.g., social information69,70) whereas the LH areas remain language-selective40.
Another possibility has to do with age-related changes in the patterns of inter-regional functional correlations. In our data, we found no hemispheric bias in the strength of inter-regional correlations during naturalistic cognition in the youngest group: similarly strong inter-regional integration in the left-and right-hemisphere language networks may at least partially underlie their equipotentiality for language early in life.
Finally, as noted above, early equipotentiality of the two hemispheres for language— accompanied by bilateral responses to language—may only be present under conditions of atypical brain development (e.g., in brains affected by early strokes, epilepsy or other neurological conditions, and in brains of individuals with different perceptual and/or motor experiences, as in the case of congenital blindness). Examination of brains in typically developing and neurologically healthy children who suffer a physical brain injury to the left hemisphere would help evaluate this possibility.
The magnitude of response to language and the strength of inter-regional correlations increase across development and reach maturity by late childhood
Although linguistic abilities show a high degree of sophistication by age ∼4 years (e.g.,3), they continue to develop well into the late teens71. In line with this protracted developmental trajectory, two aspects of the brain organization for language processing showed a developmental change. First, the strength of the response during language comprehension relative to a control condition increased from early, to middle, to late childhood, with no difference between the late-childhood and adult groups. This pattern was highly consistent between the two datasets, despite differences in the populations tested and details of the experimental materials and paradigm (Figure 1). Second, the strength of inter-regional correlations during naturalistic cognition (resting state) showed an increase across development (including after controlling for head motion; SI-5), also reaching adult-like levels by late childhood.
Developmental changes in both the magnitude of response to language and in the strength of inter-regional correlations have been reported in past studies (e.g.,22,24,42,72–76). However, to the best of our knowledge, none of those studies have relied on a validated language localizer paradigm, which differentiates language areas from nearby areas of functionally distinct networks (e.g.,46,47,67), and on activation measures that have been established to be reliable within individuals (e.g.,77). Differentiating language areas from other functional areas is critical given the differences in the developmental trajectories of different cognitive abilities. For example, in contrast to linguistic abilities, which are already quite advanced by age 4-5 years, executive functions show protracted development reaching maturity in the late teens to early 20s (e.g.,78). Separating language and Multiple Demand areas in the current study allowed us to unambiguously attribute the neural changes we observe to the maturation of linguistic abilities and their underlying substrates (see also30).
What does the developmental increase in response magnitude to language reflect? One possibility is that it reflects relatively experience-independent biological maturation of the underlying neural circuits (e.g.,79,80). Another, non-mutually exclusive possibility is that this increase reflects experience-related changes in linguistic ability. Our knowledge of language, which encompasses knowledge of words, constructions, and rules for combining those in new ways81,82, continues to expand across childhood and into adulthood. With this expansion come improvements in our ability to extract meaning from linguistic inputs. One way to distinguish between the biological vs. language-experiential accounts is to examine changes in response to language in adults as they acquire a new language. Such individuals have fully mature neural circuits but show a gradual behavioral change as they learn a language. Malik-Moraleda, Jouravlev et al.83 reported an investigation of polyglots—individuals with some proficiency in five or more languages—where they found that neural responses to language scale with proficiency levels. Their finding provides indirect support for the idea that stronger responses in the language network reflect better linguistic ability gained through language-specific experiences, although they do not rule out a contribution of biological factors. Relating age-related neural changes to linguistic ability directly is an important avenue for future work. It is also worth noting that the magnitude of response and the strength of inter-regional correlations are only moderately correlated (r = 0.32; SI-9), which may suggest that age-related changes in these two properties are associated with distinct biological and/or cognitive changes.
Limitations
The current study is limited in several ways. First, we adopted a cross-sectional approach. Tracking changes in the neural infrastructure of language processing longitudinally may offer greater sensitivity in detecting age- and experience-related changes. Second, we did not attempt to match the age groups on any variables that may affect neural responses (e.g., motion, performance on language measures). This concern is somewhat ameliorated by the inclusion of two datasets and the observation of similar patterns across them; nevertheless, more carefully matched samples should be examined in future work. Third, all the pediatric data come from English, which is not representative of the world’s languages (e.g.,84), leaving generalization to other languages an important future direction. Finally, we have here focused on the language network—brain areas that support high-level language comprehension and production. Complementary investigations of age-related changes in lower-level speech perception areas34,45 and speech articulation areas85,86, which support the earlier-emerging abilities that scaffold language development, are another important avenue of future work.
Open questions about the neural basis of language development
Several important questions remain about the development of the language network. First, when and how do the language brain areas emerge? The youngest children in the current dataset are ∼4 years old. This is also true of most past studies of neural language development (e.g.,20,21,87; cf.30,88). By age 4, children can understand and express complex ideas through language. What the field sorely lacks are neural data from infants and toddlers (age range: 6 months to ∼3 years old). A number of studies have examined responses to speech sounds in sleeping infants during the first few days/weeks of life (e.g.,89,90). However, until age ∼6 months1,91, infants do not reliably derive meaning from linguistic inputs—a core computation that the language network in adults putatively supports. Understanding how early responses to speech sounds may give rise to the language circuits a few months later remains unknown, as probing neural responses to meaningful language during the first years of life using spatially precise brain-imaging methods like fMRI poses substantial technical obstacles92,93.
Second, how does specialization for language processing develop and how does it relate to individual differences in language function? In adults, the brain areas that support linguistic comprehension and production are strongly specialized for language relative to diverse non-linguistic inputs and tasks, from general executive functions5,94, to math and logic67,95–97, to computer code processing98,99, to music5,100, to observing others’ actions, facial expressions, and gestures101–103, to social cognition70. How does this specialization emerge? Do brain areas that selectively support language in adults perform different or additional functions earlier in life?
The fact that the language system is closely juxtaposed with other large-scale brain networks41,47,79 may suggest that at earlier points in development (and perhaps evolution) the language system is not segregated from some of these nearby areas. Hiersche et al.30 suggest that by age ∼3 years, the language system is robustly segregated from the domain-general Multiple Demand network, which has been linked to fluid reasoning and general intelligence. However, testing other non-linguistic functions relative to language processing, and at earlier ages, will be critical.
Third, we, as a field, still lack an understanding of what is special about the left hemisphere: i.e., why in most typical individuals does language processing end up being localized to the left? Aso, why does language processing appear to be more bilateral in many individuals with developmental brain disorders104,105 (see106 for a meta-analysis) and in some other populations with atypical developmental experiences, like congenitally blind individuals107? Hypotheses about the left-hemisphere bias for language are plentiful (e.g.,108), but getting a clear answer has proven challenging. Understanding whether language processing is already lateralized in infants and/or toddlers may place some constraints on this hypothesis space.
In conclusion, we have shown that the human language system is strongly lateralized to the left hemisphere even in young children, which challenges the hypothesis that the language system is bilateral in childhood12,60. Although the level of response to language and the inter-connectedness of the language network continues to increase into late childhood, adult-like lateralization to the left hemisphere is already present by age 4-5 years.
Materials and Methods
1. Participants
For all child participants, a parent or guardian provided written informed consent, and the child provided assent; adult participants provided written informed consent, in accordance with the Committee on the Use of Humans as Experimental Subjects at the Massachusetts Institute of Technology.
Inclusion criteria
Across datasets, child and adult participants were included in the analysis based on the following criteria: 1) being a native speaker of the language used in the experiment (English for all groups except for the adult group in Dataset 1; diverse languages for the adult group in Dataset 1, as described in39), 2) having no diagnosis of neurological disorder, and 3) having normal or corrected-to-normal vision. For Dataset 1, it was additionally ensured that child participants were born full-term (>37 weeks), had no history of brain injury and hospitalizations, and were not using psychotropic medications.
Dataset 1
After excluding 28 child participants from group-level analyses due to excessive motion (Methods-Section5; for summary of motion outliers by group, see https://osf.io/3mvpx/), the final Dataset 1 included 206 children (ages 4-14; average = 10.69, st. dev. = 3.18; 106 females; 178 right-handed) and 91 adults (ages 19-45, average = 27.40, st. dev. = 5.5; 48 females; 84 right-handed). The pediatric data came from three projects that each focused on a different age range and did not concern the development of the language network (for details, see https://osf.io/3mvpx/?view_only=803ba79e635d4e0da1698e0a6367d7e6): early childhood (n=41; ages 4-6, average = 5.78, st. dev. = 0.69), middle childhood (n=57; ages 7-10, average = 9.00, st. dev. = 0.74), and late childhood (n=108; ages 12-14, average = 13.45, st. dev. = 0.69).
Dataset 2
After excluding 31 child participants and 0 adult participants from group-level analyses due to excessive motion (Methods-Section5; for summary of motion outliers by group, see https://osf.io/3mvpx/), the final Dataset 2 included 67 children (ages 4-16; average = 11.5, st. dev. = 1.4; 29 females) and 16 adults (ages 18-60, average = 43.8, st. dev. = 12.1; 7 females). To enable parallel analyses across the two datasets, the pediatric data in this dataset were divided into three age groups, to approximately match the division in Dataset 1: early childhood (n=6; ages 4-6, average = 5.4, st. dev. = 0.46), middle childhood (n=38; ages 7-9, average = 7.8, st. dev. = 0.65), and late childhood (n=23; ages 10-16 average = 11.5, st. dev. = 1.38).
A subset of the children and adults in Dataset 2 wore a total light-exclusion blindfold during the fMRI scan to reduce visual cortex responses to visual input during the scan, for a different study63 (children: n=14 [early = 2; middle = 7; late = 5], ages 4-16, average =8.85 , st. dev. = 2.23; adults: n=16, ages 18-59, average = 43.8, st. dev. =12.08). Blindfolded and non-blindfolded children are treated as a single group for the current study, which focuses on responses in higher-cognitive networks.
Subsets of Datasets 1 and 2 were included in several published studies39,107,109–111), whose goals differed from the current study and did not concern the development of the language network.
2. Language localizer task
As shown in Figure 1, four variants of the language localizer were used, with the control condition varying between speech played in reverse, acoustically degraded speech, and speech in an unfamiliar foreign language (see SI-1 for details). Each participant completed 6-12 blocks per condition across 1-4 scanning runs (for the early childhood group in Dataset 1, where a single run was used, the run was manually split into two equal-sized runs to enable across-runs cross-validation, as needed for the analyses; see Critical individual-level neural measures). Participants were excluded from the language localizer task analysis if more than 40% of the acquired volumes (across both runs) were identified as outlier volumes during preprocessing (as detailed below).
In addition to the language localizer task, most participants in Dataset 1 performed a resting state scan (described below) and most participants performed additional tasks that were unrelated to the current study.
3. Resting state scan
The majority of participants in Dataset 1 performed a resting state scan. The participants in the early childhood group were asked to keep their eyes open and to think about anything that came to mind. A fixation cross was shown on the screen, but the children were not instructed to keep their eyes on it. The scan lasted 5 min 15 s. The participants in the middle and late childhood groups were asked to keep their eyes on the fixation cross and to let their mind wander. The scan lasted 6 min and 22 s for the Middle Childhood group and 5 min and 46 s for the Late Childhood group. For the Adult group, following66, participants were asked to close their eyes but to stay awake and to let their mind wander; the projector was turned off, and the lights were dimmed. The scan lasted 5 min.
Participants were excluded from the resting state analysis if more than 40% of the acquired resting state volumes were identified as outlier volumes during preprocessing (as detailed below). Of the 206 and 91 adults that were included in the language localizer analyses, 189 children (ages 4 – 14, average = 10.93, st. dev. = 3.15, 97 females) and 90 adults (ages 19-45, average = 27.38, st. dev. = 5.54, 47 females) were included in the resting state analyses (n=6 early and n=14 middle participants were excluded because of excessive motion). The pediatric participants included: early childhood (n=35; ages 4-6, average = 5.78, st. dev. = 0.69), middle childhood (n=47; ages 7-10, average = 9.04, st. dev. = 0.78), and late childhood (n=107; ages 12-14, average = 13.44, st. dev. = 0.68).
4. fMRI data acquisition
All data were collected on a 3-Tesla Siemens Prisma scanner at the Athinoula A. Martinos Imaging Center at the McGovern Institute for Brain Research at MIT.
Dataset 1
Early Childhood
Whole-head, high-resolution T1-weighted multi-echo MPRAGE structural images were acquired in 176 sagittal slices (TR = 2,530 ms, TE = 1.64 ms/3.5 ms/5.36 ms/7.22 ms, TI = 1,400 ms, flip angle = 7°, resolution = 1.00 mm isotropic). Whole-brain functional blood oxygenation level dependent (BOLD) data were acquired using an echoplanar imaging (EPI) T2*-weighted sequence in 41 transverse slices (2 mm thick) in an interleaved order (TR = 2,500 ms, TE = 30 ms, flip angle = 90°, bandwidth = 2,298 Hz/Px, echo spacing = 0.5 ms, FoV = 192 mm, phase encoding A > P direction, in-plane resolution = 3 mm × 3 mm). Resting state scans were acquired using an EPI T2*-weighted sequence in 41 sagittal slices (3 mm thick) in an interleaved order with a 10% distance factor (TR = 520 ms, TE 1 = 3.41 ms, TE 2 = 5.97 ms, flip angle = 55°, bandwidth = 1,502 Hz/Px, FoV = 192 mm, phase encoding A > P direction, in-plane resolution = 3 mm x 3 mm). Prospective acquisition correction (PACE) was used to adjust the position of the gradients based on the participant’s motion one TR back.
Middle Childhood
Whole-head, high-resolution T1-weighted multi-echo MPRAGE structural images were acquired in 176 sagittal slices (TR = 2,530 ms, TE = 1.69 ms/3.55 ms/5.41 ms/7.27 ms, TI = 1,400 ms, flip angle = 7°, resolution = 1.00 mm isotropic). Whole-brain functional BOLD data were acquired using an EPI T2* weighted sequence in 32 near-axial slices (4 mm thick) in an interleaved order with a 10% distance factor (TR = 2,000 ms, TE = 30 ms, flip angle = 90°, bandwidth = 1,860 Hz/Px, echo spacing = 0.63 mm, FoV = 200 mm, phase encoding A > P direction, in-plane resolution = 2.1 mm x 2.1 mm). Resting state scans were acquired using an EPI T2*-weighted sequence in 40 sagittal slices (3 mm thick) in an interleaved order with a 10% distance factor (TR = 2,500ms, TE = 30 ms, flip angle = 90°, bandwidth = 2,380 Hz/Px, echo spacing = 0.49 mm, FoV = 210 mm, phase encoding A > P direction, in-plane resolution = 3 mm x 3 mm). PACE was used to adjust the position of the gradients based on the participant’s motion one TR back112.
Late Childhood
Whole-head, high-resolution T1-weighted multi-echo MPRAGE structural images were collected in 320 sagittal slices (TR = 4,000 ms, TE = 1.06 ms, flip angle = 2°, resolution = 2.00 mm isotropic). Whole-brain functional BOLD data were acquired using an EPI T2*-weighted sequence in 72 near-axial slices (4 mm thick) in an interleaved order with a 10% distance factor and using GRAPPA with an acceleration factor of 2 (TR = 1,000 ms, TE = 37.2 ms, flip angle = 63°, bandwidth = 2,290 Hz/Px, echo spacing = 0.58 mm, FoV = 208 mm, phase encoding A > P direction, matrix size = 96 × 96, in-plane resolution = 2 mm x 2 mm). Resting state scans were acquired using an EPI T2*-weighted sequence in 72 axial slices (2 mm thick) in an interleaved order with a 0% distance factor (TR = 800 ms, TE = 37 ms, flip angle = 52°, bandwidth = 2,290 Hz/Px, echo spacing = 0.58 mm, FoV = 208 mm, phase encoding A > P direction, in-plane resolution = 2 mm x 2 mm).
Adults
Whole-head, high-resolution T1-weighted multi-echo MPRAGE structural images were collected in 179 sagittal slices (TR = 2,530 ms, TE = 3.48 ms, resolution = 1 mm isotropic). Whole-brain functional BOLD data were acquired using an EPI T2*-weighted sequence in 31 near-axial slices (4 mm thick) in an interleaved order with a 10% distance factor and using GRAPPA with an acceleration factor of 2 (TR = 2,000 ms, TE = 30 ms, flip angle = 90°, bandwidth = 1,578 Hz/px, echo spacing = 0.72 ms, FoV = 200 mm, phase encoding A > P direction, matrix size = 96 x 96, in-plane resolution = 2.1 mm x 2.1 mm).
Dataset 2
Whole-head, high-resolution T1-weighted structural images were collected using one of two custom 32-channel head coils made for children113, a 12-channel coil, or the standard Siemens 32-channel head coil in 176 interleaved sagittal slices (TR = 2,530 ms, TE= 1.64/3.5 ms/5.36 ms/7.22 ms, resolution = 1 mm isotropic). Whole-brain functional BOLD data were collected in 32 near-axial slices (3 mm thick) in an interleaved order with a 20% distance factor (TR = 2,000 ms, TE = 30 ms, flip angle = 90°, bandwidth = 2298 Hz/Px, custom pediatric coils: FoV = 192 mm; standard 32-channel head coil: FOV = 256 mm; 12-channel head coil: FOV = unavailable, phase encoding A>P direction, matrix size = 64×64, in-plane resolution = 3 mm × 3 mm). PACE was used to adjust the position of the gradients based on the participant’s motion one TR back112.
5. fMRI data preprocessing and first-level modeling
fMRI data were preprocessed and analyzed using SPM12 (release 7487), CONN EvLab module (release 19b) and custom MATLAB scripts. Each participant’s functional and structural data were converted from DICOM to NIFTI format. All functional scans were co-registered and resampled using B-spline interpolation to the first scan of the first session. Potential outlier scans were identified from the resulting subject-motion estimates as well as from BOLD signal indicators using default thresholds in the CONN pre-processing pipeline (5 st. dev. or more above the mean in global BOLD signal change or framewise displacement values above 0.9 mm). Functional and structural data were independently normalized into a common space (the Montreal Neurological Institute (MNI) template, IXI549Space) using the SPM12 unified segmentation and normalization procedure with a reference functional image computed as the mean functional image after realignment across all time points, omitting outlier scans. The output data were resampled to a common bounding box between MNI-space coordinates (−90, −126, and −72) and (90, 90, and 108), using 2 mm isotropic voxels and fourth-order spline interpolation for the functional data and 1 mm isotropic voxels and tri-linear interpolation for the structural data. Lastly, the functional data were smoothed spatially using spatial convolution with a 4 mm full-width half-maximum (FWHM) Gaussian kernel.
For the critical and control conditions of the language localizer task, effects were estimated using a general linear model (GLM) in which each experimental condition was modeled with a boxcar function convolved with the canonical hemodynamic response function (HRF) (fixation was modeled implicitly). Temporal autocorrelations in the BOLD signal timeseries were accounted for by a combination of high-pass filtering with a 128 s cutoff and whitening using an AR (0.2) model (first-order autoregressive model linearized around the coefficient a = 0.2) to approximate the observed covariance of the functional data in the context of restricted maximum likelihood (ReML) estimation. In addition to main condition effects, other model parameters in the GLM design included first-order temporal derivatives for each condition (for modeling spatial variability in the HRF delays) as well as nuisance regressors to control for the effect on the BOLD signal of slow linear drifts, subject-motion parameters, and outlier scans.
The resting state data were pre-processed using the CONN toolbox with default parameters unless stated otherwise. First, to remove noise resulting from signal fluctuations originating from non-neuronal sources (for example, cardiac or respiratory activity), the first five BOLD signal time points extracted from the white matter and cerebrospinal fluid (CSF) were regressed out of each voxel’s time course. White matter and CSF voxels were identified based on segmentation of the anatomical image114. Second, the residual signal was band-pass filtered at 0.008–0.09 Hz to preserve only low-frequency signal fluctuations115.
6. Functional ROI (fROI) definition
For each participant, functional regions of interest (fROIs) were defined using the Group-constrained Subject-Specific (GcSS) approach31. For the language network in the left hemisphere (LH), we used five parcels derived from a group-level representation of the language localizer data in 220 adult participants (independent of the adult sample in the current study) and used in much past work (e.g.,8,50,70,98,102,105,116,117, inter alia). These parcels include three regions in the left frontal cortex (two in the inferior frontal gyrus (LIFG and LIFGorb) and one in the middle frontal gyrus (LMFG)) and two regions in the left temporal cortex (LAntTemp and LPostTemp). Individual fROIs were defined by selecting—within each parcel—the 10% of most localizer-responsive voxels based on the t-values for the Language>Control contrast (see8 for evidence that fROIs defined in this way are similar to fROIs based on a fixed statistical significance threshold). We additionally defined a set of language-responsive areas in the right hemisphere (RH). Following past work (e.g.,39), we projected the LH parcels onto the right hemisphere and selected the 10% of most localizer-responsive voxels, as in the LH. (We chose to use parcels derived from adults in order to be able to directly compare critical neural measures between children and adults, but see SI-6 for evidence that parcels derived from the pediatric data directly are similar.)
7. Critical individual-level neural measures of language processing
Statistical analyses were performed on a set of individual-level neural measures of language processing, including i) the magnitude of neural response, ii) the volume of activation, and iii) the strength of inter-regional functional correlations during naturalistic cognition (resting state).
Response magnitude
We extracted the responses (in percent BOLD signal change) from each individually defined language fROI (averaging the responses across the voxels in each fROI) to each condition (Language (Intact/Forward) and Control (Degraded/Backward/Foreign language)) relative to the fixation baseline. To ensure independence between the data used to define the fROIs and to estimate their response magnitudes, we used an across-runs cross-validation procedure (e.g.,32). Response magnitude was averaged across run splits, resulting in one value per particpant for statistical analyses.
Volume of activation
Following past work (e.g.,39,77,105), we extracted the total number of significant voxels for each fROI above the uncorrected p<0.001 for DS1 and p<0.01 for DS2 threshold for Language>Control contrast. At these thresholds, most participants showed suprathreshold voxels.
Inter-region functional correlation
For the participants with resting state data, we extracted the BOLD signal timeseries from each individually defined language fROI (averaging the responses across the voxels in each fROI) during the resting state scan. We then computed Pearson’s moment correlation coefficients between the timeseries for each pair of fROIs (45 pairwise correlations among the 10 language fROIs, 5 fROIs in each hemisphere). These correlations were Fisher-transformed to improve normality and decrease biases in averaging118.
Lateralization
To determine the degree of LH-lateralization, we used the following formula to calculate the lateralization index for each of the five bilateral language regions: (LH – RH) / (LH + RH) for the response magnitude, volume of activation, and inter-regional functional correlation measures. For response magnitude, we use the t-values119 for the Language>Control contrast averaged across the voxels in each fROI and then across the five fROIs for each hemisphere in each participant. We used t-values to minimize the number of participants with negative contrast values; even so, a few participants showed negative average t-values in one or both hemispheres; to be able to compute the LI values, we used a standard approach of baseline correction, where the largest negative value was added to each value separately for DS1 and DS2120. For volume, the total number of significant voxels per each parcel was used. For the inter-regional functional correlation, the LH and RH values represented the mean correlation among all fROI pairs in each hemisphere.
8. Statistical Analyses
We asked three research questions about the development of the language network, as described next. The analyses were identical across the two datasets, except for analyses of inter-region functional correlations, which were only performed for Dataset 1.
1) Do children (of different ages) show adult-like topography of the left hemisphere language network?
Do children show a reliable response to language relative to the control condition in the LH language network overall, and in the temporal and frontal components separately? For each age group in each dataset, we fit a linear mixed-effects regression model predicting the BOLD response (estimated in each region as described in Critical individual-level neural measures) from condition (Language vs. Control) with random intercepts for participants and fROIs (all of the analyses, including model specification, are available at: https://osf.io/j582b). For completeness, we additionally fit the same model for the RH homotope of the language network (SI-2).
2) Do children (of different ages) show adult-like left hemispheric bias?
First, we tested the effects of hemisphere on the three critical neural measures: i) Magnitude of the Language>Control contrast extracted as described above; ii) Volume of activation for the Language>Control contrast extracted by calculating for each participant in each dataset, the number of language-responsive voxels summed across all parcels, the temporal parcels, and the frontal parcels in each hemisphere. Based on the examination of individual whole-brain activation maps, we chose the p<0.001 uncorrected whole-brain threshold for Dataset 1 and the p<0.01 uncorrected whole-brain threshold for Dataset 2 (the activations were generally weaker for Dataset 2); iii) Strength of inter-regional correlations in the language network was obtained by extracting average pairwise within-hemisphere correlations as described above.
For each of these three critical measures, we fit a linear mixed-effects regression model to test for the effects of age. In one set of analyses, we treated age as a continuous variable and focused on the pediatric population; in a complementary set of analyses, we treated age group (early, middle, late childhood, and adult) as a categorical variable. For the latter analyses, we compared each pediatric group to the adult group using mixed-effects regression models that include age group as a fixed effect. additionally compared the three pediatric groups to each other. We additionally fit the same model separately for the temporal and frontal components.
Next, we asked whether/how the different properties of the language network change over the course of development, controlling for absolute differences in the magnitude of effects and potential task confounds using the lateralization index following prior work (e.g.,39,77,105,121). In particular, we used the following formula on the LH and RH network Language>Control effects, volumes of activation, and inter-regional correlation coefficients obtained as described above: (LH – RH) / (LH + RH); the resulting values vary from 1 (exclusively LH activations) to -1 (exclusively RH activations).
We examined the effect of age on the LIs based on magnitude, volume, and inter-regional correlation, both continuously and categorically. For magnitude values, there were two subjects with negative values (one in DS1 and one in DS2). To allow for LI calculation, we transformed the mean values for each hemisphere by adding the absolute value of the lowest negative number across all fROIs to all values separately for DS1 and DS2. We then fit a linear regression model predicting each LI measure (based on magnitude, volume, and inter-regional correlation) from age as a continuous variable. We then re-ran the linear regressions with age as a categorical variable.
3) Are there age-related differences in the magnitude of response and inter-regional correlations?
To better understand developmental changes in other properties of the language regions, we evaluated the effects of age on (1) the magnitude of activation in the Language>Control contrast and (2) the strength of inter-regional correlations within the language network during resting state. We fit linear mixed-effects models to predict both effects from age (with separate models using age as a continuous and categorical variable), with participants and fROIs as random effects. For the categorical analysis, we then performed pairwise comparisons of the adjusted means for age using Tukey’s method for multiple comparison adjustments and to examine the direction of effects.
All analyses were performed in R v3.5.0 (R Core Team, 2013), using identical statistical thresholds (p < 0.05), and random effect structures (using the package lme4122. Significance of fixed effects in the models was tested in an ANOVA and fitted with restricted maximum likelihood (REML) using the package lmerTest123. Degrees of freedom were estimated using the Satterthwaite approximation123,124.
In addition to the analyses reported in the main text, we performed a version of the analyses where the total number of outlier volumes (see fMRI data preprocessing and first-level modeling) was used as a covariate. The results of the analyses are reported in SI-5 The patterns of results were not affected by the inclusion of this covariate.
CRedIT Statement
Ola Ozernov-Palchik*: Conceptualization, Methodology, Investigation (DS1), Data curation, Formal analysis, Visualization, Writing – original draft
Amanda M. O’Brien*: Conceptualization, Methodology, Formal analysis, Visualization, Writing – original draft
Elizabeth Lee: Data curation; Validation
Hilary Richardson: Methodology, Investigation (DS2), Data curation, Writing – Review & Editing
Rachel Romeo: Methodology, Investigation (DS1), Data curation, Writing – Review & Editing Benjamin Lipkin: Data curation
Hannah Small: Data curation
Jimmy Capella: Investigation (DS1), Data curation
Alfonso Nieto-Castañón: Data curation, Validation, Software
Rebecca Saxe: Methodology, Supervision, Writing – Review & Editing
John D. E. Gabrieli: Methodology, Supervision, Writing – Review & Editing
Evelina Fedorenko: Conceptualization, Methodology, Writing – original draft, Supervision, Project administration
Acknowledgements and Funding Sources
OO was supported by NIH award F32-HD100064 from NICHD. AO was supported by NIH awards T32-DC000038 and F31-DC020864 from NIDCD and the Hock E. Tan and K. Lisa Yang Center for Autism Research. JDEG was supported by the William and Flora Hewlett Foundation (#4429), the Walton Family Foundation, Chan Zuckerberg Initiative for the Reach Every Reader project, and the Halis Fund for Dyslexia Research at MIT. RS was supported by the David and Lucile Packard Foundation (#2008-333024), NIH award R01-MH096914-05 from NIMH, and the Ellison Medical Foundation. EF was supported by NIH awards DC016607 and DC016950 from NIDCD, and NS121471 from NINDS, and funds from the McGovern Institute for Brain Research, the Simons Center for the Social Brain, and the Department of Brain and Cognitive Sciences at MIT. We are grateful to Atsushi Takahashi and Steve Shannon from the Athinoula A. Martinos Imaging Center at MIT. We thank the participants and families for making this research possible.
Footnotes
Classification: Major: Biological Sciences; Minor: Psychological and Cognitive Sciences
The paper was slightly revised based on comments received from colleagues.
References
- 1.↵
- 2.↵
- 3.↵
- 4.↵
- 5.↵
- 6.
- 7.↵
- 8.↵
- 9.↵
- 10.
- 11.↵
- 12.↵
- 13.↵
- 14.
- 15.↵
- 16.↵
- 17.↵
- 18.↵
- 19.↵
- 20.↵
- 21.↵
- 22.↵
- 23.
- 24.↵
- 25.↵
- 26.↵
- 27.↵
- 28.↵
- 29.↵
- 30.↵
- 31.↵
- 32.↵
- 33.↵
- 34.↵
- 35.↵
- 36.↵
- 37.↵
- 38.↵
- 39.↵
- 40.↵
- 41.↵
- 42.↵
- 43.↵
- 44.↵
- 45.↵
- 46.↵
- 47.↵
- 48.↵
- 49.↵
- 50.↵
- 51.↵
- 52.↵
- 53.↵
- 54.↵
- 55.↵
- 56.↵
- 57.↵
- 58.↵
- 59.↵
- 60.↵
- 61.↵
- 62.↵
- 63.↵
- 64.↵
- 65.↵
- 66.↵
- 67.↵
- 68.↵
- 69.↵
- 70.↵
- 71.↵
- 72.↵
- 73.
- 74.
- 75.
- 76.↵
- 77.↵
- 78.↵
- 79.↵
- 80.↵
- 81.↵
- 82.↵
- 83.↵
- 84.↵
- 85.↵
- 86.↵
- 87.↵
- 88.↵
- 89.↵
- 90.↵
- 91.↵
- 92.↵
- 93.↵
- 94.↵
- 95.↵
- 96.
- 97.↵
- 98.↵
- 99.↵
- 100.↵
- 101.↵
- 102.↵
- 103.↵
- 104.↵
- 105.↵
- 106.↵
- 107.↵
- 108.↵
- 109.↵
- 110.
- 111.↵
- 112.↵
- 113.↵
- 114.↵
- 115.↵
- 116.↵
- 117.↵
- 118.↵
- 119.↵
- 120.↵
- 121.↵
- 122.↵
- 123.↵
- 124.↵