Using stochastic language models (SLM) to map lexical, syntactic, and phonological information processing in the brain

Alessandro Lopopolo; Stefan L Frank; Antal van den Bosch; Roel M Willems

doi:10.1371/journal.pone.0177794

Using stochastic language models (SLM) to map lexical, syntactic, and phonological information processing in the brain

PLoS One. 2017 May 18;12(5):e0177794. doi: 10.1371/journal.pone.0177794. eCollection 2017.

Authors

Alessandro Lopopolo¹, Stefan L Frank¹, Antal van den Bosch^{1

2}, Roel M Willems^{1

3

4}

Affiliations

¹ Centre for Language Studies, Radboud University Nijmegen, Nijmegen, the Netherlands.
² Meertens Institute, Royal Netherlands Academy of Science and Arts, Amsterdam, the Netherlands.
³ Donders Institute, Radboud University Nijmegen, Nijmegen, the Netherlands.
⁴ Max Planck Institute for Psycholinguistics, Nijmegen, the Netherlands.

Abstract

Language comprehension involves the simultaneous processing of information at the phonological, syntactic, and lexical level. We track these three distinct streams of information in the brain by using stochastic measures derived from computational language models to detect neural correlates of phoneme, part-of-speech, and word processing in an fMRI experiment. Probabilistic language models have proven to be useful tools for studying how language is processed as a sequence of symbols unfolding in time. Conditional probabilities between sequences of words are at the basis of probabilistic measures such as surprisal and perplexity which have been successfully used as predictors of several behavioural and neural correlates of sentence processing. Here we computed perplexity from sequences of words and their parts of speech, and their phonemic transcriptions. Brain activity time-locked to each word is regressed on the three model-derived measures. We observe that the brain keeps track of the statistical structure of lexical, syntactic and phonological information in distinct areas.

MeSH terms

Adolescent
Adult
Brain / diagnostic imaging
Brain / physiology*
Brain Mapping*
Female
Humans
Image Processing, Computer-Assisted
Language*
Magnetic Resonance Imaging
Male
Mental Processes / physiology*
Models, Neurological*
Stochastic Processes
Young Adult

Grants and funding

The work presented here was funded by Netherlands Organisation for Scientific Research (NWO) Gravitation Grant 024.001.006 to the Language in Interaction Consortium and by NWO Vidi grant (NWO-Vidi 276-89-007).