Using stochastic language models (SLM) to map lexical, syntactic, and phonological information processing in the brain

PLoS One. 2017 May 18;12(5):e0177794. doi: 10.1371/journal.pone.0177794. eCollection 2017.

Abstract

Language comprehension involves the simultaneous processing of information at the phonological, syntactic, and lexical level. We track these three distinct streams of information in the brain by using stochastic measures derived from computational language models to detect neural correlates of phoneme, part-of-speech, and word processing in an fMRI experiment. Probabilistic language models have proven to be useful tools for studying how language is processed as a sequence of symbols unfolding in time. Conditional probabilities between sequences of words are at the basis of probabilistic measures such as surprisal and perplexity which have been successfully used as predictors of several behavioural and neural correlates of sentence processing. Here we computed perplexity from sequences of words and their parts of speech, and their phonemic transcriptions. Brain activity time-locked to each word is regressed on the three model-derived measures. We observe that the brain keeps track of the statistical structure of lexical, syntactic and phonological information in distinct areas.

MeSH terms

  • Adolescent
  • Adult
  • Brain / diagnostic imaging
  • Brain / physiology*
  • Brain Mapping*
  • Female
  • Humans
  • Image Processing, Computer-Assisted
  • Language*
  • Magnetic Resonance Imaging
  • Male
  • Mental Processes / physiology*
  • Models, Neurological*
  • Stochastic Processes
  • Young Adult

Grants and funding

The work presented here was funded by Netherlands Organisation for Scientific Research (NWO) Gravitation Grant 024.001.006 to the Language in Interaction Consortium and by NWO Vidi grant (NWO-Vidi 276-89-007).