The N300: An Index For Predictive Coding Of Complex Visual Objects and Scenes

Manoj Kumar; Kara D. Federmeier; Diane M. Beck

doi:10.1101/2020.09.21.304378

Abstract

The bulk of support for predictive coding models has come from the models’ ability to simulate known perceptual or neuronal phenomena, but there have been fewer attempts to identify a reliable neural signature of predictive coding. Here we propose that the N300 component of the event-related potential (ERP), occurring 250-350 ms post-stimulus-onset, may be such a signature of perceptual hypothesis testing operating at the scale of whole objects and scenes. We show that N300 amplitudes are smaller to representative (“good exemplars”) compared to less representative (“bad exemplars”) items from natural scene categories. Integrating these results with patterns observed for objects, we establish that, across a variety of visual stimuli, the N300 is responsive to statistical regularity, or the degree to which the input is “expected” (either explicitly or implicitly) by the system based on prior knowledge, with statistically regular images, which entail reduced prediction error, evoking a reduced response. Moreover, we show that the measure exhibits context-dependency; that is, we find the N300 sensitivity to category representativeness only when stimuli are congruent with and not when they are incongruent with a category pre-cue, suggesting that the component may reflect the ease with which an image matches the current hypothesis generated by the visual system. Thus, we argue that the N300 ERP component is the best candidate to date for an index of perceptual hypotheses testing, whereby incoming sensory information for complex visual objects and scenes is accessed against contextual predictions generated in mid-level visual areas.

Significance Statement Predictive coding models postulate that our perception of visual sensory input is guided by prior knowledge and the situational context, such that it is facilitated when the input matches expectation and hence produces less prediction error. Here, we show that an electrophysiological measure, the N300, matches the features hypothesized for a measure of predictive coding: complex scenes (like objects) elicit less N300 activity when they are statistically regular (e.g., more representative of their categories), in a manner that itself is context dependent. We thus show that the N300 provides a window into the interaction of context, prediction, and visual perception.

Competing Interest Statement

The authors have declared no competing interest.

The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.