Multistage audiovisual integration of speech: dissociating identification and detection

Exp Brain Res. 2011 Feb;208(3):447-57. doi: 10.1007/s00221-010-2495-9. Epub 2010 Dec 25.

Abstract

Speech perception integrates auditory and visual information. This is evidenced by the McGurk illusion where seeing the talking face influences the auditory phonetic percept and by the audiovisual detection advantage where seeing the talking face influences the detectability of the acoustic speech signal. Here, we show that identification of phonetic content and detection can be dissociated as speech-specific and non-specific audiovisual integration effects. To this end, we employed synthetically modified stimuli, sine wave speech (SWS), which is an impoverished speech signal that only observers informed of its speech-like nature recognize as speech. While the McGurk illusion only occurred for informed observers, the audiovisual detection advantage occurred for naïve observers as well. This finding supports a multistage account of audiovisual integration of speech in which the many attributes of the audiovisual speech signal are integrated by separate integration processes.

Publication types

  • Comparative Study

MeSH terms

  • Acoustic Stimulation / methods*
  • Adult
  • Audiovisual Aids
  • Auditory Perception / physiology
  • Female
  • Humans
  • Male
  • Phonetics
  • Photic Stimulation / methods*
  • Psychomotor Performance / physiology*
  • Speech Perception / physiology*
  • Visual Perception / physiology*
  • Young Adult