Trends in Neurosciences
OpinionTemporal coherence and attention in auditory scene analysis
Section snippets
The auditory scene analysis problem
Humans and other animals routinely detect, identify and track sounds coming from a particular source (e.g. someone's voice, a conspecific call) among sounds emanating from other sources (e.g. other voices, heterospecific calls, ambient music and street traffic) (Figure 1). The apparent ease with which they determine which components and attributes in a sound mixture arise from the same source belies the complexity of the underlying biological processes. By analogy with the scene segmentation
Temporal coherence in auditory scene analysis
Problems inherent to auditory scene analysis are similar to those found in visual scene analysis. However, there are a few notable unique aspects. In particular, whereas natural and artificial visual scenes often contain a large proportion of static or slow-moving elements, auditory scenes are essentially dynamic, containing many fast-changing, relatively brief acoustic events (referred to as tokens in Box 1) 30, 31. Therefore, an essential aspect of auditory scene analysis is the linking over
Is streaming a pre-attentive process?
A widely held view by which has emerged from electrophysiological studies in humans 28, 29, 67, 68, 69, 70, is that auditory streams are formed pre-attentively in the auditory system, much like the extraction of low-level features in early pre-cortical stages. Depending on the listener's intentions and guided by representations of previously encountered auditory objects (or streams) that are now stored in memory, attention would simply serve to enhance the perception of a particular stream in
Summary
Here, we proposed two ideas within an overall framework to explain the perception of auditory scenes. The first is that auditory stream formation is critically dependent on the temporal coherence between neural responses to sounds in the auditory cortex. Specifically, when stimulus-induced cortical responses are temporally coherent, the features they represent can potentially become perceptually unified (or bound) as one stream, distinct from other temporally incoherent responses. This
Acknowledgments
This work was supported by the following grants to the authors: NIH R0107657, MURI N000141010278, AFOSR FA9550-09-1-0234 and NSF CAREER award IIS-0846112.
Glossary
- Auditory scene analysis
- processes by which sequential and concurrent acoustic events are analyzed and organized into auditory streams.
- Auditory stream
- series of sounds perceived by the listener as a coherent entity and, as such, can be selectively attended to among other sounds. The word ‘stream’ emphasizes the fact that sounds usually unfold over time. Although sounds coming from different physical sound sources typically form separate streams, this is not always the case. For example, a choir
References (104)
How the brain separates sounds
Trends Cogn. Sci.
(2004)Processing of complex stimuli and natural scenes in the auditory cortex
Curr. Opin. Neurobiol.
(2004)Spectral processing and sound source determination
Int. Rev. Neurobiol.
(2005)Breaking the wave: effects of attention and learning on concurrent sound perception
Hear. Res.
(2007)The role of auditory cortex in the formation of auditory streams
Hear. Res.
(2007)Modeling the auditory scene: predictive regularity representations and perceptual objects
Trends Cogn. Sci.
(2009)- et al.
Behind the scenes of auditory perception
Curr. Opin. Neurobiol.
(2010) - et al.
Pitch, harmonicity and concurrent sound segregation: Psychoacoustical and neurophysiological findings
Hearing Res.
(2010) Neural correlates of auditory stream segregation in primary auditory cortex of the awake monkey
Hear. Res.
(2001)Perceptual organization of tone sequences in the auditory cortex of awake macaques
Neuron
(2005)
Perceptual organization of sound begins in the auditory periphery
Curr. Biol.
Spectral processing in the auditory cortex
Int. Rev. Neurobiol.
Binaural response-specific bands in primary auditory cortex (AI) of the cat: topographical organization orthogonal to isofrequency contours
Brain Res.
Temporal coherence in the perceptual organization and cortical representation of auditory scenes
Neuron
Multivariate receptive field mapping in marmoset auditory cortex
J. Neurosci. Methods
A multilevel and cross-modal approach towards neuronal mechanisms of auditory streaming
Brain Res.
Object-based auditory and visual attention
Trends Cogn. Sci.
Auditory attention – focusing the searchlight on sound
Curr. Opin. Neurobiol.
Synchrony: a neuronal mechanism for attentional selection?
Curr. Opin. Neurobiol.
Spike times make sense
Trends Neurosci.
Untangling invariant object recognition
Trends Cogn. Sci.
Auditory Scene Analysis: The Perceptual Organization of Sound
Some experiments on the recognition of speech, with one and two ears
J. Acoust. Soc. Am.
The cocktail party problem: what is it? How can it be solved? And why should animal behaviorists study it?
J. Comp. Psychol.
The cocktail party problem
Curr. Biol.
Evaluating the benefit of hearing aids in solving the cocktail party problem
Trends Amplif.
Computational Auditory Scene Analysis: Principles, Algorithms, and Applications
Sound source perception and stream segregation in nonhuman vertebrate animals
Neurophysiological mechanisms involved in auditory perceptual organization
Front Neurosci.
Toward a neurophysiological theory of auditory stream segregation
Psychol. Bull.
Stream segregation and peripheral channeling
Mus. Percep.
Computer simulation of auditory stream segregation in alternating-tone sequences
J. Acoust. Soc. Am.
A model of auditory streaming
J. Acoust. Soc. Am.
Neurodynamics for auditory stream segregation: tracking sounds in the mustached bat's natural environment
Network
Primitive auditory stream segregation: a neurophysiological study in the songbird forebrain
J. Neurophysiol.
Auditory stream segregation in monkey auditory cortex: effects of frequency separation, presentation rate, and tone duration
J. Acoust. Soc. Am.
Auditory stream segregation in the songbird forebrain: effects of time intervals on responses to interleaved tone sequences
Brain Behav. Evol.
An investigation of the auditory streaming effect using event-related brain potentials
Psychophysiol.
The role of attention in the formation of auditory streams
Percept. Psychophys.
Temporal low-order statistics of natural sounds
Adv. Neural Inf. Process. Syst.
Modulation spectra of natural sounds and ethological theories of auditory processing
J. Acoust. Soc. Am.
The role of spectral and periodicity cues in auditory stream segregation, measured using a temporal discrimination task
J. Acoust. Soc. Am.
Auditory stream segregation on the basis of amplitude-modulation rate
J. Acoust. Soc. Am.
Factors influencing sequential stream segregation
Acta Acustica
The relation between auditory temporal interval processing and sequential stream segregation examined with stimulus laterality differences
J. Percept. Psychophys.
Topography of excitatory bandwidth in cat primary auditory cortex: single-neuron versus multiple-neuron recordings
J. Neurophysiol.
Order and disorder in auditory cortical maps
Curr. Opin. Neurobiol.
Ripple analysis in the ferret primary auditory cortex. III. Topographic and columnar distribution of ripple response parameters
Aud. Neurosci.
Analysis of dynamic spectra in ferret primary auditory cortex. I. Characteristics of single-unit responses to moving ripple spectra
J. Neurophysiol.
Analysis of dynamic spectra in ferret primary auditory cortex. II. Prediction of unit responses to arbitrary dynamic spectra
J. Neurophysiol.
Cited by (311)
Slow neural oscillations explain temporal fluctuations in distractibility
2023, Progress in NeurobiologyFunctional network properties of the auditory cortex
2023, Hearing ResearchAtypical cortical processing of bottom-up speech binding cues in children with autism spectrum disorders
2023, NeuroImage: ClinicalCross-Modal Interactions Between Auditory Attention and Oculomotor Control
2024, Journal of Neuroscience