Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

A biologically oriented algorithm for spatial sound segregation

View ORCID ProfileKenny F Chou, Virginia Best, H Steven Colburn, Kamal Sen
doi: https://doi.org/10.1101/2020.11.04.368548
Kenny F Chou
1Department of Biomedical Engineering, Boston University, Boston, Massachusetts, United States of America
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Kenny F Chou
  • For correspondence: kfchou@bu.edu
Virginia Best
2Department of Speech, Language and Hearing Sciences, Boston University, Boston, Massachusetts, United States of America
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
H Steven Colburn
1Department of Biomedical Engineering, Boston University, Boston, Massachusetts, United States of America
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Kamal Sen
1Department of Biomedical Engineering, Boston University, Boston, Massachusetts, United States of America
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Preview PDF
Loading

Abstract

Listening in an acoustically cluttered scene remains a difficult task for both machines and hearing-impaired listeners. Normal-hearing listeners accomplish this task with relative ease by segregating the scene into its constituent sound sources, then selecting and attending to a target source. An assistive listening device that mimics the biological mechanisms underlying this behavior may provide an effective solution for those with difficulty listening in acoustically cluttered environments (e.g., a cocktail party). Here, we present a binaural sound segregation algorithm based on a hierarchical network model of the auditory system. In the algorithm, binaural sound inputs first drive populations of neurons tuned to specific spatial locations and frequencies. Lateral inhibition then sharpens the spatial response of the neurons. Finally, the spiking response of neurons in the output layer are then reconstructed into audible waveforms via a novel reconstruction method. We evaluate the performance of the algorithm with psychoacoustic measures of normal-hearing listeners. This two-microphone algorithm is shown to provide listeners with perceptual benefit similar to that of a 16-microphone acoustic beamformer in a difficult listening task. Unlike deep-learning approaches, the proposed algorithm is biologically interpretable and does not need to be trained on large datasets. This study presents a biologically based algorithm for sound source segregation as well as a method to reconstruct highly intelligible audio signals from spiking models.

Author Summary Animal and humans can navigate complex auditory environments with relative ease, attending to certain sounds while suppressing others. Normally, various sounds originate from various spatial locations. This paper presents an algorithmic model to perform sound segregation based on how animals make use of this spatial information at various stages of the auditory pathway. We showed that the performance of this two-microphone algorithm provides as much benefit to normal-hearing listeners a multi-microphone algorithm. Unlike mathematical and machine-learning approaches, our model is fully interpretable and does not require training with large datasets. Such an approach may benefit the design of machine hearing algorithms. To interpret the spike-trains generated in the model, we designed a method to recover sounds from model spikes with high intelligibility. This method can be applied to spiking neural networks for audio-related applications, or to interpret each node within a spiking model of the auditory cortex.

Competing Interest Statement

The authors have declared no competing interest.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted November 04, 2020.
Download PDF
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
A biologically oriented algorithm for spatial sound segregation
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
A biologically oriented algorithm for spatial sound segregation
Kenny F Chou, Virginia Best, H Steven Colburn, Kamal Sen
bioRxiv 2020.11.04.368548; doi: https://doi.org/10.1101/2020.11.04.368548
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
A biologically oriented algorithm for spatial sound segregation
Kenny F Chou, Virginia Best, H Steven Colburn, Kamal Sen
bioRxiv 2020.11.04.368548; doi: https://doi.org/10.1101/2020.11.04.368548

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Neuroscience
Subject Areas
All Articles
  • Animal Behavior and Cognition (4095)
  • Biochemistry (8784)
  • Bioengineering (6492)
  • Bioinformatics (23381)
  • Biophysics (11765)
  • Cancer Biology (9166)
  • Cell Biology (13286)
  • Clinical Trials (138)
  • Developmental Biology (7421)
  • Ecology (11383)
  • Epidemiology (2066)
  • Evolutionary Biology (15112)
  • Genetics (10408)
  • Genomics (14019)
  • Immunology (9140)
  • Microbiology (22088)
  • Molecular Biology (8792)
  • Neuroscience (47428)
  • Paleontology (350)
  • Pathology (1423)
  • Pharmacology and Toxicology (2483)
  • Physiology (3711)
  • Plant Biology (8060)
  • Scientific Communication and Education (1433)
  • Synthetic Biology (2213)
  • Systems Biology (6020)
  • Zoology (1251)