Computational strategy for accelerating robust sound source detection in dynamic scenes

Kevin D. Donohue, Paul M. Griffioen

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Efficient sound source detection and location with microphone arrays is important for many applications, including teleconferencing, surveillance, and smart rooms. While the steered response power algorithms exhibit robust performance relative to other approaches, their applications are limited by the high computational load required. For dynamic auditory scenes, the entire space must be scanned at regular intervals due to moving sound sources switching between active and inactive states. This paper introduces a time segmentation and parallelization strategy to speed up the steered response power algorithm for dynamic auditory scenes with multiple speech sources. The primary application targeted by this work is for immersive arrays and off-line auditory scene analysis with beamforming for speaker separation in cocktail party environments. Results from a Monte Carlo simulation with 6 speech sources in a mildly reverberant environment demonstrate a speed-up factor of 45, with a modest loss in the number of detections and a significant reduction in anomalous detections. Experimental results with real recordings demonstrate a performance consistent with those of the simulation.

Original languageEnglish
Title of host publicationConference Proceedings - IEEE SOUTHEASTCON
ISBN (Electronic)9781479965854
DOIs
StatePublished - Nov 7 2014
EventIEEE SoutheastCon 2014 - Lexington, United States
Duration: Mar 13 2014Mar 16 2014

Publication series

NameConference Proceedings - IEEE SOUTHEASTCON
ISSN (Print)1091-0050
ISSN (Electronic)1558-058X

Conference

ConferenceIEEE SoutheastCon 2014
Country/TerritoryUnited States
CityLexington
Period3/13/143/16/14

Bibliographical note

Publisher Copyright:
© 2014 IEEE.

Keywords

  • MATLAB
  • Steered Response Power
  • cocktail party
  • microphone arrays
  • parallel processing
  • sound source detection

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Software
  • Electrical and Electronic Engineering
  • Control and Systems Engineering
  • Signal Processing

Fingerprint

Dive into the research topics of 'Computational strategy for accelerating robust sound source detection in dynamic scenes'. Together they form a unique fingerprint.

Cite this