Distributed microphone systems can be used to enhance intelligibility for a speaker of interest (SOI) in a noisy environment of multiple speech sources (cocktail party scenario). For finite microphone distributions, however, interfering speech sources leak into the beamformed signal and degrade intelligibility. This article introduces an auditory inspired post-processing algorithm for beamformed signals using spectrooral cues to enhance SOI intelligibility. Spatial power ratios obtained through beamforming on multiple locations are used to identify and mask out time-frequency regions dominated by the interfering speech. Performance results based on planar microphone array simulations show consistent increases in the Speech Intelligibility Index (SII) over the beamformed signal for various configurations of speakers using 2 to 16 microphones. In cases of critically low SII (< 0.25), the application of interference masking achieves critical enhancements in SII, increasing it beyond.3 for the case of 2 microphones to above.5 for the 16 microphone case. Experimental recording were also performed and examples presented. The experimental recordings show similar improvements consistent with the simulation.
|Title of host publication||Conference Proceedings - IEEE SOUTHEASTCON|
|State||Published - Nov 7 2014|
|Event||IEEE SoutheastCon 2014 - Lexington, United States|
Duration: Mar 13 2014 → Mar 16 2014
|Name||Conference Proceedings - IEEE SOUTHEASTCON|
|Conference||IEEE SoutheastCon 2014|
|Period||3/13/14 → 3/16/14|
Bibliographical notePublisher Copyright:
© 2014 IEEE.
ASJC Scopus subject areas
- Computer Networks and Communications
- Electrical and Electronic Engineering
- Control and Systems Engineering
- Signal Processing