TY - GEN
T1 - Audio signal delay estimation using partial whitening
AU - Donohue, Kevin D.
AU - Agrinsoni, Alvin
AU - Hannemann, Jens
PY - 2007
Y1 - 2007
N2 - This work examines time and frequency domain implementations for estimating delays between acoustic signals arriving at spatially distributed microphones. A parametric variant of the phase-only transform (PHAT) is introduced for partially whitening the signal before estimating the delay. The PHAT variant is referred to as the PHAT-β and is shown to be advantageous when processing signals corrupted by both independent noise and reverberation effects. Simulations show superior performance for the time-domain implementation under conditions of independent noise for time-limited broadband signals, achieving low estimation errors at signal-to-noise ratios 8 to 13 dB lower than that required for a frequency-domain implementation. Extensive Monte Carlo simulations are also performed for the time-domain delay estimator using the PHAT-β on speech signals corrupted by reverberation and independent noise. Performance metrics include percent anomalous detections as well as the root mean square estimation error. Results show that partial whitening leads to significant improvements over zero or total whitening (as in the case of the standard PHAT). Simulations indicate that robust performance can be achieved for β values near 0.4 when both reverberations and independent noises are present.
AB - This work examines time and frequency domain implementations for estimating delays between acoustic signals arriving at spatially distributed microphones. A parametric variant of the phase-only transform (PHAT) is introduced for partially whitening the signal before estimating the delay. The PHAT variant is referred to as the PHAT-β and is shown to be advantageous when processing signals corrupted by both independent noise and reverberation effects. Simulations show superior performance for the time-domain implementation under conditions of independent noise for time-limited broadband signals, achieving low estimation errors at signal-to-noise ratios 8 to 13 dB lower than that required for a frequency-domain implementation. Extensive Monte Carlo simulations are also performed for the time-domain delay estimator using the PHAT-β on speech signals corrupted by reverberation and independent noise. Performance metrics include percent anomalous detections as well as the root mean square estimation error. Results show that partial whitening leads to significant improvements over zero or total whitening (as in the case of the standard PHAT). Simulations indicate that robust performance can be achieved for β values near 0.4 when both reverberations and independent noises are present.
UR - http://www.scopus.com/inward/record.url?scp=34547677902&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=34547677902&partnerID=8YFLogxK
U2 - 10.1109/SECON.2007.342946
DO - 10.1109/SECON.2007.342946
M3 - Conference contribution
AN - SCOPUS:34547677902
SN - 1424410290
SN - 9781424410293
T3 - Conference Proceedings - IEEE SOUTHEASTCON
SP - 466
EP - 471
BT - 2007 IEEE SoutheastCon
T2 - 2007 IEEE SoutheastCon
Y2 - 22 March 2007 through 25 March 2007
ER -