TY - GEN
T1 - Multichannel speech recognition using distributed microphone signal fusion strategies
AU - Trawicki, Marek B.
AU - Johnson, Michael T.
AU - Ji, An
AU - Osiejuk, Tomasz S.
PY - 2012
Y1 - 2012
N2 - Multichannel fusion strategies are presented for the distributed microphone recognition environment, for the task of song-type recognition in a multichannel songbird dataset. The signals are first fused together based on various heuristics, including their amplitudes, variances, physical distance, or squared distance, before passing the enhanced single-channel signal into the speech recognition system. The intensity-weighted fusion strategy achieved the highest overall recognition accuracy of 94.4%. By combining the noisy distributed microphone signals in an intelligent way that is proportional to the information contained in the signals, speech recognition systems can achieve higher recognition accuracies.
AB - Multichannel fusion strategies are presented for the distributed microphone recognition environment, for the task of song-type recognition in a multichannel songbird dataset. The signals are first fused together based on various heuristics, including their amplitudes, variances, physical distance, or squared distance, before passing the enhanced single-channel signal into the speech recognition system. The intensity-weighted fusion strategy achieved the highest overall recognition accuracy of 94.4%. By combining the noisy distributed microphone signals in an intelligent way that is proportional to the information contained in the signals, speech recognition systems can achieve higher recognition accuracies.
UR - http://www.scopus.com/inward/record.url?scp=84872141158&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84872141158&partnerID=8YFLogxK
U2 - 10.1109/ICALIP.2012.6376789
DO - 10.1109/ICALIP.2012.6376789
M3 - Conference contribution
AN - SCOPUS:84872141158
SN - 9781467301718
T3 - ICALIP 2012 - 2012 International Conference on Audio, Language and Image Processing, Proceedings
SP - 1146
EP - 1150
BT - ICALIP 2012 - 2012 International Conference on Audio, Language and Image Processing, Proceedings
T2 - 2012 3rd IEEE/IET International Conference on Audio, Language and Image Processing, ICALIP 2012
Y2 - 16 July 2012 through 18 July 2012
ER -