Multichannel speech recognition using distributed microphone signal fusion strategies

Marek B. Trawicki, Michael T. Johnson, An Ji, Tomasz S. Osiejuk

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

Multichannel fusion strategies are presented for the distributed microphone recognition environment, for the task of song-type recognition in a multichannel songbird dataset. The signals are first fused together based on various heuristics, including their amplitudes, variances, physical distance, or squared distance, before passing the enhanced single-channel signal into the speech recognition system. The intensity-weighted fusion strategy achieved the highest overall recognition accuracy of 94.4%. By combining the noisy distributed microphone signals in an intelligent way that is proportional to the information contained in the signals, speech recognition systems can achieve higher recognition accuracies.

Original languageEnglish
Title of host publicationICALIP 2012 - 2012 International Conference on Audio, Language and Image Processing, Proceedings
Pages1146-1150
Number of pages5
DOIs
StatePublished - 2012
Event2012 3rd IEEE/IET International Conference on Audio, Language and Image Processing, ICALIP 2012 - Shanghai, China
Duration: Jul 16 2012Jul 18 2012

Publication series

NameICALIP 2012 - 2012 International Conference on Audio, Language and Image Processing, Proceedings

Conference

Conference2012 3rd IEEE/IET International Conference on Audio, Language and Image Processing, ICALIP 2012
Country/TerritoryChina
CityShanghai
Period7/16/127/18/12

ASJC Scopus subject areas

  • Language and Linguistics
  • Computer Vision and Pattern Recognition

Fingerprint

Dive into the research topics of 'Multichannel speech recognition using distributed microphone signal fusion strategies'. Together they form a unique fingerprint.

Cite this