Abstract
This paper examines the use of multi-band reconstructed phase spaces as models for phoneme classification. Sub-banding reconstructed phase spaces combines linear, frequency-based techniques with a nonlinear modeling approach to speech recognition. Experiments comparing the effects of filtering speech signals for both reconstructed phase space and traditional speech recognition approaches are presented. These experiments study the use of two non-overlapping sub-bands for isolated phoneme classification on the TIMIT corpus. It is shown that while classification accuracy using Mel frequency cepstral coefficients as features does not improve with, sub-banding, the accuracy increases from 36.1% to 42.0% using sub-banded reconstructed phase spaces to model the phonemes.
Original language | English |
---|---|
Pages | 634-637 |
Number of pages | 4 |
State | Published - 2004 |
Event | 2004 7th International Conference on Signal Processing Proceedings (ICSP'04) - Beijing, China Duration: Aug 31 2004 → Sep 4 2004 |
Conference
Conference | 2004 7th International Conference on Signal Processing Proceedings (ICSP'04) |
---|---|
Country/Territory | China |
City | Beijing |
Period | 8/31/04 → 9/4/04 |
ASJC Scopus subject areas
- Software
- Signal Processing
- Computer Science Applications