A comparison of reconstructed phase spaces and cepstral coefficients for multi-band phoneme classification

Kevin M. Indrebo, Richard J. Povinelli, Michael T. Johnson

Research output: Contribution to conferencePaperpeer-review

2 Scopus citations

Abstract

This paper examines the use of multi-band reconstructed phase spaces as models for phoneme classification. Sub-banding reconstructed phase spaces combines linear, frequency-based techniques with a nonlinear modeling approach to speech recognition. Experiments comparing the effects of filtering speech signals for both reconstructed phase space and traditional speech recognition approaches are presented. These experiments study the use of two non-overlapping sub-bands for isolated phoneme classification on the TIMIT corpus. It is shown that while classification accuracy using Mel frequency cepstral coefficients as features does not improve with, sub-banding, the accuracy increases from 36.1% to 42.0% using sub-banded reconstructed phase spaces to model the phonemes.

Original languageEnglish
Pages634-637
Number of pages4
StatePublished - 2004
Event2004 7th International Conference on Signal Processing Proceedings (ICSP'04) - Beijing, China
Duration: Aug 31 2004Sep 4 2004

Conference

Conference2004 7th International Conference on Signal Processing Proceedings (ICSP'04)
Country/TerritoryChina
CityBeijing
Period8/31/049/4/04

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'A comparison of reconstructed phase spaces and cepstral coefficients for multi-band phoneme classification'. Together they form a unique fingerprint.

Cite this