Joint frequency domain and reconstructed phase space features for speech recognition

Andrew C. Lindgren, Michael T. Johnson, Richard J. Povinelli

Research output: Contribution to journalConference articlepeer-review

8 Scopus citations

Abstract

A novel method for speech recognition is presented, utilizing nonlinear/chaotic signal processing techniques to extract time-domain based, reconstructed phase space features. This work examines the incorporation of trajectory information into this model as well as the combination of both MFCC and RPS feature sets into one joint feature vector. The results demonstrate that integration of trajectory information increases the recognition accuracy of the typical RPS feature set, and when MFCC and RPS feature sets are combined, improvement is made over the baseline. This result suggests that the features extracted using these nonlinear techniques contain different discriminatory information than the features extracted from linear approaches alone.

Original languageEnglish
Pages (from-to)I533-I536
JournalICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Volume1
StatePublished - 2004
EventProceedings - IEEE International Conference on Acoustics, Speech, and Signal Processing - Montreal, Que, Canada
Duration: May 17 2004May 21 2004

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Joint frequency domain and reconstructed phase space features for speech recognition'. Together they form a unique fingerprint.

Cite this