Ir directamente a la navegación principal Ir directamente a la búsqueda Ir directamente al contenido principal

Time-domain isolated phoneme classification using reconstructed phase spaces

  • Michael T. Johnson
  • , Richard J. Povinelli
  • , Andrew C. Lindgren
  • , Jinjin Ye
  • , Xiaolin Liu
  • , Kevin M. Indrebo

Producción científica: Articlerevisión exhaustiva

39 Citas (Scopus)

Resumen

This paper introduces a novel time-domain approach to modeling and classifying speech phoneme waveforms. The approach is based on statistical models of reconstructed phase spaces, which offer significant theoretical benefits as representations that are known to be topologically equivalent to the state dynamics of the underlying production system. The lag and dimension parameters of the reconstruction process for speech are examined in detail, comparing common estimation heuristics for these parameters with corresponding maximum likelihood recognition accuracy over the TIMIT data set. Overall accuracies are compared with a Mel-frequency cepstral baseline system across five different phonetic classes within TIMIT, and a composite classifier using both cepstral and phase space features is developed. Results indicate that although the accuracy of the phase space approach by itself is still currently below that of baseline cepstral methods, a combined approach is capable of increasing speaker independent phoneme accuracy.

Idioma originalEnglish
Páginas (desde-hasta)458-465
Número de páginas8
PublicaciónIEEE Transactions on Speech and Audio Processing
Volumen13
N.º4
DOI
EstadoPublished - jul 2005

ASJC Scopus subject areas

  • Software
  • Acoustics and Ultrasonics
  • Computer Vision and Pattern Recognition
  • Electrical and Electronic Engineering

Huella

Profundice en los temas de investigación de 'Time-domain isolated phoneme classification using reconstructed phase spaces'. En conjunto forman una huella única.

Citar esto