Resumen
This paper presents a time-aligned singular value decomposition (SVD) analysis for speaker identification. SVD analysis has been used for fast spectral matching based on a global representation of an entire utterance. We incorporate temporal normalization directly into the decomposition by using a dynamic time warping (DTW) path to time-align the rows of the feature matrix prior to SVD analysis. Speaker identification results using the TI-46 database indicates that the time-aligned SVD significantly improves accuracy for most threshold choices.
| Idioma original | English |
|---|---|
| Páginas (desde-hasta) | IV/4160 |
| Publicación | ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings |
| Volumen | 4 |
| Estado | Published - 2002 |
| Evento | 2002 IEEE International Conference on Acoustic, Speech, and Signal Processing - Orlando, FL, United States Duración: may 13 2002 → may 17 2002 |
ASJC Scopus subject areas
- Software
- Signal Processing
- Electrical and Electronic Engineering