Abstract
Frame alignments can be computed by different methods in GMM-based speaker verification. By incorporating a phonetic Gaussian mixture model (PGMM), we are able to compare the performance using alignments extracted from the deep neural networks (DNN) and the conventional hidden Markov model (HMM) in digit-prompted speaker verification. Based on the different characteristics of these two alignments, we present a novel content verification method to improve the system security without much computational overhead. Our experiments on the RSR2015 Part-3 digit-prompted task show that, the DNN-based alignment performs on par with the HMM alignment. The results also demonstrate the effectiveness of the proposed Kullback-Leibler (KL) divergence based scoring to reject speech with incorrect pass-phrases.
Original language | English |
---|---|
Title of host publication | 2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2018 - Proceedings |
Pages | 1467-1472 |
Number of pages | 6 |
ISBN (Electronic) | 9789881476852 |
DOIs | |
State | Published - Jul 2 2018 |
Event | 10th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2018 - Honolulu, United States Duration: Nov 12 2018 → Nov 15 2018 |
Publication series
Name | 2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2018 - Proceedings |
---|
Conference
Conference | 10th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2018 |
---|---|
Country/Territory | United States |
City | Honolulu |
Period | 11/12/18 → 11/15/18 |
Bibliographical note
Publisher Copyright:© 2018 APSIPA organization.
ASJC Scopus subject areas
- Information Systems