TY - GEN
T1 - Features for phoneme independent speaker identification
AU - Wang, Jianglin
AU - Ji, An
AU - Johnson, Michael T.
N1 - Copyright:
Copyright 2013 Elsevier B.V., All rights reserved.
PY - 2012
Y1 - 2012
N2 - This paper describes a unique cross-phoneme speaker identification experiment, using deliberately mismatched phoneme sets for training and testing. The underlying goal is to identify features that represent broad individually unique characteristics rather than those that represent phonetic differences, as are more typical of modern speaker identification and verification systems. A wide range of features are proposed and evaluated within this context using a Gaussian Mixture Model framework. The results show that log-area ratio has better phonetic independence than MFCCs, that residual phase carries substantial speaker information, and identifies several other features that also have usefulness for speaker identification independent of phonetic content.
AB - This paper describes a unique cross-phoneme speaker identification experiment, using deliberately mismatched phoneme sets for training and testing. The underlying goal is to identify features that represent broad individually unique characteristics rather than those that represent phonetic differences, as are more typical of modern speaker identification and verification systems. A wide range of features are proposed and evaluated within this context using a Gaussian Mixture Model framework. The results show that log-area ratio has better phonetic independence than MFCCs, that residual phase carries substantial speaker information, and identifies several other features that also have usefulness for speaker identification independent of phonetic content.
UR - http://www.scopus.com/inward/record.url?scp=84872155626&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84872155626&partnerID=8YFLogxK
U2 - 10.1109/ICALIP.2012.6376788
DO - 10.1109/ICALIP.2012.6376788
M3 - Conference contribution
AN - SCOPUS:84872155626
SN - 9781467301718
T3 - ICALIP 2012 - 2012 International Conference on Audio, Language and Image Processing, Proceedings
SP - 1141
EP - 1145
BT - ICALIP 2012 - 2012 International Conference on Audio, Language and Image Processing, Proceedings
T2 - 2012 3rd IEEE/IET International Conference on Audio, Language and Image Processing, ICALIP 2012
Y2 - 16 July 2012 through 18 July 2012
ER -