TY - GEN
T1 - Automatic song-type classification and speaker identification of norwegian ortolan bunting (Emberiza Hortulana) vocalizations
AU - Trawicki, Marek B.
AU - Johnson, Michael T.
AU - Osiejuk, Tomasz S.
N1 - Copyright:
Copyright 2008 Elsevier B.V., All rights reserved.
PY - 2005
Y1 - 2005
N2 - This paper presents an approach to song-type classification and speaker identification of Norwegian Ortolan Bunting (Emberiza Hortulana) vocalizations using traditional human speech processing methods. Hidden Markov Models (HMMs) are used for both tasks, with features including Mel-Frequency Cepstral Coefficients (MFCCs), log energy, and delta (velocity) and delta-delta (acceleration) coefficients. Vocalizations were tested using leave-one-out cross-validation. Classification accuracy for 5 song-types is 92.4%, dropping to 63.6% as the number and similarity of the songs increases. Song-type dependent speaker identification rates peak at 98.7%, with typical accuracies of 80-95% and a low end at 76.2% as the number of speakers increases. These experiments fit into a larger framework of research working towards methods for acoustic censusing of endangered species populations and more automated bioacoustic analysis methods.
AB - This paper presents an approach to song-type classification and speaker identification of Norwegian Ortolan Bunting (Emberiza Hortulana) vocalizations using traditional human speech processing methods. Hidden Markov Models (HMMs) are used for both tasks, with features including Mel-Frequency Cepstral Coefficients (MFCCs), log energy, and delta (velocity) and delta-delta (acceleration) coefficients. Vocalizations were tested using leave-one-out cross-validation. Classification accuracy for 5 song-types is 92.4%, dropping to 63.6% as the number and similarity of the songs increases. Song-type dependent speaker identification rates peak at 98.7%, with typical accuracies of 80-95% and a low end at 76.2% as the number of speakers increases. These experiments fit into a larger framework of research working towards methods for acoustic censusing of endangered species populations and more automated bioacoustic analysis methods.
UR - http://www.scopus.com/inward/record.url?scp=33749047066&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33749047066&partnerID=8YFLogxK
U2 - 10.1109/MLSP.2005.1532913
DO - 10.1109/MLSP.2005.1532913
M3 - Conference contribution
AN - SCOPUS:33749047066
SN - 0780395174
SN - 9780780395176
T3 - 2005 IEEE Workshop on Machine Learning for Signal Processing
SP - 277
EP - 282
BT - 2005 IEEE Workshop on Machine Learning for Signal Processing
T2 - 2005 IEEE Workshop on Machine Learning for Signal Processing
Y2 - 28 September 2005 through 30 September 2005
ER -