TY - JOUR
T1 - Generalized perceptual linear prediction features for animal vocalization analysis
AU - Clemins, Patrick J.
AU - Johnson, Michael T.
PY - 2006
Y1 - 2006
N2 - A new feature extraction model, generalized perceptual linear prediction (gPLP), is developed to calculate a set of perceptually relevant features for digital signal analysis of animal vocalizations. The gPLP model is a generalized adaptation of the perceptual linear prediction model, popular in human speech processing, which incorporates perceptual information such as frequency warping and equal loudness normalization into the feature extraction process. Since such perceptual information is available for a number of animal species, this new approach integrates that information into a generalized model to extract perceptually relevant features for a particular species. To illustrate, qualitative and quantitative comparisons are made between the species-specific model, generalized perceptual linear prediction (gPLP), and the original PLP model using a set of vocalizations collected from captive African elephants (Loxodonta africana) and wild beluga whales (Delphinapterus leucas). The models that incorporate perceptional information outperform the original human-based models in both visualization and classification tasks.
AB - A new feature extraction model, generalized perceptual linear prediction (gPLP), is developed to calculate a set of perceptually relevant features for digital signal analysis of animal vocalizations. The gPLP model is a generalized adaptation of the perceptual linear prediction model, popular in human speech processing, which incorporates perceptual information such as frequency warping and equal loudness normalization into the feature extraction process. Since such perceptual information is available for a number of animal species, this new approach integrates that information into a generalized model to extract perceptually relevant features for a particular species. To illustrate, qualitative and quantitative comparisons are made between the species-specific model, generalized perceptual linear prediction (gPLP), and the original PLP model using a set of vocalizations collected from captive African elephants (Loxodonta africana) and wild beluga whales (Delphinapterus leucas). The models that incorporate perceptional information outperform the original human-based models in both visualization and classification tasks.
UR - http://www.scopus.com/inward/record.url?scp=33745762631&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33745762631&partnerID=8YFLogxK
U2 - 10.1121/1.2203596
DO - 10.1121/1.2203596
M3 - Article
C2 - 16875249
AN - SCOPUS:33745762631
SN - 0001-4966
VL - 120
SP - 527
EP - 534
JO - Journal of the Acoustical Society of America
JF - Journal of the Acoustical Society of America
IS - 1
ER -