Generalized perceptual linear prediction features for animal vocalization analysis

Patrick J. Clemins, Michael T. Johnson

Research output: Contribution to journalArticlepeer-review

32 Scopus citations

Abstract

A new feature extraction model, generalized perceptual linear prediction (gPLP), is developed to calculate a set of perceptually relevant features for digital signal analysis of animal vocalizations. The gPLP model is a generalized adaptation of the perceptual linear prediction model, popular in human speech processing, which incorporates perceptual information such as frequency warping and equal loudness normalization into the feature extraction process. Since such perceptual information is available for a number of animal species, this new approach integrates that information into a generalized model to extract perceptually relevant features for a particular species. To illustrate, qualitative and quantitative comparisons are made between the species-specific model, generalized perceptual linear prediction (gPLP), and the original PLP model using a set of vocalizations collected from captive African elephants (Loxodonta africana) and wild beluga whales (Delphinapterus leucas). The models that incorporate perceptional information outperform the original human-based models in both visualization and classification tasks.

Original languageEnglish
Pages (from-to)527-534
Number of pages8
JournalJournal of the Acoustical Society of America
Volume120
Issue number1
DOIs
StatePublished - 2006

ASJC Scopus subject areas

  • Arts and Humanities (miscellaneous)
  • Acoustics and Ultrasonics

Fingerprint

Dive into the research topics of 'Generalized perceptual linear prediction features for animal vocalization analysis'. Together they form a unique fingerprint.

Cite this