Abstract
This paper compares the performance of acoustic-to-articulatory inversion for both L1 and L2 speakers of English, as a function of the number of Gaussian Mixtures used in the inversion model. The inversion system is based on an HMM-GMM approach and is implemented on the Marquette Electromagnetic Articulography corpus of Mandarin Accented English (EMAMAE) including 20 native English speakers and 19 native Mandarin speakers of English. Results indicate that for Mandarin speakers 12 Gaussian mixtures and for L1 American English speakers 11 Gaussian mixtures give the lowest Root-Mean-Squared error (RMSE) and highest correlation between the estimated and actual articulatory pattern.
Original language | English |
---|---|
Title of host publication | 2018 IEEE International Symposium on Signal Processing and Information Technology, ISSPIT 2018 |
ISBN (Electronic) | 9781538675687 |
DOIs | |
State | Published - Jul 2 2018 |
Event | 2018 IEEE International Symposium on Signal Processing and Information Technology, ISSPIT 2018 - Louisville, United States Duration: Dec 6 2018 → Dec 8 2018 |
Publication series
Name | 2018 IEEE International Symposium on Signal Processing and Information Technology, ISSPIT 2018 |
---|---|
Volume | 2019-January |
Conference
Conference | 2018 IEEE International Symposium on Signal Processing and Information Technology, ISSPIT 2018 |
---|---|
Country/Territory | United States |
City | Louisville |
Period | 12/6/18 → 12/8/18 |
Bibliographical note
Publisher Copyright:© 2018 IEEE.
Keywords
- Acoustic-to-Articulatory Inversion
- Articulatory Features
- Electro-Magnetic Articulography
- Gaussian Mixture Model
- Hidden Markov Model
ASJC Scopus subject areas
- Signal Processing
- Computer Science Applications
- Computer Networks and Communications