Abstract
Dysarthria is a speech disorder often characterized by slow speech with reduced intelligibility. Automated assessment of the severity-level and intelligibility of dysarthric speech can improve the efficiency and reliability of clinical assessment as well as benefit automatic speech recognition systems (ASR). However, in order to evaluate them, there are not sentence-level severity and intelligibility label. We only have access to speaker-per-level severity and intelligibility labels. This is a problem as dysarthric talkers might be able to produce some intelligible utterances due to frequent use and short utterances. Therefore, label based analysis might not be very accurate. To address this problem, we explore methods to estimate the severity-level and speech intelligibility in dysarthria given discrete speaker-level labeling in the training set. To accomplish this, we propose a machine learning based method using one-dimensional Convolutional Neural Networks (1-D CNN). The TORGO dataset is used to test the performance of the proposed method, with the UASpeech dataset used for Transfer learning (TL). To evaluate, an Averaged Ranking Score (ARS) and intelligibility probability distribution are used. Our findings demonstrate that the proposed method can assess speakers based on severity-level and intelligibility to provide a more granular analysis of factors underlying speech intelligibility deficits associated with dysarthria.
Original language | English |
---|---|
Title of host publication | Speech and Computer - 23rd International Conference, SPECOM 2021, Proceedings |
Editors | Alexey Karpov, Rodmonga Potapova |
Pages | 670-679 |
Number of pages | 10 |
DOIs | |
State | Published - 2021 |
Event | 23rd International Conference on Speech and Computer, SPECOM 2021 - Virtual, Online Duration: Sep 27 2021 → Sep 30 2021 |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 12997 LNAI |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | 23rd International Conference on Speech and Computer, SPECOM 2021 |
---|---|
City | Virtual, Online |
Period | 9/27/21 → 9/30/21 |
Bibliographical note
Publisher Copyright:© 2021, Springer Nature Switzerland AG.
Funding
Acknowledgments. This work was supported by National Institutes of Health under NIDCD R15 DC017296-01.
Funders | Funder number |
---|---|
National Institutes of Health (NIH) | NIDCD R15 DC017296-01 |
Keywords
- Convolutional Neural Network
- Dysarthria
- Dysarthric speech severity
- Intelligibility assessment
- Transfer learning
ASJC Scopus subject areas
- Theoretical Computer Science
- General Computer Science