Increasing the Precision of Dysarthric Speech Intelligibility and Severity Level Estimate

Mohammad Soleymanpour, Michael T. Johnson, Jeffrey Berry

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

Dysarthria is a speech disorder often characterized by slow speech with reduced intelligibility. Automated assessment of the severity-level and intelligibility of dysarthric speech can improve the efficiency and reliability of clinical assessment as well as benefit automatic speech recognition systems (ASR). However, in order to evaluate them, there are not sentence-level severity and intelligibility label. We only have access to speaker-per-level severity and intelligibility labels. This is a problem as dysarthric talkers might be able to produce some intelligible utterances due to frequent use and short utterances. Therefore, label based analysis might not be very accurate. To address this problem, we explore methods to estimate the severity-level and speech intelligibility in dysarthria given discrete speaker-level labeling in the training set. To accomplish this, we propose a machine learning based method using one-dimensional Convolutional Neural Networks (1-D CNN). The TORGO dataset is used to test the performance of the proposed method, with the UASpeech dataset used for Transfer learning (TL). To evaluate, an Averaged Ranking Score (ARS) and intelligibility probability distribution are used. Our findings demonstrate that the proposed method can assess speakers based on severity-level and intelligibility to provide a more granular analysis of factors underlying speech intelligibility deficits associated with dysarthria.

Original languageEnglish
Title of host publicationSpeech and Computer - 23rd International Conference, SPECOM 2021, Proceedings
EditorsAlexey Karpov, Rodmonga Potapova
Pages670-679
Number of pages10
DOIs
StatePublished - 2021
Event23rd International Conference on Speech and Computer, SPECOM 2021 - Virtual, Online
Duration: Sep 27 2021Sep 30 2021

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12997 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference23rd International Conference on Speech and Computer, SPECOM 2021
CityVirtual, Online
Period9/27/219/30/21

Bibliographical note

Publisher Copyright:
© 2021, Springer Nature Switzerland AG.

Funding

Acknowledgments. This work was supported by National Institutes of Health under NIDCD R15 DC017296-01.

FundersFunder number
National Institutes of Health (NIH)NIDCD R15 DC017296-01

    Keywords

    • Convolutional Neural Network
    • Dysarthria
    • Dysarthric speech severity
    • Intelligibility assessment
    • Transfer learning

    ASJC Scopus subject areas

    • Theoretical Computer Science
    • General Computer Science

    Fingerprint

    Dive into the research topics of 'Increasing the Precision of Dysarthric Speech Intelligibility and Severity Level Estimate'. Together they form a unique fingerprint.

    Cite this