Abstract
Dysarthria is a speech disorder often characterized by slow speech with reduced intelligibility. This preliminary study investigates suprasegmental characteristics between typical and dysarthric speakers at varying severity levels, with the long-term goal of improving methods for dysarthric speech synthesis/augmentation and enhancement. First, we aim to analyze phonemes, speaking rate and pause characteristics of typical and dysarthric speech using the phoneme- and word-level alignment information extracted by Montreal Forced Aligner (MFA). Then, pitch and intensity declination trends and range analysis are conducted. The pitch and intensity declination are measured by fitting a regression line. These analyses are conducted on dysarthric speech in TORGO, containing 8 dysarthric speakers involved with cerebral palsy or amyotrophic lateral sclerosis and 7 age- and gender-matched typical speakers. These results are important for the development of dysarthric speech synthesis, augmentation to statistically model and evaluate characteristics such as pause, speaking rate, pitch, and intensity.
| Original language | English |
|---|---|
| Title of host publication | 2021 11th International Conference on Speech Technology and Human-Computer Dialogue, SpeD 2021 |
| Pages | 52-56 |
| Number of pages | 5 |
| ISBN (Electronic) | 9781665427869 |
| DOIs | |
| State | Published - 2021 |
| Event | 11th International Conference on Speech Technology and Human-Computer Dialogue, SpeD 2021 - Virtual, Bucharest, Romania Duration: Oct 13 2021 → Oct 15 2021 |
Publication series
| Name | 2021 11th International Conference on Speech Technology and Human-Computer Dialogue, SpeD 2021 |
|---|
Conference
| Conference | 11th International Conference on Speech Technology and Human-Computer Dialogue, SpeD 2021 |
|---|---|
| Country/Territory | Romania |
| City | Virtual, Bucharest |
| Period | 10/13/21 → 10/15/21 |
Bibliographical note
Publisher Copyright:© 2021 IEEE.
Funding
ACKNOWLEDGEMENT This work was supported by NationalInstitutes of Health underNIDCD R15 DC017296-01. This work was supported by NationalInstitutesof Health underNIDCD R15 DC017296-01.
| Funders | Funder number |
|---|---|
| NationalInstitutesof Health underNIDCD | |
| National Institutes of Health (NIH) | R15 DC017296-01 |
Keywords
- Dysarthria
- F0 declination
- Forced alignment
- Speech analysis
ASJC Scopus subject areas
- Language and Linguistics
- Human-Computer Interaction
- Signal Processing
- Safety, Risk, Reliability and Quality
- Communication