Resumen
Dysarthria is a speech disorder often characterized by slow speech with reduced intelligibility. This preliminary study investigates suprasegmental characteristics between typical and dysarthric speakers at varying severity levels, with the long-term goal of improving methods for dysarthric speech synthesis/augmentation and enhancement. First, we aim to analyze phonemes, speaking rate and pause characteristics of typical and dysarthric speech using the phoneme- and word-level alignment information extracted by Montreal Forced Aligner (MFA). Then, pitch and intensity declination trends and range analysis are conducted. The pitch and intensity declination are measured by fitting a regression line. These analyses are conducted on dysarthric speech in TORGO, containing 8 dysarthric speakers involved with cerebral palsy or amyotrophic lateral sclerosis and 7 age- and gender-matched typical speakers. These results are important for the development of dysarthric speech synthesis, augmentation to statistically model and evaluate characteristics such as pause, speaking rate, pitch, and intensity.
| Idioma original | English |
|---|---|
| Título de la publicación alojada | 2021 11th International Conference on Speech Technology and Human-Computer Dialogue, SpeD 2021 |
| Páginas | 52-56 |
| Número de páginas | 5 |
| ISBN (versión digital) | 9781665427869 |
| DOI | |
| Estado | Published - 2021 |
| Evento | 11th International Conference on Speech Technology and Human-Computer Dialogue, SpeD 2021 - Virtual, Bucharest, Romania Duración: oct 13 2021 → oct 15 2021 |
Serie de la publicación
| Nombre | 2021 11th International Conference on Speech Technology and Human-Computer Dialogue, SpeD 2021 |
|---|
Conference
| Conference | 11th International Conference on Speech Technology and Human-Computer Dialogue, SpeD 2021 |
|---|---|
| País/Territorio | Romania |
| Ciudad | Virtual, Bucharest |
| Período | 10/13/21 → 10/15/21 |
Nota bibliográfica
Publisher Copyright:© 2021 IEEE.
Financiación
ACKNOWLEDGEMENT This work was supported by NationalInstitutes of Health underNIDCD R15 DC017296-01. This work was supported by NationalInstitutesof Health underNIDCD R15 DC017296-01.
| Financiadores | Número del financiador |
|---|---|
| NationalInstitutesof Health underNIDCD | |
| National Institutes of Health (NIH) | R15 DC017296-01 |
ASJC Scopus subject areas
- Language and Linguistics
- Human-Computer Interaction
- Signal Processing
- Safety, Risk, Reliability and Quality
- Communication