Improved Recognition of the Speech of People with Parkinson's Who Stutter

Na, Jonghwan; Zheng, Xiuwen; Lee, Bowon; Hasegawa-Johnson, Mark

doi:10.1109/ICASSP49660.2025.10889229

상세 보기

Improved Recognition of the Speech of People with Parkinson's Who Stutter

Na, Jonghwan;
Zheng, Xiuwen;
Lee, Bowon;
Hasegawa-Johnson, Mark

Citations

WEB OF SCIENCE

0

Citations

SCOPUS

0

초록

Stuttering is a speech disorder often associated with neurological conditions, including Parkinson's disease (PD). Despite advancements in modern automatic speech recognition (ASR) technologies, today's systems still face challenges in accurately recognizing dysarthric speech, particularly when stuttering is present. In this study, we propose a novel stuttered speech data augmentation approach to improve dysarthric speech recognition. We utilize typical speech data from LibriSpeech to generate artificial stuttered speech by applying Voice Activity Detection and Forced Alignment techniques to accurately identify word boundaries, and integrating an adaptive stuttering filter to simulate severe stuttering patterns. Additionally, dysarthric speech data from individuals with PD, collected by the Speech Accessibility Project (SAP), is integrated into the model. Our experimental results demonstrate that the proposed augmentation approach outperforms existing methods in enhancing the recognition of stuttered speech. Furthermore, fine-tuning the ASR systems with SAP data yields additional performance improvements for both stuttering and non-stuttering individuals with PD.

키워드

stuttering; automatic speech recognition; accessibility; dysarthria; data augmentation

제목: Improved Recognition of the Speech of People with Parkinson's Who Stutter

저자: Na, Jonghwan; Zheng, Xiuwen; Lee, Bowon; Hasegawa-Johnson, Mark

DOI: 10.1109/ICASSP49660.2025.10889229

발행일: 2025

유형: Proceedings Paper

저널명: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings