Fine-tuning Strategies for Automatic Speech Recognition of Low-Resource Speech with Autism Spectrum Disorder

Park, Yeseul; Lee, Bowon

doi:10.21437/Interspeech.2025-2535

상세 보기

Fine-tuning Strategies for Automatic Speech Recognition of Low-Resource Speech with Autism Spectrum Disorder

Park, Yeseul;
Lee, Bowon

Citations

WEB OF SCIENCE

0

Citations

SCOPUS

0

초록

Individuals with autism spectrum disorder (ASD) exhibit unique speech patterns that challenge conventional automatic speech recognition (ASR) systems. However, research on ASD-adapted ASR models remains limited. This study explores fine-tuning strategies for ASD-specific ASR models using Whisper, comparing full fine-tuning, selective fine-tuning, adapter tuning, and LoRA-based fine-tuning. Experiments using a small-scale Korean ASD speech dataset demonstrate that adapter tuning and LoRA significantly reduce the character error rate (CER) while reducing trainable parameters. In case of Whisper-small, adapter tuning and LoRA improve the CER by 7.22% and 10.14% over full fine-tuning, respectively. Furthermore, LoRA improved CER by 10.35% and 10.15% with Whisper-base and Whisper-large-v2 models compared to full fine-tuning. These results demonstrate the adaptation efficiency and effectiveness of LoRA for low resource ASD speech dataset.

키워드

Speech recognition; accessibility; autism spectrum disorder; parameter-efficient learning

제목: Fine-tuning Strategies for Automatic Speech Recognition of Low-Resource Speech with Autism Spectrum Disorder

저자: Park, Yeseul; Lee, Bowon

DOI: 10.21437/Interspeech.2025-2535

발행일: 2025

유형: Proceedings Paper

저널명: INTERSPEECH 2025

페이지: 1858 ~ 1862