A Study on Improving the Performance of Contact Centers' Speech Recognition Using Simulated Data

  • Bak, Huiyong
  • Na, Jonghwan
Citations

SCOPUS

0

초록

Contact center speech recognition enhances service efficiency. However, developing an ASR system for contact centers presents several challenges. Because of the long-form nature of contact center speech data, it is typically segmented into individual sentences for model training. However, ASR models trained on such segmented data often show degraded performance when transcribing real-world contact center speech. To address this issue, we propose two simple yet effective data augmentation methods: simulated downsizing and simulated randomness. The generated data enables ASR models to better handle original-form contact center speech. We evaluate the effectiveness of the proposed augmentation techniques on publicly available AI-HUB contact center speech datasets. Experimental results demonstrate that both methods significantly improve the recognition performance of ASR models on original-form contact center speech. © 2025 IEEE.

키워드

automatic speech recognitioncontact centersdata augmentation
제목
A Study on Improving the Performance of Contact Centers' Speech Recognition Using Simulated Data
저자
Bak, HuiyongNa, Jonghwan
DOI
10.1109/ICCE-Asia67487.2025.11263765
발행일
2025
유형
Conference paper
저널명
2025 IEEE/IEIE International Conference on Consumer Electronics-Asia, ICCE-Asia 2025