A Study on Improving the Performance of Contact Centers' Speech Recognition Using Simulated Data

Bak, Huiyong; Na, Jonghwan

doi:10.1109/ICCE-Asia67487.2025.11263765

상세 보기

A Study on Improving the Performance of Contact Centers' Speech Recognition Using Simulated Data

Bak, Huiyong;
Na, Jonghwan

Citations

SCOPUS

0

초록

Contact center speech recognition enhances service efficiency. However, developing an ASR system for contact centers presents several challenges. Because of the long-form nature of contact center speech data, it is typically segmented into individual sentences for model training. However, ASR models trained on such segmented data often show degraded performance when transcribing real-world contact center speech. To address this issue, we propose two simple yet effective data augmentation methods: simulated downsizing and simulated randomness. The generated data enables ASR models to better handle original-form contact center speech. We evaluate the effectiveness of the proposed augmentation techniques on publicly available AI-HUB contact center speech datasets. Experimental results demonstrate that both methods significantly improve the recognition performance of ASR models on original-form contact center speech. © 2025 IEEE.

키워드

automatic speech recognition; contact centers; data augmentation

제목: A Study on Improving the Performance of Contact Centers' Speech Recognition Using Simulated Data

저자: Bak, Huiyong; Na, Jonghwan

DOI: 10.1109/ICCE-Asia67487.2025.11263765

발행일: 2025

유형: Conference paper

저널명: 2025 IEEE/IEIE International Conference on Consumer Electronics-Asia, ICCE-Asia 2025