An Effective Learning Method for Automatic Speech Recognition in Korean CI Patients' Speech

Jeong, Jiho; Mondol, S. I. M. M. Raton; Kim, Yeon Wook; Lee, Sangmin

doi:10.3390/electronics10070807

상세 보기

An Effective Learning Method for Automatic Speech Recognition in Korean CI Patients' Speech

Jeong, Jiho;
Mondol, S. I. M. M. Raton;
Kim, Yeon Wook;
Lee, Sangmin

Citations

WEB OF SCIENCE

1

Citations

SCOPUS

2

초록

The automatic speech recognition (ASR) model usually requires a large amount of training data to provide better results compared with the ASR models trained with a small amount of training data. It is difficult to apply the ASR model to non-standard speech such as that of cochlear implant (CI) patients, owing to privacy concerns or difficulty of access. In this paper, an effective finetuning and augmentation ASR model is proposed. Experiments compare the character error rate (CER) after training the ASR model with the basic and the proposed method. The proposed method achieved a CER of 36.03% on the CI patient's speech test dataset using only 2 h and 30 min of training data, which is a 62% improvement over the basic method.

키워드

speech recognition; finetuning; CI patients; augmentation

제목: An Effective Learning Method for Automatic Speech Recognition in Korean CI Patients' Speech

저자: Jeong, Jiho; Mondol, S. I. M. M. Raton; Kim, Yeon Wook; Lee, Sangmin

DOI: 10.3390/electronics10070807

발행일: 2021-04

유형: Article

저널명: ELECTRONICS

권: 10

호: 7