An Optimized EMG Encoder to minimize soft speech loss for speech to EMG conversions

KIM DEOKHWAN

상세 보기

An Optimized EMG Encoder to minimize soft speech loss for speech to EMG conversions

KIM DEOKHWAN

초록

Electromyography (EMG) to speech conversions is a standard problem to facilitate speech impaired individuals for communication via EMG (EMG to Speech). However, dataset acquisition is a cumbersome process and highly dependent on acquisition configuration. The availability of EMG signals can be made by tackling the inverse problem (Speech to EMG). In this paper, we propose an optimized EMG encoder which enhanced EMG feature extraction and in turn leads to improvements in soft speech units’ representations. To validate the efficacy of our proposed enhanced EMG encoder, we utilized state-of-the-art speech to EMG generative adversarial network (STE-GANs). We witnessed a significant improvements in synthesized EMG signals after utilizing proposed EMG encoder which improves soft speech losses by producing enhanced speech units during training of STE-GANs. The extensive results are presented on public dataset.

제목: An Optimized EMG Encoder to minimize soft speech loss for speech to EMG conversions

저자: KIM DEOKHWAN

학회명: 2024 IEEE International Conference on Big Data and Smart Computing (BigComp)

개최지: Sukosol Hotel, Bangkok

학회 개최일: 2024-02-18 ~ 2024-02-21