An Optimized EMG Encoder to minimize soft speech loss for speech to EMG conversions

Citations

WEB OF SCIENCE

1
Citations

SCOPUS

2

초록

Electromyography (EMG) to speech conversions is a standard problem to facilitate speech impaired individuals for communication via EMG (EMG to Speech). However, dataset acquisition is a cumbersome process and highly dependent on acquisition configuration. The availability of EMG signals can be made by tackling the inverse problem (Speech to EMG). In this paper, we propose an optimized EMG encoder which enhanced EMG feature extraction and in turn leads to improvements in soft speech units' representations. To validate the efficacy of our proposed enhanced EMG encoder, we utilized state-of-the-art speech to EMG generative adversarial network (STE-GANs). We witnessed a significant improvements in synthesized EMG signals after utilizing proposed EMG encoder which improves soft speech losses by producing enhanced speech units during training of STE-GANs. The extensive results are presented on public dataset.

키워드

Electromyography (EMG)Speech ProcessingGenerative Adversarial NetworksHuBERT
제목
An Optimized EMG Encoder to minimize soft speech loss for speech to EMG conversions
저자
Ullah, ShanKim, Deok-Hwan
DOI
10.1109/BigComp60711.2024.00041
발행일
2024
유형
Proceedings Paper
저널명
2024 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING, IEEE BIGCOMP 2024
페이지
215 ~ 218