An Optimized EMG Encoder to minimize soft speech loss for speech to EMG conversions

Ullah, Shan; Kim, Deok-Hwan

doi:10.1109/BigComp60711.2024.00041

상세 보기

An Optimized EMG Encoder to minimize soft speech loss for speech to EMG conversions

Ullah, Shan;
Kim, Deok-Hwan

Citations

WEB OF SCIENCE

1

Citations

SCOPUS

2

초록

Electromyography (EMG) to speech conversions is a standard problem to facilitate speech impaired individuals for communication via EMG (EMG to Speech). However, dataset acquisition is a cumbersome process and highly dependent on acquisition configuration. The availability of EMG signals can be made by tackling the inverse problem (Speech to EMG). In this paper, we propose an optimized EMG encoder which enhanced EMG feature extraction and in turn leads to improvements in soft speech units' representations. To validate the efficacy of our proposed enhanced EMG encoder, we utilized state-of-the-art speech to EMG generative adversarial network (STE-GANs). We witnessed a significant improvements in synthesized EMG signals after utilizing proposed EMG encoder which improves soft speech losses by producing enhanced speech units during training of STE-GANs. The extensive results are presented on public dataset.

키워드

Electromyography (EMG); Speech Processing; Generative Adversarial Networks; HuBERT

제목: An Optimized EMG Encoder to minimize soft speech loss for speech to EMG conversions

저자: Ullah, Shan; Kim, Deok-Hwan

DOI: 10.1109/BigComp60711.2024.00041

발행일: 2024

유형: Proceedings Paper

저널명: 2024 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING, IEEE BIGCOMP 2024

페이지: 215 ~ 218