Emotion Recognition in Speech Signals through Graph Neural Networks Integration:A GCN-GAT Hybrid Model

Emotion Recognition in Speech Signals through Graph Neural Networks Integration:A GCN-GAT Hybrid Model

초록

Our research aims to enhance the modeling of speech signals for more effective extraction of node features and analysis of relationships between nodes. To achieve this, we model speech signals as cyclic or linear graphs. Our model combines layers of Graph Convolutional Networks (GCN) and Graph Attention Networks (GAT) to leverage their respective strengths in processing graph data. Specifically, we utilize GCN to aggregate information from neighboring nodes, which helps capture local relationships among nodes. Additionally, we employ GAT mechanisms to assign varying attention weights to different neighboring nodes, facilitating a better capture of complex global relationships between nodes. In our experiments, we validate our approach using the IEMOCAP dataset and demonstrate comparable performance to state-of-the-art models in emotion recognition tasks. This research outcome provides new insights and methodologies for further exploration in the field of speech signal processing.

제목
Emotion Recognition in Speech Signals through Graph Neural Networks Integration:A GCN-GAT Hybrid Model
제목 (타언어)
Emotion Recognition in Speech Signals through Graph Neural Networks Integration:A GCN-GAT Hybrid Model
저자
KIM DEOKHWAN
학회명
2024 한국차세대컴퓨팅학회 춘계학술대회
개최지
국립한국교통대학교
학회 개최일
2024-04-25 ~ 2024-04-27