PathoVoiceFAI : Enhancing Voice Pathology Classification in Human Voices

초록

Voice pathology classification has become one of the primary objectives of research in biomedical engineering. This paper proposes PathoVoiceFAI, a technique that enhances the multiclass pathology classification by leveraging the power of attention layers and appropriate fusioning technique to fuse the multimodal inputs. The preliminary results show that use of mid-level fusion with attention layers improves the classification accuracy by 5% in comparison to the standard decision-level fusion technique. This highlights the effect of powerful feature extraction in enhancing the classification outcomes for application in clinical environment.

제목
PathoVoiceFAI : Enhancing Voice Pathology Classification in Human Voices
저자
KIM DEOKHWAN
학회명
The 10th International Conference on Next Generation Computing (ICNGC2024)
개최지
Holy Angel University, Philippines
학회 개최일
2024-11-20 ~ 2024-11-23