Controllable Diffusion Model for Generating Multimodal Biometric Images

Nguyen, Quoc Dung; Kim, Hakil

doi:10.1109/CAI64502.2025.00143

상세 보기

Controllable Diffusion Model for Generating Multimodal Biometric Images

Nguyen, Quoc Dung;
Kim, Hakil

Citations

WEB OF SCIENCE

0

Citations

SCOPUS

0

초록

High-quality, diversified, and large-scale datasets are crucial for creating reliable deep-learning models for biometric applications. Unfortunately, there is a shortage of well-labeled data. This paper introduces a text-conditional biometric imaging generation framework, addressing the complexities associated with multi-modality considerations. The proposed framework harnesses cutting-edge diffusion probabilistic models to produce multi-modal biometric images at high resolutions, seamlessly aligning with biometric language prompts. The experimental results unequivocally validate the efficacy of the proposed framework in generating a diverse array of highly realistic synthetic biometric images while consistently maintaining a commendable level of fidelity when juxtaposed with their respective reference datasets. The contributions of this study offer substantial potential for propelling advancements in biometric imaging research. © 2025 IEEE.

키워드

data augmentation; diffusion models; image synthesis; multi-model generation

제목: Controllable Diffusion Model for Generating Multimodal Biometric Images

저자: Nguyen, Quoc Dung; Kim, Hakil

DOI: 10.1109/CAI64502.2025.00143

발행일: 2025

유형: Proceedings Paper

저널명: 2025 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI

페이지: 803 ~ 808