Controllable Diffusion Model for Generating Multimodal Biometric Images

Citations

WEB OF SCIENCE

0
Citations

SCOPUS

0

초록

High-quality, diversified, and large-scale datasets are crucial for creating reliable deep-learning models for biometric applications. Unfortunately, there is a shortage of well-labeled data. This paper introduces a text-conditional biometric imaging generation framework, addressing the complexities associated with multi-modality considerations. The proposed framework harnesses cutting-edge diffusion probabilistic models to produce multi-modal biometric images at high resolutions, seamlessly aligning with biometric language prompts. The experimental results unequivocally validate the efficacy of the proposed framework in generating a diverse array of highly realistic synthetic biometric images while consistently maintaining a commendable level of fidelity when juxtaposed with their respective reference datasets. The contributions of this study offer substantial potential for propelling advancements in biometric imaging research. © 2025 IEEE.

키워드

data augmentationdiffusion modelsimage synthesismulti-model generation
제목
Controllable Diffusion Model for Generating Multimodal Biometric Images
저자
Nguyen, Quoc DungKim, Hakil
DOI
10.1109/CAI64502.2025.00143
발행일
2025
유형
Proceedings Paper
저널명
2025 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI
페이지
803 ~ 808