A Mixed-Precision Approach to a Preconditioned Eigensolver for Efficient Density Functional Calculations on AI-Focused GPUs

Citations

WEB OF SCIENCE

0
Citations

SCOPUS

0

초록

Recent advances in graphics processing units (GPUs) have substantially broadened the applicability of computational chemistry. Nevertheless, the double-precision (FP64) operations conventionally required in quantum chemistry remain inefficient on artificial intelligence (AI)-focused GPUs, which are predominantly optimized for lower precision arithmetic. Moreover, the limited memory capacity of such GPUs necessitates algorithmic adaptations to traditional approaches in density functional theory (DFT). In this work, we propose a systematic mixed precision strategy for the iterative matrix diagonalization procedure, which constitutes the principal bottleneck in real-space DFT calculations. By selectively employing single-precision (FP32) in most computational steps and even adopting brain floating point (BF16) in the preconditioning stage, we demonstrate that numerical accuracy can be preserved with negligible degradation. Building on these observations, we developed a mixed precision eigensolver and validated its performance across material systems spanning a wide range of sizes. Our results show that the proposed method achieves up to 10x speedup in the diagonalization step relative to FP64, while extending the feasible system size by approximately 50%. Comprehensive validation across multiple GPU architectures further confirms that AI-focused GPUs can deliver performance comparable to that of high-end high-performance computing (HPC)-focused GPUs, thereby substantially improving accessibility to large-scale electronic structure simulations.

키워드

GRAPHICAL PROCESSING UNITSQUANTUM-CHEMISTRYFOCK MATRIX
제목
A Mixed-Precision Approach to a Preconditioned Eigensolver for Efficient Density Functional Calculations on AI-Focused GPUs
저자
Woo, JeheonChoi, Sunghwan
DOI
10.1021/acs.jctc.5c01800
발행일
2026-02-24
유형
Article
저널명
Journal of Chemical Theory and Computation
22
4
페이지
1687 ~ 1698