Stable Quantization-Aware Training with Adaptive Gradient Clipping

Citations

SCOPUS

3

초록

Quantization-Aware Training (QAT) uses batch normalization (BN) folding during fine-tuning, so it may not use the normalization effects of the BN layer. HAWQ, which achieved SOTA with QAT, has significant accuracy degradation as the training becomes longer. In this paper, we apply Adaptive Gradient Clipping (AGC) to stable quantization-aware training and improve accuracy by adding Dropout. Moreover, we have ablation studies about AGC. © 2023 IEEE.

제목
Stable Quantization-Aware Training with Adaptive Gradient Clipping
저자
Park, JihoonLee, SeunghyunSong, Byung-Cheol
DOI
10.1109/ICEIC57457.2023.10049939
발행일
2023
유형
Conference paper
저널명
2023 International Conference on Electronics, Information, and Communication, ICEIC 2023