상세 보기
Stable Quantization-Aware Training with Adaptive Gradient Clipping
- Park, Jihoon;
- Lee, Seunghyun;
- Song, Byung-Cheol
Citations
SCOPUS
3초록
Quantization-Aware Training (QAT) uses batch normalization (BN) folding during fine-tuning, so it may not use the normalization effects of the BN layer. HAWQ, which achieved SOTA with QAT, has significant accuracy degradation as the training becomes longer. In this paper, we apply Adaptive Gradient Clipping (AGC) to stable quantization-aware training and improve accuracy by adding Dropout. Moreover, we have ablation studies about AGC. © 2023 IEEE.
- 제목
- Stable Quantization-Aware Training with Adaptive Gradient Clipping
- 저자
- Park, Jihoon; Lee, Seunghyun; Song, Byung-Cheol
- 발행일
- 2023
- 유형
- Conference paper
- 저널명
- 2023 International Conference on Electronics, Information, and Communication, ICEIC 2023