Balanced knowledge distillation for one-stage object detector

Citations

WEB OF SCIENCE

4
Citations

SCOPUS

4

초록

The latest knowledge distillation (KD) methods have successfully supervised a student model to have a better representation using intermediate layers of a teacher model. However, the previous KD methods did not obtain generalized knowledge for various object scales from a one-stage object detector because the one-stage object detector has a structural property that uses several intermediate layers to extract objects of various scales. In other words, the previous KD methods could not distill and transfer knowledge to intermediate layers of one-stage object detectors in a balanced way. Therefore, we propose a shared knowledge encoder and an averaged prototype transfer to remove or mitigate the distillation and transfer imbalances that adversely affect the KD process. Experimental results show that the proposed KD method outperforms the state-of-the-art methods. For instance, the proposed method provides about 1.3% and 2.2% higher accuracy than the baseline on the PASCAL VOC and MS COCO datasets, respectively.(c) 2022 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

키워드

Knowledge distillationOne-stage object detector
제목
Balanced knowledge distillation for one-stage object detector
저자
Lee, SungwookLee, SeunghyunSong, Byung Cheol
DOI
10.1016/j.neucom.2022.05.087
발행일
2022-08-21
유형
Article
저널명
Neurocomputing
500
페이지
394 ~ 404