Bridging the Knowledge Gap via Transformer-Based Multi-Layer Correlation Learning

Citations

WEB OF SCIENCE

0
Citations

SCOPUS

1

초록

We tackle a multi-layer knowledge distillation problem between deep models with heterogeneous architectures. The main challenges of that are the mismatches of the feature maps in terms of the resolution or semantic levels. To resolve this, we propose a novel transformer-based multi-layer correlation knowledge distillation (TMC-KD) method in order to bridge the knowledge gap between a pair of networks. Our method aims to narrow the relational knowledge gaps between teacher and student models by minimizing the local and global feature correlations. Based on extensive comparisons with the recent KD methods on classification and detection tasks, we prove the effectiveness and usefulness of our TMC-KD method.

키워드

Correlation learningimage classificationknowledge distillationmodel compressionobject detectiontransformer-based learningDISTILLATION
제목
Bridging the Knowledge Gap via Transformer-Based Multi-Layer Correlation Learning
저자
Bak, Hun-BeomBae, Seung-Hwan
DOI
10.1109/ACCESS.2024.3387859
발행일
2024
유형
Article
저널명
IEEE Access
12
페이지
53533 ~ 53548