Self-supervised Knowledge Distillation Using Singular Value Decomposition

Citations

WEB OF SCIENCE

106
Citations

SCOPUS

58

초록

To solve deep neural network (DNN)'s huge training dataset and its high computation issue, so-called teacher-student (T-S) DNN which transfers the knowledge of T-DNN to S-DNN has been proposed. However, the existing T-S-DNN has limited range of use, and the knowledge of T-DNN is insufficiently transferred to S-DNN. To improve the quality of the transferred knowledge from T-DNN, we propose a new knowledge distillation using singular value decomposition (SVD). In addition, we define a knowledge transfer as a self-supervised task and suggest a way to continuously receive information from T-DNN. Simulation results show that a S-DNN with a computational cost of 1/5 of the T-DNN can be up to 1.1% better than the T-DNN in terms of classification accuracy. Also assuming the same computational cost, our S-DNN outperforms the S-DNN driven by the state-of-the-art distillation with a performance advantage of 1.79%. code is available on https://github.com/sseung0703/SSKD_SVD.

키워드

Statistical methods and learningOptimization methodsRecognition: detectionCategorizationIndexingMatching
제목
Self-supervised Knowledge Distillation Using Singular Value Decomposition
저자
Lee, Seung HyunKim, Dae HaSong, Byung Cheol
DOI
10.1007/978-3-030-01231-1_21
발행일
2018
유형
Proceedings Paper
저널명
Lecture Notes in Computer Science
11210
페이지
339 ~ 354