An Efficient and Fast Filter Pruning Method for Object Detection in Embedded Systems

Ko, Hyunjun; Kang, Jin-Ku; Kim, Yongwoo

doi:10.1109/AICAS59952.2024.10595873

상세 보기

An Efficient and Fast Filter Pruning Method for Object Detection in Embedded Systems

Ko, Hyunjun;
Kang, Jin-Ku;
Kim, Yongwoo

Citations

WEB OF SCIENCE

1

Citations

SCOPUS

1

초록

Recently, CNN-based networks have exhibited high performance in computer vision. On the other hand, due to the networks becoming deeper and wider, it is hard to implement the model in real-time embedded environments. To overcome the drawback, filter pruning has been widely studied for neural network compression. Filter pruning does not need any special hardware or software because it removes filters of CNN and accelerates inference without any special software or hardware. In this paper, we proposed efficient and fast filter pruning (EFFP), which focuses on reducing the training computation resources and searching optimal pruned networks. The success stems from two significant improvements upon other pruning methods. (1) Short training time: In the pruning stage, we make redundant filters to zero to make the output feature map the same as a lightweight model, and (2) adjust the change of redundancy using regrowing: It is difficult to get an optimal pruned model by pruning redundant filters at once. Therefore, we use the pruning/regrowing method to gradually remove unimportant filters to avoid permanently pruning important filters to get an optimal lightweight model. Experimental results indicate that EFFP can reduce the FLOPs and parameters more efficiently and faster than other pruning methods on the object detection model. The inference time is measured on NVIDIA Jetson Xavier NX. As a result, we improve mAP and inference time by a maximum of 45 % compared to other pruning methods.

키워드

CNN; Network compression; Filter pruning; Object detection; Inference time

제목: An Efficient and Fast Filter Pruning Method for Object Detection in Embedded Systems

저자: Ko, Hyunjun; Kang, Jin-Ku; Kim, Yongwoo

DOI: 10.1109/AICAS59952.2024.10595873

발행일: 2024

유형: Proceedings Paper

저널명: 2024 IEEE 6TH INTERNATIONAL CONFERENCE ON AI CIRCUITS AND SYSTEMS, AICAS 2024

페이지: 204 ~ 207