Accelerating a computer vision algorithm on a mobile SoC using CPU-GPU co-processing ? A case study on face detection

초록

Recently, mobile devices have become equipped with sophisticated hardware components such as a heterogeneous multi-core SoC that consists of a CPU, GPU, and DSP. This provides opportunities to realize computationally-intensive computer vision applications using General Purpose GPU (GPGPU) programming tools such as Open Graphics Library for Embedded System (OpenGL ES) and Open Computing Language (OpenCL). As a case study, the aim of this paper was to accelerate the Viola-Jones face detection algorithm which is computationally expensive and limited in use on mobile devices due to irregular memory access and imbalanced workloads resulting in low performance regarding the processing time. To solve the above challenges, the proposed method of this study adapted CPU?GPU task parallelism, sliding window parallelism, scale image parallelism, dynamic allocation of threads, and local memory optimization to improve the computational time. The experimental results show that the proposed method achieved a 3.3~6.29 times increased computational time compared to the well-optimized OpenCV implementation on a CPU. The proposed method can be adapted to other applications using mobile GPUs and CPUs.

제목
Accelerating a computer vision algorithm on a mobile SoC using CPU-GPU co-processing ? A case study on face detection
저자
HAKIL KIM
학회명
2016 IEEE/ACM 38th IEEE International Conference on Mobile Software Engineering and Systems
개최지
미국 텍사스 오스틴
학회 개최일
2016-05-14 ~ 2016-05-22