Real-time DNN Model Partitioning for QoE Enhancement in Mobile Vision Applications

Citations

WEB OF SCIENCE

13
Citations

SCOPUS

15

초록

As deep learning technology advances, mobile vision applications such as augmented reality or autonomous vehicles are widespread. The quality of experience (QoE) of such applications highly depends on hardware specification of mobile device, dynamic service requests, stochastic network status and characteristics of DNN model. In this paper, we propose an algorithm called RT-DMP that jointly optimizes DNN model partitioning and process/network resources adapting to system dynamics by leveraging virtual queue-based Lyapunov optimization framework. The RT-DMP jointly makes decisions on (i) partition point between a mobile device and an MEC server, (ii) mobile GPU clock frequency, and (iii) transmission rate through the wireless network every time slot. We theoretically show that RT-DMP optimally strikes the balance among three QoE metrics that are energy consumption, throughput and end-to-end latency, which has not been addressed in existing studies. Finally, we demonstrate the performance and feasibility of RT-DMP via trace-driven simulations and real testbed based on Nvidia Jetson TX2 and a high-end MEC server.

키워드

CLOUD
제목
Real-time DNN Model Partitioning for QoE Enhancement in Mobile Vision Applications
저자
Lim, Jeong-AKim, Yeongjin
DOI
10.1109/SECON55815.2022.9918600
발행일
2022
유형
Proceedings Paper
저널명
2022 19TH ANNUAL IEEE INTERNATIONAL CONFERENCE ON SENSING, COMMUNICATION, AND NETWORKING (SECON)
페이지
407 ~ 415