DNN Model Partitioning in AI-Based Mobile Services 


Vol. 47,  No. 6, pp. 818-825, Jun.  2022
10.7840/kics.2022.47.6.818


PDF
  Abstract

Through the advancement of wireless network technology, real-time mobile vision applications such as object detection and image analysis in mobile devices are being used in various fields. Such applications leverage mobile edge computing (MEC) to utilize high-accuracy deep learning models, but show low QoE due to network overhead. To tackle this, deep model partitioning has emerged that splits processing for inference between a mobile device and MEC server. Existing works proposed deep learning model partitioning algorithms to improve one or two metrics among end-to-end latency, energy consumption, and frame per second (fps) to enhance the QoE of mobile vision applications. In this paper, we propose an algorithm to jointly control (i) the model partitioing point, (ii) the number of frames to be processed among the input frames, and (iii) the GPU clock frequency of the mobile device to improve the performance of the above three metrics. With trace-driven simulation, we verify that our RT-DMP can save 90.2% of energy consumption than mobile processing algorithm, and improve processed fps by 91.8% compared to MEC algorithm.

  Statistics
Cumulative Counts from November, 2022
Multiple requests among the same browser session are counted as one view. If you mouse over a chart, the values of data points will be shown.


  Cite this article

[IEEE Style]

J. Lim and Y. Kim, "DNN Model Partitioning in AI-Based Mobile Services," The Journal of Korean Institute of Communications and Information Sciences, vol. 47, no. 6, pp. 818-825, 2022. DOI: 10.7840/kics.2022.47.6.818.

[ACM Style]

Jeong-A Lim and Yeongjin Kim. 2022. DNN Model Partitioning in AI-Based Mobile Services. The Journal of Korean Institute of Communications and Information Sciences, 47, 6, (2022), 818-825. DOI: 10.7840/kics.2022.47.6.818.

[KICS Style]

Jeong-A Lim and Yeongjin Kim, "DNN Model Partitioning in AI-Based Mobile Services," The Journal of Korean Institute of Communications and Information Sciences, vol. 47, no. 6, pp. 818-825, 6. 2022. (https://doi.org/10.7840/kics.2022.47.6.818)