학술논문

Nimbus: Towards Latency-Energy Efficient Task Offloading for AR Services
Document Type
Periodical
Source
IEEE Transactions on Cloud Computing IEEE Trans. Cloud Comput. Cloud Computing, IEEE Transactions on. 11(2):1530-1545 Jun, 2023
Subject
Computing and Processing
Communication, Networking and Broadcast Technologies
Task analysis
Cloud computing
Performance evaluation
Image edge detection
Real-time systems
Edge computing
Energy consumption
augmented reality
optimization
resource management
cloud computing
Language
ISSN
2168-7161
2372-0018
Abstract
Widespread adoption of mobile augmented reality (AR) and virtual reality (VR) applications depends on their smoothness and immersiveness. Modern AR applications applying computationally intensive computer vision algorithms can burden today's mobile devices, and cause high energy consumption and/or poor performance. To tackle this challenge, it is possible to offload part of the computation to nearby devices at the edge. However, this calls for smart task placement strategies in order to efficiently use the resources of the edge infrastructure. In this paper, we introduce Nimbus — a task placement and offloading solution for a multi-tier, edge-cloud infrastructure where deep learning tasks are extracted from the AR application pipeline and offloaded to nearby GPU-powered edge devices. Our aim is to minimize the latency experienced by end-users and the energy costs on mobile devices. Our multifaceted evaluation, based on benchmarked performance of AR tasks, shows the efficacy of our solution. Overall, Nimbus reduces the task latency by $\sim 4\times$∼4× and the energy consumption by $\sim$∼77% for real-time object detection in AR applications. We also benchmark three variants of our offloading algorithm, disclosing the trade-off of centralized versus distributed execution.