학술논문

Towards Improved Power Management in Cloud GPUs
Document Type
Periodical
Source
IEEE Computer Architecture Letters IEEE Comput. Arch. Lett. Computer Architecture Letters. 22(2):141-144 Dec, 2023
Subject
Computing and Processing
Graphics processing units
Cloud computing
Power system management
Servers
Clocks
Monitoring
Performance evaluation
Power management
graphics processors
super (very large) computers
servers
design for power delivery limits
Language
ISSN
1556-6056
1556-6064
2473-2575
Abstract
As modern server GPUs are increasingly power intensive, better power management mechanisms can significantly reduce the power consumption, capital costs, and carbon emissions in large cloud datacenters. This letter uses diverse datacenter workloads to study the power management capabilities of modern GPUs. We find that current GPU management mechanisms have limited compatibility and monitoring support under cloud virtualization. They have sub-optimal, imprecise, and non-intuitive implementations of Dynamic Voltage and Frequency Scaling (DVFS) and power capping. Consequently, efficient GPU power management is not widely deployed in clouds today. To address these issues, we make actionable recommendations for GPU vendors and researchers.