학술논문

Machine Learning for Power, Energy, and Thermal Management on Multicore Processors: A Survey
Document Type
Periodical
Source
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems IEEE Trans. Comput.-Aided Des. Integr. Circuits Syst. Computer-Aided Design of Integrated Circuits and Systems, IEEE Transactions on. 39(1):101-116 Jan, 2020
Subject
Components, Circuits, Devices and Systems
Computing and Processing
Machine learning
Thermal management
Power demand
Cooling
Optimization
Multicore processing
Hardware
Energy efficiency
machine learning (ML)
multicore systems
neural networks
on-chip resource management
power management
reinforcement learning
thermal management
Language
ISSN
0278-0070
1937-4151
Abstract
Due to the high integration density and roadblock of voltage scaling, modern multicore processors experience higher power densities than previous technology scaling nodes. When unattended, this issue might lead to temperature hot spots, that in turn may cause nonuniform aging, accelerate chip failure, impair reliability, and reduce the performance of the system. This paper presents an overview of several research efforts that propose to use machine learning (ML) techniques for power and thermal management on single-core and multicore processors. Traditional power and thermal management techniques rely on a certain a-priori knowledge of the chip’s thermal model, as well as information of the workloads/applications to be executed (e.g., transient and average power consumption). Nevertheless, these a-priori information is not always available, and even if it is, it cannot reflect the spatial and temporal uncertainties and variations that come from the environment, the hardware, or from the workloads/applications. Contrarily, techniques based on ML can potentially adapt to varying system conditions and workloads, learning from past events in order to improve themselves as the environment changes, resulting in improved management decisions.