학술논문
Dynamic Precision-Scalable Thermal Mapping Algorithm for Three Dimensional Systolic-Array Based Neural Network Accelerator
Document Type
Periodical
Source
IEEE Transactions on Consumer Electronics IEEE Trans. Consumer Electron. Consumer Electronics, IEEE Transactions on. 70(1):757-769 Feb, 2024
Subject
Language
ISSN
0098-3063
1558-4127
1558-4127
Abstract
Nowadays, the systolic-array based accelerator has been used widely for the neural-network applications. Multiple systolic-array based accelerator chips can be stacked by the 3D IC technology to improve the performance of the neural-network applications. However, the 3D accelerator increases the power density and causes the overheating. To avoid the overheating, the sacrifice of the performance for the 3D accelerator under the thermal limitations is important. In this work, a dynamic precision-scalable thermal mapping algorithm (DPSTM) is proposed to change the active processing elements with different data precisions in the 3D accelerators dynamically. The goal is to control the power density and peak temperature of the 3D accelerator. Compared with the related works, DPSTM can reduce 29%-77% and 7%-73% latencies in AlexNet and ResNet-18 with 92-95°C thermal limitations.