학술논문

ALPINE: Analog In-Memory Acceleration With Tight Processor Integration for Deep Learning

Document Type

Periodical

Author

Klein, J.; Boybat, I.; Qureshi, Y.M.; Dazzi, M.; Levisse, A.; Ansaloni, G.; Zapater, M.; Sebastian, A.; Atienza, D.

Source

IEEE Transactions on Computers IEEE Trans. Comput. Computers, IEEE Transactions on. 72(7):1985-1998 Jul, 2023

Subject

Computing and Processing
Hardware
Computational modeling
Computer architecture
Biological system modeling
In-memory computing
Reduced instruction set computing
Recurrent neural networks
AI accelerators
architectural exploration
artificial neural networks
gem5
neuromorphic computing

Language

ISSN

0018-9340
1557-9956
2326-3814

Abstract

Analog in-memory computing (AIMC) cores offers significant performance and energy benefits for neural network inference with respect to digital logic (e.g., CPUs). AIMCs accelerate matrix-vector multiplications, which dominate these applications’ run-time. However, AIMC-centric platforms lack the flexibility of general-purpose systems, as they often have hard-coded data flows and can only support a limited set of processing functions. With the goal of bridging this gap in flexibility, we present a novel system architecture that tightly integrates analog in-memory computing accelerators into multi-core CPUs in general-purpose systems. We developed a powerful gem5-based full system-level simulation framework into the gem5-X simulator, ALPINE, which enables an in-depth characterization of the proposed architecture. ALPINE allows the simulation of the entire computer architecture stack from major hardware components to their interactions with the Linux OS. Within ALPINE, we have defined a custom ISA extension and a software library to facilitate the deployment of inference models. We showcase and analyze a variety of mappings of different neural network types, and demonstrate up to 20.5x/20.8x performance/energy gains with respect to a SIMD-enabled ARM CPU implementation for convolutional neural networks, multi-layer perceptrons, and recurrent neural networks.

Online Access

Full Text (IEEE) Web of Science JCR 저널정보 Scopus Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송