학술논문

FlexBlock: A Flexible DNN Training Accelerator With Multi-Mode Block Floating Point Support

Document Type

Periodical

Author

Noh, S.; Koo, J.; Lee, S.; Park, J.; Kung, J.

Source

IEEE Transactions on Computers IEEE Trans. Comput. Computers, IEEE Transactions on. 72(9):2522-2535 Sep, 2023

Subject

Computing and Processing
Training
Tensors
Hardware
Arithmetic
Parallel processing
Deep learning
Scalability
Block floating point
DNN training accelerator
low precision training
precision scalability

Language

ISSN

0018-9340
1557-9956
2326-3814

Abstract

When training deep neural networks (DNNs), expensive floating point arithmetic units are used in GPUs or custom neural processing units (NPUs). To reduce the burden of floating point arithmetic, community has started exploring the use of more efficient data representations, e.g., block floating point (BFP). The BFP format allows a group of values to share an exponent, which effectively reduces the memory footprint and enables cheaper fixed point arithmetic for multiply-accumulate (MAC) operations. However, existing BFP-based DNN accelerators are targeted for a specific precision, making them less versatile. In this paper, we present FlexBlock, a DNN training accelerator with three BFP modes, possibly different among activation, weight, and gradient tensors. By configuring FlexBlock to a lower BFP precision, the number of MACs handled by the core increases by up to 4× in 8-bit mode or 16× in 4-bit mode compared to 16-bit mode. To reach this theoretical upper bound, FlexBlock maximizes the core utilization at various precision levels or layer types, and allows dynamic precision control to keep throughput at its peak without sacrificing training accuracy. We evaluate the effectiveness of FlexBlock using representative DNNs on CIFAR, ImageNet and WMT14 datasets. As a result, training in FlexBlock significantly improves training speed by 1.5$\sim 5.3\times$∼5.3× and energy efficiency by 2.4$\sim 7.0\times$∼7.0× compared to other training accelerators.

Online Access

Full Text (IEEE) Web of Science JCR 저널정보 Scopus Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송