학술논문

YOLOv3- Tiny's Weight Size Reduction using Pruning and Quantization

Document Type

Conference

Author

Endrawati, D.N.; Ibad, S.I.; Syafalni, I.; Sutisna, N.; Mulyawan, R.; Adiono, T.

Source

2021 15th International Conference on Telecommunication Systems, Services, and Applications (TSSA) Telecommunication Systems, Services, and Applications (TSSA), 2021 15th International Conference on. :1-5 Nov, 2021

Subject

Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
Fields, Waves and Electromagnetics
Power, Energy and Industry Applications
Robotics and Control Systems
Signal Processing and Analysis
Training
Quantization (signal)
Image edge detection
Memory management
Real-time systems
Telecommunications
Iterative methods
YOLOv3-Tiny
Pruning
Quantization

Language

Abstract

CNN has been widely applied to various technology fields, like real-time edge devices. One of the architecture detection objects based on CNN is YOLO, a real-time detection object with a high level of accuracy. Detection objects on the edge device itself are expected to have speed, accuracy, and less memory needed to carry out the detection process. In this paper, YOLOv3- Tiny will be pruned and quantized to reduce the size of the data from the weight without significant reduction of accuracy. Experimental results show that YOLOv3- Tiny weight data decreased up to 85.7% compared to the weight data before the pruning process, while in the quantization mode. Moreover, the iterative pruning method can reduce the weight up to 43%.

Online Access

Full Text (IEEE) Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송