학술논문

A Fast Inter Mode Decision Approach Based on Machine Learning for Video Compressor
Document Type
Conference
Source
2024 IEEE 7th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC) Advanced Information Technology, Electronic and Automation Control Conference (IAEAC), 2024 IEEE 7th. 7:1287-1290 Mar, 2024
Subject
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Robotics and Control Systems
Signal Processing and Analysis
Logistic regression
Machine learning algorithms
Costs
Quantization (signal)
Rate-distortion
Machine learning
Media
H.264/H.265
Mode Decision
Rate Distortion
Inter Frame
Block Partition
Language
ISSN
2689-6621
Abstract
When making mode decisions of inter frames in H.264 [1], the encoder must perform exhaustive calculations of the costs of Rate Distortion (RD) on all modes and find the optimal one. This procedure has high computational complexity for many redundant calculations. A Machine Learning based mode division strategy is proposed in this paper, which can accurately and quickly predict unnecessary modes and improve the overall encoding speed. The strategy uses the logistic regression model [2] and includes three features: the Quantization Parameter (QP), the variance, and the best RD cost of the 16x16 macroblock. The experimental results on different video scenes show that compared with the existing fast mode in x264, the speed is increased by 13.5% and the quality is improved by 0.15%.