학술논문

RDONet: Rate-Distortion Optimized Learned Image Compression with Variable Depth
Document Type
Conference
Source
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) CVPRW Computer Vision and Pattern Recognition Workshops (CVPRW), 2022 IEEE/CVF Conference on. :1758-1762 Jun, 2022
Subject
Computing and Processing
Measurement
Video coding
Training
Image coding
Rate-distortion
Video compression
Pattern recognition
Language
ISSN
2160-7516
Abstract
Rate-distortion optimization (RDO) is responsible for large gains in image and video compression. While RDO is a standard tool in traditional image and video coding, it is not yet widely used in novel end-to-end trained neural methods. The major reason is that the decoding function is trained once and does not have free parameters. In this paper, we present RDONet, a network containing state-of-the-art components, which is perceptually optimized and capable of rate-distortion optimization. With this network, we are able to outperform VVC Intra on MS-SSIM and two different perceptual LPIPS metrics. This paper is part of the CLIC challenge, where we participate under the team name RDONet FAU.