학술논문

GRIT: GAN Residuals for Paired Image-to-Image Translation

Document Type

Conference

Author

Suri, Saksham; Meshry, Moustafa; Davis, Larry S.; Shrivastava, Abhinav

Source

2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) WACV Applications of Computer Vision (WACV), 2024 IEEE/CVF Winter Conference on. :4953-4963 Jan, 2024

Subject

Computing and Processing
Training
Image synthesis
Impedance matching
Image edge detection
Noise
Proposals
Spatial resolution
Algorithms
Generative models for image
video
3D
etc.
Computational photography
image and video synthesis

Language

ISSN

2642-9381

Abstract

Current Image-to-Image translation (I2I) frameworks rely heavily on reconstruction losses, where the output needs to match a given ground truth image. An adversarial loss is commonly utilized as a secondary loss term, mainly to add more realism to the output. Compared to unconditional GANs, I2I translation frameworks have more supervisory signals, but still their output shows more artifacts and does not reach the same level of realism achieved by unconditional GANs. We study the performance gap, in terms of photo-realism, between I2I translation and unconditional GAN frameworks. Based on our observations, we propose a modified architecture and training objective to address this realism gap. Our proposal relaxes the role of reconstruction losses, to act as regularizers instead of doing all the heavy lifting which is common in current I2I frameworks. Furthermore, our proposed formulation decouples the optimization of reconstruction and adversarial objectives and removes pixel-wise constraints on the final output. This allows for a set of stochastic but realistic variations of any target output image. Our project page can be accessed at cs.umd.edu/~sakshams/grit.

Online Access

Full Text (IEEE) Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송