학술논문

Revisiting Self-Similarity: Structural Embedding for Image Retrieval

Document Type

Conference

Author

Lee, Seongwon; Lee, Suhyeon; Seong, Hongje; Kim, Euntai

Source

2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) CVPR Computer Vision and Pattern Recognition (CVPR), 2023 IEEE/CVF Conference on. :23412-23421 Jun, 2023

Subject

Computing and Processing
Convolutional codes
Visualization
Computer vision
Image coding
Fuses
Image retrieval
Image representation
Recognition: Categorization
detection
retrieval

Language

ISSN

2575-7075

Abstract

Despite advances in global image representation, existing image retrieval approaches rarely consider geometric structure during the global retrieval stage. In this work, we revisit the conventional self-similarity descriptor from a convolutional perspective, to encode both the visual and structural cues of the image to global image representation. Our proposed network, named Structural Embedding Network (SENet), captures the internal structure of the images and gradually compresses them into dense self-similarity descriptors while learning diverse structures from various images. These self-similarity descriptors and original image features are fused and then pooled into global embedding, so that global embedding can represent both geometric and visual cues of the image. Along with this novel structural embedding, our proposed network sets new state-of-the-art performances on several image retrieval benchmarks, convincing its robustness to look-alike distractors. The code and models are available: https://github.com/sungonce/SENet.

Online Access

Full Text (IEEE) Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송