학술논문

PiGLET: Pixel-Level Grounding of Language Expressions With Transformers

Document Type

Periodical

Author

Gonzalez, C.; Ayobi, N.; Hernandez, I.; Pont-Tuset, J.; Arbelaez, P.

Source

IEEE Transactions on Pattern Analysis and Machine Intelligence IEEE Trans. Pattern Anal. Mach. Intell. Pattern Analysis and Machine Intelligence, IEEE Transactions on. 45(10):12206-12221 Oct, 2023

Subject

Computing and Processing
Bioengineering
Grounding
Visualization
Task analysis
Image segmentation
Annotations
Transformers
Natural languages
Panoptic segmentation
referring expression segmentation
visual grounding

Language

ISSN

0162-8828
2160-9292
1939-3539

Abstract

This paper proposes Panoptic Narrative Grounding , a spatially fine and general formulation of the natural language visual grounding problem. We establish an experimental framework for the study of this new task, including new ground truth and metrics. We propose PiGLET, a novel multi-modal Transformer architecture to tackle the Panoptic Narrative Grounding task, and to serve as a stepping stone for future work. We exploit the intrinsic semantic richness in an image by including panoptic categories, and we approach visual grounding at a fine-grained level using segmentations. In terms of ground truth, we propose an algorithm to automatically transfer Localized Narratives annotations to specific regions in the panoptic segmentations of the MS COCO dataset. PiGLET achieves a performance of 63.2 absolute Average Recall points. By leveraging the rich language information on the Panoptic Narrative Grounding benchmark on MS COCO, PiGLET obtains an improvement of 0.4 Panoptic Quality points over its base method on the panoptic segmentation task. Finally, we demonstrate the generalizability of our method to other natural language visual grounding problems such as Referring Expression Segmentation. PiGLET is competitive with previous state-of-the-art in RefCOCO, RefCOCO+ and RefCOCOg.

Online Access

Full Text (IEEE) Web of Science JCR 저널정보 Scopus Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송