학술논문

LVOS: A Benchmark for Long-term Video Object Segmentation

Document Type

Conference

Author

Hong, Lingyi; Chen, Wenchao; Liu, Zhongying; Zhang, Wei; Guo, Pinxue; Chen, Zhaoyu; Zhang, Wenqiang

Source

2023 IEEE/CVF International Conference on Computer Vision (ICCV) ICCV Computer Vision (ICCV), 2023 IEEE/CVF International Conference on. :13434-13446 Oct, 2023

Subject

Computing and Processing
Signal Processing and Analysis
Computer vision
Analytical models
Codes
Heuristic algorithms
Computational modeling
Object segmentation
Benchmark testing

Language

ISSN

2380-7504

Abstract

Existing video object segmentation (VOS) benchmarks focus on short-term videos which just last about 3-5 seconds and where objects are visible most of the time. These videos are poorly representative of practical applications, and the absence of long-term datasets restricts further investigation of VOS on the application in realistic scenarios. So, in this paper, we present a new benchmark dataset named LVOS, which consists of 220 videos with a total duration of 421 minutes. To the best of our knowledge, LVOS is the first densely annotated long-term VOS dataset. The videos in our LVOS last 1.59 minutes on average, which is 20 times longer than videos in existing VOS datasets. Each video includes various attributes, especially challenges deriving from the wild, such as long-term reappearing and cross-temporal similar objeccts. Based on LVOS, we assess existing video object segmentation algorithms and propose a Diverse Dynamic Memory network (DDMemory) that consists of three complementary memory banks to exploit temporal information adequately. The experimental results demonstrate the strength and weaknesses of prior methods, pointing promising directions for further study. Data and code are available at https://lingyihongfd.github.io/lvos.github.io/.

Online Access

Full Text (IEEE) Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송