학술논문

Understanding 3D Object Articulation in Internet Videos
Document Type
Conference
Source
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) CVPR Computer Vision and Pattern Recognition (CVPR), 2022 IEEE/CVF Conference on. :1589-1599 Jun, 2022
Subject
Computing and Processing
Computers
Computer vision
Three-dimensional displays
Shape
Surveillance
Internet
Pattern recognition
3D from single images; Scene analysis and understanding
Language
ISSN
2575-7075
Abstract
We propose to investigate detecting and characterizing the 3D planar articulation of objects from ordinary RGB videos. While seemingly easy for humans, this problem poses many challenges for computers. Our approach is based on a top-down detection system that finds planes that can be articulated. This approach is followed by optimizing for a 3D plane that explains a sequence of detected articulations. We show that this system can be trained on a combination of videos and 3D scan datasets. When tested on a dataset of challenging Internet videos and the Charades dataset, our approach obtains strong performance.