학술논문

6D Object Pose Estimation in Cluttered Scenes from RGB Images
Document Type
Academic Journal
Source
Journal of Computer Science and Technology. June, 2022, Vol. 37 Issue 3, p719, 12 p.
Subject
Algorithm
Algorithms
Language
English
ISSN
1000-9000
Abstract
We propose a feature-fusion network for pose estimation directly from RGB images without any depth information in this study. First, we introduce a two-stream architecture consisting of segmentation and regression streams. The segmentation stream processes the spatial embedding features and obtains the corresponding image crop. These features are further coupled with the image crop in the fusion network. Second, we use an efficient perspective-n-point (E-PnP) algorithm in the regression stream to extract robust spatial features between 3D and 2D keypoints. Finally, we perform iterative refinement with an end-to-end mechanism to improve the estimation performance. We conduct experiments on two public datasets of YCB-Video and the challenging Occluded-LineMOD. The results show that our method outperforms state-of-the-art approaches in both the speed and the accuracy. Keywords two-stream network, 6D pose estimation, fusion feature
1 Introduction The 6D object pose estimation has been widely used in computer vision tasks in daily life, such as robot grasping and manipulation, autonomous navigation and augmented/mixed reality, with [...]