학술논문

PVGNet: A Bottom-Up One-Stage 3D Object Detector with Integrated Multi-Level Features

Document Type

Conference

Author

Miao, Zhenwei; Chen, Jikai; Pan, Hongyu; Zhang, Ruiwen; Liu, Kaixuan; Hao, Peihan; Zhu, Jun; Wang, Yang; Zhan, Xin

Source

2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) CVPR Computer Vision and Pattern Recognition (CVPR), 2021 IEEE/CVF Conference on. :3278-3287 Jun, 2021

Subject

Computing and Processing
Training
Three-dimensional displays
Laser radar
Quantization (signal)
Merging
Object detection
Detectors

Language

ISSN

2575-7075

Abstract

Quantization-based methods are widely used in LiDAR points 3D object detection for its efficiency in extracting context information. Unlike image where the context information is distributed evenly over the object, most LiDAR points are distributed along the object boundary, which means the boundary features are more critical in LiDAR points 3D detection. However, quantization inevitably introduces ambiguity during both the training and inference stages. To alleviate this problem, we propose a one-stage and voting-based 3D detector, named Point-Voxel-Grid Network (PVGNet). In particular, PVGNet extracts point, voxel and grid-level features in a unified backbone architecture and produces point-wise fusion features. It segments Li-DAR points into foreground and background, predicts a 3D bounding box for each foreground point, and performs group voting to get the final detection results. Moreover, we observe that instance-level point imbalance due to occlusion and observation distance also degrades the detection performance. A novel instance-aware focal loss is proposed to alleviate this problem and further improve the detection ability. We conduct experiments on the KITTI and Waymo datasets. Our proposed PVGNet outperforms previous state-of-the-art methods and ranks at the top of KITTI 3D/BEV detection leaderboards.

Online Access

Full Text (IEEE) Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송