학술논문

Objaverse: A Universe of Annotated 3D Objects

Document Type

Conference

Author

Deitke, Matt; Schwenk, Dustin; Salvador, Jordi; Weihs, Luca; Michel, Oscar; VanderBilt, Eli; Schmidt, Ludwig; Ehsanit, Kiana; Kembhavi, Aniruddha; Farhadi, Ali

Source

2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) CVPR Computer Vision and Pattern Recognition (CVPR), 2023 IEEE/CVF Conference on. :13142-13153 Jun, 2023

Subject

Computing and Processing
Training
Solid modeling
Computer vision
Analytical models
Three-dimensional displays
Computational modeling
Benchmark testing
Datasets and evaluation

Language

ISSN

2575-7075

Abstract

Massive data corpora like WebText, Wikipedia, Conceptual Captions, WebImageText, and LAION have propelled recent dramatic progress in AI. Large neural models trained on such datasets produce impressive results and top many of today's benchmarks. A notable omisslion within this family of large-scale datasets is 3D data. Despite considerable interest and potential applications in 3D vision, datasets of high-fidelity 3D models continue to be mid-sized with limited diversity of object categories. Addressing this gap, we present Objaverse 1.0, a large dataset of objects with 800K + (and growing) 3D models with descriptive captions, tags, and animations. Objaverse improves upon present day 3D repositories in terms of scale, number of categories, and in the visual diversity of instances within a category. We demonstrate the large potential of Objaverse via four diverse applications: training generative 3D models, improving tail category segmentation on the LVIS benchmark, training open-vocabulary object-navigation models for Embodied AI, and creating a new benchmark for robustness analysis of vision models. Objaverse can open new directions for research and enable new applications across the field of AI.

Online Access

Full Text (IEEE) Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송