학술논문

Home Action Genome: Cooperative Compositional Action Understanding

Document Type

Conference

Author

Rai, Nishant; Chen, Haofeng; Ji, Jingwei; Desai, Rishi; Kozuka, Kazuki; Ishizaka, Shun; Adeli, Ehsan; Niebles, Juan Carlos

Source

2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) CVPR Computer Vision and Pattern Recognition (CVPR), 2021 IEEE/CVF Conference on. :11179-11188 Jun, 2021

Subject

Computing and Processing
Location awareness
Learning systems
Computer vision
Annotations
Image color analysis
Genomics
Data visualization

Language

ISSN

2575-7075

Abstract

Existing research on action recognition treats activities as monolithic events occurring in videos. Recently, the benefits of formulating actions as a combination of atomicactions have shown promise in improving action understanding with the emergence of datasets containing such annotations, allowing us to learn representations capturing this information. However, there remains a lack of studies that extend action composition and leverage multiple view-points and multiple modalities of data for representation learning. To promote research in this direction, we introduce Home Action Genome (HOMAGE): a multi-view action dataset with multiple modalities and view-points supplemented with hierarchical activity and atomic action labels together with dense scene composition labels. Lever-aging rich multi-modal and multi-view settings, we propose Cooperative Compositional Action Understanding (CCAU), a cooperative learning framework for hierarchical action recognition that is aware of compositional action elements. CCAU shows consistent performance improvements across all modalities. Furthermore, we demonstrate the utility of co-learning compositions in few-shot action recognition by achieving 28.6% mAP with just a single sample.

Online Access

Full Text (IEEE) Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송