학술논문

Continual Egocentric Activity Recognition With Foreseeable-Generalized Visual–IMU Representations
Document Type
Periodical
Source
IEEE Sensors Journal IEEE Sensors J. Sensors Journal, IEEE. 24(8):12934-12945 Apr, 2024
Subject
Signal Processing and Analysis
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Robotics and Control Systems
Task analysis
Sensors
Training
Human activity recognition
Wearable sensors
Videos
Optimization
Continual learning
human activity recognition (HAR)
multimodal network
wearable sensors
Language
ISSN
1530-437X
1558-1748
2379-9153
Abstract
The rapid advancement of wearable sensors has significantly facilitated data collection in our daily lives. Human activity recognition (HAR), a prominent research area in wearable technology, has made substantial progress in recent years. However, the existing efforts often overlook the issue of functional scalability in models, making it challenging for deep models to adapt to application scenarios that require continuous evolution. Furthermore, when employing conventional continual learning techniques, we have observed an imbalance between visual-based and inertial-measurement-unit (IMU) sensing modalities during joint optimization, which hampers model generalization and poses a significant challenge. To obtain a generalized representation more adapted to continual tasks, we propose a motivational optimization scheme to address the limited generalization caused by the modal imbalance, enabling foreseeable generalization in a visual–IMU multimodal network. To prevent the forgetting of previously learned activities, we introduce a robust representation estimation technique and a pseudo representation (PR) generation strategy for continual learning. Experimental results on the egocentric activity dataset UESTC-MMEA-CL demonstrate the effectiveness of our proposed method. Furthermore, our method effectively leverages the generalization capabilities of IMU-based modal representations, outperforming state-of-the-art methods in various task settings.