학술논문

Enhanced Spatio- Temporal Image Encoding for Online Human Activity Recognition

Document Type

Conference

Author

Mokhtari, Nassim; Fer, Vincent; Nedelec, Alexis; Gilles, Marlene; de Loor, Pierre

Source

2023 International Conference on Machine Learning and Applications (ICMLA) ICMLA Machine Learning and Applications (ICMLA), 2023 International Conference on. :884-889 Dec, 2023

Subject

Computing and Processing
Engineering Profession
Robotics and Control Systems
Signal Processing and Analysis
Training
Image coding
Three-dimensional displays
Image recognition
Time series analysis
Focusing
Streaming media
3D Skeleton Data
Spatio-temporal Image En-coding
Motion Energy
Online Action Recognition
Human Activity Recognition
Deep learning

Language

ISSN

1946-0759

Abstract

Human Activity Recognition (HAR) based on sen-sors data can be seen as a time series classification problem where the challenge is to handle both spatial and temporal dependencies, while focusing on the most relevant data variations. It can be done using 3D skeleton data extracted from a RGB+D camera. In this work, we propose to improve the spatio-temporal image encoding of 3D skeletons captured from a Kinect sensor, by studying the concept of motion energy which focuses mainly on skeleton joints that are the most solicited for an action. This encoding allows us to achieve a better discrimination for the detection of online activities by focusing on the most significant parts of the actions. The article presents this new encoding and its application for HAR using a deep learning model trained on the encoded 3D skeleton data. For this purpose, we proposed to investigate the knowledge transferability of several pre-trained CNNs provided by Keras. The article shows a significant improvement of the accuracy of the learning according to the state of the art.

Online Access

Full Text (IEEE) Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송