학술논문

Enhanced Spatio- Temporal Image Encoding for Online Human Activity Recognition
Document Type
Conference
Source
2023 International Conference on Machine Learning and Applications (ICMLA) ICMLA Machine Learning and Applications (ICMLA), 2023 International Conference on. :884-889 Dec, 2023
Subject
Computing and Processing
Engineering Profession
Robotics and Control Systems
Signal Processing and Analysis
Training
Image coding
Three-dimensional displays
Image recognition
Time series analysis
Focusing
Streaming media
3D Skeleton Data
Spatio-temporal Image En-coding
Motion Energy
Online Action Recognition
Human Activity Recognition
Deep learning
Language
ISSN
1946-0759
Abstract
Human Activity Recognition (HAR) based on sen-sors data can be seen as a time series classification problem where the challenge is to handle both spatial and temporal dependencies, while focusing on the most relevant data variations. It can be done using 3D skeleton data extracted from a RGB+D camera. In this work, we propose to improve the spatio-temporal image encoding of 3D skeletons captured from a Kinect sensor, by studying the concept of motion energy which focuses mainly on skeleton joints that are the most solicited for an action. This encoding allows us to achieve a better discrimination for the detection of online activities by focusing on the most significant parts of the actions. The article presents this new encoding and its application for HAR using a deep learning model trained on the encoded 3D skeleton data. For this purpose, we proposed to investigate the knowledge transferability of several pre-trained CNNs provided by Keras. The article shows a significant improvement of the accuracy of the learning according to the state of the art.