학술논문

Adversarial Unsupervised Domain Adaptation for Hand Gesture Recognition Using Thermal Images
Document Type
Periodical
Source
IEEE Sensors Journal IEEE Sensors J. Sensors Journal, IEEE. 23(4):3493-3504 Feb, 2023
Subject
Signal Processing and Analysis
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Robotics and Control Systems
Task analysis
Gesture recognition
Adaptation models
Feature extraction
Thermal sensors
Data models
Training
Adversarial learning
deep learning (DL)
hand gesture recognition
unsupervised domain adaptation (UDA)
Language
ISSN
1530-437X
1558-1748
2379-9153
Abstract
Hand gesture recognition has a wide range of applications, including in the automotive and industrial sectors, health assistive systems, authentication, and so on. Thermal images are more resistant to environmental changes than red–green–blue (RGB) images for hand gesture recognition. However, one disadvantage of using thermal images for the aforementioned task is the scarcity of labeled thermal datasets. To tackle this problem, we propose a method that combines unsupervised domain adaptation (UDA) techniques with deep-learning (DL) technology to remove the need for labeled data in the learning process. There are several types and methods for implementing UDA, with adversarial UDA being one of the most common. In this article, the first time in this field, we propose a novel adversarial UDA model that uses channel attention and bottleneck layers to learn domain-invariant features across RGB and thermal domains. Thus, the proposed model leverages the information from the labeled RGB data to solve the hand gesture recognition task using thermal images. We evaluate the proposed model on two hand gesture datasets, namely, Sign Digit Classification and Alphabet Gesture Classification, and compare it to other benchmark models in terms of accuracy, model size, and model parameters. Our model outperforms the other state-of-the-art methods on the Sign Digit Classification and Alphabet Gesture Classification datasets and achieves 91.32% and 80.91% target test accuracy, respectively.