학술논문

Semantic Data Augmentation for Long-tailed Facial Expression Recognition
Document Type
Conference
Source
2023 8th International Conference on Computer and Communication Systems (ICCCS) Computer and Communication Systems (ICCCS), 2023 8th International Conference on. :1052-1055 Apr, 2023
Subject
Communication, Networking and Broadcast Technologies
Computing and Processing
Computer vision
Communication systems
Face recognition
Computational modeling
Semantics
Medical services
Fatigue
facial expression recognition
data augmentation
long tailed recognition
VAE-GAN
Language
Abstract
Facial Expression Recognition has a wide application prospect in social robotics, health care, driver fatigue monitoring, and many other practical scenarios. Automatic recognition of facial expressions has been extensively studied by the Computer Vision research society. But Facial Expression Recognition in real-world is still a challenging task, partially due to the long-tailed distribution of the dataset. Many recent studies use data augmentation for Long-Tailed Recognition tasks. In this paper, we propose a novel semantic augmentation method. By introducing randomness into the encoding of the source data in the latent space of VAE-GAN, new samples are generated. Then, for facial expression recognition in RAF-DB dataset, we use our augmentation method to balance the long-tailed distribution. Our method can be used in not only FER tasks, but also more diverse data-hungry scenarios.