학술논문

Self-Supervise Reinforcement Learning Method for Vacant Parking Space Detection Based on Task Consistency and Corrupted Rewards
Document Type
Periodical
Source
IEEE Transactions on Intelligent Transportation Systems IEEE Trans. Intell. Transport. Syst. Intelligent Transportation Systems, IEEE Transactions on. 25(2):1346-1363 Feb, 2024
Subject
Transportation
Aerospace
Communication, Networking and Broadcast Technologies
Computing and Processing
Robotics and Control Systems
Signal Processing and Analysis
Task analysis
Training
Noise measurement
Feature extraction
Detectors
Transfer learning
Semantics
Task-consistency
reinforcement learning
learn from corrupted rewards
domain transfer
Language
ISSN
1524-9050
1558-0016
Abstract
This paper proposes a novel task-consistency learning method that enables us to train a vacant space detection network (target task) based on the logic consistency with the semantic outcomes from a flow-based motion behavior classifier (source task) in a parking lot. Note that the source task can introduce false detection during task-consistency learning, which implies noisy rewards or supervision. The target network can be trained in a reinforcement learning setting by appropriately designing the reward mechanism upon semantic consistency. We also introduce a novel symmetric constraint to detect corrupted samples and reduce the effect of noisy rewards. Unlike conventional corrupted learning methods that use only training losses to identify corrupted samples, our symmetric constraint also explores the relationship among training samples to improve performance. Compared with conventional supervised detection methods, the main contribution of our work is the ability to learn a vacant space detector via semantic consistency rather than supervised labels. The dynamic learning property allows the proposed detector to be easily deployed and updated in various lots without heavy human loads. Experiments demonstrate that our noisy task consistency mechanism can be successfully applied to train a vacant space detector from scratch.