학술논문

MARS: Multiagent Reinforcement Learning for Spatial—Spectral and Temporal Feature Selection in EEG-Based BCI
Document Type
Periodical
Source
IEEE Transactions on Systems, Man, and Cybernetics: Systems IEEE Trans. Syst. Man Cybern, Syst. Systems, Man, and Cybernetics: Systems, IEEE Transactions on. 54(5):3084-3096 May, 2024
Subject
Signal Processing and Analysis
Robotics and Control Systems
Power, Energy and Industry Applications
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
General Topics for Engineers
Feature extraction
Electroencephalography
Task analysis
Robots
Reinforcement learning
Mars
Data mining
Brain–computer interface (BCI)
feature selection
motor imagery (MI)
multiagent reinforcement learning (RL)
Language
ISSN
2168-2216
2168-2232
Abstract
In recent years, deep learning methods have shown promising capabilities for extracting informative and discriminative features from electroencephalography (EEG) data. However, several studies have reported that the feature selection process followed by feature extraction can be beneficial to achieve further performance improvement. Even though a recent work achieved promising results by using the single-agent reinforcement learning (RL)-based framework to select task-relevant features in the temporal domain, it still failed to consider other significant features in the spatial-spectral domain. To overcome such limitations, we propose a cooperative multiagent RL-based framework (MARS) that performs feature selection in both the spatial–spectral and temporal domains simultaneously for a motor imagery (MI)-EEG classification task. In this framework, we enable our RL agents to collaborate with each other as a team to solve a complex multiobjective feature selection problem. Furthermore, we adopt a counterfactual advantage function to overcome the free-rider problem, which is associated with the credit assignment issue in multiagent cases. To assess the MARS framework, we conduct extensive experiments with two public MI datasets under subject-dependent and subject-independent scenarios and we apply the MARS to different backbone networks. The experimental results demonstrate that our MARS outperforms other competing methods in terms of mean accuracy and achieves statistically significant improvements.