학술논문

Learning a Policy for Pursuit-Evasion Games Using Spiking Neural Networks and the STDP Algorithm

Document Type

Conference

Author

Ramezanlou, Mohammad T.; Schwartz, Howard; Lambadaris, Ioannis; Barbeau, Michel; Naqvi, Syed Hassan Raza

Source

2023 IEEE International Conference on Systems, Man, and Cybernetics (SMC) Systems, Man, and Cybernetics (SMC), 2023 IEEE International Conference on. :1918-1925 Oct, 2023

Subject

Aerospace
Bioengineering
Communication, Networking and Broadcast Technologies
Computing and Processing
General Topics for Engineers
Power, Energy and Industry Applications
Robotics and Control Systems
Signal Processing and Analysis
Transportation
Training
Pedestrians
Heuristic algorithms
Neural networks
Dynamics
Games
Game theory
Active Target Defense
Spiking Neural Network
Spike-Timing-Dependent Plasticity

Language

ISSN

2577-1655

Abstract

Pursuit-Evasion (PE) games are regarded as a major platform for game theory. In this kind of game, an agent called an evader tries to escape from another agent called a pursuer. Active Target Defense (ATD) is a derivative of PE games, attracting attention recently. In an ATD game, the evader, often called an invader, strives to capture a moving target. The pursuer, called a defender, tries to intercept the invader. This paper implements the Spike-Timing-Dependent Plasticity (STDP) algorithm to train two Spiking Neural Networks (SNNs) to find a suitable solution for the ATD problem in decentralized situations. One of the SNNs is used to control the invader, while the other controls the defender. The performance is compared with the analytical solution for the pedestrian model. The results showed that an SNN can learn the optimal capture point only using relative velocities and line of sight.

Online Access

Full Text (IEEE) Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송