학술논문

Frequency Agile Anti-Interference Technology Based on Reinforcement Learning Using Long Short-Term Memory and Multi-Layer Historical Information Observation.

Document Type

Article

Author

Shi, Weihao; Guo, Shanhong; Cong, Xiaoyu; Sheng, Weixing; Yan, Jing; Chen, Jinkun

Source

Remote Sensing. Dec2023, Vol. 15 Issue 23, p5467. 20p.

Subject

*REINFORCEMENT learning
*RADAR interference
*PARTIALLY observable Markov decision processes
*COLLECTIVE memory
*MILITARY electronics

Language

ISSN

2072-4292

Abstract

In modern electronic warfare, radar intelligence has become increasingly crucial when dealing with complex interference environments. This paper combines radar agile frequency technology with reinforcement learning to achieve adaptive frequency hopping for radar anti-jamming. Unlike traditional reinforcement learning with Markov decision processes (MDPs), the interaction between radar and jammers occurs within the partially observable Markov decision processes (POMDPs). In this context, the partial observation information available to the agent does not strictly satisfy the Markov property. This paper uses multiple layers of historical observation information to solve this problem. Historical observations can be viewed as a time series, and time-sensitive networks are employed to extract the temporal information embedded within the observations. In addition, the reward function is optimized to facilitate the faster learning of the agent in the jammer sweep environment. This simulation shows that the optimization of the agent state, network structure, and reward function can effectively help the radar to resist jamming. [ABSTRACT FROM AUTHOR]

Online Access

EBSCOHost PDF Full Text (ProQuest Central) Full Text (Gale Academic Onefile) Web of Science JCR 저널정보 Scopus Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송