학술논문

Single and Multi-Agent Deep Reinforcement Learning for AI-Enabled Wireless Networks: A Tutorial
Document Type
Periodical
Source
IEEE Communications Surveys & Tutorials IEEE Commun. Surv. Tutorials Communications Surveys & Tutorials, IEEE. 23(2):1226-1252 Jan, 2021
Subject
Communication, Networking and Broadcast Technologies
Signal Processing and Analysis
Tutorials
Wireless networks
Games
Computational modeling
Training
5G mobile communication
Reinforcement learning
AI-enabled wireless networks
deep reinforcement learning (DRL)
multi-agent reinforcement learning (MARL)
model-based reinforcement learning (MBRL)
decentralized networks
Language
ISSN
1553-877X
2373-745X
Abstract
Deep Reinforcement Learning (DRL) has recently witnessed significant advances that have led to multiple successes in solving sequential decision-making problems in various domains, particularly in wireless communications. The next generation of wireless networks is expected to provide scalable, low-latency, ultra-reliable services empowered by the application of data-driven Artificial Intelligence (AI). The key enabling technologies of future wireless networks, such as intelligent meta-surfaces, aerial networks, and AI at the edge, involve more than one agent which motivates the importance of multi-agent learning techniques. Furthermore, cooperation is central to establishing self-organizing, self-sustaining, and decentralized networks. In this context, this tutorial focuses on the role of DRL with an emphasis on deep Multi-Agent Reinforcement Learning (MARL) for AI-enabled wireless networks. The first part of this paper will present a clear overview of the mathematical frameworks for single-agent RL and MARL. The main idea of this work is to motivate the application of RL beyond the model-free perspective which was extensively adopted in recent years. Thus, we provide a selective description of RL algorithms such as Model-Based RL (MBRL) and cooperative MARL and we highlight their potential applications in future wireless networks. Finally, we overview the state-of-the-art of MARL in fields such as Mobile Edge Computing (MEC), Unmanned Aerial Vehicles (UAV) networks, and cell-free massive MIMO, and identify promising future research directions. We expect this tutorial to stimulate more research endeavors to build scalable and decentralized systems based on MARL.