학술논문

Optimal D2D Learning-Based Neighbor Selection in mmWave Networks using Gittins Indices
Document Type
Conference
Source
2023 19th International Conference on Wireless and Mobile Computing, Networking and Communications (WiMob) Wireless and Mobile Computing, Networking and Communications (WiMob), 2023 19th International Conference on. :149-154 Jun, 2023
Subject
Communication, Networking and Broadcast Technologies
Computing and Processing
Signal Processing and Analysis
Performance evaluation
Wireless communication
Training
Analytical models
Computational modeling
Stochastic processes
Reinforcement learning
device-to-device (D2D) communication
millimeter wave (mmWave) communication
5G Proximity Services (ProSe)
performance modeling and analysis
Gittins Indices
Multi-Armed-Bandit (MAB)
neighbor selection
Language
ISSN
2160-4894
Abstract
Device-to-Device (D2D) communication helps in increasing the coverage range and throughput in millimeter Wave (mmWave) networks. The performance of the mmWave D2D communication mainly depends on selecting the best neighbor device through a process of Beamforming Training (BT). The process of BT involves pointing beams of multiple resolutions at different angles to select the best neighbor device in terms of link quality. While the sender exhaustively performs BT with all the neighbors in naive BT, the Reinforcement Learning (RL) based techniques, on the other hand, employ exploration/exploitation strategies to intelligently search for the best neighbor.The state-of-the-art RL algorithms typically yield sub-optimal performance, by trading-off computational tractability over optimality. Tractable optimal solving methods are therefore critical in improving the performance of neighbor selection through an intelligent BT process. To this end, this work studies the problem structure of the mmWave BT neighbor selection process formulated as a Multi-Armed Bandit (MAB) problem and provides a simple and scalable optimal method to select mmWave D2D neighbors with the objective of maximizing throughput performance. With extensive simulations, the efficacy of the proposed framework is demonstrated.