학술논문

3D UAV Trajectory and Data Collection Optimisation Via Deep Reinforcement Learning
Document Type
Periodical
Source
IEEE Transactions on Communications IEEE Trans. Commun. Communications, IEEE Transactions on. 70(4):2358-2371 Apr, 2022
Subject
Communication, Networking and Broadcast Technologies
Trajectory
Wireless networks
Resource management
Optimization
Data collection
Throughput
Three-dimensional displays
UAV-assisted wireless network
trajectory
data collection
and deep reinforcement learning
Language
ISSN
0090-6778
1558-0857
Abstract
Unmanned aerial vehicles (UAVs) are now beginning to be deployed for enhancing the network performance and coverage in wireless communication. However, due to the limitation of their on- board power and flight time, it is challenging to obtain an optimal resource allocation scheme for the UAV-assisted Internet of Things (IoT). In this paper, we design a new UAV-assisted IoT system relying on the shortest flight path of the UAVs while maximising the amount of data collected from IoT devices. Then, a deep reinforcement learning-based technique is conceived for finding the optimal trajectory and throughput in a specific coverage area. After training, the UAV has the ability to autonomously collect all the data from user nodes at a significant total sum-rate improvement while minimising the associated resources used. Numerical results are provided to highlight how our techniques strike a balance between the throughput attained, trajectory, and the time spent. More explicitly, we characterise the attainable performance in terms of the UAV trajectory, the expected reward and the total sum-rate.