Multi-UAV Assisted Offloading Optimization: A Game Combined Reinforcement Learning Approach

Ang Gao; Qi Wang; Kaiyue Chen; Wei Liang

doi:10.1109/LCOMM.2021.3078469

Multi-UAV Assisted Offloading Optimization: A Game Combined Reinforcement Learning Approach

Ang Gao, Qi Wang, Kaiyue Chen, Wei Liang

电子信息学院

Northwestern Polytechnical University Xian

科研成果: 期刊稿件 › 文章 › 同行评审

8 引用（Scopus）

摘要

Although unmanned aerial vehicles (UAVs) have attracted much attention by providing aerial relays to massive ground users (GUs) for tasks offloading, there still exist several issues, such as the unbalance of tasks size and trajectory optimization related to energy efficiency and obstacles avoidance. The letter models the multi-UAV assisted offloading system as two separate problems optimized by a potential game combined reinforcement learning algorithm, i.e., potential game for service assignment, and deep deterministic policy gradient (DDPG) for trajectory planning. The former largely reduces the convergence time, and the latter can search the best action in a continuous domain. The numerical results show that the proposed approach has great advantages in minimizing offloading delay, enhancing energy efficiency and avoiding obstacles.

源语言	英语
文章编号	9426943
页（从-至）	2629-2633
页数	5
期刊	IEEE Communications Letters
卷	25
期	8
DOI	https://doi.org/10.1109/LCOMM.2021.3078469
出版状态	已出版 - 8月 2021

联合国可持续发展目标

此成果有助于实现下列可持续发展目标：

访问文件

10.1109/LCOMM.2021.3078469

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{2e418d07b8dd4db1aeaf3fd52bb138f3,

title = "Multi-UAV Assisted Offloading Optimization: A Game Combined Reinforcement Learning Approach",

abstract = "Although unmanned aerial vehicles (UAVs) have attracted much attention by providing aerial relays to massive ground users (GUs) for tasks offloading, there still exist several issues, such as the unbalance of tasks size and trajectory optimization related to energy efficiency and obstacles avoidance. The letter models the multi-UAV assisted offloading system as two separate problems optimized by a potential game combined reinforcement learning algorithm, i.e., potential game for service assignment, and deep deterministic policy gradient (DDPG) for trajectory planning. The former largely reduces the convergence time, and the latter can search the best action in a continuous domain. The numerical results show that the proposed approach has great advantages in minimizing offloading delay, enhancing energy efficiency and avoiding obstacles.",

keywords = "DDPG, DRL, Offloading, potential game",

author = "Ang Gao and Qi Wang and Kaiyue Chen and Wei Liang",

note = "Publisher Copyright: {\textcopyright} 1997-2012 IEEE.",

year = "2021",

month = aug,

doi = "10.1109/LCOMM.2021.3078469",

language = "英语",

volume = "25",

pages = "2629--2633",

journal = "IEEE Communications Letters",

issn = "1089-7798",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "8",

}

TY - JOUR

T1 - Multi-UAV Assisted Offloading Optimization

T2 - A Game Combined Reinforcement Learning Approach

AU - Gao, Ang

AU - Wang, Qi

AU - Chen, Kaiyue

AU - Liang, Wei

PY - 2021/8

Y1 - 2021/8

N2 - Although unmanned aerial vehicles (UAVs) have attracted much attention by providing aerial relays to massive ground users (GUs) for tasks offloading, there still exist several issues, such as the unbalance of tasks size and trajectory optimization related to energy efficiency and obstacles avoidance. The letter models the multi-UAV assisted offloading system as two separate problems optimized by a potential game combined reinforcement learning algorithm, i.e., potential game for service assignment, and deep deterministic policy gradient (DDPG) for trajectory planning. The former largely reduces the convergence time, and the latter can search the best action in a continuous domain. The numerical results show that the proposed approach has great advantages in minimizing offloading delay, enhancing energy efficiency and avoiding obstacles.

AB - Although unmanned aerial vehicles (UAVs) have attracted much attention by providing aerial relays to massive ground users (GUs) for tasks offloading, there still exist several issues, such as the unbalance of tasks size and trajectory optimization related to energy efficiency and obstacles avoidance. The letter models the multi-UAV assisted offloading system as two separate problems optimized by a potential game combined reinforcement learning algorithm, i.e., potential game for service assignment, and deep deterministic policy gradient (DDPG) for trajectory planning. The former largely reduces the convergence time, and the latter can search the best action in a continuous domain. The numerical results show that the proposed approach has great advantages in minimizing offloading delay, enhancing energy efficiency and avoiding obstacles.

KW - DDPG

KW - DRL

KW - Offloading

KW - potential game

UR - http://www.scopus.com/inward/record.url?scp=85105867046&partnerID=8YFLogxK

U2 - 10.1109/LCOMM.2021.3078469

DO - 10.1109/LCOMM.2021.3078469

M3 - 文章

AN - SCOPUS:85105867046

SN - 1089-7798

VL - 25

SP - 2629

EP - 2633

JO - IEEE Communications Letters

JF - IEEE Communications Letters

IS - 8

M1 - 9426943

ER -

Multi-UAV Assisted Offloading Optimization: A Game Combined Reinforcement Learning Approach

摘要

联合国可持续发展目标

访问文件

其它文件与链接

指纹

引用此