TY - JOUR
T1 - Decentralized offloading strategies based on reinforcement learning for multi-access edge computing
AU - Hu, Chunyang
AU - Li, Jingchen
AU - Shi, Haobin
AU - Ning, Bin
AU - Gu, Qiong
N1 - Publisher Copyright:
© 2021 by the authors. Licensee MDPI, Basel, Switzerland.
PY - 2021/9
Y1 - 2021/9
N2 - Using reinforcement learning technologies to learn offloading strategies for multi-access edge computing systems has been developed by researchers. However, large-scale systems are unsuitable for reinforcement learning, due to their huge state spaces and offloading behaviors. For this reason, this work introduces the centralized training and decentralized execution mechanism, designing a decentralized reinforcement learning model for multi-access edge computing systems. Considering a cloud server and several edge servers, we separate the training and execution in the reinforcement learning model. The execution happens in edge devices of the system, and edge servers need no communication. Conversely, the training process occurs at the cloud device, which causes a lower transmission latency. The developed method uses a deep deterministic policy gradient algorithm to optimize offloading strategies. The simulated experiment shows that our method can learn the offloading strategy for each edge device efficiently.
AB - Using reinforcement learning technologies to learn offloading strategies for multi-access edge computing systems has been developed by researchers. However, large-scale systems are unsuitable for reinforcement learning, due to their huge state spaces and offloading behaviors. For this reason, this work introduces the centralized training and decentralized execution mechanism, designing a decentralized reinforcement learning model for multi-access edge computing systems. Considering a cloud server and several edge servers, we separate the training and execution in the reinforcement learning model. The execution happens in edge devices of the system, and edge servers need no communication. Conversely, the training process occurs at the cloud device, which causes a lower transmission latency. The developed method uses a deep deterministic policy gradient algorithm to optimize offloading strategies. The simulated experiment shows that our method can learn the offloading strategy for each edge device efficiently.
KW - Deep reinforcement learning
KW - Multi-access edge computing
KW - Task offloading
UR - http://www.scopus.com/inward/record.url?scp=85114111923&partnerID=8YFLogxK
U2 - 10.3390/info12090343
DO - 10.3390/info12090343
M3 - 文章
AN - SCOPUS:85114111923
SN - 2078-2489
VL - 12
JO - Information (Switzerland)
JF - Information (Switzerland)
IS - 9
M1 - 343
ER -