TY - JOUR
T1 - Content caching policy for 5G network based on asynchronous advantage actor-critic method
AU - Shi, Zhuoyang
AU - Li, Lixin
AU - Xu, Yang
AU - Li, Xu
AU - Chen, Wei
AU - Han, Zhu
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019
Y1 - 2019
N2 - Nowadays content caching at base stations (BSs) has attracted more and more attention in 5G networks with the ability of saving resources and reducing data traffic. However, in practice, it's a challenge to design a caching policy intelligently due to the limited storage capacity as well as time and space varying users' requests. In this paper, we propose an algorithm based on asynchronous advantage actor-critic (A3C) to solve the content caching problem. Considering some cooperative BSs, with each BS having a cache, every BS can fetch contents from either neighboring BSs or the backbone network, with different degrees of expenditure. In order to learn the optimal caching and sharing policy, the online A3C-based algorithm is designed to minimize the total transmission cost without knowing content popularity distribution. To evaluate the proposed algorithm, we compare the performance with the classical caching policies, including Least Recently Used (LRU), Least Frequently Used (LFU), Adaptive Replacement Cache (ARC) and one distributed algorithm in the literature. The simulation results show that the proposed A3C-based algorithm can achieve a low transmission cost and improve the convergence rate in the dynamic environment.
AB - Nowadays content caching at base stations (BSs) has attracted more and more attention in 5G networks with the ability of saving resources and reducing data traffic. However, in practice, it's a challenge to design a caching policy intelligently due to the limited storage capacity as well as time and space varying users' requests. In this paper, we propose an algorithm based on asynchronous advantage actor-critic (A3C) to solve the content caching problem. Considering some cooperative BSs, with each BS having a cache, every BS can fetch contents from either neighboring BSs or the backbone network, with different degrees of expenditure. In order to learn the optimal caching and sharing policy, the online A3C-based algorithm is designed to minimize the total transmission cost without knowing content popularity distribution. To evaluate the proposed algorithm, we compare the performance with the classical caching policies, including Least Recently Used (LRU), Least Frequently Used (LFU), Adaptive Replacement Cache (ARC) and one distributed algorithm in the literature. The simulation results show that the proposed A3C-based algorithm can achieve a low transmission cost and improve the convergence rate in the dynamic environment.
KW - Asynchronous advantage actor-critic
KW - Content caching
KW - Deep reinforcement learning
KW - Transmission cost
UR - http://www.scopus.com/inward/record.url?scp=85081955925&partnerID=8YFLogxK
U2 - 10.1109/GLOBECOM38437.2019.9014268
DO - 10.1109/GLOBECOM38437.2019.9014268
M3 - 会议文章
AN - SCOPUS:85081955925
SN - 2334-0983
JO - Proceedings - IEEE Global Communications Conference, GLOBECOM
JF - Proceedings - IEEE Global Communications Conference, GLOBECOM
M1 - 9014268
T2 - 2019 IEEE Global Communications Conference, GLOBECOM 2019
Y2 - 9 December 2019 through 13 December 2019
ER -