Content Caching Policy Based on GAN and Distributional Reinforcement Learning

Haipeng Weng, Lixin Li, Qianqian Cheng, Wei Chen, Zhu Han

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Scopus citations

Abstract

To reduce content transmission power and network load pressure, content caching technology based on a large number of small base stations (SBSs) is considered to be an effective solution. However, due to the limited cache capacity and unknown content popularity, how to design an intelligent content caching policy has become a great challenge. In this paper, we propose a generative adversarial network (GAN) based on the distributional deep Q-Network (DDQN) algorithm, named QGAN, to learn the content caching policy. A content caching network that contains several cooperative SBSs is considered in the case of unknown content popularity, where each SBS fetches cached content from the adjacent SBS or cloud. Moreover, compared with three classical content caching policies and one reinforcement learning algorithm, the performance of the QGAN algorithm is verified. The simulation results show that the convergence rate is improved and the transmission cost is reduced with the proposed algorithm.

Original languageEnglish
Title of host publication2020 IEEE International Conference on Communications, ICC 2020 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728150895
DOIs
StatePublished - Jun 2020
Event2020 IEEE International Conference on Communications, ICC 2020 - Dublin, Ireland
Duration: 7 Jun 202011 Jun 2020

Publication series

NameIEEE International Conference on Communications
Volume2020-June
ISSN (Print)1550-3607

Conference

Conference2020 IEEE International Conference on Communications, ICC 2020
Country/TerritoryIreland
CityDublin
Period7/06/2011/06/20

Keywords

  • Content caching
  • distributional reinforcement learning (DRL)
  • generative adversarial network (GAN)

Fingerprint

Dive into the research topics of 'Content Caching Policy Based on GAN and Distributional Reinforcement Learning'. Together they form a unique fingerprint.

Cite this