A Dynamic Power Allocation Scheme in Power-Domain NOMA using Actor-Critic Reinforcement Learning

Shaomin Zhang, Lixin Li, Jiaying Yin, Wei Liang, Xu Li, Wei Chen, Zhu Han

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

18 Scopus citations

Abstract

Non-orthogonal multiple access (NOMA) is one of the most promising technologies in the next-generation cellular communication. However, the effective power allocation strategy has always been a problem that needs to be solved in power-domain NOMA. In this paper, we propose a reinforcement learning (RL) method to solve the power allocation problem. In particular, in the power-domain NOMA, the base station (BS) simultaneously transmits data to the user under the constraint of the sum power. Considering that the power allocation assigned by the BS to each user can be used to optimize the energy efficient (EE) of the entire system, we propose the RL algorithm framework of the Actor-Critic to dynamically select the power allocation coefficient. A parameterized strategy is constructed in the Actor part, and then the Critic part evaluates it, and finally the Actor part adjust the strategy according to the feedback from the Critic part. Numerical results indicate that the proposed scheme can efficiently improve the EE of the entire system.

Original languageEnglish
Title of host publication2018 IEEE/CIC International Conference on Communications in China, ICCC 2018
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages719-723
Number of pages5
ISBN (Electronic)9781538670057
DOIs
StatePublished - 2 Jul 2018
Event2018 IEEE/CIC International Conference on Communications in China, ICCC 2018 - Beijing, China
Duration: 16 Aug 201818 Aug 2018

Publication series

Name2018 IEEE/CIC International Conference on Communications in China, ICCC 2018

Conference

Conference2018 IEEE/CIC International Conference on Communications in China, ICCC 2018
Country/TerritoryChina
CityBeijing
Period16/08/1818/08/18

Keywords

  • Actor-Critic
  • energy efficiency
  • NOMA
  • power allocation
  • reinforcement learning

Fingerprint

Dive into the research topics of 'A Dynamic Power Allocation Scheme in Power-Domain NOMA using Actor-Critic Reinforcement Learning'. Together they form a unique fingerprint.

Cite this