A Dynamic Power Allocation Scheme in Power-Domain NOMA using Actor-Critic Reinforcement Learning

Shaomin Zhang; Lixin Li; Jiaying Yin; Wei Liang; Xu Li; Wei Chen; Zhu Han

doi:10.1109/ICCChina.2018.8641248

A Dynamic Power Allocation Scheme in Power-Domain NOMA using Actor-Critic Reinforcement Learning

Shaomin Zhang, Lixin Li, Jiaying Yin, Wei Liang, Xu Li, Wei Chen, Zhu Han

School of Electronics and Information

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

18 Scopus citations

Abstract

Non-orthogonal multiple access (NOMA) is one of the most promising technologies in the next-generation cellular communication. However, the effective power allocation strategy has always been a problem that needs to be solved in power-domain NOMA. In this paper, we propose a reinforcement learning (RL) method to solve the power allocation problem. In particular, in the power-domain NOMA, the base station (BS) simultaneously transmits data to the user under the constraint of the sum power. Considering that the power allocation assigned by the BS to each user can be used to optimize the energy efficient (EE) of the entire system, we propose the RL algorithm framework of the Actor-Critic to dynamically select the power allocation coefficient. A parameterized strategy is constructed in the Actor part, and then the Critic part evaluates it, and finally the Actor part adjust the strategy according to the feedback from the Critic part. Numerical results indicate that the proposed scheme can efficiently improve the EE of the entire system.

Original language	English
Title of host publication	2018 IEEE/CIC International Conference on Communications in China, ICCC 2018
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	719-723
Number of pages	5
ISBN (Electronic)	9781538670057
DOIs	https://doi.org/10.1109/ICCChina.2018.8641248
State	Published - 2 Jul 2018
Event	2018 IEEE/CIC International Conference on Communications in China, ICCC 2018 - Beijing, China Duration: 16 Aug 2018 → 18 Aug 2018

Publication series

Name	2018 IEEE/CIC International Conference on Communications in China, ICCC 2018

Conference

Conference	2018 IEEE/CIC International Conference on Communications in China, ICCC 2018
Country/Territory	China
City	Beijing
Period	16/08/18 → 18/08/18

Keywords

Actor-Critic
energy efficiency
NOMA
power allocation
reinforcement learning

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.1109/ICCChina.2018.8641248

Cite this

Zhang, S., Li, L., Yin, J., Liang, W., Li, X., Chen, W., & Han, Z. (2018). A Dynamic Power Allocation Scheme in Power-Domain NOMA using Actor-Critic Reinforcement Learning. In 2018 IEEE/CIC International Conference on Communications in China, ICCC 2018 (pp. 719-723). Article 8641248 (2018 IEEE/CIC International Conference on Communications in China, ICCC 2018). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICCChina.2018.8641248

Zhang, Shaomin ; Li, Lixin ; Yin, Jiaying et al. / A Dynamic Power Allocation Scheme in Power-Domain NOMA using Actor-Critic Reinforcement Learning. 2018 IEEE/CIC International Conference on Communications in China, ICCC 2018. Institute of Electrical and Electronics Engineers Inc., 2018. pp. 719-723 (2018 IEEE/CIC International Conference on Communications in China, ICCC 2018).

@inproceedings{1dc3fc60d5684f5e91017b14738edda0,

title = "A Dynamic Power Allocation Scheme in Power-Domain NOMA using Actor-Critic Reinforcement Learning",

abstract = "Non-orthogonal multiple access (NOMA) is one of the most promising technologies in the next-generation cellular communication. However, the effective power allocation strategy has always been a problem that needs to be solved in power-domain NOMA. In this paper, we propose a reinforcement learning (RL) method to solve the power allocation problem. In particular, in the power-domain NOMA, the base station (BS) simultaneously transmits data to the user under the constraint of the sum power. Considering that the power allocation assigned by the BS to each user can be used to optimize the energy efficient (EE) of the entire system, we propose the RL algorithm framework of the Actor-Critic to dynamically select the power allocation coefficient. A parameterized strategy is constructed in the Actor part, and then the Critic part evaluates it, and finally the Actor part adjust the strategy according to the feedback from the Critic part. Numerical results indicate that the proposed scheme can efficiently improve the EE of the entire system.",

keywords = "Actor-Critic, energy efficiency, NOMA, power allocation, reinforcement learning",

author = "Shaomin Zhang and Lixin Li and Jiaying Yin and Wei Liang and Xu Li and Wei Chen and Zhu Han",

note = "Publisher Copyright: {\textcopyright} 2018 IEEE.; 2018 IEEE/CIC International Conference on Communications in China, ICCC 2018 ; Conference date: 16-08-2018 Through 18-08-2018",

year = "2018",

month = jul,

day = "2",

doi = "10.1109/ICCChina.2018.8641248",

language = "英语",

series = "2018 IEEE/CIC International Conference on Communications in China, ICCC 2018",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "719--723",

booktitle = "2018 IEEE/CIC International Conference on Communications in China, ICCC 2018",

}

Zhang, S, Li, L, Yin, J, Liang, W, Li, X, Chen, W & Han, Z 2018, A Dynamic Power Allocation Scheme in Power-Domain NOMA using Actor-Critic Reinforcement Learning. in 2018 IEEE/CIC International Conference on Communications in China, ICCC 2018., 8641248, 2018 IEEE/CIC International Conference on Communications in China, ICCC 2018, Institute of Electrical and Electronics Engineers Inc., pp. 719-723, 2018 IEEE/CIC International Conference on Communications in China, ICCC 2018, Beijing, China, 16/08/18. https://doi.org/10.1109/ICCChina.2018.8641248

A Dynamic Power Allocation Scheme in Power-Domain NOMA using Actor-Critic Reinforcement Learning. / Zhang, Shaomin; Li, Lixin; Yin, Jiaying et al.
2018 IEEE/CIC International Conference on Communications in China, ICCC 2018. Institute of Electrical and Electronics Engineers Inc., 2018. p. 719-723 8641248 (2018 IEEE/CIC International Conference on Communications in China, ICCC 2018).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - A Dynamic Power Allocation Scheme in Power-Domain NOMA using Actor-Critic Reinforcement Learning

AU - Zhang, Shaomin

AU - Li, Lixin

AU - Yin, Jiaying

AU - Liang, Wei

AU - Li, Xu

AU - Chen, Wei

AU - Han, Zhu

PY - 2018/7/2

Y1 - 2018/7/2

N2 - Non-orthogonal multiple access (NOMA) is one of the most promising technologies in the next-generation cellular communication. However, the effective power allocation strategy has always been a problem that needs to be solved in power-domain NOMA. In this paper, we propose a reinforcement learning (RL) method to solve the power allocation problem. In particular, in the power-domain NOMA, the base station (BS) simultaneously transmits data to the user under the constraint of the sum power. Considering that the power allocation assigned by the BS to each user can be used to optimize the energy efficient (EE) of the entire system, we propose the RL algorithm framework of the Actor-Critic to dynamically select the power allocation coefficient. A parameterized strategy is constructed in the Actor part, and then the Critic part evaluates it, and finally the Actor part adjust the strategy according to the feedback from the Critic part. Numerical results indicate that the proposed scheme can efficiently improve the EE of the entire system.

AB - Non-orthogonal multiple access (NOMA) is one of the most promising technologies in the next-generation cellular communication. However, the effective power allocation strategy has always been a problem that needs to be solved in power-domain NOMA. In this paper, we propose a reinforcement learning (RL) method to solve the power allocation problem. In particular, in the power-domain NOMA, the base station (BS) simultaneously transmits data to the user under the constraint of the sum power. Considering that the power allocation assigned by the BS to each user can be used to optimize the energy efficient (EE) of the entire system, we propose the RL algorithm framework of the Actor-Critic to dynamically select the power allocation coefficient. A parameterized strategy is constructed in the Actor part, and then the Critic part evaluates it, and finally the Actor part adjust the strategy according to the feedback from the Critic part. Numerical results indicate that the proposed scheme can efficiently improve the EE of the entire system.

KW - Actor-Critic

KW - energy efficiency

KW - NOMA

KW - power allocation

KW - reinforcement learning

UR - http://www.scopus.com/inward/record.url?scp=85063085424&partnerID=8YFLogxK

U2 - 10.1109/ICCChina.2018.8641248

DO - 10.1109/ICCChina.2018.8641248

M3 - 会议稿件

AN - SCOPUS:85063085424

T3 - 2018 IEEE/CIC International Conference on Communications in China, ICCC 2018

SP - 719

EP - 723

BT - 2018 IEEE/CIC International Conference on Communications in China, ICCC 2018

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2018 IEEE/CIC International Conference on Communications in China, ICCC 2018

Y2 - 16 August 2018 through 18 August 2018

ER -

Zhang S, Li L, Yin J, Liang W, Li X, Chen W et al. A Dynamic Power Allocation Scheme in Power-Domain NOMA using Actor-Critic Reinforcement Learning. In 2018 IEEE/CIC International Conference on Communications in China, ICCC 2018. Institute of Electrical and Electronics Engineers Inc. 2018. p. 719-723. 8641248. (2018 IEEE/CIC International Conference on Communications in China, ICCC 2018). doi: 10.1109/ICCChina.2018.8641248

A Dynamic Power Allocation Scheme in Power-Domain NOMA using Actor-Critic Reinforcement Learning

Abstract

Publication series

Conference

Keywords

UN SDGs

Access to Document

Other files and links

Fingerprint

Cite this