Learning-Aided Resource Allocation for Pattern Division Multiple Access-Based SWIPT Systems

Lixin Li; Hui Ma; Huan Ren; Qianqian Cheng; Dawei Wang; Tong Bai; Zhu Han

doi:10.1109/LWC.2020.3023108

Learning-Aided Resource Allocation for Pattern Division Multiple Access-Based SWIPT Systems

Lixin Li, Hui Ma, Huan Ren, Qianqian Cheng, Dawei Wang, Tong Bai, Zhu Han

电子信息学院

科研成果: 期刊稿件 › 文章 › 同行评审

11 引用（Scopus）

摘要

In this letter, a learning-aided resource allocation scheme based on the constrained Markov decision process (CMDP) is proposed to improve the average network energy efficiency (EE) with the constrained quality of service (QoS) in the pattern division multiple access (PDMA)-based simultaneous wireless information and power transfer (SWIPT) system. In order to solve the formulated CMDP resource allocation problem, the Lagrange duality is adopted to transform CMDP into an unconstrained Markov decision process (MDP). Due to the instability of the practical system, the Deep Q Network (DQN)-based CMDP scheme is proposed to obtain the optimal solution. The simulation results verify the proposed scheme converges faster than the benchmark in terms of increasing average network EE.

源语言	英语
文章编号	9193910
页（从-至）	131-135
页数	5
期刊	IEEE Wireless Communications Letters
卷	10
期	1
DOI	https://doi.org/10.1109/LWC.2020.3023108
出版状态	已出版 - 1月 2021

联合国可持续发展目标

此成果有助于实现下列可持续发展目标：

访问文件

10.1109/LWC.2020.3023108

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{bf4866307df64c6e933a3fcc01a1ba22,

title = "Learning-Aided Resource Allocation for Pattern Division Multiple Access-Based SWIPT Systems",

abstract = "In this letter, a learning-aided resource allocation scheme based on the constrained Markov decision process (CMDP) is proposed to improve the average network energy efficiency (EE) with the constrained quality of service (QoS) in the pattern division multiple access (PDMA)-based simultaneous wireless information and power transfer (SWIPT) system. In order to solve the formulated CMDP resource allocation problem, the Lagrange duality is adopted to transform CMDP into an unconstrained Markov decision process (MDP). Due to the instability of the practical system, the Deep Q Network (DQN)-based CMDP scheme is proposed to obtain the optimal solution. The simulation results verify the proposed scheme converges faster than the benchmark in terms of increasing average network EE.",

keywords = "constrained Markov decision process (CMDP), deep Q network (DQN), pattern division multiple access (PDMA), Simultaneous wireless information and power transfer (SWIPT)",

author = "Lixin Li and Hui Ma and Huan Ren and Qianqian Cheng and Dawei Wang and Tong Bai and Zhu Han",

note = "Publisher Copyright: {\textcopyright} 2012 IEEE.",

year = "2021",

month = jan,

doi = "10.1109/LWC.2020.3023108",

language = "英语",

volume = "10",

pages = "131--135",

journal = "IEEE Wireless Communications Letters",

issn = "2162-2337",

publisher = "IEEE Communications Society",

number = "1",

}

TY - JOUR

T1 - Learning-Aided Resource Allocation for Pattern Division Multiple Access-Based SWIPT Systems

AU - Li, Lixin

AU - Ma, Hui

AU - Ren, Huan

AU - Cheng, Qianqian

AU - Wang, Dawei

AU - Bai, Tong

AU - Han, Zhu

PY - 2021/1

Y1 - 2021/1

N2 - In this letter, a learning-aided resource allocation scheme based on the constrained Markov decision process (CMDP) is proposed to improve the average network energy efficiency (EE) with the constrained quality of service (QoS) in the pattern division multiple access (PDMA)-based simultaneous wireless information and power transfer (SWIPT) system. In order to solve the formulated CMDP resource allocation problem, the Lagrange duality is adopted to transform CMDP into an unconstrained Markov decision process (MDP). Due to the instability of the practical system, the Deep Q Network (DQN)-based CMDP scheme is proposed to obtain the optimal solution. The simulation results verify the proposed scheme converges faster than the benchmark in terms of increasing average network EE.

AB - In this letter, a learning-aided resource allocation scheme based on the constrained Markov decision process (CMDP) is proposed to improve the average network energy efficiency (EE) with the constrained quality of service (QoS) in the pattern division multiple access (PDMA)-based simultaneous wireless information and power transfer (SWIPT) system. In order to solve the formulated CMDP resource allocation problem, the Lagrange duality is adopted to transform CMDP into an unconstrained Markov decision process (MDP). Due to the instability of the practical system, the Deep Q Network (DQN)-based CMDP scheme is proposed to obtain the optimal solution. The simulation results verify the proposed scheme converges faster than the benchmark in terms of increasing average network EE.

KW - constrained Markov decision process (CMDP)

KW - deep Q network (DQN)

KW - pattern division multiple access (PDMA)

KW - Simultaneous wireless information and power transfer (SWIPT)

UR - http://www.scopus.com/inward/record.url?scp=85099502899&partnerID=8YFLogxK

U2 - 10.1109/LWC.2020.3023108

DO - 10.1109/LWC.2020.3023108

M3 - 文章

AN - SCOPUS:85099502899

SN - 2162-2337

VL - 10

SP - 131

EP - 135

JO - IEEE Wireless Communications Letters

JF - IEEE Wireless Communications Letters

IS - 1

M1 - 9193910

ER -

Learning-Aided Resource Allocation for Pattern Division Multiple Access-Based SWIPT Systems

摘要

联合国可持续发展目标

访问文件

其它文件与链接

指纹

引用此