摘要
In this letter, a learning-aided resource allocation scheme based on the constrained Markov decision process (CMDP) is proposed to improve the average network energy efficiency (EE) with the constrained quality of service (QoS) in the pattern division multiple access (PDMA)-based simultaneous wireless information and power transfer (SWIPT) system. In order to solve the formulated CMDP resource allocation problem, the Lagrange duality is adopted to transform CMDP into an unconstrained Markov decision process (MDP). Due to the instability of the practical system, the Deep Q Network (DQN)-based CMDP scheme is proposed to obtain the optimal solution. The simulation results verify the proposed scheme converges faster than the benchmark in terms of increasing average network EE.
| 源语言 | 英语 |
|---|---|
| 文章编号 | 9193910 |
| 页(从-至) | 131-135 |
| 页数 | 5 |
| 期刊 | IEEE Wireless Communications Letters |
| 卷 | 10 |
| 期 | 1 |
| DOI | |
| 出版状态 | 已出版 - 1月 2021 |
联合国可持续发展目标
此成果有助于实现下列可持续发展目标:
-
可持续发展目标 7 经济适用的清洁能源
指纹
探究 'Learning-Aided Resource Allocation for Pattern Division Multiple Access-Based SWIPT Systems' 的科研主题。它们共同构成独一无二的指纹。引用此
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver