Skip to main navigation Skip to search Skip to main content

Learning-Aided Resource Allocation for Pattern Division Multiple Access-Based SWIPT Systems

  • Northwestern Polytechnical University Xian
  • Queen Mary University of London
  • University of Houston

Research output: Contribution to journalArticlepeer-review

13 Scopus citations

Abstract

In this letter, a learning-aided resource allocation scheme based on the constrained Markov decision process (CMDP) is proposed to improve the average network energy efficiency (EE) with the constrained quality of service (QoS) in the pattern division multiple access (PDMA)-based simultaneous wireless information and power transfer (SWIPT) system. In order to solve the formulated CMDP resource allocation problem, the Lagrange duality is adopted to transform CMDP into an unconstrained Markov decision process (MDP). Due to the instability of the practical system, the Deep Q Network (DQN)-based CMDP scheme is proposed to obtain the optimal solution. The simulation results verify the proposed scheme converges faster than the benchmark in terms of increasing average network EE.

Original languageEnglish
Article number9193910
Pages (from-to)131-135
Number of pages5
JournalIEEE Wireless Communications Letters
Volume10
Issue number1
DOIs
StatePublished - Jan 2021

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 7 - Affordable and Clean Energy
    SDG 7 Affordable and Clean Energy

Keywords

  • Simultaneous wireless information and power transfer (SWIPT)
  • constrained Markov decision process (CMDP)
  • deep Q network (DQN)
  • pattern division multiple access (PDMA)

Fingerprint

Dive into the research topics of 'Learning-Aided Resource Allocation for Pattern Division Multiple Access-Based SWIPT Systems'. Together they form a unique fingerprint.

Cite this