TY - JOUR
T1 - Reinforcement Learning Based RSS-Threshold Optimization for D2D-Aided HTC/MTC in Dense NOMA Systems
AU - Zhang, Shangwei
AU - Wang, Xiao
AU - Shi, Zhenjiang
AU - Liu, Jiajia
N1 - Publisher Copyright:
© 2002-2012 IEEE.
PY - 2023/10/1
Y1 - 2023/10/1
N2 - To fulfill the stringent requirements brought by human-type communication (HTC) along with massive machine-type communication (MTC), device-to-device (D2D) and non-orthogonal multiple access (NOMA) techniques will inevitably be incorporated into dense cellular networks to cater massive connectivity and maintain high spectral efficiency. However, such combination may lead to very complex network topologies and bring challenge in resource allocation, interference management and transmission mode selection. Note the received signal strength (RSS) is an important factor for cellular and D2D mode selection, it can affect multi-access mode determination in D2D-aided HTC/MTC dense NOMA systems. Therefore, the RSS threshold of each cell has great impact on system performance and should be carefully tuned. To this end, we formulate the RSS-threshold selection problem as a decentralized partially observable Markov decision process to maximize the performance for downlink and uplink communications. Accordingly, we employ a multi-agent reinforcement learning based scheme wherein each small base station acts as an agent and chooses the optimal RSS threshold to achieve maximum sum rate by interacting with the environment continuously. Extensive simulation results reveal our proposed scheme can improve the system sum rate and coverage by enhancing the connectivity of massive HTC and MTC devices via D2D and NOMA techniques.
AB - To fulfill the stringent requirements brought by human-type communication (HTC) along with massive machine-type communication (MTC), device-to-device (D2D) and non-orthogonal multiple access (NOMA) techniques will inevitably be incorporated into dense cellular networks to cater massive connectivity and maintain high spectral efficiency. However, such combination may lead to very complex network topologies and bring challenge in resource allocation, interference management and transmission mode selection. Note the received signal strength (RSS) is an important factor for cellular and D2D mode selection, it can affect multi-access mode determination in D2D-aided HTC/MTC dense NOMA systems. Therefore, the RSS threshold of each cell has great impact on system performance and should be carefully tuned. To this end, we formulate the RSS-threshold selection problem as a decentralized partially observable Markov decision process to maximize the performance for downlink and uplink communications. Accordingly, we employ a multi-agent reinforcement learning based scheme wherein each small base station acts as an agent and chooses the optimal RSS threshold to achieve maximum sum rate by interacting with the environment continuously. Extensive simulation results reveal our proposed scheme can improve the system sum rate and coverage by enhancing the connectivity of massive HTC and MTC devices via D2D and NOMA techniques.
KW - device-to-device
KW - machine-type communications
KW - non-orthogonal multiple access
KW - Received signal strength
UR - http://www.scopus.com/inward/record.url?scp=85149401631&partnerID=8YFLogxK
U2 - 10.1109/TWC.2023.3244192
DO - 10.1109/TWC.2023.3244192
M3 - 文章
AN - SCOPUS:85149401631
SN - 1536-1276
VL - 22
SP - 6489
EP - 6503
JO - IEEE Transactions on Wireless Communications
JF - IEEE Transactions on Wireless Communications
IS - 10
ER -