TY - GEN
T1 - Optimal transmit antenna selection strategy for MIMO wiretap channel based on deep reinforcement learning
AU - Hu, Youbing
AU - Li, Lixin
AU - Yin, Jiaying
AU - Zhang, Huisheng
AU - Liang, Wei
AU - Gao, Ang
AU - Han, Zhu
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018/7/2
Y1 - 2018/7/2
N2 - Antenna selection is often used for physical layer security to implement secure communications. However, due to the rapid changes of the main channel and the feedback delay of the channel state information (CSI), the transmitter obtains outdated CSI, and the outdated CSI leads to the outdated optimal transmit antenna. In order to improve the security of the system based on outdated CSI, in this paper, we propose a deep reinforcement learning framework of Deep Q Network (DQN) to predict the optimal transmit antenna in the multiple input multiple output (MIMO) wiretap channel. The legitimate receiver receives the pilot signals from each transmitting antenna, and the signal-To-noise ratio (SNR) of the pilot signals transmitted by each transmitting antenna can be obtained through maximal ratio combining. And then the legitimate receiver uses the DQN to predict the transmitting antenna at the next moment according to these SNRs. The simulation results show that DQN algorithm can effectively predict the optimal antenna at the next moment, and reduce the secrecy outage probability of MIMO wiretap system, compared with the traditional algorithm.
AB - Antenna selection is often used for physical layer security to implement secure communications. However, due to the rapid changes of the main channel and the feedback delay of the channel state information (CSI), the transmitter obtains outdated CSI, and the outdated CSI leads to the outdated optimal transmit antenna. In order to improve the security of the system based on outdated CSI, in this paper, we propose a deep reinforcement learning framework of Deep Q Network (DQN) to predict the optimal transmit antenna in the multiple input multiple output (MIMO) wiretap channel. The legitimate receiver receives the pilot signals from each transmitting antenna, and the signal-To-noise ratio (SNR) of the pilot signals transmitted by each transmitting antenna can be obtained through maximal ratio combining. And then the legitimate receiver uses the DQN to predict the transmitting antenna at the next moment according to these SNRs. The simulation results show that DQN algorithm can effectively predict the optimal antenna at the next moment, and reduce the secrecy outage probability of MIMO wiretap system, compared with the traditional algorithm.
KW - Deep Q Network
KW - MIMO wiretap channel
KW - physical layer security
KW - transmit antenna selection
UR - http://www.scopus.com/inward/record.url?scp=85063108002&partnerID=8YFLogxK
U2 - 10.1109/ICCChina.2018.8641085
DO - 10.1109/ICCChina.2018.8641085
M3 - 会议稿件
AN - SCOPUS:85063108002
T3 - 2018 IEEE/CIC International Conference on Communications in China, ICCC 2018
SP - 803
EP - 807
BT - 2018 IEEE/CIC International Conference on Communications in China, ICCC 2018
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2018 IEEE/CIC International Conference on Communications in China, ICCC 2018
Y2 - 16 August 2018 through 18 August 2018
ER -