TY - GEN
T1 - Research on Industrial Cyber Range Based on Multi-agent Cooperative Optimization
AU - Miao, Shangting
AU - Li, Yang
AU - Pan, Quan
N1 - Publisher Copyright:
© 2022, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
PY - 2022
Y1 - 2022
N2 - Combining the characteristics of multi-agents and the prototype structure of the industrial cyber range (ICR), a multi-agent cooperative optimization ICR routing algorithm is proposed. It is used to relieve the pressure of the upper computer computing interaction between the information layer of the industrial control network and the field layer. Use the dependency relationship between environment perception and interactive decision-making of multi-agent reinforcement learning to establish an ICR multi-agent cooperative model, decompose ICR into the central brain of the industrial control network and distributed intelligent routing modules, abstract each module into an agent, and apply it to the industrial control network. The learning of historical data realizes the optimal feedback of the agent to the Part I environment and computing resource requirements, and improves the congestion environment of the partial industrial control network. The experimental results show that, compared with the Q-Learning algorithm, multi-agent cooperative optimization of the ICR routing algorithm improves the utility of the industrial control network in the Part I area to a certain extent.
AB - Combining the characteristics of multi-agents and the prototype structure of the industrial cyber range (ICR), a multi-agent cooperative optimization ICR routing algorithm is proposed. It is used to relieve the pressure of the upper computer computing interaction between the information layer of the industrial control network and the field layer. Use the dependency relationship between environment perception and interactive decision-making of multi-agent reinforcement learning to establish an ICR multi-agent cooperative model, decompose ICR into the central brain of the industrial control network and distributed intelligent routing modules, abstract each module into an agent, and apply it to the industrial control network. The learning of historical data realizes the optimal feedback of the agent to the Part I environment and computing resource requirements, and improves the congestion environment of the partial industrial control network. The experimental results show that, compared with the Q-Learning algorithm, multi-agent cooperative optimization of the ICR routing algorithm improves the utility of the industrial control network in the Part I area to a certain extent.
KW - Agent
KW - Cooperative optimization
KW - Industrial cyber range
UR - http://www.scopus.com/inward/record.url?scp=85130966987&partnerID=8YFLogxK
U2 - 10.1007/978-981-16-9492-9_83
DO - 10.1007/978-981-16-9492-9_83
M3 - 会议稿件
AN - SCOPUS:85130966987
SN - 9789811694912
T3 - Lecture Notes in Electrical Engineering
SP - 843
EP - 851
BT - Proceedings of 2021 International Conference on Autonomous Unmanned Systems, ICAUS 2021
A2 - Wu, Meiping
A2 - Niu, Yifeng
A2 - Gu, Mancang
A2 - Cheng, Jin
PB - Springer Science and Business Media Deutschland GmbH
T2 - International Conference on Autonomous Unmanned Systems, ICAUS 2021
Y2 - 24 September 2021 through 26 September 2021
ER -