基于序贯博弈多智能体强化学习的综合模块化航空电子系统重构方法

Translated title of the contribution: Integrated Modular Avionics System Reconstruction Method Based on Sequential Game Multi-agent Reinforcement Learning

Tao Zhang, Wen Tao Zhang, Ling Dai, Jing Yi Chen, Li Wang, Qian Ru Wei

Research output: Contribution to journalArticlepeer-review

6 Scopus citations

Abstract

Dynamic reconfiguration is an efficient fault-tolerant approach for integrated modular avionics(IMA) systems. The reconfiguration blueprint defines the application migration and resource reconfiguration scheme in the system failure environment, which is the key to reconfiguring and recovering the system function with minimum cost. How to generate effective reconfiguration blueprints rapidly and automatically in complex multi-level associated failure modes is the difficulty. This paper proposes an IMA system reconfiguration method based on sequential game multi-agent reinforcement learning to solve the problem. The sequential game model is introduced in this method. We define the application software needs to be migrated as the agent in the game. The sequence of sequential game is determined according to the priority of the application software. Aiming at the problem of competition and cooperation among multiple agents in the process of sequential game, the algorithm introduces policy gradient of reinforcement learning and optimizes the reconfiguration effect by controlling the action selection probability in interaction with the environment. The policy gradient Monte Carlo tree search algorithm based on biased estimation is applied to update game strategy, which solves the problems of oscillation, difficulty in convergence, long calculation time of the traditional policy gradient algorithm. Experimental results indicate that compared with differential evolution and Q-learning methods, the proposed algorithm has significant advantages in convergence and efficiency.

Translated title of the contributionIntegrated Modular Avionics System Reconstruction Method Based on Sequential Game Multi-agent Reinforcement Learning
Original languageChinese (Traditional)
Pages (from-to)954-966
Number of pages13
JournalTien Tzu Hsueh Pao/Acta Electronica Sinica
Volume50
Issue number4
DOIs
StatePublished - Apr 2022

Fingerprint

Dive into the research topics of 'Integrated Modular Avionics System Reconstruction Method Based on Sequential Game Multi-agent Reinforcement Learning'. Together they form a unique fingerprint.

Cite this