Enhancing Simultaneous Arrival at a Dynamic Target: A Hybrid Approach with Proximal Policy Optimization and Expert Rules

Yifei Lei, Jinwen Hu, Zhao Xu, Chenqi Gao, Jiatong Li

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper addresses the complex challenge of ensuring simultaneous arrival of multiple agents at a dynamic target, a critical requirement for operations such as roundups and saturation attacks. Traditional guidance systems have faced significant hurdles due to inaccurate flight time estimates and difficulties adapting to high-speed maneuvers. To overcome these limitations, we introduce a novel framework that utilizes Proximal Policy Optimization (PPO) and expert rules. This approach leverages distributed computing to enable autonomous decision-making among agents, thereby simplifying the deep reinforcement learning model and reducing computational overhead, which enhances scalability and adaptability. Additionally, we incorporate an angular velocity reward into the reward function, improving the predictability and effectiveness of maneuvers, particularly for targets with high-speed and irregular trajectories. The proposed methods have been rigorously tested through numerous simulations and high-fidelity scenarios, confirming their robustness and superior performance over traditional and enhanced proportional guidance systems.

Original languageEnglish
Title of host publication2024 18th International Conference on Control, Automation, Robotics and Vision, ICARCV 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages43-48
Number of pages6
ISBN (Electronic)9798331518493
DOIs
StatePublished - 2024
Event18th International Conference on Control, Automation, Robotics and Vision, ICARCV 2024 - Dubai, United Arab Emirates
Duration: 12 Dec 202415 Dec 2024

Publication series

Name2024 18th International Conference on Control, Automation, Robotics and Vision, ICARCV 2024

Conference

Conference18th International Conference on Control, Automation, Robotics and Vision, ICARCV 2024
Country/TerritoryUnited Arab Emirates
CityDubai
Period12/12/2415/12/24

Fingerprint

Dive into the research topics of 'Enhancing Simultaneous Arrival at a Dynamic Target: A Hybrid Approach with Proximal Policy Optimization and Expert Rules'. Together they form a unique fingerprint.

Cite this