摘要
This paper addresses the coverage path planning problem when an unmanned aerial vehicle (UAV) surveys an unknown site composed of multiple isolated areas. The problem is typically NP-hard and cannot be easily solved, especially when considering the scale of each area. By decomposing the problem into two cascaded sub-problems—1) covering a specific polygon area and 2) determining the optimal visiting order of different areas—an approximate solution can be found more efficiently. First, the target areas are approximated as convex polygons, and the coverage pattern is designed based on four control points. Then, the optimal visiting order is determined based on a state defined by area indices and control points. We propose two different optimization methods to solve this problem. The first method is a direct extension of the genetic algorithm, using a customized coding method. The second method is a reinforcement learning-based (RL-based) approach that solves the problem as a variant of the Traveling Salesman Problem (TSP) through end-to-end policy training. The simulation results indicate that the proposed methods can provide solutions to the multiple-area coverage problem with competitive optimality and efficiency.
源语言 | 英语 |
---|---|
页(从-至) | 1-12 |
页数 | 12 |
期刊 | IEEE Transactions on Cognitive and Developmental Systems |
DOI | |
出版状态 | 已接受/待刊 - 2024 |