Multi-object aerodynamic design optimization using deep reinforcement learning

Xinyu Hui; Hui Wang; Wenqiang Li; Junqiang Bai; Fei Qin; Guoqiang He

doi:10.1063/5.0058088

Multi-object aerodynamic design optimization using deep reinforcement learning

Xinyu Hui, Hui Wang, Wenqiang Li, Junqiang Bai, Fei Qin, Guoqiang He

Northwestern Polytechnical University Xian

科研成果: 期刊稿件 › 文章 › 同行评审

13 引用（Scopus）

摘要

Aerodynamic design optimization is a key aspect in aircraft design. The further evolution of advanced aircraft derivatives requires a powerful optimization toolbox. Reinforcement learning (RL) is a powerful optimization tool but has rarely been utilized in the aerodynamic design. It can potentially obtain results similar to those of a human designer, by accumulating experience from training. In this work, a popular RL method called proximal policy optimization (PPO) is proposed to investigate multi-object aerodynamic design optimization. By observing the aerodynamic performances of different airfoils, the PPO updates a reasonable policy to generate the optimal airfoils in a single step. In a Pareto optimization problem with constraints, the PPO requires only 15% of the computational time of the non-dominated sorted genetic algorithm (II) to achieve the same accuracy. The results from testing show that the agent learns a policy that can achieve ∼4.3%-10.1% improvements of the aerodynamic performance compared with the results of baseline.

源语言	英语
文章编号	085311
期刊	AIP Advances
卷	11
期	8
DOI	https://doi.org/10.1063/5.0058088
出版状态	已出版 - 1 8月 2021

访问文件

10.1063/5.0058088

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{9b2362787bd84fefb89c7b7b9bc8de83,

title = "Multi-object aerodynamic design optimization using deep reinforcement learning",

abstract = "Aerodynamic design optimization is a key aspect in aircraft design. The further evolution of advanced aircraft derivatives requires a powerful optimization toolbox. Reinforcement learning (RL) is a powerful optimization tool but has rarely been utilized in the aerodynamic design. It can potentially obtain results similar to those of a human designer, by accumulating experience from training. In this work, a popular RL method called proximal policy optimization (PPO) is proposed to investigate multi-object aerodynamic design optimization. By observing the aerodynamic performances of different airfoils, the PPO updates a reasonable policy to generate the optimal airfoils in a single step. In a Pareto optimization problem with constraints, the PPO requires only 15% of the computational time of the non-dominated sorted genetic algorithm (II) to achieve the same accuracy. The results from testing show that the agent learns a policy that can achieve ∼4.3%-10.1% improvements of the aerodynamic performance compared with the results of baseline.",

author = "Xinyu Hui and Hui Wang and Wenqiang Li and Junqiang Bai and Fei Qin and Guoqiang He",

note = "Publisher Copyright: {\textcopyright} 2021 Author(s).",

year = "2021",

month = aug,

day = "1",

doi = "10.1063/5.0058088",

language = "英语",

volume = "11",

journal = "AIP Advances",

issn = "2158-3226",

publisher = "American Institute of Physics",

number = "8",

}

TY - JOUR

T1 - Multi-object aerodynamic design optimization using deep reinforcement learning

AU - Hui, Xinyu

AU - Wang, Hui

AU - Li, Wenqiang

AU - Bai, Junqiang

AU - Qin, Fei

AU - He, Guoqiang

PY - 2021/8/1

Y1 - 2021/8/1

N2 - Aerodynamic design optimization is a key aspect in aircraft design. The further evolution of advanced aircraft derivatives requires a powerful optimization toolbox. Reinforcement learning (RL) is a powerful optimization tool but has rarely been utilized in the aerodynamic design. It can potentially obtain results similar to those of a human designer, by accumulating experience from training. In this work, a popular RL method called proximal policy optimization (PPO) is proposed to investigate multi-object aerodynamic design optimization. By observing the aerodynamic performances of different airfoils, the PPO updates a reasonable policy to generate the optimal airfoils in a single step. In a Pareto optimization problem with constraints, the PPO requires only 15% of the computational time of the non-dominated sorted genetic algorithm (II) to achieve the same accuracy. The results from testing show that the agent learns a policy that can achieve ∼4.3%-10.1% improvements of the aerodynamic performance compared with the results of baseline.

AB - Aerodynamic design optimization is a key aspect in aircraft design. The further evolution of advanced aircraft derivatives requires a powerful optimization toolbox. Reinforcement learning (RL) is a powerful optimization tool but has rarely been utilized in the aerodynamic design. It can potentially obtain results similar to those of a human designer, by accumulating experience from training. In this work, a popular RL method called proximal policy optimization (PPO) is proposed to investigate multi-object aerodynamic design optimization. By observing the aerodynamic performances of different airfoils, the PPO updates a reasonable policy to generate the optimal airfoils in a single step. In a Pareto optimization problem with constraints, the PPO requires only 15% of the computational time of the non-dominated sorted genetic algorithm (II) to achieve the same accuracy. The results from testing show that the agent learns a policy that can achieve ∼4.3%-10.1% improvements of the aerodynamic performance compared with the results of baseline.

UR - http://www.scopus.com/inward/record.url?scp=85112352458&partnerID=8YFLogxK

U2 - 10.1063/5.0058088

DO - 10.1063/5.0058088

M3 - 文章

AN - SCOPUS:85112352458

SN - 2158-3226

VL - 11

JO - AIP Advances

JF - AIP Advances

IS - 8

M1 - 085311

ER -

Multi-object aerodynamic design optimization using deep reinforcement learning

摘要

访问文件

其它文件与链接

指纹

引用此