Risk-Aware Reward Shaping of Reinforcement Learning Agents for Autonomous Driving

Lin Chi Wu, Zengjie Zhang, Sofie Haesaert, Zhiqiang Ma, Zhiyong Sun

科研成果: 书/报告/会议事项章节会议稿件同行评审

2 引用 (Scopus)

摘要

Reinforcement learning (RL) is an effective approach to motion planning in autonomous driving, where an optimal driving policy can be automatically learned using the interaction data with the environment. Nevertheless, the reward function for an RL agent, which is significant to its performance, is challenging to determine. The conventional work mainly focuses on rewarding safe driving states but does not incorporate the awareness of risky driving behaviors of the vehicles. In this paper, we investigate how to use risk-aware reward shaping to leverage the training and test performance of RL agents in autonomous driving. Based on the essential requirements that prescribe the safety specifications for general autonomous driving in practice, we propose additional reshaped reward terms that encourage exploration and penalize risky driving behaviors. A simulation study in OpenAI Gym indicates the advantage of risk-aware reward shaping for various RL agents. Also, we point out that proximal policy optimization (PPO) is likely to be the best RL method that works with risk-aware reward shaping.

源语言英语
主期刊名IECON 2023 - 49th Annual Conference of the IEEE Industrial Electronics Society
出版商IEEE Computer Society
ISBN(电子版)9798350331820
DOI
出版状态已出版 - 2023
活动49th Annual Conference of the IEEE Industrial Electronics Society, IECON 2023 - Singapore, 新加坡
期限: 16 10月 202319 10月 2023

出版系列

姓名IECON Proceedings (Industrial Electronics Conference)
ISSN(印刷版)2162-4704
ISSN(电子版)2577-1647

会议

会议49th Annual Conference of the IEEE Industrial Electronics Society, IECON 2023
国家/地区新加坡
Singapore
时期16/10/2319/10/23

指纹

探究 'Risk-Aware Reward Shaping of Reinforcement Learning Agents for Autonomous Driving' 的科研主题。它们共同构成独一无二的指纹。

引用此