Risk-Aware Reward Shaping of Reinforcement Learning Agents for Autonomous Driving

Lin Chi Wu; Zengjie Zhang; Sofie Haesaert; Zhiqiang Ma; Zhiyong Sun

doi:10.1109/IECON51785.2023.10312462

Risk-Aware Reward Shaping of Reinforcement Learning Agents for Autonomous Driving

Lin Chi Wu, Zengjie Zhang, Sofie Haesaert, Zhiqiang Ma, Zhiyong Sun

School of Astronautics

Eindhoven University of Technology

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

2 Scopus citations

Abstract

Reinforcement learning (RL) is an effective approach to motion planning in autonomous driving, where an optimal driving policy can be automatically learned using the interaction data with the environment. Nevertheless, the reward function for an RL agent, which is significant to its performance, is challenging to determine. The conventional work mainly focuses on rewarding safe driving states but does not incorporate the awareness of risky driving behaviors of the vehicles. In this paper, we investigate how to use risk-aware reward shaping to leverage the training and test performance of RL agents in autonomous driving. Based on the essential requirements that prescribe the safety specifications for general autonomous driving in practice, we propose additional reshaped reward terms that encourage exploration and penalize risky driving behaviors. A simulation study in OpenAI Gym indicates the advantage of risk-aware reward shaping for various RL agents. Also, we point out that proximal policy optimization (PPO) is likely to be the best RL method that works with risk-aware reward shaping.

Original language	English
Title of host publication	IECON 2023 - 49th Annual Conference of the IEEE Industrial Electronics Society
Publisher	IEEE Computer Society
ISBN (Electronic)	9798350331820
DOIs	https://doi.org/10.1109/IECON51785.2023.10312462
State	Published - 2023
Event	49th Annual Conference of the IEEE Industrial Electronics Society, IECON 2023 - Singapore, Singapore Duration: 16 Oct 2023 → 19 Oct 2023

Publication series

Name	IECON Proceedings (Industrial Electronics Conference)
ISSN (Print)	2162-4704
ISSN (Electronic)	2577-1647

Conference

Conference	49th Annual Conference of the IEEE Industrial Electronics Society, IECON 2023
Country/Territory	Singapore
City	Singapore
Period	16/10/23 → 19/10/23

Keywords

autonomous driving
motion planning
reinforcement learning
reward shaping
risk awareness

Access to Document

10.1109/IECON51785.2023.10312462

Cite this

@inproceedings{1539ef7a55e54cc48fb524f86ea8d079,

title = "Risk-Aware Reward Shaping of Reinforcement Learning Agents for Autonomous Driving",

abstract = "Reinforcement learning (RL) is an effective approach to motion planning in autonomous driving, where an optimal driving policy can be automatically learned using the interaction data with the environment. Nevertheless, the reward function for an RL agent, which is significant to its performance, is challenging to determine. The conventional work mainly focuses on rewarding safe driving states but does not incorporate the awareness of risky driving behaviors of the vehicles. In this paper, we investigate how to use risk-aware reward shaping to leverage the training and test performance of RL agents in autonomous driving. Based on the essential requirements that prescribe the safety specifications for general autonomous driving in practice, we propose additional reshaped reward terms that encourage exploration and penalize risky driving behaviors. A simulation study in OpenAI Gym indicates the advantage of risk-aware reward shaping for various RL agents. Also, we point out that proximal policy optimization (PPO) is likely to be the best RL method that works with risk-aware reward shaping.",

keywords = "autonomous driving, motion planning, reinforcement learning, reward shaping, risk awareness",

author = "Wu, {Lin Chi} and Zengjie Zhang and Sofie Haesaert and Zhiqiang Ma and Zhiyong Sun",

note = "Publisher Copyright: {\textcopyright} 2023 IEEE.; 49th Annual Conference of the IEEE Industrial Electronics Society, IECON 2023 ; Conference date: 16-10-2023 Through 19-10-2023",

year = "2023",

doi = "10.1109/IECON51785.2023.10312462",

language = "英语",

series = "IECON Proceedings (Industrial Electronics Conference)",

publisher = "IEEE Computer Society",

booktitle = "IECON 2023 - 49th Annual Conference of the IEEE Industrial Electronics Society",

}

Wu, LC, Zhang, Z, Haesaert, S, Ma, Z & Sun, Z 2023, Risk-Aware Reward Shaping of Reinforcement Learning Agents for Autonomous Driving. in IECON 2023 - 49th Annual Conference of the IEEE Industrial Electronics Society. IECON Proceedings (Industrial Electronics Conference), IEEE Computer Society, 49th Annual Conference of the IEEE Industrial Electronics Society, IECON 2023, Singapore, Singapore, 16/10/23. https://doi.org/10.1109/IECON51785.2023.10312462

Risk-Aware Reward Shaping of Reinforcement Learning Agents for Autonomous Driving. / Wu, Lin Chi; Zhang, Zengjie; Haesaert, Sofie et al.
IECON 2023 - 49th Annual Conference of the IEEE Industrial Electronics Society. IEEE Computer Society, 2023. (IECON Proceedings (Industrial Electronics Conference)).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Risk-Aware Reward Shaping of Reinforcement Learning Agents for Autonomous Driving

AU - Wu, Lin Chi

AU - Zhang, Zengjie

AU - Haesaert, Sofie

AU - Ma, Zhiqiang

AU - Sun, Zhiyong

PY - 2023

Y1 - 2023

N2 - Reinforcement learning (RL) is an effective approach to motion planning in autonomous driving, where an optimal driving policy can be automatically learned using the interaction data with the environment. Nevertheless, the reward function for an RL agent, which is significant to its performance, is challenging to determine. The conventional work mainly focuses on rewarding safe driving states but does not incorporate the awareness of risky driving behaviors of the vehicles. In this paper, we investigate how to use risk-aware reward shaping to leverage the training and test performance of RL agents in autonomous driving. Based on the essential requirements that prescribe the safety specifications for general autonomous driving in practice, we propose additional reshaped reward terms that encourage exploration and penalize risky driving behaviors. A simulation study in OpenAI Gym indicates the advantage of risk-aware reward shaping for various RL agents. Also, we point out that proximal policy optimization (PPO) is likely to be the best RL method that works with risk-aware reward shaping.

AB - Reinforcement learning (RL) is an effective approach to motion planning in autonomous driving, where an optimal driving policy can be automatically learned using the interaction data with the environment. Nevertheless, the reward function for an RL agent, which is significant to its performance, is challenging to determine. The conventional work mainly focuses on rewarding safe driving states but does not incorporate the awareness of risky driving behaviors of the vehicles. In this paper, we investigate how to use risk-aware reward shaping to leverage the training and test performance of RL agents in autonomous driving. Based on the essential requirements that prescribe the safety specifications for general autonomous driving in practice, we propose additional reshaped reward terms that encourage exploration and penalize risky driving behaviors. A simulation study in OpenAI Gym indicates the advantage of risk-aware reward shaping for various RL agents. Also, we point out that proximal policy optimization (PPO) is likely to be the best RL method that works with risk-aware reward shaping.

KW - autonomous driving

KW - motion planning

KW - reinforcement learning

KW - reward shaping

KW - risk awareness

UR - http://www.scopus.com/inward/record.url?scp=85179509126&partnerID=8YFLogxK

U2 - 10.1109/IECON51785.2023.10312462

DO - 10.1109/IECON51785.2023.10312462

M3 - 会议稿件

AN - SCOPUS:85179509126

T3 - IECON Proceedings (Industrial Electronics Conference)

BT - IECON 2023 - 49th Annual Conference of the IEEE Industrial Electronics Society

PB - IEEE Computer Society

T2 - 49th Annual Conference of the IEEE Industrial Electronics Society, IECON 2023

Y2 - 16 October 2023 through 19 October 2023

ER -

Risk-Aware Reward Shaping of Reinforcement Learning Agents for Autonomous Driving

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this