Robust Quadrupedal Locomotion via Risk-Averse Policy Learning

Jiyuan Shi; Chenjia Bai; Haoran He; Lei Han; Dong Wang; Bin Zhao; Mingguo Zhao; Xiu Li; Xuelong Li

doi:10.1109/ICRA57147.2024.10610086

Robust Quadrupedal Locomotion via Risk-Averse Policy Learning

Jiyuan Shi, Chenjia Bai, Haoran He, Lei Han, Dong Wang, Bin Zhao, Mingguo Zhao, Xiu Li, Xuelong Li

School of Artificial Intelligence, OPtics and Electronics

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

6 Scopus citations

Abstract

The robustness of legged locomotion is crucial for quadrupedal robots in challenging terrains. Recently, Reinforcement Learning (RL) has shown promising results in legged locomotion and various methods try to integrate privileged distillation, scene modeling, and external sensors to improve the generalization and robustness of locomotion policies. However, these methods are hard to handle uncertain scenarios such as abrupt terrain changes or unexpected external forces. In this paper, we consider a novel risk-sensitive perspective to enhance the robustness of legged locomotion. Specifically, we employ a distributional value function learned by quantile regression to model the aleatoric uncertainty of environments, and perform risk-averse policy learning by optimizing the worst-case scenarios via a risk distortion measure. Extensive experiments in both simulation environments and a real Aliengo robot demonstrate that our method is efficient in handling various external disturbances, and the resulting policy exhibits improved robustness in harsh and uncertain situations in legged locomotion.

Original language	English
Title of host publication	2024 IEEE International Conference on Robotics and Automation, ICRA 2024
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	11459-11466
Number of pages	8
ISBN (Electronic)	9798350384574
DOIs	https://doi.org/10.1109/ICRA57147.2024.10610086
State	Published - 2024
Event	2024 IEEE International Conference on Robotics and Automation, ICRA 2024 - Yokohama, Japan Duration: 13 May 2024 → 17 May 2024

Publication series

Name	Proceedings - IEEE International Conference on Robotics and Automation
ISSN (Print)	1050-4729

Conference

Conference	2024 IEEE International Conference on Robotics and Automation, ICRA 2024
Country/Territory	Japan
City	Yokohama
Period	13/05/24 → 17/05/24

Access to Document

10.1109/ICRA57147.2024.10610086

Cite this

Shi, J., Bai, C., He, H., Han, L., Wang, D., Zhao, B., Zhao, M., Li, X., & Li, X. (2024). Robust Quadrupedal Locomotion via Risk-Averse Policy Learning. In 2024 IEEE International Conference on Robotics and Automation, ICRA 2024 (pp. 11459-11466). (Proceedings - IEEE International Conference on Robotics and Automation). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICRA57147.2024.10610086

@inproceedings{0fee8a20301c4e49a708e2e0db8379ea,

title = "Robust Quadrupedal Locomotion via Risk-Averse Policy Learning",

abstract = "The robustness of legged locomotion is crucial for quadrupedal robots in challenging terrains. Recently, Reinforcement Learning (RL) has shown promising results in legged locomotion and various methods try to integrate privileged distillation, scene modeling, and external sensors to improve the generalization and robustness of locomotion policies. However, these methods are hard to handle uncertain scenarios such as abrupt terrain changes or unexpected external forces. In this paper, we consider a novel risk-sensitive perspective to enhance the robustness of legged locomotion. Specifically, we employ a distributional value function learned by quantile regression to model the aleatoric uncertainty of environments, and perform risk-averse policy learning by optimizing the worst-case scenarios via a risk distortion measure. Extensive experiments in both simulation environments and a real Aliengo robot demonstrate that our method is efficient in handling various external disturbances, and the resulting policy exhibits improved robustness in harsh and uncertain situations in legged locomotion.",

author = "Jiyuan Shi and Chenjia Bai and Haoran He and Lei Han and Dong Wang and Bin Zhao and Mingguo Zhao and Xiu Li and Xuelong Li",

note = "Publisher Copyright: {\textcopyright} 2024 IEEE.; 2024 IEEE International Conference on Robotics and Automation, ICRA 2024 ; Conference date: 13-05-2024 Through 17-05-2024",

year = "2024",

doi = "10.1109/ICRA57147.2024.10610086",

language = "英语",

series = "Proceedings - IEEE International Conference on Robotics and Automation",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "11459--11466",

booktitle = "2024 IEEE International Conference on Robotics and Automation, ICRA 2024",

}

Shi, J, Bai, C, He, H, Han, L, Wang, D, Zhao, B, Zhao, M, Li, X & Li, X 2024, Robust Quadrupedal Locomotion via Risk-Averse Policy Learning. in 2024 IEEE International Conference on Robotics and Automation, ICRA 2024. Proceedings - IEEE International Conference on Robotics and Automation, Institute of Electrical and Electronics Engineers Inc., pp. 11459-11466, 2024 IEEE International Conference on Robotics and Automation, ICRA 2024, Yokohama, Japan, 13/05/24. https://doi.org/10.1109/ICRA57147.2024.10610086

Robust Quadrupedal Locomotion via Risk-Averse Policy Learning. / Shi, Jiyuan; Bai, Chenjia; He, Haoran et al.
2024 IEEE International Conference on Robotics and Automation, ICRA 2024. Institute of Electrical and Electronics Engineers Inc., 2024. p. 11459-11466 (Proceedings - IEEE International Conference on Robotics and Automation).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Robust Quadrupedal Locomotion via Risk-Averse Policy Learning

AU - Shi, Jiyuan

AU - Bai, Chenjia

AU - He, Haoran

AU - Han, Lei

AU - Wang, Dong

AU - Zhao, Bin

AU - Zhao, Mingguo

AU - Li, Xiu

AU - Li, Xuelong

PY - 2024

Y1 - 2024

N2 - The robustness of legged locomotion is crucial for quadrupedal robots in challenging terrains. Recently, Reinforcement Learning (RL) has shown promising results in legged locomotion and various methods try to integrate privileged distillation, scene modeling, and external sensors to improve the generalization and robustness of locomotion policies. However, these methods are hard to handle uncertain scenarios such as abrupt terrain changes or unexpected external forces. In this paper, we consider a novel risk-sensitive perspective to enhance the robustness of legged locomotion. Specifically, we employ a distributional value function learned by quantile regression to model the aleatoric uncertainty of environments, and perform risk-averse policy learning by optimizing the worst-case scenarios via a risk distortion measure. Extensive experiments in both simulation environments and a real Aliengo robot demonstrate that our method is efficient in handling various external disturbances, and the resulting policy exhibits improved robustness in harsh and uncertain situations in legged locomotion.

AB - The robustness of legged locomotion is crucial for quadrupedal robots in challenging terrains. Recently, Reinforcement Learning (RL) has shown promising results in legged locomotion and various methods try to integrate privileged distillation, scene modeling, and external sensors to improve the generalization and robustness of locomotion policies. However, these methods are hard to handle uncertain scenarios such as abrupt terrain changes or unexpected external forces. In this paper, we consider a novel risk-sensitive perspective to enhance the robustness of legged locomotion. Specifically, we employ a distributional value function learned by quantile regression to model the aleatoric uncertainty of environments, and perform risk-averse policy learning by optimizing the worst-case scenarios via a risk distortion measure. Extensive experiments in both simulation environments and a real Aliengo robot demonstrate that our method is efficient in handling various external disturbances, and the resulting policy exhibits improved robustness in harsh and uncertain situations in legged locomotion.

UR - http://www.scopus.com/inward/record.url?scp=85194490892&partnerID=8YFLogxK

U2 - 10.1109/ICRA57147.2024.10610086

DO - 10.1109/ICRA57147.2024.10610086

M3 - 会议稿件

AN - SCOPUS:85194490892

T3 - Proceedings - IEEE International Conference on Robotics and Automation

SP - 11459

EP - 11466

BT - 2024 IEEE International Conference on Robotics and Automation, ICRA 2024

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2024 IEEE International Conference on Robotics and Automation, ICRA 2024

Y2 - 13 May 2024 through 17 May 2024

ER -

Shi J, Bai C, He H, Han L, Wang D, Zhao B et al. Robust Quadrupedal Locomotion via Risk-Averse Policy Learning. In 2024 IEEE International Conference on Robotics and Automation, ICRA 2024. Institute of Electrical and Electronics Engineers Inc. 2024. p. 11459-11466. (Proceedings - IEEE International Conference on Robotics and Automation). doi: 10.1109/ICRA57147.2024.10610086

Robust Quadrupedal Locomotion via Risk-Averse Policy Learning

Abstract

Publication series

Conference

Access to Document

Other files and links

Fingerprint

Cite this