Efficient deep reinforcement learning through policy transfer

Tianpei Yang; Jianye Hao; Zhaopeng Meng; Zongzhang Zhang; Yujing Hu; Yingfeng Chen; Changjie Fan; Weixun Wang; Zhaodong Wang; Jiajie Peng

Efficient deep reinforcement learning through policy transfer

Tianpei Yang, Jianye Hao, Zhaopeng Meng, Zongzhang Zhang, Yujing Hu, Yingfeng Chen, Changjie Fan, Weixun Wang, Zhaodong Wang, Jiajie Peng

School of Computer Science

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

6 Scopus citations

Abstract

Transfer Learning (TL) has shown great potential to accelerate Reinforcement Learning (RL) by leveraging prior knowledge from past learned policies of relevant tasks. Existing TL approaches either explicitly computes the similarity between tasks or select appropriate source policies to provide guided explorations for the target task. However, how to directly optimize the target policy by alternatively utilizing knowledge from appropriate source policies without explicitly measuring the similarity is currently missing. In this paper, we propose a novel Policy Transfer Framework (PTF) by taking advantage of this idea. PTF learns when and which source policy is the best to reuse for the target policy and when to terminate it by modeling multi-policy transfer as the option learning problem. PTF can be easily combined with existing deep RL approaches. Experimental results show it significantly accelerates the learning process and outperforms state-of-the-art policy transfer methods in both discrete and continuous action spaces.

Original language	English
Title of host publication	Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2020
Editors	Bo An, Amal El Fallah Seghrouchni, Gita Sukthankar
Publisher	International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS)
Pages	2053-2055
Number of pages	3
ISBN (Electronic)	9781450375184
State	Published - 2020
Event	19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2020 - Virtual, Auckland, New Zealand Duration: 19 May 2020 → …

Publication series

Name	Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS
Volume	2020-May
ISSN (Print)	1548-8403
ISSN (Electronic)	1558-2914

Conference

Conference	19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2020
Country/Territory	New Zealand
City	Virtual, Auckland
Period	19/05/20 → …

Keywords

Policy reuse
Policy transfer
Reinforcement learning

Cite this

Yang, T., Hao, J., Meng, Z., Zhang, Z., Hu, Y., Chen, Y., Fan, C., Wang, W., Wang, Z., & Peng, J. (2020). Efficient deep reinforcement learning through policy transfer. In B. An, A. El Fallah Seghrouchni, & G. Sukthankar (Eds.), Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2020 (pp. 2053-2055). (Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS; Vol. 2020-May). International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS).

Yang, Tianpei ; Hao, Jianye ; Meng, Zhaopeng et al. / Efficient deep reinforcement learning through policy transfer. Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2020. editor / Bo An ; Amal El Fallah Seghrouchni ; Gita Sukthankar. International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS), 2020. pp. 2053-2055 (Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS).

@inproceedings{2e590d332aad402f927fde981ce802d1,

title = "Efficient deep reinforcement learning through policy transfer",

abstract = "Transfer Learning (TL) has shown great potential to accelerate Reinforcement Learning (RL) by leveraging prior knowledge from past learned policies of relevant tasks. Existing TL approaches either explicitly computes the similarity between tasks or select appropriate source policies to provide guided explorations for the target task. However, how to directly optimize the target policy by alternatively utilizing knowledge from appropriate source policies without explicitly measuring the similarity is currently missing. In this paper, we propose a novel Policy Transfer Framework (PTF) by taking advantage of this idea. PTF learns when and which source policy is the best to reuse for the target policy and when to terminate it by modeling multi-policy transfer as the option learning problem. PTF can be easily combined with existing deep RL approaches. Experimental results show it significantly accelerates the learning process and outperforms state-of-the-art policy transfer methods in both discrete and continuous action spaces.",

keywords = "Policy reuse, Policy transfer, Reinforcement learning",

author = "Tianpei Yang and Jianye Hao and Zhaopeng Meng and Zongzhang Zhang and Yujing Hu and Yingfeng Chen and Changjie Fan and Weixun Wang and Zhaodong Wang and Jiajie Peng",

note = "Publisher Copyright: {\textcopyright} 2020 International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS). All rights reserved.; 19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2020 ; Conference date: 19-05-2020",

year = "2020",

language = "英语",

series = "Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS",

publisher = "International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS)",

pages = "2053--2055",

editor = "Bo An and {El Fallah Seghrouchni}, Amal and Gita Sukthankar",

booktitle = "Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2020",

}

Yang, T, Hao, J, Meng, Z, Zhang, Z, Hu, Y, Chen, Y, Fan, C, Wang, W, Wang, Z & Peng, J 2020, Efficient deep reinforcement learning through policy transfer. in B An, A El Fallah Seghrouchni & G Sukthankar (eds), Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2020. Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS, vol. 2020-May, International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS), pp. 2053-2055, 19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2020, Virtual, Auckland, New Zealand, 19/05/20.

Efficient deep reinforcement learning through policy transfer. / Yang, Tianpei; Hao, Jianye; Meng, Zhaopeng et al.
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2020. ed. / Bo An; Amal El Fallah Seghrouchni; Gita Sukthankar. International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS), 2020. p. 2053-2055 (Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS; Vol. 2020-May).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Efficient deep reinforcement learning through policy transfer

AU - Yang, Tianpei

AU - Hao, Jianye

AU - Meng, Zhaopeng

AU - Zhang, Zongzhang

AU - Hu, Yujing

AU - Chen, Yingfeng

AU - Fan, Changjie

AU - Wang, Weixun

AU - Wang, Zhaodong

AU - Peng, Jiajie

PY - 2020

Y1 - 2020

N2 - Transfer Learning (TL) has shown great potential to accelerate Reinforcement Learning (RL) by leveraging prior knowledge from past learned policies of relevant tasks. Existing TL approaches either explicitly computes the similarity between tasks or select appropriate source policies to provide guided explorations for the target task. However, how to directly optimize the target policy by alternatively utilizing knowledge from appropriate source policies without explicitly measuring the similarity is currently missing. In this paper, we propose a novel Policy Transfer Framework (PTF) by taking advantage of this idea. PTF learns when and which source policy is the best to reuse for the target policy and when to terminate it by modeling multi-policy transfer as the option learning problem. PTF can be easily combined with existing deep RL approaches. Experimental results show it significantly accelerates the learning process and outperforms state-of-the-art policy transfer methods in both discrete and continuous action spaces.

AB - Transfer Learning (TL) has shown great potential to accelerate Reinforcement Learning (RL) by leveraging prior knowledge from past learned policies of relevant tasks. Existing TL approaches either explicitly computes the similarity between tasks or select appropriate source policies to provide guided explorations for the target task. However, how to directly optimize the target policy by alternatively utilizing knowledge from appropriate source policies without explicitly measuring the similarity is currently missing. In this paper, we propose a novel Policy Transfer Framework (PTF) by taking advantage of this idea. PTF learns when and which source policy is the best to reuse for the target policy and when to terminate it by modeling multi-policy transfer as the option learning problem. PTF can be easily combined with existing deep RL approaches. Experimental results show it significantly accelerates the learning process and outperforms state-of-the-art policy transfer methods in both discrete and continuous action spaces.

KW - Policy reuse

KW - Policy transfer

KW - Reinforcement learning

UR - http://www.scopus.com/inward/record.url?scp=85096692016&partnerID=8YFLogxK

M3 - 会议稿件

AN - SCOPUS:85096692016

T3 - Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS

SP - 2053

EP - 2055

BT - Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2020

A2 - An, Bo

A2 - El Fallah Seghrouchni, Amal

A2 - Sukthankar, Gita

PB - International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS)

T2 - 19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2020

Y2 - 19 May 2020

ER -

Yang T, Hao J, Meng Z, Zhang Z, Hu Y, Chen Y et al. Efficient deep reinforcement learning through policy transfer. In An B, El Fallah Seghrouchni A, Sukthankar G, editors, Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2020. International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS). 2020. p. 2053-2055. (Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS).

Efficient deep reinforcement learning through policy transfer

Abstract

Publication series

Conference

Keywords

Other files and links

Fingerprint

Cite this