SCRL: Self-supervised Continual Reinforcement Learning for Domain Adaptation

Yuyang Fang, Bin Guo, Jiaqi Liu, Kaixing Zhao, Yasan Ding, Na Wang, Zhiwen Yu

科研成果: 书/报告/会议事项章节会议稿件同行评审

1 引用 (Scopus)

摘要

Deep reinforcement learning outperforms humans in many tasks. However, in reality, the scenarios of agent deployment often change, and the performance of models may therefore degrade, resulting in unreasonable decisions in new scenarios. Therefore, it is important for the model to own the evolution ability in new scenarios. Although recent attempts have studied the continual training of models through reward signals or self-supervised learning, on the one hand, previously used reward signals in the target domain are generally difficult to obtain and on the other hand, the performance of self-supervised evolution is always limited. In addition, adapting to the target domain usually results in forgetting the source domain. To overcome the above problems, in this paper, we propose a self-supervised continual reinforcement learning method, called SCRL, which combines reinforcement learning tasks with self-supervised learning auxiliary tasks and weight regularizes. Specifically, in the source domain, we first jointly train different objectives of reinforcement and self-supervised learning, which enables to share the same encoder. The Fisher information matrix is then calculated to record the importance of the model parameters to the source domain. In the target domain, the encoder is used for the reinforcement learning task and is updated by self-supervised learning under the control of the Fisher regularizer. Our extensive experiments on four continuous control tasks from the DeepMind Control suite showed that in the absence of reward signals, the proposed SCRL can effectively adapt the model to the target domain without catastrophically forgetting the source domain.

源语言英语
主期刊名Proceedings - 2023 International Conference on Artificial Intelligence of Things and Systems, AIoTSys 2023
出版商Institute of Electrical and Electronics Engineers Inc.
55-63
页数9
ISBN(电子版)9798350312270
DOI
出版状态已出版 - 2023
活动2023 International Conference on Artificial Intelligence of Things and Systems, AIoTSys 2023 - Xi�an, 中国
期限: 19 10月 202322 10月 2023

出版系列

姓名Proceedings - 2023 International Conference on Artificial Intelligence of Things and Systems, AIoTSys 2023

会议

会议2023 International Conference on Artificial Intelligence of Things and Systems, AIoTSys 2023
国家/地区中国
Xi�an
时期19/10/2322/10/23

指纹

探究 'SCRL: Self-supervised Continual Reinforcement Learning for Domain Adaptation' 的科研主题。它们共同构成独一无二的指纹。

引用此