TY - GEN
T1 - Path planning for autonomous mobile robot using transfer learning-based Q-learning
AU - Wu, Shengshuai
AU - Hu, Jinwen
AU - Zhao, Chunhui
AU - Pan, Quan
N1 - Publisher Copyright:
© 2020 IEEE.
PY - 2020/11/27
Y1 - 2020/11/27
N2 - Transfer learning is the process of reusing the experience of agents in source tasks to improve the performance in new target tasks. In recent years, transfer learning has received more and more attention over the reinforcement learning settings. However, when applied to reinforcement learning, many problems will be exposed, such as how is the target task different from the source task, if the mappings between the tasks are required and what knowledge is transferred. Transfer learning algorithms have mainly been applied in discrete gridworld tasks. We first introduce the traditional Q-learning to the transfer algorithm based on the measurement of distance between two MDPs. Further, inspired by Q(lambad) algorithm, this paper investigates the improved Q-learning transfer algorithm to improve the learning efficiency. Finally, the simulation is shown to verify the effectiveness of the proposed algorithms.
AB - Transfer learning is the process of reusing the experience of agents in source tasks to improve the performance in new target tasks. In recent years, transfer learning has received more and more attention over the reinforcement learning settings. However, when applied to reinforcement learning, many problems will be exposed, such as how is the target task different from the source task, if the mappings between the tasks are required and what knowledge is transferred. Transfer learning algorithms have mainly been applied in discrete gridworld tasks. We first introduce the traditional Q-learning to the transfer algorithm based on the measurement of distance between two MDPs. Further, inspired by Q(lambad) algorithm, this paper investigates the improved Q-learning transfer algorithm to improve the learning efficiency. Finally, the simulation is shown to verify the effectiveness of the proposed algorithms.
KW - Path planning
KW - Q-learning
KW - Reinforcement learning
KW - Transfer
UR - http://www.scopus.com/inward/record.url?scp=85099004578&partnerID=8YFLogxK
U2 - 10.1109/ICUS50048.2020.9274821
DO - 10.1109/ICUS50048.2020.9274821
M3 - 会议稿件
AN - SCOPUS:85099004578
T3 - Proceedings of 2020 3rd International Conference on Unmanned Systems, ICUS 2020
SP - 88
EP - 93
BT - Proceedings of 2020 3rd International Conference on Unmanned Systems, ICUS 2020
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 3rd International Conference on Unmanned Systems, ICUS 2020
Y2 - 27 November 2020 through 28 November 2020
ER -