Pessimistic value iteration for multi-task data sharing in Offline Reinforcement Learning

Chenjia Bai, Lingxiao Wang, Jianye Hao, Zhuoran Yang, Bin Zhao, Zhen Wang, Xuelong Li

Research output: Contribution to journalArticlepeer-review

7 Scopus citations

Fingerprint

Dive into the research topics of 'Pessimistic value iteration for multi-task data sharing in Offline Reinforcement Learning'. Together they form a unique fingerprint.

Computer Science