Pessimistic value iteration for multi-task data sharing in Offline Reinforcement Learning

Chenjia Bai, Lingxiao Wang, Jianye Hao, Zhuoran Yang, Bin Zhao, Zhen Wang, Xuelong Li

科研成果: 期刊稿件文章同行评审

7 引用 (Scopus)

指纹

探究 'Pessimistic value iteration for multi-task data sharing in Offline Reinforcement Learning' 的科研主题。它们共同构成独一无二的指纹。

Computer Science