Controlling underestimation bias in reinforcement learning via minmax operation

Fanghui HUANG, Yixin HE, Yu ZHANG, Xinyang DENG, Wen JIANG

科研成果: 期刊稿件文章同行评审

2 引用 (Scopus)

指纹

探究 'Controlling underestimation bias in reinforcement learning via minmax operation' 的科研主题。它们共同构成独一无二的指纹。

Computer Science

Neuroscience

Chemical Engineering

Psychology