Aligning Human Intent from Imperfect Demonstrations with Confidence-Based Inverse Soft-Q Learning
Xizhou Bu, Wenjuan Li, Zhengxiong Liu, Zhiqiang Ma, Panfeng Huang
科研成果: 期刊稿件 › 文章 › 同行评审
Xizhou Bu, Wenjuan Li, Zhengxiong Liu, Zhiqiang Ma, Panfeng Huang
科研成果: 期刊稿件 › 文章 › 同行评审