A novel policy based on action confidence limit to improve exploration efficiency in reinforcement learning

Fanghui Huang, Xinyang Deng, Yixin He, Wen Jiang

Research output: Contribution to journalArticlepeer-review

13 Scopus citations

Fingerprint

Dive into the research topics of 'A novel policy based on action confidence limit to improve exploration efficiency in reinforcement learning'. Together they form a unique fingerprint.

Computer Science