Localizing state space for visual reinforcement learning in noisy environments

Jing Cheng; Jingchen Li; Haobin Shi; Tao Zhang

doi:10.1016/j.engappai.2025.110998

Localizing state space for visual reinforcement learning in noisy environments

Jing Cheng, Jingchen Li, Haobin Shi, Tao Zhang

计算机学院

科研成果: 期刊稿件 › 文章 › 同行评审

摘要

Gaining robust policies is what the visual reinforcement learning community desires. In practical application, the noises in an environment lead to a larger variance in the perception of a reinforcement learning agent. This work introduces a non-differential module into deep reinforcement learning to localize the state space for agents, by which the impact of noises can be greatly reduced, and the learned policy can be explained implicitly. The proposed model leverages a hard attention module for localization, while an additional reinforcement learning process is built to update the localization module. We analyze the relationship between the non-differential module and agent, regarding the whole training as a hierarchical multi-agent reinforcement learning model, ensuring the convergence of policies by centralized evaluation. Moreover, to couple the localization policy and behavior policy, we modify the evaluation processes, gaining more direct coordination for them. The proposed method enables the agent to localize its observation or state in an explainable way, learning more advanced and robust policies by ignoring irrelevant data or changes in noisy environments. That is, it enhances reinforcement learning's ability to disturbance rejection. Several experiments on simulation environments and Robot Arm suggest our localization module can be embedded into existing reinforcement learning models to enhance them in many respects.

源语言	英语
文章编号	110998
期刊	Engineering Applications of Artificial Intelligence
卷	156
DOI	https://doi.org/10.1016/j.engappai.2025.110998
出版状态	已出版 - 15 9月 2025

访问文件

10.1016/j.engappai.2025.110998

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{60c25dd0ca7a4b42a2bfacaa2581fc83,

title = "Localizing state space for visual reinforcement learning in noisy environments",

abstract = "Gaining robust policies is what the visual reinforcement learning community desires. In practical application, the noises in an environment lead to a larger variance in the perception of a reinforcement learning agent. This work introduces a non-differential module into deep reinforcement learning to localize the state space for agents, by which the impact of noises can be greatly reduced, and the learned policy can be explained implicitly. The proposed model leverages a hard attention module for localization, while an additional reinforcement learning process is built to update the localization module. We analyze the relationship between the non-differential module and agent, regarding the whole training as a hierarchical multi-agent reinforcement learning model, ensuring the convergence of policies by centralized evaluation. Moreover, to couple the localization policy and behavior policy, we modify the evaluation processes, gaining more direct coordination for them. The proposed method enables the agent to localize its observation or state in an explainable way, learning more advanced and robust policies by ignoring irrelevant data or changes in noisy environments. That is, it enhances reinforcement learning's ability to disturbance rejection. Several experiments on simulation environments and Robot Arm suggest our localization module can be embedded into existing reinforcement learning models to enhance them in many respects.",

keywords = "Deep reinforcement learning, Explainable reinforcement learning, Reinforcement learning",

author = "Jing Cheng and Jingchen Li and Haobin Shi and Tao Zhang",

note = "Publisher Copyright: {\textcopyright} 2025",

year = "2025",

month = sep,

day = "15",

doi = "10.1016/j.engappai.2025.110998",

language = "英语",

volume = "156",

journal = "Engineering Applications of Artificial Intelligence",

issn = "0952-1976",

publisher = "Elsevier Ltd",

}

TY - JOUR

T1 - Localizing state space for visual reinforcement learning in noisy environments

AU - Cheng, Jing

AU - Li, Jingchen

AU - Shi, Haobin

AU - Zhang, Tao

PY - 2025/9/15

Y1 - 2025/9/15

N2 - Gaining robust policies is what the visual reinforcement learning community desires. In practical application, the noises in an environment lead to a larger variance in the perception of a reinforcement learning agent. This work introduces a non-differential module into deep reinforcement learning to localize the state space for agents, by which the impact of noises can be greatly reduced, and the learned policy can be explained implicitly. The proposed model leverages a hard attention module for localization, while an additional reinforcement learning process is built to update the localization module. We analyze the relationship between the non-differential module and agent, regarding the whole training as a hierarchical multi-agent reinforcement learning model, ensuring the convergence of policies by centralized evaluation. Moreover, to couple the localization policy and behavior policy, we modify the evaluation processes, gaining more direct coordination for them. The proposed method enables the agent to localize its observation or state in an explainable way, learning more advanced and robust policies by ignoring irrelevant data or changes in noisy environments. That is, it enhances reinforcement learning's ability to disturbance rejection. Several experiments on simulation environments and Robot Arm suggest our localization module can be embedded into existing reinforcement learning models to enhance them in many respects.

AB - Gaining robust policies is what the visual reinforcement learning community desires. In practical application, the noises in an environment lead to a larger variance in the perception of a reinforcement learning agent. This work introduces a non-differential module into deep reinforcement learning to localize the state space for agents, by which the impact of noises can be greatly reduced, and the learned policy can be explained implicitly. The proposed model leverages a hard attention module for localization, while an additional reinforcement learning process is built to update the localization module. We analyze the relationship between the non-differential module and agent, regarding the whole training as a hierarchical multi-agent reinforcement learning model, ensuring the convergence of policies by centralized evaluation. Moreover, to couple the localization policy and behavior policy, we modify the evaluation processes, gaining more direct coordination for them. The proposed method enables the agent to localize its observation or state in an explainable way, learning more advanced and robust policies by ignoring irrelevant data or changes in noisy environments. That is, it enhances reinforcement learning's ability to disturbance rejection. Several experiments on simulation environments and Robot Arm suggest our localization module can be embedded into existing reinforcement learning models to enhance them in many respects.

KW - Deep reinforcement learning

KW - Explainable reinforcement learning

KW - Reinforcement learning

UR - http://www.scopus.com/inward/record.url?scp=105005276343&partnerID=8YFLogxK

U2 - 10.1016/j.engappai.2025.110998

DO - 10.1016/j.engappai.2025.110998

M3 - 文章

AN - SCOPUS:105005276343

SN - 0952-1976

VL - 156

JO - Engineering Applications of Artificial Intelligence

JF - Engineering Applications of Artificial Intelligence

M1 - 110998

ER -

Localizing state space for visual reinforcement learning in noisy environments

摘要

访问文件

其它文件与链接

指纹

引用此