TY - JOUR
T1 - TMAC
T2 - a Transformer-based partially observable multi-agent communication method
AU - Li, Xuesi
AU - Xue, Shuai
AU - He, Ziming
AU - Shi, Haobin
N1 - Publisher Copyright:
Copyright 2025 Li et al.
PY - 2025
Y1 - 2025
N2 - Effective communication plays a crucial role in coordinating the actions of multiple agents. Within the realm of multi-agent reinforcement learning, agents have the ability to share information with one another through communication channels, leading to enhanced learning outcomes and successful goal attainment. Agents are limited by their observations and communication ranges due to increasingly complex location arrangements, making multi-agent collaboration based on communication increasingly difficult. In this article, for multi-agent communication in some partially observable scenarios, we propose a Transformer-based Partially Observable MultiAgent Communication algorithm (TMAC), which improves agents extracting features and generating output messages. Meanwhile, a self-message fusing module is proposed to obtain features from multiple sources. Therefore, agents can achieve better collaboration through communication. At the same time, we performed experimental verification in the surviving and the StarCraft Multi-Agent Challenge (SMAC) environments where agents had limited local observation and could only communicate with neighboring agents. In two test environments, our method achieves an improvement in performance 6% and 10% over the baseline algorithm, respectively.
AB - Effective communication plays a crucial role in coordinating the actions of multiple agents. Within the realm of multi-agent reinforcement learning, agents have the ability to share information with one another through communication channels, leading to enhanced learning outcomes and successful goal attainment. Agents are limited by their observations and communication ranges due to increasingly complex location arrangements, making multi-agent collaboration based on communication increasingly difficult. In this article, for multi-agent communication in some partially observable scenarios, we propose a Transformer-based Partially Observable MultiAgent Communication algorithm (TMAC), which improves agents extracting features and generating output messages. Meanwhile, a self-message fusing module is proposed to obtain features from multiple sources. Therefore, agents can achieve better collaboration through communication. At the same time, we performed experimental verification in the surviving and the StarCraft Multi-Agent Challenge (SMAC) environments where agents had limited local observation and could only communicate with neighboring agents. In two test environments, our method achieves an improvement in performance 6% and 10% over the baseline algorithm, respectively.
KW - Attention mechanism
KW - Communication
KW - Multi-agent reinforcement learning
UR - http://www.scopus.com/inward/record.url?scp=105003043945&partnerID=8YFLogxK
U2 - 10.7717/peerj-cs.2758
DO - 10.7717/peerj-cs.2758
M3 - 文章
AN - SCOPUS:105003043945
SN - 2376-5992
VL - 11
SP - 1
EP - 20
JO - PeerJ Computer Science
JF - PeerJ Computer Science
M1 - e2758
ER -