Inducing Coordination in Multi-Agent Repeated Game through Hierarchical Gifting Policies

Mingze Lv; Jiaqi Liu; Bin Guo; Yasan Ding; Yun Zhang; Zhiwen Yu

doi:10.1109/MASS58611.2023.00041

Inducing Coordination in Multi-Agent Repeated Game through Hierarchical Gifting Policies

Mingze Lv, Jiaqi Liu, Bin Guo, Yasan Ding, Yun Zhang, Zhiwen Yu

计算机学院

Northwestern Polytechnical University Xian

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

摘要

Coordination, i.e., multiple autonomous agents in a system to achieve a common goal, is critical for distributed systems since it can increase the overall reward among all agents. However, The dynamic environment and selfish agents pose challenges to learning coordination behavior from historical interaction data in a long-term interaction environment. Previous works mostly focus on one-shot or short-term distributed agent interaction environments, which often leads to selfish or lazy behavior in long-term interaction environments, i.e., prioritizing individual optimal strategies over cooperative strategies. This behavior is mainly due to the lack of historical memory or incomplete use of historical interaction data to guide the current interaction strategy. In this paper, we propose a hierarchical peer-rewarding mechanism, hierarchical gifting, that allows each agent to dynamically assign some of their rewards to other agents based on historical interaction data and guide the agents towards more coordinated behavior while ensuring that agents remain selfish and decentralized. Specifically, we first propose an auxiliary opponent modeling task so that agents can infer opponents' types through historical interaction trajectories. In addition, we design a hierarchical gifting strategy that dynamically changes during execution based on known opponents' types. We employ a theoretical framework that captures the benefit of hierarchical gifting in converging to the coordinated behavior by characterizing the equilibria's basins of attraction in a dynamical system. With hierarchical gifting, we demonstrate increased coordinated behavior of different risk, general-sum coordination games to the prosocial equilibrium both via numerical analysis and experiments.

源语言	英语
主期刊名	Proceedings - 2023 IEEE 20th International Conference on Mobile Ad Hoc and Smart Systems, MASS 2023
出版商	Institute of Electrical and Electronics Engineers Inc.
页	279-287
页数	9
ISBN（电子版）	9798350324334
DOI	https://doi.org/10.1109/MASS58611.2023.00041
出版状态	已出版 - 2023
活动	20th IEEE International Conference on Mobile Ad Hoc and Smart Systems, MASS 2023 - Toronto, 加拿大期限: 25 9月 2023 → 27 9月 2023

出版系列

姓名	Proceedings - 2023 IEEE 20th International Conference on Mobile Ad Hoc and Smart Systems, MASS 2023

会议

会议	20th IEEE International Conference on Mobile Ad Hoc and Smart Systems, MASS 2023
国家/地区	加拿大
市	Toronto
时期	25/09/23 → 27/09/23

访问文件

10.1109/MASS58611.2023.00041

其它文件与链接

链接到 Scopus 的出版物

引用此

Lv, M., Liu, J., Guo, B., Ding, Y., Zhang, Y., & Yu, Z. (2023). Inducing Coordination in Multi-Agent Repeated Game through Hierarchical Gifting Policies. 在 Proceedings - 2023 IEEE 20th International Conference on Mobile Ad Hoc and Smart Systems, MASS 2023 (页码 279-287). (Proceedings - 2023 IEEE 20th International Conference on Mobile Ad Hoc and Smart Systems, MASS 2023). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/MASS58611.2023.00041

Lv, Mingze ; Liu, Jiaqi ; Guo, Bin 等. / Inducing Coordination in Multi-Agent Repeated Game through Hierarchical Gifting Policies. Proceedings - 2023 IEEE 20th International Conference on Mobile Ad Hoc and Smart Systems, MASS 2023. Institute of Electrical and Electronics Engineers Inc., 2023. 页码 279-287 (Proceedings - 2023 IEEE 20th International Conference on Mobile Ad Hoc and Smart Systems, MASS 2023).

@inproceedings{24b225856d80410fb449bed9bca5699f,

title = "Inducing Coordination in Multi-Agent Repeated Game through Hierarchical Gifting Policies",

abstract = "Coordination, i.e., multiple autonomous agents in a system to achieve a common goal, is critical for distributed systems since it can increase the overall reward among all agents. However, The dynamic environment and selfish agents pose challenges to learning coordination behavior from historical interaction data in a long-term interaction environment. Previous works mostly focus on one-shot or short-term distributed agent interaction environments, which often leads to selfish or lazy behavior in long-term interaction environments, i.e., prioritizing individual optimal strategies over cooperative strategies. This behavior is mainly due to the lack of historical memory or incomplete use of historical interaction data to guide the current interaction strategy. In this paper, we propose a hierarchical peer-rewarding mechanism, hierarchical gifting, that allows each agent to dynamically assign some of their rewards to other agents based on historical interaction data and guide the agents towards more coordinated behavior while ensuring that agents remain selfish and decentralized. Specifically, we first propose an auxiliary opponent modeling task so that agents can infer opponents' types through historical interaction trajectories. In addition, we design a hierarchical gifting strategy that dynamically changes during execution based on known opponents' types. We employ a theoretical framework that captures the benefit of hierarchical gifting in converging to the coordinated behavior by characterizing the equilibria's basins of attraction in a dynamical system. With hierarchical gifting, we demonstrate increased coordinated behavior of different risk, general-sum coordination games to the prosocial equilibrium both via numerical analysis and experiments.",

keywords = "Coordination, Game Theory, Multi-agent Reinforcement Learning, Multi-agent Systems",

author = "Mingze Lv and Jiaqi Liu and Bin Guo and Yasan Ding and Yun Zhang and Zhiwen Yu",

note = "Publisher Copyright: {\textcopyright} 2023 IEEE.; 20th IEEE International Conference on Mobile Ad Hoc and Smart Systems, MASS 2023 ; Conference date: 25-09-2023 Through 27-09-2023",

year = "2023",

doi = "10.1109/MASS58611.2023.00041",

language = "英语",

series = "Proceedings - 2023 IEEE 20th International Conference on Mobile Ad Hoc and Smart Systems, MASS 2023",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "279--287",

booktitle = "Proceedings - 2023 IEEE 20th International Conference on Mobile Ad Hoc and Smart Systems, MASS 2023",

}

Lv, M, Liu, J, Guo, B, Ding, Y, Zhang, Y & Yu, Z 2023, Inducing Coordination in Multi-Agent Repeated Game through Hierarchical Gifting Policies. 在 Proceedings - 2023 IEEE 20th International Conference on Mobile Ad Hoc and Smart Systems, MASS 2023. Proceedings - 2023 IEEE 20th International Conference on Mobile Ad Hoc and Smart Systems, MASS 2023, Institute of Electrical and Electronics Engineers Inc., 页码 279-287, 20th IEEE International Conference on Mobile Ad Hoc and Smart Systems, MASS 2023, Toronto, 加拿大, 25/09/23. https://doi.org/10.1109/MASS58611.2023.00041

Inducing Coordination in Multi-Agent Repeated Game through Hierarchical Gifting Policies. / Lv, Mingze; Liu, Jiaqi; Guo, Bin 等.
Proceedings - 2023 IEEE 20th International Conference on Mobile Ad Hoc and Smart Systems, MASS 2023. Institute of Electrical and Electronics Engineers Inc., 2023. 页码 279-287 (Proceedings - 2023 IEEE 20th International Conference on Mobile Ad Hoc and Smart Systems, MASS 2023).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Inducing Coordination in Multi-Agent Repeated Game through Hierarchical Gifting Policies

AU - Lv, Mingze

AU - Liu, Jiaqi

AU - Guo, Bin

AU - Ding, Yasan

AU - Zhang, Yun

AU - Yu, Zhiwen

PY - 2023

Y1 - 2023

N2 - Coordination, i.e., multiple autonomous agents in a system to achieve a common goal, is critical for distributed systems since it can increase the overall reward among all agents. However, The dynamic environment and selfish agents pose challenges to learning coordination behavior from historical interaction data in a long-term interaction environment. Previous works mostly focus on one-shot or short-term distributed agent interaction environments, which often leads to selfish or lazy behavior in long-term interaction environments, i.e., prioritizing individual optimal strategies over cooperative strategies. This behavior is mainly due to the lack of historical memory or incomplete use of historical interaction data to guide the current interaction strategy. In this paper, we propose a hierarchical peer-rewarding mechanism, hierarchical gifting, that allows each agent to dynamically assign some of their rewards to other agents based on historical interaction data and guide the agents towards more coordinated behavior while ensuring that agents remain selfish and decentralized. Specifically, we first propose an auxiliary opponent modeling task so that agents can infer opponents' types through historical interaction trajectories. In addition, we design a hierarchical gifting strategy that dynamically changes during execution based on known opponents' types. We employ a theoretical framework that captures the benefit of hierarchical gifting in converging to the coordinated behavior by characterizing the equilibria's basins of attraction in a dynamical system. With hierarchical gifting, we demonstrate increased coordinated behavior of different risk, general-sum coordination games to the prosocial equilibrium both via numerical analysis and experiments.

AB - Coordination, i.e., multiple autonomous agents in a system to achieve a common goal, is critical for distributed systems since it can increase the overall reward among all agents. However, The dynamic environment and selfish agents pose challenges to learning coordination behavior from historical interaction data in a long-term interaction environment. Previous works mostly focus on one-shot or short-term distributed agent interaction environments, which often leads to selfish or lazy behavior in long-term interaction environments, i.e., prioritizing individual optimal strategies over cooperative strategies. This behavior is mainly due to the lack of historical memory or incomplete use of historical interaction data to guide the current interaction strategy. In this paper, we propose a hierarchical peer-rewarding mechanism, hierarchical gifting, that allows each agent to dynamically assign some of their rewards to other agents based on historical interaction data and guide the agents towards more coordinated behavior while ensuring that agents remain selfish and decentralized. Specifically, we first propose an auxiliary opponent modeling task so that agents can infer opponents' types through historical interaction trajectories. In addition, we design a hierarchical gifting strategy that dynamically changes during execution based on known opponents' types. We employ a theoretical framework that captures the benefit of hierarchical gifting in converging to the coordinated behavior by characterizing the equilibria's basins of attraction in a dynamical system. With hierarchical gifting, we demonstrate increased coordinated behavior of different risk, general-sum coordination games to the prosocial equilibrium both via numerical analysis and experiments.

KW - Coordination

KW - Game Theory

KW - Multi-agent Reinforcement Learning

KW - Multi-agent Systems

UR - http://www.scopus.com/inward/record.url?scp=85178514695&partnerID=8YFLogxK

U2 - 10.1109/MASS58611.2023.00041

DO - 10.1109/MASS58611.2023.00041

M3 - 会议稿件

AN - SCOPUS:85178514695

T3 - Proceedings - 2023 IEEE 20th International Conference on Mobile Ad Hoc and Smart Systems, MASS 2023

SP - 279

EP - 287

BT - Proceedings - 2023 IEEE 20th International Conference on Mobile Ad Hoc and Smart Systems, MASS 2023

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 20th IEEE International Conference on Mobile Ad Hoc and Smart Systems, MASS 2023

Y2 - 25 September 2023 through 27 September 2023

ER -

Lv M, Liu J, Guo B, Ding Y, Zhang Y, Yu Z. Inducing Coordination in Multi-Agent Repeated Game through Hierarchical Gifting Policies. 在 Proceedings - 2023 IEEE 20th International Conference on Mobile Ad Hoc and Smart Systems, MASS 2023. Institute of Electrical and Electronics Engineers Inc. 2023. 页码 279-287. (Proceedings - 2023 IEEE 20th International Conference on Mobile Ad Hoc and Smart Systems, MASS 2023). doi: 10.1109/MASS58611.2023.00041

Inducing Coordination in Multi-Agent Repeated Game through Hierarchical Gifting Policies

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此