Skip to main navigation Skip to search Skip to main content

Dueling Network Architecture for Multi-Agent Deep Deterministic Policy Gradient

  • Northwestern Polytechnical University Xian
  • Institute of Army Aviation

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

6 Scopus citations

Abstract

Recently, reinforcement learning has made remarkable achievements in the fields of natural science, engineering, medicine and operational research. Reinforcement learning addresses sequence problems and considers long-term returns. This long-term view of reinforcement learning is critical to find the optimal solution of many problems. The existing multi-agent reinforcement learning methods usually update the value function of state action slowly, and the reward value of agents is low. This paper presents a Dueling Multi-Agent Deep Deterministic Policy Gradient (MADDPG) method based on MADDPG, which modifies critic's network structure. The main work is to add two subnetworks behind the critic network of the traditional MADDPG method. This method allows the critic network to update its parameters faster and receive higher rewards. Finally, in order to verify the validity of the network structure, the improved framework is compared with the traditional MADDPG, DQN and DDPG methods in the simulation environment.

Original languageEnglish
Title of host publication2021 IEEE 4th International Conference on Computer and Communication Engineering Technology, CCET 2021
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages163-168
Number of pages6
ISBN (Electronic)9781665438902
DOIs
StatePublished - 13 Aug 2021
Event4th IEEE International Conference on Computer and Communication Engineering Technology, CCET 2021 - Virtual, Beijing, China
Duration: 13 Aug 202115 Aug 2021

Publication series

Name2021 IEEE 4th International Conference on Computer and Communication Engineering Technology, CCET 2021

Conference

Conference4th IEEE International Conference on Computer and Communication Engineering Technology, CCET 2021
Country/TerritoryChina
CityVirtual, Beijing
Period13/08/2115/08/21

Keywords

  • Deep learning
  • Dueling network
  • multi-agent system
  • neural networks
  • Reinforcement learning

Fingerprint

Dive into the research topics of 'Dueling Network Architecture for Multi-Agent Deep Deterministic Policy Gradient'. Together they form a unique fingerprint.

Cite this