Coordination in multiagent reinforcement learning systems by virtual reinforcement signals

M. A.S. Kamal, Junichi Murata

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)

Abstract

This paper presents a novel method for on-line coordination in multiagent reinforcement learning systems. In this method a reinforcement-learning agent learns to select its action estimating system dynamics in terms of both the natural reward for task achievement and the virtual reward for cooperation. The virtual reward for cooperation is ascertained dynamically by a coordinating agent who estimates it from the change in degree of cooperation of all agents using a separate reinforcement learning. This technique provides adaptive coordination, requires less communication and ensures agents to be cooperative. The validity of virtual rewards for convergence in learning is verified, and the proposed method is tested on two different simulated domains to illustrate its significance. The empirical performance of the coordinated system compared to the uncoordinated system illustrates its advantages for multiagent systems.

Original languageEnglish
Pages (from-to)181-191
Number of pages11
JournalInternational Journal of Knowledge-Based and Intelligent Engineering Systems
Volume11
Issue number3
DOIs
Publication statusPublished - 2007

All Science Journal Classification (ASJC) codes

  • Software
  • Control and Systems Engineering
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Coordination in multiagent reinforcement learning systems by virtual reinforcement signals'. Together they form a unique fingerprint.

Cite this