Towards adaptive multi-robot coordination based on resource expenditure velocity

Dan Erusalimchik, Gal A. Kaminka

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In the research area of multi-robot systems, several researchers have reported on consistent success in using heuristic measures to improve loose coordination in teams, by minimizing coordination costs using various heuristic techniques. While these heuristic methods has proven successful in several domains, they have never been formalized, nor have they been put in context of existing work on adaptation and learning. As a result, the conditions for their use remain unknown. We posit that in fact all of these different heuristic methods are instances of reinforcement learning in a one-stage MDP game, with the specific heuristic functions used as rewards. We show that a specific reward function-which we call Effectiveness Index (EI)-is an appropriate reward function for learning to select between coordination methods. EI estimates the resource-spending velocity by a coordination algorithm, and allows minimization of this velocity using familiar reinforcement learning algorithms (in our case, Q-learning in one-stage MDP). The paper analytically and empirically argues for the use of EI by proving that under certain conditions, maximizing this reward leads to greater utility in the task. We report on initial experiments that demonstrate that EI indeed overcomes limitations in previous work, and outperforms it in different cases.

Original languageEnglish
Title of host publicationIntelligent Autonomous Systems 10, IAS 2008
Pages288-297
Number of pages10
DOIs
StatePublished - 2008
Event10th International Conference on Intelligent Autonomous Systems, IAS 2008 - Baden-Baden, Germany
Duration: 23 Jul 200825 Jul 2008

Publication series

NameIntelligent Autonomous Systems 10, IAS 2008

Conference

Conference10th International Conference on Intelligent Autonomous Systems, IAS 2008
Country/TerritoryGermany
CityBaden-Baden
Period23/07/0825/07/08

Keywords

  • Coordination
  • Multi-robot systems
  • Reinforcement learning

Fingerprint

Dive into the research topics of 'Towards adaptive multi-robot coordination based on resource expenditure velocity'. Together they form a unique fingerprint.

Cite this