Toward Policy Explanations for Multi-Agent Reinforcement Learning

Kayla Boggess, Sarit Kraus, Lu Feng

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

9 Scopus citations

Abstract

Advances in multi-agent reinforcement learning (MARL) enable sequential decision making for a range of exciting multi-agent applications such as cooperative AI and autonomous driving. Explaining agent decisions is crucial for improving system transparency, increasing user satisfaction, and facilitating human-agent collaboration. However, existing works on explainable reinforcement learning mostly focus on the single-agent setting and are not suitable for addressing challenges posed by multi-agent environments. We present novel methods to generate two types of policy explanations for MARL: (i) policy summarization about the agent cooperation and task sequence, and (ii) language explanations to answer queries about agent behavior. Experimental results on three MARL domains demonstrate the scalability of our methods. A user study shows that the generated explanations significantly improve user performance and increase subjective ratings on metrics such as user satisfaction.

Original languageEnglish
Title of host publicationProceedings of the 31st International Joint Conference on Artificial Intelligence, IJCAI 2022
EditorsLuc De Raedt, Luc De Raedt
PublisherInternational Joint Conferences on Artificial Intelligence
Pages109-115
Number of pages7
ISBN (Electronic)9781956792003
DOIs
StatePublished - 2022
Event31st International Joint Conference on Artificial Intelligence, IJCAI 2022 - Vienna, Austria
Duration: 23 Jul 202229 Jul 2022

Publication series

NameIJCAI International Joint Conference on Artificial Intelligence
ISSN (Print)1045-0823

Conference

Conference31st International Joint Conference on Artificial Intelligence, IJCAI 2022
Country/TerritoryAustria
CityVienna
Period23/07/2229/07/22

Bibliographical note

Publisher Copyright:
© 2022 International Joint Conferences on Artificial Intelligence. All rights reserved.

Funding

This work was supported in part by U.S. National Science Foundation grant CCF-1942836, U.S. Office of Naval Research grant N00014-18-1-2829, Israel Science Foundation grant 1958/20, EU Project TAILOR grant 992215, and by the Data Science Institute at Bar-Ilan University. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the grant sponsors.

FundersFunder number
Data Science Institute at Bar-Ilan University
National Science FoundationCCF-1942836
Office of Naval ResearchN00014-18-1-2829
Emory University992215
Israel Science Foundation1958/20

    Fingerprint

    Dive into the research topics of 'Toward Policy Explanations for Multi-Agent Reinforcement Learning'. Together they form a unique fingerprint.

    Cite this