Abstract
Advances in multi-agent reinforcement learning (MARL) enable sequential decision making for a range of exciting multi-agent applications such as cooperative AI and autonomous driving. Explaining agent decisions is crucial for improving system transparency, increasing user satisfaction, and facilitating human-agent collaboration. However, existing works on explainable reinforcement learning mostly focus on the single-agent setting and are not suitable for addressing challenges posed by multi-agent environments. We present novel methods to generate two types of policy explanations for MARL: (i) policy summarization about the agent cooperation and task sequence, and (ii) language explanations to answer queries about agent behavior. Experimental results on three MARL domains demonstrate the scalability of our methods. A user study shows that the generated explanations significantly improve user performance and increase subjective ratings on metrics such as user satisfaction.
Original language | English |
---|---|
Title of host publication | Proceedings of the 31st International Joint Conference on Artificial Intelligence, IJCAI 2022 |
Editors | Luc De Raedt, Luc De Raedt |
Publisher | International Joint Conferences on Artificial Intelligence |
Pages | 109-115 |
Number of pages | 7 |
ISBN (Electronic) | 9781956792003 |
DOIs | |
State | Published - 2022 |
Event | 31st International Joint Conference on Artificial Intelligence, IJCAI 2022 - Vienna, Austria Duration: 23 Jul 2022 → 29 Jul 2022 |
Publication series
Name | IJCAI International Joint Conference on Artificial Intelligence |
---|---|
ISSN (Print) | 1045-0823 |
Conference
Conference | 31st International Joint Conference on Artificial Intelligence, IJCAI 2022 |
---|---|
Country/Territory | Austria |
City | Vienna |
Period | 23/07/22 → 29/07/22 |
Bibliographical note
Publisher Copyright:© 2022 International Joint Conferences on Artificial Intelligence. All rights reserved.
Funding
This work was supported in part by U.S. National Science Foundation grant CCF-1942836, U.S. Office of Naval Research grant N00014-18-1-2829, Israel Science Foundation grant 1958/20, EU Project TAILOR grant 992215, and by the Data Science Institute at Bar-Ilan University. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the grant sponsors.
Funders | Funder number |
---|---|
Data Science Institute at Bar-Ilan University | |
National Science Foundation | CCF-1942836 |
Office of Naval Research | N00014-18-1-2829 |
Emory University | 992215 |
Israel Science Foundation | 1958/20 |