TY - GEN
T1 - To teach or not to teach? Decision making under uncertainty in ad hoc teams
AU - Stone, Peter
AU - Kraus, Sarit
PY - 2010
Y1 - 2010
N2 - In typical multiagcnt teamwork settings, the teammates are either programmed together, or are otherwise provided with standard communication languages and coordination protocols. In contrast, this paper presents an ad hoc team setting in which the teammates are not pre-coordinated, yet still must work together in order to achieve their common goal(s). We represent a specific instance of this scenario, in which a teammate has limited action capabilities and a fixed and known behavior, as a finite-horizon, cooperative k-armed bandit. In addition to motivating and studying this novel ad hoc teamwork scenario, the paper contributes to the κ-armed bandits literature by characterizing the conditions under which certain actions are potentially optimal, and by presenting a polynomial dynamic programming algorithm that solves for the optimal action when the arm payoffs come from a discrete distribution.
AB - In typical multiagcnt teamwork settings, the teammates are either programmed together, or are otherwise provided with standard communication languages and coordination protocols. In contrast, this paper presents an ad hoc team setting in which the teammates are not pre-coordinated, yet still must work together in order to achieve their common goal(s). We represent a specific instance of this scenario, in which a teammate has limited action capabilities and a fixed and known behavior, as a finite-horizon, cooperative k-armed bandit. In addition to motivating and studying this novel ad hoc teamwork scenario, the paper contributes to the κ-armed bandits literature by characterizing the conditions under which certain actions are potentially optimal, and by presenting a polynomial dynamic programming algorithm that solves for the optimal action when the arm payoffs come from a discrete distribution.
KW - Autonomous agents
KW - Coordination
KW - Multiagent systems
UR - http://www.scopus.com/inward/record.url?scp=84875873713&partnerID=8YFLogxK
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:84875873713
SN - 9781617387715
T3 - Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS
SP - 117
EP - 124
BT - 9th International Joint Conference on Autonomous Agents and Multiagent Systems 2010, AAMAS 2010
PB - International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS)
T2 - 9th International Joint Conference on Autonomous Agents and Multiagent Systems 2010, AAMAS 2010
Y2 - 10 May 2010
ER -