TY - GEN
T1 - Targeted opponent modeling of memory-bounded agents
AU - Chakraborty, Doran
AU - Agmon, Noa
AU - Stone, Peter
PY - 2013
Y1 - 2013
N2 - In a repeated game, a memory-bounded agent selects its next action by basing its policy on a fixed window of past L plays. Traditionally, approaches that attempt to model memory-bounded agents, do so by modeling them based on the past L joint actions. Since the number of possible L sized joint actions grows exponentially with L, these approaches are restricted to modeling agents with a small L. This paper explores an alternative, more efficient mechanism for modeling memory-bounded agents based on high-level features derived from the past L plays. Called Targeted Opponent Modeler against Memory-Bounded Agents, or Tommba, our approach successfully models memory-bounded agents, in a sample efficient manner, given a priori knowledge of a feature set that includes the correct features. Tommba is fully implemented, with successful empirical results in a couple of challenging surveillance based tasks.
AB - In a repeated game, a memory-bounded agent selects its next action by basing its policy on a fixed window of past L plays. Traditionally, approaches that attempt to model memory-bounded agents, do so by modeling them based on the past L joint actions. Since the number of possible L sized joint actions grows exponentially with L, these approaches are restricted to modeling agents with a small L. This paper explores an alternative, more efficient mechanism for modeling memory-bounded agents based on high-level features derived from the past L plays. Called Targeted Opponent Modeler against Memory-Bounded Agents, or Tommba, our approach successfully models memory-bounded agents, in a sample efficient manner, given a priori knowledge of a feature set that includes the correct features. Tommba is fully implemented, with successful empirical results in a couple of challenging surveillance based tasks.
KW - Learning
KW - Memory-bounded agents
KW - Modeling
UR - http://www.scopus.com/inward/record.url?scp=84944701639&partnerID=8YFLogxK
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:84944701639
SN - 9781943580125
T3 - AAMAS 2013 Workshop on Adaptive and Learning Agents, ALA 2013
BT - AAMAS 2013 Workshop on Adaptive and Learning Agents, ALA 2013
PB - AAMAS
T2 - AAMAS 2013 Workshop on Adaptive and Learning Agents, ALA 2013
Y2 - 6 May 2013 through 7 May 2013
ER -