Efficient agents for cliff-edge environments with a large set of decision options

Ron Katz, Sarit Kraus

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

15 Scopus citations

Abstract

This paper proposes an efficient agent for competing in Cliff Edge (CE) environments, such as sealed-bid auctions, dynamic pricing and the ultimatum game. The agent competes in one-shot CE interactions repeatedly, each time against a different human opponent, and its performance is evaluated based on all the interactions in which it participates. The agent, which learns the general pattern of the population's behavior, does not apply any examples of previous interactions in the environment, neither of other competitors nor its own. We propose a generic approach which competes in different CE environments under the same configuration, with no knowledge about the specific rules of each environment. The underlying mechanism of the proposed agent is a new meta-algorithm, Deviated Virtual Learning (DVL), which extends existing methods to efficiently cope with environments comprising a large number of optional decisions at each decision point. Experiments comparing the performance of the proposed algorithm with algorithms taken from the literature, as well as another intuitive meta-algorithm, reveal a significant superiority of the former in average payoff and stability. In addition, the agent performed better than human competitors executing the same task.

Original languageEnglish
Title of host publicationProceedings of the Fifth International Joint Conference on Autonomous Agents and Multiagent Systems
Pages697-704
Number of pages8
DOIs
StatePublished - 2006
EventFifth International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS - Hakodate, Japan
Duration: 8 May 200612 May 2006

Publication series

NameProceedings of the International Conference on Autonomous Agents
Volume2006

Conference

ConferenceFifth International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS
Country/TerritoryJapan
CityHakodate
Period8/05/0612/05/06

Keywords

  • Opponent modeling
  • Reinforcement learning
  • Sealed-bid auctions
  • Ultimatum game

Fingerprint

Dive into the research topics of 'Efficient agents for cliff-edge environments with a large set of decision options'. Together they form a unique fingerprint.

Cite this