Blind search for atari-like online planning revisited

Alexander Shleyfman, Alexander Tuisov, Carmel Domshlak

Research output: Contribution to journalConference articlepeer-review


Similarly to the classical AI planning, the Atari 2600 games supported in the Arcade Learning Environment all feature a fully observable (RAM) state and actions that have deterministic effect. At the same time, the problems in ALE are given only implicitly, via a simulator, a priori precluding exploiting most of the modern classical planning techniques. Despite that, Lipovetzky et al. [2015] recently showed how online planning for Atari-like problems can be effectively addressed using IW(i), a blind state-space search algorithm that employs a certain form of structural similarity-based pruning. We show that the effectiveness of the blind statespace search for Atari-like online planning can be pushed even further by focusing the search using both structural state similarity and the relative myopic value of the states. We also show that the planning effectiveness can be further improved by considering online planning for the Atari games as a multiarmed bandit style competition between the various actions available at the state planned for, and not purely as a classical planning style action sequence optimization problem.

Original languageEnglish
Pages (from-to)3251-3257
Number of pages7
JournalIJCAI International Joint Conference on Artificial Intelligence
StatePublished - 2016
Externally publishedYes
Event25th International Joint Conference on Artificial Intelligence, IJCAI 2016 - New York, United States
Duration: 9 Jul 201615 Jul 2016


Dive into the research topics of 'Blind search for atari-like online planning revisited'. Together they form a unique fingerprint.

Cite this