"Superstition" in the Network: Deep Reinforcement Learning Plays Deceptive Games

Matthew Stephenson; Philip Bontrager; Ahmed Khalifa; Damien Anderson; Christoph Salge; Julian Togelius

"Superstition" in the Network: Deep Reinforcement Learning Plays Deceptive Games

Matthew Stephenson, Philip Bontrager^*, Ahmed Khalifa, Damien Anderson, Christoph Salge, Julian Togelius

^*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceeding › Conference article in proceeding › Academic › peer-review

Abstract

Deep reinforcement learning has learned to play many games well, but failed on others. To better characterize the modes and reasons of failure of deep reinforcement learners, we test the widely used Asynchronous Actor-Critic (A2C) algorithm on four deceptive games, which are specially designed to provide challenges to game-playing agents. These games are implemented in the General Video Game AI framework, which allows us to compare the behavior of reinforcement learningbased agents with planning agents based on tree search. We find that several of these games reliably deceive deep reinforcement learners, and that the resulting behavior highlights the shortcomings of the learning algorithm. The particular ways in which agents fail differ from how planning-based agents fail, further illuminating the character of these algorithms. We propose an initial typology of deceptions which could help us better understand pitfalls and failure modes of (deep) reinforcement learning.

Original language	English
Title of host publication	Fifteenth Annual AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment
Editors	Gillian Smith, Levi Lelis
Pages	10-16
Volume	15
Edition	1
Publication status	Published - Oct 2019
Event	Fifteenth AAAI Conference on ArtificialIntelligence and Interactive Digital Entertainment - Georgia Institute of Technology in Atlanta, Georgia, United States Duration: 8 Oct 2019 → 12 Oct 2019 Conference number: 15 https://ojs.aaai.org/index.php/AIIDE/issue/view/247

Conference

Conference	Fifteenth AAAI Conference on ArtificialIntelligence and Interactive Digital Entertainment
Abbreviated title	AIIDE-19
Country/Territory	United States
City	Georgia
Period	8/10/19 → 12/10/19
Internet address	https://ojs.aaai.org/index.php/AIIDE/issue/view/247

Access to Document

https://ojs.aaai.org/index.php/AIIDE/article/view/5218/5074

Cite this

Stephenson, M., Bontrager, P., Khalifa, A., Anderson, D., Salge, C., & Togelius, J. (2019). "Superstition" in the Network: Deep Reinforcement Learning Plays Deceptive Games. In G. Smith, & L. Lelis (Eds.), Fifteenth Annual AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment (1 ed., Vol. 15, pp. 10-16) https://ojs.aaai.org/index.php/AIIDE/article/view/5218/5074

@inproceedings{bb1972230df44f66bcfc19f1f3b306c4,

title = "{"}Superstition{"} in the Network: Deep Reinforcement Learning Plays Deceptive Games",

abstract = "Deep reinforcement learning has learned to play many games well, but failed on others. To better characterize the modes and reasons of failure of deep reinforcement learners, we test the widely used Asynchronous Actor-Critic (A2C) algorithm on four deceptive games, which are specially designed to provide challenges to game-playing agents. These games are implemented in the General Video Game AI framework, which allows us to compare the behavior of reinforcement learningbased agents with planning agents based on tree search. We find that several of these games reliably deceive deep reinforcement learners, and that the resulting behavior highlights the shortcomings of the learning algorithm. The particular ways in which agents fail differ from how planning-based agents fail, further illuminating the character of these algorithms. We propose an initial typology of deceptions which could help us better understand pitfalls and failure modes of (deep) reinforcement learning.",

author = "Matthew Stephenson and Philip Bontrager and Ahmed Khalifa and Damien Anderson and Christoph Salge and Julian Togelius",

note = "Publisher Copyright: Copyright {\textcopyright} 2019, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.; Fifteenth AAAI Conference on ArtificialIntelligence and Interactive Digital Entertainment, AIIDE-19 ; Conference date: 08-10-2019 Through 12-10-2019",

year = "2019",

month = oct,

language = "English",

volume = "15",

pages = "10--16",

editor = "Gillian Smith and Levi Lelis",

booktitle = "Fifteenth Annual AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment",

edition = "1",

url = "https://ojs.aaai.org/index.php/AIIDE/issue/view/247",

}

Stephenson, M, Bontrager, P, Khalifa, A, Anderson, D, Salge, C & Togelius, J 2019, "Superstition" in the Network: Deep Reinforcement Learning Plays Deceptive Games. in G Smith & L Lelis (eds), Fifteenth Annual AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment. 1 edn, vol. 15, pp. 10-16, Fifteenth AAAI Conference on ArtificialIntelligence and Interactive Digital Entertainment, Georgia, United States, 8/10/19. <https://ojs.aaai.org/index.php/AIIDE/article/view/5218/5074>

"Superstition" in the Network: Deep Reinforcement Learning Plays Deceptive Games. / Stephenson, Matthew; Bontrager, Philip; Khalifa, Ahmed et al.
Fifteenth Annual AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment. ed. / Gillian Smith; Levi Lelis. Vol. 15 1. ed. 2019. p. 10-16.

Research output: Chapter in Book/Report/Conference proceeding › Conference article in proceeding › Academic › peer-review

TY - GEN

T1 - "Superstition" in the Network

T2 - Fifteenth AAAI Conference on ArtificialIntelligence and Interactive Digital Entertainment

AU - Stephenson, Matthew

AU - Bontrager, Philip

AU - Khalifa, Ahmed

AU - Anderson, Damien

AU - Salge, Christoph

AU - Togelius, Julian

N1 - Conference code: 15

PY - 2019/10

Y1 - 2019/10

N2 - Deep reinforcement learning has learned to play many games well, but failed on others. To better characterize the modes and reasons of failure of deep reinforcement learners, we test the widely used Asynchronous Actor-Critic (A2C) algorithm on four deceptive games, which are specially designed to provide challenges to game-playing agents. These games are implemented in the General Video Game AI framework, which allows us to compare the behavior of reinforcement learningbased agents with planning agents based on tree search. We find that several of these games reliably deceive deep reinforcement learners, and that the resulting behavior highlights the shortcomings of the learning algorithm. The particular ways in which agents fail differ from how planning-based agents fail, further illuminating the character of these algorithms. We propose an initial typology of deceptions which could help us better understand pitfalls and failure modes of (deep) reinforcement learning.

AB - Deep reinforcement learning has learned to play many games well, but failed on others. To better characterize the modes and reasons of failure of deep reinforcement learners, we test the widely used Asynchronous Actor-Critic (A2C) algorithm on four deceptive games, which are specially designed to provide challenges to game-playing agents. These games are implemented in the General Video Game AI framework, which allows us to compare the behavior of reinforcement learningbased agents with planning agents based on tree search. We find that several of these games reliably deceive deep reinforcement learners, and that the resulting behavior highlights the shortcomings of the learning algorithm. The particular ways in which agents fail differ from how planning-based agents fail, further illuminating the character of these algorithms. We propose an initial typology of deceptions which could help us better understand pitfalls and failure modes of (deep) reinforcement learning.

M3 - Conference article in proceeding

VL - 15

SP - 10

EP - 16

BT - Fifteenth Annual AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment

A2 - Smith, Gillian

A2 - Lelis, Levi

Y2 - 8 October 2019 through 12 October 2019

ER -