Monte Carlo Tree Search in Simultaneous Move Games with Applications to Goofspiel

Marc Lanctot, Viliam Lisý, Mark H. M. Winands

Research output: Chapter in Book/Report/Conference proceedingChapterAcademic

Abstract

Monte carlo tree search (mcts) has become a widely popular sampled-based search algorithm for two-player games with perfect information. When actions are chosen simultaneously, players may need to mix between their strategies. In this paper, we discuss the adaptation of mcts to simultaneous move games. We introduce a new algorithm, online outcome sampling (oos), that approaches a nash equilibrium strategy over time. We compare both head-to-head performance and exploitability of several mcts variants in goofspiel. We show that regret matching and oos perform best and that all variants produce less exploitable strategies than uct.
Original languageEnglish
Title of host publicationComputer Games
Subtitle of host publicationWorkshop on Computer Games, CGW 2013, Held in Conjunction with the 23rd International Conference on Artificial Intelligence, IJCAI 2013, Beijing, China, August 3, 2013, Revised Selected Papers
PublisherSpringer
Pages28-43
Number of pages16
ISBN (Print)978-3-319-05428-5
DOIs
Publication statusPublished - 2014

Publication series

SeriesCommunications in Computer and Information Science
Volume408

Cite this