Monte Carlo Tree Search in Lines of Action

Mark H. M. Winands; Yngvi Björnsson; Jahn-Takeshi Saito

doi:10.1109/TCIAIG.2010.2061050

Monte Carlo Tree Search in Lines of Action

Mark H. M. Winands^*, Yngvi Björnsson, Jahn-Takeshi Saito

^*Corresponding author for this work

Networks and Strategic Optimization

Research output: Contribution to journal › Article › Academic › peer-review

Abstract

The success of Monte Carlo tree search (MCTS) in many games, where alpha beta-based search has failed, naturally raises the question whether Monte Carlo simulations will eventually also outperform traditional game-tree search in game domains where alpha beta-based search is now successful. The forte of alpha beta-based search are highly tactical deterministic game domains with a small to moderate branching factor, where efficient yet knowledge-rich evaluation functions can be applied effectively. In this paper, we describe an MCTS-based program for playing the game Lines of Action (LOA), which is a highly tactical slow-progression game exhibiting many of the properties difficult for MCTS. The program uses an improved MCTS variant that allows it to both prove the game-theoretical value of nodes in a search tree and to focus its simulations better using domain knowledge. This results in simulations superior in both handling tactics and ensuring game progression. Using the improved MCTS variant, our program is able to outperform even the world's strongest alpha beta-based LOA program. This is an important milestone for MCTS because the traditional game-tree search approach has been considered to be the better suited for playing LOA.

Original language	English
Pages (from-to)	239-250
Journal	IEEE Transactions on Computational Intelligence and AI in Games
Volume	2
Issue number	4
DOIs	https://doi.org/10.1109/TCIAIG.2010.2061050
Publication status	Published - Dec 2010

Keywords

Game-tree solver
Lines of Action (LOA)
Monte Carlo tree search (MCTS)

Access to Document

10.1109/TCIAIG.2010.2061050

Cite this

@article{d954d42849914221a89e46583ff69757,

title = "Monte Carlo Tree Search in Lines of Action",

abstract = "The success of Monte Carlo tree search (MCTS) in many games, where alpha beta-based search has failed, naturally raises the question whether Monte Carlo simulations will eventually also outperform traditional game-tree search in game domains where alpha beta-based search is now successful. The forte of alpha beta-based search are highly tactical deterministic game domains with a small to moderate branching factor, where efficient yet knowledge-rich evaluation functions can be applied effectively. In this paper, we describe an MCTS-based program for playing the game Lines of Action (LOA), which is a highly tactical slow-progression game exhibiting many of the properties difficult for MCTS. The program uses an improved MCTS variant that allows it to both prove the game-theoretical value of nodes in a search tree and to focus its simulations better using domain knowledge. This results in simulations superior in both handling tactics and ensuring game progression. Using the improved MCTS variant, our program is able to outperform even the world's strongest alpha beta-based LOA program. This is an important milestone for MCTS because the traditional game-tree search approach has been considered to be the better suited for playing LOA.",

keywords = "Game-tree solver, Lines of Action (LOA), Monte Carlo tree search (MCTS)",

author = "Winands, {Mark H. M.} and Yngvi Bj{\"o}rnsson and Jahn-Takeshi Saito",

year = "2010",

month = dec,

doi = "10.1109/TCIAIG.2010.2061050",

language = "English",

volume = "2",

pages = "239--250",

journal = "IEEE Transactions on Computational Intelligence and AI in Games",

issn = "1943-068X",

publisher = "IEEE",

number = "4",

}

TY - JOUR

T1 - Monte Carlo Tree Search in Lines of Action

AU - Winands, Mark H. M.

AU - Björnsson, Yngvi

AU - Saito, Jahn-Takeshi

PY - 2010/12

Y1 - 2010/12

N2 - The success of Monte Carlo tree search (MCTS) in many games, where alpha beta-based search has failed, naturally raises the question whether Monte Carlo simulations will eventually also outperform traditional game-tree search in game domains where alpha beta-based search is now successful. The forte of alpha beta-based search are highly tactical deterministic game domains with a small to moderate branching factor, where efficient yet knowledge-rich evaluation functions can be applied effectively. In this paper, we describe an MCTS-based program for playing the game Lines of Action (LOA), which is a highly tactical slow-progression game exhibiting many of the properties difficult for MCTS. The program uses an improved MCTS variant that allows it to both prove the game-theoretical value of nodes in a search tree and to focus its simulations better using domain knowledge. This results in simulations superior in both handling tactics and ensuring game progression. Using the improved MCTS variant, our program is able to outperform even the world's strongest alpha beta-based LOA program. This is an important milestone for MCTS because the traditional game-tree search approach has been considered to be the better suited for playing LOA.

AB - The success of Monte Carlo tree search (MCTS) in many games, where alpha beta-based search has failed, naturally raises the question whether Monte Carlo simulations will eventually also outperform traditional game-tree search in game domains where alpha beta-based search is now successful. The forte of alpha beta-based search are highly tactical deterministic game domains with a small to moderate branching factor, where efficient yet knowledge-rich evaluation functions can be applied effectively. In this paper, we describe an MCTS-based program for playing the game Lines of Action (LOA), which is a highly tactical slow-progression game exhibiting many of the properties difficult for MCTS. The program uses an improved MCTS variant that allows it to both prove the game-theoretical value of nodes in a search tree and to focus its simulations better using domain knowledge. This results in simulations superior in both handling tactics and ensuring game progression. Using the improved MCTS variant, our program is able to outperform even the world's strongest alpha beta-based LOA program. This is an important milestone for MCTS because the traditional game-tree search approach has been considered to be the better suited for playing LOA.

KW - Game-tree solver

KW - Lines of Action (LOA)

KW - Monte Carlo tree search (MCTS)

U2 - 10.1109/TCIAIG.2010.2061050

DO - 10.1109/TCIAIG.2010.2061050

M3 - Article

SN - 1943-068X

VL - 2

SP - 239

EP - 250

JO - IEEE Transactions on Computational Intelligence and AI in Games

JF - IEEE Transactions on Computational Intelligence and AI in Games

IS - 4

ER -