Monte-Carlo Tree Search in Board Games

Mark H. M. Winands

doi:10.1007/978-981-4560-50-4_27

Monte-Carlo Tree Search in Board Games

Mark H. M. Winands

Networks and Strategic Optimization

Research output: Chapter in Book/Report/Conference proceeding › Chapter › Academic

181 Downloads (Pure)

Abstract

Monte-carlo tree search (mcts) is a best-first search method guided by the results of monte-carlo simulations. It is based on randomized exploration of the search space. Using the results of previous explorations, the method gradually builds up a game tree in memory and successively becomes better at accurately estimating the values of the most promising moves. Mcts has substantially advanced the state of the art in board games such as go, amazons, hex, chinese checkers, kriegspiel, and lines of action.this chapter gives an overview of popular and effective enhancements for board game playing mcts agents. First, it starts by describing the structure of mcts and giving pseudocode. It also addresses how to adjust mcts to prove the game-theoretic value of a board position. Next, popular enhancements such as rave, progressive bias, progressive widening, and prior knowledge, which improve the simulation in the tree part of mcts, are discussed in detail. Subsequently, enhancements such as mast, n-grams, and evaluation function-based strategies are explained for improving the simulation outside the tree. As modern computers have nowadays multiple cores, this chapter mentions techniques to parallelize mcts in a straightforward but effective way. Finally, approaches to deal with imperfect information and stochasticity in an mcts context are discussed as well.

Original language	English
Title of host publication	Handbook of Digital Games and Entertainment Technologies
Editors	Ryohei Nakatsu, Matthias Rauterberg, Paolo Ciancarini
Publisher	Springer
Pages	47-76
ISBN (Electronic)	978-981-4560-50-4
ISBN (Print)	978-981-4560-49-8
DOIs	https://doi.org/10.1007/978-981-4560-50-4_27
Publication status	Published - 2017

Access to Document

10.1007/978-981-4560-50-4_27

Full text Final published version, 642 KBLicence: Taverne

Cite this

@inbook{51a1aea9d74347b88f2d356075a7fe1c,

title = "Monte-Carlo Tree Search in Board Games",

abstract = "Monte-carlo tree search (mcts) is a best-first search method guided by the results of monte-carlo simulations. It is based on randomized exploration of the search space. Using the results of previous explorations, the method gradually builds up a game tree in memory and successively becomes better at accurately estimating the values of the most promising moves. Mcts has substantially advanced the state of the art in board games such as go, amazons, hex, chinese checkers, kriegspiel, and lines of action.this chapter gives an overview of popular and effective enhancements for board game playing mcts agents. First, it starts by describing the structure of mcts and giving pseudocode. It also addresses how to adjust mcts to prove the game-theoretic value of a board position. Next, popular enhancements such as rave, progressive bias, progressive widening, and prior knowledge, which improve the simulation in the tree part of mcts, are discussed in detail. Subsequently, enhancements such as mast, n-grams, and evaluation function-based strategies are explained for improving the simulation outside the tree. As modern computers have nowadays multiple cores, this chapter mentions techniques to parallelize mcts in a straightforward but effective way. Finally, approaches to deal with imperfect information and stochasticity in an mcts context are discussed as well.",

author = "Winands, {Mark H. M.}",

year = "2017",

doi = "10.1007/978-981-4560-50-4_27",

language = "English",

isbn = "978-981-4560-49-8",

pages = "47--76",

editor = "Ryohei Nakatsu and Matthias Rauterberg and Paolo Ciancarini",

booktitle = "Handbook of Digital Games and Entertainment Technologies",

publisher = "Springer",

address = "United States",

}

TY - CHAP

T1 - Monte-Carlo Tree Search in Board Games

AU - Winands, Mark H. M.

PY - 2017

Y1 - 2017

N2 - Monte-carlo tree search (mcts) is a best-first search method guided by the results of monte-carlo simulations. It is based on randomized exploration of the search space. Using the results of previous explorations, the method gradually builds up a game tree in memory and successively becomes better at accurately estimating the values of the most promising moves. Mcts has substantially advanced the state of the art in board games such as go, amazons, hex, chinese checkers, kriegspiel, and lines of action.this chapter gives an overview of popular and effective enhancements for board game playing mcts agents. First, it starts by describing the structure of mcts and giving pseudocode. It also addresses how to adjust mcts to prove the game-theoretic value of a board position. Next, popular enhancements such as rave, progressive bias, progressive widening, and prior knowledge, which improve the simulation in the tree part of mcts, are discussed in detail. Subsequently, enhancements such as mast, n-grams, and evaluation function-based strategies are explained for improving the simulation outside the tree. As modern computers have nowadays multiple cores, this chapter mentions techniques to parallelize mcts in a straightforward but effective way. Finally, approaches to deal with imperfect information and stochasticity in an mcts context are discussed as well.

AB - Monte-carlo tree search (mcts) is a best-first search method guided by the results of monte-carlo simulations. It is based on randomized exploration of the search space. Using the results of previous explorations, the method gradually builds up a game tree in memory and successively becomes better at accurately estimating the values of the most promising moves. Mcts has substantially advanced the state of the art in board games such as go, amazons, hex, chinese checkers, kriegspiel, and lines of action.this chapter gives an overview of popular and effective enhancements for board game playing mcts agents. First, it starts by describing the structure of mcts and giving pseudocode. It also addresses how to adjust mcts to prove the game-theoretic value of a board position. Next, popular enhancements such as rave, progressive bias, progressive widening, and prior knowledge, which improve the simulation in the tree part of mcts, are discussed in detail. Subsequently, enhancements such as mast, n-grams, and evaluation function-based strategies are explained for improving the simulation outside the tree. As modern computers have nowadays multiple cores, this chapter mentions techniques to parallelize mcts in a straightforward but effective way. Finally, approaches to deal with imperfect information and stochasticity in an mcts context are discussed as well.

U2 - 10.1007/978-981-4560-50-4_27

DO - 10.1007/978-981-4560-50-4_27

M3 - Chapter

SN - 978-981-4560-49-8

SP - 47

EP - 76

BT - Handbook of Digital Games and Entertainment Technologies

A2 - Nakatsu, Ryohei

A2 - Rauterberg, Matthias

A2 - Ciancarini, Paolo

PB - Springer

ER -