MCTS-Minimax Hybrids

Hendrik Baier; Mark H. M. Winands

doi:10.1109/TCIAIG.2014.2366555

MCTS-Minimax Hybrids

Hendrik Baier^*, Mark H. M. Winands

^*Corresponding author for this work

Research output: Contribution to journal › Article › Academic › peer-review

Abstract

Monte Carlo tree search (MCTS) is a sampling-based search algorithm that is state of the art in a variety of games. In many domains, its Monte Carlo rollouts of entire games give it a strategic advantage over traditional depth-limited minimax search with pruning. These rollouts can often detect long-term consequences of moves, freeing the programmer from having to capture these consequences in a heuristic evaluation function. But due to its highly selective tree, MCTS runs a higher risk than full-width minimax search of missing individual moves and falling into traps in tactical situations. This paper proposes MCTS-minimax hybrids that integrate shallow minimax searches into the MCTS framework. Three approaches are outlined, using minimax in the selection/expansion phase, the rollout phase, and the backpropagation phase of MCTS. Without assuming domain knowledge in the form of evaluation functions, these hybrid algorithms are a first step towards combining the strategic strength of MCTS and the tactical strength of minimax. We investigate their effectiveness in the test domains of Connect-4, Breakthrough, Othello, and Catch the Lion, and relate this performance to the tacticality of the domains.

Original language	English
Pages (from-to)	167-179
Journal	IEEE Transactions on Computational Intelligence and AI in Games
Volume	7
Issue number	2
DOIs	https://doi.org/10.1109/TCIAIG.2014.2366555
Publication status	Published - Jun 2015

Keywords

Artificial intelligence
computational intelligence
games
game tree search
Monte Carlo methods
planning

Access to Document

10.1109/TCIAIG.2014.2366555

Cite this

@article{23238f8397004109beff4a550c59147c,

title = "MCTS-Minimax Hybrids",

abstract = "Monte Carlo tree search (MCTS) is a sampling-based search algorithm that is state of the art in a variety of games. In many domains, its Monte Carlo rollouts of entire games give it a strategic advantage over traditional depth-limited minimax search with pruning. These rollouts can often detect long-term consequences of moves, freeing the programmer from having to capture these consequences in a heuristic evaluation function. But due to its highly selective tree, MCTS runs a higher risk than full-width minimax search of missing individual moves and falling into traps in tactical situations. This paper proposes MCTS-minimax hybrids that integrate shallow minimax searches into the MCTS framework. Three approaches are outlined, using minimax in the selection/expansion phase, the rollout phase, and the backpropagation phase of MCTS. Without assuming domain knowledge in the form of evaluation functions, these hybrid algorithms are a first step towards combining the strategic strength of MCTS and the tactical strength of minimax. We investigate their effectiveness in the test domains of Connect-4, Breakthrough, Othello, and Catch the Lion, and relate this performance to the tacticality of the domains.",

keywords = "Artificial intelligence, computational intelligence, games, game tree search, Monte Carlo methods, planning",

author = "Hendrik Baier and Winands, {Mark H. M.}",

year = "2015",

month = jun,

doi = "10.1109/TCIAIG.2014.2366555",

language = "English",

volume = "7",

pages = "167--179",

journal = "IEEE Transactions on Computational Intelligence and AI in Games",

issn = "1943-068X",

publisher = "IEEE",

number = "2",

}

TY - JOUR

T1 - MCTS-Minimax Hybrids

AU - Baier, Hendrik

AU - Winands, Mark H. M.

PY - 2015/6

Y1 - 2015/6

N2 - Monte Carlo tree search (MCTS) is a sampling-based search algorithm that is state of the art in a variety of games. In many domains, its Monte Carlo rollouts of entire games give it a strategic advantage over traditional depth-limited minimax search with pruning. These rollouts can often detect long-term consequences of moves, freeing the programmer from having to capture these consequences in a heuristic evaluation function. But due to its highly selective tree, MCTS runs a higher risk than full-width minimax search of missing individual moves and falling into traps in tactical situations. This paper proposes MCTS-minimax hybrids that integrate shallow minimax searches into the MCTS framework. Three approaches are outlined, using minimax in the selection/expansion phase, the rollout phase, and the backpropagation phase of MCTS. Without assuming domain knowledge in the form of evaluation functions, these hybrid algorithms are a first step towards combining the strategic strength of MCTS and the tactical strength of minimax. We investigate their effectiveness in the test domains of Connect-4, Breakthrough, Othello, and Catch the Lion, and relate this performance to the tacticality of the domains.

AB - Monte Carlo tree search (MCTS) is a sampling-based search algorithm that is state of the art in a variety of games. In many domains, its Monte Carlo rollouts of entire games give it a strategic advantage over traditional depth-limited minimax search with pruning. These rollouts can often detect long-term consequences of moves, freeing the programmer from having to capture these consequences in a heuristic evaluation function. But due to its highly selective tree, MCTS runs a higher risk than full-width minimax search of missing individual moves and falling into traps in tactical situations. This paper proposes MCTS-minimax hybrids that integrate shallow minimax searches into the MCTS framework. Three approaches are outlined, using minimax in the selection/expansion phase, the rollout phase, and the backpropagation phase of MCTS. Without assuming domain knowledge in the form of evaluation functions, these hybrid algorithms are a first step towards combining the strategic strength of MCTS and the tactical strength of minimax. We investigate their effectiveness in the test domains of Connect-4, Breakthrough, Othello, and Catch the Lion, and relate this performance to the tacticality of the domains.

KW - Artificial intelligence

KW - computational intelligence

KW - games

KW - game tree search

KW - Monte Carlo methods

KW - planning

U2 - 10.1109/TCIAIG.2014.2366555

DO - 10.1109/TCIAIG.2014.2366555

M3 - Article

SN - 1943-068X

VL - 7

SP - 167

EP - 179

JO - IEEE Transactions on Computational Intelligence and AI in Games

JF - IEEE Transactions on Computational Intelligence and AI in Games

IS - 2

ER -