Time Management for Monte Carlo Tree Search

Hendrik Baier; Mark H. M. Winands

doi:10.1109/TCIAIG.2015.2443123

Time Management for Monte Carlo Tree Search

Hendrik Baier^*, Mark H. M. Winands

^*Corresponding author for this work

Research output: Contribution to journal › Article › Academic › peer-review

Abstract

Monte Carlo Tree Search (MCTS) is a popular approach for tree search in a variety of games. While MCTS allows for fine-grained time control, not much has been published on time management for MCTS programs under tournament conditions. This paper first investigates the effects of various time-management strategies on playing strength in the challenging game of Go. A number of domain-independent strategies are then tested in the domains Connect-4, Breakthrough, Othello, and Catch the Lion. We consider strategies taken from the literature as well as newly proposed and improved ones. Strategies include both semi-dynamic strategies that decide about time allocation for each search before it is started, and dynamic strategies that influence the duration of each move search while it is already running. Furthermore, we analyze the effects of time management strategies on the distribution of time over the moves of an average game, allowing us to partly explain their performance. In the experiments, the domain-independent strategy STOP provides a significant improvement over the state of the art in Go, and is the most effective time management strategy tested in all five domains.

Original language	English
Pages (from-to)	301-314
Journal	IEEE Transactions on Computational Intelligence and AI in Games
Volume	8
Issue number	3
DOIs	https://doi.org/10.1109/TCIAIG.2015.2443123
Publication status	Published - Sept 2016

Keywords

Artificial intelligence
game tree search
Monte Carlo Tree Search
time management

Access to Document

10.1109/TCIAIG.2015.2443123

Cite this

@article{c548014515d14f2db26bbd15476b443a,

title = "Time Management for Monte Carlo Tree Search",

abstract = "Monte Carlo Tree Search (MCTS) is a popular approach for tree search in a variety of games. While MCTS allows for fine-grained time control, not much has been published on time management for MCTS programs under tournament conditions. This paper first investigates the effects of various time-management strategies on playing strength in the challenging game of Go. A number of domain-independent strategies are then tested in the domains Connect-4, Breakthrough, Othello, and Catch the Lion. We consider strategies taken from the literature as well as newly proposed and improved ones. Strategies include both semi-dynamic strategies that decide about time allocation for each search before it is started, and dynamic strategies that influence the duration of each move search while it is already running. Furthermore, we analyze the effects of time management strategies on the distribution of time over the moves of an average game, allowing us to partly explain their performance. In the experiments, the domain-independent strategy STOP provides a significant improvement over the state of the art in Go, and is the most effective time management strategy tested in all five domains.",

keywords = "Artificial intelligence, game tree search, Monte Carlo Tree Search, time management",

author = "Hendrik Baier and Winands, {Mark H. M.}",

year = "2016",

month = sep,

doi = "10.1109/TCIAIG.2015.2443123",

language = "English",

volume = "8",

pages = "301--314",

journal = "IEEE Transactions on Computational Intelligence and AI in Games",

issn = "1943-068X",

publisher = "IEEE",

number = "3",

}

TY - JOUR

T1 - Time Management for Monte Carlo Tree Search

AU - Baier, Hendrik

AU - Winands, Mark H. M.

PY - 2016/9

Y1 - 2016/9

N2 - Monte Carlo Tree Search (MCTS) is a popular approach for tree search in a variety of games. While MCTS allows for fine-grained time control, not much has been published on time management for MCTS programs under tournament conditions. This paper first investigates the effects of various time-management strategies on playing strength in the challenging game of Go. A number of domain-independent strategies are then tested in the domains Connect-4, Breakthrough, Othello, and Catch the Lion. We consider strategies taken from the literature as well as newly proposed and improved ones. Strategies include both semi-dynamic strategies that decide about time allocation for each search before it is started, and dynamic strategies that influence the duration of each move search while it is already running. Furthermore, we analyze the effects of time management strategies on the distribution of time over the moves of an average game, allowing us to partly explain their performance. In the experiments, the domain-independent strategy STOP provides a significant improvement over the state of the art in Go, and is the most effective time management strategy tested in all five domains.

AB - Monte Carlo Tree Search (MCTS) is a popular approach for tree search in a variety of games. While MCTS allows for fine-grained time control, not much has been published on time management for MCTS programs under tournament conditions. This paper first investigates the effects of various time-management strategies on playing strength in the challenging game of Go. A number of domain-independent strategies are then tested in the domains Connect-4, Breakthrough, Othello, and Catch the Lion. We consider strategies taken from the literature as well as newly proposed and improved ones. Strategies include both semi-dynamic strategies that decide about time allocation for each search before it is started, and dynamic strategies that influence the duration of each move search while it is already running. Furthermore, we analyze the effects of time management strategies on the distribution of time over the moves of an average game, allowing us to partly explain their performance. In the experiments, the domain-independent strategy STOP provides a significant improvement over the state of the art in Go, and is the most effective time management strategy tested in all five domains.

KW - Artificial intelligence

KW - game tree search

KW - Monte Carlo Tree Search

KW - time management

U2 - 10.1109/TCIAIG.2015.2443123

DO - 10.1109/TCIAIG.2015.2443123

M3 - Article

SN - 1943-068X

VL - 8

SP - 301

EP - 314

JO - IEEE Transactions on Computational Intelligence and AI in Games

JF - IEEE Transactions on Computational Intelligence and AI in Games

IS - 3

ER -