Ancestor-Based α-β Bounds for Monte-Carlo Tree Search

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingAcademicpeer-review

1 Downloads (Pure)

Abstract

Upper Confidence bounds applied to Trees (UCT) is the default selection policy in Monte-Carlo Tree Search (MCTS), yet it overlooks the strategic use of ancestral node information. Consequently, UCT approaches each decision level as an independent Multi-Armed Bandit problem, disregarding the results achieved along the path that led to the current state. Consequently, it treats decisions as separate in the tree, without integrating the historical context of previous choices. This paper introduces an enhancement to UCT for two-player, deterministic zero-sum games by integrating insights from α−β pruning-a method that increases minimax search efficiency through selective pruning. We propose a revised selection policy that leverages ancestor node data, mirroring α−β pruning’s principle, to refine sample-based search. Our experiments with this enhanced method reveal performance gains in Breakthrough, Mini Shogi, and GoMoku, highlighting the effectiveness of incorporating ancestor search results into the MCTS selection processes.
Original languageEnglish
Title of host publication2024 IEEE Conference on Games (CoG)
PublisherIEEE
Pages1-4
ISBN (Electronic)979-8-3503-5067-8
ISBN (Print)979-8-3503-5068-5
DOIs
Publication statusPublished - 5 Aug 2024
Event2024 IEEE Conference on Games - Milan, Italy
Duration: 5 Aug 20248 Aug 2024
https://2024.ieee-cog.org/

Conference

Conference2024 IEEE Conference on Games
Abbreviated titleIEEE CoG 2024
Country/TerritoryItaly
CityMilan
Period5/08/248/08/24
Internet address

Fingerprint

Dive into the research topics of 'Ancestor-Based α-β Bounds for Monte-Carlo Tree Search'. Together they form a unique fingerprint.

Cite this