Spatial State-Action Features for General Games

Dennis J. N. J. Soemers; Eric Piette; Matthew Stephenson; Cameron Browne

doi:https://doi.org/10.1016/j.artint.2023.103937

Spatial State-Action Features for General Games

Dennis J. N. J. Soemers^*, Eric Piette, Matthew Stephenson, Cameron Browne

^*Corresponding author for this work

Research output: Contribution to journal › Article › Academic › peer-review

Abstract

In many board games and other abstract games, patterns have been used as features that can guide automated game-playing agents. Such patterns or features often represent particular configurations of pieces, empty positions, etc., which may be relevant for a game's strategies. Their use has been particularly prevalent in the game of Go, but also many other games used as benchmarks for AI research. In this paper, we formulate a design and efficient implementation of spatial state-action features for general games. These are patterns that can be trained to incentivise or disincentivise actions based on whether or not they match variables of the state in a local area around action variables. We provide extensive details on several design and implementation choices, with a primary focus on achieving a high degree of generality to support a wide variety of different games using different board geometries or other graphs. Secondly, we propose an efficient approach for evaluating active features for any given set of features. In this approach, we take inspiration from heuristics used in problems such as SAT to optimise the order in which parts of patterns are matched and prune unnecessary evaluations. This approach is defined for a highly general and abstract description of the problem—phrased as optimising the order in which propositions of formulas in disjunctive normal form are evaluated—and may therefore also be of interest to other types of problems than board games. An empirical evaluation on 33 distinct games in the Ludii general game system demonstrates the efficiency of this approach in comparison to a naive baseline, as well as a baseline based on prefix trees, and demonstrates that the additional efficiency significantly improves the playing strength of agents using the features to guide search.

Original language	English
Article number	103937
Number of pages	32
Journal	Artificial Intelligence
Volume	321
DOIs	https://doi.org/10.1016/j.artint.2023.103937
Publication status	Published - Aug 2023

Keywords

AI and games
General game playing
Ordering propositions
Pattern matching

Access to Document

https://doi.org/10.1016/j.artint.2023.103937Licence: CC BY

Cite this

@article{6cf15862f80a4e0e831deb41f7586729,

title = "Spatial State-Action Features for General Games",

abstract = "In many board games and other abstract games, patterns have been used as features that can guide automated game-playing agents. Such patterns or features often represent particular configurations of pieces, empty positions, etc., which may be relevant for a game's strategies. Their use has been particularly prevalent in the game of Go, but also many other games used as benchmarks for AI research. In this paper, we formulate a design and efficient implementation of spatial state-action features for general games. These are patterns that can be trained to incentivise or disincentivise actions based on whether or not they match variables of the state in a local area around action variables. We provide extensive details on several design and implementation choices, with a primary focus on achieving a high degree of generality to support a wide variety of different games using different board geometries or other graphs. Secondly, we propose an efficient approach for evaluating active features for any given set of features. In this approach, we take inspiration from heuristics used in problems such as SAT to optimise the order in which parts of patterns are matched and prune unnecessary evaluations. This approach is defined for a highly general and abstract description of the problem—phrased as optimising the order in which propositions of formulas in disjunctive normal form are evaluated—and may therefore also be of interest to other types of problems than board games. An empirical evaluation on 33 distinct games in the Ludii general game system demonstrates the efficiency of this approach in comparison to a naive baseline, as well as a baseline based on prefix trees, and demonstrates that the additional efficiency significantly improves the playing strength of agents using the features to guide search.",

keywords = "AI and games, General game playing, Ordering propositions, Pattern matching",

author = "Soemers, {Dennis J. N. J.} and Eric Piette and Matthew Stephenson and Cameron Browne",

note = "Funding Information: This research is funded by the European Research Council as part of the Digital Ludeme Project (ERC Consolidator Grant # 771292 ) led by Cameron Browne at Maastricht University's Department of Advanced Computing Sciences. We wish to thank Walter Crist, C{\'e}dric Piette, Chiara Sironi, and Mark Winands for helpful pointers to related work. This work was carried out on the Dutch national e-infrastructure with the support of SURF Cooperative (grant no. EINF-1133 ). This work used the Dutch national e-infrastructure with the support of the SURF cooperative using grant no. EINF-4028 . This publication is part of the project “Evaluation of Trained AIs for General Game Playing” (with project number EINF-4028 ) of the research programme Computing Time on National Computer Facilities which is (partly) financed by the Dutch Research Council (NWO). Publisher Copyright: {\textcopyright} 2023 The Author(s)",

year = "2023",

month = aug,

doi = "https://doi.org/10.1016/j.artint.2023.103937",

language = "English",

volume = "321",

journal = "Artificial Intelligence",

issn = "0004-3702",

publisher = "Elsevier Science",

}

TY - JOUR

T1 - Spatial State-Action Features for General Games

AU - Soemers, Dennis J. N. J.

AU - Piette, Eric

AU - Stephenson, Matthew

AU - Browne, Cameron

N1 - Funding Information: This research is funded by the European Research Council as part of the Digital Ludeme Project (ERC Consolidator Grant # 771292 ) led by Cameron Browne at Maastricht University's Department of Advanced Computing Sciences. We wish to thank Walter Crist, Cédric Piette, Chiara Sironi, and Mark Winands for helpful pointers to related work. This work was carried out on the Dutch national e-infrastructure with the support of SURF Cooperative (grant no. EINF-1133 ). This work used the Dutch national e-infrastructure with the support of the SURF cooperative using grant no. EINF-4028 . This publication is part of the project “Evaluation of Trained AIs for General Game Playing” (with project number EINF-4028 ) of the research programme Computing Time on National Computer Facilities which is (partly) financed by the Dutch Research Council (NWO). Publisher Copyright: © 2023 The Author(s)

PY - 2023/8

Y1 - 2023/8

N2 - In many board games and other abstract games, patterns have been used as features that can guide automated game-playing agents. Such patterns or features often represent particular configurations of pieces, empty positions, etc., which may be relevant for a game's strategies. Their use has been particularly prevalent in the game of Go, but also many other games used as benchmarks for AI research. In this paper, we formulate a design and efficient implementation of spatial state-action features for general games. These are patterns that can be trained to incentivise or disincentivise actions based on whether or not they match variables of the state in a local area around action variables. We provide extensive details on several design and implementation choices, with a primary focus on achieving a high degree of generality to support a wide variety of different games using different board geometries or other graphs. Secondly, we propose an efficient approach for evaluating active features for any given set of features. In this approach, we take inspiration from heuristics used in problems such as SAT to optimise the order in which parts of patterns are matched and prune unnecessary evaluations. This approach is defined for a highly general and abstract description of the problem—phrased as optimising the order in which propositions of formulas in disjunctive normal form are evaluated—and may therefore also be of interest to other types of problems than board games. An empirical evaluation on 33 distinct games in the Ludii general game system demonstrates the efficiency of this approach in comparison to a naive baseline, as well as a baseline based on prefix trees, and demonstrates that the additional efficiency significantly improves the playing strength of agents using the features to guide search.

AB - In many board games and other abstract games, patterns have been used as features that can guide automated game-playing agents. Such patterns or features often represent particular configurations of pieces, empty positions, etc., which may be relevant for a game's strategies. Their use has been particularly prevalent in the game of Go, but also many other games used as benchmarks for AI research. In this paper, we formulate a design and efficient implementation of spatial state-action features for general games. These are patterns that can be trained to incentivise or disincentivise actions based on whether or not they match variables of the state in a local area around action variables. We provide extensive details on several design and implementation choices, with a primary focus on achieving a high degree of generality to support a wide variety of different games using different board geometries or other graphs. Secondly, we propose an efficient approach for evaluating active features for any given set of features. In this approach, we take inspiration from heuristics used in problems such as SAT to optimise the order in which parts of patterns are matched and prune unnecessary evaluations. This approach is defined for a highly general and abstract description of the problem—phrased as optimising the order in which propositions of formulas in disjunctive normal form are evaluated—and may therefore also be of interest to other types of problems than board games. An empirical evaluation on 33 distinct games in the Ludii general game system demonstrates the efficiency of this approach in comparison to a naive baseline, as well as a baseline based on prefix trees, and demonstrates that the additional efficiency significantly improves the playing strength of agents using the features to guide search.

KW - AI and games

KW - General game playing

KW - Ordering propositions

KW - Pattern matching

U2 - https://doi.org/10.1016/j.artint.2023.103937

DO - https://doi.org/10.1016/j.artint.2023.103937

M3 - Article

SN - 0004-3702

VL - 321

JO - Artificial Intelligence

JF - Artificial Intelligence

M1 - 103937

ER -