Two learning algorithms for forward pruning

L Kocsis; HJ van den Herik; JWHM Uiterwijk

doi:10.3233/ICG-2003-26303

Two learning algorithms for forward pruning

L Kocsis^*, HJ van den Herik, JWHM Uiterwijk

^*Corresponding author for this work

Networks and Strategic Optimization

Research output: Contribution to journal › Article › Academic › peer-review

Abstract

The article investigates two learning algorithms for forward pruning. The TS-FPV algorithm uses a tabu-search (TS) algorithm to explore the space of the forward-pruning vectors (FPVs). It focuses on critical FPVs. The RL-FPF algorithm is a reinforcement-learning (RL) algorithm for forward-pruning functions (FPFs). It uses a gradient-descent update rule. The two algorithms are tested using the chess program CRAFTY. The criteria used for evaluation are the size of the search tree and the quality of the move. The experimental results show that the two algorithms are able to tune a forward-pruning scheme that has a better overall performance than a comparable full-width search. The main result arrived at is that the FPFs obtained from RL-FPF outperform the best FPVs resulting from TS-FPV.

Original language	English
Pages (from-to)	165-181
Journal	ICGA Journal
Volume	26
Issue number	3
DOIs	https://doi.org/10.3233/ICG-2003-26303
Publication status	Published - Sept 2003

Access to Document

10.3233/ICG-2003-26303

Cite this

@article{fea2b6da07074507b0fb362490b4d058,

title = "Two learning algorithms for forward pruning",

abstract = "The article investigates two learning algorithms for forward pruning. The TS-FPV algorithm uses a tabu-search (TS) algorithm to explore the space of the forward-pruning vectors (FPVs). It focuses on critical FPVs. The RL-FPF algorithm is a reinforcement-learning (RL) algorithm for forward-pruning functions (FPFs). It uses a gradient-descent update rule. The two algorithms are tested using the chess program CRAFTY. The criteria used for evaluation are the size of the search tree and the quality of the move. The experimental results show that the two algorithms are able to tune a forward-pruning scheme that has a better overall performance than a comparable full-width search. The main result arrived at is that the FPFs obtained from RL-FPF outperform the best FPVs resulting from TS-FPV.",

author = "L Kocsis and {van den Herik}, HJ and JWHM Uiterwijk",

year = "2003",

month = sep,

doi = "10.3233/ICG-2003-26303",

language = "English",

volume = "26",

pages = "165--181",

journal = "ICGA Journal",

issn = "1389-6911",

publisher = "IOS Press",

number = "3",

}

TY - JOUR

T1 - Two learning algorithms for forward pruning

AU - Kocsis, L

AU - van den Herik, HJ

AU - Uiterwijk, JWHM

PY - 2003/9

Y1 - 2003/9

N2 - The article investigates two learning algorithms for forward pruning. The TS-FPV algorithm uses a tabu-search (TS) algorithm to explore the space of the forward-pruning vectors (FPVs). It focuses on critical FPVs. The RL-FPF algorithm is a reinforcement-learning (RL) algorithm for forward-pruning functions (FPFs). It uses a gradient-descent update rule. The two algorithms are tested using the chess program CRAFTY. The criteria used for evaluation are the size of the search tree and the quality of the move. The experimental results show that the two algorithms are able to tune a forward-pruning scheme that has a better overall performance than a comparable full-width search. The main result arrived at is that the FPFs obtained from RL-FPF outperform the best FPVs resulting from TS-FPV.

AB - The article investigates two learning algorithms for forward pruning. The TS-FPV algorithm uses a tabu-search (TS) algorithm to explore the space of the forward-pruning vectors (FPVs). It focuses on critical FPVs. The RL-FPF algorithm is a reinforcement-learning (RL) algorithm for forward-pruning functions (FPFs). It uses a gradient-descent update rule. The two algorithms are tested using the chess program CRAFTY. The criteria used for evaluation are the size of the search tree and the quality of the move. The experimental results show that the two algorithms are able to tune a forward-pruning scheme that has a better overall performance than a comparable full-width search. The main result arrived at is that the FPFs obtained from RL-FPF outperform the best FPVs resulting from TS-FPV.

U2 - 10.3233/ICG-2003-26303

DO - 10.3233/ICG-2003-26303

M3 - Article

SN - 1389-6911

VL - 26

SP - 165

EP - 181

JO - ICGA Journal

JF - ICGA Journal

IS - 3

ER -