Graph kernels and Gaussian processes for relational reinforcement learning

Kurt Driessens; Jan Ramon; Thomas Gaertner

doi:10.1007/s10994-006-8258-y

Graph kernels and Gaussian processes for relational reinforcement learning

Kurt Driessens^*, Jan Ramon, Thomas Gaertner

^*Corresponding author for this work

Research output: Contribution to journal › Article › Academic › peer-review

Abstract

Rrl is a relational reinforcement learning system based on q-learning in relational state-action spaces. It aims to enable agents to learn how to act in an environment that has no natural representation as a tuple of constants. For relational reinforcement learning, the learning algorithm used to approximate the mapping between state-action pairs and their so called q(uality)-value has to be very reliable, and it has to be able to handle the relational representation of state-action pairs. In this paper we investigate the use of gaussian processes to approximate the q-values of state-action pairs. In order to employ gaussian processes in a relational setting we propose graph kernels as a covariance function between state-action pairs. The standard prediction mechanism for gaussian processes requires a matrix inversion which can become unstable when the kernel matrix has low rank. These instabilities can be avoided by employing qr-factorization. This leads to better and more stable performance of the algorithm and a more efficient incremental update mechanism. Experiments conducted in the blocks world and with the tetris game show that gaussian processes with graph kernels can compete with, and often improve on, regression trees and instance based regression as a generalization algorithm for rrl.

Original language	English
Pages (from-to)	91-119
Journal	Machine Learning
Volume	64
Issue number	1-3
DOIs	https://doi.org/10.1007/s10994-006-8258-y
Publication status	Published - Sept 2006
Externally published	Yes

Keywords

reinforcement learning
relational learning
graph kernels
Gaussian processes

Access to Document

10.1007/s10994-006-8258-y

Cite this

@article{bf7117eadbc340f6b94945fae10609c4,

title = "Graph kernels and Gaussian processes for relational reinforcement learning",

abstract = "Rrl is a relational reinforcement learning system based on q-learning in relational state-action spaces. It aims to enable agents to learn how to act in an environment that has no natural representation as a tuple of constants. For relational reinforcement learning, the learning algorithm used to approximate the mapping between state-action pairs and their so called q(uality)-value has to be very reliable, and it has to be able to handle the relational representation of state-action pairs. In this paper we investigate the use of gaussian processes to approximate the q-values of state-action pairs. In order to employ gaussian processes in a relational setting we propose graph kernels as a covariance function between state-action pairs. The standard prediction mechanism for gaussian processes requires a matrix inversion which can become unstable when the kernel matrix has low rank. These instabilities can be avoided by employing qr-factorization. This leads to better and more stable performance of the algorithm and a more efficient incremental update mechanism. Experiments conducted in the blocks world and with the tetris game show that gaussian processes with graph kernels can compete with, and often improve on, regression trees and instance based regression as a generalization algorithm for rrl.",

keywords = "reinforcement learning, relational learning, graph kernels, Gaussian processes",

author = "Kurt Driessens and Jan Ramon and Thomas Gaertner",

year = "2006",

month = sep,

doi = "10.1007/s10994-006-8258-y",

language = "English",

volume = "64",

pages = "91--119",

journal = "Machine Learning",

issn = "0885-6125",

publisher = "Springer",

number = "1-3",

}

TY - JOUR

T1 - Graph kernels and Gaussian processes for relational reinforcement learning

AU - Driessens, Kurt

AU - Ramon, Jan

AU - Gaertner, Thomas

PY - 2006/9

Y1 - 2006/9

N2 - Rrl is a relational reinforcement learning system based on q-learning in relational state-action spaces. It aims to enable agents to learn how to act in an environment that has no natural representation as a tuple of constants. For relational reinforcement learning, the learning algorithm used to approximate the mapping between state-action pairs and their so called q(uality)-value has to be very reliable, and it has to be able to handle the relational representation of state-action pairs. In this paper we investigate the use of gaussian processes to approximate the q-values of state-action pairs. In order to employ gaussian processes in a relational setting we propose graph kernels as a covariance function between state-action pairs. The standard prediction mechanism for gaussian processes requires a matrix inversion which can become unstable when the kernel matrix has low rank. These instabilities can be avoided by employing qr-factorization. This leads to better and more stable performance of the algorithm and a more efficient incremental update mechanism. Experiments conducted in the blocks world and with the tetris game show that gaussian processes with graph kernels can compete with, and often improve on, regression trees and instance based regression as a generalization algorithm for rrl.

AB - Rrl is a relational reinforcement learning system based on q-learning in relational state-action spaces. It aims to enable agents to learn how to act in an environment that has no natural representation as a tuple of constants. For relational reinforcement learning, the learning algorithm used to approximate the mapping between state-action pairs and their so called q(uality)-value has to be very reliable, and it has to be able to handle the relational representation of state-action pairs. In this paper we investigate the use of gaussian processes to approximate the q-values of state-action pairs. In order to employ gaussian processes in a relational setting we propose graph kernels as a covariance function between state-action pairs. The standard prediction mechanism for gaussian processes requires a matrix inversion which can become unstable when the kernel matrix has low rank. These instabilities can be avoided by employing qr-factorization. This leads to better and more stable performance of the algorithm and a more efficient incremental update mechanism. Experiments conducted in the blocks world and with the tetris game show that gaussian processes with graph kernels can compete with, and often improve on, regression trees and instance based regression as a generalization algorithm for rrl.

KW - reinforcement learning

KW - relational learning

KW - graph kernels

KW - Gaussian processes

U2 - 10.1007/s10994-006-8258-y

DO - 10.1007/s10994-006-8258-y

M3 - Article

SN - 0885-6125

VL - 64

SP - 91

EP - 119

JO - Machine Learning

JF - Machine Learning

IS - 1-3

ER -