TY - GEN
T1 - Environment Descriptions for Usability and Generalisation in Reinforcement Learning
AU - Soemers, Dennis J. N. J.
AU - Samothrakis, Spyridon
AU - Driessens, Kurt
AU - Winands, Mark H.M.
PY - 2025
Y1 - 2025
N2 - The majority of current reinforcement learning (RL) research involves training and deploying agents in environments that are implemented by engineers in general-purpose programming languages and more advanced frameworks such as CUDA or JAX. This makes the application of RL to novel problems of interest inaccessible to small organisations or private individuals with insufficient engineering expertise. This position paper argues that, to enable more widespread adoption of RL, it is important for the research community to shift focus towards methodologies where environments are described in user-friendly domain-specific or natural languages. Aside from improving the usability of RL, such language-based environment descriptions may also provide valuable context and boost the ability of trained agents to generalise to unseen environments within the set of all environments that can be described in any language of choice.
AB - The majority of current reinforcement learning (RL) research involves training and deploying agents in environments that are implemented by engineers in general-purpose programming languages and more advanced frameworks such as CUDA or JAX. This makes the application of RL to novel problems of interest inaccessible to small organisations or private individuals with insufficient engineering expertise. This position paper argues that, to enable more widespread adoption of RL, it is important for the research community to shift focus towards methodologies where environments are described in user-friendly domain-specific or natural languages. Aside from improving the usability of RL, such language-based environment descriptions may also provide valuable context and boost the ability of trained agents to generalise to unseen environments within the set of all environments that can be described in any language of choice.
U2 - 10.5220/0013247300003890
DO - 10.5220/0013247300003890
M3 - Conference article in proceeding
VL - 3
SP - 983
EP - 992
BT - Proceedings of the 17th International Conference on Agents and Artificial Intelligence
ER -