Exploring the Context of Recurrent Neural Network based Conversational Agents

Raffaele Piccini.; Gerasimos Spanakis.

doi:10.5220/0007574203470356

Exploring the Context of Recurrent Neural Network based Conversational Agents

Raffaele Piccini.^*, Gerasimos Spanakis.

^*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceeding › Conference article in proceeding › Academic › peer-review

Abstract

Conversational agents have begun to rise both in the academic (in terms of research) and commercial (in terms of applications) world. This paper investigates the task of building a non-goal driven conversational agent, using neural network generative models and analyzes how the conversation context is handled. It compares a simpler Encoder-Decoder with a Hierarchical Recurrent Encoder-Decoder architecture, which includes an additional module to model the context of the conversation using previous utterances information. We found that the hierarchical model was able to extract relevant context information and include them in the generation of the output. However, it performed worse (35-40%) than the simple Encoder-Decoder model regarding both grammatically correct output and meaningful response. Despite these results, experiments demonstrate how conversations about similar topics appear close to each other in the context space due to the increased frequency of specific topic-related words, thus leaving promising directions for future research and how the context of a conversation can be exploited.

Original language	English
Title of host publication	Proceedings of the 11th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART
Editors	AP Rocha, L Steels, J VanDenHerik
Publisher	SCITEPRESS
Pages	347-356
Number of pages	10
ISBN (Print)	978-989-758-350-6
DOIs	https://doi.org/10.5220/0007574203470356
Publication status	Published - Feb 2019

Keywords

Conversational Agents
Hierarchical Recurrent Encoder Decoder
Recurrent Neural Networks

Access to Document

10.5220/0007574203470356Licence: CC BY-NC-ND

Cite this

@inproceedings{ab3fdbd826994609bc75273e60e14fda,

title = "Exploring the Context of Recurrent Neural Network based Conversational Agents",

abstract = "Conversational agents have begun to rise both in the academic (in terms of research) and commercial (in terms of applications) world. This paper investigates the task of building a non-goal driven conversational agent, using neural network generative models and analyzes how the conversation context is handled. It compares a simpler Encoder-Decoder with a Hierarchical Recurrent Encoder-Decoder architecture, which includes an additional module to model the context of the conversation using previous utterances information. We found that the hierarchical model was able to extract relevant context information and include them in the generation of the output. However, it performed worse (35-40%) than the simple Encoder-Decoder model regarding both grammatically correct output and meaningful response. Despite these results, experiments demonstrate how conversations about similar topics appear close to each other in the context space due to the increased frequency of specific topic-related words, thus leaving promising directions for future research and how the context of a conversation can be exploited.",

keywords = "Conversational Agents, Hierarchical Recurrent Encoder Decoder, Recurrent Neural Networks",

author = "Raffaele Piccini. and Gerasimos Spanakis.",

year = "2019",

month = feb,

doi = "10.5220/0007574203470356",

language = "English",

isbn = "978-989-758-350-6",

pages = "347--356",

editor = "AP Rocha and L Steels and J VanDenHerik",

booktitle = "Proceedings of the 11th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART",

publisher = "SCITEPRESS",

}

Exploring the Context of Recurrent Neural Network based Conversational Agents. / Piccini., Raffaele; Spanakis., Gerasimos.
Proceedings of the 11th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART. ed. / AP Rocha; L Steels; J VanDenHerik. SCITEPRESS, 2019. p. 347-356.

Research output: Chapter in Book/Report/Conference proceeding › Conference article in proceeding › Academic › peer-review

TY - GEN

T1 - Exploring the Context of Recurrent Neural Network based Conversational Agents

AU - Piccini., Raffaele

AU - Spanakis., Gerasimos

PY - 2019/2

Y1 - 2019/2

N2 - Conversational agents have begun to rise both in the academic (in terms of research) and commercial (in terms of applications) world. This paper investigates the task of building a non-goal driven conversational agent, using neural network generative models and analyzes how the conversation context is handled. It compares a simpler Encoder-Decoder with a Hierarchical Recurrent Encoder-Decoder architecture, which includes an additional module to model the context of the conversation using previous utterances information. We found that the hierarchical model was able to extract relevant context information and include them in the generation of the output. However, it performed worse (35-40%) than the simple Encoder-Decoder model regarding both grammatically correct output and meaningful response. Despite these results, experiments demonstrate how conversations about similar topics appear close to each other in the context space due to the increased frequency of specific topic-related words, thus leaving promising directions for future research and how the context of a conversation can be exploited.

AB - Conversational agents have begun to rise both in the academic (in terms of research) and commercial (in terms of applications) world. This paper investigates the task of building a non-goal driven conversational agent, using neural network generative models and analyzes how the conversation context is handled. It compares a simpler Encoder-Decoder with a Hierarchical Recurrent Encoder-Decoder architecture, which includes an additional module to model the context of the conversation using previous utterances information. We found that the hierarchical model was able to extract relevant context information and include them in the generation of the output. However, it performed worse (35-40%) than the simple Encoder-Decoder model regarding both grammatically correct output and meaningful response. Despite these results, experiments demonstrate how conversations about similar topics appear close to each other in the context space due to the increased frequency of specific topic-related words, thus leaving promising directions for future research and how the context of a conversation can be exploited.

KW - Conversational Agents

KW - Hierarchical Recurrent Encoder Decoder

KW - Recurrent Neural Networks

U2 - 10.5220/0007574203470356

DO - 10.5220/0007574203470356

M3 - Conference article in proceeding

SN - 978-989-758-350-6

SP - 347

EP - 356

BT - Proceedings of the 11th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART

A2 - Rocha, AP

A2 - Steels, L

A2 - VanDenHerik, J

PB - SCITEPRESS

ER -