Incremental processing of noisy user utterances in the spoken language understanding task

Stefan Constantin; Jan Niehues; Alex Waibel

doi:10.18653/V1/D19-5535

Incremental processing of noisy user utterances in the spoken language understanding task

Stefan Constantin, Jan Niehues, Alex Waibel

Advanced Computing Sciences

Research output: Chapter in Book/Report/Conference proceeding › Conference article in proceeding › Academic › peer-review

Abstract

The state-of-the-art neural network architectures make it possible to create spoken language understanding systems with high quality and fast processing time. One major challenge for real-world applications is the high latency of these systems caused by triggered actions with high executions times. If an action can be separated into subactions, the reaction time of the systems can be improved through incremental processing of the user utterance and starting subactions while the utterance is still being uttered. In this work, we present a model-agnostic method to achieve high quality in processing incrementally produced partial utterances. Based on clean and noisy versions of the ATIS dataset, we show how to create datasets with our method to create low-latency natural language understanding components. We get improvements of up to 47.91 absolute percentage points in the metric F1-score.

Original language	English
Title of host publication	Proceedings of the 5th Workshop on Noisy User-generated Text (W-NUT)
Publisher	Association for Computational Linguistics (ACL)
Pages	265-274
Number of pages	10
ISBN (Electronic)	9781950737840
DOIs	https://doi.org/10.18653/V1/D19-5535
Publication status	Published - 2019
Event	5th Workshop on Noisy User-Generated Text - Hong Kong, China Duration: 4 Nov 2019 → 4 Nov 2019 Conference number: 5

Workshop

Workshop	5th Workshop on Noisy User-Generated Text
Abbreviated title	W-NUT@EMNLP 2019
Country/Territory	China
City	Hong Kong
Period	4/11/19 → 4/11/19

Access to Document

10.18653/V1/D19-5535Licence: CC BY

http://arxiv.org/abs/1909.13790

Cite this

@inproceedings{feb02684e6b3406c894cc86a4f84b3b0,

title = "Incremental processing of noisy user utterances in the spoken language understanding task",

abstract = "The state-of-the-art neural network architectures make it possible to create spoken language understanding systems with high quality and fast processing time. One major challenge for real-world applications is the high latency of these systems caused by triggered actions with high executions times. If an action can be separated into subactions, the reaction time of the systems can be improved through incremental processing of the user utterance and starting subactions while the utterance is still being uttered. In this work, we present a model-agnostic method to achieve high quality in processing incrementally produced partial utterances. Based on clean and noisy versions of the ATIS dataset, we show how to create datasets with our method to create low-latency natural language understanding components. We get improvements of up to 47.91 absolute percentage points in the metric F1-score.",

author = "Stefan Constantin and Jan Niehues and Alex Waibel",

note = "Funding Information: This work has been conducted in the SecondHands project which has received funding from the European Union{\textquoteright}s Horizon 2020 Research and Innovation programme (call:H2020-ICT-2014-1, RIA) under grant agreement No 643950. Publisher Copyright: {\textcopyright} 2019 Association for Computational Linguistics; 5th Workshop on Noisy User-Generated Text, W-NUT@EMNLP 2019 ; Conference date: 04-11-2019 Through 04-11-2019",

year = "2019",

doi = "10.18653/V1/D19-5535",

language = "English",

pages = "265--274",

booktitle = "Proceedings of the 5th Workshop on Noisy User-generated Text (W-NUT)",

publisher = "Association for Computational Linguistics (ACL)",

address = "United States",

}

Constantin, S, Niehues, J & Waibel, A 2019, Incremental processing of noisy user utterances in the spoken language understanding task. in Proceedings of the 5th Workshop on Noisy User-generated Text (W-NUT). Association for Computational Linguistics (ACL), pp. 265-274, 5th Workshop on Noisy User-Generated Text, Hong Kong, China, 4/11/19. https://doi.org/10.18653/V1/D19-5535

Incremental processing of noisy user utterances in the spoken language understanding task. / Constantin, Stefan; Niehues, Jan; Waibel, Alex.
Proceedings of the 5th Workshop on Noisy User-generated Text (W-NUT). Association for Computational Linguistics (ACL), 2019. p. 265-274.

Research output: Chapter in Book/Report/Conference proceeding › Conference article in proceeding › Academic › peer-review

TY - GEN

T1 - Incremental processing of noisy user utterances in the spoken language understanding task

AU - Constantin, Stefan

AU - Niehues, Jan

AU - Waibel, Alex

N1 - Conference code: 5

PY - 2019

Y1 - 2019

N2 - The state-of-the-art neural network architectures make it possible to create spoken language understanding systems with high quality and fast processing time. One major challenge for real-world applications is the high latency of these systems caused by triggered actions with high executions times. If an action can be separated into subactions, the reaction time of the systems can be improved through incremental processing of the user utterance and starting subactions while the utterance is still being uttered. In this work, we present a model-agnostic method to achieve high quality in processing incrementally produced partial utterances. Based on clean and noisy versions of the ATIS dataset, we show how to create datasets with our method to create low-latency natural language understanding components. We get improvements of up to 47.91 absolute percentage points in the metric F1-score.

AB - The state-of-the-art neural network architectures make it possible to create spoken language understanding systems with high quality and fast processing time. One major challenge for real-world applications is the high latency of these systems caused by triggered actions with high executions times. If an action can be separated into subactions, the reaction time of the systems can be improved through incremental processing of the user utterance and starting subactions while the utterance is still being uttered. In this work, we present a model-agnostic method to achieve high quality in processing incrementally produced partial utterances. Based on clean and noisy versions of the ATIS dataset, we show how to create datasets with our method to create low-latency natural language understanding components. We get improvements of up to 47.91 absolute percentage points in the metric F1-score.

U2 - 10.18653/V1/D19-5535

DO - 10.18653/V1/D19-5535

M3 - Conference article in proceeding

SP - 265

EP - 274

BT - Proceedings of the 5th Workshop on Noisy User-generated Text (W-NUT)

PB - Association for Computational Linguistics (ACL)

T2 - 5th Workshop on Noisy User-Generated Text

Y2 - 4 November 2019 through 4 November 2019

ER -