Modeling Confidence in Sequence-to-Sequence Models

Jan Niehues; Ngoc-Quan Pham

doi:10.18653/V1/W19-8671

Modeling Confidence in Sequence-to-Sequence Models

Jan Niehues, Ngoc-Quan Pham

Advanced Computing Sciences

Research output: Chapter in Book/Report/Conference proceeding › Conference article in proceeding › Academic › peer-review

Abstract

Recently, significant improvements have been achieved in various natural language processing tasks using neural sequence-to-sequence models. While aiming for the best generation quality is important, ultimately it is also necessary to develop models that can assess the quality of their output. In this work, we propose to use the similarity between training and test conditions as a measure for models’ confidence. We investigate methods solely using the similarity as well as methods combining it with the posterior probability. While traditionally only target tokens are annotated with confidence measures, we also investigate methods to annotate source tokens with confidence. By learning an internal alignment model, we can significantly improve confidence projection over using state-of-the-art external alignment tools. We evaluate the proposed methods on downstream confidence estimation for machine translation (MT). We show improvements on segment-level confidence estimation as well as on confidence estimation for source tokens. In addition, we show that the same methods can also be applied to other tasks using sequence-to-sequence models. On the automatic speech recognition (ASR) task, we are able to find 60% of the errors by looking at 20% of the data.

Original language	English
Title of host publication	Proceedings of the 12th International Conference on Natural Language Generation (INLG 2019)
Editors	Kees van Deemter, Chenghua Lin, Hiroya Takamura
Publisher	Association for Computational Linguistics
Pages	575-583
Number of pages	9
DOIs	https://doi.org/10.18653/V1/W19-8671
Publication status	Published - 2019

Access to Document

10.18653/V1/W19-8671Licence: CC BY

Cite this

@inproceedings{860922cbb50244fab80de154fd903bd0,

title = "Modeling Confidence in Sequence-to-Sequence Models",

abstract = "Recently, significant improvements have been achieved in various natural language processing tasks using neural sequence-to-sequence models. While aiming for the best generation quality is important, ultimately it is also necessary to develop models that can assess the quality of their output. In this work, we propose to use the similarity between training and test conditions as a measure for models{\textquoteright} confidence. We investigate methods solely using the similarity as well as methods combining it with the posterior probability. While traditionally only target tokens are annotated with confidence measures, we also investigate methods to annotate source tokens with confidence. By learning an internal alignment model, we can significantly improve confidence projection over using state-of-the-art external alignment tools. We evaluate the proposed methods on downstream confidence estimation for machine translation (MT). We show improvements on segment-level confidence estimation as well as on confidence estimation for source tokens. In addition, we show that the same methods can also be applied to other tasks using sequence-to-sequence models. On the automatic speech recognition (ASR) task, we are able to find 60% of the errors by looking at 20% of the data.",

author = "Jan Niehues and Ngoc-Quan Pham",

note = "Funding Information: The project ELITR leading to this publication has received funding from the European Unions Horizon 2020 Research and Innovation Programme under grant agreement No 825460. We thank Elizabeth Salesky for the constructive comments. Publisher Copyright: {\textcopyright}2019 Association for Computational Linguistics",

year = "2019",

doi = "10.18653/V1/W19-8671",

language = "English",

pages = "575--583",

editor = "{van Deemter}, Kees and Chenghua Lin and Hiroya Takamura",

booktitle = "Proceedings of the 12th International Conference on Natural Language Generation (INLG 2019)",

publisher = "Association for Computational Linguistics",

}

Modeling Confidence in Sequence-to-Sequence Models. / Niehues, Jan; Pham, Ngoc-Quan.
Proceedings of the 12th International Conference on Natural Language Generation (INLG 2019). ed. / Kees van Deemter; Chenghua Lin; Hiroya Takamura. Association for Computational Linguistics, 2019. p. 575-583.

Research output: Chapter in Book/Report/Conference proceeding › Conference article in proceeding › Academic › peer-review

TY - GEN

T1 - Modeling Confidence in Sequence-to-Sequence Models

AU - Niehues, Jan

AU - Pham, Ngoc-Quan

N1 - Funding Information: The project ELITR leading to this publication has received funding from the European Unions Horizon 2020 Research and Innovation Programme under grant agreement No 825460. We thank Elizabeth Salesky for the constructive comments. Publisher Copyright: ©2019 Association for Computational Linguistics

PY - 2019

Y1 - 2019

N2 - Recently, significant improvements have been achieved in various natural language processing tasks using neural sequence-to-sequence models. While aiming for the best generation quality is important, ultimately it is also necessary to develop models that can assess the quality of their output. In this work, we propose to use the similarity between training and test conditions as a measure for models’ confidence. We investigate methods solely using the similarity as well as methods combining it with the posterior probability. While traditionally only target tokens are annotated with confidence measures, we also investigate methods to annotate source tokens with confidence. By learning an internal alignment model, we can significantly improve confidence projection over using state-of-the-art external alignment tools. We evaluate the proposed methods on downstream confidence estimation for machine translation (MT). We show improvements on segment-level confidence estimation as well as on confidence estimation for source tokens. In addition, we show that the same methods can also be applied to other tasks using sequence-to-sequence models. On the automatic speech recognition (ASR) task, we are able to find 60% of the errors by looking at 20% of the data.

AB - Recently, significant improvements have been achieved in various natural language processing tasks using neural sequence-to-sequence models. While aiming for the best generation quality is important, ultimately it is also necessary to develop models that can assess the quality of their output. In this work, we propose to use the similarity between training and test conditions as a measure for models’ confidence. We investigate methods solely using the similarity as well as methods combining it with the posterior probability. While traditionally only target tokens are annotated with confidence measures, we also investigate methods to annotate source tokens with confidence. By learning an internal alignment model, we can significantly improve confidence projection over using state-of-the-art external alignment tools. We evaluate the proposed methods on downstream confidence estimation for machine translation (MT). We show improvements on segment-level confidence estimation as well as on confidence estimation for source tokens. In addition, we show that the same methods can also be applied to other tasks using sequence-to-sequence models. On the automatic speech recognition (ASR) task, we are able to find 60% of the errors by looking at 20% of the data.

U2 - 10.18653/V1/W19-8671

DO - 10.18653/V1/W19-8671

M3 - Conference article in proceeding

SP - 575

EP - 583

BT - Proceedings of the 12th International Conference on Natural Language Generation (INLG 2019)

A2 - van Deemter, Kees

A2 - Lin, Chenghua

A2 - Takamura, Hiroya

PB - Association for Computational Linguistics

ER -