Combining quality estimation and automatic post-editing to enhance machine translation output

R. Chatterjee, M. Negri, M. Turchi, F. Blain, L. Specia

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

9 Citations (Scopus)

Abstract

We investigate different strategies for combining quality estimation (QE) and automatic postediting (APE) to improve the output of machine translation (MT) systems. The joint contribution of the two technologies is analyzed in different settings, in which QE serves as either: i) an activator of APE corrections, or ii) a guidance to APE corrections, or iii) a selector of the final output to be returned to the user. In the first case (QE as activator), sentence-level predictions on the raw MT output quality are used to trigger its automatic correction when the estimated (TER) scores are below a certain threshold. In the second case (QE as guidance), word-level binary quality predictions (“good”/“bad”) are used to inform APE about problematic words in the MT output that should be corrected. In the last case (QE as selector), both sentence- and word-level quality predictions are used to identify the most accurate translation between the original MT output and its post-edited version. For the sake of comparison, the underlying APE technologies explored in our evaluation are both phrase-based and neural. Experiments are carried out on the English-German data used for the QE/APE shared tasks organized within the First Conference on Machine Translation (WMT 2016). Our evaluation shows positive but mixed results, with higher performance observed when word-level QE is used as a selector for neural APE applied to the output of a phrase-based MT system. Overall, our findings motivate further investigation on QE technologies. By reducing the gap between the performance of current solutions and “oracle” results, QE could significantly add to competitive APE technologies.
Original languageEnglish
Title of host publicationAMTA 2018 - 13th Conference of the Association for Machine Translation in the Americas, Proceedings
EditorsColin Cherry, Graham Neubig
Pages26-38
Number of pages13
Volume1
Publication statusPublished - 2018
Externally publishedYes
EventThe 13th Conference of The Association for Machine Translation in the Americas 2018 - Boston, United States
Duration: 17 Mar 201821 Mar 2018

Conference

ConferenceThe 13th Conference of The Association for Machine Translation in the Americas 2018
Abbreviated titleAMTA 2018
Country/TerritoryUnited States
CityBoston
Period17/03/1821/03/18

Keywords

  • Quality estimation
  • Automatic post editing

Fingerprint

Dive into the research topics of 'Combining quality estimation and automatic post-editing to enhance machine translation output'. Together they form a unique fingerprint.

Cite this