Abstract
We investigate different strategies for combining quality estimation (QE) and automatic postediting (APE) to improve the output of machine translation (MT) systems. The joint contribution of the two technologies is analyzed in different settings, in which QE serves as either: i) an activator of APE corrections, or ii) a guidance to APE corrections, or iii) a selector of the final output to be returned to the user. In the first case (QE as activator), sentence-level predictions on the raw MT output quality are used to trigger its automatic correction when the estimated (TER) scores are below a certain threshold. In the second case (QE as guidance), word-level binary quality predictions (“good”/“bad”) are used to inform APE about problematic words in the MT output that should be corrected. In the last case (QE as selector), both sentence- and word-level quality predictions are used to identify the most accurate translation between the original MT output and its post-edited version. For the sake of comparison, the underlying APE technologies explored in our evaluation are both phrase-based and neural. Experiments are carried out on the English-German data used for the QE/APE shared tasks organized within the First Conference on Machine Translation (WMT 2016). Our evaluation shows positive but mixed results, with higher performance observed when word-level QE is used as a selector for neural APE applied to the output of a phrase-based MT system. Overall, our findings motivate further investigation on QE technologies. By reducing the gap between the performance of current solutions and “oracle” results, QE could significantly add to competitive APE technologies.
Original language | English |
---|---|
Title of host publication | AMTA 2018 - 13th Conference of the Association for Machine Translation in the Americas, Proceedings |
Editors | Colin Cherry, Graham Neubig |
Pages | 26-38 |
Number of pages | 13 |
Volume | 1 |
Publication status | Published - 2018 |
Externally published | Yes |
Event | The 13th Conference of The Association for Machine Translation in the Americas 2018 - Boston, United States Duration: 17 Mar 2018 → 21 Mar 2018 |
Conference
Conference | The 13th Conference of The Association for Machine Translation in the Americas 2018 |
---|---|
Abbreviated title | AMTA 2018 |
Country/Territory | United States |
City | Boston |
Period | 17/03/18 → 21/03/18 |
Keywords
- Quality estimation
- Automatic post editing