Multiword units translation evaluation in machine translation: another pain in the neck?
Contributo in Atti di convegno
Data di Pubblicazione:
2016
Abstract:
Recent studies have highlighted that the translation of Multiword Units (MWUs) by Machine Translation (MT) is still an open challenge, whatever the adopted approach (statistical, rule-based or example- based). The difficulties in translating automatically this recurrent, complex and varied lexical phenomenon originate from its lexical, syntactic, semantic, pragmatic and/or statistical but also translational idiomaticity. It is widely acknowledged that in order to achieve significant improvements in Machine Translation and translation technologies it is important to develop resources, which can be used both for Statistical Machine Translation (SMT) training and evaluation purposes. There is therefore, the need to develop linguistic re- sources, mainly parallel corpora annotated with MWUs which can help improve the MT quality in particular as regards translation of MWUs in context and discontinuous MWUs. In this paper, we analyse the state of the art concerning MWU-aware MT evaluation metrics, the availability of both benchmarking resources and annotation guidelines and procedures.
Tipologia CRIS:
4.1 Contributo in Atti di convegno
Keywords:
Machine Translation, multiword unit, MT quality
Elenco autori:
Monti, Johanna; Todirascu, Amalia
Link alla scheda completa:
Link al Full Text:
Titolo del libro:
Workshop proceedings for: Multi-word Units in Machine Translation and Translation Technology (Organised at Europhras 2015)