GENder-IT: An Annotated English-Italian Parallel Challenge Set for Cross-Linguistic Natural Gender Phenomena
Contributo in Atti di convegno
Data di Pubblicazione:
2021
Abstract:
Languages differ in terms of the absence or presence of gender features, the number of gender classes and whether and where gender features are explicitly marked. These cross-linguistic differences can lead to ambiguities that are difficult to resolve, especially for sentence-level MT systems. The identification of ambiguity and its subsequent resolution is a challenging task for which currently there aren't any specific resources or challenge sets available. In this paper, we introduce gENder-IT, an English--Italian challenge set focusing on the resolution of natural gender phenomena by providing word-level gender tags on the English source side and multiple gender alternative translations, where needed, on the Italian target side.
Tipologia CRIS:
4.1 Contributo in Atti di convegno
Keywords:
gENder-IT corpus, Machine translation, ambiguity, gender bias
Elenco autori:
Vanmassenhove, Eva; Monti, Johanna
Link alla scheda completa:
Titolo del libro:
Proceedings of the 3rd Workshop on Gender Bias in Natural Language Processing