Skip to Main Content (Press Enter)

Logo UNIOR
  • ×
  • Home
  • Corsi
  • Insegnamenti
  • Persone
  • Strutture

UNIFIND
Logo UNIOR

|

UNIFIND

unior.it
  • ×
  • Home
  • Corsi
  • Insegnamenti
  • Persone
  • Strutture

Large Language Models as Legal Translators of Arabic Legislation: Do ChatGPT and Gemini Care for Context and Terminology?

Contributo in Atti di convegno
Data di Pubblicazione:
2024
Abstract:
Accurate translation of terminology and adaptation to in-context information is a pillar to high quality translation. Recently, there is a remarkable interest towards the use and the evaluation of Large Language Models (LLMs) particularly for Machine Translation tasks. Nevertheless, despite their recent advancement and ability to understand and generate human-like language, these LLMs are still far from perfect, especially in domain-specific scenarios, and need to be thoroughly investigated. This is particularly evident in automatically translating legal terminology from Arabic into English and French, where, beyond the inherent complexities of legal language and specialised translations, technical limitations of LLMs further hinder accurate generation of text. In this paper, we present a preliminary evaluation of two evolving LLMs, namely GPT-4 Generative Pre-trained Transformer and Gemini, as legal translators of Arabic legislatives to test their accuracy and the extent to which they care for context and terminology across two language pairs (AR→EN / AR→FR). The study targets the evaluation of Zero-Shot prompting for in-context and out-of-context scenarios of both models relying on a gold standard dataset, verified by professional translators who are also experts in the field. We evaluate the results applying the Multidimensional Quality Metrics to classify translation errors. Moreover, we also evaluate the general LLMs outputs to verify their correctness, consistency, and completeness.In general, our results show that the models are far from perfect and recall for more fine-tuning efforts using specialised terminological data in the legal domain from Arabic into English and French.
Tipologia CRIS:
4.1 Contributo in Atti di convegno
Keywords:
Large Language Models, Arabic Machine Translation, Legal Terminology Resources, Arabic Legislation.
Elenco autori:
Elfqih, Ait; Khadija, ; Monti, Johanna
Autori di Ateneo:
MONTI JOHANNA
Link alla scheda completa:
https://unora.unior.it/handle/11574/232781
Link al Full Text:
https://unora.unior.it//retrieve/handle/11574/232781/178327/2024.arabicnlp-1.10.pdf
Titolo del libro:
Proceedings of The Second Arabic Natural Language Processing Conference
  • Dati Generali

Dati Generali

URL

https://aclanthology.org/2024.arabicnlp-1.10
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.1.0