Language Resources for Italian: towards the Development of a Corpus of Annotated Italian Multiword Expressions
Conference Paper
Publication Date:
2016
abstract:
This paper describes the first resource annotated for multiword expressions (MWEs) in Italian. Two versions of this dataset have been prepared: the first with a fast markup list of out-of-context MWEs, and the second with an in-context annotation, where the MWEs are entered with their contexts. The paper also discusses annotation issues and reports the inter-annotator agreement for both types of annotations. Finally, the results of the first exploitation of the new resource, namely the automatic extraction of Italian MWEs, are presented.
Iris type:
4.1 Contributo in Atti di convegno
Keywords:
multiword expressions, Italian,
List of contributors:
Taslimipoor, Shiva; de Santis, Anna; Cherchi, manuela; Mitkov, ruslan; Monti, Johanna
Book title:
Proceedings of Third Italian Conference on Computational Linguistics (CLiC-it 2016) & Fifth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop (EVALITA 2016)