conferito da Europeana
- 2021
- This proposal from a team based at the University of Naples 'L'Orientale' aims to create a dataset for Named Entity Recognition (NER) and Term Extraction for archeological terms in Italian and English in the Europeana Archeology collection. NER is the process of identifying proper names such as person names or locations in unstructured text. Term Extraction is similar, but focuses on finding specialised terms, in this case from the archeology domain. Vocabularies like Getty and CIDOC CRM will be considered. The final dataset could be used in the development and evaluation of AI/ML based technologies for NER in the archeology domain.
Reviewers particularly appreciated the clear structure and maturity of the proposal, for which a mock dataset was already made using Europeana’s APIs to test the approach proposed. The bilingual aspect and the scarcity of similar open resources for the archeology field were also seen as particularly valuable.