Data-driven Update of AGROVOC Using Agricultural Text Corpora

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingAcademicpeer-review

Abstract

AGROVOC is a well-known multilingual controlled vocabulary covering the fields of agriculture, forestry, fisheries, and food. It is used for dataset annotation, indexing of literature, and automated text tagging, and its effective use depends on its continuous update. Currently, updates are done manually by a dispersed community of editors. In this paper, we present work towards automated update recommendations using large corpora of agricultural text (such as the AGRIS database). The work is based on the extraction of agricultural concept mentions from text through the deployment of custom trained Named Entity Recognition models and the exploitation of Graph Neural Networks to recommend concept and relation additions towards predicting future AGROVOC states. The research questions and methodology are presented together with the results of an initial experiment. The next steps and future research directions are outlined. This work forms part of a PhD research on monitoring and predicting changes in knowledge graphs utilising textual data.

Original languageEnglish
Title of host publicationProceedings of HAICTA 2022
Pages260-265
Number of pages6
Volume3293
Publication statusPublished - 2022
Event10th International Conference on ICT in Agriculture, Food & Environment - Athens, Greece
Duration: 22 Sept 202225 Sept 2022
Conference number: 10
https://2022.haicta.gr/

Conference

Conference10th International Conference on ICT in Agriculture, Food & Environment
Abbreviated titleHAICTA 2022
Country/TerritoryGreece
CityAthens
Period22/09/2225/09/22
Internet address

Keywords

  • AGROVOC
  • Graph Neural Networks
  • Named Entity Recognition
  • knowledge graph
  • update

Cite this