Extracting Event Data from Document-Driven Enterprise Systems

Diego Calvanese, Mieke Jans*, Tahir Emre Kalayci, Marco Montali

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingAcademicpeer-review

Abstract

The preparation of input event data is one of the most critical phases in process mining projects. Different frameworks have been developed to offer methodologies and/or supporting toolkits for data preparation. One of these frameworks, called OnProm, relies on sophisticated semantic technologies to extract event logs from relational databases. The toolkit consists of a series of general steps, meant to work on arbitrary, legacy databases. However, in many settings, the input database is not a legacy one but is structured with conceptually understandable object types and relationships that can be effectively employed to support business users in the extraction process. This is, for example, the case for document-driven enterprise systems. In this paper, we focus on this class of systems and propose a guided approach, erprep, to support a group of business and technical users in setting up OnProm with minimal effort. We demonstrate the approach in a real-life use case.
Original languageEnglish
Title of host publicationAdvanced Information Systems Engineering - 35th International Conference, CAiSE 2023, Zaragoza, Spain, June 12-16, 2023 Proceedings
EditorsMarta Indulska, Iris Reinhartz-Berger, Carlos Cetina, Oscar Pastor
PublisherSpringer, Cham
Pages193-209
Number of pages17
ISBN (Electronic)978-3-031-34560-9
ISBN (Print)9783031345593
DOIs
Publication statusPublished - 8 Jun 2023
Event35th International Conference on Advanced Information Systems Engineering - Zaragoza, Spain
Duration: 12 Jun 202316 Jun 2023
Conference number: 35
https://caise23.svit.usj.es/

Publication series

SeriesLecture Notes in Computer Science
Volume13901
ISSN0302-9743

Conference

Conference35th International Conference on Advanced Information Systems Engineering
Abbreviated titleCAiSE 2023
Country/TerritorySpain
CityZaragoza
Period12/06/2316/06/23
Internet address

Keywords

  • Data preparation
  • ERP systems
  • event log extraction
  • Ontology-based event modeling

Cite this