TY - JOUR
T1 - Getting the Data in Shape for Your Process Mining Analysis
T2 - An In-Depth Analysis of the Pre-Analysis Stage
AU - Pradhan, Shameer K.
AU - Jans, Mieke
AU - Martin, Niels
N1 - Funding Information:
This study was supported by the Special Research Fund (BOF) of Hasselt University under Grant Nos. BOF21OWB22 and BOF24TT02.
data source:
PY - 2025/2/10
Y1 - 2025/2/10
N2 - Process mining enables organizations to analyze the data stored in their information systems and derive insights regarding their business processes. However, raw data needs to be converted into a format that can be fed into process mining algorithms. Various pre-analysis activities can be performed on the raw data, such as imperfection removal or granularity level change. Although pre-analysis activities play a crucial role in process mining, there is currently a limited overview available regarding their scope and the extent of their examination. This study presents a systematic literature review of the pre-analysis activities in process mining projects. To better understand this stage and its current state of research, we explore which activities constitute the pre-analysis stage, their goals, the applied research methodologies, the proposed research outcomes, and the data used to evaluate the research outcomes. We identify 15 pre-analysis activities and concepts, e.g., data extraction, generation, and cleaning. We also discover that design science research is the methodology and methods are the primary research outcome in previous studies. We also realize that the proposed outcomes have been evaluated using only real-life data most of the time. This study reveals that research on pre-analysis is a growing field of interest in process mining.
AB - Process mining enables organizations to analyze the data stored in their information systems and derive insights regarding their business processes. However, raw data needs to be converted into a format that can be fed into process mining algorithms. Various pre-analysis activities can be performed on the raw data, such as imperfection removal or granularity level change. Although pre-analysis activities play a crucial role in process mining, there is currently a limited overview available regarding their scope and the extent of their examination. This study presents a systematic literature review of the pre-analysis activities in process mining projects. To better understand this stage and its current state of research, we explore which activities constitute the pre-analysis stage, their goals, the applied research methodologies, the proposed research outcomes, and the data used to evaluate the research outcomes. We identify 15 pre-analysis activities and concepts, e.g., data extraction, generation, and cleaning. We also discover that design science research is the methodology and methods are the primary research outcome in previous studies. We also realize that the proposed outcomes have been evaluated using only real-life data most of the time. This study reveals that research on pre-analysis is a growing field of interest in process mining.
KW - data preprocessing
KW - event log
KW - Process mining
KW - process mining pre-analysis
U2 - 10.1145/3712587
DO - 10.1145/3712587
M3 - Article
SN - 0360-0300
VL - 57
SP - 1
EP - 37
JO - ACM Computing Surveys
JF - ACM Computing Surveys
IS - 6
M1 - 159
ER -