Back to Search Start Over

Integrated detection and localization of concept drifts in process mining with batch and stream trace clustering support.

Authors :
de Sousa, Rafael Gaspar
Meira Neto, Antonio Carlos
Fantinato, Marcelo
Peres, Sarajane Marques
Reijers, Hajo Alexander
Source :
Data & Knowledge Engineering. Jan2024, Vol. 149, pN.PAG-N.PAG. 1p.
Publication Year :
2024

Abstract

Process mining can help organizations by extracting knowledge from event logs. However, process mining techniques often assume business processes are stationary, while actual business processes are constantly subject to change because of the complexity of organizations and their external environment. Thus, addressing process changes over time – known as concept drifts – allows for a better understanding of process behavior and can provide a competitive edge for organizations, especially in an online data stream scenario. Current approaches to handling process concept drift focus primarily on detecting and locating concept drifts, often through an integrated, albeit offline, approach. However, part of these integrated approaches rely on complex data structures related to tree-based process models, usually discovered through algorithms whose results are influenced by specific heuristic rules. Moreover, most of the proposed approaches have not been tested on public true concept drift-labeled event logs commonly used as benchmark, making comparative analysis difficult. In this article, we propose an online approach to detect and localize concept drifts in an integrated way using batch and stream trace clustering support. In our approach, cluster models provide input information for both concept drift detection and localization methods. Each cluster abstracts a behavior profile underlying the process and reveals descriptive information about the discovered concept drifts. Experiments with benchmark synthetic event logs with different control-flow changes, as well as with real-world event logs, showed that our approach, when relying on the same clustering model, is competitive in relation to baselines concept drift detection method. In addition, the experiment showed our approach is able to correctly locate the concept drifts detected and allows the analysis of such concept drifts through different process behavior profiles. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
0169023X
Volume :
149
Database :
Academic Search Index
Journal :
Data & Knowledge Engineering
Publication Type :
Academic Journal
Accession number :
174466169
Full Text :
https://doi.org/10.1016/j.datak.2023.102253