Back to Search Start Over

Comparing Trace Similarity Metrics Across Logs and Evaluation Measures

Authors :
Indulska, Marta
Reinhartz-Berger, Iris
Cetina, Carlos
Pastor, Oscar
Back, Christoffer Olling
Simonsen, Jakob Grue
Indulska, Marta
Reinhartz-Berger, Iris
Cetina, Carlos
Pastor, Oscar
Back, Christoffer Olling
Simonsen, Jakob Grue
Source :
Back , C O & Simonsen , J G 2023 , Comparing Trace Similarity Metrics Across Logs and Evaluation Measures . in M Indulska , I Reinhartz-Berger , C Cetina & O Pastor (eds) , Advanced Information Systems Engineering : 35th International Conference, CAiSE 2023, Zaragoza, Spain, June 12–16, 2023, Proceedings . Springer , Lecture Notes in Computer Science , vol. 13901) , pp. 226-242 , 35th International Conference on Advanced Information Systems Engineering, CAiSE 2023 , Zaragoza , Spain , 12/06/2023 .
Publication Year :
2023

Abstract

Trace similarity is a prerequisite for several process mining tasks, e.g. identifying process variants and anomalies. Many similarity metrics have been presented in the literature, but the similarity metric itself is seldom subject to controlled evaluation. Instead, they are usually demonstrated in conjunction with downstream tasks, e.g. process model discovery, and often evaluated qualitatively or with limited comparison. In this paper, we isolate similarity metrics from downstream tasks and compare them wrt. evaluation measures adapted from metric learning and clustering literature. We present a comparison of 18 similarity metrics across 4 evaluation measures and 12 event logs. Friedman and Nemenyi tests for statistical significance show that certain similarity metrics consistently outperform on some evaluation measures, but their mean rank varies across evaluation measures. One similarity metric based on a weighted eventually-follows relation does stand out as consistently outperforming, and the simplest n-gram similarity metrics also perform well. Our results demonstrate that choice of evaluation measures will determine the contours of the metric that are revealed. This study may be harnessed as a baseline for benchmarking future work on trace similarity, and describes tools for quantitative evaluation that we hope will inspire empirical rigor in future work.

Details

Database :
OAIster
Journal :
Back , C O & Simonsen , J G 2023 , Comparing Trace Similarity Metrics Across Logs and Evaluation Measures . in M Indulska , I Reinhartz-Berger , C Cetina & O Pastor (eds) , Advanced Information Systems Engineering : 35th International Conference, CAiSE 2023, Zaragoza, Spain, June 12–16, 2023, Proceedings . Springer , Lecture Notes in Computer Science , vol. 13901) , pp. 226-242 , 35th International Conference on Advanced Information Systems Engineering, CAiSE 2023 , Zaragoza , Spain , 12/06/2023 .
Notes :
English
Publication Type :
Electronic Resource
Accession number :
edsoai.on1439557191
Document Type :
Electronic Resource