TY - JOUR
T1 - Case notion discovery and recommendation
T2 - automated event log building on databases
AU - de Murillas, E. González López
AU - Reijers, H.A.
AU - van der Aalst, W.M.P.
PY - 2020/7/1
Y1 - 2020/7/1
N2 - Process mining techniques use event logs as input. When analyzing complex databases, these event logs can be built in many ways. Events need to be grouped into traces corresponding to a case. Different groupings provide different views on the data. Building event logs is usually a time-consuming, manual task. This paper provides a precise view on the case notion on databases, which enables the automatic computation of event logs. Also, it provides a way to assess event log quality, used to rank event logs with respect to their interestingness. The computational cost of building an event log can be avoided by predicting the interestingness of a case notion, before the corresponding event log is computed. This makes it possible to give recommendations to users, so they can focus on the analysis of the most promising process views. Finally, the accuracy of the predictions and the quality of the rankings generated by our unsupervised technique are evaluated in comparison to the existing regression techniques as well as to state-of-the-art learning to rank algorithms from the information retrieval field. The results show that our prediction technique succeeds at discovering interesting event logs and provides valuable recommendations to users about the perspectives on which to focus the efforts during the analysis.
AB - Process mining techniques use event logs as input. When analyzing complex databases, these event logs can be built in many ways. Events need to be grouped into traces corresponding to a case. Different groupings provide different views on the data. Building event logs is usually a time-consuming, manual task. This paper provides a precise view on the case notion on databases, which enables the automatic computation of event logs. Also, it provides a way to assess event log quality, used to rank event logs with respect to their interestingness. The computational cost of building an event log can be avoided by predicting the interestingness of a case notion, before the corresponding event log is computed. This makes it possible to give recommendations to users, so they can focus on the analysis of the most promising process views. Finally, the accuracy of the predictions and the quality of the rankings generated by our unsupervised technique are evaluated in comparison to the existing regression techniques as well as to state-of-the-art learning to rank algorithms from the information retrieval field. The results show that our prediction technique succeeds at discovering interesting event logs and provides valuable recommendations to users about the perspectives on which to focus the efforts during the analysis.
KW - Case notion
KW - Database
KW - Event log
KW - Process mining
KW - Ranking
KW - Recommendation
UR - http://www.scopus.com/inward/record.url?scp=85077181148&partnerID=8YFLogxK
U2 - 10.1007/s10115-019-01430-6
DO - 10.1007/s10115-019-01430-6
M3 - Article
AN - SCOPUS:85077181148
SN - 0219-1377
VL - 62
SP - 2539
EP - 2575
JO - Knowledge and Information Systems
JF - Knowledge and Information Systems
IS - 7
ER -