Prefix Imputation of Orphan Events in Event Stream Processing

Rashid Zaman (Corresponding author), Marwan Hassani (Corresponding author), Boudewijn F. van Dongen

Research output: Contribution to journalArticleAcademicpeer-review

4 Citations (Scopus)
45 Downloads (Pure)

Abstract

In the context of process mining, event logs consist of process instances called cases. Conformance checking is a process mining task that inspects whether a log file is conformant with an existing process model. This inspection is additionally quantifying the conformance in an explainable manner. Online conformance checking processes streaming event logs by having precise insights into the running cases and timely mitigating non-conformance, if any. State-of-the-art online conformance checking approaches bound the memory by either delimiting storage of the events per case or limiting the number of cases to a specific window width. The former technique still requires unbounded memory as the number of cases to store is unlimited, while the latter technique forgets running, not yet concluded, cases to conform to the limited window width. Consequently, the processing system may later encounter events that represent some intermediate activity as per the process model and for which the relevant case has been forgotten, to be referred to as orphan events. The naïve approach to cope with an orphan event is to either neglect its relevant case for conformance checking or treat it as an altogether new case. However, this might result in misleading process insights, for instance, overestimated non-conformance. In order to bound memory yet effectively incorporate the orphan events into processing, we propose an imputation of missing-prefix approach for such orphan events. Our approach utilizes the existing process model for imputing the missing prefix. Furthermore, we leverage the case storage management to increase the accuracy of the prefix prediction. We propose a systematic forgetting mechanism that distinguishes and forgets the cases that can be reliably regenerated as prefix upon receipt of their future orphan event. We evaluate the efficacy of our proposed approach through multiple experiments with synthetic and three real event logs while simulating a streaming setting. Our approach achieves considerably higher realistic conformance statistics than the state of the art while requiring the same storage.

Original languageEnglish
Article number705243
Number of pages21
JournalFrontiers in Big Data
Volume4
DOIs
Publication statusPublished - 6 Oct 2021

Bibliographical note

Funding Information:
The authors have received funding within the BPR4GDPR6 project from the European Union’s Horizon 2020 research and innovation programme under grant agreement No. 787149.

Funding

The authors have received funding within the BPR4GDPR6 project from the European Union’s Horizon 2020 research and innovation programme under grant agreement No. 787149.

FundersFunder number
European Union's Horizon 2020 - Research and Innovation Framework Programme
European Union's Horizon 2020 - Research and Innovation Framework Programme787149

    Keywords

    • event stream processing
    • online conformance checking
    • online process mining
    • prefix imputation
    • prefix-alignments

    Fingerprint

    Dive into the research topics of 'Prefix Imputation of Orphan Events in Event Stream Processing'. Together they form a unique fingerprint.

    Cite this