Abstract
Process discovery, one of the key challenges in process mining, aims at discovering process models from process execution data stored in event logs. Most discovery algorithms assume that all data in an event log conform to correct execution of the process, and hence, incorporate all behaviour in their resulting process model. However, in real event logs, noise and irrelevant infrequent behaviour are often present. Incorporating such behaviour results in complex, incomprehensible process models concealing the correct and/or relevant behaviour of the underlying process. In this paper, we propose a novel general purpose filtering method that exploits observed conditional probabilities between sequences of activities. The method has been implemented in both the ProM toolkit and the RapidProM framework. We evaluate our approach using real and synthetic event data. The results show that the proposed method accurately removes irrelevant behaviour and, indeed, improves process discovery results.
Original language | English |
---|---|
Title of host publication | Business Process Management Workshops - BPM 2017 International Workshops, Revised Papers |
Place of Publication | Berlin |
Publisher | Springer |
Pages | 216-229 |
Number of pages | 14 |
ISBN (Print) | 9783319740294 |
DOIs | |
Publication status | Published - 2018 |
Event | 15th International Conference on Business Process Management (BPM 2017) - Barcelona, Spain Duration: 10 Sept 2017 → 15 Sept 2017 Conference number: 15 https://bpm2017.cs.upc.edu/ |
Publication series
Name | Lecture Notes in Business Information Processing |
---|---|
Volume | 308 |
ISSN (Print) | 1865-1348 |
Conference
Conference | 15th International Conference on Business Process Management (BPM 2017) |
---|---|
Abbreviated title | BPM 2017 |
Country/Territory | Spain |
City | Barcelona |
Period | 10/09/17 → 15/09/17 |
Internet address |
Keywords
- Noise filtering
- Outlier detection
- Process discovery
- Process mining
- Noise filtering Outlier detection