Abstract
Event correlation is a cornerstone for process discovery over event logs crossing multiple data sources. The computed correlation rules and process instances will greatly help us to unleash the power of process mining. However, exploring all possible event correlations over a log could be time consuming, especially when the log is large. State-of-The-Art methods based on MapReduce designed to handle this challenge have offered significant performance improvements over standalone implementations. However, all existing techniques are still based on a conventional generating-And-pruning scheme. Therefore, event partitioning across multiple machines is often inefficient. In this paper, following the principle of filtering-And-verification, we propose a new algorithm, called RF-GraP, which provides a more efficient correlation over distributed systems. We present the detailed implementation of our approach and conduct a quantitative evaluation using the Spark platform. Experimental results demonstrate that the proposed method is indeed efficient. Compared to the state-of-The-Art, we are able to achieve significant performance speedups with obviously less network communication.
| Original language | English |
|---|---|
| Title of host publication | 2017 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGRID 2017 |
| Place of Publication | Piscataway |
| Publisher | Institute of Electrical and Electronics Engineers |
| Number of pages | 10 |
| ISBN (Electronic) | 978-1-5090-6610-0 |
| DOIs | |
| Publication status | Published - 10 Jul 2017 |
| Event | 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGRID 2017 - Madrid, Spain Duration: 14 May 2017 → 17 May 2017 |
Conference
| Conference | 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGRID 2017 |
|---|---|
| Country/Territory | Spain |
| City | Madrid |
| Period | 14/05/17 → 17/05/17 |
Keywords
- Big Data
- Data Partitioning
- Data-Intensive Computing
- Event Correlation
- Process Mining
- Service Computing
Fingerprint
Dive into the research topics of 'Efficient event correlation over distributed systems'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver