Efficient event correlation over distributed systems

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

9 Citations (Scopus)
1 Downloads (Pure)

Abstract

Event correlation is a cornerstone for process discovery over event logs crossing multiple data sources. The computed correlation rules and process instances will greatly help us to unleash the power of process mining. However, exploring all possible event correlations over a log could be time consuming, especially when the log is large. State-of-The-Art methods based on MapReduce designed to handle this challenge have offered significant performance improvements over standalone implementations. However, all existing techniques are still based on a conventional generating-And-pruning scheme. Therefore, event partitioning across multiple machines is often inefficient. In this paper, following the principle of filtering-And-verification, we propose a new algorithm, called RF-GraP, which provides a more efficient correlation over distributed systems. We present the detailed implementation of our approach and conduct a quantitative evaluation using the Spark platform. Experimental results demonstrate that the proposed method is indeed efficient. Compared to the state-of-The-Art, we are able to achieve significant performance speedups with obviously less network communication.

Original languageEnglish
Title of host publication2017 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGRID 2017
Place of PublicationPiscataway
PublisherInstitute of Electrical and Electronics Engineers
Number of pages10
ISBN (Electronic)978-1-5090-6610-0
DOIs
Publication statusPublished - 10 Jul 2017
Event17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGRID 2017 - Madrid, Spain
Duration: 14 May 201717 May 2017

Conference

Conference17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGRID 2017
CountrySpain
CityMadrid
Period14/05/1717/05/17

Keywords

  • Big Data
  • Data Partitioning
  • Data-Intensive Computing
  • Event Correlation
  • Process Mining
  • Service Computing

Fingerprint Dive into the research topics of 'Efficient event correlation over distributed systems'. Together they form a unique fingerprint.

  • Cite this

    Cheng, L., van Dongen, B. F., & van der Aalst, W. M. P. (2017). Efficient event correlation over distributed systems. In 2017 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGRID 2017 [7973683] Institute of Electrical and Electronics Engineers. https://doi.org/10.1109/CCGRID.2017.94