Efficient event correlation over distributed systems

    Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

    12 Citations (Scopus)
    1 Downloads (Pure)

    Abstract

    Event correlation is a cornerstone for process discovery over event logs crossing multiple data sources. The computed correlation rules and process instances will greatly help us to unleash the power of process mining. However, exploring all possible event correlations over a log could be time consuming, especially when the log is large. State-of-The-Art methods based on MapReduce designed to handle this challenge have offered significant performance improvements over standalone implementations. However, all existing techniques are still based on a conventional generating-And-pruning scheme. Therefore, event partitioning across multiple machines is often inefficient. In this paper, following the principle of filtering-And-verification, we propose a new algorithm, called RF-GraP, which provides a more efficient correlation over distributed systems. We present the detailed implementation of our approach and conduct a quantitative evaluation using the Spark platform. Experimental results demonstrate that the proposed method is indeed efficient. Compared to the state-of-The-Art, we are able to achieve significant performance speedups with obviously less network communication.

    Original languageEnglish
    Title of host publication2017 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGRID 2017
    Place of PublicationPiscataway
    PublisherInstitute of Electrical and Electronics Engineers
    Number of pages10
    ISBN (Electronic)978-1-5090-6610-0
    DOIs
    Publication statusPublished - 10 Jul 2017
    Event17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGRID 2017 - Madrid, Spain
    Duration: 14 May 201717 May 2017

    Conference

    Conference17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGRID 2017
    Country/TerritorySpain
    CityMadrid
    Period14/05/1717/05/17

    Keywords

    • Big Data
    • Data Partitioning
    • Data-Intensive Computing
    • Event Correlation
    • Process Mining
    • Service Computing

    Fingerprint

    Dive into the research topics of 'Efficient event correlation over distributed systems'. Together they form a unique fingerprint.

    Cite this