DB-XES: enabling process discovery in the large

    Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

    6 Citations (Scopus)
    1 Downloads (Pure)

    Abstract

    Dealing with the abundance of event data is one of the main process discovery challenges. Current process discovery techniques are able to efficiently handle imported event log files that fit in the computer’s memory. Once data files get bigger, scalability quickly drops since the speed required to access the data becomes a limiting factor. This paper proposes a new technique based on relational database technology as a solution for scalable process discovery. A relational database is used both for storing event data (i.e. we move the location of the data) and for pre-processing the event data (i.e. we move some computations from analysis-time to insertion-time). To this end, we first introduce DB-XES as a database schema which resembles the standard XES structure, we provide a transparent way to access event data stored in DB-XES, and we show how this greatly improves on the memory requirements of the state-of-the-art process discovery techniques. Secondly, we show how to move the computation of intermediate data structures to the database engine, to reduce the time required during process discovery. The work presented in this paper is implemented in ProM tool, and a range of experiments demonstrates the feasibility of our approach.

    Original languageEnglish
    Title of host publicationData-Driven Process Discovery and Analysis
    Subtitle of host publication6th IFIP WG 2.6 International Symposium, SIMPDA 2016, Graz, Austria, December 15-16, 2016, Revised Selected Papers
    EditorsP. Ceravolo, C. Guetl, S. Rinderle-Ma
    Place of PublicationDordrecht
    PublisherSpringer
    Pages53-77
    Number of pages25
    ISBN (Electronic)978-3-319-74161-1
    ISBN (Print)978-3-319-74160-4
    DOIs
    Publication statusPublished - 2018
    Event6th International Symposium on Data-Driven Process Discovery and Analysis (SIMPDA 2016) - Graz, Austria
    Duration: 15 Dec 201616 Dec 2016
    Conference number: 6

    Publication series

    NameLecture Notes in Business Information Processing
    Volume307
    ISSN (Print)1865-1348

    Conference

    Conference6th International Symposium on Data-Driven Process Discovery and Analysis (SIMPDA 2016)
    Abbreviated titleSIMPDA 2016
    Country/TerritoryAustria
    CityGraz
    Period15/12/1616/12/16

    Keywords

    • Big event data
    • Process discovery
    • Process mining
    • Relational database

    Fingerprint

    Dive into the research topics of 'DB-XES: enabling process discovery in the large'. Together they form a unique fingerprint.

    Cite this