Alignment-based trace clustering

T. Chatain, J. Carmona, B.F. van Dongen

    Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

    11 Citations (Scopus)

    Abstract

    A novel method to cluster event log traces is presented in this paper. In contrast to the approaches in the literature, the clustering approach of this paper assumes an additional input: a process model that describes the current process. The core idea of the algorithm is to use model traces as centroids of the clusters detected, computed from a generalization of the notion of alignment. This way, model explanations of observed behavior are the driving force to compute the clusters, instead of current model agnostic approaches, e.g., which group log traces merely on their vector-space similarity. We believe alignment-based trace clustering provides results more useful for stakeholders. Moreover, in case of log incompleteness, noisy logs or concept drift, they can be more robust for dealing with highly deviating traces. The technique of this paper can be combined with any clustering technique to provide model explanations to the clusters computed. The proposed technique relies on encoding the individual alignment problems into the (pseudo-)Boolean domain, and has been implemented in our tool DarkSider that uses an open-source solver.

    Original languageEnglish
    Title of host publicationConceptual Modeling - 36th International Conference, ER 2017, Proceedings
    EditorsH.C. Mayr, G. Guizzardi, H. Ma, O. Pastor
    PublisherSpringer
    Pages295-308
    Number of pages14
    ISBN (Print)9783319699035
    DOIs
    Publication statusPublished - 2017
    Event36th International Conference on Conceptual Modeling, (ER2017) - Universitat Politecnica de Valencia, Valencia, Spain
    Duration: 6 Nov 20179 Nov 2017
    Conference number: 36
    http://er2017.pros.webs.upv.es/

    Publication series

    NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Volume10650 LNCS
    ISSN (Print)0302-9743
    ISSN (Electronic)1611-3349
    NameInformation Systems and Applications, series LNISA
    Volume10650

    Conference

    Conference36th International Conference on Conceptual Modeling, (ER2017)
    Abbreviated titleER 2017
    CountrySpain
    CityValencia
    Period6/11/179/11/17
    Internet address

    Fingerprint Dive into the research topics of 'Alignment-based trace clustering'. Together they form a unique fingerprint.

    Cite this