Using n-grams for the automated clustering of structural models

    Onderzoeksoutput: Hoofdstuk in Boek/Rapport/CongresprocedureConferentiebijdrageAcademicpeer review

    28 Citaten (Scopus)
    1 Downloads (Pure)

    Samenvatting

    Model comparison and clustering are important for dealing with many models in data analysis and exploration, e.g. in domain model recovery or model repository management. Particularly in structural models, information is captured not only in model elements (e.g. in names and types) but also in the structural context, i.e. the relation of one element to the others. Some approaches involve a large number of models ignoring the structural context of model elements; others handle very few (typically two) models applying sophisticated structural techniques. In this paper we address both aspects and extend our previous work on model clustering based on vector space model, with a technique for incorporating structural context in the form of n-grams. We compare the n-gram accuracy on two datasets of Ecore metamodels in AtlanMod Zoo: small random samples using up to trigrams and a larger one (∼100 models) up to bigrams.

    Originele taal-2Engels
    TitelSOFSEM 2017: Theory and Practice of Computer Science - 43rd International Conference on Current Trends in Theory and Practice of Computer Science, Proceedings
    UitgeverijSpringer
    Pagina's510-524
    Aantal pagina's15
    ISBN van geprinte versie9783319519623
    DOI's
    StatusGepubliceerd - 2017
    Evenement43rd Conference on Current Trends in Theory and Practice of Computer Science, (SOFSEM 2017), Januari 16-20, 2017, Limerick, Ireland - Limerick, Ierland
    Duur: 16 jan. 201720 jan. 2017

    Publicatie series

    NaamLecture Notes in Computer Science
    Volume10139
    ISSN van geprinte versie03029743
    ISSN van elektronische versie16113349

    Congres

    Congres43rd Conference on Current Trends in Theory and Practice of Computer Science, (SOFSEM 2017), Januari 16-20, 2017, Limerick, Ireland
    Land/RegioIerland
    StadLimerick
    Periode16/01/1720/01/17

    Vingerafdruk

    Duik in de onderzoeksthema's van 'Using n-grams for the automated clustering of structural models'. Samen vormen ze een unieke vingerafdruk.

    Citeer dit