Doorgaan naar hoofdnavigatie Doorgaan naar zoeken Ga verder naar hoofdinhoud

Efficient pattern mining of uncertain data with sampling

  • T. Calders
  • , C. Garboni
  • , B. Goethals

    Onderzoeksoutput: Hoofdstuk in Boek/Rapport/CongresprocedureConferentiebijdrageAcademicpeer review

    Samenvatting

    Mining frequent itemsets from transactional datasets is a well known problem with good algorithmic solutions. In the case of uncertain data, however, several new techniques have been proposed. Unfortunately, these proposals often suffer when a lot of items occur with many different probabilities. Here we propose an approach based on sampling by instantiating "possible worlds" of the uncertain data, on which we subsequently run optimized frequent itemset mining algorithms. As such we gain efficiency at a surprisingly low loss in accuracy. These is confirmed by a statistical and an empirical evaluation on real and synthetic data.
    Originele taal-2Engels
    TitelAdvances in Knowledge Discovery and Data Mining (14th Pacific-Asia Conference, PAKDD 2010, Hyderabad, India, June 21-24, 2010. Proceedings, Part I)
    RedacteurenM.J. Zaki, J.X. Yu, B. Ravindran, V. Pudi
    Plaats van productieBerlin
    UitgeverijSpringer
    Pagina's480-487
    ISBN van geprinte versie978-3-642-13656-6
    DOI's
    StatusGepubliceerd - 2010

    Publicatie series

    NaamLecture Notes in Computer Science
    Volume6118
    ISSN van geprinte versie0302-9743

    Vingerafdruk

    Duik in de onderzoeksthema's van 'Efficient pattern mining of uncertain data with sampling'. Samen vormen ze een unieke vingerafdruk.

    Citeer dit