Identifying hidden contexts in classification

I. Zliobaite

    Onderzoeksoutput: Hoofdstuk in Boek/Rapport/CongresprocedureConferentiebijdrageAcademicpeer review

    5 Citaten (Scopus)

    Samenvatting

    In this study we investigate how to identify hidden contexts from the data in classification tasks. Contexts are artifacts in the data, which do not predict the class label directly. For instance, in speech recognition task speakers might have different accents, which do not directly discriminate between the spoken words. Identifying hidden contexts is considered as data preprocessing task, which can help to build more accurate classifiers, tailored for particular contexts and give an insight into the data structure. We present three techniques to identify hidden contexts, which hide class label information from the input data and partition it using clustering techniques. We form a collection of performance measures to ensure that the resulting contexts are valid. We evaluate the performance of the proposed techniques on thirty real datasets. We present a case study illustrating how the identified contexts can be used to build specialized more accurate classifiers.
    Originele taal-2Engels
    TitelAdvances in Knowledge Discovery and Data Mining (15th Pacific-Asia Conference, PAKDD 2011, Shenzhen, China, May 24-27, 2011, Proceedings, Part I)
    RedacteurenJ.Z. Huang, L. Cao, J. Srivastava
    Plaats van productieBerlin
    UitgeverijSpringer
    Pagina's277-288
    ISBN van geprinte versie978-3-642-20840-9
    DOI's
    StatusGepubliceerd - 2011

    Publicatie series

    NaamLecture Notes in Computer Science
    Volume6634
    ISSN van geprinte versie0302-9743

    Vingerafdruk

    Duik in de onderzoeksthema's van 'Identifying hidden contexts in classification'. Samen vormen ze een unieke vingerafdruk.

    Citeer dit