TY - GEN
T1 - A Clustering-Inspired Quality Measure for Exceptional Preferences Mining - Design Choices and Consequences.
AU - Verhaegh, Ruben Franciscus Adrianus
AU - Kiezebrink, Jacco Johannes Egbert
AU - Nusteling, Frank
AU - Rio, Arnaud Wander André
AU - Bendicsek, Márton Bendegúz
AU - Duivesteijn, Wouter
AU - Schouten, Rianne Margaretha
PY - 2022
Y1 - 2022
N2 - Exceptional Preferences Mining (EPM) combines the research fields of Preference Learning and Exceptional Model Mining. It is a local pattern mining task, where we try to find coherent subgroups of the dataset featuring unusual preferences between a fixed set of labels. We introduce a new quality measure for Exceptional Preferences Mining, inspired by concepts from Clustering. On top of that, we draw conclusions on two design choices that must necessarily be made whenever one defines a quality measure for any version of Exceptional Model Mining: on the one hand, exceptional behavior is easily (spuriously) found in tiny subgroups, so what is the best way to compensate for that; on the other hand, when gauging exceptionality of a subgroup’s behavior, what does one use as reference for the normal behavior? We find that the choice of correction factor not only influences the subgroup size but it also effects the presumed exceptionality of found subgroups. The entropy function allows for detecting exceptional subgroups of a meaningful size, both when a candidate subgroup is evaluated against its complement and against the entire dataset.
AB - Exceptional Preferences Mining (EPM) combines the research fields of Preference Learning and Exceptional Model Mining. It is a local pattern mining task, where we try to find coherent subgroups of the dataset featuring unusual preferences between a fixed set of labels. We introduce a new quality measure for Exceptional Preferences Mining, inspired by concepts from Clustering. On top of that, we draw conclusions on two design choices that must necessarily be made whenever one defines a quality measure for any version of Exceptional Model Mining: on the one hand, exceptional behavior is easily (spuriously) found in tiny subgroups, so what is the best way to compensate for that; on the other hand, when gauging exceptionality of a subgroup’s behavior, what does one use as reference for the normal behavior? We find that the choice of correction factor not only influences the subgroup size but it also effects the presumed exceptionality of found subgroups. The entropy function allows for detecting exceptional subgroups of a meaningful size, both when a candidate subgroup is evaluated against its complement and against the entire dataset.
KW - Exceptional model mining
KW - Exceptional preferences mining
KW - Label ranking
KW - Pattern mining
KW - Preference learning
UR - https://www.scopus.com/pages/publications/85142710188
U2 - 10.1007/978-3-031-18840-4_31
DO - 10.1007/978-3-031-18840-4_31
M3 - Conference contribution
SN - 9783031188398
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 429
EP - 444
BT - Discovery Science
A2 - Pascal, Poncelet
A2 - Ienco, Dino
PB - Springer
T2 - 25th International Conference, DS 2022
Y2 - 10 October 2022 through 12 October 2022
ER -