TY - JOUR
T1 - A tree augmented classifier based on Extreme Imprecise Dirichlet Model
AU - Corani, G.
AU - de Campos, C.P.
PY - 2010
Y1 - 2010
N2 - We present TANC, a TAN classifier (tree-augmented naive) based on imprecise probabilities. TANC models prior near-ignorance via the Extreme Imprecise Dirichlet Model (EDM). A first contribution of this paper is the experimental comparison between EDM and the global Imprecise Dirichlet Model using the naive credal classifier (NCC), with the aim of showing that EDM is a sensible approximation of the global IDM. TANC is able to deal with missing data in a conservative manner by considering all possible completions (without assuming them to be missing-at-random), but avoiding an exponential increase of the computational time. By experiments on real data sets, we show that TANC is more reliable than the Bayesian TAN and that it provides better performance compared to previous TANs based on imprecise probabilities. Yet, TANC is sometimes outperformed by NCC because the learned TAN structures are too complex; this calls for novel algorithms for learning the TAN structures, better suited for an imprecise probability classifier.
AB - We present TANC, a TAN classifier (tree-augmented naive) based on imprecise probabilities. TANC models prior near-ignorance via the Extreme Imprecise Dirichlet Model (EDM). A first contribution of this paper is the experimental comparison between EDM and the global Imprecise Dirichlet Model using the naive credal classifier (NCC), with the aim of showing that EDM is a sensible approximation of the global IDM. TANC is able to deal with missing data in a conservative manner by considering all possible completions (without assuming them to be missing-at-random), but avoiding an exponential increase of the computational time. By experiments on real data sets, we show that TANC is more reliable than the Bayesian TAN and that it provides better performance compared to previous TANs based on imprecise probabilities. Yet, TANC is sometimes outperformed by NCC because the learned TAN structures are too complex; this calls for novel algorithms for learning the TAN structures, better suited for an imprecise probability classifier.
KW - Classification
KW - Classifier
KW - Imprecise Dirichlet Model
KW - Naive credal
KW - TANC
UR - http://www.scopus.com/inward/record.url?scp=79960128281&partnerID=8YFLogxK
U2 - 10.1016/j.ijar.2010.08.007
DO - 10.1016/j.ijar.2010.08.007
M3 - Article
SN - 0888-613X
VL - 51
SP - 1053
EP - 1068
JO - International Journal of Approximate Reasoning
JF - International Journal of Approximate Reasoning
IS - 9
ER -