Samenvatting
In this paper, we provide a unified view of generative classification and sequence
probability estimation. The context-tree weighting (CTW) algorithm, which was
originally proposed as an optimum universal source coding algorithm, is thus ex
tended to the field of pattern recognition. While focusing on binary classification
problems in this paper, we propose single tree and multiple tree CT’~V classifiers,
and derive decision criteria for both classifiers. A few ad-hoc approaches to order
the features in a single tree and divide the features into different trees are pre
sented. As an example in our numerical study, we consider spain email filtering
problem. Numerical results show that the approaches proposed in this paper
clearly outperforms the naive Bayes classifier, which is a conventional generative
classifier.
Originele taal-2 | Engels |
---|---|
Titel | Proceedings of the 28th Symposium on Information Theory in the Benelux, 24-25 May 2007, Enschede, The Netherlands |
Redacteuren | H. Cronie, H. Hoeksema, R. Veldhuis, F. Hoeksema |
Plaats van productie | Enschede |
Uitgeverij | Universiteit Twente |
Pagina's | 321-328 |
ISBN van geprinte versie | 90-365-2509-1 |
Status | Gepubliceerd - 2007 |
Evenement | conference; 28th Symposium on Information Theory in the Benelux, Enschede, The Netherlands; 2007-05-24; 2007-05-25 - Duur: 24 mei 2007 → 25 mei 2007 |
Congres
Congres | conference; 28th Symposium on Information Theory in the Benelux, Enschede, The Netherlands; 2007-05-24; 2007-05-25 |
---|---|
Periode | 24/05/07 → 25/05/07 |
Ander | 28th Symposium on Information Theory in the Benelux, Enschede, The Netherlands |