Efficient model sharing for scalable collaborative classification

Odysseas Papapetrou, Wolf Siberski, Stefan Siersdorfer

Research output: Contribution to journalArticleAcademicpeer-review

2 Citations (Scopus)

Abstract

We propose a novel collaborative approach for document classification, combining the knowledge of multiple users for improved organization of data such as individual document repositories or emails. To this end, we distribute locally built classification models in a network of participating users, and combine the shared classifiers into more powerful meta models. In order to increase the propagation efficiency, we apply a method for selecting the most discriminative model components and transmitting them to other participants. In our experiments on four large standard collections for text classification we study the resulting tradeoffs between network cost and classification accuracy. The experimental results show that the proposed model propagation has negligible communication costs and substantially outperforms current approaches with respect to efficiency and classification quality.

Original languageEnglish
Pages (from-to)384-398
Number of pages15
JournalPeer-to-Peer Networking and Applications
Volume8
Issue number3
DOIs
Publication statusPublished - 1 May 2015
Externally publishedYes

Keywords

  • Clustering, classification and association rules
  • Data mining
  • Peer-to-peer

Fingerprint

Dive into the research topics of 'Efficient model sharing for scalable collaborative classification'. Together they form a unique fingerprint.

Cite this