Clustering-structure representative sampling from graph streams

J. Zhang, Kaijie Zhu, Y. Pei, G.H.L. Fletcher, M. Pechenizkiy

Research output: Contribution to conferencePaperAcademic

323 Downloads (Pure)

Abstract

Most existing sampling algorithms on graphs (i.e., network-structured data) focus on sampling from memory-resident static graphs and assume the entire graphs are always available. However, the graphs encountered in modern applications are often too large and/or too dynamic to be processed with limited memory.
Furthermore, existing sampling techniques are inadequate for preserving the inherent clustering structure, which is an essential property of complex networks.
To tackle these problems, we propose a new sampling algorithm that dynamically maintains a representative sample and is capable of retaining clustering structure in graph streams at any time.
Performance of the proposed algorithm is evaluated through empirical experiments using real-world networks. The experimental results have shown that our proposed \textit{CPIES} algorithm can produce clustering-structure representative samples and outperforms current online sampling algorithms.
Original languageEnglish
Publication statusPublished - 2017

Fingerprint

Dive into the research topics of 'Clustering-structure representative sampling from graph streams'. Together they form a unique fingerprint.

Cite this