Incremental subspace clustering over multiple data streams

Qi Zhang, Jinze Liu, Wei Wang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

10 Scopus citations

Abstract

Data streams are often locally correlated, with a subset of streams exhibiting coherent patterns over a subset of time points. Subspace clustering can discover clusters of objects in different subspaces. However, traditional sub-space clustering algorithms for static data sets are not readily used for incremental clustering, and is very expensive for frequent re-clustering over dynamically changing stream data. In this paper, we present an efficient incremental sub-space clustering algorithm for multiple streams over sliding windows. Our algorithm detects all the δ-CC-Clusters, which capture the coherent changing patterns among a set of streams over a set of time points. δ-CC-Clusters are incrementally generated by traversing a directed acyclic graph pDAG. We propose efficient insertion and deletion operations to update thepDAG dynamically. In addition, effective pruning techniques are applied to reduce the search space. Experiments on real data sets demonstrate the performance of our algorithm.

Original languageEnglish
Title of host publicationProceedings of the 7th IEEE International Conference on Data Mining, ICDM 2007
Pages727-732
Number of pages6
DOIs
StatePublished - 2007
Event7th IEEE International Conference on Data Mining, ICDM 2007 - Omaha, NE, United States
Duration: Oct 28 2007Oct 31 2007

Publication series

NameProceedings - IEEE International Conference on Data Mining, ICDM
ISSN (Print)1550-4786

Conference

Conference7th IEEE International Conference on Data Mining, ICDM 2007
Country/TerritoryUnited States
CityOmaha, NE
Period10/28/0710/31/07

ASJC Scopus subject areas

  • General Engineering

Fingerprint

Dive into the research topics of 'Incremental subspace clustering over multiple data streams'. Together they form a unique fingerprint.

Cite this