TY - GEN
T1 - Incremental subspace clustering over multiple data streams
AU - Zhang, Qi
AU - Liu, Jinze
AU - Wang, Wei
PY - 2007
Y1 - 2007
N2 - Data streams are often locally correlated, with a subset of streams exhibiting coherent patterns over a subset of time points. Subspace clustering can discover clusters of objects in different subspaces. However, traditional sub-space clustering algorithms for static data sets are not readily used for incremental clustering, and is very expensive for frequent re-clustering over dynamically changing stream data. In this paper, we present an efficient incremental sub-space clustering algorithm for multiple streams over sliding windows. Our algorithm detects all the δ-CC-Clusters, which capture the coherent changing patterns among a set of streams over a set of time points. δ-CC-Clusters are incrementally generated by traversing a directed acyclic graph pDAG. We propose efficient insertion and deletion operations to update thepDAG dynamically. In addition, effective pruning techniques are applied to reduce the search space. Experiments on real data sets demonstrate the performance of our algorithm.
AB - Data streams are often locally correlated, with a subset of streams exhibiting coherent patterns over a subset of time points. Subspace clustering can discover clusters of objects in different subspaces. However, traditional sub-space clustering algorithms for static data sets are not readily used for incremental clustering, and is very expensive for frequent re-clustering over dynamically changing stream data. In this paper, we present an efficient incremental sub-space clustering algorithm for multiple streams over sliding windows. Our algorithm detects all the δ-CC-Clusters, which capture the coherent changing patterns among a set of streams over a set of time points. δ-CC-Clusters are incrementally generated by traversing a directed acyclic graph pDAG. We propose efficient insertion and deletion operations to update thepDAG dynamically. In addition, effective pruning techniques are applied to reduce the search space. Experiments on real data sets demonstrate the performance of our algorithm.
UR - http://www.scopus.com/inward/record.url?scp=49749129993&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=49749129993&partnerID=8YFLogxK
U2 - 10.1109/ICDM.2007.100
DO - 10.1109/ICDM.2007.100
M3 - Conference contribution
AN - SCOPUS:49749129993
SN - 0769530184
SN - 9780769530185
T3 - Proceedings - IEEE International Conference on Data Mining, ICDM
SP - 727
EP - 732
BT - Proceedings of the 7th IEEE International Conference on Data Mining, ICDM 2007
T2 - 7th IEEE International Conference on Data Mining, ICDM 2007
Y2 - 28 October 2007 through 31 October 2007
ER -