Ir directamente a la navegación principal Ir directamente a la búsqueda Ir directamente al contenido principal

Incremental subspace clustering over multiple data streams

  • Qi Zhang
  • , Jinze Liu
  • , Wei Wang

Producción científica: Conference contributionrevisión exhaustiva

11 Citas (Scopus)

Resumen

Data streams are often locally correlated, with a subset of streams exhibiting coherent patterns over a subset of time points. Subspace clustering can discover clusters of objects in different subspaces. However, traditional sub-space clustering algorithms for static data sets are not readily used for incremental clustering, and is very expensive for frequent re-clustering over dynamically changing stream data. In this paper, we present an efficient incremental sub-space clustering algorithm for multiple streams over sliding windows. Our algorithm detects all the δ-CC-Clusters, which capture the coherent changing patterns among a set of streams over a set of time points. δ-CC-Clusters are incrementally generated by traversing a directed acyclic graph pDAG. We propose efficient insertion and deletion operations to update thepDAG dynamically. In addition, effective pruning techniques are applied to reduce the search space. Experiments on real data sets demonstrate the performance of our algorithm.

Idioma originalEnglish
Título de la publicación alojadaProceedings of the 7th IEEE International Conference on Data Mining, ICDM 2007
Páginas727-732
Número de páginas6
DOI
EstadoPublished - 2007
Evento7th IEEE International Conference on Data Mining, ICDM 2007 - Omaha, NE, United States
Duración: oct 28 2007oct 31 2007

Serie de la publicación

NombreProceedings - IEEE International Conference on Data Mining, ICDM
ISSN (versión impresa)1550-4786

Conference

Conference7th IEEE International Conference on Data Mining, ICDM 2007
País/TerritorioUnited States
CiudadOmaha, NE
Período10/28/0710/31/07

ASJC Scopus subject areas

  • General Engineering

Huella

Profundice en los temas de investigación de 'Incremental subspace clustering over multiple data streams'. En conjunto forman una huella única.

Citar esto