Revealing true subspace clusters in high dimensions

  • Jinze Liu
  • , Karl Strohmaier
  • , Wei Wang

Producción científica: Conference contributionrevisión exhaustiva

4 Citas (Scopus)

Resumen

Subspace clustering is one of the best approaches for discovering meaningful clusters in high dimensional space. One cluster in high dimensional space may be transcribed into multiple distinct maximal clusters by projecting onto different subspaces. A direct consequence of clustering independently in each subspace is an overwhelmingly large set of overlapping clusters which may be significantly similar. To reveal the true underlying clusters, we propose a similarity measurement of the overlapping clusters. We adopt the model of Gaussian tailed hyper-rectangles to capture the distribution of any subspace cluster. A set of experiments on a synthetic dataset demonstrates the effectiveness of our approach. Application to real gene expression data also reveals impressive meta-clusters expected by biologists.

Idioma originalEnglish
Título de la publicación alojadaProceedings - Fourth IEEE International Conference on Data Mining, ICDM 2004
EditoresR. Rastogi, K. Morik, M. Bramer, X. Wu
Páginas463-466
Número de páginas4
EstadoPublished - 2004
EventoProceedings - Fourth IEEE International Conference on Data Mining, ICDM 2004 - Brighton, United Kingdom
Duración: nov 1 2004nov 4 2004

Serie de la publicación

NombreProceedings - Fourth IEEE International Conference on Data Mining, ICDM 2004

Conference

ConferenceProceedings - Fourth IEEE International Conference on Data Mining, ICDM 2004
País/TerritorioUnited Kingdom
CiudadBrighton
Período11/1/0411/4/04

ASJC Scopus subject areas

  • General Engineering

Huella

Profundice en los temas de investigación de 'Revealing true subspace clusters in high dimensions'. En conjunto forman una huella única.

Citar esto