OP-cluster: Clustering by tendency in high dimensional space

Jinze Liu, Wei Wang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

161 Scopus citations

Abstract

Clustering is the process of grouping a set of objects into classes of similar objects. Because of unknownness of the hidden patterns in the data sets, the definition of similarity is very subtle. Until recently, similarity measures are typically based on distances, e.g Euclidean distance and cosine distance. In this paper, we propose a flexible yet powerful clustering model, namely OP-Cluster (Order Preserving Cluster). Under this new model, two objects are similar on a subset of dimensions if the values of these two objects induce the same relative order of those dimensions. Such a cluster might arise when the expression levels of (coregulated) genes can rise or fall synchronously in response to a sequence of environment stimuli. Hence, discovery of OP-Cluster is essential in revealing significant gene regulatory networks. A deterministic algorithm is designed and implemented to discover all the significant OP-Clusters. A set of extensive experiments has been done on several real biological data sets to demonstrate its effectiveness and efficiency in detecting co-regulated patterns.

Original languageEnglish
Title of host publicationProceedings - 3rd IEEE International Conference on Data Mining, ICDM 2003
Pages187-194
Number of pages8
StatePublished - 2003
Event3rd IEEE International Conference on Data Mining, ICDM '03 - Melbourne, FL, United States
Duration: Nov 19 2003Nov 22 2003

Publication series

NameProceedings - IEEE International Conference on Data Mining, ICDM
ISSN (Print)1550-4786

Conference

Conference3rd IEEE International Conference on Data Mining, ICDM '03
Country/TerritoryUnited States
CityMelbourne, FL
Period11/19/0311/22/03

ASJC Scopus subject areas

  • General Engineering

Fingerprint

Dive into the research topics of 'OP-cluster: Clustering by tendency in high dimensional space'. Together they form a unique fingerprint.

Cite this