Simultaneous pattern and data hiding in unsupervised learning

Jie Wang, Jun Zhang, Lian Liu, Dianwei Han

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

10 Scopus citations

Abstract

How to control the level of knowledge disclosure and secure certain confidential patterns is a subtask comparable to confidential data hiding in privacy preserving data mining. We propose a technique to simultaneously hide data values and confidential patterns without undesirable side effects on distorting nonconfidential patterns. We use non-negative matrix factorization technique to distort the original dataset and preserve its overall characteristics. A factor swapping method is designed to hide particular confidential patterns for k-means clustering. The effectiveness of this novel hiding technique is examined on a benchmark dataset. Experimental results indicate that our technique can produce a single modified dataset to achieve both pattern and data value hiding. Under certain constraints on the nonnegative matrix factorization iterations, an optimal solution can be computed in which the user-specified confidential memberships or relationships are hidden without undesirable alterations on nonconfidential patterns.

Original languageEnglish
Title of host publicationICDM Workshops 2007 - Proceedings of the 17th IEEE International Conference on Data Mining Workshops
Pages729-734
Number of pages6
DOIs
StatePublished - 2007
Event17th IEEE International Conference on Data Mining Workshops, ICDM Workshops 2007 - Omaha, NE, United States
Duration: Oct 28 2007Oct 31 2007

Publication series

NameProceedings - IEEE International Conference on Data Mining, ICDM
ISSN (Print)1550-4786

Conference

Conference17th IEEE International Conference on Data Mining Workshops, ICDM Workshops 2007
Country/TerritoryUnited States
CityOmaha, NE
Period10/28/0710/31/07

ASJC Scopus subject areas

  • General Engineering

Fingerprint

Dive into the research topics of 'Simultaneous pattern and data hiding in unsupervised learning'. Together they form a unique fingerprint.

Cite this