Simultaneous pattern and data hiding in supervised learning

Pengpeng Lin, Jun Zhang, Xiwei Wang, Art Shindhelm

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The ability to hide private data and confidential patterns from potential adversaries while still maintaining data mining value is an important aspect in privacy preserving data mining. In this paper, we study a nonnegative matrix factorization technique, where we show how to define objective functions and derive corresponding multiplicative update functions. We then use that knowledge to propose a data value perturbation scheme that hides data values but still keeps the data pattern to a large degree. Based on the proposed data value perturbation scheme, we develop a dual data hiding scheme which not only hides data but also hides individual sample's class membership. The essential idea is to use an indicator matrix as a guide for the update process. The performance of the proposed schemes are examined on benchmark datasets for both utility value and data perturbation degree. The empirical results show that the data values are well perturbed and our schemes are capable of hiding a data sample's class membership without side effects. At the end, we draw some interesting conclusions and layout potential future work.

Original languageEnglish
Title of host publicationProceedings of the 2012 IEEE 13th International Conference on Information Reuse and Integration, IRI 2012
Pages385-392
Number of pages8
DOIs
StatePublished - 2012
Event2012 IEEE 13th International Conference on Information Reuse and Integration, IRI 2012 - Las Vegas, NV, United States
Duration: Aug 8 2012Aug 10 2012

Publication series

NameProceedings of the 2012 IEEE 13th International Conference on Information Reuse and Integration, IRI 2012

Conference

Conference2012 IEEE 13th International Conference on Information Reuse and Integration, IRI 2012
Country/TerritoryUnited States
CityLas Vegas, NV
Period8/8/128/10/12

Keywords

  • Classification
  • Indicator Matrix
  • NMF
  • PPDM

ASJC Scopus subject areas

  • Information Systems

Fingerprint

Dive into the research topics of 'Simultaneous pattern and data hiding in supervised learning'. Together they form a unique fingerprint.

Cite this