Constrained nonnegative matrix factorization based data distortion techniques study of data privacy and utility

Nirmal Thapa, Peng Peng Lin, Lian Liu, Jie Wang, Jun Zhang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

With the rise of data mining techniques came across the problem of privacy disclosure, that is why it has become one of the top priorities as far as designing the data mining techniques is concerned. In this paper, we briefly discuss the Nonnegative Matrix Factorization (NMF) and the motivation behind using NMF for data representation. We provide the mathematical derivation for NMF with some additional constraints. Based on the mathematical derivations, we propose a couple of novel data distortion strategies. The first technique is called the Constrained Nonnegative Matrix Factorization (CMF) and the second one is Sparsified CNMF. We study the distortion level of each of these algorithms with the other matrix based techniques like SVD and NMF. K-means is used to study the data utility of the two proposed methods. Our experimental results show that, in comparison with standard data distortion techniques, the proposed schemes are very effective in achieving a good tradeoff between data privacy and data utility, and affords a feasible solution to protect sensitive information and promise higher accuracy in decision making. We investigate utility of the perturbed data based on the results from the original data.

Original languageEnglish
Title of host publicationDATA 2012 - Proceedings of the International Conference on Data Technologies and Applications
Pages51-56
Number of pages6
StatePublished - 2012
Event1st International Conference on Data Technologies and Applications, DATA 2012 - Rome, Italy
Duration: Jul 25 2012Jul 27 2012

Publication series

NameDATA 2012 - Proceedings of the International Conference on Data Technologies and Applications

Conference

Conference1st International Conference on Data Technologies and Applications, DATA 2012
Country/TerritoryItaly
CityRome
Period7/25/127/27/12

Keywords

  • Constrained NMF
  • Data distortion
  • NMF
  • Nonnegative matrix factorization
  • SVD

ASJC Scopus subject areas

  • Computer Science Applications

Fingerprint

Dive into the research topics of 'Constrained nonnegative matrix factorization based data distortion techniques study of data privacy and utility'. Together they form a unique fingerprint.

Cite this