Privacy-Preserving and Outsourced Multi-user K-Means Clustering

Fang Yu Rao, Bharath K. Samanthula, Elisa Bertino, Xun Yi, Dongxi Liu

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

23 Scopus citations

Abstract

Many techniques for privacy-preserving data mining (PPDM) have been investigated over the past decade. Such techniques, however, usually incur heavy computational and communication cost on the participating parties and thus entities with limited resources may have to refrain from participating in the PPDM process. To address this issue, one promising solution is to outsource the tasks to the cloud environment. In this paper, we propose a novel and efficient solution to privacy-preserving outsourced distributed clustering (PPODC) for multiple users based on the k-means clustering algorithm. The main novelty of our solution lies in avoiding the secure division operations required in computing cluster centers through efficient transformation techniques. In addition, we discuss two strategies, namely offline computation and pipelined execution that aim to boost the performance of our protocol. We implement our protocol on a cluster of 16 nodes and demonstrate how our two strategies combined with parallelism can significantly improve the performance of our protocol through extensive experiments using a real dataset.

Original languageEnglish
Title of host publicationProceedings - 2015 IEEE Conference on Collaboration and Internet Computing, CIC 2015
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages80-89
Number of pages10
ISBN (Electronic)9781509000890
DOIs
StatePublished - Mar 1 2016
Event1st IEEE International Conference on Collaboration and Internet Computing, CIC 2015 - Hangzhou, China
Duration: Oct 28 2015Oct 30 2015

Publication series

NameProceedings - 2015 IEEE Conference on Collaboration and Internet Computing, CIC 2015

Other

Other1st IEEE International Conference on Collaboration and Internet Computing, CIC 2015
Country/TerritoryChina
CityHangzhou
Period10/28/1510/30/15

ASJC Scopus subject areas

  • Human-Computer Interaction
  • Computer Networks and Communications

Keywords

  • Cloud computing
  • Encrypted data
  • K-means clustering
  • Privacy

Cite this