Privacy preserving data mining

Research output: Contribution to journalArticlepeer-review

469 Scopus citations

Abstract

In this paper we address the issue of privacy preserving data mining. Specifically, we consider a scenario in which two parties owning confidential databases wish to run a data mining algorithm on the union of their databases, without revealing any unnecessary information. Our work is motivated by the need both to protect privileged information and to enable its use for research or other purposes. The above problem is a specific example of secure multi-party computation and, as such, can be solved using known generic protocols. However, data mining algorithms are typically complex and, furthermore, the input usually consists of massive data sets. The generic protocols in such a case are of no practical use and therefore more efficient protocols are required. We focus on the problem of decision tree learning with the popular ID3 algorithm. Our protocol is considerably more efficient than generic solutions and demands both very few rounds of communication and reasonable bandwidth.

Original languageEnglish
Pages (from-to)177-206
Number of pages30
JournalJournal of Cryptology
Volume15
Issue number3
DOIs
StatePublished - Jun 2003
Externally publishedYes

Keywords

  • Data mining
  • Decision trees
  • Oblivious polynomial evaluation
  • Oblivious transfer
  • Secure two-party computation

Fingerprint

Dive into the research topics of 'Privacy preserving data mining'. Together they form a unique fingerprint.

Cite this