Privacy Preserving Data Mining

Research output: Contribution to journalArticlepeer-review

Abstract

In this paper we address the issue of privacy preserving data mining. Specifically, we consider a scenario in which two parties owning confidential databases wish to run a data mining algorithm on the union of their databases, without revealing any unnecessary information. Our work is motivated by the need both to protect privileged information and to enable its use for research or other purposes. The above problem is a specific example of secure multi-party computation and, as such, can be solved using known generic protocols. However, data mining algorithms are typically complex and, furthermore, the input usually consists of massive data sets. The generic protocols in such a case are of no practical use and therefore more efficient protocols are required. We focus on the problem of decision tree learning with the popular ID3 algorithm. Our protocol is considerably more efficient than generic solutions and demands both very few rounds of communication and reasonable bandwidth
Original languageAmerican English
Pages (from-to)177-206
JournalJournal of Cryptology
Volume15
Issue number3
StatePublished - 2002

Bibliographical note

STAR Lab, Intertrust Technologies, 4750 Patrick Henry Drive, Santa Clara, CA 95054, U.S.A. bpinkas@intertrust.com benny@pinkas.netUS

Fingerprint

Dive into the research topics of 'Privacy Preserving Data Mining'. Together they form a unique fingerprint.

Cite this