Abstract
Clustering is a fundamental problem in data analysis. In differentially private clustering, the goal is to identify k cluster centers without disclosing information on individual data points. Despite significant research progress, the problem had so far resisted practical solutions. In this work we aim at providing simple implementable differentially private clustering algorithms that provide utility when the data is”easy, ” e.g., when there exists a significant separation between the clusters. We propose a framework that allows us to apply non-private clustering algorithms to the easy instances and privately combine the results. We are able to get improved sample complexity bounds in some cases of Gaussian mixtures and k-means. We complement our theoretical analysis with an empirical evaluation on synthetic data.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 38th International Conference on Machine Learning, ICML 2021 |
| Publisher | ML Research Press |
| Pages | 2049-2059 |
| Number of pages | 11 |
| ISBN (Electronic) | 9781713845065 |
| State | Published - 2021 |
| Externally published | Yes |
| Event | 38th International Conference on Machine Learning, ICML 2021 - Virtual, Online Duration: 18 Jul 2021 → 24 Jul 2021 |
Publication series
| Name | Proceedings of Machine Learning Research |
|---|---|
| Volume | 139 |
| ISSN (Electronic) | 2640-3498 |
Conference
| Conference | 38th International Conference on Machine Learning, ICML 2021 |
|---|---|
| City | Virtual, Online |
| Period | 18/07/21 → 24/07/21 |
Bibliographical note
Publisher Copyright:Copyright © 2021 by the author(s)
Fingerprint
Dive into the research topics of 'Differentially-Private Clustering of Easy Instances'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver