Estimating Numerical Distributions under Local Differential Privacy

Zitao Li, Tianhao Wang, Milan Lopuhaä-Zwakenberg, Ninghui Li, Boris Škoric

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

90 Citations (Scopus)

Abstract

When collecting information, local differential privacy (LDP) relieves the concern of privacy leakage from users' perspective, as user's private information is randomized before sent to the aggregator. We study the problem of recovering the distribution over a numerical domain while satisfying LDP. While one can discretize a numerical domain and then apply the protocols developed for categorical domains, we show that taking advantage of the numerical nature of the domain results in better trade-off of privacy and utility. We introduce a new reporting mechanism, called the square wave (SW) mechanism, which exploits the numerical nature in reporting. We also develop an Expectation Maximization with Smoothing (EMS) algorithm, which is applied to aggregated histograms from the SW mechanism to estimate the original distributions. Extensive experiments demonstrate that our proposed approach, SW with EMS, consistently outperforms other methods in a variety of utility metrics.

Original languageEnglish
Title of host publicationSIGMOD 2020 - Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data
PublisherAssociation for Computing Machinery, Inc.
Pages621-635
Number of pages15
ISBN (Electronic)9781450367356
DOIs
Publication statusPublished - 14 Jun 2020
Event2020 ACM SIGMOD International Conference on Management of Data, SIGMOD 2020 - Portland, United States
Duration: 14 Jun 202019 Jun 2020

Conference

Conference2020 ACM SIGMOD International Conference on Management of Data, SIGMOD 2020
Country/TerritoryUnited States
CityPortland
Period14/06/2019/06/20

Funding

This project is supported by NSF grant 1640374, NWO grant 628.001.026, and NSF grant 1931443. We thank the anonymous reviewers for their helpful suggestions.

Keywords

  • density estimation
  • local differential privacy

Fingerprint

Dive into the research topics of 'Estimating Numerical Distributions under Local Differential Privacy'. Together they form a unique fingerprint.

Cite this