Abstract
When collecting information, local differential privacy (LDP) relieves the concern of privacy leakage from users' perspective, as user's private information is randomized before sent to the aggregator. We study the problem of recovering the distribution over a numerical domain while satisfying LDP. While one can discretize a numerical domain and then apply the protocols developed for categorical domains, we show that taking advantage of the numerical nature of the domain results in better trade-off of privacy and utility. We introduce a new reporting mechanism, called the square wave (SW) mechanism, which exploits the numerical nature in reporting. We also develop an Expectation Maximization with Smoothing (EMS) algorithm, which is applied to aggregated histograms from the SW mechanism to estimate the original distributions. Extensive experiments demonstrate that our proposed approach, SW with EMS, consistently outperforms other methods in a variety of utility metrics.
Original language | English |
---|---|
Title of host publication | SIGMOD 2020 - Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data |
Publisher | Association for Computing Machinery, Inc. |
Pages | 621-635 |
Number of pages | 15 |
ISBN (Electronic) | 9781450367356 |
DOIs | |
Publication status | Published - 14 Jun 2020 |
Event | 2020 ACM SIGMOD International Conference on Management of Data, SIGMOD 2020 - Portland, United States Duration: 14 Jun 2020 → 19 Jun 2020 |
Conference
Conference | 2020 ACM SIGMOD International Conference on Management of Data, SIGMOD 2020 |
---|---|
Country/Territory | United States |
City | Portland |
Period | 14/06/20 → 19/06/20 |
Funding
This project is supported by NSF grant 1640374, NWO grant 628.001.026, and NSF grant 1931443. We thank the anonymous reviewers for their helpful suggestions.
Keywords
- density estimation
- local differential privacy