Differential privacy for Bayesian inference through posterior sampling
Journal article, 2017

Differential privacy formalises privacy-preserving mechanisms that provide access to a database. Can Bayesian inference be used directly to provide private access to data? The answer is yes: under certain conditions on the prior, sampling from the posterior distribution can lead to a desired level of privacy and utility. For a uniform treatment, we define differential privacy over arbitrary data set metrics, outcome spaces and distribution families. This allows us to also deal with non-i.i.d or non-tabular data sets. We then prove bounds on the sensitivity of the posterior to the data, which delivers a measure of robustness. We also show how to use posterior sampling to provide differentially private responses to queries, within a decision-theoretic framework. Finally, we provide bounds on the utility of answers to queries and on the ability of an adversary to distinguish between data sets. The latter are complemented by a novel use of Le Cam's method to obtain lower bounds on distinguishability. Our results hold for arbitrary metrics, including those for the common definition of differential privacy. For specific choices of the metric, we give a number of examples satisfying our assumptions. © 2017 C Dimitrakakis, B. Nelson, Z. Zhang, A. Mitrokotsa, B. I. P. Rubinstein.

Privacy preserving

Distinguishability

Tabular data Engineering main heading: Inference engines

Data privacy Bayesian inference

Lower bounds

Differential privacies

Engineering controlled terms: Bayesian networks

Decision-theoretic

Posterior distributions

Author

Christos Dimitrakakis

Chalmers, Computer Science and Engineering (Chalmers), Computing Science (Chalmers)

B.E.N. Nelson

Google Inc.

Zuhe Zhang

University of Melbourne

Aikaterini Mitrokotsa

Chalmers, Computer Science and Engineering (Chalmers), Networks and Systems (Chalmers)

B. Rubinstein

University of Melbourne

Journal of Machine Learning Research

1532-4435 (ISSN) 1533-7928 (eISSN)

Vol. 18 1 March 2017

Subject Categories (SSIF 2011)

Computer and Information Science

More information

Latest update

4/20/2018