CodedPrivateML: A Fast and Privacy-Preserving Framework for Distributed Machine Learning

Submitted by admin on Mon, 10/28/2024 - 01:24

How to train a machine learning model while keeping the data private and secure? We present CodedPrivateML, a fast and scalable approach to this critical problem. CodedPrivateML keeps both the data and the model information-theoretically private, while allowing efficient parallelization of training across distributed workers. We characterize CodedPrivateML’s privacy threshold and prove its convergence for logistic (and linear) regression. Furthermore, via extensive experiments on Amazon EC2, we demonstrate that CodedPrivateML provides significant speedup over cryptographic approaches based on multi-party computing (MPC).

Privacy and Security of Information Systems

READ ON IEEE Xplore

Jinhyun So

Başak Güler

A. Salman Avestimehr