PrivateDL: Privacy‐preserving collaborative deep learning against leakage from gradient sharing

计算机科学差别隐私随机梯度下降算法异步通信信息隐私泄漏（经济）私人信息检索数据共享信息敏感性信息泄露人为噪声数据挖掘机器学习深度学习数据建模人工智能计算机安全人工神经网络计算机网络数据库宏观经济学经济医学频道（广播）发射机替代医学病理

作者

Qi Zhao,Chuan Zhao,Shujie Cui,Shan Jing,Zhenxiang Chen

出处

期刊：International Journal of Intelligent Systems [Wiley]
日期：2020-05-15 卷期号：35 (8): 1262-1279 被引量：37

链接

doi.orgdoi.org

标识

DOI：10.1002/int.22241

摘要

Large-scale data training is vital to the generalization performance of deep learning (DL) models. However, collecting data directly is associated with increased risk of privacy disclosure, particularly in special fields such as healthcare, finance, and genomics. To protect training data privacy, collaborative deep learning (CDL) has been proposed to enable joint training from multiple data owners while providing reliable privacy guarantee. However, recent studies have shown that CDL is vulnerable to several attacks that could reveal sensitive information about the original training data. One of the most powerful attacks benefits from the leakage from gradient sharing during collaborative training process. In this study, we present a new CDL framework, PrivateDL, to effectively protect private training data against leakage from gradient sharing. Unlike conventional training process that trains on private data directly, PrivateDL allows effective transfer of relational knowledge from sensitive data to public data in a privacy-preserving way, and enables participants to jointly learn local models based on the public data with noise-preserving labels. This way, PrivateDL establishes a privacy gap between the local models and the private datasets, thereby ensuring privacy against the attacks launched to the local models through gradient sharing. Moreover, we propose a new algorithm called Distributed Aggregation Stochastic Gradient Descent, which is designed to improve the efficiency and accuracy of CDL, especially in the asynchronous training mode. Experimental results demonstrate that PrivateDL preserves data privacy with reasonable performance overhead.

求助该文献

最长约 10秒，即可获得该文献文件

PrivateDL: Privacy‐preserving collaborative deep learning against leakage from gradient sharing

今日热心研友