Adversarial Learning Targeting Deep Neural Network Classification: A Comprehensive Review of Defenses Against Attacks

对抗制 深度学习 人工智能 人工神经网络 计算机科学 深层神经网络 机器学习 计算机安全
作者
David J. Miller,Zhen Xiang,George Kesidis
出处
期刊:Proceedings of the IEEE [Institute of Electrical and Electronics Engineers]
卷期号:108 (3): 402-433 被引量:237
标识
DOI:10.1109/jproc.2020.2970615
摘要

With wide deployment of machine learning (ML)-based systems for a variety of applications including medical, military, automotive, genomic, multimedia, and social networking, there is great potential for damage from adversarial learning (AL) attacks. In this article, we provide a contemporary survey of AL, focused particularly on defenses against attacks on deep neural network classifiers. After introducing relevant terminology and the goals and range of possible knowledge of both attackers and defenders, we survey recent work on test-time evasion (TTE), data poisoning (DP), backdoor DP, and reverse engineering (RE) attacks and particularly defenses against the same. In so doing, we distinguish robust classification from anomaly detection (AD), unsupervised from supervised, and statistical hypothesis-based defenses from ones that do not have an explicit null (no attack) hypothesis. We also consider several scenarios for detecting backdoors. We provide a technical assessment for reviewed works, including identifying any issues/limitations, required hyperparameters, needed computational complexity, as well as the performance measures evaluated and the obtained quality. We then delve deeper, providing novel insights that challenge conventional AL wisdom and that target unresolved issues, including: robust classification versus AD as a defense strategy; the belief that attack success increases with attack strength, which ignores susceptibility to AD; small perturbations for TTE attacks: a fallacy or a requirement; validity of the universal assumption that a TTE attacker knows the ground-truth class for the example to be attacked; black, gray, or white-box attacks as the standard for defense evaluation; and susceptibility of query-based RE to an AD defense. We also discuss attacks on the privacy of training data. We then present benchmark comparisons of several defenses against TTE, RE, and backdoor DP attacks on images. The article concludes with a discussion of continuing research directions, including the supreme challenge of detecting attacks whose goal is not to alter classification decisions, but rather simply to embed, without detection, “fake news” or other false content.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
管某发布了新的文献求助10
刚刚
小羊败下阵来完成签到,获得积分10
2秒前
问霖完成签到,获得积分10
2秒前
3秒前
3秒前
3秒前
4秒前
番茄炒蛋完成签到,获得积分10
4秒前
阿华关注了科研通微信公众号
5秒前
6秒前
8秒前
王懒懒完成签到 ,获得积分10
9秒前
Xenia发布了新的文献求助10
9秒前
哈基米发布了新的文献求助10
10秒前
大白发布了新的文献求助10
10秒前
sunzyu发布了新的文献求助10
10秒前
11秒前
12秒前
科研通AI2S应助两只晕虾采纳,获得10
12秒前
liangbingyan完成签到,获得积分20
12秒前
12秒前
16秒前
王美祥发布了新的文献求助10
17秒前
Zhe完成签到,获得积分10
17秒前
所所应助1111采纳,获得10
17秒前
18秒前
迷人的问枫完成签到,获得积分20
18秒前
言非离完成签到,获得积分10
19秒前
19秒前
唠叨的完成签到,获得积分10
20秒前
撒西不理发布了新的文献求助10
22秒前
南西完成签到,获得积分10
23秒前
may完成签到,获得积分10
23秒前
小蜜蜂完成签到,获得积分10
24秒前
24秒前
小小鹿完成签到,获得积分10
26秒前
无期完成签到,获得积分10
27秒前
可爱的函函应助ZJH采纳,获得10
28秒前
科目三应助喜悦的毛豆采纳,获得10
28秒前
高分求助中
Invited Discussant 63O and 64O 1000
Ideology and Meaning-Making under the Putin Regime 750
Petrology and Plate Tectonics 500
Writing Systems 500
A Handbook of User Experience Research & Design in Libraries 400
Understanding Modeling and Simulation of Polymerization Reactions 400
Direct and Iterative Linear System Solvers 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 计算机科学 化学工程 生物化学 物理 内科学 复合材料 催化作用 光电子学 物理化学 电极 细胞生物学 基因 遗传学
热门帖子
关注 科研通微信公众号,转发送积分 6901146
求助须知:如何正确求助?哪些是违规求助? 8595856
关于积分的说明 18249289
捐赠科研通 6301541
什么是DOI,文献DOI怎么找? 3062371
关于科研通互助平台的介绍 2083423
邀请新用户注册赠送积分活动 2040242