TrojanZoo: Towards Unified, Holistic, and Practical Evaluation of Neural Backdoors

后门 计算机科学 可解释性 计算机安全 特洛伊木马 稳健性(进化) 威胁模型 人工智能 机器学习 生物化学 化学 基因
作者
Ren Pang,Zheng Gang Zhang,Gao Xiangshan,Zhaohan Xi,Shouling Ji,Peng Cheng,Ting Wang
出处
期刊:Cornell University - arXiv
标识
DOI:10.48550/arxiv.2012.09302
摘要

Neural backdoors represent one primary threat to the security of deep learning systems. The intensive research has produced a plethora of backdoor attacks/defenses, resulting in a constant arms race. However, due to the lack of evaluation benchmarks, many critical questions remain under-explored: (i) what are the strengths and limitations of different attacks/defenses? (ii) what are the best practices to operate them? and (iii) how can the existing attacks/defenses be further improved? To bridge this gap, we design and implement TROJANZOO, the first open-source platform for evaluating neural backdoor attacks/defenses in a unified, holistic, and practical manner. Thus far, focusing on the computer vision domain, it has incorporated 8 representative attacks, 14 state-of-the-art defenses, 6 attack performance metrics, 10 defense utility metrics, as well as rich tools for in-depth analysis of the attack-defense interactions. Leveraging TROJANZOO, we conduct a systematic study on the existing attacks/defenses, unveiling their complex design spectrum: both manifest intricate trade-offs among multiple desiderata (e.g., the effectiveness, evasiveness, and transferability of attacks). We further explore improving the existing attacks/defenses, leading to a number of interesting findings: (i) one-pixel triggers often suffice; (ii) training from scratch often outperforms perturbing benign models to craft trojan models; (iii) optimizing triggers and trojan models jointly greatly improves both attack effectiveness and evasiveness; (iv) individual defenses can often be evaded by adaptive attacks; and (v) exploiting model interpretability significantly improves defense robustness. We envision that TROJANZOO will serve as a valuable platform to facilitate future research on neural backdoors.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
hoojack发布了新的文献求助10
1秒前
6秒前
7秒前
田様应助科研通管家采纳,获得10
8秒前
星辰大海应助科研通管家采纳,获得10
8秒前
充电宝应助科研通管家采纳,获得10
8秒前
wanci应助科研通管家采纳,获得10
8秒前
田様应助科研通管家采纳,获得10
8秒前
benben应助科研通管家采纳,获得10
8秒前
小二郎应助科研通管家采纳,获得10
8秒前
Owen应助科研通管家采纳,获得10
8秒前
Hello应助科研通管家采纳,获得10
9秒前
科研通AI2S应助科研通管家采纳,获得10
9秒前
科研小白鼠完成签到,获得积分10
13秒前
马艺帆完成签到,获得积分10
13秒前
雨声完成签到,获得积分10
14秒前
春一又木完成签到,获得积分10
15秒前
英姑应助杏杏采纳,获得10
18秒前
爱听歌火龙果完成签到,获得积分10
20秒前
Orange应助monica采纳,获得10
20秒前
22秒前
淡淡瓜子完成签到 ,获得积分10
22秒前
23秒前
summer给summer的求助进行了留言
26秒前
27秒前
28秒前
yuri关注了科研通微信公众号
33秒前
36秒前
火神杯完成签到,获得积分10
43秒前
45秒前
jyl发布了新的文献求助10
47秒前
49秒前
复杂函完成签到,获得积分10
53秒前
Daisy发布了新的文献求助10
53秒前
54秒前
58秒前
Zhong发布了新的文献求助10
59秒前
59秒前
所所应助Daisy采纳,获得10
1分钟前
1分钟前
高分求助中
Thermodynamic data for steelmaking 3000
Manual of Clinical Microbiology, 4 Volume Set (ASM Books) 13th Edition 1000
Cross-Cultural Psychology: Critical Thinking and Contemporary Applications (8th edition) 800
Counseling With Immigrants, Refugees, and Their Families From Social Justice Perspectives pages 800
マンネンタケ科植物由来メロテルペノイド類の網羅的全合成/Collective Synthesis of Meroterpenoids Derived from Ganoderma Family 500
Electrochemistry 500
Broflanilide prolongs the development of fall armyworm Spodoptera frugiperda by regulating biosynthesis of juvenile hormone 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2370705
求助须知:如何正确求助?哪些是违规求助? 2079265
关于积分的说明 5206124
捐赠科研通 1806447
什么是DOI,文献DOI怎么找? 901690
版权声明 558148
科研通“疑难数据库(出版商)”最低求助积分说明 481418