Poison Attack and Poison Detection on Deep Source Code Processing Models

计算机科学 源代码 计算机安全 深度学习 编码(集合论) 可解释性 脆弱性(计算) 代码库 人工智能 程序设计语言 集合(抽象数据类型)
作者
Jia Li,Zhuo Li,Huangzhao Zhang,Ge Li,Zhi Jin,Xing Hu,Xin Xia
出处
期刊:ACM Transactions on Software Engineering and Methodology [Association for Computing Machinery]
卷期号:33 (3): 1-31 被引量:7
标识
DOI:10.1145/3630008
摘要

In the software engineering (SE) community, deep learning (DL) has recently been applied to many source code processing tasks, achieving state-of-the-art results. Due to the poor interpretability of DL models, their security vulnerabilities require scrutiny. Recently, researchers have identified an emergent security threat to DL models, namely, poison attacks . The attackers aim to inject insidious backdoors into DL models by poisoning the training data with poison samples. The backdoors mean that poisoned models work normally with clean inputs but produce targeted erroneous results with inputs embedded with specific triggers. By using triggers to activate backdoors, attackers can manipulate poisoned models in security-related scenarios (e.g., defect detection) and lead to severe consequences. To verify the vulnerability of deep source code processing models to poison attacks, we present a poison attack approach for source code named CodePoisoner as a strong imaginary enemy. CodePoisoner can produce compilable and functionality-preserving poison samples and effectively attack deep source code processing models by poisoning the training data with poison samples. To defend against poison attacks, we further propose an effective poison detection approach named CodeDetector . CodeDetector can automatically identify poison samples in the training data. We apply CodePoisoner and CodeDetector to six deep source code processing models, including defect detection, clone detection, and code repair models. The results show that ❶ CodePoisoner conducts successful poison attacks with a high attack success rate (average: 98.3%, maximum: 100%). It validates that existing deep source code processing models have a strong vulnerability to poison attacks. ❷ CodeDetector effectively defends against multiple poison attack approaches by detecting (maximum: 100%) poison samples in the training data. We hope this work can help SE researchers and practitioners notice poison attacks and inspire the design of more advanced defense techniques.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
liushoujia完成签到,获得积分0
刚刚
等等发布了新的文献求助10
1秒前
1501929468完成签到,获得积分10
1秒前
方方方完成签到,获得积分10
1秒前
strike完成签到,获得积分0
2秒前
小鱼完成签到,获得积分10
2秒前
ncuwzq完成签到,获得积分10
2秒前
pypi_2020完成签到,获得积分10
2秒前
拼豆豆发布了新的文献求助10
3秒前
张占完成签到,获得积分10
3秒前
yueyueyeu完成签到,获得积分10
3秒前
互助开心应助飞飞飞采纳,获得10
3秒前
kexin完成签到,获得积分10
4秒前
木木完成签到,获得积分10
4秒前
飘逸的高山完成签到,获得积分20
4秒前
xuhang完成签到,获得积分10
5秒前
Silence完成签到,获得积分0
5秒前
Chris完成签到,获得积分10
6秒前
dan1029完成签到,获得积分10
6秒前
feng8848完成签到,获得积分10
6秒前
6秒前
小齐爱科研完成签到,获得积分10
6秒前
微纳组刘同完成签到,获得积分10
7秒前
开着飞机骑拖拉机完成签到,获得积分10
7秒前
kerwin发布了新的文献求助30
7秒前
乐乐应助永远的得胜同志采纳,获得10
7秒前
李浩完成签到,获得积分10
7秒前
思源应助小余采纳,获得10
7秒前
困困完成签到,获得积分10
7秒前
芽卉完成签到,获得积分10
8秒前
狂野土豆完成签到 ,获得积分10
8秒前
李爱国应助超级小凝采纳,获得10
8秒前
8秒前
等等完成签到,获得积分10
8秒前
搜集达人应助爱学习采纳,获得10
8秒前
bkagyin应助复杂斓采纳,获得30
9秒前
hou完成签到,获得积分10
9秒前
zhiqi完成签到,获得积分10
10秒前
112233完成签到,获得积分10
10秒前
mufulee发布了新的文献求助30
11秒前
高分求助中
GL 2 A method for assessing the in-place cleanability of food processing equipment, Fourth Edition, December 2023 3000
Annie Ernaux: De la perte au corps glorieux 600
Writing Systems 500
类器官构建与应用:从基础到前沿 500
Electric Vehicle Powertrains Design Fundamentals, Components, and Applications 400
Handbook on Planning and Climate Change Adaptation 400
Optical Coating Design with the Essential Macleod 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6807493
求助须知:如何正确求助?哪些是违规求助? 8524384
关于积分的说明 18145243
捐赠科研通 6131122
什么是DOI,文献DOI怎么找? 3028460
邀请新用户注册赠送积分活动 2005053
关于科研通互助平台的介绍 2001953