An Empirical Study on Heterogeneous Defect Prediction Approaches

可解释性 计算机科学 公制(单位) 集合(抽象数据类型) 数据挖掘 领域(数学) 软件度量 软件 实证研究 机器学习 软件错误 预测建模 选择(遗传算法) 班级(哲学) 性能指标 转化(遗传学) 人工智能 软件开发 软件质量 程序设计语言 统计 数学 纯数学 管理 化学 运营管理 经济 基因 生物化学
作者
Haowen Chen,Xiao‐Yuan Jing,Zhiqiang Li,Di Wu,Peng Yi,Zhiguo Huang
出处
期刊:IEEE Transactions on Software Engineering [IEEE Computer Society]
卷期号:47 (12): 2803-2822 被引量:49
标识
DOI:10.1109/tse.2020.2968520
摘要

Software defect prediction has always been a hot research topic in the field of software engineering owing to its capability of allocating limited resources reasonably. Compared with cross-project defect prediction (CPDP), heterogeneous defect prediction (HDP) further relaxes the limitation of defect data used for prediction, permitting different metric sets to be contained in the source and target projects. However, there is still a lack of a holistic understanding of existing HDP studies due to different evaluation strategies and experimental settings. In this paper, we provide an empirical study on HDP approaches. We review the research status systematically and compare the HDP approaches proposed from 2014 to June 2018. Furthermore, we also investigate the feasibility of HDP approaches in CPDP. Through extensive experiments on 30 projects from five datasets, we have the following findings: (1) metric transformation-based HDP approaches usually result in better prediction effects, while metric selection-based approaches have better interpretability. Overall, the HDP approach proposed by Li et al. (CTKCCA) currently has the best performance. (2) Handling class imbalance problems can boost the prediction effects, but the improvements are usually limited. In addition, utilizing mixed project data cannot improve the performance of HDP approaches consistently since the label information in the target project is not used effectively. (3) HDP approaches are feasible for cross-project defect prediction in which the source and target projects have the same metric set.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
科研通AI6.3应助斑ban采纳,获得10
3秒前
晚月恋冬发布了新的文献求助10
4秒前
8秒前
六六六关注了科研通微信公众号
9秒前
隐形曼青应助yt采纳,获得10
9秒前
10秒前
科研通AI6.1应助鲤鱼平蓝采纳,获得10
10秒前
乐乐应助Wang采纳,获得10
11秒前
晚月恋冬发布了新的文献求助10
12秒前
15秒前
17秒前
17秒前
zipzhang完成签到 ,获得积分10
17秒前
18秒前
读研有点小难应助逸风望采纳,获得10
18秒前
20秒前
情怀应助科研通管家采纳,获得10
20秒前
大模型应助科研通管家采纳,获得10
20秒前
研友_VZG7GZ应助科研通管家采纳,获得10
20秒前
星辰大海应助科研通管家采纳,获得10
20秒前
今后应助科研通管家采纳,获得10
20秒前
彭于晏应助科研通管家采纳,获得10
20秒前
情怀应助科研通管家采纳,获得10
20秒前
20秒前
20秒前
打打应助欣欣子采纳,获得10
20秒前
21秒前
21秒前
Mor711发布了新的文献求助10
23秒前
FF发布了新的文献求助10
25秒前
luxia完成签到 ,获得积分10
25秒前
朴素凡阳发布了新的文献求助10
26秒前
Ava应助胡先生采纳,获得10
27秒前
大气的夜绿完成签到,获得积分10
28秒前
saner32完成签到,获得积分10
28秒前
28秒前
29秒前
saner32发布了新的文献求助10
31秒前
幽默平安发布了新的文献求助10
32秒前
32秒前
高分求助中
Adhesion Science: Principles & Practice 1234
Signals, Systems, and Signal Processing 610
Petrology and Plate Tectonics,2025 400
Burger's Medicinal Chemistry and Drug Discovery 400
A Step-by-Step Guide to Qualitative Data Coding 2nd Edition 400
Impact of Storage Orientation and Duration on Prefilled Syringe Performance: Break-Loose and Glide Forces, and Injection Time Across Multiple Time Points 360
Programming for Chemical Engineers Using C, C++, and MATLAB 320
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6679174
求助须知:如何正确求助?哪些是违规求助? 8425715
关于积分的说明 18009406
捐赠科研通 5895894
什么是DOI,文献DOI怎么找? 2980558
邀请新用户注册赠送积分活动 1956457
关于科研通互助平台的介绍 1889092