计算机科学
软件度量
软件错误
Java
源代码
软件质量
软件
随机森林
预测建模
决策树
数据挖掘
编码(集合论)
软件工程
软件开发
机器学习
程序设计语言
集合(抽象数据类型)
作者
Rebro, Dominik Arne,Stanislav Chren,Bruno Rossi
标识
DOI:10.1145/3555776.3577809
摘要
In current research, there are contrasting results about the applicability of software source code metrics as features for defect prediction models. The goal of the paper is to evaluate the adoption of software metrics in models for software defect prediction, identifying the impact of individual source code metrics. With an empirical study on 275 release versions of 39 Java projects mined from GitHub, we compute 12 software metrics and collect software defect information. We train and compare three defect classification models. The results across all projects indicate that Decision Tree (DT) and Random Forest (RF) classifiers show the best results. Among the highest-performing individual metrics are NOC, NPA, DIT, and LCOM5. While other metrics, such as CBO, do not bring significant improvements to the models.
科研通智能强力驱动
Strongly Powered by AbleSci AI