假阳性悖论
计算机科学
可用性
排名(信息检索)
预警系统
特征(语言学)
优先次序
机器学习
软件
集合(抽象数据类型)
人工智能
数据挖掘
比例(比率)
工程类
哲学
物理
程序设计语言
管理科学
人机交互
电信
量子力学
语言学
作者
Mingshuang Qing,Xia‐Ting Feng,Jun Luo,Wanmin Huang,Jingui Zhang,Ping Wang,Yong Fan,Xiuting Ge,Yanglin Pan
标识
DOI:10.1109/qrs-c55045.2021.00103
摘要
Static analysis tools (SATs) can automatically detect software defects by analyzing the software source code. However, there are a large number of false positives in the warnings generated by SATs, which hinder the usability of SATs. To address this problem, this paper proposes an approach to constructing a machine learning-based warning prioritization model. Our approach explores the 91 warning features and obtains an optimal feature set with 25 features. Subsequently, the machine learning model is applied to learn a warning prioritization model based on the optimal feature set. To evaluate the warning prioritization model, we conduct the experimental evaluation on a real, open source, and large-scale warning dataset. The experimental results show that our approach is better than the random ranking algorithm. On average, in comparison with the random ranking algorithm, our approach reduces the number of false positives by 30.56% – 48.18% when developers check the warning list to find each true defect.
科研通智能强力驱动
Strongly Powered by AbleSci AI