计算机科学
帕斯卡(单位)
人工智能
目标检测
判别式
基本事实
模式识别(心理学)
显著性图
计算机视觉
监督学习
机器学习
对象(语法)
分类器(UML)
图像(数学)
人工神经网络
程序设计语言
作者
Danpei Zhao,Zhichao Yuan,Zhenwei Shi,Fengying Xie
出处
期刊:Neurocomputing
[Elsevier BV]
日期:2021-03-20
卷期号:455: 431-440
被引量:3
标识
DOI:10.1016/j.neucom.2021.03.047
摘要
Abstract Even though weakly-supervised object detection (WSOD) has become an effective method to relieve the heavy work of labeling, there are still difficult problems to be solved. WSOD method represented by a Multiple Instance Learning (MIL) have some common problems including running slowly and focusing on discriminative parts rather than the whole object, which will lead to false detection. To improve the efficiency and accuracy, we propose a single-shot weakly-supervised object detection model guided by empirical saliency model (SSWOD). As human vision always focuses on the most attracting parts of the image, saliency maps can usually guide our model to locate the most promising object areas. By this way, our model takes the saliency areas as pseudo ground-truths to realize the WSOD task with only class labels. Moreover, empirical saliency is designed to refine the pseudo ground-truth and improve the detection. Our new framework not only realizes a one-step detection without region proposals, but also reduces computational consumption. Experiments on PASCAL VOC 2007 & 2012 benchmarks demonstrate that SSWOD is 8 times faster and 5 times smaller than previous approaches, surpassing the state-of-the-art WSOD methods by 6.1% mean average precision (mAP).
科研通智能强力驱动
Strongly Powered by AbleSci AI