Weakly Supervised Object Detection Using Proposal- and Semantic-Level Relationships

计算机科学 帕斯卡(单位) 人工智能 机器学习 视觉推理 目标检测 图形 过程(计算) 背景(考古学) 深度学习 视觉对象识别的认知神经科学 对象(语法) 卷积神经网络 模式识别(心理学) 理论计算机科学 古生物学 操作系统 程序设计语言 生物
作者
Dingwen Zhang,Wenyuan Zeng,Jie-Ru Yao,Junwei Han
出处
期刊:IEEE Transactions on Pattern Analysis and Machine Intelligence [Institute of Electrical and Electronics Engineers]
卷期号:44 (6): 3349-3363 被引量:22
标识
DOI:10.1109/tpami.2020.3046647
摘要

In recent years, weakly supervised object detection has attracted great attention in the computer vision community. Although numerous deep learning-based approaches have been proposed in the past few years, such an ill-posed problem is still challenging and the learning performance is still behind the expectation. In fact, most of the existing approaches only consider the visual appearance of each proposal region but ignore to make use of the helpful context information. To this end, this paper introduces two levels of context into the weakly supervised learning framework. The first one is the proposal-level context, i.e., the relationship of the spatially adjacent proposals. The second one is the semantic-level context, i.e., the relationship of the co-occurring object categories. Therefore, the proposed weakly supervised learning framework contains not only the cognition process on the visual appearance but also the reasoning process on the proposal- and semantic-level relationships, which leads to the novel deep multiple instance reasoning framework. Specifically, built upon a conventional CNN-based network architecture, the proposed framework is equipped with two additional graph convolutional network-based reasoning models to implement object location reasoning and multi-label reasoning within an end-to-end network training procedure. Comprehensive experiments on the widely used PASCAL VOC and MS COCO benchmarks have been implemented, which demonstrate the superior capacity of the proposed approach when compared with other state-of-the-art methods and baseline models.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
建议保存本图,每天支付宝扫一扫(相册选取)领红包
实时播报
刚刚
1秒前
自信筮发布了新的文献求助10
3秒前
4秒前
chaojikeyan发布了新的文献求助10
4秒前
情怀应助yatou采纳,获得10
5秒前
8秒前
他方世界完成签到,获得积分10
9秒前
chaojikeyan完成签到,获得积分10
11秒前
11秒前
沉默凌寒发布了新的文献求助10
12秒前
满意的嵩发布了新的文献求助10
17秒前
like1994发布了新的文献求助10
17秒前
无花果应助JianminLuo采纳,获得10
19秒前
19秒前
大媛媛完成签到,获得积分10
19秒前
科研通AI2S应助蔺丹翠采纳,获得10
25秒前
hua应助自信筮采纳,获得10
26秒前
大媛媛发布了新的文献求助10
27秒前
molly雨轩发布了新的文献求助10
27秒前
豪豪发布了新的文献求助50
29秒前
华仔应助周斌采纳,获得10
29秒前
30秒前
李健应助jjq采纳,获得10
31秒前
34秒前
GuSiwen发布了新的文献求助10
35秒前
LeoYiS214发布了新的文献求助10
36秒前
ding应助小白采纳,获得10
38秒前
怕黑道消完成签到 ,获得积分10
39秒前
39秒前
cllcx完成签到,获得积分10
40秒前
天天快乐应助洁净的亦竹采纳,获得30
40秒前
leptin完成签到 ,获得积分10
40秒前
爆米花应助qqqyy采纳,获得10
41秒前
一个小菜鸡完成签到 ,获得积分10
42秒前
chengzugen发布了新的文献求助10
43秒前
华仔应助曲书文采纳,获得10
44秒前
善良元龙完成签到,获得积分10
45秒前
47秒前
彩色夜阑完成签到,获得积分10
48秒前
高分求助中
Teaching Social and Emotional Learning in Physical Education 1000
Guide to Using WVASE Spectroscopic Ellipsometry Data Acquisition and Analysis Software 600
Multifunctionality Agriculture: A New Paradigm for European Agriculture and Rural Development 500
grouting procedures for ground source heat pump 500
ANDA Litigation: Strategies and Tactics for Pharmaceutical Patent Litigators Second 版本 500
中国志愿服务发展报告(2022~2023) 300
The Commercialization of Pharmaceutical Patents in China (Asian Commercial, Financial and Economic Law and Policy series) 300
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2335491
求助须知:如何正确求助?哪些是违规求助? 2022558
关于积分的说明 5063977
捐赠科研通 1772859
什么是DOI,文献DOI怎么找? 887336
版权声明 555736
科研通“疑难数据库(出版商)”最低求助积分说明 472842