亲爱的研友该休息了!由于当前在线用户较少,发布求助请尽量完整的填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!身体可是革命的本钱,早点休息,好梦!

FGAHOI: Fine-Grained Anchors for Human-Object Interaction Detection

计算机科学 人工智能 目标检测 合并(版本控制) 任务(项目管理) 对象(语法) 机器学习 模式识别(心理学) 情报检索 管理 经济
作者
Shuailei Ma,Yuefeng Wang,Shanze Wang,Ying Wei
出处
期刊:IEEE Transactions on Pattern Analysis and Machine Intelligence [Institute of Electrical and Electronics Engineers]
卷期号:: 1-16 被引量:4
标识
DOI:10.1109/tpami.2023.3331738
摘要

Human-Object Interaction (HOI), as an important problem in computer vision, requires locating the human-object pair and identifying the interactive relationships between them. The HOI instance has a greater span in spatial, scale, and task than the individual object instance, making its detection more susceptible to noisy backgrounds. To alleviate the disturbance of noisy backgrounds on HOI detection, it is necessary to consider the input image information to generate fine-grained anchors which are then leveraged to guide the detection of HOI instances. However, it has the following challenges. i) how to extract pivotal features from the images with complex background information is still an open question. ii) how to semantically align the extracted features and query embeddings is also a difficult issue. In this paper, a novel end-to-end transformer-based framework (FGAHOI) is proposed to alleviate the above problems. FGAHOI comprises three dedicated components namely, multi-scale sampling (MSS), hierarchical spatial-aware merging (HSAM) and task-aware merging mechanism (TAM). MSS extracts features of humans, objects and interaction areas from noisy backgrounds for HOI instances of various scales. HSAM and TAM semantically align and merge the extracted features and query embeddings in the hierarchical spatial and task perspectives in turn. In the meanwhile, a novel training strategy Stage-wise Training Strategy is designed to reduce the training pressure caused by overly complex tasks done by FGAHOI. In addition, we propose two ways to measure the difficulty of HOI detection and a novel dataset, i.e., HOI-SDC for the two challenges (Uneven Distributed Area in Human-Object Pairs and Long Distance Visual Modeling of Human-Object Pairs) of HOI instances detection. Experiments are conducted on three benchmarks: HICO-DET, HOI-SDC and V-COCO. Our model outperforms the state-of-the-art HOI detection methods, and the extensive ablations reveal the merits of our proposed contribution.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
李容容发布了新的文献求助10
刚刚
优秀冰真完成签到,获得积分10
16秒前
LYL发布了新的文献求助10
21秒前
阔达的半青完成签到,获得积分10
21秒前
Chloe完成签到 ,获得积分10
31秒前
Kiwi完成签到 ,获得积分10
38秒前
今后应助Kashing采纳,获得10
1分钟前
其乐融融完成签到,获得积分10
1分钟前
LYL完成签到,获得积分10
1分钟前
Yaoz完成签到,获得积分10
1分钟前
Hello应助科研通管家采纳,获得10
1分钟前
1分钟前
惘文发布了新的文献求助10
1分钟前
1分钟前
等待昊强完成签到 ,获得积分10
1分钟前
Kashing发布了新的文献求助10
1分钟前
小方完成签到 ,获得积分0
2分钟前
CodeCraft应助Kashing采纳,获得10
2分钟前
lululu发布了新的文献求助10
2分钟前
邓明发布了新的文献求助10
2分钟前
Odile完成签到 ,获得积分10
2分钟前
winkyyang完成签到 ,获得积分10
2分钟前
2分钟前
盒子先森发布了新的文献求助10
2分钟前
棉花糖猫弦完成签到 ,获得积分10
3分钟前
盒子先森完成签到,获得积分10
3分钟前
TXZ06完成签到,获得积分10
3分钟前
惘文完成签到 ,获得积分10
3分钟前
3分钟前
林狗完成签到 ,获得积分10
3分钟前
871624521完成签到 ,获得积分10
3分钟前
洗洗发布了新的文献求助30
3分钟前
SOLOMON应助科研通管家采纳,获得10
3分钟前
健壮薯片完成签到 ,获得积分10
4分钟前
狸宝的小果子完成签到 ,获得积分10
4分钟前
Orange应助小样采纳,获得10
4分钟前
4分钟前
SHARK完成签到,获得积分20
4分钟前
4分钟前
冉亦完成签到,获得积分10
4分钟前
高分求助中
请在求助之前详细阅读求助说明!!!! 20000
One Man Talking: Selected Essays of Shao Xunmei, 1929–1939 1000
The Three Stars Each: The Astrolabes and Related Texts 900
Yuwu Song, Biographical Dictionary of the People's Republic of China 800
Multifunctional Agriculture, A New Paradigm for European Agriculture and Rural Development 600
Bernd Ziesemer - Maos deutscher Topagent: Wie China die Bundesrepublik eroberte 500
A radiographic standard of reference for the growing knee 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2477955
求助须知:如何正确求助?哪些是违规求助? 2141346
关于积分的说明 5458827
捐赠科研通 1864616
什么是DOI,文献DOI怎么找? 926925
版权声明 562896
科研通“疑难数据库(出版商)”最低求助积分说明 496002