Multistage Spatio-Temporal Networks for Robust Sketch Recognition

计算机科学 人工智能 循环神经网络 模式识别(心理学) 卷积神经网络 素描 特征(语言学) 人工神经网络 算法 语言学 哲学
作者
Hanhui Li,Xudong Jiang,Boliang Guan,Ruomei Wang,Nadia Magnenat Thalmann
出处
期刊:IEEE transactions on image processing [Institute of Electrical and Electronics Engineers]
卷期号:31: 2683-2694 被引量:16
标识
DOI:10.1109/tip.2022.3160240
摘要

Sketch recognition relies on two types of information, namely, spatial contexts like the local structures in images and temporal contexts like the orders of strokes. Existing methods usually adopt convolutional neural networks (CNNs) to model spatial contexts, and recurrent neural networks (RNNs) for temporal contexts. However, most of them combine spatial and temporal features with late fusion or single-stage transformation, which is prone to losing the informative details in sketches. To tackle this problem, we propose a novel framework that aims at the multi-stage interactions and refinements of spatial and temporal features. Specifically, given a sketch represented by a stroke array, we first generate a temporal-enriched image (TEI), which is a pseudo-color image retaining the temporal order of strokes, to overcome the difficulty of CNNs in leveraging temporal information. We then construct a dual-branch network, in which a CNN branch and a RNN branch are adopted to process the stroke array and the TEI respectively. In the early stages of our network, considering the limited ability of RNNs in capturing spatial structures, we utilize multiple enhancement modules to enhance the stroke features with the TEI features. While in the last stage of our network, we propose a spatio-temporal enhancement module that refines stroke features and TEI features in a joint feature space. Furthermore, a bidirectional temporal-compatible unit that adaptively merges features in opposite temporal orders, is proposed to help RNNs tackle abrupt strokes. Comprehensive experimental results on QuickDraw and TU-Berlin demonstrate that the proposed method is a robust and efficient solution for sketch recognition.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
orixero应助海孩子采纳,获得10
刚刚
大河完成签到,获得积分10
2秒前
2秒前
高大荔枝发布了新的文献求助10
2秒前
脑洞疼应助molo采纳,获得10
3秒前
4秒前
刻苦小笼包完成签到,获得积分10
4秒前
Cm666发布了新的文献求助10
4秒前
Hello应助Rafaeleb采纳,获得10
4秒前
5秒前
5秒前
5秒前
JamesPei应助hrpppp采纳,获得30
7秒前
7秒前
百里笑晴完成签到,获得积分10
8秒前
科研通AI2S应助八宝粥采纳,获得10
9秒前
酷波er应助dazhang15采纳,获得10
9秒前
香蕉觅云应助hoshiii采纳,获得10
10秒前
落寞自中发布了新的文献求助10
10秒前
11秒前
墨雨云烟完成签到,获得积分20
11秒前
范莉发布了新的文献求助10
12秒前
虚心的寒天完成签到,获得积分10
13秒前
有钱发布了新的文献求助30
13秒前
13秒前
墨雨云烟发布了新的文献求助20
15秒前
16秒前
16秒前
23333完成签到,获得积分10
16秒前
酷波er应助图图的饼干采纳,获得10
16秒前
17秒前
17秒前
17秒前
科研通AI6.4应助高大荔枝采纳,获得20
19秒前
SciGPT应助高大荔枝采纳,获得10
19秒前
蓝天应助高大荔枝采纳,获得10
19秒前
YL完成签到,获得积分10
19秒前
南淮完成签到,获得积分10
19秒前
汉堡包应助有钱采纳,获得10
20秒前
21秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
A Research Agenda for Law, Finance and the Environment 800
Development Across Adulthood 800
Chemistry and Physics of Carbon Volume 18 800
The Organometallic Chemistry of the Transition Metals 800
A Time to Mourn, A Time to Dance: The Expression of Grief and Joy in Israelite Religion 700
The formation of Australian attitudes towards China, 1918-1941 640
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6447192
求助须知:如何正确求助?哪些是违规求助? 8260347
关于积分的说明 17597872
捐赠科研通 5508567
什么是DOI,文献DOI怎么找? 2902309
邀请新用户注册赠送积分活动 1879313
关于科研通互助平台的介绍 1719730