Multi-virtual View Scoring Network for 3D Hand Pose Estimation from a Single Depth Image

计算机科学 特征(语言学) 人工智能 计算机视觉 偏移量(计算机科学) 姿势 过程(计算) 虚拟映像 编码(内存) 点云 架空(工程) 模式识别(心理学) 哲学 语言学 程序设计语言 操作系统
作者
Yingli Tian,Chen Li,Tian Lan
出处
期刊:Communications in computer and information science 卷期号:: 147-164
标识
DOI:10.1007/978-981-99-9109-9_15
摘要

3D hand pose estimation is a crucial subject in the domain of computer vision. Recently researchers transform a single depth image into multiple virtual view depth images. By projecting a single depth image through point cloud transformation and using the depth images of multiple virtual views together for hand pose estimation, these methods can effectively improve the estimation accuracy. However, current methods have issues with distorted generated depth images, insufficient usage of the depth image of each view, and high computational overhead. To overcome these problems, we introduce a multi-virtual view scoring network (MVSN). Our proposed MVSN consists of a single virtual view estimation module, virtual view feature encoding module, and virtual view scoring module. To generate an intermediate feature map suitable for virtual view scoring, the single virtual view estimation module uses a feature map offset loss function and enhance information interaction between channels in the backbone network. The virtual view feature encoding module adopts a two-branch structure to capture information about all joints and single joints from the intermediate feature map, respectively. This structure effectively improves model sensitivity to each view, better integrates information from each virtual view, and obtains a more appropriate scoring feature for each virtual view. The virtual view scoring module scores each view based on the scoring feature, and gives a higher score to the more accurately estimated virtual view. We also propose a dynamic virtual view removal strategy to remove poor quality views in the training process. Our model is tested on the NYU and ICVL datasets, and the mean joint error is 6.21 mm and 4.53 mm, respectively, exhibiting better estimation accuracy than existing methods.

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
xiaozheng完成签到,获得积分10
1秒前
万能图书馆应助jjj采纳,获得10
5秒前
Zo完成签到,获得积分10
7秒前
勤恳的雪卉完成签到,获得积分10
8秒前
Coral.完成签到 ,获得积分10
9秒前
SciGPT应助zhouzhou打工人采纳,获得10
11秒前
12秒前
wpz完成签到,获得积分10
13秒前
一次过发布了新的文献求助10
16秒前
23秒前
宫城完成签到,获得积分10
28秒前
风中半山完成签到 ,获得积分20
28秒前
Jasper应助喵星采纳,获得10
33秒前
37秒前
li发布了新的文献求助10
42秒前
45秒前
46秒前
Akim应助娇气的友易采纳,获得10
49秒前
55秒前
56秒前
Hello应助江流有声采纳,获得10
59秒前
化云完成签到,获得积分0
1分钟前
三楼发布了新的文献求助10
1分钟前
香蕉觅云应助crowling采纳,获得10
1分钟前
背后寒烟完成签到 ,获得积分10
1分钟前
1分钟前
深情安青应助Nolan采纳,获得30
1分钟前
77完成签到,获得积分10
1分钟前
crowling完成签到,获得积分10
1分钟前
英俊的铭应助冷艳的咖啡采纳,获得10
1分钟前
angeldrn完成签到,获得积分10
1分钟前
1分钟前
领导范儿应助科研通管家采纳,获得10
1分钟前
今后应助科研通管家采纳,获得30
1分钟前
所所应助科研通管家采纳,获得10
1分钟前
李健应助科研通管家采纳,获得10
1分钟前
1分钟前
充电宝应助科研通管家采纳,获得10
1分钟前
李健应助li采纳,获得10
1分钟前
深情安青应助漫漫采纳,获得10
1分钟前
高分求助中
Teaching Social and Emotional Learning in Physical Education 900
Plesiosaur extinction cycles; events that mark the beginning, middle and end of the Cretaceous 800
Recherches Ethnographiques sue les Yao dans la Chine du Sud 500
Two-sample Mendelian randomization analysis reveals causal relationships between blood lipids and venous thromboembolism 500
Chinese-English Translation Lexicon Version 3.0 500
[Lambert-Eaton syndrome without calcium channel autoantibodies] 460
Wisdom, Gods and Literature Studies in Assyriology in Honour of W. G. Lambert 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2394150
求助须知:如何正确求助?哪些是违规求助? 2097973
关于积分的说明 5286523
捐赠科研通 1825434
什么是DOI,文献DOI怎么找? 910174
版权声明 559960
科研通“疑难数据库(出版商)”最低求助积分说明 486453