人工智能
姿势
计算机科学
计算机视觉
估计员
一致性(知识库)
能见度
像素
跟踪(教育)
图形
光流
关节式人体姿态估计
三维姿态估计
模式识别(心理学)
数学
图像(数学)
理论计算机科学
光学
物理
心理学
统计
教育学
作者
Yalong Jiang,Wenrui Ding,Hongguang Li,Zheru Chi
标识
DOI:10.1109/tip.2024.3405339
摘要
In this paper, we propose a novel framework for multi-person pose estimation and tracking on challenging scenarios. In view of occlusions and motion blurs which hinder the performance of pose tracking, we proposed to model humans as graphs and perform pose estimation and tracking by concentrating on the visible parts of human bodies which are informative about complete skeletons under incomplete observations. Specifically, the proposed framework involves three parts: (i) A Sparse Key-point Flow Estimating Module (SKFEM) and a Hierarchical Graph Distance Minimizing Module (HGMM) for estimating pixel-level and human-level motion, respectively; (ii) Pixel-level appearance consistency and human-level structural consistency are combined in measuring the visibility scores of body joints. The scores guide the pose estimator to predict complete skeletons by observing high-visibility parts, under the assumption that visible and invisible parts are inherently correlated in human part graphs. The pose estimator is iteratively fine-tuned to achieve this capability; (iii) Multiple historical frames are combined to benefit tracking which is implemented using HGMM. The proposed approach not only achieves state-of-the-art performance on PoseTrack datasets but also contributes to significant improvements in other tasks such as human-related anomaly detection.
科研通智能强力驱动
Strongly Powered by AbleSci AI