计算机科学
期限(时间)
人工智能
估计
计算机视觉
工程类
物理
系统工程
量子力学
作者
Bo Wang,Jian Li,Yang Yu,Li Liu,Zhenping Sun,Dewen Hu
标识
DOI:10.1109/tpami.2025.3572489
摘要
Considering that scene flow estimation has the capability of the spatial domain to focus but lacks the coherence of the temporal domain, this study proposes long-term scene flow estimation (LSFE), a comprehensive task that can simultaneously capture the fine-grained and long-term 3D motion in an online manner. We introduce SceneTracker, the first LSFE network that adopts an iterative approach to approximate the optimal 3D trajectory. The network dynamically and simultaneously indexes and constructs appearance correlation and depth residual features. Transformers are then employed to explore and utilize long-range connections within and between trajectories. With detailed experiments, SceneTracker shows superior capabilities in addressing 3D spatial occlusion and depth noise interference, highly tailored to the needs of the LSFE task. We build a real-world evaluation dataset, LSFDriving, for the LSFE field and use it in experiments to further demonstrate the advantage of SceneTracker in generalization abilities.
科研通智能强力驱动
Strongly Powered by AbleSci AI