弹道
计算机科学
噪音(视频)
降噪
人工智能
过程(计算)
高斯过程
噪声测量
先验概率
生成模型
行人
骨料(复合)
机器学习
高斯分布
高斯噪声
扩散过程
还原(数学)
传感器融合
公制(单位)
算法
计算机视觉
模式识别(心理学)
融合
扩散
生成语法
混合模型
隐马尔可夫模型
作者
Yanghong Liu,Xingping Dong,Yutian Lin,Mang Ye,Kaihao Zhang,Bo Du
标识
DOI:10.1109/tpami.2025.3645918
摘要
Pedestrian behavior exhibits inherent multi-modality, necessitating predictions that balance accuracy and diversity to adapt effectively to various complex scenarios. However, conventional noise addition in diffusion models is often aimless and unguided, leading to redundant noise reduction steps and the generation of uncontrollable samples. To address these issues, we propose a Prior Condition-Guided Diffusion Model (CGD-TraP) for multi-modal pedestrian trajectory prediction. Instead of directly adding Gaussian noise to trajectories at each timestep during the forward process, our approach leverages internal intention and external interaction to guide noise estimation. Specifically, we design two specialized modules to extract and aggregate intention and interaction features. These features are then adaptively fused through a spatial-temporal fusion based on selective state space, which estimates a controllable noisy trajectory distribution. By optimizing the noise addition process in a more controlled and efficient manner, our method ensures that the denoising process is effectively guided, resulting in predictions that are both accurate and diverse. Extensive experiments on the ETH-UCY, SDD, and NBA datasets demonstrate that CGD-TraP surpasses state-of-the-art diffusion-based and other generative methods, achieving superior efficiency, accuracy, and diversity.
科研通智能强力驱动
Strongly Powered by AbleSci AI