A multimodal stepwise-coordinating framework for pedestrian trajectory prediction

行人 弹道 计算机科学 人工智能 工程类 运输工程 物理 天文
作者
Yijun Wang,Zhiqiang Guo,Chang Xu,Jianxin Lin
出处
期刊:Knowledge Based Systems [Elsevier BV]
卷期号:299: 112038-112038
标识
DOI:10.1016/j.knosys.2024.112038
摘要

Pedestrian trajectory prediction from the first-person view has still been considered one of the challenging problems in automatic driving due to the difficulty of understanding and predicting pedestrian actions. Observing that pedestrian motion naturally contains the repetitive pattern of the gait cycle and global intention information, we design a Multimodal Stepwise-Coordinating Network, namely MSCN, to sufficiently leverage the underlying human motion properties. Specifically, we first design a multimodal spatial-frequency encoder, which encodes the periodicity of pedestrian motion with a frequency-domain enhanced Transformer and other visual information with a spatial-domain Transformer. Then, we propose a stepwise-coordinating decoder structure, which leverages both local and global information in sequence decoding through a two-stage decoding process. After generating a coarse sequence from the stepwise trajectory predictor, we design a coordinator to aggregate the corresponding representations used to generate the coarse sequence. Subsequently, the coordinator learns to output a refined sequence through a knowledge distillation process based on the aggregated representations. In this way, MSCN can adequately capture the representations of short-term motion behaviors, thus modeling better long-term sequence prediction. Extensive experiments show that the proposed model can achieve significant improvements over state-of-the-art approaches on the PIE and JAAD datasets by 16.1% and 16.4% respectively.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
LOVER完成签到 ,获得积分10
刚刚
sadscv发布了新的文献求助10
2秒前
方远锋完成签到,获得积分10
3秒前
今后应助胡萝卜叶子采纳,获得30
6秒前
7秒前
7秒前
8秒前
暖暖完成签到 ,获得积分10
8秒前
10秒前
丘比特应助白图采纳,获得10
11秒前
Nzoth完成签到,获得积分10
12秒前
vic303发布了新的文献求助10
13秒前
lyl发布了新的文献求助10
14秒前
Hello应助sadscv采纳,获得30
14秒前
木雨完成签到 ,获得积分10
17秒前
haki发布了新的文献求助10
17秒前
emberflow完成签到,获得积分10
19秒前
19秒前
JamesPei应助乌梅橘子茶采纳,获得10
19秒前
布曲发布了新的文献求助10
23秒前
应应完成签到,获得积分10
25秒前
26秒前
Alexbirchurros完成签到 ,获得积分10
27秒前
29秒前
土冂足各发布了新的文献求助10
32秒前
Akim应助木雨采纳,获得10
32秒前
哈哈完成签到 ,获得积分10
35秒前
36秒前
爆米花应助厚厚采纳,获得30
37秒前
38秒前
科研通AI5应助科研通管家采纳,获得30
38秒前
huqianxue关注了科研通微信公众号
39秒前
卡卡西应助科研通管家采纳,获得10
39秒前
852应助科研通管家采纳,获得100
39秒前
bkagyin应助科研通管家采纳,获得10
39秒前
只A不B应助科研通管家采纳,获得10
39秒前
伶俐绿柏完成签到,获得积分10
39秒前
脑洞疼应助科研通管家采纳,获得10
39秒前
田様应助科研通管家采纳,获得10
39秒前
科研通AI5应助科研通管家采纳,获得10
39秒前
高分求助中
Introduction to Strong Mixing Conditions Volumes 1-3 500
Tip60 complex regulates eggshell formation and oviposition in the white-backed planthopper, providing effective targets for pest control 400
Optical and electric properties of monocrystalline synthetic diamond irradiated by neutrons 320
共融服務學習指南 300
Essentials of Pharmacoeconomics: Health Economics and Outcomes Research 3rd Edition. by Karen Rascati 300
Peking Blues // Liao San 300
E-commerce live streaming impact analysis based on stimulus-organism response theory 260
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 物理 生物化学 纳米技术 计算机科学 化学工程 内科学 复合材料 物理化学 电极 遗传学 量子力学 基因 冶金 催化作用
热门帖子
关注 科研通微信公众号,转发送积分 3801455
求助须知:如何正确求助?哪些是违规求助? 3347217
关于积分的说明 10332634
捐赠科研通 3063494
什么是DOI,文献DOI怎么找? 1681768
邀请新用户注册赠送积分活动 807719
科研通“疑难数据库(出版商)”最低求助积分说明 763867