亲爱的研友该休息了!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!身体可是革命的本钱,早点休息,好梦!

Learning-enabled decision-making for autonomous driving: framework and methodology

笑声 心理学 计算机科学 人机交互 美学 社会心理学 数学教育 艺术
作者
Zhiyu Huang
标识
DOI:10.32657/10356/172842
摘要

The growing adoption of autonomous vehicles (AVs) holds the promise of transforming transportation systems, enhancing traffic safety, and supporting environmental sustainability. Despite significant advancements in advanced perception systems for AVs, the effectiveness of these vehicles in real-world scenarios depends critically on advanced decision-making systems. Therefore, a critical challenge lies in developing safe, robust, and human-like decision-making systems for achieving higher levels of vehicle autonomy. Harnessing the power of artificial intelligence (AI) and machine learning (ML) is pivotal in developing such advanced decision-making systems. These techniques offer promising solutions for tackling key challenges in autonomous driving systems, including scalability, adaptability, and alignment with human driver behaviors. \n \nThis thesis presents a comprehensive framework and a series of learning-based methodologies for decision-making in AVs, with the objective of improving the scalability, adaptability, and alignment of their decision-making systems. The learning-enabled decision-making framework consists of three essential modules: world model, actor, and cost. These modules work collaboratively to provide a unified solution for autonomous machine intelligence and can be learned individually or jointly from human driving data. The methodologies proposed in this thesis focus on three key areas: (1) creating ML-driven actors and cost functions that align with human values, (2) developing ML-based world models capable of accurately forecasting complex human interactions, and (3) designing ML-based decision-making architectures that seamlessly integrate these components. \n \nInitially, we explore each module individually. For the actor module, we propose an approach based on reinforcement learning (RL) that incorporates human prior knowledge into the training process. By combining imitation learning (IL) from human expert demonstrations with RL, a novel learning framework is developed to align agent behavior with human expectations. The results show improved sample efficiency, enabling the agent to navigate complex urban environments safely, while also generating distinct driving styles that closely resemble human driving behaviors. In the cost module, we focus on learning personalized cost functions underlying human driving behaviors using inverse reinforcement learning (IRL). We propose a maximum entropy IRL algorithm that infers the reward function based on a structural assumption about discrete latent intentions guiding continuous low-level control actions. The personalized cost learning method outperforms general cost modeling methods, leading to a more human-like driving experience. Regarding the world model module, the goal is to accurately represent the complexity and dynamics of human behaviors in real-world driving environments. We introduce a joint multi-agent prediction model based on Transformers, leveraging hierarchical game theory to model interactions among agents in driving scenarios. The model employs a novel Transformer decoder structure and a learning process that regulates agent behavior to respond to other agents' behaviors. Extensive experiments validate the model's state-of-the-art prediction accuracy and its capability to jointly reason about ego agent motion plans and other agents' behaviors. \n \nFollowing individual assessments, unified predictive decision-making architectures are presented, harmonizing the three modules. A comprehensive behavior planning framework that integrates all three core modules is proposed. It generates diverse trajectory proposals, forecasts multi-modal futures for other agents, and evaluates candidate plans using a learned cost function. Importantly, this framework includes a conditional motion predictor (CMP) that forecasts other agents' responses to the potential actions of the AV. The CMP improves prediction accuracy and facilitates the selection of human-like behaviors. Moreover, we propose a differentiable and integrated prediction-planning framework that enables end-to-end training of the entire decision-making system. The framework incorporates a differentiable nonlinear optimizer as the motion planner, allowing the gradients to flow between the three core modules and directly optimizing the planning performance. The experimental results demonstrate superior performance in both open-loop and closed-loop testing, effectively handling complex urban driving scenarios. \n \nThis thesis not only makes significant contributions to the advancement of autonomous driving systems but also lays the foundation for future research in learning-based decision-making for a wide range of intelligent systems. It paves the way for safe and human-centered autonomy, while driving the evolution of intelligent transportation systems.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
Akim应助魔幻乘云采纳,获得10
1秒前
Thrain发布了新的文献求助10
1秒前
Leonard发布了新的文献求助10
2秒前
姆姆没买完成签到 ,获得积分0
4秒前
4秒前
魔幻乘云完成签到,获得积分20
12秒前
13秒前
Hello应助Thrain采纳,获得10
16秒前
干净傲霜完成签到 ,获得积分10
18秒前
魔幻乘云发布了新的文献求助10
18秒前
竹筏过海完成签到,获得积分0
20秒前
科研通AI6应助科研通管家采纳,获得10
24秒前
科研通AI6应助科研通管家采纳,获得10
24秒前
25秒前
兼听则明应助科研通管家采纳,获得100
25秒前
慕青应助科研通管家采纳,获得10
25秒前
酷波er应助科研通管家采纳,获得10
25秒前
深情安青应助科研通管家采纳,获得10
25秒前
香蕉觅云应助科研通管家采纳,获得10
25秒前
坤桑完成签到 ,获得积分10
27秒前
山野完成签到 ,获得积分10
28秒前
31秒前
竺七完成签到 ,获得积分10
32秒前
siwei发布了新的文献求助30
35秒前
GD发布了新的文献求助50
40秒前
量子星尘发布了新的文献求助10
42秒前
44秒前
47秒前
55秒前
金毛上将完成签到,获得积分10
56秒前
57秒前
咕噜关注了科研通微信公众号
58秒前
沉默寡严完成签到,获得积分10
58秒前
传奇3应助Wzh采纳,获得30
1分钟前
luxixi完成签到,获得积分10
1分钟前
(# ̄д ̄)完成签到,获得积分10
1分钟前
顺利的源智完成签到 ,获得积分10
1分钟前
1分钟前
落叶捎来讯息完成签到 ,获得积分10
1分钟前
荆玉豪完成签到 ,获得积分10
1分钟前
高分求助中
Aerospace Standards Index - 2025 10000
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Video: Lagrangian coherent structures in the flow field of a fluidic oscillator 2000
Clinical Microbiology Procedures Handbook, Multi-Volume, 5th Edition 1000
Teaching Language in Context (Third Edition) 1000
List of 1,091 Public Pension Profiles by Region 961
流动的新传统主义与新生代农民工的劳动力再生产模式变迁 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 生物化学 物理 纳米技术 计算机科学 内科学 化学工程 复合材料 物理化学 基因 遗传学 催化作用 冶金 量子力学 光电子学
热门帖子
关注 科研通微信公众号,转发送积分 5449605
求助须知:如何正确求助?哪些是违规求助? 4557722
关于积分的说明 14264820
捐赠科研通 4480856
什么是DOI,文献DOI怎么找? 2454582
邀请新用户注册赠送积分活动 1445382
关于科研通互助平台的介绍 1421087