发布文献求助

亲爱的研友该休息了！由于当前在线用户较少，发布求助请尽量完整地填写文献信息，科研通机器人24小时在线，伴您度过漫漫科研夜！身体可是革命的本钱，早点休息，好梦！

DriveGPT4: Interpretable End-to-End Autonomous Driving Via Large Language Model

端到端原则计算机科学语言模型历史的终结人工智能政治学政治法学

作者

Zhenhua Xu,Yujia Zhang,Enze Xie,Zhao Zhen,Yong Guo,Kenneth K. Wong,Zhenguo Li,Hengshuang Zhao

出处

期刊：IEEE robotics and automation letters [Institute of Electrical and Electronics Engineers]
日期：2024-08-07 卷期号：9 (10): 8186-8193 被引量：312

链接

标识

DOI：10.1109/lra.2024.3440097

摘要

Multimodallarge language models (MLLMs) have emerged as a prominent area of interest within the research community, given their proficiency in handling and reasoning with non-textual data, including images and videos. This study seeks to extend the application of MLLMs to the realm of autonomous driving by introducing DriveGPT4, a novel interpretable end-to-end autonomous driving system based on LLMs. Capable of processing multi-frame video inputs and textual queries, DriveGPT4 facilitates the interpretation of vehicle actions, offers pertinent reasoning, and effectively addresses a diverse range of questions posed by users. Furthermore, DriveGPT4 predicts low-level vehicle control signals in an end-to-end fashion. These advanced capabilities are achieved through the utilization of a bespoke visual instruction tuning dataset, specifically tailored for autonomous driving applications, in conjunction with a mix-finetuning training strategy. DriveGPT4 represents the pioneering effort to leverage LLMs for the development of an interpretable end-to-end autonomous driving solution. Evaluations conducted on the BDD-X dataset showcase the superior qualitative and quantitative performance of DriveGPT4. Additionally, the fine-tuning of domain-specific data enables DriveGPT4 to yield close or even improved results in terms of autonomous driving grounding when contrasted with GPT4-V.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

⚡ 2026年影响因子、分区 已更新！ (2026-6-17)

更新

📰 新增『新锐期刊分区』 (2026-3-24)

更新

💬 新增更精细的自定义提醒设置 (2026-1-4)

新增

🕒 每天60秒读懂世界·精选全球要闻 (2026-1-2)

新增

PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: wcy完成签到，获得积分10

9秒前; Lucas的应助被幸运小狗采纳，获得10

19秒前; Nole上传了应助文件

23秒前; 一只科研狗完成签到，获得积分10

43秒前; 优雅愚志完成签到，获得积分10

44秒前; 科研通AI6.3的应助被likes采纳，获得10

44秒前; Dailei完成签到，获得积分10

47秒前; Orange上传了应助文件

48秒前; 一只科研狗发布了新的文献求助10

50秒前; 研友_8WdzPL发布了新的文献求助10

54秒前; 今后的应助被昵称未命名采纳，获得10

56秒前; 传奇3上传了应助文件

56秒前; 无花果的应助被火星上的云采纳，获得10

1分钟前; LIUDEHUA发布了新的文献求助10

1分钟前; Nole上传了应助文件

1分钟前; Z777完成签到，获得积分10

1分钟前; 乐乐的应助被LIUDEHUA采纳，获得10

1分钟前; Abstract完成签到，获得积分10

1分钟前; Kao的应助被科研通管家采纳，获得10

1分钟前; Orange的应助被科研通管家采纳，获得10

1分钟前; zwq完成签到，获得积分10

1分钟前; Copyright上传了应助文件

1分钟前; June完成签到，获得积分10

1分钟前; ZhaohuaXie上传了应助文件

1分钟前; 科研通AI2S上传了应助文件

1分钟前; 科研通AI6.3上传了应助文件

1分钟前; 啦啦啦发布了新的文献求助10

1分钟前; likes发布了新的文献求助10

1分钟前; zhangfan发布了新的文献求助30

1分钟前; 科研通AI6.4的应助被likes采纳，获得10

1分钟前; 美满的馒头完成签到，获得积分10

1分钟前; Nole上传了应助文件

1分钟前; KongXY完成签到，获得积分10

1分钟前; SciGPT的应助被啦啦啦采纳，获得10

1分钟前; 英姑上传了应助文件

1分钟前; 咩咩咩完成签到，获得积分10

1分钟前; 123发布了新的文献求助10

1分钟前; 云藤发布了新的文献求助10

1分钟前; Carol完成签到，获得积分10

1分钟前; 今后上传了应助文件

1分钟前

高分求助中: Principles of Economics, 11th Edition 10000; University Physics with Modern Physics, 16th edition 10000; (应助此贴封号)【重要！！请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000; Molecular Mechanisms of Photosynthesis, 4th Edition 1000; Organic Reactions, Volume 116 1000; Matrix Methods in Data Mining and Pattern Recognition 510; Reading and Understanding Health Research 500

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 7252464; 求助须知：如何正确求助？哪些是违规求助？ 8874894; 关于积分的说明 18733790; 捐赠科研通 6932760; 什么是DOI，文献DOI怎么找？ 3199700; 关于科研通互助平台的介绍 2374416; 邀请新用户注册赠送积分活动 2174340

今日热心研友

潇洒的惋清

俭朴的甜瓜

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2026 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：821889395【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通