发布文献求助

Monocular Road Scene Bird’s Eye View Prediction via Big Kernel-Size Encoder and Spatial-Channel Transform Module

计算机科学人工智能计算机视觉频道（广播）单眼编码器核（代数）数学计算机网络组合数学操作系统

作者

Zhongyu Rao,Hai Wang,Long Chen,Yubo Lian,Yilin Zhong,Ze Liu,Yingfeng Cai

出处

期刊：IEEE Transactions on Intelligent Transportation Systems [Institute of Electrical and Electronics Engineers]
日期：2023-03-16 卷期号：24 (7): 7138-7148 被引量：5

标识

DOI：10.1109/tits.2023.3253554

摘要

A detailed representation of the surrounding road scene is crucial for an autonomous driving system. yellow The camera-based Bird's Eye View map has been a popular solution to present the surrounding information, due to its low cost and rich spatial context information. Most of the existing methods predict the BEV map based on the depth-estimation or the trivial homography method, which may cause the error propagation and the absence of content. To overcome these drawbacks, we propose a novel end-to-end framework that employs the front monocular image to predict the road layout and vehicle occupancy. In particular, to capture the long-range feature, we redesign a CNN encoder with a large kernel size to extract the image features. For reducing the big difference between the front image features and the top-down features, we propose a novel Spatial-Channel projection module to convert the front map into the top-down space. Additionally, concerning the correlation between front view and top-down view, we propose the Dual Cross-view Transformer module to refine the top-down view feature maps and strengthen the transformation. Extensive evaluations on the KITTI and Argoverse datasets present that the proposed model achieves the state-of-the-art results for both datasets. Furthermore, the proposed model runs in 37 FPS on a single GPU, demonstrating the generation of a real-time BEV map. The code will be published at https://github.com/raozhongyu/BEV_LKA.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

⚡ 2026年影响因子、分区 已更新！ (2026-6-17)

更新

📰 新增『新锐期刊分区』 (2026-3-24)

更新

💬 新增更精细的自定义提醒设置 (2026-1-4)

新增

🕒 每天60秒读懂世界·精选全球要闻 (2026-1-2)

新增

PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: 大西瓜完成签到，获得积分10

2秒前; Hello的应助被Polling采纳，获得10

3秒前; 漫天飞雪_寒江孤影完成签到，获得积分10

3秒前; 科研通AI6.1上传了应助文件

3秒前; fys131415发布了新的文献求助10

3秒前; 小马甲的应助被明理的鼠标采纳，获得10

6秒前; 小二郎的应助被共和国采纳，获得10

7秒前; ding的应助被yy采纳，获得10

8秒前; 在水一方上传了应助文件

9秒前; 小蘑菇上传了应助文件

9秒前; 一只小羊发布了新的文献求助30

11秒前; Copyright上传了应助文件

13秒前; 大西瓜发布了新的文献求助10

14秒前; 坦率尔琴发布了新的文献求助10

14秒前; 领导范儿上传了应助文件

15秒前; Owen上传了应助文件

15秒前; JamesPei的应助被Aryatarg采纳，获得10

16秒前; 顾矜上传了应助文件

17秒前; 柔弱的静芙完成签到，获得积分10

17秒前; 声声完成签到，获得积分10

18秒前; SY1005完成签到，获得积分10

19秒前; lgj发布了新的文献求助10

20秒前; 富贵发布了新的文献求助10

21秒前; ding上传了应助文件

21秒前; 核桃发布了新的文献求助10

22秒前; 完美世界的应助被今晚打老虎采纳，获得10

23秒前; 赫兹完成签到，获得积分10

24秒前; yy发布了新的文献求助10

26秒前; 感动的梦菲关注了科研通微信公众号

27秒前; JamesPei上传了应助文件

27秒前; zzz完成签到，获得积分10

28秒前; lgj完成签到，获得积分10

29秒前; mingxi发布了新的文献求助20

31秒前; 酷波er的应助被svsv采纳，获得10

31秒前; 小马甲上传了应助文件

32秒前; Aryatarg发布了新的文献求助10

34秒前; Humab668完成签到，获得积分10

36秒前; 任性的傲柏完成签到，获得积分10

36秒前; 执玉笛完成签到，获得积分10

37秒前; Vicky完成签到，获得积分10

37秒前

高分求助中: 液晶指向矢仿真分析数据集 8888; Invited Discussant 63O and 64O 1000; Ideology and Meaning-Making under the Putin Regime 750; Advanced Memory Technology 500; Petrology and Plate Tectonics 500; Writing Systems 500; A Handbook of User Experience Research & Design in Libraries 400

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 6865118; 求助须知：如何正确求助？哪些是违规求助？ 8567935; 关于积分的说明 18217564; 捐赠科研通 6234526; 什么是DOI，文献DOI怎么找？ 3049159; 关于科研通互助平台的介绍 2051064; 邀请新用户注册赠送积分活动 2026871

今日热心研友

个性的紫菜

殷勤的紫槐

祝我有座钻石山

含糊的寻雪

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2026 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：821889395【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通