发布文献求助

Multi-Modal Feature Pyramid Transformer for RGB-Infrared Object Detection

人工智能计算机科学计算机视觉 RGB颜色模型棱锥（几何）变压器特征（语言学）模式识别（心理学）情态动词目标检测特征提取模式工程类数学哲学社会学电气工程语言学电压化学高分子化学社会科学几何学

作者

Yaohui Zhu,Xiaoyu Sun,Miao Wang,Hua Huang

出处

期刊：IEEE Transactions on Intelligent Transportation Systems [Institute of Electrical and Electronics Engineers]
日期：2023-04-19 卷期号：24 (9): 9984-9995 被引量：84

标识

DOI：10.1109/tits.2023.3266487

摘要

RGB-Infrared multi-modal object detection utilizes diverse and complementary information, showing some advantages in intelligent transportation field. The main challenge of RGB-Infrared object detection is how to fuse the two modalities. The difficulty of fusion is reflected in two aspects: 1) large visual differences between modalities make it difficult to learn effective complementary features, 2) some misaligned RGB-Infrared images increase the difficulty of fusion. To this end, based on feature pyramid commonly used in object detection, we propose Multi-modal Feature Pyramid Transformer (MFPT) to fuse the two modalities. The proposed MFPT learns semantic and modal complementary information to enhance each modal features via intra-modal feature pyramid transformer and inter-modal feature pyramid transformer. The intra-modal feature pyramid transformer enables features to interact across space and scales, improving the semantic representations of features in each modality. The inter-modal feature pyramid transformer conducts feature interaction between modalities, enabling each modality to learn complementary features from other modalities. Meanwhile, the inter-modal feature pyramid transformer can also learn distance independent dependencies between modalities, which are not sensitive to misaligned images. Furthermore, a local attention mechanism is introduced within different windows into MFPT to achieve efficient correlation between regions of different scales or different modalities. Experimental results on two RGB-Infrared detection datasets demonstrate the proposed method is superior to state-of-the-art methods.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

⚡ 2026年影响因子、分区 已更新！ (2026-6-17)

更新

📰 新增『新锐期刊分区』 (2026-3-24)

更新

💬 新增更精细的自定义提醒设置 (2026-1-4)

新增

🕒 每天60秒读懂世界·精选全球要闻 (2026-1-2)

新增

PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: dida完成签到，获得积分10

刚刚; 科研通AI6.4的应助被yeee采纳，获得10

1秒前; Orange上传了应助文件

2秒前; 香蕉觅云的应助被Siran采纳，获得10

2秒前; Akim的应助被八荒来犬采纳，获得10

3秒前; 感动冬灵完成签到，获得积分10

4秒前; Nole上传了应助文件

5秒前; Tianz发布了新的文献求助100

6秒前; 眼睛大的一一的应助被lllzz采纳，获得10

6秒前; Lucas的应助被削土豆牛腩采纳，获得10

6秒前; 乐乐的应助被野性的沛儿采纳，获得10

6秒前; Lucas的应助被直率雪曼采纳，获得10

7秒前; 小蘑菇的应助被Longkun_Li采纳，获得20

8秒前; 丘比特上传了应助文件

8秒前; Onlyone完成签到，获得积分10

8秒前; 乐乐上传了应助文件

8秒前; 彭于晏上传了应助文件

9秒前; 脑洞疼的应助被兴奋幻桃采纳，获得10

9秒前; 仁爱晓兰发布了新的文献求助10

11秒前; 糖豆包子发布了新的文献求助10

11秒前; 顾矜的应助被Firsterchao采纳，获得10

12秒前; 大胆的寻菡完成签到，获得积分10

12秒前; zyc1111111完成签到，获得积分10

12秒前; Akim上传了应助文件

13秒前; htt发布了新的文献求助10

14秒前; 黑猫黑猫发布了新的文献求助10

14秒前; 流浪完成签到，获得积分10

14秒前; apollo3232完成签到，获得积分0

14秒前; 眼睛大的一一的应助被坚定的雪枫采纳，获得10

14秒前; gujianhua发布了新的文献求助10

14秒前; mysci发布了新的文献求助10

14秒前; 科研通AI6.2的应助被茕茕采纳，获得30

16秒前; 英俊的铭上传了应助文件

16秒前; 失眠世平完成签到，获得积分10

17秒前; 千寻完成签到，获得积分10

18秒前; 八荒来犬发布了新的文献求助10

19秒前; 可爱的函函的应助被高高采纳，获得10

19秒前; bogula1112完成签到，获得积分10

20秒前; 酷波er的应助被miao采纳，获得10

20秒前; SciGPT的应助被zhu采纳，获得10

21秒前

高分求助中: (应助此贴封号)【重要！！请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000; 2026年中国辛酸癸酸聚乙二醇甘油酯行业市场现状调查及投资机会研判报告 1000; 2026年中国辛酸癸酸聚乙二醇甘油酯行业市场规模及竞争格局分析报告 1000; 模型平均及其应用 900; Nondestructive Testing Handbook: Vol. 4, Thermal and Infrared Testing (IR), 4th ed 800; Évora na Idade Média 555; 作者名：Kristopher P. Plain，悉尼大学的，目前只能查到其四篇论文，想找到其博士论文 550

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 7344372; 求助须知：如何正确求助？哪些是违规求助？ 8956947; 关于积分的说明 19018167; 捐赠科研通 6996267; 什么是DOI，文献DOI怎么找？ 3219764; 关于科研通互助平台的介绍 2384735; 邀请新用户注册赠送积分活动 2199918

今日热心研友

大力的冬萱

行走的荷尔蒙

认真的不评

无情的聪健

昏睡的蟠桃

学术文献互助

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2026 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：821889395【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通