MA-YOLO: a multi-attention object detection network for remote sensing images

计算机科学 目标检测 骨干网 特征提取 光学(聚焦) 人工智能 无人机 特征(语言学) 深度学习 可扩展性 推论 数据挖掘 遥感 计算机视觉 模式识别(心理学) 计算机网络 语言学 哲学 物理 数据库 生物 光学 遗传学 地质学
作者
Qingzeng Song,Maorui Hou,Yongjiang Xue,Guanghao Jin
出处
期刊:Journal of Electronic Imaging [SPIE - International Society for Optical Engineering]
卷期号:33 (01)
标识
DOI:10.1117/1.jei.33.1.013006
摘要

In recent years, deep learning-based objects detection algorithms have demonstrated exceptional performance in natural environments. These algorithms have been extensively used in various remote sensing applications, which include the detection of structures and roads as well as flood and earthquake disasters. In these applications, remote sensing images may be captured by satellites, drones, and other equipment. Compared with conventional images, they often feature substantial occlusion, intricate backgrounds and numerous small targets, which are difficult to detect because of the high resolution and large data volume. The existing algorithms focus on detection accuracy or speed, which often fail to achieve a balance between these. To solve this problem, we proposed a single-stage object detection algorithm MA-YOLO based on YOLOv4. We first design a backbone network aimed to enhance feature extraction capabilities while maintaining inference speed. Second, we introduced a parallel attention mechanism, which is to improve the detection performance of small targets. Finally, we applied an attention mechanism to the path aggregation network, which is to enhance the fusion effect of multi-scale features for detecting multi-scale targets. To validate the efficacy of our proposed approach, we evaluated MA-YOLO on three datasets: DIOR, RSOD, and NWPU VHR-10. The experimental results show that our proposed network achieves detection accuracy of 68.87%, 94.13%, and 93.77% on these datasets while ensuring the reasoning speed of 28.4 frames per second and realizes the effective balance between detection accuracy and speed.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
JT发布了新的文献求助30
刚刚
viauue9完成签到,获得积分10
2秒前
Yyy发布了新的文献求助10
2秒前
111完成签到 ,获得积分10
2秒前
3秒前
吃老鼠的鱼完成签到,获得积分10
3秒前
冷傲凉面发布了新的文献求助10
4秒前
脑洞疼应助七七采纳,获得10
5秒前
超帅pzc发布了新的文献求助10
5秒前
fddd完成签到 ,获得积分10
6秒前
正版小魏发布了新的文献求助10
6秒前
8秒前
9秒前
顾矜应助hua采纳,获得10
9秒前
didi发布了新的文献求助10
9秒前
CipherSage应助成就的道天采纳,获得10
10秒前
FashionBoy应助成就的道天采纳,获得10
10秒前
10秒前
10秒前
罗_应助成就的道天采纳,获得10
10秒前
Mike完成签到,获得积分10
11秒前
11秒前
小鱼儿完成签到,获得积分10
11秒前
Akim应助自信的水壶采纳,获得10
11秒前
Yyy完成签到,获得积分10
12秒前
西扬发布了新的文献求助10
13秒前
14秒前
狂野雁丝发布了新的文献求助20
15秒前
16秒前
16秒前
17秒前
酷炫的八宝粥应助yourhonor采纳,获得10
18秒前
缥缈夏寒应助等等采纳,获得10
18秒前
JT完成签到,获得积分10
21秒前
七七发布了新的文献求助10
21秒前
正直的煎蛋完成签到,获得积分10
22秒前
22秒前
HXY发布了新的文献求助10
22秒前
斯文败类应助zsc采纳,获得10
24秒前
高分求助中
Teaching Social and Emotional Learning in Physical Education 900
Plesiosaur extinction cycles; events that mark the beginning, middle and end of the Cretaceous 800
Recherches Ethnographiques sue les Yao dans la Chine du Sud 500
Two-sample Mendelian randomization analysis reveals causal relationships between blood lipids and venous thromboembolism 500
Chinese-English Translation Lexicon Version 3.0 500
[Lambert-Eaton syndrome without calcium channel autoantibodies] 440
Wisdom, Gods and Literature Studies in Assyriology in Honour of W. G. Lambert 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2389535
求助须知:如何正确求助?哪些是违规求助? 2095504
关于积分的说明 5277664
捐赠科研通 1822681
什么是DOI,文献DOI怎么找? 909020
版权声明 559530
科研通“疑难数据库(出版商)”最低求助积分说明 485732