An improved YOLOv7 network using RGB-D multi-modal feature fusion for tea shoots detection

RGB颜色模型 人工智能 计算机科学 情态动词 特征提取 特征(语言学) 计算机视觉 模式识别(心理学) 语言学 哲学 化学 高分子化学
作者
Yuanwei Wu,Jianneng Chen,Shunkai Wu,Hui Li,He Li,Runmao Zhao,Chuanyu Wu
出处
期刊:Computers and Electronics in Agriculture [Elsevier]
卷期号:216: 108541-108541 被引量:1
标识
DOI:10.1016/j.compag.2023.108541
摘要

Due to the increasing scarcity of tea pickers, the implementation of intelligent harvesting for premium tea is a crucial prerequisite for the sustainable development of the premium tea industry. The initial step towards achieving intelligent and precise harvesting is the accurate detection of tender shoots, which consist of one bud and one leaf. However, accurately identifying tea shoots poses a challenging visual task due to their small size, variable shapes, as well as similar colors and backgrounds. The existing model, based on RGB images, can only detect partial targets. To address this issue and further enhance the detection of tea buds, this study proposes the utilization of multi-modal features encompassing red, green, blue, and depth (RGB-D) for identification. In addition, a unidirectional complementary multi-modal fusion method is introduced to minimize the adverse effects caused by low-quality depth information. Firstly, an RGB-D dataset comprising high-quality tea leaves is constructed, and the samples are carefully calibrated. Subsequently, an enhanced end-to-end RGB-D multi-modal object detection network, referred to as YOLO-RGBDtea, is developed based on You Only Look Once version 7 (YOLOv7). This model incorporates a parallel lightweight depth image feature extraction backbone network and incorporates a self-attention mechanism to prioritize contextual information. Lastly, a cross-modal spatial attention fusion module (CSFM) is devised to collaboratively integrate depth features with RGB features in a unidirectional manner. The experimental results reveal that YOLO-RGBDtea achieves an AP50 of 91.12% when confronting complex outdoor tea shoots, exhibiting significant performance improvements compared to YOLOv7, especially in scenarios involving small targets, overlapping target groups, and highly overexposed images. Notably, the parameter increment in YOLO-RGBDtea compared to the original YOLOv7 model is merely 17.8%, and the additional components can be seamlessly transferred to other models. Overall, this study introduces a straightforward yet effective multi-modal fusion method that bears theoretical and practical significance in advancing the detection of high-quality tea shoots in complex outdoor environments.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
拼搏诗翠发布了新的文献求助10
1秒前
MW发布了新的文献求助10
1秒前
嘟嘟噜完成签到,获得积分10
1秒前
2秒前
fireking_sid发布了新的文献求助10
3秒前
4秒前
Catalysis123发布了新的文献求助10
5秒前
传奇3应助嘿嘿采纳,获得10
6秒前
hokin33发布了新的文献求助30
7秒前
诸缘郡发布了新的文献求助10
9秒前
ding应助fireking_sid采纳,获得10
9秒前
希望天下0贩的0应助思垢采纳,获得10
11秒前
cctv18应助李霞采纳,获得10
12秒前
个性的紫菜应助lgjswxf采纳,获得10
12秒前
14秒前
15秒前
16秒前
思源应助孤独沛菡采纳,获得10
17秒前
毁灭世界关注了科研通微信公众号
17秒前
19秒前
zdz完成签到,获得积分20
19秒前
19秒前
zhongying完成签到 ,获得积分10
20秒前
23秒前
23秒前
24秒前
虚心以丹完成签到,获得积分10
25秒前
忧心的翅膀完成签到,获得积分10
26秒前
26秒前
孤独沛菡发布了新的文献求助10
29秒前
benben应助诸缘郡采纳,获得10
30秒前
30秒前
半山完成签到 ,获得积分10
32秒前
柯一一应助某某采纳,获得10
35秒前
35秒前
FashionBoy应助wxy采纳,获得10
37秒前
失眠的大雁完成签到,获得积分10
39秒前
41秒前
言言发布了新的文献求助10
43秒前
Tigher完成签到,获得积分10
45秒前
高分求助中
The three stars each : the Astrolabes and related texts 1070
Manual of Clinical Microbiology, 4 Volume Set (ASM Books) 13th Edition 1000
Hieronymi Mercurialis Foroliviensis De arte gymnastica libri sex: In quibus exercitationum omnium vetustarum genera, loca, modi, facultates, & ... exercitationes pertinet diligenter explicatur Hardcover – 26 August 2016 900
Sport in der Antike 800
De arte gymnastica. The art of gymnastics 600
少脉山油柑叶的化学成分研究 530
Sport in der Antike Hardcover – March 1, 2015 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2404258
求助须知:如何正确求助?哪些是违规求助? 2102893
关于积分的说明 5307159
捐赠科研通 1830555
什么是DOI,文献DOI怎么找? 912123
版权声明 560502
科研通“疑难数据库(出版商)”最低求助积分说明 487683