YOLO-TBD: Tea Bud Detection with Triple-Branch Attention Mechanism and Self-Correction Group Convolution

机制（生物学）群（周期表）园艺计算机科学数学化学生物物理有机化学量子力学

作者

Zhongyuan Liu,Zhuo Li,Chunwang Dong,Jiafeng Li

出处

期刊：Industrial Crops and Products [Elsevier BV]
日期：2025-02-01 卷期号：226: 120607-120607 被引量：17

链接

doi.orgdoi.org

标识

DOI：10.1016/j.indcrop.2025.120607

摘要

Automatic Tea Bud Detection (TBD) is one of the core technologies in intelligent tea-picking systems Since the tea buds are small, dense, highly overlapped, and their colors are close to the background, accurate tea bud detection faces great challenges. In this paper, a tea bud detection method, named as YOLO-TBD, is proposed, which adopts YOLOv8 as the basic framework. Firstly, the Path Aggregation Feature Pyramid Network (PAFPN) in YOLOv8 is improved by incorporating the features from the 2nd layer into the PAFPN network. This modification enables better utilization of low-level features, such as texture and color information, thereby enhancing the network’s feature representation ability. Secondly, a Triple-Branch Attention Mechanism (TBAM) is designed and integrated into the output of the backbone network and the C2f module. This attention mechanism strengthens the features of the tea bud objects and suppresses background noise through feature channel interactions, without increasing the model parameters. Finally, a Self-Correction Group Convolution (SCGC) is proposed, which replaces the conventional convolution in the C2f module. This convolution establishes long-range spatial and channel dependencies around each spatial position, enabling a larger receptive field and better contextual information capture with fewer parameters, thereby mitigating false detections and missed detections of tea bud objects. The proposed modules are integrated into the YOLOv8 network architecture, resulting in the construction of three detection models with different parameters, namely YOLO-TBD-L, YOLO-TBD-M and YOLO-TBD-S, respectively. Experimental results on our self-built tea bud detection dataset and the publicly available GWHD_2021 dataset demonstrate that, compared with current methods, the proposed YOLO-TBD-L method can attain a state-of-the-art accuracy, with mAP value reaching 87.04 % and 94.5 %, respectively. And the proposed YOLO-TBD-S model achieves comparable detection accuracy to the YOLOv8-L model with much lower model parameters and computational complexity. • The Path Aggregation Feature Pyramid Network (PAFPN) in YOLOv8 is improved, in which the 2nd layer features are also fed into the network, to fully exploit the texture and color information contained in the low-level features. • A Triple-Branch Attention Mechanism (TBAM) is designed, which employs a dual-branch structure to capture cross-dimensional interactions and the remaining branch is utilized to compute the similarity between each pixel in the feature maps and its adjacent pixels. • A Self-Correction Group Convolution (SCGC) is proposed, which establishes long-range spatial and channel dependencies around each spatial position.

求助该文献

最长约 10秒，即可获得该文献文件

YOLO-TBD: Tea Bud Detection with Triple-Branch Attention Mechanism and Self-Correction Group Convolution

今日热心研友