LFN-YOLO: precision underwater small object detection via a lightweight reparameterized approach

目标检测计算机科学稳健性（进化）水下特征提取人工智能卷积（计算机科学）特征（语言学）推论卷积神经网络人工神经网络数据挖掘模式识别（心理学）联营边缘检测适应性实时计算机器学习机器视觉计算机视觉假警报视觉对象识别的认知神经科学杂乱脉冲响应计算机工程对象（语法）棱锥（几何）图像处理深度学习上下文图像分类长方体特征向量计算资源边缘设备计算智能网络模型声纳软件部署背景减法解算器

作者

Mingxin Liu,Yujie Wu,Ruixin Li,Cong Lin

出处

期刊：Frontiers in Marine Science [Frontiers Media]
日期：2025-01-23 卷期号：11 被引量：4

链接

doi.org doaj.orgdoi.org

标识

DOI：10.3389/fmars.2024.1513740

摘要

Underwater object detection plays a significant role in fisheries resource assessment and ecological environment protection. However, traditional underwater object detection methods struggle to achieve accurate detection in complex underwater environments with limited computational resources. This paper proposes a lightweight underwater object detection network called LightFusionNet-YOLO (LFN-YOLO). First, we introduce the reparameterization technique RepGhost to reduce the number of parameters while enhancing training and inference efficiency. This approach effectively minimizes precision loss even with a lightweight backbone network. Then, we replaced the standard depthwise convolution in the feature extraction network with SPD-Conv, which includes an additional pooling layer to mitigate detail loss. This modification effectively enhances the detection performance for small objects. Furthermore, We employed the Generalized Feature Pyramid Network (GFPN) for feature fusion in the network's neck, enhancing the network's adaptability to features of varying scales. Finally, we design a new detection head, CLLAHead, which reduces computational costs and strengthens the robustness of the model through cross-layer local attention. At the same time, the DFL loss function is introduced to reduce regression and classification errors. Experiments conducted on public datasets, including URPC, Brackish, and TrashCan, showed that the mAP@0.5 reached 74.1%, 97.5%, and 66.2%, respectively, with parameter sizes and computational complexities of 2.7M and 7.2 GFLOPs, and the model size is only 5.9 Mb. Compared to mainstream vision models, our model demonstrates superior performance. Additionally, deployment on the NVIDIA Jetson AGX Orin edge computing device confirms its high real-time performance and suitability for underwater applications, further showcasing the exceptional capabilities of LFN-YOLO.

求助该文献

最长约 10秒，即可获得该文献文件

LFN-YOLO: precision underwater small object detection via a lightweight reparameterized approach

今日热心研友