目标检测                        
                
                                
                        
                            计算机科学                        
                
                                
                        
                            稳健性(进化)                        
                
                                
                        
                            水下                        
                
                                
                        
                            特征提取                        
                
                                
                        
                            人工智能                        
                
                                
                        
                            卷积(计算机科学)                        
                
                                
                        
                            特征(语言学)                        
                
                                
                        
                            推论                        
                
                                
                        
                            卷积神经网络                        
                
                                
                        
                            人工神经网络                        
                
                                
                        
                            数据挖掘                        
                
                                
                        
                            模式识别(心理学)                        
                
                                
                        
                            联营                        
                
                                
                        
                            边缘检测                        
                
                                
                        
                            适应性                        
                
                                
                        
                            实时计算                        
                
                                
                        
                            机器学习                        
                
                                
                        
                            机器视觉                        
                
                                
                        
                            计算机视觉                        
                
                                
                        
                            假警报                        
                
                                
                        
                            视觉对象识别的认知神经科学                        
                
                                
                        
                            杂乱                        
                
                                
                        
                            脉冲响应                        
                
                                
                        
                            计算机工程                        
                
                                
                        
                            对象(语法)                        
                
                                
                        
                            棱锥(几何)                        
                
                                
                        
                            图像处理                        
                
                                
                        
                            深度学习                        
                
                                
                        
                            上下文图像分类                        
                
                                
                        
                            长方体                        
                
                                
                        
                            特征向量                        
                
                                
                        
                            计算资源                        
                
                                
                        
                            边缘设备                        
                
                                
                        
                            计算智能                        
                
                                
                        
                            网络模型                        
                
                                
                        
                            声纳                        
                
                                
                        
                            软件部署                        
                
                                
                        
                            背景减法                        
                
                                
                        
                            解算器                        
                
                        
                    
            作者
            
                Mingxin Liu,Yujie Wu,Ruixin Li,Cong Lin            
         
                    
        
    
            
            标识
            
                                    DOI:10.3389/fmars.2024.1513740
                                    
                                
                                 
         
        
                
            摘要
            
            Underwater object detection plays a significant role in fisheries resource assessment and ecological environment protection. However, traditional underwater object detection methods struggle to achieve accurate detection in complex underwater environments with limited computational resources. This paper proposes a lightweight underwater object detection network called LightFusionNet-YOLO (LFN-YOLO). First, we introduce the reparameterization technique RepGhost to reduce the number of parameters while enhancing training and inference efficiency. This approach effectively minimizes precision loss even with a lightweight backbone network. Then, we replaced the standard depthwise convolution in the feature extraction network with SPD-Conv, which includes an additional pooling layer to mitigate detail loss. This modification effectively enhances the detection performance for small objects. Furthermore, We employed the Generalized Feature Pyramid Network (GFPN) for feature fusion in the network's neck, enhancing the network's adaptability to features of varying scales. Finally, we design a new detection head, CLLAHead, which reduces computational costs and strengthens the robustness of the model through cross-layer local attention. At the same time, the DFL loss function is introduced to reduce regression and classification errors. Experiments conducted on public datasets, including URPC, Brackish, and TrashCan, showed that the mAP@0.5 reached 74.1%, 97.5%, and 66.2%, respectively, with parameter sizes and computational complexities of 2.7M and 7.2 GFLOPs, and the model size is only 5.9 Mb. Compared to mainstream vision models, our model demonstrates superior performance. Additionally, deployment on the NVIDIA Jetson AGX Orin edge computing device confirms its high real-time performance and suitability for underwater applications, further showcasing the exceptional capabilities of LFN-YOLO.
         
            
 
                 
                
                    
                    科研通智能强力驱动
Strongly Powered by AbleSci AI