目标检测
计算机科学
对象(语法)
计算机视觉
人工智能
模式识别(心理学)
作者
Canhui Zhang,Cong Gu,Qiqiang Duan,Shuo Zhao
标识
DOI:10.1109/jsen.2025.3568844
摘要
With the development of deep convolutional neural networks, significant progress has been made in object detection over the past few years. Current detection algorithms demonstrate high accuracy for large and medium-sized objects. However, the detection of small objects remains a significant challenge due to their limited pixel area, scarcity of distinctive features, low contrast against complex backgrounds, and increased sensitivity to boundary changes. These limitations reduce both accuracy and inference speed in small object detection tasks. To address these challenges, we propose a Small Object Enhancement Network based on YOLOv8, named SOE-YOLO. The network is designed to enhance both feature extraction and semantic understanding of small objects. SOE-YOLO integrates two key modules: the Attention-Guided Multi-Path Spatial Pyramid Pooling (AMSPP) module and the Cascaded Semantic Embedding (CSE) module. The AMSPP module initially extracts multiscale information through multi-path spatial pyramid pooling. To guide the network in focusing on the features of smaller objects, we introduce the Marginal Attention (MA) mechanism, which efficiently and accurately assigns corresponding weights to each feature point. Subsequently, the CSE module stacks multiple Semantic Embedding Units (SEU), progressively incorporating higher-level semantic information into high-resolution feature maps. This strategy effectively combines the spatial details of small objects with abstract, high-level semantic knowledge, enhancing the model’s ability to discern complex relationships between small objects and backgrounds. Extensive experiments on benchmark datasets demonstrate the effectiveness of SOE-YOLO, which outperforms other state-of-the-art detectors in terms of both detection accuracy and robustness for small object detection.
科研通智能强力驱动
Strongly Powered by AbleSci AI