WildMDT-YOLO: A multi-scale wildlife detection model for complex environments

计算机科学适应性野生动物目标检测人工智能数据挖掘特征提取一般化快照（计算机存储）特征（语言学）模式识别（心理学）卷积神经网络任务（项目管理）机器学习对象（语法）稳健性（进化）随机森林基线（sea）航空影像领域（数学分析）计算机视觉野生动物保护野生动物走廊过度拟合频道（广播）条件随机场

作者

Chao Li,Youbo Pang,Xianhang Liu,Zhipeng Yu,Minchao Sun

出处

期刊：Computer Vision and Image Understanding [Elsevier BV]
日期：2026-02-04 卷期号：265: 104678-104678

标识

DOI：10.1016/j.cviu.2026.104678

摘要

The deterioration of the global ecological environment and increasing human activities pose severe threats to wildlife survival, making reliable detection methods crucial for wildlife protection and monitoring. However, existing detection methods often encounter inadequate feature extraction in complex settings such as tree occlusion, strong illumination, and low-light environments. To address this challenge, this paper proposes WildMDT-YOLO, an improved YOLOv8n-based wildlife detection method. The model introduces a Multi-scale Focusing Diffusion Network (MFDN) that enhances contextual information across scales through feature focusing and diffusion mechanisms. A novel detection head employs shared convolutions to reduce model parameters while task alignment and interactive feature extraction improve both classification and localization accuracy. The integration of Deformable Convolutional Networks v3 (DCNv3) and Mixed Local Channel Attention (MLCA) mechanism further enhances adaptability to wildlife species with complex shapes and varying scales. Experimental results show that WildMDT-YOLO achieves 92.6% mean average precision (mAP), a 3.1% increase over baseline YOLOv8n, while reducing parameters by 17.1%. Cross-dataset evaluation on Snapshot Serengeti demonstrates robust generalization capability, achieving 89.2% mAP and maintaining 83.6% small object detection performance despite significant domain shift between ecosystems. This model provides an effective tool for improving monitoring efficiency and accuracy in wildlife conservation. • Proposes WildMDT-YOLO (YOLOv8n-based) with 92.6% mAP (+3.1% vs YOLOv8n) and 17.1% fewer parameters for wildlife detection. • Introduces MFDN and DTADH; boosts small object detection to 87.4% AP (+5.8% vs baseline) in complex environments. • Integrates DCNv3 and MLCA to enhance adaptability to wildlife with complex shapes and varying scales. • Achieves 89.2% mAP and 83.6% small-object AP on Snapshot Serengeti, showing robust cross-ecosystem generalization.

求助该文献

WildMDT-YOLO: A multi-scale wildlife detection model for complex environments

今日热心研友