计算机科学
适应性
野生动物
目标检测
人工智能
数据挖掘
特征提取
一般化
快照(计算机存储)
特征(语言学)
模式识别(心理学)
卷积神经网络
任务(项目管理)
机器学习
对象(语法)
稳健性(进化)
随机森林
基线(sea)
航空影像
领域(数学分析)
计算机视觉
野生动物保护
野生动物走廊
过度拟合
频道(广播)
条件随机场
作者
Chao Li,Youbo Pang,Xianhang Liu,Zhipeng Yu,Minchao Sun
标识
DOI:10.1016/j.cviu.2026.104678
摘要
The deterioration of the global ecological environment and increasing human activities pose severe threats to wildlife survival, making reliable detection methods crucial for wildlife protection and monitoring. However, existing detection methods often encounter inadequate feature extraction in complex settings such as tree occlusion, strong illumination, and low-light environments. To address this challenge, this paper proposes WildMDT-YOLO, an improved YOLOv8n-based wildlife detection method. The model introduces a Multi-scale Focusing Diffusion Network (MFDN) that enhances contextual information across scales through feature focusing and diffusion mechanisms. A novel detection head employs shared convolutions to reduce model parameters while task alignment and interactive feature extraction improve both classification and localization accuracy. The integration of Deformable Convolutional Networks v3 (DCNv3) and Mixed Local Channel Attention (MLCA) mechanism further enhances adaptability to wildlife species with complex shapes and varying scales. Experimental results show that WildMDT-YOLO achieves 92.6% mean average precision (mAP), a 3.1% increase over baseline YOLOv8n, while reducing parameters by 17.1%. Cross-dataset evaluation on Snapshot Serengeti demonstrates robust generalization capability, achieving 89.2% mAP and maintaining 83.6% small object detection performance despite significant domain shift between ecosystems. This model provides an effective tool for improving monitoring efficiency and accuracy in wildlife conservation. • Proposes WildMDT-YOLO (YOLOv8n-based) with 92.6% mAP (+3.1% vs YOLOv8n) and 17.1% fewer parameters for wildlife detection. • Introduces MFDN and DTADH; boosts small object detection to 87.4% AP (+5.8% vs baseline) in complex environments. • Integrates DCNv3 and MLCA to enhance adaptability to wildlife with complex shapes and varying scales. • Achieves 89.2% mAP and 83.6% small-object AP on Snapshot Serengeti, showing robust cross-ecosystem generalization.
科研通智能强力驱动
Strongly Powered by AbleSci AI