计算机科学
人工智能
解耦(概率)
特征提取
目标检测
特征(语言学)
遥感
模式识别(心理学)
计算机视觉
语义特征
遥感应用
先验概率
蒸馏
上下文图像分类
航空影像
高斯分布
混合模型
特征模型
数据挖掘
特征向量
探测器
高光谱成像
数据建模
非线性系统
作者
Yuhang Song,Boya Zhao,Yuanfeng Wu,Xiushan Bai,Bing Zhang
标识
DOI:10.1109/tgrs.2025.3622546
摘要
Deep learning-based object detection in remote sensing images often achieves high performance but has high model complexity, hindering edge deployment. Knowledge distillation offers a potential solution for model lightweighting. However, most existing distillation methods are designed for natural scene images and are not suitable for the spatial heterogeneity, semantic discrepancies, and scale diversity of satellite and aerial images (remote sensing imagery). Therefore, this paper proposes a non-uniform knowledge distillation based on feature decoupling (NKDFD). NKDFD incorporates spatial priors derived from 2D Gaussian distributions during feature distillation to achieve spatial decoupling of foreground and background features. Independent feature adaptation modules are applied to the decoupled features, enabling non-uniform feature extraction and knowledge transfer for distinct attributes. This design mitigates potential detection pattern discrepancies between teacher and student models resulting from spatial heterogeneity. A Kolmogorov-Arnold network (KAN)-based semantic decoupling and selection module is used due to its strong nonlinear modeling capability, reducing interference from semantic discrepancies. A cross-scale feature interaction module is developed based on multiscale feature decoupling to enhance multiscale feature learning. Extensive experiments on the DOTA, DIOR, RSOD, and VHR10 datasets demonstrate NKDFD’s consistent improvements for different detectors (single-stage, two-stage, anchor-based, and anchor-free). Comparisons with 13 mainstream distillation methods on DOTA and RSOD show NKDFD’s superiority. It achieves the highest mAP of 37.7% (RetinaNet) and 44.8% (ATSS) on DOTA, and 48.8% (RetinaNet) and 54.0% (ATSS) on RSOD. Further analysis validates the necessity of non-uniform processing, showing that distinct features (e.g., foreground/background) require independent adapters; otherwise, the output distribution is not statistically significantly different from that of random data.
科研通智能强力驱动
Strongly Powered by AbleSci AI