计算机科学
一致性(知识库)
卷积(计算机科学)
残余物
流离失所(心理学)
编码(集合论)
人工智能
帧(网络)
棱锥(几何)
光流
运动(物理)
运动估计
计算机视觉
代表(政治)
领域(数学)
班级(哲学)
模式识别(心理学)
数据一致性
目标检测
灵活性(工程)
参考坐标系
测距
运动检测
依赖关系(UML)
航向(导航)
高动态范围
作者
Chuiyi Deng,Yanyin Guo,Xiang Xu,Zhuoyi Zhao,Yixin Xia,Runxuan An,Junwei Li,Antonio Plaza
标识
DOI:10.1109/tgrs.2026.3657842
摘要
Motion Infrared Small Target Detection (MIRSTD) leverages multi-frame temporal dependencies to improve detection robustness. However, existing methods have difficulty modeling global consistency and achieving precise alignment in complex motion and large displacement scenarios, leading to dispersed target representations and higher error rates. To address these challenges, we propose Dynamic Query Aligner (DQAligner), which introduces global random large-displacement augmentation and a cross-scale bidirectional shared attention mechanism to enhance inter-frame consistency. A dynamic receptive field pyramid deformable convolution decomposes complex multi-scale motions, enabling precise target alignment. Furthermore, class query memory serves as the generalized residual form of deformable convolution, which iteratively learns dynamic query representations to facilitate global target localization within each frame and maintain semantic consistency across frames. DQAligner achieves a paradigm shift from rigid alignment to flexible matching, and significantly boosts detection performance in large displacement and dynamic scenarios. Experiments on extensive stationary and moving platform datasets show that DQAligner outperforms existing methods, especially under complex motion and low signal-noise-rate conditions. Code will be available at https://github.com/dengfa02/DQAligner_MIRSTD.
科研通智能强力驱动
Strongly Powered by AbleSci AI