计算机科学
人工智能
初始化
下游(制造业)
特征(语言学)
钥匙(锁)
编码(内存)
安全性令牌
特征提取
像素
解码方法
频域
构造(python库)
原始数据
计算机视觉
嵌入
分类
信息抽取
图像(数学)
机器学习
块(置换群论)
信息模型
图像处理
上游(联网)
人机交互
数据挖掘
模式识别(心理学)
聚类分析
领域(数学分析)
任务分析
作者
Boyu Zhao,Wei Li,Junjie Wang,Yuxiang Zhang,Hong Yang,Haitao Zhao,Ran Tao,Qian Du
标识
DOI:10.1109/tpami.2025.3639595
摘要
Frequency domain analysis reveals fundamental image patterns difficult to observe in raw pixel values, while avoiding redundant information in original image processing. Although recent remote sensing foundation models (FMs) have made progress in leveraging spatial and spectral information, they have limitations in fully utilizing frequency characteristics that capture hidden features. Existing FMs that incorporate frequency properties often struggle to maintain connections with the original image content, creating a semantic gap that affects downstream performance. To address these challenges, we propose the All-in-One Spectral-Spatial-Frequency Awareness Foundation Model (Alliance), a framework that effectively integrates information across all three domains. Alliance introduces several key innovations: (1) a progressive frequency decoding mechanism inspired by human visual cognition that minimizes multi-domain information gaps while preserving connections between general image information and frequency characteristics, progressively reconstructing from low to mid to high frequencies to extract patterns difficult to observe in raw pixel values; (2) a triple-domain fusion attention module that separately processes amplitude, phase, and spectral-spatial relationships for comprehensive feature integration; and (3) frequency embedding with frequency-aware Cls token initialization and frequency-specific mask token initialization that achieves fine-grained modeling of different frequency band information. Additionally, to evaluate FMs generalizability, we construct the Yellow River dataset, a large-scale multi-temporal collection that introduces challenging cross-domain tasks and establishes more rigorous standards for FMs assessment. Extensive experiments across six downstream tasks demonstrate Alliance's superior performance.
科研通智能强力驱动
Strongly Powered by AbleSci AI