短时记忆
期限(时间)
地理
地图学
变压器
人工智能
计算机科学
计算机视觉
遥感
工程类
电气工程
物理
人工神经网络
量子力学
电压
循环神经网络
作者
Swalpa Kumar Roy,Ali Jamali,Koushik Biswas,Danfeng Hong,Pedram Ghamisi
出处
期刊:International journal of applied earth observation and geoinformation
日期:2025-09-01
卷期号:143: 104801-104801
标识
DOI:10.1016/j.jag.2025.104801
摘要
Scene classification plays a critical role in remote sensing image analysis, with numerous methods based on Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) developed to improve performance on high-resolution remote sensing (HRRS) imagery. However, the existing models struggle with several key challenges, including effectively capturing fine-grained local features and modeling long-range spatial dependencies in complex scenes. These limitations reduce the discriminative power of extracted features, which is critical for HRRS image classification. To overcome these issues, our study aims to design a unified model that jointly leverages local information extraction, global context modeling, and long-range dependency learning. We propose a novel architecture, ViCxLSTM, designed to enhance feature discriminability for HRRS scene classification. ViCxLSTM is a hybrid model that integrates a Local Pattern Unit (comprising convolutional layers and Fourier Transforms), an extended Long Short-Term Memory module (xLSTM), and a Vision Transformer. This integrated architecture enables the model to capture a wide range of spatial patterns, from local textures to long-range dependencies and global contextual relationships. Experimental evaluations show that ViCxLSTM achieves superior classification performance across diverse land use datasets, outperforming several state-of-the-art models, including ResNet-50, ResNet-101, ResNet-152, ViT, LeViT, CrossViT, DeepViT, and CaiT. The code will be provided freely accessible at https://github.com/aj1365/ViCxLSTM.
科研通智能强力驱动
Strongly Powered by AbleSci AI