高光谱成像
计算机科学
人工智能
上下文图像分类
对偶(语法数字)
模式识别(心理学)
集合(抽象数据类型)
遥感
图像(数学)
地质学
文学类
艺术
程序设计语言
作者
Boao Qin,Shou Feng,Chunhui Zhao,Wei Li,Ran Tao,Jun Zhou
标识
DOI:10.1109/tgrs.2025.3549049
摘要
In recent years, language-supervised vision models have demonstrated impressive potential in learning open-world concepts. Some research has introduced this learning paradigm to the hyperspectral image (HSI) processing domain; however, there has been limited work integrating textual information into the hyperspectral open-set recognition task. To fill this gap, we leverage textual supervision information in open-set HSI classification (HSIC) and propose a language-enhanced dual-level contrastive learning network (LDCLNet). Specifically, we introduce a linguistic mode with prior knowledge as a supervised signal to enhance the metric distances between closed-set samples and provide supplementary semantic information for open-set samples. Second, a dual-level visual-language (V-L) contrastive learning (CL) approach, which can align visual and language embeddings separately at the instance level and manifold level, is proposed to establish a more accurate link between visual and language representations. Finally, a distance-refined open-set recognition method is proposed, which aims to effectively discover unknown class samples during testing by refining predictions of known and unknown classes. Extensive experiments and analysis on three public HSI datasets validate the effectiveness of LDCLNet.
科研通智能强力驱动
Strongly Powered by AbleSci AI