计算机科学
语义学(计算机科学)
班级(哲学)
人工智能
代表(政治)
一般化
特征(语言学)
上下文图像分类
情报检索
样品(材料)
文字嵌入
弹丸
图像(数学)
模式识别(心理学)
嵌入
化学
政治学
数学
法学
程序设计语言
语言学
有机化学
色谱法
政治
哲学
数学分析
作者
Jie Chen,Ya Guo,Jingru Zhu,Geng Sun,Dengda Qin,Min Deng,Huimin Liu
标识
DOI:10.1109/tgrs.2022.3219726
摘要
Few-shot remote sensing scene classification (FSRSSC) has been used for new class recognition in the presence of a limited number of labeled samples. The representation vector (prototype) of categories obtained using images only confronts some challenges, such as insufficient generalization when the number of samples is too small. To address this problem, we propose a new FSRSSC method based on prototype networks, named CNSPN, which combines semantic information of class names (name of the scene categories, such as aircraft, harbor, and bridge). First, CNSPN extracts semantics for class names using a pre-trained word-embedding model, which enriches the feature representation ability of the category at the source. Then, an enhanced fusion prototype is generated by fusing the semantic information of text and visual information in the image through a multimodal prototype fusion module (MPFM). Finally, the query image is classified by measuring the distance between the query sample and the visual prototype, and between the query sample and the fusion prototype. Comparative experiments on the NWPU-RESISC45 and RSD46-WHU datasets show that the proposed method significantly improves FSRSSC performance. Code is available at https://github.com/RS-CSU/CNSPN.git.
科研通智能强力驱动
Strongly Powered by AbleSci AI