隐藏字幕
计算机科学
任务(项目管理)
图像(数学)
上下文图像分类
人工智能
班级(哲学)
模式识别(心理学)
遥感
计算机视觉
工程类
地理
系统工程
作者
Qiaoqiao Yang,Zihao Ni,Peng Ren
出处
期刊:Isprs Journal of Photogrammetry and Remote Sensing
日期:2022-02-25
卷期号:186: 190-200
被引量:31
标识
DOI:10.1016/j.isprsjprs.2022.02.001
摘要
Remote sensing image captioning models require large amounts of caption-labeled training data. Though image classification models are normally trained with sufficient training data, they cannot be straightforwardly applied to remote sensing image captioning, because the labels for classification and captioning arise from different task domains. Additionally, remote sensing images with caption labels are not as sufficient as images with class labels. Such limitations render difficulty to effective remote sensing image captioning. To address these limitations, we develop a meta captioning framework that conducts remote sensing image captioning with meta learning. The meta captioning framework extracts meta features from two support tasks, i.e., natural image classification and remote sensing image classification, and transfers the meta features to one target task, i.e., remote sensing image captioning. The two support tasks train classification models with big amounts of class-labeled data such that they extract meta features that comprehensively represent image visual features from the perspective of classification. The target task, equipped by the meta features, just requires a relatively small amount of caption-labeled training data for achieving effective remote sensing image captioning. Experimental evaluations on three public datasets validate that the meta captioning framework achieves state-of-the-art performance on remote sensing image captioning. We release the code for our work at: https://github.com/QiaoqiaoYang/MetaCaptioning.
科研通智能强力驱动
Strongly Powered by AbleSci AI