计算机科学
人工智能
网(多面体)
图像(数学)
上下文图像分类
模式识别(心理学)
数学
几何学
作者
Mengru Ma,Wenping Ma,Licheng Jiao,Xu Liu,Fang Liu,Lingling Li,Shuyuan Yang
标识
DOI:10.1109/tcsvt.2023.3322470
摘要
A growing number of earth observation satellites are able to simultaneously gather multimodal images of the same area due to the expanding availability and resolution of satellite remote sensing data. This paper proposes a novel multimodal balanced self-learning interaction network (MBSI-Net) for the classification task. It involves a dual-branch teacher-student network that enables knowledge interaction and transfer between the multimodalities. Firstly, in order to introduce statistical information in addition to local and global structural information, a texture feature equalization module (TFE-Module) is proposed. This can enhance the texture information of features through histogram equalization and further improve the representation ability of features. Secondly, to enable the student network to provide timely feedback questions, the paper proposes a feature fusion module (F 2 -Module) that models and enhances teacher features through the student network. This helps to raise the classification's accuracy by incorporating information from multimodal images. Finally, the paper proposes a loss function based on structural similarity analysis to ensure balanced self-learning between the student and the teacher networks. Taking the multispectral (MS) and the panchromatic (PAN) images of the same scene as examples, through experimental verification, the proposed method can achieve good results on multiple datasets compared with other methods. Therefore, it offers an effective method for classifying and fusing multimodal data.
科研通智能强力驱动
Strongly Powered by AbleSci AI