Imbalance knowledge-driven multi-modal network for land-cover semantic segmentation using aerial images and LiDAR point clouds

计算机科学 分割 点云 水准点(测量) 特征(语言学) 人工智能 情态动词 激光雷达 范畴变量 模式识别(心理学) 机器学习 遥感 地图学 地理 语言学 哲学 化学 高分子化学
作者
Yameng Wang,Yi Wan,Yongjun Zhang,Bin Zhang,Zhi Gao
出处
期刊:Isprs Journal of Photogrammetry and Remote Sensing 卷期号:202: 385-404 被引量:1
标识
DOI:10.1016/j.isprsjprs.2023.06.014
摘要

Despite the good results that have been achieved in unimodal segmentation, the inherent limitations of individual data increase the difficulty of achieving breakthroughs in performance. For that reason, multi-modal learning is increasingly being explored within the field of remote sensing. The present multi-modal methods usually map high-dimensional features to low-dimensional spaces as a preprocess before feature extraction to address the nonnegligible domain gap, which inevitably leads to information loss. To address this issue, in this paper we present our novel Imbalance Knowledge-Driven Multi-modal Network (IKD-Net) to extract features from multi-modal heterogeneous data of aerial images and LiDAR directly. IKD-Net is capable of mining imbalance information across modalities while utilizing a strong modal to drive the feature map refinement of the weaker ones in the global and categorical perspectives by way of two sophisticated plug-and-play modules: the Global Knowledge-Guided (GKG) and Class Knowledge-Guided (CKG) gated modules. The whole network then is optimized using a joint loss function. While we were developing IKD-Net, we also established a new dataset called the National Agriculture Imagery Program and 3D Elevation Program Combined dataset in California (N3C-California), which provides a particular benchmark for multi-modal joint segmentation tasks. In our experiments, IKD-Net outperformed the benchmarks and state-of-the-art methods both in the N3C-California and the small-scale ISPRS Vaihingen dataset. IKD-Net has been ranked first on the real-time leaderboard for the GRSS DFC 2018 challenge evaluation until this paper’s submission. Our code and N3C-California dataset are available at https://github.com/wymqqq/IKDNet-pytorch.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
18岁的momo完成签到,获得积分10
刚刚
1秒前
疯狂星星完成签到,获得积分10
2秒前
贺丽莎发布了新的文献求助10
2秒前
背后飞柏发布了新的文献求助10
4秒前
4秒前
18岁的momo发布了新的文献求助10
5秒前
6秒前
awen完成签到,获得积分10
6秒前
7秒前
8秒前
Tina完成签到,获得积分10
8秒前
sunrise_99完成签到,获得积分10
10秒前
WXR完成签到,获得积分10
10秒前
lq完成签到,获得积分10
10秒前
辛勤如柏完成签到,获得积分10
10秒前
11秒前
花花子完成签到 ,获得积分10
11秒前
woxyk发布了新的文献求助10
11秒前
11秒前
直率苡完成签到,获得积分10
13秒前
PhD-SCAU完成签到,获得积分10
13秒前
13秒前
14秒前
小飞爱科研完成签到,获得积分10
16秒前
Ecokarster完成签到,获得积分10
16秒前
mhy完成签到 ,获得积分10
17秒前
托托完成签到,获得积分10
17秒前
20秒前
BLUE发布了新的文献求助30
20秒前
hmy完成签到 ,获得积分10
21秒前
21秒前
香蕉觅云应助juzi采纳,获得10
21秒前
不想起昵称完成签到,获得积分10
22秒前
23秒前
24秒前
24秒前
ranran完成签到,获得积分10
25秒前
26秒前
cebr完成签到,获得积分10
26秒前
高分求助中
Psychopathic Traits and Quality of Prison Life 1000
Chemistry and Physics of Carbon Volume 18 800
The formation of Australian attitudes towards China, 1918-1941 660
Signals, Systems, and Signal Processing 610
天津市智库成果选编 600
Forced degradation and stability indicating LC method for Letrozole: A stress testing guide 500
全相对论原子结构与含时波包动力学的理论研究--清华大学 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6451706
求助须知:如何正确求助?哪些是违规求助? 8263440
关于积分的说明 17608260
捐赠科研通 5516344
什么是DOI,文献DOI怎么找? 2903718
邀请新用户注册赠送积分活动 1880647
关于科研通互助平台的介绍 1722664