Modeling the Label Distributions for Weakly-Supervised Semantic Segmentation

人工智能 计算机科学 分割 图像分割 模式识别(心理学)
作者
Linshan Wu,Zhun Zhong,Jiayi Ma,Yunchao Wei,Hao Chen,Leyuan Fang,Shutao Li
出处
期刊:IEEE Transactions on Pattern Analysis and Machine Intelligence [IEEE Computer Society]
卷期号:47 (8): 6290-6306 被引量:8
标识
DOI:10.1109/tpami.2025.3557047
摘要

Weakly-Supervised Semantic Segmentation (WSSS) aims to train segmentation models by weak labels, which is receiving significant attention due to its low annotation cost. Existing approaches focus on generating pseudo labels for supervision while largely ignoring to leverage the inherent semantic correlation among different pseudo labels. We observe that pseudo-labeled pixels that are close to each other in the feature space are more likely to share the same class, and those closer to the distribution centers tend to have higher confidence. Motivated by this, we propose to model the underlying label distributions and employ cross-label constraints to generate more accurate pseudo labels. In this paper, we develop a unified WSSS framework named Adaptive Gaussian Mixtures Model, which leverages a GMM to model the label distributions. Specifically, we calculate the feature distribution centers of pseudo-labeled pixels and build the GMM by measuring the distance between the centers and each pseudo-labeled pixel. Then, we introduce an Online Expectation-Maximization (OEM) algorithm and a novel maximization loss to optimize the GMM adaptively, aiming to learn more discriminative decision boundaries between different class-wise Gaussian mixtures. Based on the label distributions, we leverage the GMM to generate high-quality pseudo labels for more reliable supervision. Our framework is capable of solving different forms of weak labels: image-level labels, points, scribbles, blocks, and bounding-boxes. Extensive experiments on PASCAL, COCO, Cityscapes, and ADE20 K datasets demonstrate that our framework can effectively provide more reliable supervision and outperform the state-of-the-art methods under all settings.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
CipherSage应助朴实的曼荷采纳,获得10
2秒前
3秒前
4秒前
科研通AI6.3应助精明金毛采纳,获得10
5秒前
zhangjialong发布了新的文献求助10
6秒前
每天都困发布了新的文献求助10
7秒前
852应助钰清采纳,获得10
8秒前
一哥哥来薅文献完成签到,获得积分10
10秒前
大模型应助贪玩的秋柔采纳,获得10
10秒前
11秒前
11秒前
11秒前
DY关闭了DY文献求助
11秒前
上官若男应助yu采纳,获得10
13秒前
14秒前
15秒前
15秒前
15秒前
16秒前
Wolfe完成签到,获得积分10
17秒前
归尘发布了新的文献求助10
18秒前
bing发布了新的文献求助10
18秒前
呵呵发布了新的文献求助10
19秒前
ff完成签到,获得积分10
20秒前
Moweikang完成签到,获得积分10
20秒前
会飞的鱼发布了新的文献求助30
21秒前
汉堡包应助晚上吃什么采纳,获得10
22秒前
甜甜乌冬面完成签到,获得积分10
23秒前
25秒前
华仔应助saturn采纳,获得10
25秒前
愉快洋葱完成签到,获得积分10
26秒前
26秒前
28秒前
yu发布了新的文献求助10
29秒前
卡奇Mikey完成签到,获得积分10
30秒前
30秒前
xh应助Lynth_iota采纳,获得10
30秒前
丘比特应助冷傲手套采纳,获得10
31秒前
俭朴凝冬完成签到,获得积分10
32秒前
32秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Les Mantodea de Guyane Insecta, Polyneoptera 2000
Emmy Noether's Wonderful Theorem 1200
Leading Academic-Practice Partnerships in Nursing and Healthcare: A Paradigm for Change 800
基于非线性光纤环形镜的全保偏锁模激光器研究-上海科技大学 800
Signals, Systems, and Signal Processing 610
Wade & Forsyth's Administrative Law 550
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6410276
求助须知:如何正确求助?哪些是违规求助? 8229593
关于积分的说明 17461859
捐赠科研通 5463374
什么是DOI,文献DOI怎么找? 2886728
邀请新用户注册赠送积分活动 1863166
关于科研通互助平台的介绍 1702351