计算机科学
人工智能
班级(哲学)
基础(拓扑)
帕斯卡(单位)
像素
分割
集合(抽象数据类型)
正投影
正交性
知识库
模式识别(心理学)
机器学习
数学
数学分析
程序设计语言
几何学
作者
Sun-Ao Liu,Yiheng Zhang,Zhaofan Qiu,Hongtao Xie,Yongdong Zhang,Ting Yao
标识
DOI:10.1109/cvpr52729.2023.01089
摘要
Generalized few-shot semantic segmentation (GFSS) distinguishes pixels of base and novel classes from the background simultaneously, conditioning on sufficient data of base classes and a few examples from novel class. A typical GFSS approach has two training phases: base class learning and novel class updating. Nevertheless, such a stand-alone updating process often compromises the well-learnt features and results in performance drop on base classes. In this paper, we propose a new idea of leveraging Projection onto Orthogonal Prototypes (POP), which updates features to identify novel classes without compromising base classes. POP builds a set of orthogonal prototypes, each of which represents a semantic class, and makes the prediction for each class separately based on the features projected onto its prototype. Technically, POP first learns prototypes on base data, and then extends the prototype set to novel classes. The orthogonal constraint of POP encourages the orthogonality between the learnt prototypes and thus mitigates the influence on base class features when generalizing to novel prototypes. Moreover, we capitalize on the residual of feature projection as the background representation to dynamically fit semantic shifting (i.e., background no longer includes the pixels of novel classes in updating phase). Extensive experiments on two benchmarks demonstrate that our POP achieves superior performances on novel classes without sacrificing much accuracy on base classes. Notably, POP outperforms the state-of-the-art fine-tuning by 3.93% overall mIoU on PASCAL-5 i in 5-shot scenario.
科研通智能强力驱动
Strongly Powered by AbleSci AI