计算机科学
人工智能
计算机视觉
一般化
分割
编码器
图像分割
领域(数学)
突出
发电机(电路理论)
领域(数学分析)
模式识别(心理学)
基于分割的对象分类
域适应
尺度空间分割
特征提取
图像(数学)
机器学习
适应(眼睛)
质量(理念)
特征(语言学)
先验概率
实体造型
可视化
监督学习
作者
Zhifei Wu,Shuying Zhao,Yunzhou Zhang,Yuxin Jin
标识
DOI:10.1109/tim.2025.3548183
摘要
Industrial defect segmentation plays a crucial role in ensuring quality and safety within manufacturing processes. Previous research in this field has focused on training models with limited available data to enable the segmentation of new classes. However, the limited and singular training data lead to limited feature extraction capabilities of the models, hindering their generalization to other industrial scenarios. Recently, the segment anything model (SAM), an interactive visual segmentation foundation model, demonstrates remarkable zero-shot generalization capabilities. However, its performance in the specific domain of industrial defect detection is relatively poor. Therefore, we propose an approach that leverages visual foundation model to achieve few-shot defect segmentation. This method extracts prior knowledge from example images, derives multitype robust prompts from the prior knowledge, and fine-tunes SAM with few-shot learning paradigm and low-rank adaptation (LoRA) strategy. Specifically, given a few support image-mask pairs, foreground and background prototypes are extracted, and their similarities with the query image are computed to obtain prototype prior probability maps. Subsequently, a prototype prompt generator specifically for defect data extracts salient information from these prior probability maps, producing robust prototype prompts. These prototype prompts are then used to fine-tune the LoRA layers in SAM’s image encoder and the lightweight mask decoder of SAM, enabling SAM to adapt to the few-shot defect segmentation task. Experimental results demonstrate that our proposed method outperforms state-of-the-art few-shot defect segmentation techniques and can be generalized to various industrial scenarios, exhibiting significant application value.
科研通智能强力驱动
Strongly Powered by AbleSci AI