VLCIM: A Vision-Language Cyclic Interaction Model for Industrial Defect Detection

计算机视觉 计算机科学 人工智能 机器视觉 自然语言处理
作者
Xiangkai Shen,Lei Li,Yushan Ma,Shaofeng Xu,Jinhai Liu,Zhiming Yang,Yan Shi
出处
期刊:IEEE Transactions on Instrumentation and Measurement [Institute of Electrical and Electronics Engineers]
卷期号:74: 1-13 被引量:41
标识
DOI:10.1109/tim.2025.3583364
摘要

Accurate defect detection is an important element in ensuring product quality and safe equipment operation. However, due to the lack of deep cross-modal interactions during vision feature extraction, existing methods often suffer from attention bias, which ultimately limits detection accuracy. To address this issue, this paper proposes a Vision-Language Cyclic Interaction Model (VLCIM), which progressively optimizes vision feature extraction by integrating domain prior knowledge and generic large model, effectively bridging the dual-domain barrier between “generic-specific” and “vision-language”. Specifically, progressive cyclic interaction learning is proposed for the first time, which integrates a recursive guidance module (RGM) and cross-modal interaction (CMI) strategy to realize bidirectional dynamic fusion and collaborative optimization of vision and language features. Furthermore, the proposed dual-view synergistic detection mechanism enhances discriminative decision responses, significantly improving the model’s boundary perception ability and decision-making accuracy in complex scenarios. VLCIM achieves high-precision defect detection by establishing a cyclic interaction mechanism between domain-specific language features and vision representations. Experimental results on three industrial datasets demonstrate that VLCIM achieves improvements of 5.9%, 5.6%, and 4.1% in mIoU over the state-of-the-art (SOTA) methods, indicating its validity and generalization in different scenarios.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
刚刚
刚刚
Orange应助百里烬言采纳,获得10
1秒前
3秒前
4秒前
烟花应助刘凯采纳,获得10
4秒前
小栗发布了新的文献求助10
5秒前
胡子发布了新的文献求助10
5秒前
7秒前
万能图书馆应助HOPE采纳,获得10
7秒前
李健的小迷弟应助阿秋采纳,获得10
7秒前
geenhui发布了新的文献求助10
8秒前
科研通AI6.1应助周周采纳,获得30
9秒前
10秒前
漂亮忆南发布了新的文献求助10
10秒前
红白刀发布了新的文献求助10
10秒前
10秒前
Ppao7ii发布了新的文献求助10
11秒前
xhc发布了新的文献求助10
11秒前
12秒前
吃货发布了新的文献求助10
14秒前
14秒前
CQ完成签到,获得积分10
15秒前
16秒前
wang完成签到,获得积分10
16秒前
很困发布了新的文献求助10
17秒前
tinna完成签到,获得积分10
17秒前
bai发布了新的文献求助10
17秒前
嘿嘿嘿完成签到,获得积分10
18秒前
Orange应助科研通管家采纳,获得10
19秒前
CipherSage应助科研通管家采纳,获得10
19秒前
Ppao7ii完成签到,获得积分10
19秒前
顾矜应助科研通管家采纳,获得10
19秒前
19秒前
cdercder应助科研通管家采纳,获得10
19秒前
小马甲应助科研通管家采纳,获得10
19秒前
香蕉觅云应助科研通管家采纳,获得10
19秒前
19秒前
大模型应助科研通管家采纳,获得10
19秒前
高分求助中
Adhesion Science: Principles & Practice 1234
Signals, Systems, and Signal Processing 610
Petrology and Plate Tectonics,2025 450
Burger's Medicinal Chemistry and Drug Discovery 400
New directions for experimental lessons in science teaching: Myth, Mystery, Necessity? by Emily K. da Silva Cunha Souto (Author), Flávia Lins Silva (Author) 333
Scientific experimentation in the classroom: Comparison between genetic-Socratic-exemplary teaching and workshop teaching by Ingrid Hofer (Author) 333
Programming for Chemical Engineers Using C, C++, and MATLAB 320
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6723184
求助须知:如何正确求助?哪些是违规求助? 8459093
关于积分的说明 18059109
捐赠科研通 5976533
什么是DOI,文献DOI怎么找? 2996942
邀请新用户注册赠送积分活动 1973167
关于科研通互助平台的介绍 1927559