稳健性(进化)
跳跃式监视
帕斯卡(单位)
融合
计算机科学
利用
传感器融合
人工智能
精确性和召回率
计算机视觉
数据建模
目标检测
光学(聚焦)
机器学习
深度学习
数据挖掘
模式识别(心理学)
相似性(几何)
图像融合
多模态
最小边界框
特征提取
作者
Huiyan Wang,Ruihao Peng,Ming Ying,Fashuai Li,Jiuyi Zhang,Xiaolan Li,Yan Tian,Guofeng Zhang
标识
DOI:10.1109/tii.2025.3606880
摘要
Detecting small defects in industrial biaxially oriented polypropylene (BOPP) films is challenging due to their limited visual cues, low contrast, and high similarity to background textures. Existing deep learning-based methods, which rely solely on visual data, often suffer from high miss rates and poor classification accuracy in such complex scenarios. To address these limitations, we propose the multimodal fusion-based film small defect detection (MFF-SDD) model, specifically designed for small defect detection in BOPP film production. Unlike conventional language-image models that neglect the bidirectional interaction between modalities, MFF-SDD introduces the bidirectional guidance multimodal cross-fusion module, which enhances visual–textual integration through mutual guidance and attention mechanisms, enabling more effective foreground focus and background suppression. Furthermore, the text and image cross-modality neighbors multiscale fusion module employs multiscale cross-modal fusion to preserve fine-grained details and exploit complementary features from both modalities, improving detection accuracy and reducing misclassification of small defects. We also present the film small defect (FSD) dataset, comprising 10 385 annotated bounding boxes across seven defect categories. Experimental results show that MFF-SDD surpasses state-of-the-art methods by 2.31% in average precision and 3.75% in recall on the FSD dataset and achieves leading performance on public benchmarks, such as PASCAL VOC and TinyPerson. These findings demonstrate the effectiveness and robustness of our approach in multimodal small defect detection. Our dataset and code will be made publicly available upon acceptance of this article for publication.
科研通智能强力驱动
Strongly Powered by AbleSci AI