变压器
计算机科学
嵌入
织物
人工智能
卷积(计算机科学)
鉴定(生物学)
纤维
编码器
计算机视觉
服装
模式识别(心理学)
材料科学
工程类
人工神经网络
电气工程
操作系统
历史
生物
复合材料
植物
电压
考古
作者
Luoli Xu,F. Li,Shan Chang
标识
DOI:10.1177/00405175231194797
摘要
Simple, fast and effective fiber identification can help consumers purchase their desired apparel and help the industry conduct large-scale textile testing. This paper presents a transformer architecture incorporating convolutions to recognize fibers in textile surface images, which meets the above requirements. Firstly, a convolution operation is performed on textile images to pick up overlapping patches as tokens and the linear projections in transformer encoders are replaced by depth-wise separable convolutions to extract the fiber representations. Secondly, the multi-head cross-attention module enables each label embedding to be compared with features at each spatial location to locate and pool the corresponding fiber characteristics. Finally, a simplified asymmetric loss is introduced to further purify the extracted fiber features. Experiments demonstrate that the proposed approach provides a significant improvement in fiber identification accuracy over both state-of-the-art multi-label classification frameworks and fiber identification architectures.
科研通智能强力驱动
Strongly Powered by AbleSci AI