规范化(社会学)
机器学习
模糊逻辑
计算机科学
人工智能
模态(人机交互)
一致性(知识库)
判别式
光学(聚焦)
匹配(统计)
特征向量
模式
生成语法
模式识别(心理学)
特征学习
特征(语言学)
对比度(视觉)
对抗制
自然语言处理
生成模型
语义学(计算机科学)
实证研究
特征提取
深度学习
贝叶斯概率
模糊集
像素
建设性的
灵活性(工程)
身份(音乐)
作者
Haojie Liu,Jianyang Gu,Zhiyong Li,Mingyu Wang,Q. M. Jonathan Wu,Wei Jiang
标识
DOI:10.1109/tsmc.2025.3604832
摘要
Visible–infrared person re-identification (VI-ReID) focuses on accurately matching individuals across different imaging modalities. Existing studies focus on generating modality-consistent images at the pixel level through the use of generative adversarial networks (GANs) to mitigate the impact of modality discrepancies. However, these methods face significant challenges in overcoming the limitation that synthesized samples from different modalities may suffer from semantic distortion. In this work, we propose an online one-stage style fuzzy normalization (SFN) method to generate modality-fuzzy features in the latent space while regularizing the model’s predictions. Specifically, SFN adaptively mixes the feature statistics of two random modality instances of the same identity in a single forward pass during training. In this process, to enhance the richness of modality interaction information, we design a novel causality balance loss, which enforces the generated fuzzy features to be independent of their initial modality while simultaneously encouraging them to align more closely with the other modality. Furthermore, we introduce an identity-aware consistency loss to regularize the predictions between the original and SFN-generated features to ensure semantic consistency. In contrast to prior work, SFN is a plug-and-play module that does not rely on any generative-based models, making it highly adaptable to various network architectures. Extensive experiments were performed on three public cross-modality datasets to ensure fair and reliable comparisons. The empirical results demonstrate the clear superiority of our method over previous state-of-the-art methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI