计算机科学
光学(聚焦)
匹配(统计)
领域
过程(计算)
人机交互
服装
代表(政治)
人工智能
操作系统
历史
光学
法学
政治学
政治
考古
数学
物理
统计
作者
Jianghong Ma,Huiyue Sun,Dezhao Yang,Haijun Zhang
摘要
Fashion recommendation has become a prominent focus in the realm of online shopping, with various tasks being explored to enhance the customer experience. Recent research has particularly emphasized fashion recommendation based on body shapes, yet a critical aspect of incorporating multimodal data relevance has been overlooked. In this paper, we present the Contrastive Multimodal Cross-Attention Network, a novel approach specifically designed for fashion recommendation catering to diverse body shapes. By incorporating multimodal representation learning and leveraging contrastive learning techniques, our method effectively captures both inter- and intra-sample relationships, resulting in improved accuracy in fashion recommendations tailored to individual body types. Additionally, we propose a locality-aware cross-attention module to align and understand the local preferences between body shapes and clothing items, thus enhancing the matching process. Experimental results conducted on a diverse dataset demonstrate the state-of-the-art performance achieved by our approach, reinforcing its potential to significantly enhance the personalized online shopping experience for consumers with varying body shapes and preferences.
科研通智能强力驱动
Strongly Powered by AbleSci AI