Lv11
8 积分 2024-12-23 加入
Multimodal Foundation Models: From Specialists to General-Purpose Assistants
6小时前
已完结
From Structure to Synergy: A Survey of Vision-Language Perception Paradigm Evolution in Multimodal Large Language Models
6小时前
求助中
Bottom-up color-independent alignment learning for text–image person re-identification
3个月前
已完结
Attribute-Centric Cross-Modal Alignment for Weakly Supervised Text-Based Person Re-ID
3个月前
已完结
UCPM: Uncertainty-Guided Cross-Modal Retrieval with Partially Mismatched Pairs
4个月前
已完结
Cross-Modal Alignment Enhancement Network for Text-to-Image Person Re-identification
5个月前
已完结
RMGNet: The Progressive Relationship-Mining Graph Neural Network for Text-to-image Person Re-identification
5个月前
已完结
Towards unified bijective image-text generation for text-to-image person re-identification
5个月前
已完结
Mask-Aware Hierarchical Aggregation Transformer for Occluded Person Re-Identification
5个月前
已完结
Occluded Person Reidentification via a Universal Framework With Difference Consistency Guidance Learning
5个月前
已完结