Lv1
88 积分 2024-09-13 加入
Multi-scale dual-stream visual feature extraction and graph reasoning for visual question answering
2个月前
已完结
CKCR: Context-aware knowledge construction and retrieval for knowledge-based visual question answering
2个月前
已关闭
Question-guided attention and cross-modal alignment for knowledge-based visual question answering
2个月前
已关闭
Question-guided multigranular visual augmentation for knowledge-based visual question answering
2个月前
已关闭
Supporting vision-language model few-shot inference with confounder-pruned knowledge prompt
8个月前
已完结
Consistent prompt learning for vision-language models
8个月前
已关闭
A Slim Prompt-Averaged Consistency prompt learning for vision–language model
8个月前
已关闭
VIKCSE: Visual-knowledge enhanced contrastive learning with prompts for sentence embedding
8个月前
已完结
Global–local prompts guided image-text embedding, alignment and aggregation for multi-label zero-shot learning
8个月前
已关闭
Fine-grained multi-modal prompt learning for vision–language models
8个月前
已完结