Lv4
490 积分 2023-10-02 加入
Fast, Accurate, and Lightweight Memory-Enhanced Embedding Learning Framework for Image-Text Retrieval
9小时前
已完结
IDseq: Decoupled and Sequentially Detecting and Grounding Multi-Modal Media Manipulation
9小时前
待确认
OTE: Exploring Accurate Scene Text Recognition Using One Token
9小时前
待确认
DiffAM: Diffusion-based Adversarial Makeup Transfer for Facial Privacy Protection
9小时前
待确认
TALK-Act: Enhance Textural-Awareness for 2D Speaking Avatar Reenactment with Diffusion Model
9小时前
已完结
Leveraging Text Localization for Scene Text Removal via Text-aware Masked Image Modeling
9小时前
待确认
DHVT: Dynamic Hybrid Vision Transformer for Small Dataset Recognition
9小时前
待确认
ReferSAM: Unleashing Segment Anything Model for Referring Image Segmentation
9小时前
已完结
High Fidelity Face Swapping via Facial Texture and Structure Consistency Mining
9小时前
已完结
A Detail-Aware Transformer to Generalizable Face Forgery Detection
9小时前
已完结