aneeshan95 / Sketch_LVM
Project page for the paper 'CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained or Not'
☆59Updated last year
Related projects: ⓘ
- ZSE-SBIR☆45Updated 10 months ago
- [ICCV 2023] - Zero-shot Composed Image Retrieval with Textual Inversion☆143Updated 4 months ago
- Code and Dataset for FS-COCO: Towards Understanding of Freehand Sketches of Common Objects in Context.☆17Updated last year
- Open Vocabulary Semantic Scene Sketch Understanding☆22Updated 2 months ago
- [ICLR 2024] Official repository for "Vision-by-Language for Training-Free Compositional Image Retrieval"☆37Updated 2 months ago
- ICLR 2023 DeCap: Decoding CLIP Latents for Zero-shot Captioning☆118Updated last year
- The official implementation for BLIP4CIR with bi-directional training | Bi-directional Training for Composed Image Retrieval via Text Pro…☆23Updated 7 months ago
- ICLR‘24 Offical Implementation of Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization☆59Updated 7 months ago
- [ICCV 2023] - Composed Image Retrieval on Common Objects in context (CIRCO) dataset☆47Updated last month
- ☆17Updated last year
- ☆85Updated 11 months ago
- Repository of "Improving Cross-Modal Retrieval With Set of Diverse Embeddings" (CVPR'23, Highlight)☆36Updated 10 months ago
- [CVPR 2023] Learning Attention as Disentangler for Compositional Zero-shot Learning☆36Updated last year
- An easy to use, user-friendly and efficient code for extracting OpenAI CLIP (Global/Grid) features from image and text respectively.☆104Updated 2 years ago
- Context-I2W: Mapping Images to Context-dependent words for Accurate Zero-Shot Composed Image Retrieval [AAAI 2024 Oral]☆36Updated 5 months ago
- PyTorch Implementation of TCN (T-PAMI2021)☆9Updated 2 years ago
- Official implementation of "Text Is MASS: Modeling as Stochastic Embedding for Text-Video Retrieval (CVPR 2024 Highlight)"☆44Updated last month
- ☆33Updated last year
- [CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".☆91Updated 2 months ago
- [CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》☆148Updated last year
- [BMVC 2023] Zero-shot Composed Text-Image Retrieval☆42Updated last year
- Official implementation of Data-Free Sketch-Based Image Retrieval, CVPR 2023.☆24Updated last year
- Code for the paper: "SuS-X: Training-Free Name-Only Transfer of Vision-Language Models" [ICCV'23]☆91Updated last year
- Pytorch Code for "Unified Coarse-to-Fine Alignment for Video-Text Retrieval" (ICCV 2023)☆50Updated 3 months ago
- [ACM TOMM 2023] - Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features☆156Updated last year
- ☆72Updated 5 months ago
- The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"☆55Updated 5 months ago
- Code for ECCV 2022 Workshop paper "See Finer, See More: Implicit Modality Alignment for Text-based Person Retrieval"☆17Updated last year
- ☆34Updated last year
- [CVPR 2022] Visual Abductive Reasoning☆113Updated 2 years ago