billpsomas / icirLinks
This repository contains the official implementation code of NeurIPS 2025 paper: "Instance-Level Composed Image Retrieval".
☆49Updated last month
Alternatives and similar repositories for icir
Users that are interested in icir are comparing it to the libraries listed below
Sorting:
- This repo contains the official implementation of ICLR 2022 paper "It Takes Two to Tango: Mixup for Deep Metric Learning".☆36Updated last year
- ILIAS: Instance-Level Image retrieval At Scale☆34Updated 4 months ago
- [CVPR 2025] Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval☆35Updated 4 months ago
- ☆56Updated 5 months ago
- Official PyTorch implementation of the WACV 2025 Oral paper "Composed Image Retrieval for Training-FREE DOMain Conversion".☆46Updated 5 months ago
- AAPL: Adding Attributes to Prompt Learning for Vision-Language Models (CVPRw 2024)☆34Updated last year
- Library implementation of "No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations"☆40Updated last year
- ☆12Updated last year
- [ACM TOMM 2023] - Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features☆192Updated 2 years ago
- TIPS (ICLR'25): Text-Image Pretraining with Spatial Awareness☆117Updated 10 months ago
- AMES: Asymmetric and Memory-Efficient Similarity☆46Updated 5 months ago
- [CVPR24] Official Implementation of GEM (Grounding Everything Module)☆136Updated 9 months ago
- [ECCV2024] ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation☆111Updated 10 months ago
- An open source implementation of CLIP (With TULIP Support)☆165Updated 8 months ago
- ☆17Updated 11 months ago
- [ECCV 2024] Official Release of SILC: Improving vision language pretraining with self-distillation☆47Updated last year
- [CVPR 2024] Improving language-visual pretraining efficiency by perform cluster-based masking on images.☆31Updated last year
- A large-scale benchmark for the evaluation of embeddings across a number of fine-grained and instance-level visual domains.☆17Updated last year
- [ICLR 2024] Official repository for "Vision-by-Language for Training-Free Compositional Image Retrieval"☆83Updated last year
- [ICCV 2023] - Composed Image Retrieval on Common Objects in context (CIRCO) dataset☆83Updated 6 months ago
- Code for our paper: "Where's Waldo: Diffusion Features For Personalized Segmentation and Retrieval".☆15Updated 11 months ago
- [CVPR 2024 Best paper award candidate] EGTR: Extracting Graph from Transformer for Scene Graph Generation☆140Updated last year
- Official Pytorch implementation of LinCIR: Language-only Training of Zero-shot Composed Image Retrieval (CVPR 2024)☆142Updated last month
- The official implementation for BLIP4CIR with bi-directional training | Bi-directional Training for Composed Image Retrieval via Text Pro…☆34Updated 2 years ago
- RO-ViT CVPR 2023 "Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers"☆18Updated 2 years ago
- [CVPR25] CoLLM: A Large Language Model for Composed Image Retrieval☆28Updated 10 months ago
- [ECCV 2024] Official implementation of the paper "Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learning…☆29Updated 11 months ago
- [ECCV 2024] - Improving Zero-shot Generalization of Learned Prompts via Unsupervised Knowledge Distillation☆64Updated 6 months ago
- FreeDA: Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation (CVPR 2024)☆49Updated last year
- [TMLR 2025 J2C] TextRegion: Text-Aligned Region Tokens from Frozen Image-Text Models☆51Updated last month