ExplainableML/Vision_by_Language

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ExplainableML/Vision_by_Language)

ExplainableML / Vision_by_Language

[ICLR 2024] Official repository for "Vision-by-Language for Training-Free Compositional Image Retrieval"

☆89

Alternatives and similar repositories for Vision_by_Language

Users that are interested in Vision_by_Language are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

youngkyunJang / VDG
View on GitHub
Visual Delta Generator with Large Multi-modal Model for Semi-supervised Composed Image Retrieval - CVPR2024
☆21May 30, 2024Updated 2 years ago
Pter61 / osrcir
View on GitHub
Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval [CVPR 2025 Highlight]
☆72Jul 8, 2025Updated last year
miccunifi / CIRCO
View on GitHub
[ICCV 2023] - Composed Image Retrieval on Common Objects in context (CIRCO) dataset
☆87Aug 6, 2025Updated 11 months ago
navervision / lincir
View on GitHub
Official Pytorch implementation of LinCIR: Language-only Training of Zero-shot Composed Image Retrieval (CVPR 2024)
☆148Jan 5, 2026Updated 6 months ago
navervision / CompoDiff
View on GitHub
Official Pytorch implementation of "CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion" (TMLR 2024)
☆88Feb 2, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
iLearn-Lab / TOIS25-Awesome-Composed-Image-Retrieval
View on GitHub
Collection of Composed Image Retrieval (CIR) papers.
☆360Jun 8, 2026Updated last month
chunmeifeng / SPRC
View on GitHub
【ICLR 2024, Spotlight】Sentence-level Prompts Benefit Composed Image Retrieval
☆94Apr 16, 2024Updated 2 years ago
miccunifi / SEARLE
View on GitHub
[ICCV 2023] - Zero-shot Composed Image Retrieval with Textual Inversion
☆198Jul 31, 2025Updated 11 months ago
ExplainableML / EgoCVR
View on GitHub
[ECCV 2024] EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval
☆41Apr 11, 2025Updated last year
google-research / composed_image_retrieval
View on GitHub
☆197May 9, 2026Updated 2 months ago
Pter61 / context-i2w
View on GitHub
Context-I2W: Mapping Images to Context-dependent words for Accurate Zero-Shot Composed Image Retrieval [AAAI 2024 Oral]
☆54May 27, 2025Updated last year
OmkarThawakar / composed-video-retrieval
View on GitHub
Composed Video Retrieval
☆62May 2, 2024Updated 2 years ago
Chiangsonw / CaLa
View on GitHub
The official code of "CaLa: Complementary Association Learning for Augmenting Composed Image Retrieval"
☆15Sep 19, 2024Updated last year
sotayang / SEIZE
View on GitHub
[ACM MM'2024] Official repository for "Semantic Editing Increment Benefits Zero-Shot Composed Image Retrieval"
☆43Jul 2, 2026Updated 2 weeks ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
sotayang / LDRE
View on GitHub
[SIGIR'2024 Best Paper Honorable Mention] Official repository for "LDRE: LLM-based Divergent Reasoning and Ensemble for Zero-Shot Compose…
☆104Jul 2, 2026Updated 2 weeks ago
facebookresearch / genecis
View on GitHub
Code and Models for "GeneCIS A Benchmark for General Conditional Image Similarity"
☆61Jun 12, 2023Updated 3 years ago
ABaldrati / CLIP4Cir
View on GitHub
[ACM TOMM 2023] - Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features
☆195Sep 5, 2023Updated 2 years ago
Cuberick-Orion / CIRR
View on GitHub
Official repository of ICCV 2021 - Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models
☆136Oct 17, 2025Updated 9 months ago
Code-kunkun / ZS-CIR
View on GitHub
[BMVC 2023] Zero-shot Composed Text-Image Retrieval
☆55Nov 26, 2024Updated last year
naver / artemis
View on GitHub
Official code release for ARTEMIS: Attention-based Retrieval with Text-Explicit Matching and Implicit Similarity (published at ICLR 2022)
☆52Feb 9, 2023Updated 3 years ago
lucas-ventura / CoVR
View on GitHub
Official PyTorch implementation of the paper "CoVR: Learning Composed Video Retrieval from Web Video Captions".
☆119Apr 21, 2026Updated 3 months ago
google-deepmind / magiclens
View on GitHub
[ICML'24 Oral] "MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions"
☆211Oct 28, 2024Updated last year
iLearn-Lab / SIGIR24-FTI4CIR
View on GitHub
Codes of the Fine-grained Textual Inversion network for Zero-Shot Composed Image Retrieval
☆27Apr 9, 2026Updated 3 months ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
suoych / KEDs
View on GitHub
Implementation of the paper Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval (CVPR 2024)
☆20Nov 4, 2024Updated last year
vl2g / CSTBIR
View on GitHub
Official Code for Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex Interactions
☆15Dec 27, 2023Updated 2 years ago
bladewaltz1 / PromptSwitch
View on GitHub
☆30Aug 14, 2023Updated 2 years ago
LorenzoGianassi / Land-Diffuser
View on GitHub
The Land-Diffuser is a novel application of the Denoising Diffusion Probabilistic Model (DDPM) in the realm of 3D Talking Head generation…
☆13Dec 23, 2023Updated 2 years ago
wangbohan97 / MPS
View on GitHub
☆13Jul 5, 2024Updated 2 years ago
Cuberick-Orion / Candidate-Reranking-CIR
View on GitHub
The official implementation for Candidate Set Re-ranking for Composed Image Retrieval (TMLR) 01/2024
☆20Feb 7, 2024Updated 2 years ago
dhg-wei / MCL
View on GitHub
(ICML 2024) Improve Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning
☆28Sep 27, 2024Updated last year
iLearn-Lab / SIGIR24-DQU-CIR
View on GitHub
[SIGIR 2024] - Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval
☆44Jul 14, 2024Updated 2 years ago
facebookresearch / diht
View on GitHub
Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training
☆141Dec 16, 2025Updated 7 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
UCSB-AI / ComCLIP
View on GitHub
Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"
☆37Aug 18, 2024Updated last year
arijitray1993 / COLA
View on GitHub
COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!
☆25May 14, 2026Updated 2 months ago
Monoxide-Chen / uncertainty_retrieval
View on GitHub
ICLR‘24 Offical Implementation of Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization
☆75Jan 30, 2024Updated 2 years ago
BUAADreamer / SPN4CIR
View on GitHub
[ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives
☆39Sep 9, 2025Updated 10 months ago
ozmig77 / dcnet
View on GitHub
☆16Mar 15, 2021Updated 5 years ago
fuxianghuang1 / Multimodal-Composite-Editing-and-Retrieval
View on GitHub
Multimodal-Composite-Editing-and-Retrieval-update
☆35Oct 13, 2025Updated 9 months ago
Paranioar / RCAR
View on GitHub
[TIP2023] The code of “Plug-and-Play Regulators for Image-Text Matching”
☆34Apr 11, 2024Updated 2 years ago