The official code of "CaLa: Complementary Association Learning for Augmenting Composed Image Retrieval"
☆15Sep 19, 2024Updated last year
Alternatives and similar repositories for CaLa
Users that are interested in CaLa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official implementation for Candidate Set Re-ranking for Composed Image Retrieval (TMLR) 01/2024☆20Feb 7, 2024Updated 2 years ago
- ☆10Mar 31, 2025Updated last year
- [ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives☆39Sep 9, 2025Updated 7 months ago
- Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval [CVPR 2025 Highlight]☆67Jul 8, 2025Updated 9 months ago
- 【ICLR 2024, Spotlight】Sentence-level Prompts Benefit Composed Image Retrieval☆93Apr 16, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆16Mar 15, 2021Updated 5 years ago
- [ICLR 2024] Official repository for "Vision-by-Language for Training-Free Compositional Image Retrieval"☆84Jul 4, 2024Updated last year
- The official implementation for BLIP4CIR with bi-directional training | Bi-directional Training for Composed Image Retrieval via Text Pro…☆34Feb 7, 2024Updated 2 years ago
- [CVPR 2025] This repository is intended to store the code and data for ASAP (Advancing Semantic Alignment Promotes Multi-Modal Manipulati…☆20Jun 18, 2025Updated 9 months ago
- ICLR‘24 Offical Implementation of Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization☆75Jan 30, 2024Updated 2 years ago
- [ACM MM'2024] Official repository for "Semantic Editing Increment Benefits Zero-Shot Composed Image Retrieval"☆42Dec 23, 2024Updated last year
- (ICML 2024) Improve Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning☆28Sep 27, 2024Updated last year
- Used in M4C feature extraction script: https://github.com/facebookresearch/mmf/blob/project/m4c/projects/M4C/scripts/extract_ocr_frcn_fea…☆13Jan 30, 2020Updated 6 years ago
- Collection of Composed Image Retrieval (CIR) papers.☆323Mar 27, 2026Updated 2 weeks ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official Pytorch implementation of "Improved Probabilistic Image-Text Representations" (ICLR 2024)☆60May 26, 2024Updated last year
- Multimodal-Composite-Editing-and-Retrieval-update☆35Oct 13, 2025Updated 6 months ago
- The official repository of "A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Eva…☆204Updated this week
- Official Pytorch implementation of LinCIR: Language-only Training of Zero-shot Composed Image Retrieval (CVPR 2024)☆145Jan 5, 2026Updated 3 months ago
- ☆20Mar 5, 2025Updated last year
- The official code of "Beyond Walking: A Large-Scale Image-Text Benchmark for Text-based Person Anomaly Search"☆27Sep 15, 2025Updated 6 months ago
- AnyTrans: Translate AnyText in the Image with Large Scale Models (EMNLP2024 Findings)☆24Dec 11, 2024Updated last year
- this repo contains some useful metadata for Fashion IQ challenge: https://sites.google.com/view/lingir/fashion-iq☆15Jun 28, 2019Updated 6 years ago
- Visual Delta Generator with Large Multi-modal Model for Semi-supervised Composed Image Retrieval - CVPR2024☆21May 30, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This is a PyTorch implementation of MCLN proposed by our paper "Multi-branch Collaborative Learning Network for 3D Visual Grounding"(ECCV…☆26Oct 10, 2024Updated last year
- Official PyTorch Implementation of RITC☆21Oct 26, 2021Updated 4 years ago
- Text Proxy: Decomposing Retrieval from a 1-to-N Relationship into N 1-to-1 Relationships for Text-Video Retrieval -- AAAI2025☆18Jul 14, 2025Updated 9 months ago
- [ICML 2024] Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization☆23Dec 20, 2024Updated last year
- [IEEE TMM'25] Scene-Text Grounding for Text-Based Video Question Answering☆17Feb 16, 2026Updated last month
- [CVPR25] CoLLM: A Large Language Model for Composed Image Retrieval☆28Mar 26, 2025Updated last year
- ☆16Jan 13, 2024Updated 2 years ago
- [ACM TOMM 2023] - Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features☆194Sep 5, 2023Updated 2 years ago
- Context-I2W: Mapping Images to Context-dependent words for Accurate Zero-Shot Composed Image Retrieval [AAAI 2024 Oral]☆54May 27, 2025Updated 10 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension☆29Dec 28, 2023Updated 2 years ago
- Composed Video Retrieval☆62May 2, 2024Updated last year
- [CVPR 2022 - Demo Track] - Effective conditioned and composed image retrieval combining CLIP-based features☆85Nov 12, 2024Updated last year
- Project page for End-to-end Recovery of Human Shape and Pose☆22Apr 4, 2022Updated 4 years ago
- CVPR 2024 Official Repository☆13Mar 27, 2024Updated 2 years ago
- [TMLR'24] This repository includes the official implementation our paper "Unleashing the Power of Visual Prompting At the Pixel Level"☆42Apr 30, 2024Updated last year
- ☆165Mar 7, 2022Updated 4 years ago