The official code of "CaLa: Complementary Association Learning for Augmenting Composed Image Retrieval"
☆15Sep 19, 2024Updated last year
Alternatives and similar repositories for CaLa
Users that are interested in CaLa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Mar 31, 2025Updated last year
- [ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives☆39Sep 9, 2025Updated 9 months ago
- 【ICLR 2024, Spotlight】Sentence-level Prompts Benefit Composed Image Retrieval☆94Apr 16, 2024Updated 2 years ago
- ☆16Mar 15, 2021Updated 5 years ago
- [ICLR 2024] Official repository for "Vision-by-Language for Training-Free Compositional Image Retrieval"☆88Jul 4, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The official implementation for BLIP4CIR with bi-directional training | Bi-directional Training for Composed Image Retrieval via Text Pro…☆34Feb 7, 2024Updated 2 years ago
- [SIGIR 2024] - Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval☆44Jul 14, 2024Updated last year
- ☆12Apr 12, 2026Updated 2 months ago
- [ACM MM'2024] Official repository for "Semantic Editing Increment Benefits Zero-Shot Composed Image Retrieval"☆43Dec 23, 2024Updated last year
- (ICML 2024) Improve Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning☆28Sep 27, 2024Updated last year
- The official code of "Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark"☆174Jul 23, 2025Updated 11 months ago
- Used in M4C feature extraction script: https://github.com/facebookresearch/mmf/blob/project/m4c/projects/M4C/scripts/extract_ocr_frcn_fea…☆13Jan 30, 2020Updated 6 years ago
- Collection of Composed Image Retrieval (CIR) papers.☆356Jun 8, 2026Updated 3 weeks ago
- Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval (ICCV 2025 Highlight)☆26Aug 1, 2025Updated 11 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- We are very happy that our work has been accepted by ACM Multimedia 2024!🥰☆12Jan 8, 2025Updated last year
- Tensorflow Implementation on Paper [CVPR2020]Image Search with Text Feedback by Visiolinguistic Attention Learning☆63Sep 12, 2020Updated 5 years ago
- Official Pytorch implementation of "Improved Probabilistic Image-Text Representations" (ICLR 2024)☆63May 26, 2024Updated 2 years ago
- Multimodal-Composite-Editing-and-Retrieval-update☆35Oct 13, 2025Updated 8 months ago
- This is a PyTorch implementation of 3DRefTR proposed by our paper "A Unified Framework for 3D Point Cloud Visual Grounding"☆26Aug 24, 2023Updated 2 years ago
- The official code of "Beyond Walking: A Large-Scale Image-Text Benchmark for Text-based Person Anomaly Search"☆32Sep 15, 2025Updated 9 months ago
- AnyTrans: Translate AnyText in the Image with Large Scale Models (EMNLP2024 Findings)☆25Dec 11, 2024Updated last year
- This repository offers a comprehensive overview of existing datasets and methods in the field of change captioning.☆17Sep 2, 2025Updated 10 months ago
- Visual Delta Generator with Large Multi-modal Model for Semi-supervised Composed Image Retrieval - CVPR2024☆21May 30, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆16Jan 6, 2025Updated last year
- Code for ACL 2023 paper: Exploring Better Text Image Translation with Multimodal Codebook☆21Apr 19, 2026Updated 2 months ago
- Official PyTorch Implementation of RITC☆21Oct 26, 2021Updated 4 years ago
- Text Proxy: Decomposing Retrieval from a 1-to-N Relationship into N 1-to-1 Relationships for Text-Video Retrieval -- AAAI2025☆21May 8, 2026Updated last month
- [CVPR 2025] Official implementation of paper "Prosody-Enhanced Acoustic Pre-training and Acoustic-Disentangled Prosody Adapting for Movie…☆23Jun 6, 2025Updated last year
- [IEEE TMM'25] Scene-Text Grounding for Text-Based Video Question Answering☆17Feb 16, 2026Updated 4 months ago
- ☆19Jul 28, 2025Updated 11 months ago
- [ACM TOMM 2023] - Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features☆195Sep 5, 2023Updated 2 years ago
- Implementing ONNX runtime for android to run Segment Anything Model 2☆12Aug 1, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 基于强化学习(RL)的冰壶游戏实例; 梯度下降的Sarsa(lambda) + 非均匀径向基特征表示☆22Jul 5, 2020Updated 5 years ago
- ☆24Nov 24, 2022Updated 3 years ago
- [PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension☆31Dec 28, 2023Updated 2 years ago
- [ACM MM24] Official implementation of paper "From Speaker to Dubber: Movie Dubbing with Prosody and Duration Consistency Learning"☆34May 7, 2025Updated last year
- CVPR 2024 Official Repository☆13Mar 27, 2024Updated 2 years ago
- A tool for Android applications to edit and display rich text easily☆21Mar 16, 2021Updated 5 years ago
- ☆170Mar 7, 2022Updated 4 years ago