Official repository of "Chatting Makes Perfect: Chat-based Image Retrieval"
☆33Feb 5, 2025Updated last year
Alternatives and similar repositories for ChatIR
Users that are interested in ChatIR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repository of "Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach" (ACL 2024 Oral)☆35Mar 24, 2025Updated last year
- SIGIR paper Conversational Fashion Image Retrieval via Multiturn Natural Language Feedback☆14Oct 17, 2022Updated 3 years ago
- Pytorch Implementation of LLaVA-ReID: Selective Multi-image Questioner for Interactive Person Re-Identification☆103Nov 20, 2025Updated 6 months ago
- [ICCV 2023] Simple Baselines for Interactive Video Retrieval with Questions and Answers☆19Apr 16, 2024Updated 2 years ago
- (ICML 2024) Improve Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning☆28Sep 27, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code for the CVPR 2020 article "Adversarial Vertex mixup: Toward Better Adversarially Robust Generalization"☆12Jul 13, 2020Updated 5 years ago
- [CVPR 2025] LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant☆181Jul 7, 2025Updated 10 months ago
- This software architecture document aims to provide a detailed overview of the architecture of a ride-sharing service, including the key …☆12May 11, 2025Updated last year
- VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection☆26May 31, 2025Updated 11 months ago
- The official repository of MM-R5☆29Jun 22, 2025Updated 11 months ago
- Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning☆24Sep 9, 2024Updated last year
- Visualize KITTI360 sequences on ROS with full tf support.☆10Apr 21, 2023Updated 3 years ago
- Reinforcement Learning Tuning for VideoLLMs: Reward Design and Data Efficiency☆62Jun 6, 2025Updated 11 months ago
- ☆12Feb 2, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives☆39Sep 9, 2025Updated 8 months ago
- a tiny project to test the effectiveness of video QA through RAG techniques and multimodal LLMs☆15Jun 2, 2024Updated last year
- This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our em…☆83May 12, 2026Updated last week
- Collect data sets and research papers in the field of 3D computer vision tasks with implemented repositories.☆23Jun 26, 2020Updated 5 years ago
- [EMNLP 2025] Official codebase for Rearank: Reasoning Re-ranking Agent☆35Aug 20, 2025Updated 9 months ago
- Ranking of fine-tuned HF models as base models.☆36Sep 17, 2025Updated 8 months ago
- Noise of Web (NoW) is a challenging noisy correspondence learning (NCL) benchmark containing 100K image-text pairs for robust image-text …☆16Nov 20, 2025Updated 6 months ago
- ACMMM 2025☆17Dec 11, 2025Updated 5 months ago
- ☆12May 20, 2019Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Streamlit OpenAI app to chat with custom text documents of all kinds☆13Apr 11, 2026Updated last month
- This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our em…☆133May 12, 2026Updated last week
- Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"☆33Mar 26, 2025Updated last year
- [NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale☆208Nov 13, 2023Updated 2 years ago
- Official repository of ICCV 2021 - Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models☆135Oct 17, 2025Updated 7 months ago
- [EMNLP25 Main]The official code of "Gradient-Attention Guided Dual-Masking Synergetic Framework for Robust Text-based Person Retrieval"☆26Mar 30, 2026Updated last month
- Implementation of PWOC-3D network for end-to-end stereo scene flow estimation☆13Oct 19, 2023Updated 2 years ago
- [Under Construction]☆15Mar 29, 2020Updated 6 years ago
- This repo is reproduction resources for linear alignment paper, still working☆18May 19, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆11Nov 28, 2022Updated 3 years ago
- [EMNLP 2024 Industry track] MERLIN : Multimodal Embedding Refinement via LLM-based Iterative Navigation for Text-Video Retrieval-Rerank P…☆14Mar 4, 2025Updated last year
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25May 14, 2026Updated last week
- Mixture-of-Embeddings-Experts☆121Jul 21, 2020Updated 5 years ago
- ☆17Jan 30, 2024Updated 2 years ago
- Context-I2W: Mapping Images to Context-dependent words for Accurate Zero-Shot Composed Image Retrieval [AAAI 2024 Oral]☆54May 27, 2025Updated 11 months ago
- This repo contains the official PyTorch implementation of "A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement" (…☆28Aug 8, 2022Updated 3 years ago