Official repository of "Chatting Makes Perfect: Chat-based Image Retrieval"
☆32Feb 5, 2025Updated last year
Alternatives and similar repositories for ChatIR
Users that are interested in ChatIR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repository of "Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach" (ACL 2024 Oral)☆35Mar 24, 2025Updated last year
- [ICCV 2023] Simple Baselines for Interactive Video Retrieval with Questions and Answers☆19Apr 16, 2024Updated last year
- (ICML 2024) Improve Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning☆28Sep 27, 2024Updated last year
- [NeurIPS 2019] Drill-down: Interactive Retrieval of Complex Scenes using Natural Language Queries☆12Apr 15, 2022Updated 3 years ago
- Code for Modeling Thousands of Human Annotators for Generalizable Text-to-Image Person Re-identification (CVPR2025)☆40Nov 4, 2025Updated 4 months ago
- [CVPR 2025] LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant☆179Jul 7, 2025Updated 8 months ago
- This software architecture document aims to provide a detailed overview of the architecture of a ride-sharing service, including the key …☆12May 11, 2025Updated 10 months ago
- VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection☆26May 31, 2025Updated 9 months ago
- Code for our paper: "Where's Waldo: Diffusion Features For Personalized Segmentation and Retrieval".☆15Feb 26, 2025Updated last year
- The official repository of MM-R5☆29Jun 22, 2025Updated 9 months ago
- A Python Interface to Reproducibility Measures of System-Oriented IR Experiments☆11Dec 2, 2025Updated 3 months ago
- Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning☆24Sep 9, 2024Updated last year
- [DMLR 2024] Benchmarking Robustness of Multimodal Image-Text Models under Distribution Shift☆38Jan 25, 2024Updated 2 years ago
- SeeSo(Eye-Tracking SDK) sample for iOS☆13Jan 5, 2024Updated 2 years ago
- ☆43Mar 6, 2024Updated 2 years ago
- a simple questionnaire Flask web app☆19May 1, 2023Updated 2 years ago
- Reinforcement Learning Tuning for VideoLLMs: Reward Design and Data Efficiency☆62Jun 6, 2025Updated 9 months ago
- ☆12Feb 2, 2024Updated 2 years ago
- mcp wrapper for openai built-in tools☆12Mar 13, 2025Updated last year
- [ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives☆39Sep 9, 2025Updated 6 months ago
- a tiny project to test the effectiveness of video QA through RAG techniques and multimodal LLMs☆15Jun 2, 2024Updated last year
- Collect data sets and research papers in the field of 3D computer vision tasks with implemented repositories.☆23Jun 26, 2020Updated 5 years ago
- ☆18Jan 11, 2024Updated 2 years ago
- [EMNLP 2025] Official codebase for Rearank: Reasoning Re-ranking Agent☆34Aug 20, 2025Updated 7 months ago
- Noise of Web (NoW) is a challenging noisy correspondence learning (NCL) benchmark containing 100K image-text pairs for robust image-text …☆15Nov 20, 2025Updated 4 months ago
- Dataset from Tip of the Tongue Known-Item Retrieval (2021) paper.☆12Nov 4, 2021Updated 4 years ago
- Learning Python Data Visualization [video], published by Packt☆12Oct 28, 2022Updated 3 years ago
- [EMNLP25 Main]The official code of "Gradient-Attention Guided Dual-Masking Synergetic Framework for Robust Text-based Person Retrieval"☆22Mar 11, 2026Updated last week
- PySpark-based causal inference package.☆13Aug 20, 2021Updated 4 years ago
- ☆12May 20, 2019Updated 6 years ago
- Streamlit OpenAI app to chat with custom text documents of all kinds☆13Sep 26, 2024Updated last year
- This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our em…☆124Nov 26, 2025Updated 3 months ago
- Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"☆33Mar 26, 2025Updated 11 months ago
- [NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale☆205Nov 13, 2023Updated 2 years ago
- Official repository of ICCV 2021 - Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models☆131Oct 17, 2025Updated 5 months ago
- [Under Construction]☆15Mar 29, 2020Updated 5 years ago
- This repo is reproduction resources for linear alignment paper, still working☆18May 19, 2024Updated last year
- Mixture-of-Embeddings-Experts☆121Jul 21, 2020Updated 5 years ago
- [EMNLP 2024 Industry track] MERLIN : Multimodal Embedding Refinement via LLM-based Iterative Navigation for Text-Video Retrieval-Rerank P…☆14Mar 4, 2025Updated last year