Official repository of "Chatting Makes Perfect: Chat-based Image Retrieval"
☆32Feb 5, 2025Updated last year
Alternatives and similar repositories for ChatIR
Users that are interested in ChatIR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repository of "Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach" (ACL 2024 Oral)☆34Mar 24, 2025Updated last year
- SIGIR paper Conversational Fashion Image Retrieval via Multiturn Natural Language Feedback☆14Oct 17, 2022Updated 3 years ago
- Pytorch Implementation of LLaVA-ReID: Selective Multi-image Questioner for Interactive Person Re-Identification☆105Nov 20, 2025Updated 6 months ago
- (ICML 2024) Improve Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning☆28Sep 27, 2024Updated last year
- [CVPR 2025] LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant☆183Jul 7, 2025Updated 11 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection☆26May 31, 2025Updated last year
- Code for our paper: "Where's Waldo: Diffusion Features For Personalized Segmentation and Retrieval".☆14Feb 26, 2025Updated last year
- The official repository of MM-R5☆29Jun 22, 2025Updated 11 months ago
- Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning☆24Sep 9, 2024Updated last year
- SeeSo(Eye-Tracking SDK) sample for iOS☆13Jan 5, 2024Updated 2 years ago
- This projects entails performing in-depth descriptive analysis and data visualization on United Kingdom Road Traffic and Accident dataset…☆11Jul 20, 2020Updated 5 years ago
- Reinforcement Learning Tuning for VideoLLMs: Reward Design and Data Efficiency☆62Jun 6, 2025Updated last year
- mcp wrapper for openai built-in tools☆12Mar 13, 2025Updated last year
- [ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives☆39Sep 9, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- a tiny project to test the effectiveness of video QA through RAG techniques and multimodal LLMs☆15Jun 2, 2024Updated 2 years ago
- [EMNLP 2025] Official codebase for Rearank: Reasoning Re-ranking Agent☆37Aug 20, 2025Updated 9 months ago
- ☆19Jan 11, 2024Updated 2 years ago
- Ranking of fine-tuned HF models as base models.☆36Sep 17, 2025Updated 8 months ago
- Noise of Web (NoW) is a challenging noisy correspondence learning (NCL) benchmark containing 100K image-text pairs for robust image-text …☆16Nov 20, 2025Updated 6 months ago
- Dataset from Tip of the Tongue Known-Item Retrieval (2021) paper.☆12Nov 4, 2021Updated 4 years ago
- Streamlit OpenAI app to chat with custom text documents of all kinds☆13Apr 11, 2026Updated 2 months ago
- This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our em…☆136May 23, 2026Updated 2 weeks ago
- Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"☆33Mar 26, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale☆208Nov 13, 2023Updated 2 years ago
- Official repository of ICCV 2021 - Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models☆135Oct 17, 2025Updated 7 months ago
- [EMNLP25 Main]The official code of "Gradient-Attention Guided Dual-Masking Synergetic Framework for Robust Text-based Person Retrieval"☆25Mar 30, 2026Updated 2 months ago
- Implementation of PWOC-3D network for end-to-end stereo scene flow estimation☆13Oct 19, 2023Updated 2 years ago
- [Under Construction]☆15Mar 29, 2020Updated 6 years ago
- ☆11Nov 28, 2022Updated 3 years ago
- [EMNLP 2024 Industry track] MERLIN : Multimodal Embedding Refinement via LLM-based Iterative Navigation for Text-Video Retrieval-Rerank P…☆14Mar 4, 2025Updated last year
- Mixture-of-Embeddings-Experts☆121Jul 21, 2020Updated 5 years ago
- ☆17Jan 30, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This repo contains the official PyTorch implementation of "A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement" (…☆28Aug 8, 2022Updated 3 years ago
- 🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning☆36Jan 30, 2026Updated 4 months ago
- ☆15Dec 7, 2021Updated 4 years ago
- Official Implementation of Visual Abstraction: A Plug-and-Play Approach for Text-Visual Retrieval☆26Jul 14, 2025Updated 10 months ago
- ☆25May 13, 2024Updated 2 years ago
- ☆24May 8, 2024Updated 2 years ago
- 2018/2019/校招/春招/秋招/算法/机器学习(Machine Learning)/深度学习(Deep Learning)/自然语言处理(NLP)/C/C++/Python/面试笔记☆24Oct 6, 2018Updated 7 years ago