Official repository of "Chatting Makes Perfect: Chat-based Image Retrieval"
☆31Feb 5, 2025Updated last year
Alternatives and similar repositories for ChatIR
Users that are interested in ChatIR are comparing it to the libraries listed below
Sorting:
- Official repository of "Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach" (ACL 2024 Oral)☆34Mar 24, 2025Updated 11 months ago
- Pytorch Implementation of LLaVA-ReID: Selective Multi-image Questioner for Interactive Person Re-Identification☆96Nov 20, 2025Updated 3 months ago
- (ICML 2024) Improve Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning☆28Sep 27, 2024Updated last year
- [ICCV 2023] Simple Baselines for Interactive Video Retrieval with Questions and Answers☆18Apr 16, 2024Updated last year
- Reinforcement Learning Tuning for VideoLLMs: Reward Design and Data Efficiency☆60Jun 6, 2025Updated 8 months ago
- Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning☆24Sep 9, 2024Updated last year
- Collect data sets and research papers in the field of 3D computer vision tasks with implemented repositories.☆23Jun 26, 2020Updated 5 years ago
- Context-I2W: Mapping Images to Context-dependent words for Accurate Zero-Shot Composed Image Retrieval [AAAI 2024 Oral]☆56May 27, 2025Updated 9 months ago
- [CVPR 2025] LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant☆178Jul 7, 2025Updated 7 months ago
- Curated List of NLP tutorials☆30Feb 27, 2025Updated last year
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25Nov 23, 2024Updated last year
- Code for Modeling Thousands of Human Annotators for Generalizable Text-to-Image Person Re-identification (CVPR2025)☆38Nov 4, 2025Updated 3 months ago
- [NeurIPS 2023] The official implementation of paper "Prototype-based Aleatoric Uncertainty Quantification for Cross-modal Retrieval" acce…☆27May 14, 2024Updated last year
- PyTorch implementation of "UNIT: Unifying Image and Text Recognition in One Vision Encoder", NeurlPS 2024.☆34Sep 26, 2024Updated last year
- [ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives☆39Sep 9, 2025Updated 5 months ago
- This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our em…☆77Nov 7, 2025Updated 3 months ago
- Abnormal Activity Detection using Deep Learning LRCN is a model that combines CNN and RNN to identify abnormal behavior in videos. With r…☆10Sep 22, 2023Updated 2 years ago
- official code for "Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval"☆42Jul 4, 2025Updated 7 months ago
- [NeurIPS 2025] Composed Person Retrieval (CPR) is a new cross-modal retrieval task that aims to identify individuals in large-scale perso…☆73Oct 20, 2025Updated 4 months ago
- [DMLR 2024] Benchmarking Robustness of Multimodal Image-Text Models under Distribution Shift☆38Jan 25, 2024Updated 2 years ago
- ☆10Nov 6, 2018Updated 7 years ago
- Python GUI application that generates images based on user prompts using the StableDiffusionPipeline model from the diffusers module. The…☆14May 28, 2023Updated 2 years ago
- 【ICLR 2024, Spotlight】Sentence-level Prompts Benefit Composed Image Retrieval☆92Apr 16, 2024Updated last year
- ☆14Jul 2, 2023Updated 2 years ago
- [ICCV2023] Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer☆37Oct 18, 2023Updated 2 years ago
- This project combines logistic regression, gradient boosting, and LSTMs to predict next-month returns.☆13Sep 25, 2019Updated 6 years ago
- Cell2location paper - Comprehensive mapping of tissue cell architecture via integrated single cell and spatial transcriptomics☆15Nov 26, 2022Updated 3 years ago
- The official implementation of the EMNLP 2023 paper "Paraphrase Types for Generation and Detection"☆12Oct 20, 2024Updated last year
- Notes and code for Programming Massively Parallel Processors☆13Mar 29, 2025Updated 11 months ago
- CLV prediction with pareto-NBD model☆12Jul 1, 2016Updated 9 years ago
- how to build a sentence embedding application using BentoML☆14Mar 31, 2025Updated 11 months ago
- My Data Engineering project @ Insight Data Science☆10Jul 23, 2018Updated 7 years ago
- auto ticket reservation program (python)☆12Jan 28, 2020Updated 6 years ago
- Multi-Person Tracking in Tour Guide Robot☆10Aug 23, 2022Updated 3 years ago
- A digital twin of the city of Chicago along with automated sensors☆12Nov 14, 2019Updated 6 years ago
- ☆12Feb 2, 2024Updated 2 years ago
- Sample and Computation Redistribution for Efficient Face Detection☆16May 13, 2024Updated last year
- MXNet-Gluon model to Caffe (support SSD in gluoncv)☆10Jun 20, 2019Updated 6 years ago
- ☆12Nov 19, 2024Updated last year