levymsn/ChatIR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/levymsn/ChatIR)

levymsn / ChatIR

Official repository of "Chatting Makes Perfect: Chat-based Image Retrieval"

☆32

Alternatives and similar repositories for ChatIR

Users that are interested in ChatIR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kevinliang888 / IVR-QA-baselines
View on GitHub
[ICCV 2023] Simple Baselines for Interactive Video Retrieval with Questions and Answers
☆20Apr 16, 2024Updated 2 years ago
Code-kunkun / LamRA
View on GitHub
[CVPR 2025] LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant
☆182Jul 7, 2025Updated last year
uvavision / DrillDown
View on GitHub
[NeurIPS 2019] Drill-down: Interactive Retrieval of Complex Scenes using Natural Language Queries
☆12Apr 15, 2022Updated 4 years ago
XLearning-SCU / LLaVA-ReID
View on GitHub
Pytorch Implementation of LLaVA-ReID: Selective Multi-image Questioner for Interactive Person Re-Identification
☆107Nov 20, 2025Updated 8 months ago
Saehyung-Lee / DCC
View on GitHub
This repository is the official implementation of Dataset Condensation with Contrastive Signals (DCC), accepted at ICML 2022.
☆22Jun 8, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
dvirsamuel / PDM
View on GitHub
Code for our paper: "Where's Waldo: Diffusion Features For Personalized Segmentation and Retrieval".
☆14Feb 26, 2025Updated last year
Owen-Liuyuxuan / kitti360_visualize
View on GitHub
Visualize KITTI360 sequences on ROS with full tf support.
☆10Apr 21, 2023Updated 3 years ago
GasolSun36 / MVP
View on GitHub
Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning
☆24Sep 9, 2024Updated last year
vlevit / q10r
View on GitHub
a simple questionnaire Flask web app
☆19May 1, 2023Updated 3 years ago
aayushkubb / nlp
View on GitHub
Curated List of NLP tutorials
☆30Feb 27, 2025Updated last year
Yansz / 3D-Computer-Vision-Research
View on GitHub
Collect data sets and research papers in the field of 3D computer vision tasks with implemented repositories.
☆23Jun 26, 2020Updated 6 years ago
BUAADreamer / SPN4CIR
View on GitHub
[ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives
☆39Sep 9, 2025Updated 10 months ago
XLearning-SCU / 2024-IJCV-LCNL
View on GitHub
☆12Feb 2, 2024Updated 2 years ago
QinYang79 / Awesome-Noisy-Correspondence
View on GitHub
This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our em…
☆86May 24, 2026Updated last month
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
QinYang79 / ICL
View on GitHub
Human-centered Interactive Learning via MLLMs for Text-to-Image Person Re-identification (CVPR 2025 Pytorch Code)
☆49Jul 19, 2025Updated last year
MPI-Lab / HAM
View on GitHub
Code for Modeling Thousands of Human Annotators for Generalizable Text-to-Image Person Re-identification (CVPR2025)
☆50Nov 4, 2025Updated 8 months ago
TianXiaoRui / Semantic-Mapping-for-ORB-SLAM
View on GitHub
☆12May 20, 2019Updated 7 years ago
uestc-xyh / ComqueryFormer
View on GitHub
☆11Nov 28, 2022Updated 3 years ago
antoyang / VidChapters
View on GitHub
[NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale
☆211Nov 13, 2023Updated 2 years ago
arijitray1993 / COLA
View on GitHub
COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!
☆25May 14, 2026Updated 2 months ago
gabrielchua / embedding-adapter
View on GitHub
A lightweight open-source package to fine-tune embedding models.
☆22Feb 4, 2024Updated 2 years ago
dhk1349 / MERLIN_text_to_video_search
View on GitHub
[EMNLP 2024 Industry track] MERLIN : Multimodal Embedding Refinement via LLM-based Iterative Navigation for Text-Video Retrieval-Rerank P…
☆14Mar 4, 2025Updated last year
salmank255 / ROAD_Waymo_Baseline
View on GitHub
☆17Jan 30, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Pter61 / context-i2w
View on GitHub
Context-I2W: Mapping Images to Context-dependent words for Accurate Zero-Shot Composed Image Retrieval [AAAI 2024 Oral]
☆54May 27, 2025Updated last year
appletea233 / Temporal-R1
View on GitHub
Reinforcement Learning Tuning for VideoLLMs: Reward Design and Data Efficiency
☆62Jun 6, 2025Updated last year
pengts / VW-LMM
View on GitHub
☆25May 13, 2024Updated 2 years ago
antoine77340 / Mixture-of-Embedding-Experts
View on GitHub
Mixture-of-Embeddings-Experts
☆122Jul 21, 2020Updated 6 years ago
DeployQL / awesome-multi-vector
View on GitHub
A list of multi-vector retrieval resources
☆19May 29, 2024Updated 2 years ago
clear-nus / MuMMI
View on GitHub
Multi-Modal Mutual Information (MuMMI) Training for Robust Self-Supervised Deep Reinforcement Learning
☆13Jun 28, 2022Updated 4 years ago
xianzhangzx / FINER-MLLM
View on GitHub
The implementation of FINER-MLLM, which is accepted by MM2024.
☆18Oct 8, 2024Updated last year
leolee99 / PAU
View on GitHub
[NeurIPS 2023] The official implementation of paper "Prototype-based Aleatoric Uncertainty Quantification for Cross-modal Retrieval" acce…
☆28May 14, 2024Updated 2 years ago
hi-zhenyu / MvSCN
View on GitHub
☆18Feb 20, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
ChocoWu / SeTok
View on GitHub
Codes for ICLR 2025 Paper: Towards Semantic Equivalence of Tokenization in Multimodal LLM
☆81Apr 19, 2025Updated last year
shubhrampandey / coronaVirus-dataViz
View on GitHub
Corona Virus Data Visuzalization Platform
☆17May 15, 2021Updated 5 years ago
XLearning-SCU / 2026-CVPR-BML
View on GitHub
[CVPR 2026] Pytorch Code for the paper "Bootstrapping Multi-view Learning for Test-time Noisy Correspondence"
☆15Jul 1, 2026Updated 3 weeks ago
TidbitsJS / Learn-RN
View on GitHub
☆15Jun 22, 2022Updated 4 years ago
AvishakeAdhikary / Text-To-Image-Generator
View on GitHub
Python GUI application that generates images based on user prompts using the StableDiffusionPipeline model from the diffusers module. The…
☆14May 28, 2023Updated 3 years ago
jyliu-98 / MoSketch
View on GitHub
[ICCV 2025] This repo is the official implementation of "Multi-Object Sketch Animation by Scene Decomposition and Motion Planning"
☆28Jul 30, 2025Updated 11 months ago
perfectyayra / Algo-Trading-Project
View on GitHub
☆10Nov 6, 2018Updated 7 years ago