hmchuong / CoLLM
[CVPR25] CoLLM: A Large Language Model for Composed Image Retrieval
☆18Updated last month
Alternatives and similar repositories for CoLLM:
Users that are interested in CoLLM are comparing it to the libraries listed below
- An official repo for WACV 2025 paper "LLaVA-SpaceSGG: Visual Instruct Tuning for Open-vocabulary Scene Graph Generation with Enhanced Spa…☆16Updated 3 months ago
- Open-vocabulary Semantic Segmentation☆34Updated last year
- [NeurIPS 2024] OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling.☆19Updated 2 months ago
- AMES: Asymmetric and Memory-Efficient Similarity☆30Updated 6 months ago
- Open-Vocabulary Panoptic Segmentation☆23Updated 8 months ago
- ☆9Updated 2 months ago
- [CVPR 2025 Highlight] Official Pytorch codebase for paper: "Assessing and Learning Alignment of Unimodal Vision and Language Models"☆34Updated 3 weeks ago
- ILIAS: Instance-Level Image retrieval At Scale☆22Updated last month
- ☆31Updated last year
- ICLR‘24 Offical Implementation of Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization☆72Updated last year
- ☆65Updated last year
- [IEEE TCSVT] Official Pytorch Implementation of CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation.☆42Updated 4 months ago
- VLPrompt: Vision-Language Prompting for Panoptic Scene Graph Generation☆26Updated 7 months ago
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆37Updated last year
- [ICLR 2024] Official repository for "Vision-by-Language for Training-Free Compositional Image Retrieval"☆65Updated 10 months ago
- ☆30Updated 7 months ago
- ☆58Updated last year
- Official PyTorch Implementation of Revisiting Self-Similarity: Structural Embedding for Image Retrieval, CVPR 2023☆67Updated last year
- Learnable Pillar-based Re-ranking for Image-Text Retrieval. SIGIR'23☆20Updated last year
- ☆12Updated 4 months ago
- ☆16Updated 6 months ago
- Pytorch Implementation of LLaVA-ReID: Selective Multi-image Questioner for Interactive Person Re-Identification☆20Updated 3 weeks ago
- [CVPR2025] Official implementation of RAM☆14Updated last month
- ☆27Updated 3 years ago
- ☆21Updated last year
- Repository for the paper: Teaching VLMs to Localize Specific Objects from In-context Examples☆23Updated 5 months ago
- ☆55Updated 2 weeks ago
- Sambor: Boosting Segment Anything Model Towards Open-Vocabulary Learning☆30Updated last year
- ☆28Updated 3 months ago
- (NeurIPS 2024) Official repository of paper "Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models"☆27Updated last month