hmchuong / CoLLM
[CVPR25] CoLLM: A Large Language Model for Composed Image Retrieval
☆12Updated 3 weeks ago
Alternatives and similar repositories for CoLLM:
Users that are interested in CoLLM are comparing it to the libraries listed below
- Visual Place Recognition☆14Updated 4 months ago
- This repo contains the code and data of "Graph Matching with Bi-level Noisy Correspondence".☆20Updated last year
- Open-Vocabulary Panoptic Segmentation☆23Updated 7 months ago
- ☆16Updated last year
- ☆10Updated 5 months ago
- The Official Implementation of CFCD. Coarse-to-Fine: Learning Compact Discriminative Representation for Single-Stage Image Retrieval☆30Updated last year
- ☆25Updated 4 months ago
- Official implementation of NeurIPS 2021 paper "Contextual Similarity Aggregation with Self-attention for Visual Re-ranking"☆25Updated 3 years ago
- [NeurIPS 2024] Official PyTorch implementation of "Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives"☆37Updated 4 months ago
- [NeurIPS 2024] OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling.☆18Updated last month
- Visual geolocation using segment cross-attention☆23Updated 2 years ago
- An official repo for WACV 2025 paper "LLaVA-SpaceSGG: Visual Instruct Tuning for Open-vocabulary Scene Graph Generation with Enhanced Spa…☆16Updated 2 months ago
- ☆16Updated 8 months ago
- ☆11Updated 7 months ago
- Code for CVPR2025 "MMRL: Multi-Modal Representation Learning for Vision-Language Models".☆27Updated last week
- Learnable Pillar-based Re-ranking for Image-Text Retrieval. SIGIR'23☆19Updated last year
- Official implementation of the WACV 2025 paper "3D Part Segmentation via Geometric Aggregation of 2D Visual Features"☆17Updated last month
- AMES: Asymmetric and Memory-Efficient Similarity☆26Updated 5 months ago
- PyTorch implementation of the Dark Side Augmentation☆8Updated 8 months ago
- Framework for computationally efficient training of universal image feature extraction models.☆20Updated 8 months ago
- ☆27Updated 3 years ago
- Code for "Visual Spatial Description: Controlled Spatial-Oriented Image-to-Text Generation"☆27Updated last year
- Official PyTorch Implementation for "ProGEO: Generating Prompts through Image-Text Contrastive Learning For Visual Geo-localization, ICAN…☆50Updated 7 months ago
- [CVPR 2025 Highlight] Official Pytorch codebase for paper: "Assessing and Learning Alignment of Unimodal Vision and Language Models"☆32Updated last week
- [CVPR 2025] Few-shot Recognition via Stage-Wise Retrieval-Augmented Finetuning☆14Updated 2 weeks ago
- An offical repo for ECCV 2024 Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching☆80Updated 2 months ago
- Resolution Asymmetric Metric Learning☆16Updated last year
- ☆16Updated 6 months ago
- Open-vocabulary Semantic Segmentation☆34Updated last year
- [ICML 2024] GeoReasoner: Geo-localization with Reasoning in Street Views using a Large Vision-Language Mode☆49Updated 4 months ago