zzezze / NeighborRetrLinks
Official implementation of "NeighborRetr: Balancing Hub Centrality in Cross-Modal Retrieval (CVPR 2025)"
☆26Updated 3 months ago
Alternatives and similar repositories for NeighborRetr
Users that are interested in NeighborRetr are comparing it to the libraries listed below
Sorting:
- [CVPR2025] Synthetic Data is an Elegant GIFT for Continual Vision-Language Models☆16Updated 3 weeks ago
- ☆20Updated 2 months ago
- [CVPR25] CoLLM: A Large Language Model for Composed Image Retrieval☆25Updated 3 months ago
- [ECCV 2024 Oral] Code for our paper "A Fair Ranking and New Model for Panoptic Scene Graph Generation"☆14Updated last month
- Pytorch Implementation of LLaVA-ReID: Selective Multi-image Questioner for Interactive Person Re-Identification☆33Updated last week
- ☆21Updated last year
- [ICLR 2025] Official Implementation of Local-Prompt: Extensible Local Prompts for Few-Shot Out-of-Distribution Detection☆42Updated last week
- [CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval☆17Updated 3 weeks ago
- Code for the paper "Compositional Entailment Learning for Hyperbolic Vision-Language Models".☆75Updated last month
- [CVPR2025] BOLT: Boost Large Vision-Language Model Without Training for Long-form Video Understanding☆23Updated 3 months ago
- Official Repository for ICML 2024 Paper "OT-CLIP: Understanding and Generalizing CLIP via Optimal Transport"☆16Updated last year
- 🔎Official code for our paper: "VL-Uncertainty: Detecting Hallucination in Large Vision-Language Model via Uncertainty Estimation".☆39Updated 4 months ago
- PyTorch implementation for Cross-modal Retrieval with Noisy Correspondence via Consistency Refining and Mining (TIP 2024)☆17Updated last year
- The official repository for ECCV2024 paper "PromptCCD: Learning Gaussian Mixture Prompt Pool for Continual Category Discovery"☆22Updated 3 months ago
- ☆11Updated last year
- ☆42Updated 3 weeks ago
- [CVPR2025] Official implementation of RAM☆17Updated 3 months ago
- [CVPR 2025] COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training☆26Updated 3 months ago
- [ECCV 2024] Official repository of "GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning".☆29Updated 7 months ago
- Rui Qian, Xin Yin, Dejing Dou†: Reasoning to Attend: Try to Understand How <SEG> Token Works (CVPR 2025)☆38Updated 2 months ago
- [SIGIR 2024] - Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval☆40Updated last year
- ☆14Updated 4 months ago
- cliptrase☆38Updated 10 months ago
- ☆27Updated 2 years ago
- [ICLR 2025] TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning☆36Updated 3 months ago
- The source code for "UniBind: LLM-Augmented Unified and Balanced Representation Space to Bind Them All"☆42Updated last year
- Code for our paper: "Where's Waldo: Diffusion Features For Personalized Segmentation and Retrieval".☆13Updated 4 months ago
- Official implementation of paper "OED: Towards One-stage End-to-End Dynamic Scene Graph Generation".☆20Updated last year
- [ICME 2024 Oral] DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding☆20Updated 4 months ago
- Learnable Pillar-based Re-ranking for Image-Text Retrieval. SIGIR'23☆20Updated last year