ZhaoJingjing713 / HC-RefLoCoLinks
[NeurIPS 2024] A Large-Scale Human-Centric Benchmark for Referring Expression Comprehension in the LMM Era
☆10Updated last year
Alternatives and similar repositories for HC-RefLoCo
Users that are interested in HC-RefLoCo are comparing it to the libraries listed below
Sorting:
- ☆17Updated 2 weeks ago
- Collection of Composed Image Retrieval (CIR) papers.☆243Updated this week
- 【ICLR 2024, Spotlight】Sentence-level Prompts Benefit Composed Image Retrieval☆85Updated last year
- A comprehensive survey of Composed Multi-modal Retrieval (CMR), including Composed Image Retrieval (CIR) and Composed Video Retrieval (CV…☆49Updated 3 weeks ago
- [AAAI'25, CVPRW 2024] Official repository of paper titled "Learning to Prompt with Text Only Supervision for Vision-Language Models".☆111Updated 7 months ago
- Visual Delta Generator with Large Multi-modal Model for Semi-supervised Composed Image Retrieval - CVPR2024☆18Updated last year
- [ToMM2023] - AMC: Adaptive Multi-expert Collaborative Network for Text-guided Image Retrieval☆20Updated 11 months ago
- [CVPR 2025] LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant☆138Updated last month
- This is the first released survey paper on hallucinations of large vision-language models (LVLMs). To keep track of this field and contin…☆75Updated last year
- Context-I2W: Mapping Images to Context-dependent words for Accurate Zero-Shot Composed Image Retrieval [AAAI 2024 Oral]☆54Updated 2 months ago
- ☆134Updated 6 months ago
- Official implementation of HawkEye: Training Video-Text LLMs for Grounding Text in Videos☆42Updated last year
- [CVPR 2024] Retrieval-Augmented Image Captioning with External Visual-Name Memory for Open-World Comprehension☆54Updated last year
- Official github repo for ICCV2023 paper 'Multi-event Video-Text Retrieval'☆18Updated last year
- ☆8Updated 8 months ago
- Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)☆78Updated last year
- [CVPR25 Highlight] A ChatGPT-Prompted Visual hallucination Evaluation Dataset, featuring over 100,000 data samples and four advanced eval…☆18Updated 3 months ago
- SmallCap: Lightweight Image Captioning Prompted with Retrieval Augmentation☆117Updated last year
- Code for paper "LLMs Can Evolve Continually on Modality for X-Modal Reasoning" NeurIPS2024☆37Updated 7 months ago
- Composed Video Retrieval☆58Updated last year
- [ICML 2024] Official implementation for "HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding"☆92Updated 8 months ago
- Official repository for "Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting" [CVPR 2023]☆121Updated 2 years ago
- ☆168Updated last year
- ☆94Updated last year
- Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval [CVPR 2025 Highlight]☆54Updated last month
- Paper Reading of IMCC groups.☆17Updated last month
- ☆37Updated 2 years ago
- [ICML 2024] "Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models"☆54Updated 11 months ago
- Code for paper: Nullu: Mitigating Object Hallucinations in Large Vision-Language Models via HalluSpace Projection☆38Updated 5 months ago
- ☆25Updated 11 months ago