[EMNLP 2024] SURf: Teaching Large Vision-Language Models to Selectively Utilize Retrieved Information
☆12Oct 11, 2024Updated last year
Alternatives and similar repositories for SURf
Users that are interested in SURf are comparing it to the libraries listed below
Sorting:
- ☆72Jul 28, 2025Updated 7 months ago
- Ref-Diff: Zero-shot Referring Image Segmentation with Generative Models☆21May 29, 2025Updated 9 months ago
- Repository for the NeurIPS 2024 paper "SearchLVLMs: A Plug-and-Play Framework for Augmenting Large Vision-Language Models by Searching Up…☆26Dec 9, 2024Updated last year
- Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs☆32Sep 21, 2025Updated 5 months ago
- ☆18Jun 10, 2025Updated 8 months ago
- Repository for the paper: Teaching VLMs to Localize Specific Objects from In-context Examples☆40Nov 27, 2024Updated last year
- Official PyTorch Implementation of MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced …☆91Nov 15, 2024Updated last year
- ☆40Jul 19, 2024Updated last year
- Official implementation of CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augmentation☆12Dec 5, 2025Updated 2 months ago
- [NeurIPS 2025] Official Implementation of paper "Sherlock: Self-Correcting Reasoning in Vision-Language Models"☆28Sep 18, 2025Updated 5 months ago
- ☆11Aug 20, 2025Updated 6 months ago
- [NAACL 2024] Vision language model that reduces hallucinations through self-feedback guided revision. Visualizes attentions on image feat…☆47Aug 21, 2024Updated last year
- [NAACL 2025] Guiding Large Language Models in Code Execution with Fine-grained Multimodal Chain-of-Thought Reasoning☆12Feb 9, 2025Updated last year
- Code for paper "Rethinking Text-based Protein Understanding: Retrieval or LLM?"☆18Oct 7, 2025Updated 4 months ago
- ☆16Oct 13, 2025Updated 4 months ago
- Repository for AAAI 2024 paper "Manifold-based Verbalizer Space Re-embedding for Tuning-free Prompt-based Classification"☆10Feb 6, 2024Updated 2 years ago
- EMIT: Enhancing MLLMs for Industrial Anomaly Detection via Difficulty-Aware GRPO☆20Jan 24, 2026Updated last month
- ☆18Mar 2, 2025Updated last year
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆48Feb 27, 2025Updated last year
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆23Sep 21, 2025Updated 5 months ago
- CVPR 2024 Official Repository☆12Mar 27, 2024Updated last year
- [ACM-MM 2025 Workshop] More Is Better: A MoE-Based Emotion Recognition Framework with Human Preference Alignment.☆25Nov 25, 2025Updated 3 months ago
- Tis is code for Few-Shot Joint Multimodal Entity-Relation Extraction via Knowledge-Enhanced Cross-modal Prompt Model (ACM MM 2024))☆12Aug 27, 2024Updated last year
- Cross-Self KV Cache Pruning for Efficient Vision-Language Inference☆10Dec 15, 2024Updated last year
- ☆12Dec 20, 2024Updated last year
- PyTorch code for the Neurips 2021 paper: Fairness via Representation Neutralization☆10Oct 26, 2021Updated 4 years ago
- Corpus to accompany: "Selective Vision is the Challenge for Visual Reasoning: A Benchmark for Visual Argument Understanding"☆11Apr 11, 2025Updated 10 months ago
- [NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'☆204Jul 17, 2025Updated 7 months ago
- ☆12Jul 16, 2025Updated 7 months ago
- PyTorch code for the CVPR'23 paper: "ConStruct-VL: Data-Free Continual Structured VL Concepts Learning"☆14Feb 5, 2024Updated 2 years ago
- Enhanced GPUstat-web☆10Oct 2, 2020Updated 5 years ago
- Continual Memorization of Factoids in Large Language Models☆12Nov 20, 2024Updated last year
- ☆21Feb 13, 2026Updated 2 weeks ago
- Official code for 'One-Shot Object Localization in Medical Images based on Relative Position Regression'.☆12Sep 10, 2022Updated 3 years ago
- ☆13Apr 23, 2025Updated 10 months ago
- [NeurIPS 2023]Federated Learning with Bilateral Curation for Partially Class-Disjoint Data☆14Aug 1, 2025Updated 7 months ago
- An Official Implementation for the Paper 'Point Beyond Class: A Benchmark for Weakly Semi-Supervised Abnormality Localization in Chest X-…☆18Oct 20, 2022Updated 3 years ago
- [ICCV 2025] The official pytorch implement of "LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMs".☆22Oct 28, 2025Updated 4 months ago
- [Neurocomputing] Efficient Redundancy Reduction for Open-Vocabulary Semantic Segmentation☆22Dec 21, 2025Updated 2 months ago