[EMNLP 2024] SURf: Teaching Large Vision-Language Models to Selectively Utilize Retrieved Information
☆12Oct 11, 2024Updated last year
Alternatives and similar repositories for SURf
Users that are interested in SURf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Ref-Diff: Zero-shot Referring Image Segmentation with Generative Models☆21May 29, 2025Updated 9 months ago
- ☆73Jul 28, 2025Updated 7 months ago
- Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning☆24Sep 9, 2024Updated last year
- ☆15Jul 8, 2024Updated last year
- Simple implementation of Retrieval-Augmented Generation System☆29Oct 24, 2024Updated last year
- Continual Memorization of Factoids in Large Language Models☆12Nov 20, 2024Updated last year
- Official PyTorch Implementation of MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced …☆91Nov 15, 2024Updated last year
- Code for paper "W-RAG: Weakly Supervised Dense Retrieval in RAG for Open-domain Question Answering"☆15Oct 2, 2025Updated 5 months ago
- 貔貅(PiXiu): 基于中文金融知识图谱的指令微调模型☆27Aug 8, 2023Updated 2 years ago
- This is for EMNLP 2024 Paper: AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction☆15Nov 4, 2024Updated last year
- ☆12Dec 20, 2024Updated last year
- Official code for the paper: DRA-GRPO: Exploring Diversity-Aware Reward Adjustment for R1-Zero-Like Training of Large Language Models☆24Jan 6, 2026Updated 2 months ago
- Code for paper 'Are We Falling in a Middle-Intelligence Trap? An Analysis and Mitigation of the Reversal Curse'☆13Aug 2, 2024Updated last year
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆24Sep 21, 2025Updated 6 months ago
- Python source code for EMNLP 2021 Findings paper: "Subword Mapping and Anchoring Across Languages".☆13Sep 17, 2021Updated 4 years ago
- ☆15Jul 16, 2021Updated 4 years ago
- [CVPR 2025] VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning☆13Jun 7, 2025Updated 9 months ago
- This is the package used to calculate the similarity index of the label graph pairs.☆13Nov 4, 2020Updated 5 years ago
- ☆14Sep 10, 2021Updated 4 years ago
- ☆19Jun 10, 2025Updated 9 months ago
- A collection of ready to use ollama models☆18Jan 22, 2024Updated 2 years ago
- Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs☆34Sep 21, 2025Updated 6 months ago
- [ACM-MM 2025 Workshop] More Is Better: A MoE-Based Emotion Recognition Framework with Human Preference Alignment.☆25Nov 25, 2025Updated 3 months ago
- Code for the ACL 2023 paper Scene Graph as Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Sc…☆12May 19, 2023Updated 2 years ago
- ☆11Aug 20, 2025Updated 7 months ago
- Geometrical Face Features Extraction☆16Mar 30, 2013Updated 12 years ago
- Code for NAACL 2025 paper "AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge"☆17Mar 2, 2026Updated 3 weeks ago
- Repository for AAAI 2024 paper "Manifold-based Verbalizer Space Re-embedding for Tuning-free Prompt-based Classification"☆10Feb 6, 2024Updated 2 years ago
- Repository for the NeurIPS 2024 paper "SearchLVLMs: A Plug-and-Play Framework for Augmenting Large Vision-Language Models by Searching Up…☆26Dec 9, 2024Updated last year
- [ICCV 2025] The official pytorch implement of "LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMs".☆22Oct 28, 2025Updated 4 months ago
- Repository for the paper: Teaching VLMs to Localize Specific Objects from In-context Examples☆40Nov 27, 2024Updated last year
- A collection of research papers related to Natural Language Reasoning☆11May 27, 2022Updated 3 years ago
- An autohotkey's script that makes your capslock more powerful☆13Aug 3, 2018Updated 7 years ago
- [SynthText Chinese] Improved code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural I…☆13Dec 8, 2022Updated 3 years ago
- This is the repository for our paper: Untying the Reversal Curse via Bidirectional Language Model Editing☆11May 25, 2025Updated 9 months ago
- Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented Generation☆64Sep 28, 2025Updated 5 months ago
- Official implementation of CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augmentation☆12Dec 5, 2025Updated 3 months ago
- A Survey of Multimodal Retrieval-Augmented Generation☆20Nov 3, 2025Updated 4 months ago
- MC-CoT implementation code☆22Jun 24, 2025Updated 9 months ago