This is the first released survey paper on hallucinations of large vision-language models (LVLMs). To keep track of this field and continuously update our survey, we maintain this repository of relevant references.
β94Jul 26, 2024Updated last year
Alternatives and similar repositories for LVLM-Hallucinations-Survey
Users that are interested in LVLM-Hallucinations-Survey are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).β1,003Sep 27, 2025Updated 6 months ago
- The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''β255Aug 21, 2025Updated 7 months ago
- An LLM-free Multi-dimensional Benchmark for Multi-modal Hallucination Evaluationβ162Jan 15, 2024Updated 2 years ago
- β15Oct 15, 2023Updated 2 years ago
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)β57Oct 28, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- An automatic MLLM hallucination detection frameworkβ19Sep 26, 2023Updated 2 years ago
- [ICLR 2024] Analyzing and Mitigating Object Hallucination in Large Vision-Language Modelsβ155Apr 30, 2024Updated last year
- HallE-Control: Controlling Object Hallucination in LMMsβ32Apr 10, 2024Updated 2 years ago
- β28Apr 18, 2025Updated 11 months ago
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigationβ138Sep 11, 2025Updated 7 months ago
- β55Apr 1, 2024Updated 2 years ago
- [CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decodingβ393Oct 7, 2024Updated last year
- Code for paper: Visual Signal Enhancement for Object Hallucination Mitigation in Multimodal Large language Modelsβ57Dec 18, 2024Updated last year
- Official pytorch implementation of "RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in Large Vision Languageβ¦β14Dec 16, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Context-Aware Multi-View Summarization Network for Image-Text Matching. ACM MM'20β29May 26, 2022Updated 3 years ago
- [NeurIPS 2025] Official Implementation for "Enhancing Vision-Language Model Reliability with Uncertainty-Guided Dropout Decoding"β22Dec 8, 2024Updated last year
- [ECCV2024] Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Modelsβ20Jul 17, 2024Updated last year
- [ACL 2025 Findings] Official pytorch implementation of "Don't Miss the Forest for the Trees: Attentional Vision Calibration for Large Visβ¦β25Jul 21, 2024Updated last year
- [AAAI 26 Demo] Offical repo for CAT-V - Caption Anything in Video: Object-centric Dense Video Captioning with Spatiotemporal Multimodal Pβ¦β64Jan 27, 2026Updated 2 months ago
- Dynamic Modality Interaction Modeling for Image-Text Retrieval. SIGIR'21β70Updated this week
- A Novel Approach for Effective Multi-View Clustering with Information-Theoretic Perspective is a paper accepted by NeurIPS 2023β10May 15, 2024Updated last year
- [ACM Multimedia 2025] This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visualβ¦β84Feb 22, 2025Updated last year
- [CVPR 2025] Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attentionβ65Jul 16, 2024Updated last year
- End-to-end encrypted email - Proton Mail β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- π curated list of awesome LMM hallucinations papers, methods & resources.β149Mar 23, 2024Updated 2 years ago
- The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?β43Nov 1, 2024Updated last year
- (NeurIPS 2025) Official implementation for "MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?"β48Jun 3, 2025Updated 10 months ago
- up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resourcesβ290Feb 8, 2026Updated 2 months ago
- [EMNLP 2024] TraveLER: A Modular Multi-LMM Agent Framework for Video Question-Answeringβ17Oct 31, 2024Updated last year
- Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimizationβ101Jan 30, 2024Updated 2 years ago
- mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigatingβ98Jan 29, 2024Updated 2 years ago
- The implementation of paper "EliMRec: Eliminating single-modal bias in multimedia recommendation", MM'22.β22Dec 7, 2023Updated 2 years ago
- [CVPR2024 Highlight] Official implementation for Transferable Visual Prompting. The paper "Exploring the Transferability of Visual Promptβ¦β45Dec 20, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- β13Feb 1, 2022Updated 4 years ago
- MUltiple SUV Thresholding (MUST)-segmenter is a semi-automated PET image segmentation tool that enables delineation of multiple lesions aβ¦β12Mar 18, 2026Updated 3 weeks ago
- Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)β86Jul 1, 2024Updated last year
- β102Dec 22, 2023Updated 2 years ago
- [NeurIPS2023] LoRA: A Logical Reasoning Augmented Dataset for Visual Question Answeringβ13Jan 5, 2024Updated 2 years ago
- Chinese Vision-Language Understanding Evaluationβ23Dec 26, 2024Updated last year
- the official repo for EMNLP 2024 (main) paper "EFUF: Efficient Fine-grained Unlearning Framework for Mitigating Hallucinations in Multimoβ¦β21Apr 9, 2025Updated last year