This is the first released survey paper on hallucinations of large vision-language models (LVLMs). To keep track of this field and continuously update our survey, we maintain this repository of relevant references.
β96Jul 26, 2024Updated last year
Alternatives and similar repositories for LVLM-Hallucinations-Survey
Users that are interested in LVLM-Hallucinations-Survey are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).β1,022Sep 27, 2025Updated 8 months ago
- The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''β262Aug 21, 2025Updated 9 months ago
- An LLM-free Multi-dimensional Benchmark for Multi-modal Hallucination Evaluationβ169Jan 15, 2024Updated 2 years ago
- β16Oct 15, 2023Updated 2 years ago
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)β57Oct 28, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- An automatic MLLM hallucination detection frameworkβ19Sep 26, 2023Updated 2 years ago
- [ICLR 2024] Analyzing and Mitigating Object Hallucination in Large Vision-Language Modelsβ156Apr 30, 2024Updated 2 years ago
- HallE-Control: Controlling Object Hallucination in LMMsβ32Apr 10, 2024Updated 2 years ago
- β33Apr 18, 2025Updated last year
- β55Apr 1, 2024Updated 2 years ago
- Github repository for "Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas" (ICML 2025)β75May 2, 2025Updated last year
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigationβ145Sep 11, 2025Updated 9 months ago
- [CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decodingβ406Oct 7, 2024Updated last year
- Official pytorch implementation of "RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in Large Vision Languageβ¦β14Dec 16, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Context-Aware Multi-View Summarization Network for Image-Text Matching. ACM MM'20β29May 26, 2022Updated 4 years ago
- β20Oct 21, 2022Updated 3 years ago
- Multi-level Attention Network for Retinal Vessel Segmentationβ10May 10, 2021Updated 5 years ago
- [NeurIPS 2025] Official Implementation for "Enhancing Vision-Language Model Reliability with Uncertainty-Guided Dropout Decoding"β22Dec 8, 2024Updated last year
- [ECCV2024] Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Modelsβ20Jul 17, 2024Updated last year
- Dynamic Modality Interaction Modeling for Image-Text Retrieval. SIGIR'21β70Apr 5, 2026Updated 2 months ago
- [AAAI 26 Demo] Offical repo for CAT-V - Caption Anything in Video: Object-centric Dense Video Captioning with Spatiotemporal Multimodal Pβ¦β66Jan 27, 2026Updated 4 months ago
- [NeurIPS 2024] Calibrated Self-Rewarding Vision Language Modelsβ87Oct 26, 2025Updated 7 months ago
- LLM hallucination paper listβ335Mar 11, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [ACM Multimedia 2025] This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visualβ¦β83Feb 22, 2025Updated last year
- [CVPR 2025] Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attentionβ68Jul 16, 2024Updated last year
- π curated list of awesome LMM hallucinations papers, methods & resources.β150Mar 23, 2024Updated 2 years ago
- The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?β43Nov 1, 2024Updated last year
- up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resourcesβ322Feb 8, 2026Updated 4 months ago
- Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimizationβ104Jan 30, 2024Updated 2 years ago
- mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigatingβ99Jan 29, 2024Updated 2 years ago
- The implementation of paper "EliMRec: Eliminating single-modal bias in multimedia recommendation", MM'22.β22Dec 7, 2023Updated 2 years ago
- β13Feb 1, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- β13Jun 11, 2024Updated last year
- Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)β89Jul 1, 2024Updated last year
- β102Dec 22, 2023Updated 2 years ago
- [NeurIPS2023] LoRA: A Logical Reasoning Augmented Dataset for Visual Question Answeringβ12Jan 5, 2024Updated 2 years ago
- Chinese Vision-Language Understanding Evaluationβ23Dec 26, 2024Updated last year
- the official repo for EMNLP 2024 (main) paper "EFUF: Efficient Fine-grained Unlearning Framework for Mitigating Hallucinations in Multimoβ¦β21Apr 9, 2025Updated last year
- [Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuningβ93Apr 30, 2024Updated 2 years ago