This is the first released survey paper on hallucinations of large vision-language models (LVLMs). To keep track of this field and continuously update our survey, we maintain this repository of relevant references.
β95Jul 26, 2024Updated last year
Alternatives and similar repositories for LVLM-Hallucinations-Survey
Users that are interested in LVLM-Hallucinations-Survey are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).β1,013Sep 27, 2025Updated 7 months ago
- The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''β258Aug 21, 2025Updated 8 months ago
- An LLM-free Multi-dimensional Benchmark for Multi-modal Hallucination Evaluationβ165Jan 15, 2024Updated 2 years ago
- β15Oct 15, 2023Updated 2 years ago
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)β57Oct 28, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An automatic MLLM hallucination detection frameworkβ19Sep 26, 2023Updated 2 years ago
- [ICLR 2024] Analyzing and Mitigating Object Hallucination in Large Vision-Language Modelsβ156Apr 30, 2024Updated 2 years ago
- HallE-Control: Controlling Object Hallucination in LMMsβ32Apr 10, 2024Updated 2 years ago
- β32Apr 18, 2025Updated last year
- Github repository for "Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas" (ICML 2025)β73May 2, 2025Updated 11 months ago
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigationβ142Sep 11, 2025Updated 7 months ago
- [CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decodingβ399Oct 7, 2024Updated last year
- Code for paper: Visual Signal Enhancement for Object Hallucination Mitigation in Multimodal Large language Modelsβ60Dec 18, 2024Updated last year
- Official pytorch implementation of "RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in Large Vision Languageβ¦β14Dec 16, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Context-Aware Multi-View Summarization Network for Image-Text Matching. ACM MM'20β29May 26, 2022Updated 3 years ago
- β20Oct 21, 2022Updated 3 years ago
- Multi-level Attention Network for Retinal Vessel Segmentationβ10May 10, 2021Updated 4 years ago
- [NeurIPS 2025] Official Implementation for "Enhancing Vision-Language Model Reliability with Uncertainty-Guided Dropout Decoding"β22Dec 8, 2024Updated last year
- [ECCV2024] Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Modelsβ20Jul 17, 2024Updated last year
- [ACL 2025 Findings] Official pytorch implementation of "Don't Miss the Forest for the Trees: Attentional Vision Calibration for Large Visβ¦β25Jul 21, 2024Updated last year
- [AAAI 26 Demo] Offical repo for CAT-V - Caption Anything in Video: Object-centric Dense Video Captioning with Spatiotemporal Multimodal Pβ¦β65Jan 27, 2026Updated 3 months ago
- Dynamic Modality Interaction Modeling for Image-Text Retrieval. SIGIR'21β70Apr 5, 2026Updated 3 weeks ago
- [NeurIPS 2024] Calibrated Self-Rewarding Vision Language Modelsβ88Oct 26, 2025Updated 6 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- LLM hallucination paper listβ334Mar 11, 2024Updated 2 years ago
- [ACM Multimedia 2025] This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visualβ¦β84Feb 22, 2025Updated last year
- [CVPR 2025] Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attentionβ67Jul 16, 2024Updated last year
- π curated list of awesome LMM hallucinations papers, methods & resources.β149Mar 23, 2024Updated 2 years ago
- The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?β43Nov 1, 2024Updated last year
- (NeurIPS 2025) Official implementation for "MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?"β50Jun 3, 2025Updated 10 months ago
- up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resourcesβ301Feb 8, 2026Updated 2 months ago
- [EMNLP 2024] TraveLER: A Modular Multi-LMM Agent Framework for Video Question-Answeringβ18Oct 31, 2024Updated last year
- Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimizationβ103Jan 30, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICLR '25] Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations"β102Nov 30, 2025Updated 5 months ago
- mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigatingβ99Jan 29, 2024Updated 2 years ago
- The implementation of paper "EliMRec: Eliminating single-modal bias in multimedia recommendation", MM'22.β22Dec 7, 2023Updated 2 years ago
- [CVPR2024 Highlight] Official implementation for Transferable Visual Prompting. The paper "Exploring the Transferability of Visual Promptβ¦β45Dec 20, 2024Updated last year
- β13Feb 1, 2022Updated 4 years ago
- MUltiple SUV Thresholding (MUST)-segmenter is a semi-automated PET image segmentation tool that enables delineation of multiple lesions aβ¦β12Mar 18, 2026Updated last month
- Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)β87Jul 1, 2024Updated last year