Sreyan88/VDGD

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Sreyan88/VDGD)

Sreyan88 / VDGD

Code for ICLR 2025 Paper: Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs

☆25

Alternatives and similar repositories for VDGD

Users that are interested in VDGD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lijm48 / IMCCD
View on GitHub
☆15Apr 27, 2025Updated last year
yejipark-m / ConVis
View on GitHub
[AAAI 2025] ConVis: Contrastive Decoding with Hallucination Visualization for Mitigating Hallucinations in Multimodal Large Language Mode…
☆25Sep 26, 2024Updated last year
ZhangqiJiang07 / middle_layers_indicating_hallucinations
View on GitHub
[CVPR 2025] Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Att…
☆84Oct 9, 2025Updated 9 months ago
Lackel / AGLA
View on GitHub
[CVPR 2025] Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
☆68Jul 16, 2024Updated 2 years ago
jinghan1he / VHR
View on GitHub
[ACL 2025] Cracking the Code of Hallucination in LVLMs with Vision-aware Head Divergence
☆21Jun 10, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
LALBJ / PAI
View on GitHub
[ECCV 2024] Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs
☆171Nov 6, 2024Updated last year
Hongcheng-Gao / HAVEN
View on GitHub
Code and data for paper "Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation".
☆25Oct 22, 2025Updated 9 months ago
pspdada / SENTINEL
View on GitHub
[ICCV 2025] Official repository of "Mitigating Object Hallucinations via Sentence-Level Early Intervention".
☆31Jul 2, 2026Updated 3 weeks ago
zifuwan / ONLY
View on GitHub
[ICCV 2025] ONLY: One-Layer Intervention Sufficiently Mitigates Hallucinations in Large Vision-Language Models
☆51Jul 7, 2025Updated last year
mlrm-LEAD / mlrm-LEAD
View on GitHub
[CVPR 2026 Highlight] Thinking in Uncertainty: Mitigating Hallucinations in MLRMs with Latent Entropy-Aware Decoding
☆94Apr 9, 2026Updated 3 months ago
PostMindLab / ICD
View on GitHub
[ACL 2024] Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decoding
☆18Nov 10, 2025Updated 8 months ago
MLRM-Halu / MLRM-Halu
View on GitHub
[NeurIPS 2025] More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models
☆82May 31, 2025Updated last year
GasolSun36 / MVP
View on GitHub
Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning
☆24Sep 9, 2024Updated last year
LijunZhang01 / Octopus
View on GitHub
☆33Apr 18, 2025Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
zhoujiahuan1991 / ICML2025-KA-Prompt
View on GitHub
☆19Jul 3, 2025Updated last year
ustc-hyin / ClearSight
View on GitHub
Code for paper: Visual Signal Enhancement for Object Hallucination Mitigation in Multimodal Large language Models
☆61Dec 18, 2024Updated last year
xmed-lab / TAM
View on GitHub
[ICCV25 Oral] Token Activation Map to Visually Explain Multimodal LLMs
☆190Dec 14, 2025Updated 7 months ago
THU-BPM / ICT
View on GitHub
Official repo for ICT: Image-Object Cross-Level Trusted Intervention for Mitigating Object Hallucination in Large Vision-Language Models
☆28Mar 24, 2025Updated last year
whongzhong / MMHalSnowball
View on GitHub
Official resource for paper Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models (ACL 20…
☆18Aug 12, 2024Updated last year
1zhou-Wang / MemVR
View on GitHub
[ICML 2025] Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in…
☆171Sep 25, 2025Updated 9 months ago
shengliu66 / VTI
View on GitHub
Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steering
☆117Nov 23, 2024Updated last year
Ziwei-Zheng / VaLSe
View on GitHub
A library of visualization tools for the interpretability and hallucination analysis of large vision-language models (LVLMs).
☆42May 22, 2025Updated last year
seilk / VisAttnSink
View on GitHub
[ICLR 2025] See What You Are Told: Visual Attention Sink in Large Multimodal Models
☆116Feb 16, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
TungChintao / SkiLa
View on GitHub
Official codes of "Sketch-in-Latents: Eliciting Unified Reasoning in MLLMs"
☆17Feb 15, 2026Updated 5 months ago
awesome-openreviewers / Awesome_Openreviewers-Authors
View on GitHub
Info on Openreview for top-tier CS conferences
☆20Nov 27, 2025Updated 7 months ago
zjunlp / Deco
View on GitHub
[ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation
☆146Sep 11, 2025Updated 10 months ago
zhoujiahuan1991 / ICML2025-TCPA
View on GitHub
☆23May 8, 2025Updated last year
TianyunYoung / Hallucination-Attribution
View on GitHub
This repo contains the code for the paper "Understanding and Mitigating Hallucinations in Large Vision-Language Models via Modular Attrib…
☆39Jul 14, 2025Updated last year
YuanLi95 / GATT-For-Aspect
View on GitHub
☆11Dec 14, 2022Updated 3 years ago
KejiaZhang-Robust / VAP
View on GitHub
[NeurIPS 2025] Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs
☆38Sep 21, 2025Updated 10 months ago
HotanLee / DeFT
View on GitHub
The official implementation for paper: Vision-Language Models are Strong Noisy Label Detectors
☆19Mar 31, 2025Updated last year
DoubtedSteam / RoE
View on GitHub
The official implement of "Routing Experts: Learning to Route Dynamic Experts in Existing Multi-modal Large Language Models"
☆17Mar 24, 2025Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
GuangtaoLyu / awesome-hallucination-mllm
View on GitHub
MLLM hallucination, LVLM, LLM, Hallucination Mitigation, Training-free hallucination mitigation
☆30Jul 13, 2026Updated last week
hasanar1f / PAINT
View on GitHub
[CVPR 2025 Workshop] PAINT (Paying Attention to INformed Tokens) is a plug-and-play framework that intervenes in the self-attention of th…
☆20Updated this week
sangminwoo / RITUAL
View on GitHub
Official pytorch implementation of "RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in Large Vision Language…
☆14Dec 16, 2024Updated last year
zeyangsha / De-Fake
View on GitHub
☆32Jun 12, 2025Updated last year
Cooperx521 / ScaleCap
View on GitHub
(ICLR 2026)Official repository of 'ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing’
☆60Jan 26, 2026Updated 5 months ago
mrwu-mac / ControlMLLM
View on GitHub
[NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'
☆210Jul 17, 2025Updated last year
om-ai-lab / ZoomEye
View on GitHub
[EMNLP-2025 Oral] ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration
☆91Nov 20, 2025Updated 8 months ago