ustc-hyin / ClearSightLinks

Code for paper: Visual Signal Enhancement for Object Hallucination Mitigation in Multimodal Large language Models

☆36

Alternatives and similar repositories for ClearSight

Users that are interested in ClearSight are comparing it to the libraries listed below

Sorting:

saccharomycetes / mllms_know
[ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'
☆293Updated 7 months ago
1zhou-Wang / MemVR
[ICML 2025] Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in…
☆168Updated last month
jungao1106 / ICoT
[CVPR' 25] Interleaved-Modal Chain-of-Thought
☆92Updated 2 weeks ago
mrwu-mac / ControlMLLM
[NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'
☆197Updated 4 months ago
shikiw / Awesome-MLLM-Hallucination
Papers about Hallucination in Multi-Modal Large Language Models (MLLMs)
☆98Updated last year
LALBJ / PAI
[ECCV 2024] Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs
☆151Updated last year
Go2Heart / EchoSight
[EMNLP 2024 Findings] The official PyTorch implementation of EchoSight: Advancing Visual-Language Models with Wiki Knowledge.
☆77Updated 5 months ago
zjunlp / Deco
[ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation
☆120Updated 2 months ago
Sreyan88 / VDGD
Code for ICLR 2025 Paper: Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs
☆22Updated 6 months ago
lhanchao777 / LVLM-Hallucinations-Survey
This is the first released survey paper on hallucinations of large vision-language models (LVLMs). To keep track of this field and contin…
☆87Updated last year
EIT-NLP / Layer_Select_Fuse_for_MLLM
[CVPR2025] Official implementation of the paper "Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best Practi…
☆39Updated 3 weeks ago
ZhangqiJiang07 / middle_layers_indicating_hallucinations
[CVPR 2025] Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Att…
☆52Updated last month
Ruiyang-061X / VL-Uncertainty
🔎Official code for our paper: "VL-Uncertainty: Detecting Hallucination in Large Vision-Language Model via Uncertainty Estimation".
☆46Updated 8 months ago
zhyang2226 / OPA-DPO
[CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key
☆86Updated last month
diaoquesang / Code-in-Paper-Guide
🌟 手把手教你在论文中插入代码链接
☆22Updated 3 months ago
Wang-Xiaodong1899 / CVPR25-MLLM-Paper-List
🔥CVPR 2025 Multimodal Large Language Models Paper List
☆155Updated 8 months ago
DAMO-NLP-SG / VCD
[CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding
☆339Updated last year
minglllli / CLS-RL
[NeurIPS 2025 Spotlight] Think or Not Think: A Study of Explicit Thinking in Rule-Based Visual Reinforcement Fine-Tuning
☆73Updated 2 months ago
RupertLuo / VoCoT
VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models
☆76Updated last year
seilk / VisAttnSink
[ICLR 2025] See What You Are Told: Visual Attention Sink in Large Multimodal Models
☆69Updated 9 months ago
Ghy0501 / HiDe-LLaVA
[ACL'25 Main] Official Implementation of HiDe-LLaVA: Hierarchical Decoupling for Continual Instruction Tuning of Multimodal Large Languag…
☆39Updated 2 months ago
xtong-zhang / Chain-of-Focus
☆51Updated last week
Code-kunkun / LamRA
[CVPR 2025] LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant
☆172Updated 4 months ago
whongzhong / MMHalSnowball
Official resource for paper Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models (ACL 20…
☆12Updated last year
The-Martyr / CausalMM
[ICLR 2025] Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality
☆54Updated 4 months ago
Hui-design / Open-LLaVA-Video-R1
[LLaVA-Video-R1]✨First Adaptation of R1 to LLaVA-Video (2025-03-18)
☆35Updated 6 months ago
The-Martyr / Awesome-Multimodal-Reasoning
Latest Advances on (RL based) Multimodal Reasoning and Generation in Multimodal Large Language Models
☆43Updated 3 weeks ago
NishilBalar / Awesome-LVLM-Hallucination
up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources
☆212Updated last month
Lackel / AGLA
[CVPR 2025] Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
☆50Updated last year
chancharikmitra / CCoT
[CVPR 2024] Official Code for the Paper "Compositional Chain-of-Thought Prompting for Large Multimodal Models"
☆141Updated last year