THU-BPM/ICT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/THU-BPM/ICT)

THU-BPM / ICT

Official repo for ICT: Image-Object Cross-Level Trusted Intervention for Mitigating Object Hallucination in Large Vision-Language Models

☆28

Alternatives and similar repositories for ICT

Users that are interested in ICT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

shengliu66 / VTI
View on GitHub
Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steering
☆117Nov 23, 2024Updated last year
Lackel / AGLA
View on GitHub
[CVPR 2025] Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
☆68Jul 16, 2024Updated 2 years ago
Ziwei-Zheng / Nullu
View on GitHub
Code for paper: Nullu: Mitigating Object Hallucinations in Large Vision-Language Models via HalluSpace Projection
☆63Mar 13, 2025Updated last year
mengchuang123 / VASparse-github
View on GitHub
[CVPR 2025] VASparse: Towards Efficient Visual Hallucination Mitigation via Visual-Aware Token Sparsification
☆50Mar 24, 2025Updated last year
ZhangqiJiang07 / middle_layers_indicating_hallucinations
View on GitHub
[CVPR 2025] Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Att…
☆84Oct 9, 2025Updated 9 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Sreyan88 / VDGD
View on GitHub
Code for ICLR 2025 Paper: Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs
☆25May 7, 2025Updated last year
LDXDU / FedDiff
View on GitHub
This paper is currently under review by IEEE TCSVT, and the diffusion framework of the FedDiff algorithm part will be disclosed.
☆14Mar 8, 2024Updated 2 years ago
zifuwan / ONLY
View on GitHub
[ICCV 2025] ONLY: One-Layer Intervention Sufficiently Mitigates Hallucinations in Large Vision-Language Models
☆51Jul 7, 2025Updated last year
DAMO-NLP-SG / VCD
View on GitHub
[CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding
☆411Oct 7, 2024Updated last year
OmriKaduri / vlm-interp
View on GitHub
Code for paper: "What’s in the Image? A Deep-Dive into the Vision of Vision Language Models" (CVPR 2025)
☆18May 1, 2025Updated last year
lijm48 / IMCCD
View on GitHub
☆15Apr 27, 2025Updated last year
YiCheng98 / IntegrativeDecoding
View on GitHub
Official Implementation for the paper "Integrative Decoding: Improving Factuality via Implicit Self-consistency"
☆33Apr 12, 2025Updated last year
Huenao / Debate-Augmented-RAG
View on GitHub
[ACL 2025] Removal of Hallucination on Hallucination: Debate-Augmented RAG
☆44Aug 4, 2025Updated 11 months ago
Linxi-ZHAO / MARINE
View on GitHub
☆19Jun 6, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
leorebensabath / TMRPlusPlus
View on GitHub
☆25Mar 18, 2025Updated last year
kigb / DropoutDecoding
View on GitHub
[NeurIPS 2025] Official Implementation for "Enhancing Vision-Language Model Reliability with Uncertainty-Guided Dropout Decoding"
☆22Dec 8, 2024Updated last year
bytedance / ParGo
View on GitHub
Official PyTorch Implementation of ParGo: Bridging Vision-Language with Partial and Global Views. (AAAI 2025)
☆16Jan 7, 2025Updated last year
yejipark-m / ConVis
View on GitHub
[AAAI 2025] ConVis: Contrastive Decoding with Hallucination Visualization for Mitigating Hallucinations in Multimodal Large Language Mode…
☆25Sep 26, 2024Updated last year
LzVv123456 / VISTA
View on GitHub
☆86Jul 28, 2025Updated 11 months ago
HotanLee / DeFT
View on GitHub
The official implementation for paper: Vision-Language Models are Strong Noisy Label Detectors
☆19Mar 31, 2025Updated last year
ByZ0e / Glance-Focus
View on GitHub
This repo contains source code for Glance and Focus: Memory Prompting for Multi-Event Video Question Answering (Accepted in NeurIPS 2023)
☆31Jun 28, 2024Updated 2 years ago
Hanhpt23 / OmniMod
View on GitHub
MCOUT: Multimodal Chain of Continuous Thought for Latent Reasoning
☆21Oct 4, 2025Updated 9 months ago
LijunZhang01 / Octopus
View on GitHub
☆33Apr 18, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
YUECHE77 / SPIN
View on GitHub
[EMNLP 2025 Main Conference] Mitigating Hallucinations in Vision-Language Models through Image-Guided Head Suppression
☆16Dec 26, 2025Updated 6 months ago
NishilBalar / Awesome-LVLM-Hallucination
View on GitHub
up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources
☆325Feb 8, 2026Updated 5 months ago
sangminwoo / RITUAL
View on GitHub
Official pytorch implementation of "RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in Large Vision Language…
☆14Dec 16, 2024Updated last year
Lum1104 / EIBench
View on GitHub
(NeXD @ CVPR 2025) Why We Feel: Breaking Boundaries in Emotional Reasoning with Multimodal Large Language Models
☆32Sep 30, 2025Updated 9 months ago
jiazhen-code / PhD
View on GitHub
[CVPR25 Highlight] A ChatGPT-Prompted Visual hallucination Evaluation Dataset, featuring over 100,000 data samples and four advanced eval…
☆32Apr 16, 2025Updated last year
sangminwoo / AvisC
View on GitHub
[ACL 2025 Findings] Official pytorch implementation of "Don't Miss the Forest for the Trees: Attentional Vision Calibration for Large Vis…
☆25Jul 21, 2024Updated 2 years ago
TencentYoutuResearch / HighlightDetection-CLC
View on GitHub
Code for CVPR2023 paper "Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies"
☆18Mar 21, 2023Updated 3 years ago
ustc-hyin / ClearSight
View on GitHub
Code for paper: Visual Signal Enhancement for Object Hallucination Mitigation in Multimodal Large language Models
☆60Dec 18, 2024Updated last year
fuyyyyy / SEPM
View on GitHub
[ICML'25 Spotlight] Catch Your Emotion: Sharpening Emotion Perception in Multimodal Large Language Models
☆57Jan 21, 2026Updated 6 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
hukcc / SHIELD
View on GitHub
[ICLR 2026🔥] SHIELD: Suppressing Hallucinations In LVLM Encoders via Bias and Vulnerability Defense
☆17Mar 24, 2026Updated 3 months ago
Murrol / GenMoStyle-code
View on GitHub
Official implementation of "Generative Human Motion Stylization in Latent Space", ICLR'24
☆39Sep 4, 2025Updated 10 months ago
tsunghan-wu / reverse_vlm
View on GitHub
🔥 [NeurIPS 2025] Official implementation of "Generate, but Verify: Reducing Visual Hallucination in Vision-Language Models with Retrospe…
☆58Jan 22, 2026Updated 6 months ago
THU-BPM / RAPL
View on GitHub
Code and data for EMNLP 2023 paper "RAPL: A Relation-Aware Prototype Learning Approach for Few-Shot Document-Level Relation Extraction"
☆18Mar 6, 2024Updated 2 years ago
knightyxp / DGL
View on GitHub
[AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.
☆49Oct 14, 2024Updated last year
TonyAlbertWan / Advanced-AI
View on GitHub
中国科学院大学研究生课程-高级人工智能
☆10Jan 8, 2022Updated 4 years ago
xiaoshutongly / clip-lora
View on GitHub
☆15Jun 6, 2023Updated 3 years ago