Ziwei-Zheng / VaLSeView external linksLinks
A library of visualization tools for the interpretability and hallucination analysis of large vision-language models (LVLMs).
☆41May 22, 2025Updated 8 months ago
Alternatives and similar repositories for VaLSe
Users that are interested in VaLSe are comparing it to the libraries listed below
Sorting:
- ☆13Feb 24, 2025Updated 11 months ago
- This repo contains the code for the paper "Understanding and Mitigating Hallucinations in Large Vision-Language Models via Modular Attrib…☆33Jul 14, 2025Updated 7 months ago
- [ICML 2024] Official implementation for "HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding"☆108Dec 4, 2024Updated last year
- This is the repositoary for our paper published at ICML24.☆11Jun 11, 2025Updated 8 months ago
- ☆23Mar 18, 2025Updated 10 months ago
- Code and data for paper "Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation".☆23Oct 22, 2025Updated 3 months ago
- Code for ICLR 2025 Paper: Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs☆22May 7, 2025Updated 9 months ago
- Code release for VTW (AAAI 2025 Oral)☆64Nov 4, 2025Updated 3 months ago
- The official repo for "Where do Large Vision-Language Models Look at when Answering Questions?"☆56Jan 7, 2026Updated last month
- AutoToM: Scaling Model-based Mental Inference via Automated Agent Modeling☆39Dec 26, 2025Updated last month
- ☆71Jul 28, 2025Updated 6 months ago
- This is the official repository for the "Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP" paper acce…☆25Apr 18, 2024Updated last year
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)☆57Oct 28, 2024Updated last year
- [NeurIPS 2024] Mitigating Object Hallucination via Concentric Causal Attention☆66Aug 30, 2025Updated 5 months ago
- [ICCV 2025] ONLY: One-Layer Intervention Sufficiently Mitigates Hallucinations in Large Vision-Language Models☆49Jul 7, 2025Updated 7 months ago
- ☆18Jun 10, 2025Updated 8 months ago
- Official implementation of "Generative Human Motion Stylization in Latent Space", ICLR'24☆36Sep 4, 2025Updated 5 months ago
- ECSO (Make MLLM safe without neither training nor any external models!) (https://arxiv.org/abs/2403.09572)☆36Nov 2, 2024Updated last year
- The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?☆42Nov 1, 2024Updated last year
- Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model (SIGGRAPH 2024)☆38Sep 10, 2024Updated last year
- [CVPR 2025] Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Att…☆62Oct 9, 2025Updated 4 months ago
- Notes about courses Machine Learning 2025 Spring by Hung-yi Lee☆23Sep 22, 2025Updated 4 months ago
- [NeurIPS 2025] More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models☆75May 31, 2025Updated 8 months ago
- (ICLR 2026)Official repository of 'ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing’☆58Jan 26, 2026Updated 3 weeks ago
- A large-scale training and benchmarking framework for rPPG.☆10Nov 26, 2024Updated last year
- Vision Transformer (ViT) models, with their attention mechanisms, revolutionized computer vision. By merging Class Activation Map (CAM) a…☆13Aug 14, 2023Updated 2 years ago
- [ICLR2026] The code for "Interp3D: Correspondence-Aware Interpolation for Generative Textured 3D Morphing."☆21Jan 21, 2026Updated 3 weeks ago
- [CVPR 2025] VASparse: Towards Efficient Visual Hallucination Mitigation via Visual-Aware Token Sparsification☆49Mar 24, 2025Updated 10 months ago
- 🔥 [NeurIPS 2025] Official implementation of "Generate, but Verify: Reducing Visual Hallucination in Vision-Language Models with Retrospe…☆52Jan 22, 2026Updated 3 weeks ago
- [CVPR 2024 Highlight] OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allo…☆396Aug 24, 2024Updated last year
- mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating☆97Jan 29, 2024Updated 2 years ago
- Official implementation of paper "Vision Graph Prompting via Semantic Low-Rank Decomposition", ICML 2025☆16Dec 25, 2025Updated last month
- [ICLR 2025] This repo is the official implementation of "The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs".☆13Jan 25, 2025Updated last year
- ☆11Jun 3, 2024Updated last year
- [🔥ACM MM2025] EchoMask: Speech-Queried Attention-based Mask Modeling for Holistic Co-Speech Motion Generation☆23Dec 30, 2025Updated last month
- Official implementation of "Interpreting and Controlling Vision Foundation Models via Text Explanations"☆14May 29, 2024Updated last year
- ☆14Mar 15, 2025Updated 11 months ago
- ☆12Mar 5, 2024Updated last year
- Deep learning approaches in detecting 14 different abnormalities via Chest X-Ray images☆11Jan 16, 2022Updated 4 years ago