A library of visualization tools for the interpretability and hallucination analysis of large vision-language models (LVLMs).
☆41May 22, 2025Updated 9 months ago
Alternatives and similar repositories for VaLSe
Users that are interested in VaLSe are comparing it to the libraries listed below
Sorting:
- Code for paper: Nullu: Mitigating Object Hallucinations in Large Vision-Language Models via HalluSpace Projection☆52Mar 13, 2025Updated 11 months ago
- ☆14Feb 24, 2025Updated last year
- This is the repositoary for our paper published at ICML24.☆11Jun 11, 2025Updated 8 months ago
- ☆24Mar 18, 2025Updated 11 months ago
- Code and data for paper "Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation".☆24Oct 22, 2025Updated 4 months ago
- Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steering☆104Nov 23, 2024Updated last year
- Code for ICLR 2025 Paper: Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs☆22May 7, 2025Updated 10 months ago
- ☆23Jun 13, 2024Updated last year
- Code release for VTW (AAAI 2025 Oral)☆64Nov 4, 2025Updated 4 months ago
- The official repo for "Where do Large Vision-Language Models Look at when Answering Questions?"☆59Jan 7, 2026Updated 2 months ago
- AutoToM: Scaling Model-based Mental Inference via Automated Agent Modeling☆39Dec 26, 2025Updated 2 months ago
- ☆27Apr 18, 2025Updated 10 months ago
- ☆72Jul 28, 2025Updated 7 months ago
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)☆57Oct 28, 2024Updated last year
- This is the official repository for the "Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP" paper acce…☆25Feb 16, 2026Updated 3 weeks ago
- [NeurIPS 2024] Mitigating Object Hallucination via Concentric Causal Attention☆66Aug 30, 2025Updated 6 months ago
- ☆18Jun 10, 2025Updated 8 months ago
- [ICCV 2025] ONLY: One-Layer Intervention Sufficiently Mitigates Hallucinations in Large Vision-Language Models☆49Jul 7, 2025Updated 8 months ago
- ☆30Jul 4, 2024Updated last year
- Official implementation of "Generative Human Motion Stylization in Latent Space", ICLR'24☆37Sep 4, 2025Updated 6 months ago
- ECSO (Make MLLM safe without neither training nor any external models!) (https://arxiv.org/abs/2403.09572)☆35Nov 2, 2024Updated last year
- The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?☆42Nov 1, 2024Updated last year
- Notes about courses Machine Learning 2025 Spring by Hung-yi Lee☆25Sep 22, 2025Updated 5 months ago
- XL-VLMs: General Repository for eXplainable Large Vision Language Models☆46Sep 8, 2025Updated 6 months ago
- (ICLR 2026)Official repository of 'ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing’☆59Jan 26, 2026Updated last month
- [NeurIPS 2025] Official Implementation of paper "Sherlock: Self-Correcting Reasoning in Vision-Language Models"☆28Sep 18, 2025Updated 5 months ago
- A large-scale training and benchmarking framework for rPPG.☆10Nov 26, 2024Updated last year
- Vision Transformer (ViT) models, with their attention mechanisms, revolutionized computer vision. By merging Class Activation Map (CAM) a…☆13Aug 14, 2023Updated 2 years ago
- (AAAI 2026) OSVBench, a new benchmark for evaluating Large Language Models (LLMs) in generating complete specification code pertaining to…☆13May 13, 2025Updated 9 months ago
- [ICLR2026] The code for "Interp3D: Correspondence-Aware Interpolation for Generative Textured 3D Morphing."☆25Jan 21, 2026Updated last month
- [CVPR 2024 Highlight] OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allo…☆397Aug 24, 2024Updated last year
- 🔥 [NeurIPS 2025] Official implementation of "Generate, but Verify: Reducing Visual Hallucination in Vision-Language Models with Retrospe…☆54Jan 22, 2026Updated last month
- mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating☆97Jan 29, 2024Updated 2 years ago
- ☆20Nov 21, 2025Updated 3 months ago
- [CVPR'25] Official code of paper "Mimic In-Context Learning for Multimodal Tasks"☆24Jun 8, 2025Updated 9 months ago
- Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)☆12Oct 31, 2024Updated last year
- ☆53Jan 2, 2025Updated last year
- ☆11Jun 3, 2024Updated last year
- dMel: Speech Tokenization Made Simple☆16May 13, 2025Updated 9 months ago