[ICCV 2025] ONLY: One-Layer Intervention Sufficiently Mitigates Hallucinations in Large Vision-Language Models
☆49Jul 7, 2025Updated 8 months ago
Alternatives and similar repositories for ONLY
Users that are interested in ONLY are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] Code for Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models☆24Apr 14, 2025Updated 10 months ago
- [ICML 2025] Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in…☆172Sep 25, 2025Updated 5 months ago
- [AAAI 2026] Global Compression Commander: Plug-and-Play Inference Acceleration for High-Resolution Large Vision-Language Models☆38Jan 27, 2026Updated last month
- ☆72Jul 28, 2025Updated 7 months ago
- IPO: Interpretable Prompt Optimization for Vision-Language Models(NeurIPS 2024)☆15Mar 4, 2025Updated last year
- Code for Negative Yields Positive: Unified Dual-Path Adapter for Vision-Language Models☆27Oct 29, 2024Updated last year
- [CVPR 2025] Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention☆61Jul 16, 2024Updated last year
- ☆20Apr 15, 2025Updated 10 months ago
- Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model☆37Jan 8, 2025Updated last year
- Code for paper: Nullu: Mitigating Object Hallucinations in Large Vision-Language Models via HalluSpace Projection☆52Mar 13, 2025Updated 11 months ago
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆136Sep 11, 2025Updated 5 months ago
- Code of LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents☆28Nov 24, 2025Updated 3 months ago
- [CVPR 2025] VASparse: Towards Efficient Visual Hallucination Mitigation via Visual-Aware Token Sparsification☆49Mar 24, 2025Updated 11 months ago
- 🔥 [NeurIPS 2025] Official implementation of "Generate, but Verify: Reducing Visual Hallucination in Vision-Language Models with Retrospe…☆54Jan 22, 2026Updated last month
- Research works from Tencent AI Lab regarding self-evolving agents☆83Jan 30, 2026Updated last month
- [CVPR 2024] Code for HiKER-SGG: Hierarchical Knowledge Enhanced Robust Scene Graph Generation☆76Oct 11, 2024Updated last year
- Official repo for EscapeCraft (an 3D environment for room escape) and benchmark MM-Escape. This work is accepted by ICCV 2025.☆36Jul 7, 2025Updated 8 months ago
- Training recipe for SpatialReasoner☆38Sep 21, 2025Updated 5 months ago
- ☆23Aug 20, 2024Updated last year
- [ICLR 2025] See What You Are Told: Visual Attention Sink in Large Multimodal Models☆94Feb 16, 2025Updated last year
- source code for NeurIPS'24 paper "HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection"☆66Apr 11, 2025Updated 10 months ago
- [CVPR 2024 Highlight] OpenBias: Open-set Bias Detection in Text-to-Image Generative Models☆26Feb 13, 2025Updated last year
- [NeurIPS'25] ReAgent-V: A Reward-Driven Multi-Agent Framework for Video Understanding☆46Sep 21, 2025Updated 5 months ago
- ☆76Feb 5, 2026Updated last month
- [CVPR 2024 Highlight] OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allo…☆397Aug 24, 2024Updated last year
- Official implementation of "Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation" (CVPR 202…☆40May 26, 2025Updated 9 months ago
- [CVPR 2025] Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Att…☆64Oct 9, 2025Updated 5 months ago
- Notes about courses Machine Learning 2025 Spring by Hung-yi Lee☆25Sep 22, 2025Updated 5 months ago
- [ECCV 2024] Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs☆163Nov 6, 2024Updated last year
- [NeurIPS 2025] Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models☆68Nov 27, 2025Updated 3 months ago
- XL-VLMs: General Repository for eXplainable Large Vision Language Models☆46Sep 8, 2025Updated 6 months ago
- ☆44Jun 19, 2025Updated 8 months ago
- Visual Instruction Tuning for Qwen2 Base Model☆41Jun 29, 2024Updated last year
- [AAAI 2025] HiRED strategically drops visual tokens in the image encoding stage to improve inference efficiency for High-Resolution Visio…☆44Apr 18, 2025Updated 10 months ago
- (ICLR 2026)Official repository of 'ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing’☆59Jan 26, 2026Updated last month
- MetaModulation: Learning Variational Feature Hierarchies for Few-Shot Learning with Fewer Tasks (ICML 2023)☆11Aug 15, 2023Updated 2 years ago
- ☆13Jul 3, 2024Updated last year
- Instituto de Telecomunicações Deep Learning-based Point Cloud Codec☆11Jun 18, 2024Updated last year
- Gesture Recognition Based on ALTERA DE2-115 FPGA☆10Mar 18, 2014Updated 11 years ago