KejiaZhang-Robust / VAPLinks
Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs
☆28Updated last month
Alternatives and similar repositories for VAP
Users that are interested in VAP are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] Code for Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models☆21Updated 5 months ago
- The official implement of "Routing Experts: Learning to Route Dynamic Experts in Existing Multi-modal Large Language Models"☆15Updated 5 months ago
- [CVPR 2025] Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Att…☆38Updated 6 months ago
- Code for ICLR 2025 Paper: Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs☆20Updated 4 months ago
- [ICLR 2025] See What You Are Told: Visual Attention Sink in Large Multimodal Models☆48Updated 7 months ago
- ☆49Updated 9 months ago
- This is the official implementation of our paper "QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehens…☆73Updated 4 months ago
- Survey: https://arxiv.org/pdf/2507.20198☆139Updated last week
- [CVPR 2025] DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models☆74Updated 2 weeks ago
- [CVPR 2025] VASparse: Towards Efficient Visual Hallucination Mitigation via Visual-Aware Token Sparsification☆34Updated 5 months ago
- HoliTom: Holistic Token Merging for Fast Video Large Language Models☆43Updated last month
- [ECCV 2024] API: Attention Prompting on Image for Large Vision-Language Models☆102Updated 11 months ago
- [CVPR 2025] PACT: Pruning and Clustering-Based Token Reduction for Faster Visual Language Models☆40Updated 2 months ago
- Official code for paper: Beyond Attention or Similarity: Maximizing Conditional Diversity for Token Pruning in MLLMs.☆53Updated 2 months ago
- Code for "The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs"☆63Updated last month
- (ICCV 2025)This repository is the official implementation of AIGI-Holmes: Towards Explainable and Generalizable AI-Generated Image Detect…☆118Updated last month
- [ICCV 2025] ONLY: One-Layer Intervention Sufficiently Mitigates Hallucinations in Large Vision-Language Models☆36Updated 2 months ago
- A framework for unified personalized model, achieving mutual enhancement between personalized understanding and generation. Demonstrating…☆121Updated last month
- [ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'☆262Updated 5 months ago
- [CVPR 2025] Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention☆45Updated last year
- Code for the paper "AsFT: Anchoring Safety During LLM Fune-Tuning Within Narrow Safety Basin".☆27Updated 2 months ago
- A curated list of Awesome Personalized Large Multimodal Models resources☆36Updated last month
- [ICCV 2025] The official code of the paper "Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration R…☆106Updated 2 months ago
- 🔥CVPR 2025 Multimodal Large Language Models Paper List☆153Updated 6 months ago
- 🚀 Video Compression Commander: Plug-and-Play Inference Acceleration for Video Large Language Models☆33Updated last week
- Official implementation of MC-LLaVA.☆139Updated 3 weeks ago
- [ECCV 2024] The official code for "AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shi…☆64Updated last year
- More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models☆53Updated 3 months ago
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆105Updated last week
- EventHallusion: Diagnosing Event Hallucinations in Video LLMs☆31Updated last month