KejiaZhang-Robust / VAPLinks
Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs
☆24Updated 2 months ago
Alternatives and similar repositories for VAP
Users that are interested in VAP are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Att…☆26Updated 4 months ago
- The official implement of "Routing Experts: Learning to Route Dynamic Experts in Existing Multi-modal Large Language Models"☆15Updated 3 months ago
- [ECCV 2024] API: Attention Prompting on Image for Large Vision-Language Models☆96Updated 9 months ago
- List of diffusion related active submissions on OpenReview for ICLR 2025.☆32Updated 8 months ago
- [ICLR 2025] Code for Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models☆19Updated 3 months ago
- [CVPR 2025] DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models☆60Updated 3 weeks ago
- ☆48Updated 7 months ago
- [CVPR 2025] Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention☆37Updated last year
- Code for ICLR 2025 Paper: Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs☆17Updated 2 months ago
- (ICCV 2025)This repository is the official implementation of AIGI-Holmes: Towards Explainable and Generalizable AI-Generated Image Detect…☆81Updated this week
- HoliTom: Holistic Token Merging for Fast Video Large Language Models☆35Updated last month
- [CVPR 2025] VASparse: Towards Efficient Visual Hallucination Mitigation via Visual-Aware Token Sparsification☆34Updated 3 months ago
- ☆89Updated 3 months ago
- [ICLR 2025] SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image and Video Generation☆42Updated 5 months ago
- This is the official implementation of our paper "QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehens…☆73Updated 2 months ago
- Official pytorch implementation of "RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in Large Vision Language…☆12Updated 7 months ago
- [CVPR-25🔥] Test-time Counterattacks (TTC) towards adversarial robustness of CLIP☆28Updated last month
- [ICLR 2025] See What You Are Told: Visual Attention Sink in Large Multimodal Models☆34Updated 5 months ago
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆89Updated 7 months ago
- 📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.☆260Updated 3 weeks ago
- A paper list about Token Merge, Reduce, Resample, Drop for MLLMs.☆67Updated 6 months ago
- [CVPR 2024] On the Diversity and Realism of Distilled Dataset: An Efficient Dataset Distillation Paradigm☆72Updated 4 months ago
- Elucidated Dataset Condensation (NeurIPS 2024)☆21Updated 9 months ago
- Code for the paper "AsFT: Anchoring Safety During LLM Fune-Tuning Within Narrow Safety Basin".☆25Updated last week
- [ECCV 2024] Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models☆50Updated last year
- Hyperbolic Safety-Aware Vision-Language Models. CVPR 2025☆20Updated 3 months ago
- (NeurIPS 2023 spotlight) Large-scale Dataset Distillation/Condensation, 50 IPC (Images Per Class) achieves the highest 60.8% on original …☆129Updated 8 months ago
- Official implementation of MC-LLaVA.☆32Updated last month
- This is a collection of awesome papers I have read (carefully or roughly) in the fields of security in diffusion models. Any suggestions …☆30Updated 8 months ago
- ✌ CLoG: Benchmarking Continual Learning of Image Generation Models☆20Updated last year