Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs
☆32Sep 21, 2025Updated 5 months ago
Alternatives and similar repositories for VAP
Users that are interested in VAP are comparing it to the libraries listed below
Sorting:
- ☆27Apr 18, 2025Updated 10 months ago
- [ICLR 2025] Code for Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models☆24Apr 14, 2025Updated 10 months ago
- [NeurIPS 2025] HoliTom: Holistic Token Merging for Fast Video Large Language Models☆71Oct 10, 2025Updated 4 months ago
- [CVPR 2025] VASparse: Towards Efficient Visual Hallucination Mitigation via Visual-Aware Token Sparsification☆49Mar 24, 2025Updated 11 months ago
- ☆10Apr 15, 2025Updated 10 months ago
- 🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.☆10Dec 12, 2024Updated last year
- Official pytorch implementation of "RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in Large Vision Language…☆14Dec 16, 2024Updated last year
- [arXiv 2024] Is Oracle Pruning the True Oracle?☆26Jan 10, 2025Updated last year
- Leetcode Practice in Python☆12Dec 12, 2024Updated last year
- ☆12Aug 20, 2025Updated 6 months ago
- This repository collects and categorizes top vision-language papers based on their approaches and applications, with a special focus on t…☆14Apr 11, 2025Updated 10 months ago
- IJCB 2023: Towards Generalizable Morph Attack Detection via Consistency Regularization☆13May 1, 2024Updated last year
- SAM: Sharpness-Aware Minimization (PyTorch)☆12Feb 21, 2024Updated 2 years ago
- Facial Attribute Recognition☆11Dec 6, 2024Updated last year
- This repository contains the code of our paper 'Skip \n: A simple method to reduce hallucination in Large Vision-Language Models'.☆15Feb 12, 2024Updated 2 years ago
- [TCSVT 2025] Niagara: Normal-Integrated Geometric Affine Field for Scene Reconstruction from a Single View☆103Dec 15, 2025Updated 2 months ago
- [ACL 2025 Findings] Official pytorch implementation of "Don't Miss the Forest for the Trees: Attentional Vision Calibration for Large Vis…☆24Jul 21, 2024Updated last year
- [AAAI 2025] ConVis: Contrastive Decoding with Hallucination Visualization for Mitigating Hallucinations in Multimodal Large Language Mode…☆26Sep 26, 2024Updated last year
- Code for ICLR 2025 Paper: Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs☆22May 7, 2025Updated 9 months ago
- Face Morphing Attack Detection Benchmark (IJCB 2022: Robust Ensemble Morph Detection with Domain Generalization)☆20Dec 18, 2024Updated last year
- Exploring the fundamentals and advanced concepts of Large Language Models (LLMs) through practical implementations and collaborative lear…☆23Dec 24, 2024Updated last year
- [CVPR 2025] Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention☆61Jul 16, 2024Updated last year
- Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning☆24Sep 9, 2024Updated last year
- [CVPR 2025] Few-shot Recognition via Stage-Wise Retrieval-Augmented Finetuning☆29Jan 10, 2026Updated last month
- ☆72Jul 28, 2025Updated 7 months ago
- [ICML 2024] Official implementation for "HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding"☆110Dec 4, 2024Updated last year
- OmniZip: Audio-Guided Dynamic Token Compression for Fast Omnimodal Large Language Models☆56Feb 1, 2026Updated last month
- TensorFlow code for our ECCV'24 Workshop paper "LightAvatar: Efficient Head Avatar as Dynamic NeLF"☆30Nov 7, 2024Updated last year
- [NeurIPS'22] What Makes a "Good" Data Augmentation in Knowledge Distillation -- A Statistical Perspective☆37Dec 15, 2022Updated 3 years ago
- [ICML 2025] Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in…☆172Sep 25, 2025Updated 5 months ago
- ☆18Jun 10, 2025Updated 8 months ago
- [CVPR 2025] DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models☆101Nov 22, 2025Updated 3 months ago
- Code for paper: Visual Signal Enhancement for Object Hallucination Mitigation in Multimodal Large language Models☆52Dec 18, 2024Updated last year
- [ECCV 2024] Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs☆163Nov 6, 2024Updated last year
- [CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding☆379Oct 7, 2024Updated last year
- Optimized MDNet for fast object tracking☆10Apr 10, 2019Updated 6 years ago
- code for LSN☆10Oct 28, 2024Updated last year
- [CVPR2023]Discrete Point-wise Attack Is Not Enough: Generalized Manifold Adversarial Attack for Face Recognition☆41May 30, 2023Updated 2 years ago
- On the Robustness of GUI Grounding Models Against Image Attacks☆12Apr 8, 2025Updated 10 months ago