zhangce01 / DeGF
[ICLR 2025] Code for Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models
☆15Updated 2 months ago
Alternatives and similar repositories for DeGF:
Users that are interested in DeGF are comparing it to the libraries listed below
- [ICLR 2025] SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image and Video Generation☆32Updated 2 months ago
- Official implementation of MC-LLaVA.☆22Updated 2 months ago
- [CVPR2024 Highlight] Official implementation for Transferable Visual Prompting. The paper "Exploring the Transferability of Visual Prompt…☆38Updated 3 months ago
- [NeurIPS 2024] Official PyTorch implementation of "Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives"☆37Updated 4 months ago
- [CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key☆45Updated last week
- ☆59Updated 3 weeks ago
- Robust-LLaVA: On the Effectiveness of Large-Scale Robust Image Encoders for Multi-modal Large Language Models☆20Updated 2 months ago
- ☆19Updated last week
- [ECCV 2024] API: Attention Prompting on Image for Large Vision-Language Models☆82Updated 6 months ago
- 🔎Official code for our paper: "VL-Uncertainty: Detecting Hallucination in Large Vision-Language Model via Uncertainty Estimation".☆31Updated 3 weeks ago
- Official Repository of Personalized Visual Instruct Tuning☆28Updated last month
- Official pytorch implementation of "RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in Large Vision Language…☆10Updated 3 months ago
- official repo for paper "[CLS] Token Tells Everything Needed for Training-free Efficient MLLMs"☆14Updated 3 months ago
- [CVPRW] Official repository of paper titled "Towards Evaluating the Robustness of Visual State Space Models"☆24Updated last week
- Rui Qian, Xin Yin, Dejing Dou†: Reasoning to Attend: Try to Understand How <SEG> Token Works (CVPR 2025)☆27Updated last week
- [CVPR 2025] COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training☆14Updated 2 weeks ago
- [NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models☆71Updated 10 months ago
- ☆11Updated 5 months ago
- Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning☆20Updated 7 months ago
- [CVPR 2025] Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention☆28Updated 8 months ago
- (ICLR 2025 Spotlight) Official code repository for Interleaved Scene Graph.☆18Updated 2 months ago
- [ICLR 2025] See What You Are Told: Visual Attention Sink in Large Multimodal Models☆18Updated last month
- PyTorch code for "Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training"☆34Updated last year
- The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?☆27Updated 5 months ago
- Collection of awesome Continual Test-Time Adaptation methods☆16Updated 10 months ago
- [CVPR 2025] RAP: Retrieval-Augmented Personalization☆40Updated last week
- Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model☆25Updated 3 months ago
- ✨A curated list of papers on the uncertainty in multi-modal large language model (MLLM).☆42Updated last week
- ☆24Updated 5 months ago
- This is the official repo of the paper "Latent Guard: a Safety Framework for Text-to-image Generation"☆48Updated 5 months ago