[CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key
☆105Jan 9, 2026Updated last month
Alternatives and similar repositories for OPA-DPO
Users that are interested in OPA-DPO are comparing it to the libraries listed below
Sorting:
- [ICLR 2026] Do Not Let Low-Probability Tokens Over-Dominate in RL for LLMs☆41May 20, 2025Updated 9 months ago
- [ICLR 2025] SiMHand: Mining Similar Hands for Large-Scale 3D Hand Pose Pre-training☆39Apr 4, 2025Updated 11 months ago
- [ICLR 2025] Data-Augmented Phrase-Level Alignment for Mitigating Object Hallucination☆19Jan 27, 2025Updated last year
- Symmetrical Visual Contrastive Optimization: Aligning Vision-Language Models with Minimal Contrastive Images☆18Jun 4, 2025Updated 9 months ago
- ☆12Mar 5, 2024Updated 2 years ago
- The official repos of "Knowledge Bridger: Towards Training-Free Missing Modality Completion"☆21Jun 30, 2025Updated 8 months ago
- code of IJCAI submission "Soft Hindsight Experience Replay"☆13Mar 23, 2020Updated 5 years ago
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆69May 31, 2024Updated last year
- [ICME 2025 Oral] Official implementation of "GlanceVAD: Exploring Glance Supervision for Label-efficient Video Anomaly Detection"☆34Mar 23, 2025Updated 11 months ago
- Platform for training generalizable deep reinforcement learning agents☆13Oct 25, 2025Updated 4 months ago
- An LLM-free Multi-dimensional Benchmark for Multi-modal Hallucination Evaluation☆157Jan 15, 2024Updated 2 years ago
- Bag of Design Choices for Inference of High-Resolution Masked Generative Transformer☆16Nov 21, 2024Updated last year
- [NeurIPS 2025] Official repository for “FlowCut: Rethinking Redundancy via Information Flow for Efficient Vision-Language Models”☆28Dec 9, 2025Updated 2 months ago
- [CVPR 2025] VASparse: Towards Efficient Visual Hallucination Mitigation via Visual-Aware Token Sparsification☆49Mar 24, 2025Updated 11 months ago
- ☆18Jan 3, 2022Updated 4 years ago
- [ICLR 2024 Spotlight] Code for ICLR 2024 paper "Towards Robust Offline Reinforcement Learning under Diverse Data Corruption"☆20Nov 25, 2024Updated last year
- ☆63Jan 30, 2026Updated last month
- ☆28Dec 29, 2023Updated 2 years ago
- Official PyTorch implementation of "Entity-Centric Reinforcement Learning for Object Manipulation from Pixels", Haramati et al., ICLR 202…☆30Feb 22, 2026Updated last week
- [Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning☆91Apr 30, 2024Updated last year
- ☆81Feb 17, 2026Updated 2 weeks ago
- ☆27Apr 18, 2025Updated 10 months ago
- Official codebase for GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning.☆29Nov 12, 2024Updated last year
- This repo contains the code for the paper "Understanding and Mitigating Hallucinations in Large Vision-Language Models via Modular Attrib…☆33Jul 14, 2025Updated 7 months ago
- ☆25Aug 21, 2024Updated last year
- Official Implementation (Pytorch) of the "VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Capti…☆24Jan 26, 2025Updated last year
- [CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding☆379Oct 7, 2024Updated last year
- ☆36Jan 9, 2026Updated last month
- R1-like Video-LLM for Temporal Grounding☆133Jun 20, 2025Updated 8 months ago
- This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…☆30Oct 21, 2024Updated last year
- [NeurIPS 2024] Mitigating Object Hallucination via Concentric Causal Attention☆66Aug 30, 2025Updated 6 months ago
- The dataset repo of "CLCIFAR: CIFAR-Derived Benchmark Datasets with Human Annotated Complementary Labels" paper☆16Aug 8, 2025Updated 6 months ago
- Vstream - Video Analytics pipeline with Hardware based accelerations (dev - stage)☆10Feb 2, 2024Updated 2 years ago
- Process Simulations Meet AI. Supercharge Your Process Engineering. Generate Infinite Data, Train Advanced Models, and Revolutionise Indus…☆11Oct 8, 2024Updated last year
- Code Implementation of "Unsupervised Recognition of Unknown Objects for Open-World Object Detection"☆31Oct 13, 2023Updated 2 years ago
- ☆26Sep 18, 2020Updated 5 years ago
- Code for paper: Visual Signal Enhancement for Object Hallucination Mitigation in Multimodal Large language Models☆52Dec 18, 2024Updated last year
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆36Jan 24, 2026Updated last month
- [CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(…☆329Oct 14, 2025Updated 4 months ago