[EMNLP'25] A novel alignment framework that leverages image retrieval to mitigate hallucinations in Vision Language Models.
☆50Aug 21, 2025Updated 7 months ago
Alternatives and similar repositories for Re-Align
Users that are interested in Re-Align are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [TMLR'25] AutoTrust, a groundbreaking benchmark designed to assess the trustworthiness of DriveVLMs. This work aims to enhance public saf…☆53Nov 20, 2025Updated 4 months ago
- Edit Away and My Face Will not Stay: Personal Biometric Defense against Malicious Generative Editing☆56Dec 17, 2024Updated last year
- [IROS'25] CoMamba: Real-time Cooperative Perception Unlocked with State Space Models☆28Sep 20, 2024Updated last year
- ☆18Apr 10, 2025Updated 11 months ago
- [IROS'25] COCMT☆12Aug 14, 2025Updated 7 months ago
- LLM Can Get "Brain Rot"☆161Jan 9, 2026Updated 2 months ago
- [CVPR2025] We present SleeperMark, a novel framework designed to embed resilient watermarks into T2I diffusion models☆38May 26, 2025Updated 9 months ago
- Official implementation of AirV2X: Unified Air-Ground\\Vehicle-to-Everything Collaboration☆55Nov 12, 2025Updated 4 months ago
- HFMF: Hierarchical Fusion Meets Multi-Stream Models for Deepfake Detection☆13Jan 6, 2025Updated last year
- ☆55Mar 3, 2026Updated 3 weeks ago
- ☆17Nov 27, 2024Updated last year
- The Most Comprehensive Survey of Video Quality Assessment to Date.☆95Dec 24, 2024Updated last year
- 🏆 Official implementation of LangCoop: Collaborative Driving with Natural Language☆78Sep 12, 2025Updated 6 months ago
- [ACL 2024] Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decoding☆15Nov 10, 2025Updated 4 months ago
- Cross-Self KV Cache Pruning for Efficient Vision-Language Inference☆10Dec 15, 2024Updated last year
- [IEEE PCS 2022 best paper finalist] "FloLPIPS: A Bespoke Video Quality Metric for Frame Interpoation", Duolikun Danier, Fan Zhang, David …☆22Mar 9, 2024Updated 2 years ago
- Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning☆24Sep 9, 2024Updated last year
- [ICLR 2026] DecAlign: Aligning Cross-Modal Semantics for Multimodal Foundation Models☆60Feb 5, 2026Updated last month
- ☆17Mar 11, 2026Updated 2 weeks ago
- This is the official implementation of UniOcc: A Unified Benchmark for Occupancy Forecasting and Prediction in Autonomous Driving☆200Sep 6, 2025Updated 6 months ago
- [NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models☆87Oct 26, 2025Updated 4 months ago
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.☆86Nov 10, 2024Updated last year
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆69May 31, 2024Updated last year
- [NeurIPS'24] CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models☆79Dec 4, 2024Updated last year
- PISCO: Precise Video Instance Insertion with Sparse Control☆53Feb 13, 2026Updated last month
- ☆10Oct 5, 2023Updated 2 years ago
- [MM2024, oral] "Self-Supervised Visual Preference Alignment" https://arxiv.org/abs/2404.10501☆62Jul 26, 2024Updated last year
- Official resource for paper Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models (ACL 20…☆15Aug 12, 2024Updated last year
- [SIGIR 2025] This is the code repo for our SIGIR'25 paper: Enhancing the Patent Matching Capability of Large Language Models via Memory G…☆19Apr 22, 2025Updated 11 months ago
- Official PyTorch Implementation for the "What if...?: Thinking Counterfactual Keywords Helps to Mitigate Hallucination in Large Multi-mod…☆20Sep 26, 2024Updated last year
- [ICLR '25] Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations"☆101Nov 30, 2025Updated 3 months ago
- Simulator designed to generate diverse driving scenarios.☆44Feb 27, 2025Updated last year
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆138Sep 11, 2025Updated 6 months ago
- ☆19Mar 16, 2025Updated last year
- ☆35Nov 6, 2025Updated 4 months ago
- ☆55Dec 7, 2024Updated last year
- [NeurIPS 2024] TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration☆26Oct 17, 2024Updated last year
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆53Oct 19, 2024Updated last year
- ☆21Jul 18, 2024Updated last year