[EMNLP'25] A novel alignment framework that leverages image retrieval to mitigate hallucinations in Vision Language Models.
☆51Aug 21, 2025Updated 10 months ago
Alternatives and similar repositories for Re-Align
Users that are interested in Re-Align are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Edit Away and My Face Will not Stay: Personal Biometric Defense against Malicious Generative Editing☆59Dec 17, 2024Updated last year
- [TIP2024] MWFormer: Multi-Weather Image Restoration Using Degradation-Aware Transformers☆80Dec 6, 2024Updated last year
- ☆19Apr 10, 2025Updated last year
- [IROS'25] COCMT☆12Aug 14, 2025Updated 10 months ago
- [CVPR2025] We present SleeperMark, a novel framework designed to embed resilient watermarks into T2I diffusion models☆39May 26, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- HFMF: Hierarchical Fusion Meets Multi-Stream Models for Deepfake Detection☆13Jan 6, 2025Updated last year
- Official implementation of AirV2X: Unified Air-Ground\\Vehicle-to-Everything Collaboration☆66Nov 12, 2025Updated 7 months ago
- [ICLR'25] Official Implementation of STAMP: Scalable Task And Model-agnostic Collaborative Perception☆63Feb 4, 2025Updated last year
- ☆17Nov 27, 2024Updated last year
- Cross-Self KV Cache Pruning for Efficient Vision-Language Inference☆10Dec 15, 2024Updated last year
- ☆16Aug 30, 2025Updated 10 months ago
- [ACL 2024] Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decoding☆17Nov 10, 2025Updated 7 months ago
- [IEEE PCS 2022 best paper finalist] "FloLPIPS: A Bespoke Video Quality Metric for Frame Interpoation", Duolikun Danier, Fan Zhang, David …☆22Mar 9, 2024Updated 2 years ago
- Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning☆24Sep 9, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆19Mar 11, 2026Updated 3 months ago
- [NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models☆87Oct 26, 2025Updated 8 months ago
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.☆86Nov 10, 2024Updated last year
- [MM2024, oral] "Self-Supervised Visual Preference Alignment" https://arxiv.org/abs/2404.10501☆60Jul 26, 2024Updated last year
- [SIGIR '25] This is the code repo for our SIGIR '25 paper: Enhancing the Patent Matching Capability of Large Language Models via Memory G…☆19Apr 22, 2025Updated last year
- Official PyTorch Implementation for the "What if...?: Thinking Counterfactual Keywords Helps to Mitigate Hallucination in Large Multi-mod…☆20Sep 26, 2024Updated last year
- [ICLR '25] Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations"☆105Nov 30, 2025Updated 7 months ago
- Simulator designed to generate diverse driving scenarios.☆44Feb 27, 2025Updated last year
- [NAACL'25] Contains code and documentation for our VANE-Bench paper.☆24Aug 19, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- PISCO: Precise Video Instance Insertion with Sparse Control☆61Feb 13, 2026Updated 4 months ago
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆146Sep 11, 2025Updated 9 months ago
- ☆56Dec 7, 2024Updated last year
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆53Oct 19, 2024Updated last year
- [CVPR25 Highlight] A ChatGPT-Prompted Visual hallucination Evaluation Dataset, featuring over 100,000 data samples and four advanced eval…☆32Apr 16, 2025Updated last year
- Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs☆36Sep 21, 2025Updated 9 months ago
- ☆21Jul 18, 2024Updated last year
- [ICLR 2020] Haotao Wang, Tianlong Chen, Zhangyang Wang, Kede Ma, "I Am Going MAD: Maximum Discrepancy Competition for Comparing Classifie…☆20Dec 30, 2021Updated 4 years ago
- VEFX-Bench: A Holistic Benchmark for Generic Video Editing and Visual Effects☆214May 16, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [CVPR 2025] VASparse: Towards Efficient Visual Hallucination Mitigation via Visual-Aware Token Sparsification☆50Mar 24, 2025Updated last year
- [ICML 2024] Official repository of ICML 2024 - RoboMP2: A Robotic Multimodal Perception-Planning Framework with Multimodal Large Language…☆11Apr 4, 2026Updated 3 months ago
- [CVPR 2025] Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention☆68Jul 16, 2024Updated last year
- Sotopia-RL: Reward Design for Social Intelligence☆52Apr 1, 2026Updated 3 months ago
- PyTorch Implementation of Latent Space Anchoring (TPAMI 2023)☆12Jul 20, 2025Updated 11 months ago
- [ECCV 2024] Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs☆172Nov 6, 2024Updated last year
- We introduce new approach, Token Reduction using CLIP Metric (TRIM), aimed at improving the efficiency of MLLMs without sacrificing their…☆22Jan 11, 2026Updated 5 months ago