[EMNLP'25] A novel alignment framework that leverages image retrieval to mitigate hallucinations in Vision Language Models.
☆50Aug 21, 2025Updated 6 months ago
Alternatives and similar repositories for Re-Align
Users that are interested in Re-Align are comparing it to the libraries listed below
Sorting:
- [TIP2024] MWFormer: Multi-Weather Image Restoration Using Degradation-Aware Transformers☆71Dec 6, 2024Updated last year
- ☆18Apr 10, 2025Updated 10 months ago
- [IROS'25] COCMT☆12Aug 14, 2025Updated 6 months ago
- [CVPR2025] We present SleeperMark, a novel framework designed to embed resilient watermarks into T2I diffusion models☆37May 26, 2025Updated 9 months ago
- Cross-Self KV Cache Pruning for Efficient Vision-Language Inference☆10Dec 15, 2024Updated last year
- HFMF: Hierarchical Fusion Meets Multi-Stream Models for Deepfake Detection☆13Jan 6, 2025Updated last year
- [SIGIR 2025] This is the code repo for our SIGIR'25 paper: Enhancing the Patent Matching Capability of Large Language Models via Memory G…☆18Apr 22, 2025Updated 10 months ago
- OpenEMMA, a permissively licensed open source "reproduction" of Waymo’s EMMA model.☆898May 13, 2025Updated 9 months ago
- a comprehensive and critical synthesis of the emerging role of GenAI across the full autonomous driving stack☆229Sep 20, 2025Updated 5 months ago
- ☆10Oct 5, 2023Updated 2 years ago
- ☆15Aug 30, 2025Updated 6 months ago
- 🏆 Official implementation of LangCoop: Collaborative Driving with Natural Language☆77Sep 12, 2025Updated 5 months ago
- ☆46Updated this week
- [ACL 2024] Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decoding☆15Nov 10, 2025Updated 3 months ago
- PISCO: Precise Video Instance Insertion with Sparse Control☆48Feb 13, 2026Updated 2 weeks ago
- Official resource for paper Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models (ACL 20…☆15Aug 12, 2024Updated last year
- [ICLR'25] Official Implementation of STAMP: Scalable Task And Model-agnostic Collaborative Perception☆56Feb 4, 2025Updated last year
- ☆17Nov 27, 2024Updated last year
- ☆21Jul 18, 2024Updated last year
- We introduce new approach, Token Reduction using CLIP Metric (TRIM), aimed at improving the efficiency of MLLMs without sacrificing their…☆20Jan 11, 2026Updated last month
- Solution of Kaggle competition: Feedback Prize - Evaluating Student Writing☆16Mar 30, 2022Updated 3 years ago
- [NeurIPS'24] CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models☆77Dec 4, 2024Updated last year
- Official PyTorch Implementation for the "What if...?: Thinking Counterfactual Keywords Helps to Mitigate Hallucination in Large Multi-mod…☆20Sep 26, 2024Updated last year
- Python implementation of the partially observable hidden Markov model☆21Oct 11, 2019Updated 6 years ago
- Official implementation of paper "ACON: Optimizing Context Compression for Long-horizon LLM Agents"☆57Oct 14, 2025Updated 4 months ago
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.☆86Nov 10, 2024Updated last year
- The Most Comprehensive Survey of Video Quality Assessment to Date.☆95Dec 24, 2024Updated last year
- [ICLR 2020] Haotao Wang, Tianlong Chen, Zhangyang Wang, Kede Ma, "I Am Going MAD: Maximum Discrepancy Competition for Comparing Classifie…☆20Dec 30, 2021Updated 4 years ago
- [ICLR '25] Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations"☆95Nov 30, 2025Updated 3 months ago
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆52Oct 19, 2024Updated last year
- [NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models☆86Oct 26, 2025Updated 4 months ago
- ☆46Dec 30, 2024Updated last year
- [ECCV 2024] Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs☆164Nov 6, 2024Updated last year
- ☆55Dec 7, 2024Updated last year
- [CVPR 2025] Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention☆61Jul 16, 2024Updated last year
- [NeurIPS 2024] TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration☆26Oct 17, 2024Updated last year
- Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning☆24Sep 9, 2024Updated last year
- [CVPR 2025] Few-shot Recognition via Stage-Wise Retrieval-Augmented Finetuning☆29Jan 10, 2026Updated last month
- AutoToM: Scaling Model-based Mental Inference via Automated Agent Modeling☆39Dec 26, 2025Updated 2 months ago