The-Martyr / CausalMM
[ICLR'25] Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality
☆9Updated last week
Alternatives and similar repositories for CausalMM:
Users that are interested in CausalMM are comparing it to the libraries listed below
- ☆59Updated 7 months ago
- Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal …☆38Updated 2 months ago
- A Self-Training Framework for Vision-Language Reasoning☆61Updated last week
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)☆41Updated 3 months ago
- [Preprint] A Neural-Symbolic Self-Training Framework☆102Updated 6 months ago
- MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)☆33Updated 9 months ago
- ☆24Updated last year
- my commonly-used tools☆48Updated 3 weeks ago
- ☆38Updated 7 months ago
- ☆40Updated 2 months ago
- ☆16Updated last year
- up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources☆80Updated this week
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.☆59Updated 2 months ago
- ☆85Updated 4 months ago
- A Survey on the Honesty of Large Language Models☆51Updated last month
- Papers about Hallucination in Multi-Modal Large Language Models (MLLMs)☆77Updated 2 months ago
- The official code repository for PRMBench.☆60Updated last week
- ☆12Updated last month
- This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vi…☆97Updated 3 months ago
- Official code repository for Interleaved Scene Graph.☆15Updated last month
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆161Updated 11 months ago
- Official repo for "AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability"☆32Updated 6 months ago
- This is the first released survey paper on hallucinations of large vision-language models (LVLMs). To keep track of this field and contin…☆60Updated 6 months ago
- DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception. (ICLR2025)☆18Updated last month
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆62Updated 2 months ago
- [ECCV 2024] Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs☆95Updated 2 months ago
- M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning☆51Updated last month
- mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating☆88Updated last year
- ☆14Updated last year
- PyTorch Implementation of "Divide, Conquer and Combine: A Training-Free Framework for High-Resolution Image Perception in Multimodal Larg…☆19Updated last month