ZuyiZhou / Awesome-Interpretable-Cross-modal-ReasoningLinks
A Survey on Interpretable Cross-modal Reasoning
☆14Updated last year
Alternatives and similar repositories for Awesome-Interpretable-Cross-modal-Reasoning
Users that are interested in Awesome-Interpretable-Cross-modal-Reasoning are comparing it to the libraries listed below
Sorting:
- The Code for Lever LM: Configuring In-Context Sequence to Lever Large Vision Language Models☆16Updated 9 months ago
- [ICLR 2024] Analyzing and Mitigating Object Hallucination in Large Vision-Language Models☆146Updated last year
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.☆76Updated 8 months ago
- ☆75Updated last year
- Official repository for the A-OKVQA dataset☆93Updated last year
- [ICLR 2025] Official codebase for the ICLR 2025 paper "Multimodal Situational Safety"☆20Updated 3 weeks ago
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)☆52Updated 8 months ago
- Authors's code for "Variational Causal Inference Network for Explanatory Visual Question Answering" and "Integrating Neural-Symbolic Reas…☆13Updated 2 weeks ago
- The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''☆211Updated last year
- ☆57Updated 8 months ago
- ☆44Updated last month
- A Self-Training Framework for Vision-Language Reasoning☆80Updated 5 months ago
- [CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(…☆289Updated 8 months ago
- [Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning☆86Updated last year
- 😎 curated list of awesome LMM hallucinations papers, methods & resources.☆149Updated last year
- Visualizing the attention of vision-language models☆206Updated 4 months ago
- Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steering☆63Updated 7 months ago
- [EMNLP'23] The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''☆85Updated last year
- Imagine While Reasoning in Space: Multimodal Visualization-of-Thought (ICML 2025)☆33Updated 3 months ago
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆74Updated 5 months ago
- This repo contains the code for the paper "Understanding and Mitigating Hallucinations in Large Vision-Language Models via Modular Attrib…☆19Updated 4 months ago
- ☆24Updated 5 months ago
- mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating☆95Updated last year
- the official repo for EMNLP 2024 (main) paper "EFUF: Efficient Fine-grained Unlearning Framework for Mitigating Hallucinations in Multimo…☆19Updated 3 months ago
- [NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models☆76Updated last year
- ☆9Updated 2 months ago
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆68Updated last year
- AutoHallusion Codebase (EMNLP 2024)☆19Updated 7 months ago
- HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data (Accepted by CVPR 2024)☆45Updated last year
- [ACM MM 2023] The released code of paper "Deconfounded Visual Question Generation with Causal Inference"☆10Updated 10 months ago