shiqichen17 / AdaptVis
☆12Updated this week
Alternatives and similar repositories for AdaptVis:
Users that are interested in AdaptVis are comparing it to the libraries listed below
- Code and datasets for "What’s “up” with vision-language models? Investigating their struggle with spatial reasoning".☆44Updated last year
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆58Updated 4 months ago
- ☆44Updated 5 months ago
- [ICML 2024] Language Models Represent Beliefs of Self and Others☆32Updated 6 months ago
- ☆13Updated 9 months ago
- M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning☆56Updated 3 months ago
- NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation☆27Updated this week
- PyTorch implementation of StableMask (ICML'24)☆12Updated 9 months ago
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)☆46Updated 5 months ago
- [ICLR 2025] The official pytorch implement of "Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Cont…☆29Updated 4 months ago
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning"☆61Updated this week
- [NAACL 2024] Vision language model that reduces hallucinations through self-feedback guided revision. Visualizes attentions on image feat…☆44Updated 8 months ago
- ☆59Updated 7 months ago
- code for arxiv paper: Understanding Multimodal LLMs: the Mechanistic Interpretability of Llava in Visual Question Answering☆16Updated 3 months ago
- The official repository for the paper "Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark"☆50Updated last week
- ☆24Updated 5 months ago
- Code and data for ACL 2024 paper on 'Cross-Modal Projection in Multimodal LLMs Doesn't Really Project Visual Attributes to Textual Space'☆13Updated 9 months ago
- Visual and Embodied Concepts evaluation benchmark☆21Updated last year
- A Survey on the Honesty of Large Language Models☆57Updated 4 months ago
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.☆72Updated 5 months ago
- ☆18Updated 5 months ago
- ☆10Updated 10 months ago
- The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism☆28Updated 9 months ago
- This repo contains evaluation code for the paper "MileBench: Benchmarking MLLMs in Long Context"☆32Updated 9 months ago
- The rule-based evaluation subset and code implementation of Omni-MATH☆19Updated 4 months ago
- DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding☆49Updated 3 weeks ago
- A Self-Training Framework for Vision-Language Reasoning☆76Updated 3 months ago
- A Comprehensive Benchmark for Robust Multi-image Understanding☆10Updated 7 months ago
- ☆11Updated last year
- [AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy☆60Updated 4 months ago