[NeurIPS 2024] MATH-Vision dataset and code to measure multimodal mathematical reasoning capabilities.
☆129May 16, 2025Updated 9 months ago
Alternatives and similar repositories for MATH-V
Users that are interested in MATH-V are comparing it to the libraries listed below
Sorting:
- ☆13May 9, 2023Updated 2 years ago
- [ECCV 2024] Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?☆176Apr 28, 2025Updated 10 months ago
- Code for Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models☆92Jun 28, 2024Updated last year
- MathVista: data, code, and evaluation for Mathematical Reasoning in Visual Contexts☆355Sep 29, 2025Updated 5 months ago
- [SCIS 2024] The official implementation of the paper "MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Di…☆62Nov 7, 2024Updated last year
- (ACL 2025) MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale☆49Jun 4, 2025Updated 8 months ago
- rmp data ranking☆13Nov 4, 2025Updated 3 months ago
- Official github repo of G-LLaVA☆148Feb 20, 2025Updated last year
- Nano Banana Studio: AI-Powered Marketing Asset Creator with Real-Time Brand Enhancement☆39Sep 10, 2025Updated 5 months ago
- ☆14Mar 11, 2024Updated last year
- VHTest☆15Oct 31, 2024Updated last year
- Code for paper "Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning"☆47Feb 19, 2026Updated last week
- Code for "AVG-LLaVA: A Multimodal Large Model with Adaptive Visual Granularity"☆33Oct 12, 2024Updated last year
- MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingua…☆63May 15, 2025Updated 9 months ago
- [Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought …☆426Dec 22, 2024Updated last year
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆89Feb 17, 2025Updated last year
- ☆85Jan 25, 2025Updated last year
- This is the Repository for Geometry Problem Solving Method Evaluation☆26Oct 8, 2024Updated last year
- Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models☆45Jun 14, 2024Updated last year
- A bug-free and improved implementation of LLaVA-UHD, based on the code from the official repo☆34Aug 12, 2024Updated last year
- MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness, and Efficiency☆136Aug 5, 2025Updated 6 months ago
- Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks☆3,845Updated this week
- Dataset introduced in PlotQA: Reasoning over Scientific Plots☆83Jun 20, 2023Updated 2 years ago
- [CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback☆305Sep 11, 2024Updated last year
- ☆12Jul 4, 2024Updated last year
- Paper collections of multi-modal LLM for Math/STEM/Code.☆136Nov 17, 2025Updated 3 months ago
- ☆47Nov 8, 2024Updated last year
- ☆23Jul 5, 2024Updated last year
- [MathCoder, MathCoder-VL] Family of LLMs/LMMs for mathematical reasoning.☆335Oct 18, 2025Updated 4 months ago
- Official repo for StableLLAVA☆95Dec 22, 2023Updated 2 years ago
- [ICCV 2025] The official code of the paper "Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration R…☆111Jul 9, 2025Updated 7 months ago
- Official repository of MMDU dataset☆104Sep 29, 2024Updated last year
- [NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning☆95Jan 7, 2025Updated last year
- ☆11Feb 28, 2024Updated 2 years ago
- ☆11Oct 11, 2023Updated 2 years ago
- [CVPR 2025] Code for "Notes-guided MLLM Reasoning: Enhancing MLLM with Knowledge and Visual Notes for Visual Question Answering".☆20Jun 16, 2025Updated 8 months ago
- POM: Occupancy map estimation for people detection☆10Aug 5, 2014Updated 11 years ago
- Improving word mover’s distance by leveraging self-attention matrix (Published in EMNLP 2023 Findings)☆10Jun 17, 2025Updated 8 months ago
- ☆15Jul 22, 2024Updated last year