InfiMM/Awesome-Multimodal-LLM-for-Math-STEM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/InfiMM/Awesome-Multimodal-LLM-for-Math-STEM)

InfiMM / Awesome-Multimodal-LLM-for-Math-STEM

Paper collections of multi-modal LLM for Math/STEM/Code.

☆144

Alternatives and similar repositories for Awesome-Multimodal-LLM-for-Math-STEM

Users that are interested in Awesome-Multimodal-LLM-for-Math-STEM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

chengruogu0915 / GeoUni
View on GitHub
Repository for GeoUni, A Unified Model for Generating Geometry Diagrams, Problems and Problem Solutions.
☆23Jun 12, 2025Updated last year
pengshuai-rin / MultiMath
View on GitHub
MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models
☆33Jan 22, 2025Updated last year
core-mm / core-mm
View on GitHub
☆17Feb 22, 2024Updated 2 years ago
InternScience / GeoX
View on GitHub
[ICLR'25] Geometric Problem Solving Through Unified Formalized Vision-Language Pre-training
☆49Jan 25, 2025Updated last year
ZrrSkywalker / MAVIS
View on GitHub
[ICLR 2025] Mathematical Visual Instruction Tuning for Multi-modal Large Language Models
☆156Dec 5, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zezeze97 / DFE-GPS
View on GitHub
☆14Jul 15, 2025Updated last year
FanqingM / MM-Eureka-V0
View on GitHub
MM-Eureka V0 also called R1-Multimodal-Journey, Latest version is in MM-Eureka
☆325Jun 21, 2025Updated last year
HZQ950419 / Math-LLaVA
View on GitHub
Code for Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models
☆91Jun 28, 2024Updated 2 years ago
ycpNotFound / GeoGen
View on GitHub
A pipeline for the automatic construction of geometry problems along with step-by-step solutions.
☆17Aug 27, 2025Updated 10 months ago
ChengpengLi1003 / DotaMath
View on GitHub
☆30Dec 27, 2024Updated last year
allenai / DrawEduMath
View on GitHub
Can VLMs understand students' hand-drawn math work?
☆19Jan 20, 2026Updated 6 months ago
wwzhuang01 / Math-PUMA
View on GitHub
[AAAI 2025]Math-PUMA: Progressive Upward Multimodal Alignment to Enhance Mathematical Reasoning
☆45Apr 14, 2025Updated last year
majianz / dl4gps
View on GitHub
[ACL 2026 Main Conference] Paper list for the survey "A Survey of Deep Learning for Geometry Problem Solving"
☆36Sep 14, 2025Updated 10 months ago
Ucas-HaoranWei / Slow-Perception
View on GitHub
Official code implementation of Slow Perception:Let's Perceive Geometric Figures Step-by-step
☆161Jul 28, 2025Updated 11 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
TideDra / lmm-r1
View on GitHub
Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.
☆847May 14, 2025Updated last year
mingliangzhang2018 / PGPS
View on GitHub
The implement of geometric solver PGPSNet
☆30Jul 8, 2026Updated 2 weeks ago
tongyx361 / Awesome-LLM4Math
View on GitHub
Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied wit…
☆159Jul 12, 2024Updated 2 years ago
RUCAIBox / Virgo
View on GitHub
Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*
☆110May 27, 2025Updated last year
yunfeixie233 / ViGaL
View on GitHub
☆70Feb 4, 2026Updated 5 months ago
yyDing1 / ScaleQuest
View on GitHub
[ACL 2025] We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLM…
☆69Oct 27, 2024Updated last year
VisuLogic-Benchmark / VisuLogic-Train
View on GitHub
☆21Jul 9, 2025Updated last year
neulab / VisualPuzzles
View on GitHub
☆18Nov 30, 2025Updated 7 months ago
njucckevin / MM-Self-Improve
View on GitHub
A Self-Training Framework for Vision-Language Reasoning
☆90Jan 23, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
KbsdJames / omni-math-rule
View on GitHub
The rule-based evaluation subset and code implementation of Omni-MATH
☆28Dec 23, 2024Updated last year
MDI-Benchmark / MDI-Benchmark
View on GitHub
☆14Dec 18, 2024Updated last year
Yushi-Hu / VisualSketchpad
View on GitHub
Codes for Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models
☆287Aug 5, 2025Updated 11 months ago
Richar-Du / Virgo
View on GitHub
Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*
☆20May 27, 2025Updated last year
psunlpgroup / VisOnlyQA
View on GitHub
This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception o…
☆29Jul 9, 2025Updated last year
yejinc00 / PREMIR
View on GitHub
[EMNLP 2025] The official implementation of "Zero-shot Multimodal Document Retrieval via Cross-Modal Question Generation"
☆15Aug 26, 2025Updated 10 months ago
ECNU-ICALK / SocraticMath
View on GitHub
[CIKM 2024] Boosting Large Language Models with Socratic Method for Conversational Mathematics Teaching
☆15Apr 2, 2026Updated 3 months ago
mathllm / MATH-V
View on GitHub
[NeurIPS 2024] MATH-Vision dataset and code to measure multimodal mathematical reasoning capabilities.
☆139May 16, 2025Updated last year
intervention-training / int
View on GitHub
☆16Feb 4, 2026Updated 5 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ZubinGou / math-evaluation-harness
View on GitHub
A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨
☆277Apr 26, 2024Updated 2 years ago
lupantech / InterGPS
View on GitHub
Data and Code for ACL 2021 Paper "Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic Reasoning"
☆178Mar 29, 2025Updated last year
zzli2022 / Awesome-System2-Reasoning-LLM
View on GitHub
Latest Advances on System-2 Reasoning
☆1,352Jun 8, 2025Updated last year
ShadeCloak / ADORA
View on GitHub
☆47Apr 9, 2025Updated last year
SCNU203 / GeoQA-Plus
View on GitHub
☆20May 14, 2024Updated 2 years ago
YuyaoZhangQAQ / QCompiler
View on GitHub
This repository contains the code for the paper “Neuro-Symbolic Query Compiler”, accepted to the Findings of ACL 2025.
☆17Oct 20, 2025Updated 9 months ago
RifleZhang / LLaVA-Reasoner-DPO
View on GitHub
☆116Jan 8, 2025Updated last year