shiwk24/MathCanvas

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/shiwk24/MathCanvas)

shiwk24 / MathCanvas

This is the official repository for the paper "MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning"

☆80

Alternatives and similar repositories for MathCanvas

Users that are interested in MathCanvas are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

multimodal-reasoning-lab / Bagel-Zebra-CoT
View on GitHub
https://huggingface.co/datasets/multimodal-reasoning-lab/Zebra-CoT
☆137Jan 30, 2026Updated 5 months ago
ThinkMorph / ThinkMorph
View on GitHub
[ICLR 2026] The official repository for paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"
☆192May 1, 2026Updated 2 months ago
majianz / dl4gps
View on GitHub
[ACL 2026 Main Conference] Paper list for the survey "A Survey of Deep Learning for Geometry Problem Solving"
☆36Sep 14, 2025Updated 10 months ago
HKU-MMLab / Math-VR-CodePlot-CoT
View on GitHub
Math-VR Benchmark & CodePlot-CoT: Mathematical Visual Reasoning by Thinking with Code-Driven Images
☆63Nov 4, 2025Updated 8 months ago
Candice-yu / GeoLaux
View on GitHub
A Benchmark for Evaluating MLLMs' Geometry Performance on Long-Step Problems Requiring Auxiliary Lines
☆38Apr 27, 2026Updated 3 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
GAIR-NLP / thinking-with-generated-images
View on GitHub
Doodling our way to AGI ✏️ 🖼️ 🧠
☆128May 29, 2025Updated last year
We-Math / V-Thinker
View on GitHub
☆177Nov 26, 2025Updated 8 months ago
hwanyu112 / Latent-Sketchpad
View on GitHub
☆73Feb 1, 2026Updated 5 months ago
MathGenie / MathGenie
View on GitHub
☆14Mar 11, 2024Updated 2 years ago
ModalityDance / Omni-R1
View on GitHub
[ACL 2026 Findings] "Omni-R1: Towards the Unified Generative Paradigm for Multimodal Reasoning"
☆63May 26, 2026Updated 2 months ago
yejinc00 / PREMIR
View on GitHub
[EMNLP 2025] The official implementation of "Zero-shot Multimodal Document Retrieval via Cross-Modal Question Generation"
☆15Aug 26, 2025Updated 11 months ago
FormalGeo / FormalGeo
View on GitHub
Formal representation and solving for Euclidean plane geometry problems.
☆43Jul 1, 2026Updated 3 weeks ago
jiyt17 / Prompt-A-Video
View on GitHub
[ICCV 2025] Prompt-A-Video
☆24Feb 2, 2025Updated last year
Ucas-HaoranWei / Slow-Perception
View on GitHub
Official code implementation of Slow Perception:Let's Perceive Geometric Figures Step-by-step
☆163Jul 28, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
FYYDCC / IVT-LR
View on GitHub
Official repository for “Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space”
☆18Jan 27, 2026Updated 6 months ago
InternScience / TrustGeoGen
View on GitHub
Official repository for "TrustGeoGen: Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving"
☆23Sep 1, 2025Updated 10 months ago
gogoczh / CoMT
View on GitHub
code for "CoMT: A Novel Benchmark for Chain of Multi-modal Thought on Large Vision-Language Models"
☆19Mar 10, 2025Updated last year
zhouyiks / CoLVA
View on GitHub
☆44Jul 9, 2025Updated last year
cheryyunl / ROVER
View on GitHub
Official eval code for ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation
☆27Dec 12, 2025Updated 7 months ago
hkust-nlp / mstar
View on GitHub
[ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning
☆75Jul 13, 2025Updated last year
mingliangzhang2018 / PGDP
View on GitHub
The first end-to-end deep learning model for explicit plane geometry diagram parsing.
☆59Jun 3, 2026Updated last month
UMass-Embodied-AGI / Mirage
View on GitHub
[CVPR 2026] Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens
☆294Aug 2, 2025Updated 11 months ago
Fr0zenCrane / UniCoT
View on GitHub
[ICLR 2026] Uni-CoT: Towards Unified Chain-of-Thought Reasoning Across Text and Vision
☆234May 31, 2026Updated last month
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
thuml / Reasoning-Visual-World
View on GitHub
Official repository for "Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models", https://arxiv.org/abs/2601.1983…
☆100Mar 9, 2026Updated 4 months ago
huaixuheqing / VPPO-RL
View on GitHub
[ICLR 2026] Official repo for "Spotlight on Token Perception for Multimodal Reinforcement Learning"
☆69Apr 3, 2026Updated 3 months ago
allenai / DrawEduMath
View on GitHub
Can VLMs understand students' hand-drawn math work?
☆19Jan 20, 2026Updated 6 months ago
JackLingjie / VisCodex
View on GitHub
Official Implementation for the paper "VisCodex: Unified Multimodal Code Generation via Merging Vision and Coding Models"
☆23Aug 14, 2025Updated 11 months ago
THU-KEG / LongWriter-V
View on GitHub
[ACM MM25] LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models
☆24Mar 29, 2025Updated last year
Zhenwen-NLP / MathChat
View on GitHub
Official code and data repository of MathChat: MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Inte…
☆22Jun 3, 2024Updated 2 years ago
OpenGVLab / GenExam
View on GitHub
[ICML 2026] GenExam: A Multidisciplinary Text-to-Image Exam
☆69May 26, 2026Updated 2 months ago
zhaochen0110 / Awesome_Think_With_Images
View on GitHub
Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual in…
☆1,497Mar 9, 2026Updated 4 months ago
GAIR-NLP / Med
View on GitHub
[ICML 2026] What Does Vision Tool-Use Reinforcement Learning Really Learn? Disentangling Tool-Induced and Intrinsic Effects for Crop-and-…
☆23May 15, 2026Updated 2 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
yayafengzi / ALToLLM
View on GitHub
ALTo: Adaptive-Length Tokenizer for Autoregressive Mask Generation
☆30May 27, 2025Updated last year
ByteDance-Seed / Bagel
View on GitHub
Open-source unified multimodal model
☆6,124May 4, 2026Updated 2 months ago
ycpNotFound / GeoGen
View on GitHub
A pipeline for the automatic construction of geometry problems along with step-by-step solutions.
☆17Aug 27, 2025Updated 11 months ago
Yushi-Hu / VisualSketchpad
View on GitHub
Codes for Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models
☆286Aug 5, 2025Updated 11 months ago
rongyaofang / prism-bench
View on GitHub
This is the official repository for the paper "FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehe…
☆131Jan 29, 2026Updated 6 months ago
bebr2 / RACE
View on GitHub
Code for RACE.
☆15Nov 12, 2025Updated 8 months ago
rongyaofang / GoT
View on GitHub
Official repository of "GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing"
☆317Sep 28, 2025Updated 10 months ago