HKU-MMLab / Math-VR-CodePlot-CoTLinks
Math-VR Benchmark & CodePlot-CoT: Mathematical Visual Reasoning by Thinking with Code-Driven Images
☆53Updated 2 months ago
Alternatives and similar repositories for Math-VR-CodePlot-CoT
Users that are interested in Math-VR-CodePlot-CoT are comparing it to the libraries listed below
Sorting:
- ☆35Updated 2 months ago
- SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward☆91Updated 5 months ago
- [ICLR 2026] Uni-CoT: Towards Unified Chain-of-Thought Reasoning Across Text and Vision☆205Updated this week
- [ICLR2025] MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models☆94Updated last year
- https://huggingface.co/datasets/multimodal-reasoning-lab/Zebra-CoT☆117Updated 3 months ago
- ☆80Updated 7 months ago
- Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos☆64Updated 4 months ago
- Official Code for "ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning"☆79Updated last month
- ☆204Updated last month
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆37Updated last year
- Multimodal RewardBench☆60Updated 11 months ago
- High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning☆52Updated 6 months ago
- ☆96Updated 7 months ago
- ☆141Updated 3 months ago
- [MTI-LLM@NeurIPS 2025] Official implementation of "PyVision: Agentic Vision with Dynamic Tooling."☆145Updated 6 months ago
- Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?☆86Updated 6 months ago
- [NeurIPS 2025] HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation☆74Updated 4 months ago
- Implementation for "The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer"☆78Updated 3 months ago
- ☆39Updated 8 months ago
- [ICLR 2025] AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark☆138Updated 7 months ago
- The code repository of UniRL☆51Updated 8 months ago
- Official implement of MIA-DPO☆70Updated last year
- OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe☆141Updated last month
- Assessing Context-Aware Creative Intelligence in MLLMs☆23Updated 6 months ago
- A collection of awesome think with videos papers.☆83Updated 2 months ago
- ✨✨ [ICLR 2026] MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models☆42Updated 9 months ago
- (ICLR 2026)Official repository of 'ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing’☆58Updated last week
- Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give…☆206Updated 3 months ago
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]☆179Updated 7 months ago
- [ICML 2025] VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models☆39Updated 7 months ago