Alpha-Innovator / GeoXLinks

[ICLR'25] Geometric Problem Solving Through Unified Formalized Vision-Language Pre-training

☆46

Alternatives and similar repositories for GeoX

Users that are interested in GeoX are comparing it to the libraries listed below

Sorting:

pengshuai-rin / MultiMath
MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models
☆31Updated 10 months ago
TIGER-AI-Lab / VL-Rethinker
The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]
☆169Updated 6 months ago
pipilurj / G-LLaVA
Official github repo of G-LLaVA
☆148Updated 10 months ago
njucckevin / MM-Self-Improve
A Self-Training Framework for Vision-Language Reasoning
☆88Updated 10 months ago
HZQ950419 / Math-LLaVA
Code for Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models
☆92Updated last year
wwzhuang01 / Math-PUMA
[AAAI 2025]Math-PUMA: Progressive Upward Multimodal Alignment to Enhance Mathematical Reasoning
☆41Updated 8 months ago
DataArcTech / ChartMoE
[ICLR2025 Oral] ChartMoE: Mixture of Diversely Aligned Expert Connector for Chart Understanding
☆94Updated 8 months ago
TideDra / VL-RLHF
A RLHF Infrastructure for Vision-Language Models
☆187Updated last year
NUS-TRAIL / NoisyRollout
[NeurIPS 2025] NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation
☆101Updated 3 months ago
RifleZhang / LLaVA-Reasoner-DPO
☆106Updated 11 months ago
luka-group / mDPO
[EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.
☆83Updated last year
pipilurj / bootstrapped-preference-optimization-BPO
code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"
☆59Updated last year
InfiMM / Awesome-Multimodal-LLM-for-Math-STEM
Paper collections of multi-modal LLM for Math/STEM/Code.
☆131Updated last month
xinyan-cxy / MINT-CoT
[NeurIPS 2025] MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning
☆93Updated 3 months ago
ShadeCloak / ADORA
☆46Updated 8 months ago
mathllm / MATH-V
[NeurIPS 2024] MATH-Vision dataset and code to measure multimodal mathematical reasoning capabilities.
☆126Updated 7 months ago
Kun-Xiang / AtomThink
Offical Repository of "AtomThink: Multimodal Slow Thinking with Atomic Step Reasoning"
☆57Updated last month
GAIR-NLP / thinking-with-generated-images
Doodling our way to AGI ✏️ 🖼️ 🧠
☆118Updated 6 months ago
bigai-nlco / LatentSeek
Official Repository of LatentSeek
☆70Updated 6 months ago
MikeWangWZHL / PAPO
Official repo for "PAPO: Perception-Aware Policy Optimization for Multimodal Reasoning"
☆106Updated last week
vlf-silkie / VLFeedback
☆100Updated last year
MME-Benchmarks / MME-CoT
MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness, and Efficiency
☆135Updated 4 months ago
eternal8080 / MV-MATH
Description for MV-MATH
☆15Updated 5 months ago
findalexli / mllm-dpo
[ACL 2024] Multi-modal preference alignment remedies regression of visual instruction tuning on language model
☆47Updated last year
OpenGVLab / V2PE
[ArXiv] V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding
☆58Updated last year
Osilly / Awesome-Interleaving-Reasoning
Interleaving Reasoning: Next-Generation Reasoning Systems for AGI
☆220Updated 2 months ago
njucckevin / CapArena
An Arena-style Automated Evaluation Benchmark for Detailed Captioning
☆56Updated 6 months ago
xuyige / SoftCoT
ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of…
☆71Updated 6 months ago
foundation-multimodal-models / CAL
[NeurIPS'24] Official PyTorch Implementation of Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment
☆58Updated last year
QingyangZhang / EMPO
[NeurIPS25 Spotlight] EMPO, A Fully Unsupervised RLVR Method
☆87Updated 3 weeks ago