mathllm / MathCoderLinks

[MathCoder, MathCoder-VL] Family of LLMs/LMMs for mathematical reasoning.

☆303

Alternatives and similar repositories for MathCoder

Users that are interested in MathCoder are comparing it to the libraries listed below

Sorting:

TIGER-AI-Lab / MAmmoTH
Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" [ICLR 2024]
☆376Updated 11 months ago
OpenBMB / OlympiadBench
[ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scie…
☆164Updated 2 months ago
lupantech / MathVista
MathVista: data, code, and evaluation for Mathematical Reasoning in Visual Contexts
☆328Updated 8 months ago
OpenBMB / Eurus
☆320Updated 10 months ago
GAIR-NLP / MathPile
[NeurlPS D&B 2024] Generative AI for Math: MathPile
☆415Updated 4 months ago
OFA-Sys / gsm8k-ScRel
Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models
☆268Updated 10 months ago
eddycmu / demystify-long-cot
☆309Updated 2 months ago
kyegomez / Lets-Verify-Step-by-Step
"Improving Mathematical Reasoning with Process Supervision" by OPENAI
☆112Updated 2 weeks ago
tongyx361 / Awesome-LLM4Math
Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied wit…
☆133Updated last year
keirp / OpenWebMath
☆161Updated last year
OpenBMB / UltraFeedback
A large-scale, fine-grained, diverse preference dataset (and models).
☆345Updated last year
tianyi-lab / Reflection_Tuning
[ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning
☆360Updated 11 months ago
voidism / DoLa
Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"
☆504Updated 6 months ago
iiis-ai / cumulative-reasoning
Official implementation of TMLR paper "Cumulative Reasoning With Large Language Models" (https://arxiv.org/abs/2308.04371)
☆297Updated last week
open-compass / MathBench
[ACL 2024 Findings] MathBench: A Comprehensive Multi-Level Difficulty Mathematics Evaluation Dataset
☆104Updated 2 months ago
Re-Align / URIAL
☆311Updated last year
da03 / Internalize_CoT_Step_by_Step
☆187Updated 3 months ago
GAIR-NLP / ReAlign
Reformatted Alignment
☆113Updated 10 months ago
da03 / implicit_chain_of_thought
☆135Updated 8 months ago
TIGER-AI-Lab / CritiqueFineTuning
Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate" [COLM 2025]
☆169Updated last month
ezelikman / STaR
Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)
☆206Updated 2 years ago
MARIO-Math-Reasoning / Super_MARIO
☆337Updated 2 months ago
TencentARC / LLaMA-Pro
[ACL 2024] Progressive LLaMA with Block Expansion.
☆508Updated last year
GAIR-NLP / ProX
[ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale
☆255Updated last month
QwenLM / AutoIF
☆301Updated last year
ZubinGou / math-evaluation-harness
A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨
☆239Updated last year
anchen1011 / FireAct
FireAct: Toward Language Agent Fine-tuning
☆281Updated last year
Cohere-Labs-Community / parameter-efficient-moe
☆269Updated last year
knoveleng / open-rs
Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"
☆248Updated 2 months ago
SqueezeAILab / LLM2LLM
[ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement
☆187Updated last year