mathllm / MathCoderLinks
[MathCoder, MathCoder-VL] Family of LLMs/LMMs for mathematical reasoning.
☆325Updated this week
Alternatives and similar repositories for MathCoder
Users that are interested in MathCoder are comparing it to the libraries listed below
Sorting:
- MathVista: data, code, and evaluation for Mathematical Reasoning in Visual Contexts☆341Updated 3 weeks ago
- Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" [ICLR 2024]☆377Updated last year
- A large-scale, fine-grained, diverse preference dataset (and models).☆353Updated last year
- [ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scie…☆171Updated 4 months ago
- ☆312Updated last year
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆267Updated last year
- ☆319Updated last year
- [TMLR] Cumulative Reasoning With Large Language Models (https://arxiv.org/abs/2308.04371)☆302Updated 2 months ago
- [NeurlPS D&B 2024] Generative AI for Math: MathPile☆417Updated 6 months ago
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆258Updated last year
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆190Updated last year
- Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"☆520Updated 9 months ago
- ☆321Updated 4 months ago
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied wit…☆143Updated last year
- [NeurIPS 2024] MATH-Vision dataset and code to measure multimodal mathematical reasoning capabilities.☆118Updated 5 months ago
- ☆164Updated last year
- Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)☆214Updated 2 years ago
- SOTA Math Opensource LLM☆333Updated last year
- This is the official implementation of "Progressive-Hint Prompting Improves Reasoning in Large Language Models"☆209Updated 2 years ago
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆263Updated 3 months ago
- ☆342Updated 4 months ago
- Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.☆436Updated last year
- Generative Judge for Evaluating Alignment☆247Updated last year
- [ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning☆362Updated last year
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate" [COLM 2025]☆174Updated 3 months ago
- Data and Code for Program of Thoughts [TMLR 2023]☆287Updated last year
- An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]☆354Updated last year
- ☆146Updated last week
- [ACL 2024 Findings] MathBench: A Comprehensive Multi-Level Difficulty Mathematics Evaluation Dataset☆106Updated 4 months ago
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆219Updated 2 months ago