mathllm / MathCoderLinks
[MathCoder, MathCoder-VL] Family of LLMs/LMMs for mathematical reasoning.
☆333Updated last month
Alternatives and similar repositories for MathCoder
Users that are interested in MathCoder are comparing it to the libraries listed below
Sorting:
- Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" [ICLR 2024]☆376Updated last year
- [ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scie…☆174Updated 5 months ago
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆267Updated last year
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied wit…☆145Updated last year
- ☆320Updated last year
- MathVista: data, code, and evaluation for Mathematical Reasoning in Visual Contexts☆346Updated 2 months ago
- ☆166Updated last year
- [ACL 2024 Findings] MathBench: A Comprehensive Multi-Level Difficulty Mathematics Evaluation Dataset☆109Updated 6 months ago
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate" [COLM 2025]☆179Updated 4 months ago
- ☆139Updated last year
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆263Updated 4 months ago
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆191Updated last year
- [NeurlPS D&B 2024] Generative AI for Math: MathPile☆418Updated 7 months ago
- A large-scale, fine-grained, diverse preference dataset (and models).☆355Updated last year
- [ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning☆364Updated last year
- ☆327Updated 6 months ago
- [NeurIPS 2024] MATH-Vision dataset and code to measure multimodal mathematical reasoning capabilities.☆123Updated 6 months ago
- ☆341Updated 5 months ago
- Reformatted Alignment☆113Updated last year
- Generative Judge for Evaluating Alignment☆248Updated last year
- FireAct: Toward Language Agent Fine-tuning☆286Updated 2 years ago
- ☆315Updated last year
- ☆313Updated last year
- This is the official implementation of "Progressive-Hint Prompting Improves Reasoning in Large Language Models"☆209Updated 2 years ago
- Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"☆524Updated 10 months ago
- [TMLR] Cumulative Reasoning With Large Language Models (https://arxiv.org/abs/2308.04371)☆303Updated 3 months ago
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆113Updated last month
- Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)☆218Updated 2 years ago
- Data and Code for Program of Thoughts [TMLR 2023]☆292Updated last year
- [ACL 2024] Progressive LLaMA with Block Expansion.☆513Updated last year