mathllm / MathCoderLinks
[MathCoder, MathCoder-VL] Family of LLMs/LMMs for mathematical reasoning.
☆300Updated last month
Alternatives and similar repositories for MathCoder
Users that are interested in MathCoder are comparing it to the libraries listed below
Sorting:
- Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" [ICLR 2024]☆376Updated 10 months ago
- [NeurlPS D&B 2024] Generative AI for Math: MathPile☆414Updated 3 months ago
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆266Updated 10 months ago
- ☆319Updated 9 months ago
- [ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scie…☆159Updated last month
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied wit…☆132Updated last year
- MathVista: data, code, and evaluation for Mathematical Reasoning in Visual Contexts☆321Updated 7 months ago
- A large-scale, fine-grained, diverse preference dataset (and models).☆344Updated last year
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆185Updated last year
- ☆159Updated last year
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆111Updated this week
- [ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning☆357Updated 10 months ago
- ☆304Updated last month
- ☆337Updated last month
- Official implementation of paper "Cumulative Reasoning With Large Language Models" (https://arxiv.org/abs/2308.04371)☆294Updated 10 months ago
- This is the official implementation of "Progressive-Hint Prompting Improves Reasoning in Large Language Models"☆209Updated last year
- [ACL 2024] Progressive LLaMA with Block Expansion.☆505Updated last year
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆253Updated last week
- [ACL 2024 Findings] MathBench: A Comprehensive Multi-Level Difficulty Mathematics Evaluation Dataset☆103Updated last month
- ☆310Updated last year
- FireAct: Toward Language Agent Fine-tuning☆280Updated last year
- ☆135Updated 8 months ago
- ☆294Updated 11 months ago
- Reformatted Alignment☆113Updated 9 months ago
- Benchmarking LLMs with Challenging Tasks from Real Users☆229Updated 8 months ago
- Project for the paper entitled `Instruction Tuning for Large Language Models: A Survey`☆180Updated 7 months ago
- ☆266Updated last year
- Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"☆503Updated 6 months ago
- Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"☆244Updated 2 months ago
- Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.☆427Updated last year