GAIR-NLP / MathPileLinks
[NeurlPS D&B 2024] Generative AI for Math: MathPile
☆417Updated 6 months ago
Alternatives and similar repositories for MathPile
Users that are interested in MathPile are comparing it to the libraries listed below
Sorting:
- Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" [ICLR 2024]☆376Updated last year
- SOTA Math Opensource LLM☆333Updated last year
- Implementation of paper Data Engineering for Scaling Language Models to 128K Context☆473Updated last year
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆263Updated 2 months ago
- ☆320Updated last year
- ☆163Updated last year
- [MathCoder, MathCoder-VL] Family of LLMs/LMMs for mathematical reasoning.☆319Updated last month
- FireAct: Toward Language Agent Fine-tuning☆281Updated last year
- a Fine-tuned LLaMA that is Good at Arithmetic Tasks☆177Updated 2 years ago
- A large-scale, fine-grained, diverse preference dataset (and models).☆353Updated last year
- ☆312Updated last year
- [ACL 2024] Progressive LLaMA with Block Expansion.☆510Updated last year
- GPT-Fathom is an open-source and reproducible LLM evaluation suite, benchmarking 10+ leading open-source and closed-source LLMs as well a…☆347Updated last year
- ☆541Updated 10 months ago
- Reformatted Alignment☆113Updated last year
- ☆307Updated last year
- Evaluation suite for LLMs☆363Updated 2 months ago
- The official evaluation suite and dynamic data release for MixEval.☆249Updated 10 months ago
- Official repository for LongChat and LongEval☆533Updated last year
- [ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark☆389Updated last year
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]☆570Updated 9 months ago
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆266Updated last year
- ☆83Updated last year
- ☆275Updated 2 years ago
- ☆341Updated 4 months ago
- This is the official implementation of "Progressive-Hint Prompting Improves Reasoning in Large Language Models"☆209Updated last year
- [TMLR] Cumulative Reasoning With Large Language Models (https://arxiv.org/abs/2308.04371)☆302Updated 2 months ago
- Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718☆348Updated last year
- Code for Quiet-STaR☆740Updated last year
- ☆763Updated last year