GAIR-NLP / MathPileLinks
[NeurlPS D&B 2024] Generative AI for Math: MathPile
☆419Updated 9 months ago
Alternatives and similar repositories for MathPile
Users that are interested in MathPile are comparing it to the libraries listed below
Sorting:
- Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" [ICLR 2024]☆382Updated last year
- SOTA Math Opensource LLM☆333Updated 2 years ago
- Implementation of paper Data Engineering for Scaling Language Models to 128K Context☆482Updated last year
- ☆167Updated last year
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆264Updated 6 months ago
- ☆320Updated last year
- a Fine-tuned LLaMA that is Good at Arithmetic Tasks☆178Updated 2 years ago
- [MathCoder, MathCoder-VL] Family of LLMs/LMMs for mathematical reasoning.☆337Updated 3 months ago
- FireAct: Toward Language Agent Fine-tuning☆291Updated 2 years ago
- A large-scale, fine-grained, diverse preference dataset (and models).☆361Updated 2 years ago
- ☆313Updated last year
- ☆559Updated last year
- ☆321Updated last year
- [ACL 2024] Progressive LLaMA with Block Expansion.☆514Updated last year
- Reformatted Alignment☆111Updated last year
- [ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark☆389Updated last year
- The official evaluation suite and dynamic data release for MixEval.☆254Updated last year
- This repository contains the paper list for the paper: Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reaso…☆367Updated 2 years ago
- Evaluation suite for LLMs☆378Updated 6 months ago
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆269Updated last year
- This is the official implementation of "Progressive-Hint Prompting Improves Reasoning in Large Language Models"☆209Updated 2 years ago
- [ICLR 2024] Lemur: Open Foundation Models for Language Agents☆555Updated 2 years ago
- ☆340Updated 7 months ago
- GPT-Fathom is an open-source and reproducible LLM evaluation suite, benchmarking 10+ leading open-source and closed-source LLMs as well a…☆346Updated last year
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]☆583Updated last year
- ☆278Updated 2 years ago
- [ACL 2024] This is the code repo for our ACL’24 paper "Cleaner Pretraining Corpus Curation with Neural Web Scraping".☆231Updated last year
- Official codebase for "SelFee: Iterative Self-Revising LLM Empowered by Self-Feedback Generation"☆228Updated 2 years ago
- A repository sharing the literatures about long-context large language models, including the methodologies and the evaluation benchmarks☆272Updated last year
- Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718☆370Updated last year