GAIR-NLP / MathPile
Generative AI for Math: MathPile
☆376Updated 2 weeks ago
Related projects: ⓘ
- Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" (ICLR 2024)☆317Updated 3 weeks ago
- SOTA Math Opensource LLM☆296Updated 9 months ago
- Implementation of paper Data Engineering for Scaling Language Models to 128K Context☆416Updated 6 months ago
- ☆268Updated this week
- FireAct: Toward Language Agent Fine-tuning☆242Updated 10 months ago
- a Fine-tuned LLaMA that is Good at Arithmetic Tasks☆173Updated last year
- ☆419Updated 2 months ago
- ☆284Updated 3 months ago
- Family of LLMs for mathematical reasoning.☆208Updated 3 months ago
- ☆260Updated last month
- Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality s…☆398Updated this week
- This repository contains the paper list for the paper: Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reaso…☆329Updated 9 months ago
- Arena-Hard-Auto: An automatic LLM benchmark.☆421Updated 2 weeks ago
- Evaluation suite for LLMs☆291Updated 3 months ago
- [ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark☆342Updated 2 months ago
- ☆110Updated 4 months ago
- ☆179Updated this week
- Generative Representational Instruction Tuning☆527Updated 2 weeks ago
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆208Updated last week
- Official Pytorch Implementation for MathGLM☆315Updated 9 months ago
- The official evaluation suite and dynamic data release for MixEval.☆200Updated this week
- Official codebase for "SelFee: Iterative Self-Revising LLM Empowered by Self-Feedback Generation"☆219Updated last year
- ☆164Updated last week
- Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.☆608Updated last month
- ☆473Updated this week
- [ACL 2024] This is the code repo for our ACL’24 paper "Cleaner Pretraining Corpus Curation with Neural Web Scraping".☆208Updated 3 weeks ago
- Expert Specialized Fine-Tuning☆129Updated last month
- DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models☆970Updated 8 months ago
- A large-scale, fine-grained, diverse preference dataset (and models).☆299Updated 8 months ago
- [ACL 2024] Progressive LLaMA with Block Expansion.☆464Updated 4 months ago