mathllm / MathCoder
Family of LLMs for mathematical reasoning.
โ244Updated last month
Alternatives and similar repositories for MathCoder:
Users that are interested in MathCoder are comparing it to the libraries listed below
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied witโฆโ102Updated 6 months ago
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. ๐งฎโจโ145Updated 8 months ago
- [ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scieโฆโ114Updated 6 months ago
- Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" (ICLR 2024)โ355Updated 4 months ago
- โ302Updated 4 months ago
- โ297Updated this week
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Modelsโ233Updated 4 months ago
- [ACL 2024 Findings] MathBench: A Comprehensive Multi-Level Difficulty Mathematics Evaluation Datasetโ92Updated 6 months ago
- Reformatted Alignmentโ113Updated 3 months ago
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"โ210Updated 3 months ago
- Mix of Minimal Optimal Sets (MMOS) of dataset has two advantages for two aspects, higher performance and lower construction costs on mathโฆโ72Updated 5 months ago
- โ81Updated 9 months ago
- ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 2023 (oral)โ254Updated 9 months ago
- A large-scale, fine-grained, diverse preference dataset (and models).โ325Updated last year
- The dataset and code for paper: TheoremQA: A Theorem-driven Question Answering datasetโ155Updated 8 months ago
- [NeurlPS D&B 2024] Generative AI for Math: MathPileโ401Updated 2 months ago
- โ134Updated 8 months ago
- SOTA Math Opensource LLMโ329Updated last year
- a Fine-tuned LLaMA that is Good at Arithmetic Tasksโ177Updated last year
- Unofficial implementation of AlpaGasusโ90Updated last year
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuningโ221Updated last year
- Data and Code for Program of Thoughts (TMLR 2023)โ256Updated 8 months ago
- โ121Updated last month
- โ246Updated 5 months ago
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMsโ237Updated last month
- Implementation of paper Data Engineering for Scaling Language Models to 128K Contextโ449Updated 10 months ago
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.โ133Updated last week
- An Analytical Evaluation Board of Multi-turn LLM Agentsโ270Updated 7 months ago
- โ113Updated 2 months ago
- Generative Judge for Evaluating Alignmentโ223Updated last year