MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models
☆454Feb 1, 2024Updated 2 years ago
Alternatives and similar repositories for MetaMath
Users that are interested in MetaMath are comparing it to the libraries listed below
Sorting:
- [NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*☆121Dec 10, 2024Updated last year
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆270Sep 12, 2024Updated last year
- 800,000 step-level correctness labels on LLM solutions to MATH problems☆2,102Jun 1, 2023Updated 2 years ago
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆273Apr 26, 2024Updated last year
- ☆30Dec 27, 2024Updated last year
- Repository of <FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models>☆77Jan 8, 2026Updated 2 months ago
- SOTA Math Opensource LLM☆335Dec 12, 2023Updated 2 years ago
- Recipes to train reward model for RLHF.☆1,521Apr 24, 2025Updated 10 months ago
- ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting wit…☆1,114Feb 22, 2024Updated 2 years ago
- The official repository of the Omni-MATH benchmark.☆93Dec 22, 2024Updated last year
- [NeurlPS D&B 2024] Generative AI for Math: MathPile☆419Apr 4, 2025Updated 11 months ago
- ☆1,111Jan 10, 2026Updated 2 months ago
- Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" [ICLR 2024]☆383Aug 25, 2024Updated last year
- State-of-the-art bilingual open-sourced Math reasoning LLMs.☆543Oct 22, 2024Updated last year
- ☆83Apr 18, 2024Updated last year
- ☆342Jun 5, 2025Updated 9 months ago
- The MATH Dataset (NeurIPS 2021)☆1,321Sep 6, 2025Updated 6 months ago
- Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"☆75May 20, 2025Updated 10 months ago
- The rule-based evaluation subset and code implementation of Omni-MATH☆26Dec 23, 2024Updated last year
- [ACL 2024 Findings] The official repo for "ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large …☆24May 29, 2024Updated last year
- Welcome to the 'Open-Alteryx-Macro' project. This project is aimed at providing an open-source solution for managing and updating Alteryx…☆156May 25, 2024Updated last year
- [ACL 2025 Findings] Autonomous Data Selection with Zero-shot Generative Classifiers for Mathematical Texts (As Huggingface Daily Papers: …☆90Nov 23, 2025Updated 3 months ago
- Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"☆392Jan 19, 2025Updated last year
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]☆591Dec 9, 2024Updated last year
- A framework for few-shot evaluation of autoregressive language models.☆26Dec 21, 2023Updated 2 years ago
- ☆332May 31, 2025Updated 9 months ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆48Jan 17, 2024Updated 2 years ago
- RewardBench: the first evaluation tool for reward models.☆704Feb 16, 2026Updated last month
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆124Sep 9, 2024Updated last year
- A project to improve skills of large language models☆865Mar 13, 2026Updated last week
- Imagine building a whole operating system around just your notes.☆80Feb 5, 2025Updated last year
- ☆160Nov 23, 2024Updated last year
- ☆51Oct 28, 2024Updated last year
- ☆321Sep 18, 2024Updated last year
- 最终幻想14英文笔记☆96May 25, 2024Updated last year
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 5 months ago
- ☆72Apr 2, 2024Updated last year
- [ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scie…☆187Jun 8, 2025Updated 9 months ago
- 基于Vue 3.0的拖拽式时间槽选择器,让你畅享终极时间管理体验☆78Mar 2, 2024Updated 2 years ago