AIMO-CMU-MATH / CMU_MATH-AIMO
☆70Updated 7 months ago
Alternatives and similar repositories for CMU_MATH-AIMO:
Users that are interested in CMU_MATH-AIMO are comparing it to the libraries listed below
- Mix of Minimal Optimal Sets (MMOS) of dataset has two advantages for two aspects, higher performance and lower construction costs on math…☆72Updated 6 months ago
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied wit…☆114Updated 7 months ago
- The official repository of the Omni-MATH benchmark.☆71Updated last month
- [NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*☆94Updated 2 months ago
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆103Updated last week
- ☆82Updated 3 weeks ago
- ☆130Updated 2 months ago
- [ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scie…☆126Updated 7 months ago
- ☆133Updated 2 months ago
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆173Updated 9 months ago
- MACM: Utilizing a Multi-Agent System for Condition Mining in Solving Complex Mathematical Problems☆79Updated 6 months ago
- The code for creating the iGSM datasets in papers "Physics of Language Models Part 2.1, Grade-School Math and the Hidden Reasoning Proces…☆36Updated last month
- [NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejection☆39Updated 3 months ago
- Code implementation of synthetic continued pretraining☆88Updated last month
- Simple and efficient pytorch-native transformer training and inference (batched)☆68Updated 10 months ago
- official implementation of paper "Process Reward Model with Q-value Rankings"☆48Updated 2 weeks ago
- 🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.☆194Updated 2 weeks ago
- [AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy☆48Updated 2 months ago
- ☆33Updated 8 months ago
- ☆92Updated last month
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"☆154Updated 2 months ago
- Repo of paper "Free Process Rewards without Process Labels"☆123Updated last month
- ☆54Updated 3 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆95Updated 4 months ago