AIMO-CMU-MATH / CMU_MATH-AIMOLinks
β75Updated 10 months ago
Alternatives and similar repositories for CMU_MATH-AIMO
Users that are interested in CMU_MATH-AIMO are comparing it to the libraries listed below
Sorting:
- [NeurIPS'24] Official code for *π―DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*β106Updated 5 months ago
- Mix of Minimal Optimal Sets (MMOS) of dataset has two advantages for two aspects, higher performance and lower construction costs on mathβ¦β73Updated 10 months ago
- AIMO2 2nd place solutionβ57Updated last week
- [ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scieβ¦β151Updated 10 months ago
- Research Code for preprint "Optimizing Test-Time Compute via Meta Reinforcement Finetuning".β95Updated 2 months ago
- The code for creating the iGSM datasets in papers "Physics of Language Models Part 2.1, Grade-School Math and the Hidden Reasoning Procesβ¦β54Updated 4 months ago
- Simple and efficient pytorch-native transformer training and inference (batched)β75Updated last year
- β141Updated 6 months ago
- Revisiting Mid-training in the Era of RL Scalingβ48Updated last month
- Official repository for ACL 2025 paper "ProcessBench: Identifying Process Errors in Mathematical Reasoning"β155Updated 2 weeks ago
- β153Updated last year
- A version of verl to support tool useβ172Updated this week
- "Improving Mathematical Reasoning with Process Supervision" by OPENAIβ107Updated 2 weeks ago
- The official repository of the Omni-MATH benchmark.β83Updated 5 months ago
- Advancing Language Model Reasoning through Reinforcement Learning and Inference Scalingβ104Updated 4 months ago
- Async pipelined version of Verlβ91Updated last month
- official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, andβ¦β58Updated 2 months ago
- β33Updated 8 months ago
- Critique-out-Loud Reward Modelsβ66Updated 7 months ago
- Official implementation of ACL 2025 Findings paper "Autonomous Data Selection with Zero-shot Generative Classifiers for Mathematical Textβ¦β81Updated 3 weeks ago
- [AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracyβ61Updated 5 months ago
- Implementation of the Quiet-STAR paper (https://arxiv.org/pdf/2403.09629.pdf)β54Updated 9 months ago
- A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Modelsβ57Updated 3 months ago
- Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Modelsβ54Updated last month
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"β186Updated 3 months ago
- MACM: Utilizing a Multi-Agent System for Condition Mining in Solving Complex Mathematical Problemsβ86Updated 10 months ago
- [NeurIPS-2024] π Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623β84Updated 8 months ago
- β53Updated 3 months ago
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied witβ¦β129Updated 10 months ago
- β82Updated 4 months ago