cyzhh / MMOSLinks
Mix of Minimal Optimal Sets (MMOS) of dataset has two advantages for two aspects, higher performance and lower construction costs on math reasoning.
☆73Updated 10 months ago
Alternatives and similar repositories for MMOS
Users that are interested in MMOS are comparing it to the libraries listed below
Sorting:
- Official repository for ACL 2025 paper "ProcessBench: Identifying Process Errors in Mathematical Reasoning"☆155Updated 2 weeks ago
- Critique-out-Loud Reward Models☆66Updated 7 months ago
- [AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy☆61Updated 5 months ago
- ☆75Updated 10 months ago
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied wit…☆129Updated 10 months ago
- ☆67Updated last year
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆120Updated 8 months ago
- [NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*☆106Updated 5 months ago
- ☆82Updated 4 months ago
- ☆47Updated 3 months ago
- Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"☆73Updated 2 weeks ago
- Code for ACL2024 paper - Adversarial Preference Optimization (APO).☆54Updated last year
- [ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scie…☆151Updated 10 months ago
- Reference implementation for Token-level Direct Preference Optimization(TDPO)☆140Updated 3 months ago
- Explore what LLMs are really leanring over SFT☆28Updated last year
- Official implementation of ACL 2025 Findings paper "Autonomous Data Selection with Zero-shot Generative Classifiers for Mathematical Text…☆81Updated 3 weeks ago
- MACM: Utilizing a Multi-Agent System for Condition Mining in Solving Complex Mathematical Problems☆86Updated 10 months ago
- Implementation of ICML 23 Paper: Specializing Smaller Language Models towards Multi-Step Reasoning.☆131Updated last year
- ☆33Updated 8 months ago
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆62Updated 10 months ago
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆220Updated last year
- ☆97Updated last year
- GenRM-CoT: Data release for verification rationales☆61Updated 7 months ago
- ☆29Updated 5 months ago
- [ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models (LLMs + MCTS + Self-Improvement)☆49Updated last year
- Collection of papers for scalable automated alignment.☆90Updated 7 months ago
- ☆48Updated 3 weeks ago
- [ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following☆127Updated 10 months ago
- AIMO2 2nd place solution☆57Updated last week
- ☆141Updated 6 months ago