Mix of Minimal Optimal Sets (MMOS) of dataset has two advantages for two aspects, higher performance and lower construction costs on math reasoning.
☆74Jul 27, 2024Updated last year
Alternatives and similar repositories for MMOS
Users that are interested in MMOS are comparing it to the libraries listed below
Sorting:
- ☆30Dec 27, 2024Updated last year
- ☆18Apr 5, 2025Updated 10 months ago
- AI for Mathematics Paper List☆17Jan 14, 2025Updated last year
- This repo is to demo the concept of lossless compression with Transformers as encoder and decoder.☆14May 2, 2024Updated last year
- Conic10K: A large-scale dataset for closed-vocabulary math problem understanding. Accepted to EMNLP2023 Findings.☆31Dec 6, 2023Updated 2 years ago
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆273Apr 26, 2024Updated last year
- [ACL 2025 Findings] Autonomous Data Selection with Zero-shot Generative Classifiers for Mathematical Texts (As Huggingface Daily Papers: …☆90Nov 23, 2025Updated 3 months ago
- [NeurIPS 2024] MACM: Utilizing a Multi-Agent System for Condition Mining in Solving Complex Mathematical Problems☆92Jul 24, 2024Updated last year
- ☆26Jul 16, 2025Updated 7 months ago
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆96Oct 30, 2024Updated last year
- ☆26Nov 1, 2021Updated 4 years ago
- The code and data for the paper JiuZhang3.0☆49May 26, 2024Updated last year
- A project to improve skills of large language models☆843Updated this week
- [NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*☆120Dec 10, 2024Updated last year
- ☆12Mar 27, 2024Updated last year
- ☆14Jul 17, 2025Updated 7 months ago
- ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting wit…☆1,113Feb 22, 2024Updated 2 years ago
- ☆29May 8, 2024Updated last year
- Kaggle AIMO2 solution with token-efficient reasoning LLM recipes☆42Aug 7, 2025Updated 6 months ago
- [MathCoder, MathCoder-VL] Family of LLMs/LMMs for mathematical reasoning.☆335Oct 18, 2025Updated 4 months ago
- A unified benchmark for math reasoning☆89Jan 25, 2023Updated 3 years ago
- The dataset and code for paper: TheoremQA: A Theorem-driven Question Answering dataset☆160Apr 23, 2024Updated last year
- [ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scie…☆184Jun 8, 2025Updated 8 months ago
- [NAACL 2024] A Synthetic, Scalable and Systematic Evaluation Suite for Large Language Models☆33Jun 10, 2024Updated last year
- The official repository of the Omni-MATH benchmark.☆93Dec 22, 2024Updated last year
- ☆83Apr 18, 2024Updated last year
- NeqLIPS: a powerful Olympiad-level inequality prover☆39Sep 7, 2025Updated 5 months ago
- ML Benchmarks in Algebraic Combinatorics☆22Jan 15, 2026Updated last month
- ☆14Aug 15, 2024Updated last year
- ☆22Dec 8, 2025Updated 2 months ago
- HyperTree Proof Search for Neural Theorem Proving -- "La science est l'œuvre de l'esprit humain, qui est plutôt destiné à étudier qu'à co…☆40Aug 1, 2024Updated last year
- [SIGIR 2024] This is the official PyTorch implementation for the paper: "EulerFormer: Sequential User Behavior Modeling with Complex Vect…☆17Oct 5, 2024Updated last year
- ☆16Apr 11, 2022Updated 3 years ago
- K12高中数学试题数据集☆15Aug 16, 2023Updated 2 years ago
- ☆342Jun 5, 2025Updated 8 months ago
- Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignment☆69Aug 18, 2023Updated 2 years ago
- ☆72Apr 2, 2024Updated last year
- Scaling Sparse Fine-Tuning to Large Language Models☆18Jan 31, 2024Updated 2 years ago
- Official code for paper: INT: An Inequality Benchmark for Evaluating Generalization in Theorem Proving☆39Dec 12, 2022Updated 3 years ago