bin123apple / MACMLinks
MACM: Utilizing a Multi-Agent System for Condition Mining in Solving Complex Mathematical Problems
β87Updated 11 months ago
Alternatives and similar repositories for MACM
Users that are interested in MACM are comparing it to the libraries listed below
Sorting:
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learningβ99Updated last month
- [NeurIPS'24] Official code for *π―DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*β108Updated 6 months ago
- Implementation of the Quiet-STAR paper (https://arxiv.org/pdf/2403.09629.pdf)β54Updated 10 months ago
- The official repository of the Omni-MATH benchmark.β84Updated 6 months ago
- RL Scaling and Test-Time Scaling (ICML'25)β106Updated 5 months ago
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correctionβ72Updated 3 months ago
- [NeurIPS 2024] Can LLMs Learn by Teaching for Better Reasoning? A Preliminary Studyβ51Updated 7 months ago
- Official repository for ACL 2025 paper "ProcessBench: Identifying Process Errors in Mathematical Reasoning"β161Updated last month
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.β124Updated 3 months ago
- The official repo for "TheoremQA: A Theorem-driven Question Answering dataset" (EMNLP 2023)β31Updated last year
- "Improving Mathematical Reasoning with Process Supervision" by OPENAIβ109Updated this week
- Research Code for preprint "Optimizing Test-Time Compute via Meta Reinforcement Finetuning".β94Updated 3 months ago
- Interpretable Contrastive Monte Carlo Tree Search Reasoningβ48Updated 7 months ago
- [ACL 2024 Findings] MathBench: A Comprehensive Multi-Level Difficulty Mathematics Evaluation Datasetβ101Updated last month
- A version of verl to support tool useβ261Updated this week
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examplesβ95Updated 2 weeks ago
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learningβ222Updated last month
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied witβ¦β131Updated 11 months ago
- [ICLR 2025] Benchmarking Agentic Workflow Generationβ100Updated 4 months ago
- On Memorization of Large Language Models in Logical Reasoningβ67Updated 2 months ago
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"β159Updated 3 weeks ago
- β103Updated 6 months ago
- [ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scieβ¦β155Updated 2 weeks ago
- xVerify: Efficient Answer Verifier for Reasoning Model Evaluationsβ113Updated 2 months ago
- β62Updated 8 months ago
- β71Updated 7 months ago
- Repo for "Z1: Efficient Test-time Scaling with Code"β61Updated 2 months ago
- Code for Paper: Learning Adaptive Parallel Reasoning with Language Modelsβ107Updated 2 months ago
- Code & data for ICLR 2024 spotlight paper: π―MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Dataβ41Updated last year
- β62Updated last week