multimodal-art-projection / CriticLeanLinks
☆39Updated 3 weeks ago
Alternatives and similar repositories for CriticLean
Users that are interested in CriticLean are comparing it to the libraries listed below
Sorting:
- Solving Inequality Proofs with Large Language Models.☆38Updated this week
- ☆58Updated 3 weeks ago
- A repo for open research on building large reasoning models☆84Updated this week
- [NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI☆102Updated 5 months ago
- MiroMind-M1 is a fully open-source series of reasoning language models built on Qwen-2.5, focused on advancing mathematical reasoning.☆106Updated this week
- Revisiting Mid-training in the Era of Reinforcement Learning Scaling☆161Updated 2 weeks ago
- ☆41Updated 10 months ago
- Open-Source LLM Coders with Co-Evolving Reinforcement Learning☆103Updated 2 weeks ago
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners☆82Updated 2 months ago
- ☆227Updated this week
- ☆85Updated 2 months ago
- Resources for the Enigmata Project.☆58Updated last month
- The official repository of the Omni-MATH benchmark.☆85Updated 7 months ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆67Updated 2 months ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆105Updated 3 months ago
- Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay☆99Updated 2 months ago
- General Reasoner: Advancing LLM Reasoning Across All Domains☆156Updated last month
- ☆322Updated last week
- SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning☆126Updated last week
- [COLM 2025] Code for Paper: Learning Adaptive Parallel Reasoning with Language Models☆117Updated 3 months ago
- 🚀 SWE-bench Goes Live!☆104Updated last week
- Scaling Computer-Use Grounding via UI Decomposition and Synthesis☆96Updated last month
- Code for "Reasoning to Learn from Latent Thoughts"☆114Updated 4 months ago
- RL Scaling and Test-Time Scaling (ICML'25)☆109Updated 6 months ago
- Technical report of Kimina-Prover Preview.☆320Updated 3 weeks ago
- The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond☆160Updated 3 weeks ago
- CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings☆47Updated 6 months ago
- ☆39Updated last month
- Repo of paper "Free Process Rewards without Process Labels"☆160Updated 4 months ago
- repo for paper https://arxiv.org/abs/2504.13837☆180Updated last month