multimodal-art-projection / CriticLeanLinks
☆41Updated 3 weeks ago
Alternatives and similar repositories for CriticLean
Users that are interested in CriticLean are comparing it to the libraries listed below
Sorting:
- Solving Inequality Proofs with Large Language Models.☆42Updated this week
- ☆63Updated last month
- ☆41Updated 11 months ago
- ☆275Updated 3 weeks ago
- ReasonFlux-Coder: Open-Source LLM Coders with Co-Evolving Reinforcement Learning☆108Updated last month
- A repo for open research on building large reasoning models☆92Updated this week
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆108Updated 3 months ago
- [NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI☆102Updated 5 months ago
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners☆82Updated 3 months ago
- Revisiting Mid-training in the Era of Reinforcement Learning Scaling☆167Updated last month
- The official repository of the Omni-MATH benchmark.☆87Updated 8 months ago
- 🚀 SWE-bench Goes Live!☆112Updated last month
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆48Updated 9 months ago
- RL Scaling and Test-Time Scaling (ICML'25)☆111Updated 7 months ago
- ☆91Updated 3 months ago
- General Reasoner: Advancing LLM Reasoning Across All Domains☆163Updated 2 months ago
- [COLM 2025] An Open Math Pre-trainng Dataset with 370B Tokens.☆98Updated 4 months ago
- Resources for the Enigmata Project.☆66Updated 2 weeks ago
- ☆126Updated 3 months ago
- A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning☆244Updated 2 months ago
- [COLM 2025] Code for Paper: Learning Adaptive Parallel Reasoning with Language Models☆122Updated last week
- ☆56Updated 2 months ago
- Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay☆112Updated 2 months ago
- SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning☆137Updated this week
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆69Updated 2 months ago
- ☆39Updated 2 months ago
- Code for "Reasoning to Learn from Latent Thoughts"☆116Updated 4 months ago
- The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond☆163Updated last month
- [ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scie…☆166Updated 2 months ago
- Technical report of Kimina-Prover Preview.☆323Updated last month