multimodal-art-projection / CriticLeanLinks
☆42Updated 2 months ago
Alternatives and similar repositories for CriticLean
Users that are interested in CriticLean are comparing it to the libraries listed below
Sorting:
- ☆68Updated 2 months ago
- Solving Inequality Proofs with Large Language Models.☆44Updated 2 weeks ago
- ☆42Updated last year
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners☆85Updated 4 months ago
- ☆40Updated 3 months ago
- [NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI☆105Updated 7 months ago
- Revisiting Mid-training in the Era of Reinforcement Learning Scaling☆176Updated 2 months ago
- ☆62Updated 4 months ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆114Updated 5 months ago
- A repo for open research on building large reasoning models☆106Updated this week
- SIFT: Grounding LLM Reasoning in Contexts via Stickers☆58Updated 7 months ago
- [NeurIPS 2025 Spotlight] ReasonFlux-Coder: Open-Source LLM Coders with Co-Evolving Reinforcement Learning☆122Updated 3 weeks ago
- RL Scaling and Test-Time Scaling (ICML'25)☆111Updated 8 months ago
- Code for "Reasoning to Learn from Latent Thoughts"☆119Updated 6 months ago
- SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning☆151Updated 3 weeks ago
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆47Updated 2 months ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆71Updated 4 months ago
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆48Updated 11 months ago
- ☆296Updated 3 weeks ago
- [COLM 2025] Code for Paper: Learning Adaptive Parallel Reasoning with Language Models☆130Updated last month
- General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]☆174Updated 4 months ago
- [ICLR 2025] This is the official implementation for the paper: "Large Language Models Meet Symbolic Provers for Logical Reasoning Evaluat…☆34Updated 3 months ago
- ☆49Updated 7 months ago
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆82Updated 6 months ago
- SSRL: Self-Search Reinforcement Learning☆145Updated last month
- Tree Search for LLM Agent Reinforcement Learning☆113Updated last week
- The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]☆88Updated 6 months ago
- ☆52Updated 3 months ago
- The official repository of the Omni-MATH benchmark.☆88Updated 9 months ago
- Code for "Variational Reasoning for Language Models"☆42Updated last week