[ICLR2026] codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)
☆813Feb 4, 2026Updated 4 months ago
Alternatives and similar repositories for R-Zero
Users that are interested in R-Zero are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- codes for Efficient Test-Time Scaling via Self-Calibration☆20Sep 13, 2025Updated 9 months ago
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆258Feb 4, 2026Updated 4 months ago
- SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning☆196Mar 27, 2026Updated 2 months ago
- [NeurIPS 2025] TTRL: Test-Time Reinforcement Learning☆1,083Apr 15, 2026Updated 2 months ago
- Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).☆51Mar 31, 2026Updated 2 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- SSRL: Self-Search Reinforcement Learning☆208Aug 20, 2025Updated 9 months ago
- ☆12Apr 18, 2025Updated last year
- Socratic-Zero is a fully autonomous framework that generates high-quality training data for mathematical reasoning☆36Oct 26, 2025Updated 7 months ago
- Code, Data and Model for Paper "Learning from Peers in Reasoning Models"☆26May 13, 2025Updated last year
- A version of verl to support diverse tool use [TMLR 2026]☆1,001Jun 8, 2026Updated last week
- Official Repository of Absolute Zero Reasoner☆1,868Aug 24, 2025Updated 9 months ago
- General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]☆227Nov 27, 2025Updated 6 months ago
- ☆21Dec 14, 2024Updated last year
- Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme☆149Apr 9, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Democratizing Reinforcement Learning for LLMs☆5,608Updated this week
- [ICML'26] Agent0 Series: Self-Evolving Agents from Zero Data☆1,216Feb 17, 2026Updated 4 months ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆4,925Nov 13, 2025Updated 7 months ago
- [ICML 2026] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments☆218Apr 30, 2026Updated last month
- [NeurIPS 2025] Scaling Speculative Decoding with Lookahead Reasoning☆69Oct 31, 2025Updated 7 months ago
- ☆64Mar 30, 2026Updated 2 months ago
- Self-Questioning Language Models☆56Mar 30, 2026Updated 2 months ago
- XmodelLM☆38Nov 19, 2024Updated last year
- [ICLR 2026] Learning to Reason without External Rewards☆410Jan 26, 2026Updated 4 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆1,420Sep 12, 2025Updated 9 months ago
- [CVPR'26] VisPlay: Self-Evolving Vision-Language Models☆62Feb 25, 2026Updated 3 months ago
- ☆31Sep 12, 2025Updated 9 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆268May 5, 2025Updated last year
- verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework☆21,969Updated this week
- ☆42Oct 28, 2025Updated 7 months ago
- Code for "Variational Reasoning for Language Models"☆60Sep 29, 2025Updated 8 months ago
- ☆26Feb 20, 2026Updated 3 months ago
- Towards a Unified View of Large Language Model Post-Training☆211Sep 8, 2025Updated 9 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Model souping for LLMs☆73Nov 18, 2025Updated 7 months ago
- Official Repo for Open-Reasoner-Zero☆2,097Jun 2, 2025Updated last year
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆576Sep 8, 2025Updated 9 months ago
- [NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆699Mar 16, 2025Updated last year
- Official Repository of "Learning to Reason under Off-Policy Guidance"☆452Mar 20, 2026Updated 2 months ago
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆81Dec 8, 2025Updated 6 months ago
- 📖 This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.☆396Nov 5, 2025Updated 7 months ago