[ICLR2026] codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)
☆795Feb 4, 2026Updated 2 months ago
Alternatives and similar repositories for R-Zero
Users that are interested in R-Zero are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- codes for Efficient Test-Time Scaling via Self-Calibration☆19Sep 13, 2025Updated 7 months ago
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆259Feb 4, 2026Updated 2 months ago
- [NeurIPS 2025] TTRL: Test-Time Reinforcement Learning☆1,048Updated this week
- Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).☆49Mar 31, 2026Updated 2 weeks ago
- SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning☆183Mar 27, 2026Updated 3 weeks ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- SSRL: Self-Search Reinforcement Learning☆207Aug 20, 2025Updated 7 months ago
- ☆12Apr 18, 2025Updated last year
- Socratic-Zero is a fully autonomous framework that generates high-quality training data for mathematical reasoning☆36Oct 26, 2025Updated 5 months ago
- Code, Data and Model for Paper "Learning from Peers in Reasoning Models"☆27May 13, 2025Updated 11 months ago
- A version of verl to support diverse tool use☆949Mar 2, 2026Updated last month
- Official Repository of Absolute Zero Reasoner☆1,844Aug 24, 2025Updated 7 months ago
- General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]☆224Nov 27, 2025Updated 4 months ago
- ☆21Dec 14, 2024Updated last year
- Agent0 Series: Self-Evolving Agents from Zero Data☆1,159Feb 17, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme☆147Apr 9, 2025Updated last year
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆4,494Nov 13, 2025Updated 5 months ago
- Democratizing Reinforcement Learning for LLMs☆5,439Updated this week
- [NeurIPS 2025] Scaling Speculative Decoding with Lookahead Reasoning☆68Oct 31, 2025Updated 5 months ago
- [Preprint] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments☆202Apr 7, 2026Updated last week
- ☆64Mar 30, 2026Updated 2 weeks ago
- ArcherCodeR is an open-source initiative enhancing code reasoning in large language models through scalable, rule-governed reinforcement …☆45Aug 6, 2025Updated 8 months ago
- Self-Questioning Language Models☆56Mar 30, 2026Updated 2 weeks ago
- XmodelLM☆38Nov 19, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICLR 2026] Learning to Reason without External Rewards☆405Jan 26, 2026Updated 2 months ago
- ☆1,406Sep 12, 2025Updated 7 months ago
- VisPlay: Self-Evolving Vision-Language Models☆54Feb 25, 2026Updated last month
- ☆31Sep 12, 2025Updated 7 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆266May 5, 2025Updated 11 months ago
- Towards a Unified View of Large Language Model Post-Training☆209Sep 8, 2025Updated 7 months ago
- ☆38Oct 28, 2025Updated 5 months ago
- Code for "Variational Reasoning for Language Models"☆59Sep 29, 2025Updated 6 months ago
- ☆26Feb 20, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- verl: Volcano Engine Reinforcement Learning for LLMs☆20,603Apr 10, 2026Updated last week
- Model souping for LLMs☆73Nov 18, 2025Updated 5 months ago
- Official Repo for Open-Reasoner-Zero☆2,091Jun 2, 2025Updated 10 months ago
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆558Sep 8, 2025Updated 7 months ago
- Official Repository of "Learning to Reason under Off-Policy Guidance"☆437Mar 20, 2026Updated 3 weeks ago
- [NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆688Mar 16, 2025Updated last year
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆78Dec 8, 2025Updated 4 months ago