Chengsong-Huang / R-ZeroLinks
codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)
☆638Updated last week
Alternatives and similar repositories for R-Zero
Users that are interested in R-Zero are comparing it to the libraries listed below
Sorting:
- Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcemen…☆422Updated last month
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆189Updated last week
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆343Updated 3 months ago
- An Open-Source Large-Scale Reinforcement Learning Project for Search Agents☆453Updated this week
- A Scientific Multimodal Foundation Model☆579Updated last week
- OpenCUA: Open Foundations for Computer-Use Agents☆511Updated last week
- ☆821Updated 3 weeks ago
- Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.☆644Updated 2 weeks ago
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆712Updated 2 months ago
- ☆226Updated 3 weeks ago
- Scaling RL on advanced reasoning models☆607Updated 2 months ago
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆447Updated last month
- ☆218Updated 7 months ago
- ☆1,283Updated last month
- Tina: Tiny Reasoning Models via LoRA☆290Updated 2 weeks ago
- Code for the paper: "Learning to Reason without External Rewards"☆360Updated 3 months ago
- Self-Adapting Language Models☆805Updated 2 months ago
- 🐉 Loong: Synthesize Long CoTs at Scale through Verifiers.☆451Updated 2 weeks ago
- Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"☆265Updated 5 months ago
- [NeurIPS 2025] Atom of Thoughts for Markov LLM Test-Time Scaling☆588Updated 3 months ago
- [NeurIPS 2025] Reinforcement Learning for Reasoning in Large Language Models with One Training Example☆361Updated this week
- Official implementation of the NeurIPS 2025 paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"☆246Updated last month
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆549Updated 5 months ago
- MCP-Universe is a comprehensive framework designed for developing, testing, and benchmarking AI agents☆445Updated this week
- Dream 7B, a large diffusion language model