Chengsong-Huang / R-ZeroLinks
codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)
☆714Updated 2 weeks ago
Alternatives and similar repositories for R-Zero
Users that are interested in R-Zero are comparing it to the libraries listed below
Sorting:
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆246Updated last month
- Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcemen…☆539Updated 3 months ago
- Latent Collaboration in Multi-Agent Systems☆668Updated last week
- OpenTinker is an RL-as-a-Service infrastructure for foundation models☆499Updated this week
- A Scientific Multimodal Foundation Model☆625Updated 3 months ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆355Updated 6 months ago
- Official implementation of "Continuous Autoregressive Language Models"☆676Updated last month
- WeDLM: The fastest diffusion language model with standard causal attention and native KV cache compatibility, delivering real speedups ov…