Chengsong-Huang / R-ZeroLinks
codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)
☆736Updated last month
Alternatives and similar repositories for R-Zero
Users that are interested in R-Zero are comparing it to the libraries listed below
Sorting:
- Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcemen…☆560Updated 4 months ago
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆251Updated 2 months ago
- WeDLM: The fastest diffusion language model with standard causal attention and native KV cache compatibility, delivering real speedups ov…☆588Updated last week
- Latent Collaboration in Multi-Agent Systems☆723Updated this week
- OpenTinker is an RL-as-a-Service infrastructure for foundation models☆599Updated this week
- Official implementation of "Continuous Autoregressive Language Models"☆714Updated last month
- ToolOrchestra is an end-to-end RL training framework for orchestrating tools and agentic workflows.☆604Updated last month
- A Scientific Multimodal Foundation Model☆627Updated 3 months ago
- Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.☆799Updated last month
- An Open-Source Large-Scale Reinforcement Learning Project for Search Agents☆543Updated 2 months ago
- Agent0 Series: Self-Evolving Agents from Zero Data☆1,006Updated last month
- ☆379Updated 2 months ago
- Official repository for DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research☆519Updated last week
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆358Updated 7 months ago
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆527Updated 4 months ago
- ☆862Updated 4 months ago
- Official Repository for "Glyph: Scaling Context Windows via Visual-Text Compression"☆552Updated 2 months ago
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆871Updated 5 months ago
- ☆321Updated 4 months ago
- ☆1,272Updated 2 months ago
- Repo for "Adaptation of Agentic AI"☆572Updated this week
- Scaling RL on advanced reasoning models☆661Updated 3 months ago
- OpenCUA: Open Foundations for Computer-Use Agents☆646Updated last week
- dLLM: Simple Diffusion Language Modeling☆1,633Updated 2 weeks ago
- ☆491Updated last month
- [NeurIPS 2025] Reinforcement Learning for Reasoning in Large Language Models with One Training Example☆400Updated 2 months ago
- ☆227Updated 11 months ago
- Official implementation of the NeurIPS 2025 paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"☆301Updated last week
- A general memory system for agents, powered by deep-research☆793Updated last month
- ☆1,383Updated 4 months ago