Chengsong-Huang / R-ZeroLinks
codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)
☆670Updated 3 weeks ago
Alternatives and similar repositories for R-Zero
Users that are interested in R-Zero are comparing it to the libraries listed below
Sorting:
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆233Updated last week
- Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcemen…☆495Updated 2 months ago
- Official implementation of "Continuous Autoregressive Language Models"☆584Updated last week
- An Open-Source Large-Scale Reinforcement Learning Project for Search Agents☆492Updated last month
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆349Updated 5 months ago
- ☆317Updated 2 weeks ago
- A Scientific Multimodal Foundation Model☆607Updated last month
- Official Repository for "Glyph: Scaling Context Windows via Visual-Text Compression"☆488Updated 2 weeks ago
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆490Updated 2 months ago
- ☆838Updated 2 months ago
- ☆1,184Updated this week
- ☆222Updated 8 months ago
- OpenCUA: Open Foundations for Computer-Use Agents☆569Updated last week
- Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.☆717Updated last month
- Code for the paper: "Learning to Reason without External Rewards"☆375Updated 4 months ago
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆785Updated 3 months ago
- ☆268Updated 2 months ago
- [NeurIPS 2025] Thinkless: LLM Learns When to Think☆242Updated last month
- 🛠️ DeepAgent: A General Reasoning Agent with Scalable Toolsets☆775Updated 3 weeks ago
- Scaling RL on advanced reasoning models☆632Updated last month
- GPU-optimized framework for training diffusion language models at any scale. The backend of Quokka, Super Data Learners, and OpenMoE 2 tr…☆272Updated last week
- [NeurIPS 2025] Reinforcement Learning for Reasoning in Large Language Models with One Training Example☆376Updated last month
- Tina: Tiny Reasoning Models via LoRA☆305Updated 2 months ago
- ☆1,339Updated 2 months ago
- Official implementation of the NeurIPS 2025 paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"☆277Updated last week
- LightMem: Lightweight and Efficient Memory-Augmented Generation☆388Updated this week
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆282Updated last month
- [NeurIPS 2025] Atom of Thoughts for Markov LLM Test-Time Scaling☆596Updated 5 months ago
- dLLM: Simple Diffusion Language Modeling☆950Updated this week
- Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike stat…☆364Updated this week