zhengkid / Parallel-R1Links
The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"
☆237Updated 3 weeks ago
Alternatives and similar repositories for Parallel-R1
Users that are interested in Parallel-R1 are comparing it to the libraries listed below
Sorting:
- Latent Collaboration in Multi-Agent Systems (LatentMAS)☆491Updated this week
- ToolOrchestra is an end-to-end RL training framework for orchestrating tools and agentic workflows.☆289Updated last week
- codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)☆691Updated last month
- Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcemen…☆511Updated 3 months ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆352Updated 5 months ago
- Demystifying Reinforcement Learning in Agentic Reasoning☆126Updated last month
- ☆226Updated 9 months ago
- LIMI: Less is More for Agency☆151Updated last month
- ☆342Updated last month
- DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL☆211Updated 2 months ago
- SSRL: Self-Search Reinforcement Learning☆157Updated 3 months ago
- Official repository for DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research☆428Updated last week
- [NeurIPS 2025] The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond☆187Updated 5 months ago
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆135Updated 3 months ago
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆161Updated last month
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆100Updated 3 months ago
- Next paradigm for LLM Agent. Unify plan and action through recursive code generation for adaptive, human-like decision-making.☆452Updated last week
- accompanying material for sleep-time compute paper☆118Updated 7 months ago
- MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning☆108Updated 2 weeks ago
- The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution☆154Updated this week
- An Open-Source Large-Scale Reinforcement Learning Project for Search Agents☆506Updated 2 weeks ago
- Data Synthesis for Deep Research Based on Semi-Structured Data☆182Updated last month
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆254Updated 7 months ago
- [Preprint] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments☆151Updated 3 weeks ago
- GPU-optimized framework for training diffusion language models at any scale. The backend of Quokka, Super Data Learners, and OpenMoE 2 tr…☆289Updated last month
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆225Updated last week
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆289Updated last month
- MCP-Universe is a comprehensive framework designed for developing, testing, and benchmarking AI agents☆507Updated 2 weeks ago
- Official Repository for "Glyph: Scaling Context Windows via Visual-Text Compression"☆521Updated last month
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆501Updated 3 months ago