zhengkid/Parallel-R1

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zhengkid/Parallel-R1)

zhengkid / Parallel-R1

The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"

☆260

Alternatives and similar repositories for Parallel-R1

Users that are interested in Parallel-R1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zhengkid / Parallel-Probe
View on GitHub
The offical repo for "Parallel-Probe: Towards Efficient Parallel Thinking via 2D Probing"
☆19Feb 3, 2026Updated 5 months ago
YujunZhou / EVOL-RL
View on GitHub
Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).
☆51Mar 31, 2026Updated 3 months ago
zhengkid / Parallel_Thinking_via_MoT
View on GitHub
Official Code for "Learning to Reason via Mixture-of-Thought for Logical Reasoning"
☆29Nov 20, 2025Updated 8 months ago
MobileLLM / ParaThinker
View on GitHub
☆48Nov 1, 2025Updated 8 months ago
Chengsong-Huang / G-Zero
View on GitHub
☆25May 14, 2026Updated 2 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zhengkid / AutoTTS
View on GitHub
The offical repo for "LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling"
☆170May 15, 2026Updated 2 months ago
bigai-nlco / Native-Parallel-Reasoner
View on GitHub
[ICML 2026] Reasoning in Parallelism via Self-Distilled RL
☆113Jun 28, 2026Updated 3 weeks ago
McGill-NLP / the-markovian-thinker
View on GitHub
Code for paper "The Markovian Thinker: Architecture-Agnostic Linear Scaling of Reasoning"
☆350Mar 16, 2026Updated 4 months ago
Chengsong-Huang / R-Zero
View on GitHub
[ICLR2026] codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)
☆823Feb 4, 2026Updated 5 months ago
Multiverse4FM / Multiverse
View on GitHub
☆88Jun 16, 2025Updated last year
WooooDyy / AgentGym-RL
View on GitHub
Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcemen…
☆819Feb 15, 2026Updated 5 months ago
kyegomez / Reka-Torch
View on GitHub
Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch
☆28Updated this week
TsinghuaC3I / Unify-Post-Training
View on GitHub
Towards a Unified View of Large Language Model Post-Training
☆211Sep 8, 2025Updated 10 months ago
Zhiyuan-Zeng / RLVE
View on GitHub
[ICML 2026] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
☆225Apr 30, 2026Updated 2 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
microsoft / rStar
View on GitHub
☆1,422Sep 12, 2025Updated 10 months ago
MoonshotAI / checkpoint-engine
View on GitHub
Checkpoint-engine is a simple middleware to update model weights in LLM inference engines
☆980Jul 4, 2026Updated 3 weeks ago
sail-sg / variational-reasoning
View on GitHub
Code for "Variational Reasoning for Language Models"
☆60Sep 29, 2025Updated 9 months ago
sunblaze-ucb / Intuitor
View on GitHub
[ICLR 2026] Learning to Reason without External Rewards
☆418Jan 26, 2026Updated 5 months ago
yannqi / R-4B
View on GitHub
The official repository of "R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Integration"
☆141Sep 4, 2025Updated 10 months ago
TIGER-AI-Lab / General-Reasoner
View on GitHub
General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]
☆228Nov 27, 2025Updated 7 months ago
yunfeixie233 / ViGaL
View on GitHub
☆70Feb 4, 2026Updated 5 months ago
NVlabs / DLER
View on GitHub
DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token via Reinforcement Learning
☆17Nov 11, 2025Updated 8 months ago
ChengpengLi1003 / CoRT
View on GitHub
☆72Oct 23, 2025Updated 9 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
terminal-agent / reptile
View on GitHub
💻 Terminal-Agent with Human-in-the-Loop Learning
☆41Jan 16, 2026Updated 6 months ago
Infini-AI-Lab / Multiverse
View on GitHub
☆119Sep 13, 2025Updated 10 months ago
NVlabs / RLP
View on GitHub
[ICLR 2026] Official PyTorch Implementation of RLP: Reinforcement as a Pretraining Objective
☆252Jan 26, 2026Updated 5 months ago
ypwang61 / One-Shot-RLVR
View on GitHub
[NeurIPS 2025] Reinforcement Learning for Reasoning in Large Language Models with One Training Example
☆444Mar 11, 2026Updated 4 months ago
evalops / cognitive-dissonance-dspy
View on GitHub
A multi-agent LLM system for detecting and resolving cognitive dissonance.
☆282Apr 25, 2026Updated 2 months ago
Kwai-Klear / RLEP
View on GitHub
RL with Experience Replay
☆59Jul 27, 2025Updated 11 months ago
sail-sg / feedback-conditional-policy
View on GitHub
Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"
☆65Jan 5, 2026Updated 6 months ago
GAIR-NLP / ToRL
View on GitHub
☆352May 24, 2025Updated last year
TIGER-AI-Lab / verl-tool
View on GitHub
A version of verl to support diverse tool use [TMLR 2026]
☆1,024Jul 15, 2026Updated last week
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
ulab-uiuc / Router-R1
View on GitHub
[NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning
☆146Dec 30, 2025Updated 6 months ago
inclusionAI / ASearcher
View on GitHub
An Open-Source Large-Scale Reinforcement Learning Project for Search Agents
☆602Nov 26, 2025Updated 7 months ago
aiming-lab / Agent0
View on GitHub
[COLM'26 & ICML'26] Agent0 Series: Self-Evolving Agents from Zero Data
☆1,234Jul 10, 2026Updated 2 weeks ago
TsinghuaC3I / SSRL
View on GitHub
SSRL: Self-Search Reinforcement Learning
☆210Aug 20, 2025Updated 11 months ago
RUCAIBox / Passk_Training
View on GitHub
The official repository of paper "Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models''
☆113Aug 15, 2025Updated 11 months ago
Parallel-Reasoning / APR
View on GitHub
[COLM 2025] Code for Paper: Learning Adaptive Parallel Reasoning with Language Models
☆145Dec 17, 2025Updated 7 months ago
inclusionAI / PromptCoT
View on GitHub
A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architect…
☆132Jan 31, 2026Updated 5 months ago