The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"
☆259Feb 4, 2026Updated 5 months ago
Alternatives and similar repositories for Parallel-R1
Users that are interested in Parallel-R1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆34Sep 19, 2025Updated 9 months ago
- Scaling In-context Learning from Few-shot to 1,024-shot on Tabular ML☆59Dec 12, 2025Updated 6 months ago
- Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).☆51Mar 31, 2026Updated 3 months ago
- [ICLR2026] codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)☆822Feb 4, 2026Updated 5 months ago
- Model souping for LLMs☆73Nov 18, 2025Updated 7 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICLR 2026] Thinking on the Fly: Test-Time Reasoning Enhancement via Latent Thought Policy Optimization☆32Mar 6, 2026Updated 3 months ago
- A multi-agent LLM system for detecting and resolving cognitive dissonance.☆280Apr 25, 2026Updated 2 months ago
- Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning [ICLR26]☆64Apr 11, 2026Updated 2 months ago
- REverse-Engineered Reasoning for Open-Ended Generation☆98Sep 10, 2025Updated 9 months ago
- codes for Efficient Test-Time Scaling via Self-Calibration☆20Sep 13, 2025Updated 9 months ago
- ☆31Sep 12, 2025Updated 9 months ago
- [NeurIPS 2025] UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents☆58Nov 27, 2025Updated 7 months ago
- Compiler-R1: Towards Agentic Compiler Auto-tuning with Reinforcement Learning☆33Jul 14, 2025Updated 11 months ago
- ☆72Oct 23, 2025Updated 8 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Artifact evaluation of MobiSys25 SynCheck☆20Mar 24, 2025Updated last year
- General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]☆228Nov 27, 2025Updated 7 months ago
- ☆43Oct 28, 2025Updated 8 months ago
- High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning☆55Jul 23, 2025Updated 11 months ago
- Towards a Unified View of Large Language Model Post-Training☆211Sep 8, 2025Updated 9 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆49Feb 4, 2026Updated 5 months ago
- [NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning☆141Dec 30, 2025Updated 6 months ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆29Jun 22, 2026Updated last week
- Official implementation of "sound distance estimation" WASPAA 23☆20Dec 31, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning☆121Feb 2, 2026Updated 5 months ago
- ☆22Mar 19, 2021Updated 5 years ago
- R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning☆42Feb 9, 2026Updated 4 months ago
- ☆12Apr 18, 2025Updated last year
- Recipes to train the self-rewarding reasoning LLMs.☆232Mar 2, 2025Updated last year
- [ICLR'26] RM-R1: Unleashing the Reasoning Potential of Reward Models☆165Jun 26, 2025Updated last year
- Official repo of Knowledge or Reasoning? A Close Look at How LLMs Think Across Domains.☆43Jun 6, 2025Updated last year
- ☆47Nov 1, 2025Updated 8 months ago
- A Scientific Multimodal Foundation Model☆816May 19, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning☆41Nov 11, 2025Updated 7 months ago
- ☆28Oct 30, 2025Updated 8 months ago
- ☆22Jun 18, 2025Updated last year
- FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones☆68Jan 26, 2026Updated 5 months ago
- The code implementation for TTCS: Test-Time Curriculum Synthesis for Self-Evolving.☆50Apr 22, 2026Updated 2 months ago
- Checkpoint-engine is a simple middleware to update model weights in LLM inference engines☆966Jun 8, 2026Updated 3 weeks ago
- [ICLR 2026] End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆397Mar 30, 2026Updated 3 months ago