The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"
β258Feb 4, 2026Updated 4 months ago
Alternatives and similar repositories for Parallel-R1
Users that are interested in Parallel-R1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- β34Sep 19, 2025Updated 8 months ago
- π» Terminal-Agent with Human-in-the-Loop Learningβ39Jan 16, 2026Updated 4 months ago
- Scaling In-context Learning from Few-shot to 1,024-shot on Tabular MLβ59Dec 12, 2025Updated 6 months ago
- Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).β51Mar 31, 2026Updated 2 months ago
- The offical repo for "LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling"β165May 15, 2026Updated 3 weeks ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ICLR2026] codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)β813Feb 4, 2026Updated 4 months ago
- Model souping for LLMsβ73Nov 18, 2025Updated 6 months ago
- [ICLR 2026] Thinking on the Fly: Test-Time Reasoning Enhancement via Latent Thought Policy Optimizationβ31Mar 6, 2026Updated 3 months ago
- β14Oct 11, 2023Updated 2 years ago
- A multi-agent LLM system for detecting and resolving cognitive dissonance.β279Apr 25, 2026Updated last month
- Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning [ICLR26]β65Apr 11, 2026Updated 2 months ago
- codes for Efficient Test-Time Scaling via Self-Calibrationβ20Sep 13, 2025Updated 9 months ago
- β18Oct 3, 2024Updated last year
- PICABench: How Far Are We from Physically Realistic Image Editing?β38Nov 5, 2025Updated 7 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ICLR 26] The official code repository for the paper "Mirage or Method? How ModelβTask Alignment Induces Divergent RL Conclusions".β17Feb 9, 2026Updated 4 months ago
- Compiler-R1: Towards Agentic Compiler Auto-tuning with Reinforcement Learningβ32Jul 14, 2025Updated 11 months ago
- β72Oct 23, 2025Updated 7 months ago
- Artifact evaluation of MobiSys25 SynCheckβ20Mar 24, 2025Updated last year
- General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]β227Nov 27, 2025Updated 6 months ago
- β42Oct 28, 2025Updated 7 months ago
- VeriWeb: Verifiable Long-Chain Web Benchmark for Agentic Information-Seekingβ89Jan 21, 2026Updated 4 months ago
- Towards a Unified View of Large Language Model Post-Trainingβ211Sep 8, 2025Updated 9 months ago
- High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learningβ54Jul 23, 2025Updated 10 months ago
- End-to-end encrypted email - Proton Mail β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domainsβ49Feb 4, 2026Updated 4 months ago
- [NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learningβ135Dec 30, 2025Updated 5 months ago
- Official repository for βReasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Spaceββ18Jan 27, 2026Updated 4 months ago
- Official implementation of "sound distance estimation" WASPAA 23β20Dec 31, 2023Updated 2 years ago
- MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoningβ120Feb 2, 2026Updated 4 months ago
- β22Mar 19, 2021Updated 5 years ago
- R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learningβ41Feb 9, 2026Updated 4 months ago
- β12Apr 18, 2025Updated last year
- Recipes to train the self-rewarding reasoning LLMs.β232Mar 2, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ICLR'26] RM-R1: Unleashing the Reasoning Potential of Reward Modelsβ165Jun 26, 2025Updated 11 months ago
- Official repo of Knowledge or Reasoning? A Close Look at How LLMs Think Across Domains.β43Jun 6, 2025Updated last year
- β47Nov 1, 2025Updated 7 months ago
- A Scientific Multimodal Foundation Modelβ811May 19, 2026Updated 3 weeks ago
- Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoningβ41Nov 11, 2025Updated 7 months ago
- β21Jun 18, 2025Updated 11 months ago
- Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specializationβ42Feb 7, 2026Updated 4 months ago