The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"
☆259Feb 4, 2026Updated 2 months ago
Alternatives and similar repositories for Parallel-R1
Users that are interested in Parallel-R1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆34Sep 19, 2025Updated 7 months ago
- Scaling In-context Learning from Few-shot to 1,024-shot on Tabular ML☆59Dec 12, 2025Updated 4 months ago
- [ICLR2026] codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)☆800Feb 4, 2026Updated 2 months ago
- Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).☆50Mar 31, 2026Updated last month
- Model souping for LLMs☆73Nov 18, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICLR 2026] Thinking on the Fly: Test-Time Reasoning Enhancement via Latent Thought Policy Optimization☆26Mar 6, 2026Updated last month
- ☆14Oct 11, 2023Updated 2 years ago
- A multi-agent LLM system for detecting and resolving cognitive dissonance.☆276Updated this week
- Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning [ICLR26]☆64Apr 11, 2026Updated 3 weeks ago
- REverse-Engineered Reasoning for Open-Ended Generation☆95Sep 10, 2025Updated 7 months ago
- codes for Efficient Test-Time Scaling via Self-Calibration☆20Sep 13, 2025Updated 7 months ago
- ☆18Oct 3, 2024Updated last year
- ☆15Nov 18, 2025Updated 5 months ago
- 🚀 First survey on Attention Sink in Transformers — 180+ papers on utilization, interpretation, and mitigation.☆69Apr 16, 2026Updated 2 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆31Sep 12, 2025Updated 7 months ago
- PICABench: How Far Are We from Physically Realistic Image Editing?☆36Nov 5, 2025Updated 5 months ago
- [NeurIPS 2025] UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents☆56Nov 27, 2025Updated 5 months ago
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆17Feb 9, 2026Updated 2 months ago
- Compiler-R1: Towards Agentic Compiler Auto-tuning with Reinforcement Learning☆31Jul 14, 2025Updated 9 months ago
- ☆72Oct 23, 2025Updated 6 months ago
- General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]☆223Nov 27, 2025Updated 5 months ago
- Artifact evaluation of MobiSys25 SynCheck☆20Mar 24, 2025Updated last year
- ☆39Oct 28, 2025Updated 6 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Towards a Unified View of Large Language Model Post-Training☆209Sep 8, 2025Updated 7 months ago
- VeriWeb: Verifiable Long-Chain Web Benchmark for Agentic Information-Seeking☆86Jan 21, 2026Updated 3 months ago
- High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning☆55Jul 23, 2025Updated 9 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆49Feb 4, 2026Updated 2 months ago
- [NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning☆129Dec 30, 2025Updated 4 months ago
- Official repository for “Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space”☆18Jan 27, 2026Updated 3 months ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆29Apr 20, 2026Updated last week
- Official implementation of "sound distance estimation" WASPAA 23☆19Dec 31, 2023Updated 2 years ago
- Code repository for "RL Grokking Recipe: How RL Unlocks and Transfers New Algorithms in LLMs""☆33Oct 12, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning☆120Feb 2, 2026Updated 2 months ago
- QuoteSum is a textual QA dataset containing Semi-Extractive Multi-source Question Answering (SEMQA) examples written by humans, based on …☆13Mar 25, 2024Updated 2 years ago
- ☆21Mar 19, 2021Updated 5 years ago
- R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning☆39Feb 9, 2026Updated 2 months ago
- ☆12Apr 18, 2025Updated last year
- Recipes to train the self-rewarding reasoning LLMs.☆233Mar 2, 2025Updated last year
- [ICLR'26] RM-R1: Unleashing the Reasoning Potential of Reward Models☆162Jun 26, 2025Updated 10 months ago