[ICML 2026] Reasoning in Parallelism via Self-Distilled RL
☆114Jun 28, 2026Updated this week
Alternatives and similar repositories for Native-Parallel-Reasoner
Users that are interested in Native-Parallel-Reasoner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆23Apr 10, 2026Updated 2 months ago
- The offical repo for "LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling"☆167May 15, 2026Updated last month
- The training codes of Jasper-Token-Compression-600M☆20Nov 19, 2025Updated 7 months ago
- FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones☆68Jan 26, 2026Updated 5 months ago
- Training tiny models to prove hard theorems☆80Mar 5, 2026Updated 3 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Official implementation of our paper "Towards Reasoning in Large Language Models via Multi-Agent Peer Review Collaboration".☆14Nov 18, 2024Updated last year
- [ACL'26 Findings] Official code for "BAPO: Boundary-Aware Policy Optimization for Reliable Agentic Search"☆30Apr 23, 2026Updated 2 months ago
- [ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"☆44Mar 31, 2025Updated last year
- ☆109Jun 5, 2026Updated 3 weeks ago
- PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning☆336Feb 5, 2026Updated 4 months ago
- [ICLR 2026] RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling☆38Feb 25, 2026Updated 4 months ago
- Large language models designed for formal theorem proving through tool-integrated reasoning.☆34Aug 13, 2025Updated 10 months ago
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 10 months ago
- P1: Mastering Physics Olympiads with Reinforcement Learning☆87Dec 29, 2025Updated 6 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best…☆62Mar 17, 2025Updated last year
- ☆56Jul 7, 2025Updated 11 months ago
- Official implementation of Browse-Master, a tool-augmented web-search agent.☆34Aug 22, 2025Updated 10 months ago
- Awesome Audio-Visual Intelligence, Survey of Audio-Visual Intelligence☆80May 8, 2026Updated last month
- Nex General Agentic Data Pipeline, an end-to-end pipeline for generating high-quality agentic training data.☆37Nov 19, 2025Updated 7 months ago
- A framework aiming to bridge fast robot prototyping, predefined motion primitives, heterogeneous teleoperation, data collection, and flex…☆27Apr 4, 2026Updated 2 months ago
- Baidu Qianfan Deep Research☆35Jun 8, 2026Updated 3 weeks ago
- 一步步通关GPU编程☆50Jun 4, 2026Updated 3 weeks ago
- [NeurIPS 2024] | An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding☆22Oct 10, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Multilingual and Multiculture Benchmark and LLM☆42May 18, 2026Updated last month
- Financial Services Interest Group☆53Jan 14, 2026Updated 5 months ago
- A Structured Output Benchmark whose 'ground-truth' is actually right☆19Dec 5, 2025Updated 6 months ago
- Codes for the paper "BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping…☆94Jan 29, 2026Updated 5 months ago
- Rethinking the Trust Region in LLM Reinforcement Learning☆61Mar 2, 2026Updated 3 months ago
- ☆44Jun 9, 2026Updated 3 weeks ago
- ☆18Nov 25, 2023Updated 2 years ago
- Official Implementation of wd1☆31Sep 25, 2025Updated 9 months ago
- Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation☆30Jun 30, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆18Apr 2, 2025Updated last year
- ☆45Apr 28, 2026Updated 2 months ago
- ☆41Jan 10, 2026Updated 5 months ago
- ☆35Oct 23, 2025Updated 8 months ago
- Python library to add support for embedding natural code in Python with shared program state.☆30Jan 20, 2026Updated 5 months ago
- Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).☆51Mar 31, 2026Updated 3 months ago
- ☆47Sep 8, 2025Updated 9 months ago