[ICML 2026] Reasoning in Parallelism via Self-Distilled RL
☆112Feb 5, 2026Updated 4 months ago
Alternatives and similar repositories for Native-Parallel-Reasoner
Users that are interested in Native-Parallel-Reasoner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆23Apr 10, 2026Updated 2 months ago
- Programmable chat templates for LLM training and inference.☆109Updated this week
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 6 months ago
- Measuring how well CLI agents like Claude Code or Codex CLI can post-train base LLMs on a single H100 GPU in 10 hours☆350Jun 3, 2026Updated last week
- Training tiny models to prove hard theorems☆77Mar 5, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"☆44Mar 31, 2025Updated last year
- ☆102Updated this week
- PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning☆335Feb 5, 2026Updated 4 months ago
- [ICLR 2026] RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling☆38Feb 25, 2026Updated 3 months ago
- [IEEE TNSRE] Mixture of Experts for EEG-Based Seizure Subtype Classification☆12Aug 20, 2024Updated last year
- Large language models designed for formal theorem proving through tool-integrated reasoning.☆34Aug 13, 2025Updated 9 months ago
- Evaluation kit for testing stateful agents☆72Jun 3, 2026Updated last week
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 9 months ago
- P1: Mastering Physics Olympiads with Reinforcement Learning☆85Dec 29, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best…☆62Mar 17, 2025Updated last year
- Awesome Audio-Visual Intelligence, Survey of Audio-Visual Intelligence☆77May 8, 2026Updated last month
- A framework aiming to bridge fast robot prototyping, predefined motion primitives, heterogeneous teleoperation, data collection, and flex…☆27Apr 4, 2026Updated 2 months ago
- Baidu Qianfan Deep Research☆29May 30, 2026Updated last week
- Official repository for the paper Number Cookbook: Number Understanding of Language Models and How to Improve It.☆21Mar 31, 2025Updated last year
- 一步步通关GPU编程☆45Jun 4, 2026Updated last week
- [NeurIPS 2024] | An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding☆22Oct 10, 2024Updated last year
- A Structured Output Benchmark whose 'ground-truth' is actually right☆19Dec 5, 2025Updated 6 months ago
- Multilingual and Multiculture Benchmark and LLM☆40May 18, 2026Updated 3 weeks ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆36Oct 3, 2025Updated 8 months ago
- Codes for the paper "BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping…☆93Jan 29, 2026Updated 4 months ago
- ☆22Dec 3, 2025Updated 6 months ago
- UFT: Unifying Supervised and Reinforcement Fine-Tuning☆30Jun 30, 2025Updated 11 months ago
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆80Dec 8, 2025Updated 6 months ago
- Official Implementation of wd1☆30Sep 25, 2025Updated 8 months ago
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆18Apr 2, 2025Updated last year
- ☆44Apr 28, 2026Updated last month
- Language Modeling Research Hub, a comprehensive compendium for enthusiasts and scholars delving into the fascinating realm of language mo…☆19Mar 19, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆40Jan 10, 2026Updated 5 months ago
- ☆35Oct 23, 2025Updated 7 months ago
- Python library to add support for embedding natural code in Python with shared program state.☆30Jan 20, 2026Updated 4 months ago
- One Network, Many Masks: Towards More Parameter-Efficient Transfer Learning☆40Jul 1, 2023Updated 2 years ago
- Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).☆51Mar 31, 2026Updated 2 months ago
- ☆47Sep 8, 2025Updated 9 months ago
- StereoVLA is powered by stereo vision and supports flexible deployment with high tolerance to camera pose variations.☆64May 21, 2026Updated 3 weeks ago