Official Repository of Native Parallel Reasoner
☆103Feb 5, 2026Updated last month
Alternatives and similar repositories for Native-Parallel-Reasoner
Users that are interested in Native-Parallel-Reasoner are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆21Dec 22, 2025Updated 3 months ago
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 4 months ago
- Training tiny models to prove hard theorems☆59Mar 5, 2026Updated 2 weeks ago
- Measuring how well CLI agents like Claude Code or Codex CLI can post-train base LLMs on a single H100 GPU in 10 hours☆229Mar 10, 2026Updated last week
- FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones☆64Jan 26, 2026Updated last month
- [ICLR 2024 Spotlight] Social Reward: Evaluating and Enhancing Generative AI through Million-User Feedback from an Online Creative Communi…☆11Mar 29, 2024Updated last year
- Infrastructure as Code for MCP access management☆32Updated this week
- [ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"☆41Mar 31, 2025Updated 11 months ago
- ☆24Jan 19, 2026Updated 2 months ago
- A framework aiming to bridge fast robot prototyping, predefined motion primitives, heterogeneous teleoperation, data collection, and flex…☆23Mar 2, 2026Updated 3 weeks ago
- ☆37Dec 16, 2025Updated 3 months ago
- Memento-Skills: Let Agents Design Agents☆110Updated this week
- [ICLR 2026] RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling☆36Feb 25, 2026Updated 3 weeks ago
- [IEEE TNSRE] Mixture of Experts for EEG-Based Seizure Subtype Classification☆12Aug 20, 2024Updated last year
- P1: Mastering Physics Olympiads with Reinforcement Learning☆79Dec 29, 2025Updated 2 months ago
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 6 months ago
- RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best…☆60Mar 17, 2025Updated last year
- ☆56Jul 7, 2025Updated 8 months ago
- Internal utility libraries for Pkl☆16Mar 10, 2026Updated last week
- Nex General Agentic Data Pipeline, an end-to-end pipeline for generating high-quality agentic training data.☆31Nov 19, 2025Updated 4 months ago
- ☆168Dec 18, 2025Updated 3 months ago
- Official repository for the paper Number Cookbook: Number Understanding of Language Models and How to Improve It.☆19Mar 31, 2025Updated 11 months ago
- [NeurIPS 2024] | An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding☆22Oct 10, 2024Updated last year
- "Moshpit SGD: Communication-Efficient Decentralized Training on Heterogeneous Unreliable Devices", official implementation☆30Feb 4, 2025Updated last year
- ☆32Mar 13, 2026Updated last week
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆37Oct 3, 2025Updated 5 months ago
- Financial Services Interest Group☆48Jan 14, 2026Updated 2 months ago
- UFT: Unifying Supervised and Reinforcement Fine-Tuning☆27Jun 30, 2025Updated 8 months ago
- Rethinking the Trust Region in LLM Reinforcement Learning☆50Mar 2, 2026Updated 3 weeks ago
- ☆21Dec 3, 2025Updated 3 months ago
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆75Dec 8, 2025Updated 3 months ago
- Our solutions to Putnam 2025.☆80Jan 9, 2026Updated 2 months ago
- Language Modeling Research Hub, a comprehensive compendium for enthusiasts and scholars delving into the fascinating realm of language mo…☆19Mar 19, 2025Updated last year
- ☆36Jan 10, 2026Updated 2 months ago
- StereoVLA is powered by stereo vision and supports flexible deployment with high tolerance to camera pose variations.☆58Jan 12, 2026Updated 2 months ago
- Python library to add support for embedding natural code in Python with shared program state.☆24Jan 20, 2026Updated 2 months ago
- One Network, Many Masks: Towards More Parameter-Efficient Transfer Learning