guanyilin428 / Dynamic-Speculative-PlanningLinks
☆30Updated 3 weeks ago
Alternatives and similar repositories for Dynamic-Speculative-Planning
Users that are interested in Dynamic-Speculative-Planning are comparing it to the libraries listed below
Sorting:
- [COLM 2025] Code for Paper: Learning Adaptive Parallel Reasoning with Language Models☆130Updated last month
- ☆61Updated 3 months ago
- ☆54Updated 4 months ago
- Official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)☆51Updated this week
- Official implementation of "Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning"☆22Updated 4 months ago
- Kinetics: Rethinking Test-Time Scaling Laws☆80Updated 3 months ago
- Lottery Ticket Adaptation☆40Updated 10 months ago
- [NeurIPS 2025 Spotlight] ReasonFlux-Coder: Open-Source LLM Coders with Co-Evolving Reinforcement Learning☆122Updated 3 weeks ago
- ☆105Updated last year
- This is the official implementation for paper "PENCIL: Long Thoughts with Short Memory".☆65Updated 5 months ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆35Updated last month
- ☆49Updated 8 months ago
- Multi-Turn RL Training System with AgentTrainer for Language Model Game Reinforcement Learning☆45Updated last week
- Official Repo for InSTA: Towards Internet-Scale Training For Agents☆55Updated 3 months ago
- ☆96Updated 3 weeks ago
- Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More☆34Updated 4 months ago
- ☆40Updated 3 months ago
- ☆24Updated 6 months ago
- Reinforcing General Reasoning without Verifiers☆88Updated 3 months ago
- Bayes-Adaptive RL for LLM Reasoning☆40Updated 4 months ago
- A repository for research on medium sized language models.☆78Updated last year
- Code for "Reasoning to Learn from Latent Thoughts"☆120Updated 6 months ago
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆28Updated 6 months ago
- An efficient implementation of the NSA (Native Sparse Attention) kernel☆119Updated 3 months ago
- ☆48Updated 5 months ago
- 🔥 LLM-powered GPU kernel synthesis: Train models to convert PyTorch ops into optimized Triton kernels via SFT+RL. Multi-turn compilation…☆79Updated last week
- [ICLR‘24 Spotlight] Code for the paper "Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy"☆95Updated 3 months ago
- ☆32Updated 8 months ago
- Code for ICML 2024 paper☆31Updated 3 weeks ago
- Official repo of paper LM2☆45Updated 7 months ago