Kamichanw / CoSLinks
[ICML'25] Official code of paper "Fast Large Language Model Collaborative Decoding via Speculation"
☆28Updated 3 months ago
Alternatives and similar repositories for CoS
Users that are interested in CoS are comparing it to the libraries listed below
Sorting:
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆86Updated 7 months ago
- ☆26Updated 3 weeks ago
- ☆18Updated 3 months ago
- [NeurIPS'25 Spotlight] ARM: Adaptive Reasoning Model☆56Updated this week
- Model merging is a highly efficient approach for long-to-short reasoning.☆86Updated 4 months ago
- ☆46Updated 6 months ago
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning☆69Updated 2 months ago
- xVerify: Efficient Answer Verifier for Reasoning Model Evaluations☆133Updated 5 months ago
- ☆43Updated 2 weeks ago
- Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping☆54Updated 4 months ago
- 🚀 LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training☆89Updated 10 months ago
- [EMNLP 2025] TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆182Updated 3 months ago
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆82Updated 4 months ago
- [NeurIPS'24] Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models☆62Updated 10 months ago
- Extrapolating RLVR to General Domains without Verifiers☆171Updated 2 months ago
- ☆67Updated 3 months ago
- [ACL-25] We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.☆68Updated 11 months ago
- The official repository of the Omni-MATH benchmark.☆88Updated 9 months ago
- ☆36Updated 3 weeks ago
- ☆101Updated 3 weeks ago
- A Unified Framework for High-Performance and Extensible LLM Steering☆69Updated last week
- ☆28Updated 5 months ago
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆37Updated last month
- CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models (NeurIPS 2025)☆152Updated 3 weeks ago
- Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization☆73Updated 2 weeks ago
- [ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style☆63Updated 2 months ago
- [ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.☆81Updated 7 months ago
- ☆18Updated 9 months ago
- ☆39Updated 2 months ago
- The official implementation of "LightTransfer: Your Long-Context LLM is Secretly a Hybrid Model with Effortless Adaptation"☆20Updated 5 months ago