BaohaoLiao / RSDView external linksLinks
[ICML 2025] Reward-guided Speculative Decoding (RSD) for efficiency and effectiveness.
☆55May 2, 2025Updated 9 months ago
Alternatives and similar repositories for RSD
Users that are interested in RSD are comparing it to the libraries listed below
Sorting:
- Make reasoning models scalable☆47May 31, 2025Updated 8 months ago
- ☆32Oct 13, 2025Updated 4 months ago
- ThinK: Thinner Key Cache by Query-Driven Pruning☆27Feb 11, 2025Updated last year
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆38Feb 4, 2026Updated last week
- PoC for "SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning" [NeurIPS '25]☆63Oct 2, 2025Updated 4 months ago
- Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination.☆21Jul 18, 2025Updated 6 months ago
- SLiM: One-shot Quantized Sparse Plus Low-rank Approximation of LLMs (ICML 2025)☆32Nov 28, 2025Updated 2 months ago
- Code for "RSQ: Learning from Important Tokens Leads to Better Quantized LLMs"☆20Jun 11, 2025Updated 8 months ago
- The official repository of NeurIPS'25 paper "Ada-R1: From Long-Cot to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization"☆21Nov 9, 2025Updated 3 months ago
- ☆21Dec 6, 2025Updated 2 months ago
- VidKV: Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models☆25Mar 26, 2025Updated 10 months ago
- RENT (Reinforcement Learning via Entropy Minimization) is an unsupervised method for training reasoning LLMs.☆41Oct 31, 2025Updated 3 months ago
- LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification☆73Jul 14, 2025Updated 7 months ago
- [COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding☆276Aug 31, 2024Updated last year
- Experimental sampler to make LLMs more creative☆31Aug 2, 2023Updated 2 years ago
- [CVPR 2025] DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models☆100Nov 22, 2025Updated 2 months ago
- ☆41Mar 26, 2025Updated 10 months ago
- [ICLR 2026] Learning to Parallel: Accelerating Diffusion Large Language Models via Learnable Parallel Decoding☆29Jan 27, 2026Updated 2 weeks ago
- Introduction about AWESOME_ENTROPY+LRM_PAPERS☆30Dec 16, 2025Updated 2 months ago
- ☆50Aug 21, 2025Updated 5 months ago
- The official repository for paper "MLLM-Protector: Ensuring MLLM’s Safety without Hurting Performance"☆44Apr 21, 2024Updated last year
- [EMNLP 2025] LightThinker: Thinking Step-by-Step Compression☆132Apr 12, 2025Updated 10 months ago
- Fully automatic skin lesion segmentation using the Berkeley wavelet transform and UNet algorithm.☆12Jun 1, 2021Updated 4 years ago
- 🎵 When AI tools vibe together on your PRs. Let CodeRabbit and Claude Code handle the repetitive feedback while you ship features. Built …☆12Nov 24, 2025Updated 2 months ago
- DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …☆14Dec 12, 2024Updated last year
- Analytics tool that applies Natural Language Processing (NLP) and Machine Learning (ML), such as concept extraction, idea classification,…☆10Dec 7, 2022Updated 3 years ago
- Official code repository for "CLARE: Continual Learning for Vision-Language-Action Models via Autonomous Adapter Routing and Expansion".☆23Jan 27, 2026Updated 2 weeks ago
- [AAAI 2025] Neural-Symbolic Collaborative Distillation: Advancing Small Language Models for Complex Reasoning Tasks☆11Jun 19, 2025Updated 7 months ago
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 4 months ago
- ☆13Feb 4, 2025Updated last year
- FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆24Feb 10, 2026Updated last week
- Knowledge sharing of AWS (Amazon Web Services) Cloud☆12Jun 7, 2021Updated 4 years ago
- Integrating neurosymbolic representations into LLMs for interpretability, steering, and running symbolic algorithms☆14Feb 2, 2026Updated 2 weeks ago
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity☆22Aug 28, 2025Updated 5 months ago
- General Use Timeseries Containers for Rust☆11Dec 31, 2020Updated 5 years ago
- ☆16Feb 22, 2025Updated 11 months ago
- An experimental distributed map reduce system based on Google's MapReduce, written in Rust!☆10Aug 3, 2022Updated 3 years ago
- [NeurIPS 2025 Spotlight] Scaling Computer-Use Grounding via UI Decomposition and Synthesis☆150Nov 6, 2025Updated 3 months ago
- An easy-to-use package for implementing SmoothQuant for LLMs☆110Apr 7, 2025Updated 10 months ago