Lyn-Lucy/MSD

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Lyn-Lucy/MSD)

Lyn-Lucy / MSD

☆38

Alternatives and similar repositories for MSD

Users that are interested in MSD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

KangJialiang / ViSpec
View on GitHub
[NeurIPS 2025] Official Implementation of ViSpec: Accelerating Vision-Language Models with Vision-Aware Speculative Decoding.
☆65Jan 28, 2026Updated 5 months ago
killthefullmoon / MMSpec
View on GitHub
MMSpec: Benchmarking Speculative Decoding for Vision-Language Models
☆41Jul 2, 2026Updated 3 weeks ago
lzhxmu / AccDiffusion_v2
View on GitHub
Code release for AccDiffusionV2 (TPAMI)
☆34Nov 4, 2025Updated 8 months ago
lzhxmu / VTW
View on GitHub
Code release for VTW (AAAI 2025 Oral)
☆68Nov 4, 2025Updated 8 months ago
zyxxmu / LBC
View on GitHub
Pytorch implementation of our paper accepted by NeurIPS 2022 -- Learning Best Combination for Efficient N:M Sparsity
☆22Jan 13, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
NonvolatileMemory / GliDe_with_a_CaPE_ICML_24
View on GitHub
official code for GliDe with a CaPE
☆22Aug 13, 2024Updated last year
hyx1999 / SAM-Decoding
View on GitHub
Official Implementation of SAM-Decoding: Speculative Decoding via Suffix Automaton
☆52May 12, 2026Updated 2 months ago
taishan1994 / llava-handbook
View on GitHub
对llava官方代码的一些学习笔记
☆29Oct 11, 2024Updated last year
smart-lty / ParallelSpeculativeDecoding
View on GitHub
[ICLR 2025] PEARL: Parallel Speculative Decoding with Adaptive Draft Length
☆170Dec 23, 2025Updated 7 months ago
Qualcomm-AI-research / pruning-vs-quantization
View on GitHub
☆26Mar 1, 2024Updated 2 years ago
ASC-Competition / ASC24-LLM-inference-optimization
View on GitHub
The dataset and baseline code for ASC23 LLM inference optimization challenge.
☆34Dec 20, 2023Updated 2 years ago
mit-han-lab / fastrl
View on GitHub
[ASPLOS'26] Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter
☆174Feb 27, 2026Updated 4 months ago
HArmonizedSS / HASS
View on GitHub
Official Implementation of "Learning Harmonized Representations for Speculative Sampling" (HASS)
☆56Mar 14, 2025Updated last year
XMUDeepLIT / MemSyco-Bench
View on GitHub
MemSyco-Bench: Benchmarking Sycophancy in Agent Memory
☆17Jul 7, 2026Updated 2 weeks ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
VILA-Lab / Elastic-Cache
View on GitHub
[ICLR 2026 🔥] Official pytorch implementation for "Attention Is All You Need for KV Cache in Diffusion LLMs"
☆42Jul 13, 2026Updated last week
chrirocca / GPUNetBench
View on GitHub
Collection of memory microbenchmarks to investigate NVIDIA GPUs Network on Chip architectures
☆15Apr 14, 2026Updated 3 months ago
zju-jiyicheng / LVSpec
View on GitHub
[ACL 2026 Main] See the Forest for the Trees: Loosely Speculative Decoding via Visual-Semantic Guidance for Efficient Inference of Video …
☆27Jul 4, 2026Updated 3 weeks ago
potato-kitty / ObjectAdd
View on GitHub
The codes of our paper "ObjectAdd: Adding Objects into Image via a Training-Free Diffusion Modification Fashion"
☆14Jun 29, 2025Updated last year
AMD-AGI / PARD
View on GitHub
PARD: Accelerating LLM Inference with Low-Cost PARallel Draft Model Adaptation (ICLR 26)
☆33Jun 10, 2026Updated last month
lzhxmu / AccDiffusion
View on GitHub
Code release for AccDiffusion (ECCV 2024)
☆92Nov 19, 2024Updated last year
zju-jiyicheng / SpecVLM
View on GitHub
[EMNLP 2025 Main] SpecVLM: Enhancing Speculative Decoding of Video LLMs via Verifier-Guided Token Pruning
☆48Apr 16, 2026Updated 3 months ago
NJUNLP / MCSD
View on GitHub
Multi-Candidate Speculative Decoding
☆41Apr 22, 2024Updated 2 years ago
hemingkx / SpeculativeDecodingPapers
View on GitHub
📰 Must-read papers and blogs on Speculative Decoding ⚡️
☆1,281Jun 27, 2026Updated 3 weeks ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
SCSLabISU / CDSGD
View on GitHub
Consensus Based Distributed Stochastic Gradient Descent
☆11Jun 24, 2018Updated 8 years ago
icip-cas / AutoAlign
View on GitHub
A toolkit for automated alignment research.
☆15Jul 3, 2026Updated 3 weeks ago
Danielement321 / FM2S
View on GitHub
[MIR] Pytorch Implementation for FM2S, a denoising algorithm for fluorescence microscopy.
☆15Mar 13, 2026Updated 4 months ago
FDU-VTS / DRAC
View on GitHub
Team FDVTS_DR's solutions for MICCAI2022 Diabetic Retinopathy Analysis Challenge (DRAC)
☆15Mar 5, 2024Updated 2 years ago
tatHi / maxmatch_dropout
View on GitHub
☆10Sep 13, 2022Updated 3 years ago
jiaruzouu / TransformerCopilot
View on GitHub
[NeurIPS 2025 Spotlight] Transformer Copilot: Learning from The Mistake Log in LLM Fine-tuning
☆21Nov 14, 2025Updated 8 months ago
QwenLM / ConsisEval
View on GitHub
☆14Jul 5, 2024Updated 2 years ago
aiadvocates / roshambo
View on GitHub
☆12Dec 9, 2022Updated 3 years ago
Sherry-97 / Deep-Reinforcement-Learning-Based-Effective-Coverage-Control-with-Connectivity-Constraints
View on GitHub
☆10Mar 2, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
cokeshao / Awesome-Multimodal-Token-Compression
View on GitHub
[TMLR 2026] Survey: https://arxiv.org/pdf/2507.20198
☆371May 29, 2026Updated last month
yuxiaodongHRI / SOIT
View on GitHub
SOIT: Segmenting Objects with Instance-Aware Transformers
☆14Jun 6, 2022Updated 4 years ago
hemingkx / Spec-Bench
View on GitHub
Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)
☆401Apr 22, 2025Updated last year
Astuary / Spry
View on GitHub
Code for "Thinking Forward: Memory-Efficient Federated Finetuning of Language Models" (NeurIPS 2024). Spry is a federated learning al…
☆13Oct 8, 2024Updated last year
hyc2026 / M3-Agent-Training
View on GitHub
☆30Mar 30, 2026Updated 3 months ago
MINE-USTC / Xiangqi-R1
View on GitHub
Code for the paper Xiangqi-R1: Enhancing Spatial Strategic Reasoning in LLMs for Chinese Chess via Reinforcement Learning
☆15Jul 23, 2025Updated last year
dqxiu / KAssess
View on GitHub
☆14Oct 28, 2023Updated 2 years ago