uw-mad-dash/decoding-speculative-decoding

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/uw-mad-dash/decoding-speculative-decoding)

uw-mad-dash / decoding-speculative-decoding

☆16

Alternatives and similar repositories for decoding-speculative-decoding

Users that are interested in decoding-speculative-decoding are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Jikai0Wang / OPT-Tree
View on GitHub
☆30May 24, 2025Updated last year
KaiLv69 / DuoDecoding
View on GitHub
DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence Drafting
☆19Mar 4, 2025Updated last year
iLearn-Lab / ACL25-PTQ1.61
View on GitHub
☆15Apr 6, 2026Updated 3 months ago
oujieww / ANPD
View on GitHub
☆11Feb 5, 2026Updated 5 months ago
goliaro / specinfer-ae
View on GitHub
☆28Mar 14, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
eBay / spec_dec
View on GitHub
☆18Updated this week
NJUNLP / MCSD
View on GitHub
Multi-Candidate Speculative Decoding
☆41Apr 22, 2024Updated 2 years ago
kyegomez / Mirasol
View on GitHub
Pytorch Implementation of the Model from "MIRASOL3B: A MULTIMODAL AUTOREGRESSIVE MODEL FOR TIME-ALIGNED AND CONTEXTUAL MODALITIES"
☆26Jan 27, 2025Updated last year
ubicomplab / classification-to-clinical
View on GitHub
From-Classification-to-Clinical
☆13Apr 26, 2024Updated 2 years ago
lol0963332320 / ICLAB
View on GitHub
☆19Mar 23, 2023Updated 3 years ago
0x7o / RETRO-transformer
View on GitHub
Easy-to-use Retrieval-Enhanced Transformer implementation
☆10Sep 30, 2022Updated 3 years ago
ixaxaar / pytorch-dni
View on GitHub
Decoupled Neural Interfaces Using Synthetic Gradients - under develeopment
☆11Jun 27, 2025Updated last year
Supercomputing-System-AI-Lab / MiLo
View on GitHub
Code repo for efficient quantized MoE inference with mixture of low-rank compensators
☆39Apr 14, 2025Updated last year
sixws / Demo
View on GitHub
一个小游戏
☆14Aug 17, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
gabry1998 / Self-Supervised-Anomaly-Detection
View on GitHub
Thesis project about Visual Anomaly Detection based on Self Supervised Learning. The model identifies anomalies from information acquired…
☆10Apr 14, 2023Updated 3 years ago
BaiTheBest / SRDML
View on GitHub
GitHub Repository for KDD 2022 paper "Saliency-Regularized Deep Multi-Task Learning"
☆12Sep 26, 2023Updated 2 years ago
Raincleared-Song / sparse_gpu_operator
View on GitHub
GPU operators for sparse tensor operations
☆37Mar 11, 2024Updated 2 years ago
Egg-Hu / SMI
View on GitHub
[ICML 2024] Sparse Model Inversion: Efficient Inversion of Vision Transformers with Less Hallucination
☆14Apr 29, 2025Updated last year
EternityYW / LLM_healthcare
View on GitHub
☆13Aug 3, 2024Updated last year
bayesianbrad / PyLFPPL
View on GitHub
A Low-level first-order probabilistic programming language, with in built translation constraints for automatic model checking. A flexibl…
☆14Apr 16, 2019Updated 7 years ago
mito-ds / snowflake-streamlit-bank-performance-demo
View on GitHub
A demo of the Mito Streamlit Spreadsheet
☆18Aug 3, 2023Updated 2 years ago
feifeibear / LLMSpeculativeSampling
View on GitHub
Fast inference from large lauguage models via speculative decoding
☆920Aug 22, 2024Updated last year
kcyu2014 / multi-model-forgetting
View on GitHub
ICML2019 Accepted Paper. Overcoming Multi-Model Forgetting
☆14Jun 5, 2019Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ACADLab / SA-DS
View on GitHub
☆15Jul 25, 2024Updated last year
DensoITLab / bitprune
View on GitHub
☆11Apr 5, 2023Updated 3 years ago
xszheng2020 / memorization
View on GitHub
An Empirical Study of Memorization in NLP (ACL 2022)
☆13Jun 22, 2022Updated 4 years ago
tech-srl / counting_dimensions
View on GitHub
demonstration for our ACL 2018 paper, "On the Practical Computational Power of Finite Precision RNNs for Language Recognition"
☆11May 26, 2019Updated 7 years ago
romsto / Speculative-Decoding
View on GitHub
Implementation of the paper Fast Inference from Transformers via Speculative Decoding, Leviathan et al. 2023.
☆112Dec 2, 2024Updated last year
sjijon / TeX-templates
View on GitHub
Various TeX templates, including slides and posters.
☆16May 19, 2022Updated 4 years ago
jzhangbs / DSM
View on GitHub
Learning Stereo Matchability in Disparity Regression Networks
☆17Aug 12, 2020Updated 5 years ago
hackersground-kr / hackers-ground
View on GitHub
해커그라운드 해커톤 2024
☆12Aug 26, 2024Updated last year
epfml / pam
View on GitHub
☆16Dec 9, 2023Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
lucidrains / speculative-decoding
View on GitHub
Explorations into some recent techniques surrounding speculative decoding
☆307Dec 22, 2024Updated last year
yh-yao / FedRule
View on GitHub
Official Code for FedRule: Federated Rule Recommendation System with Graph Neural Networks
☆14Sep 12, 2023Updated 2 years ago
eric-mitchell / concord
View on GitHub
☆14Nov 15, 2022Updated 3 years ago
aws-samples / awsame-distributed-ai-samples
View on GitHub
Cluster doctor skills
☆14May 23, 2026Updated last month
arijitx / Amazon-Satelite-Image-Labeling
View on GitHub
This is my CS 763 Computer Vision Course Project , Here we try to label Amazon Satelite Images. Here we try to implement the Show and Tel…
☆12May 10, 2018Updated 8 years ago
FasterDecoding / REST
View on GitHub
REST: Retrieval-Based Speculative Decoding, NAACL 2024
☆220Mar 5, 2026Updated 4 months ago
hellozhuo / msgc
View on GitHub
Source code of our TNNLS paper "Boosting Convolutional Neural Networks with Middle Spectrum Grouped Convolution"
☆12Apr 14, 2023Updated 3 years ago