shreyansh26 / Speculative-SamplingView external linksLinks
Implementation of Speculative Sampling as described in "Accelerating Large Language Model Decoding with Speculative Sampling" by Deepmind
☆108Feb 29, 2024Updated last year
Alternatives and similar repositories for Speculative-Sampling
Users that are interested in Speculative-Sampling are comparing it to the libraries listed below
Sorting:
- Fast inference from large lauguage models via speculative decoding☆886Aug 22, 2024Updated last year
- Parallel waveform generation with DiffusionGAN☆17Mar 26, 2022Updated 3 years ago
- Keyformer proposes KV Cache reduction through key tokens identification and without the need for fine-tuning☆58Mar 26, 2024Updated last year
- ☆24Mar 15, 2022Updated 3 years ago
- 📰 Must-read papers and blogs on Speculative Decoding ⚡️☆1,121Jan 24, 2026Updated 3 weeks ago
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated 11 months ago
- ☆12Feb 3, 2026Updated last week
- Official implementation of DGP-based multi-speaker speech synthesis with PyTorch☆24Mar 23, 2021Updated 4 years ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads☆2,705Jun 25, 2024Updated last year
- Synthesized singing voice demos of WeSinger 2 paper.☆26Feb 20, 2023Updated 2 years ago
- [SpeechCom Journal] Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Apr 18, 2023Updated 2 years ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆34Aug 27, 2023Updated 2 years ago
- REST: Retrieval-Based Speculative Decoding, NAACL 2024☆214Sep 11, 2025Updated 5 months ago
- Qwen3-0.6B megakernel: 527 tok/s decode on RTX 3090 (3.8x faster than PyTorch)☆70Updated this week
- Efficient Neural Architecture Search via Straight-Through Gradients☆13Nov 12, 2020Updated 5 years ago
- A toy-like Text-to-Speech for Chinese/Mandarin synthesize, inspired by Tacotron & FastSpeech2 & RefineGAN.☆15May 25, 2022Updated 3 years ago
- My vocoder experiments☆31Jul 26, 2025Updated 6 months ago
- ICASSP 2024 - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.☆55Nov 16, 2025Updated 2 months ago
- ☆64Dec 3, 2024Updated last year
- Official Implementation (Pytorch) of "Constant Acceleration Flow", NeurIPS 2024☆35Oct 31, 2025Updated 3 months ago
- ☆12Apr 1, 2024Updated last year
- ☆13Sep 1, 2023Updated 2 years ago
- Source code of APNet2, a vocoder☆58Nov 23, 2023Updated 2 years ago
- PyTorch implementation of Language model compression with weighted low-rank factorization☆13Jun 28, 2023Updated 2 years ago
- Source code of paper "Adapting pretrained speech model for Mandarin lyrics transcription and alignment"☆18Dec 14, 2023Updated 2 years ago
- Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)☆367Apr 22, 2025Updated 9 months ago
- Code associated with the paper **Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding**☆214Feb 13, 2025Updated last year
- ☆15Aug 22, 2025Updated 5 months ago
- ☆15Jun 4, 2024Updated last year
- MFA acoustic model training based on Opencpop☆15Sep 23, 2022Updated 3 years ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆24Aug 1, 2025Updated 6 months ago
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆14Dec 30, 2023Updated 2 years ago
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Apr 25, 2023Updated 2 years ago
- Code for the ACL2022 paper "Synthetic Question Value Estimation for Domain Adaptation of Question Answering"☆17Mar 21, 2022Updated 3 years ago
- ☆25Jun 19, 2025Updated 7 months ago
- Official Implementation for the paper: A Variational Framework for Improving Naturalness in Generative Spoken Language Models☆22Jun 18, 2025Updated 7 months ago
- source code of EfficientTTS 2☆20Feb 18, 2024Updated last year
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year