hemingkx / SpeculativeDecodingPapers
π° Must-read papers and blogs on Speculative Decoding β‘οΈ
β696Updated this week
Alternatives and similar repositories for SpeculativeDecodingPapers:
Users that are interested in SpeculativeDecodingPapers are comparing it to the libraries listed below
- Fast inference from large lauguage models via speculative decodingβ714Updated 8 months ago
- Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)β258Updated this week
- π° Must-read papers on KV Cache Compression (constantly updating π€).β376Updated last week
- [NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.