romsto / Speculative-DecodingView on GitHub
Implementation of the paper Fast Inference from Transformers via Speculative Decoding, Leviathan et al. 2023.
100Dec 2, 2024Updated last year

Alternatives and similar repositories for Speculative-Decoding

Users that are interested in Speculative-Decoding are comparing it to the libraries listed below

Sorting:

Are these results useful?