kyegomez / Speculative-Decoding

My own implementation of "Fast Inference from Transformers via Speculative Decoding"
11Updated last year

Alternatives and similar repositories for Speculative-Decoding:

Users that are interested in Speculative-Decoding are comparing it to the libraries listed below