kyegomez / Speculative-Decoding

My own implementation of "Fast Inference from Transformers via Speculative Decoding"
11Updated 11 months ago

Related projects

Alternatives and complementary repositories for Speculative-Decoding