dilab-zju / self-speculative-decoding

Code associated with the paper **Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding**
138Updated 5 months ago

Related projects

Alternatives and complementary repositories for self-speculative-decoding