dilab-zju / self-speculative-decoding

Code associated with the paper **Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding**
151Updated 7 months ago

Alternatives and similar repositories for self-speculative-decoding:

Users that are interested in self-speculative-decoding are comparing it to the libraries listed below