feifeibear / LLMSpeculativeSampling

Fast inference from large lauguage models via speculative decoding
β˜†630Updated 4 months ago

Alternatives and similar repositories for LLMSpeculativeSampling:

Users that are interested in LLMSpeculativeSampling are comparing it to the libraries listed below