feifeibear / LLMSpeculativeSampling

Fast inference from large lauguage models via speculative decoding
β˜†657Updated 5 months ago

Alternatives and similar repositories for LLMSpeculativeSampling:

Users that are interested in LLMSpeculativeSampling are comparing it to the libraries listed below