feifeibear / LLMSpeculativeSampling

Fast inference from large lauguage models via speculative decoding
β˜†562Updated 2 months ago

Related projects β“˜

Alternatives and complementary repositories for LLMSpeculativeSampling