zhihu / ZhiLight

A highly optimized LLM inference acceleration engine for Llama and its variants.
835Updated this week

Alternatives and similar repositories for ZhiLight:

Users that are interested in ZhiLight are comparing it to the libraries listed below