harleyszhang / lite_llama

A light llama-like llm inference framework based on the triton kernel.
78Updated 3 weeks ago

Alternatives and similar repositories for lite_llama:

Users that are interested in lite_llama are comparing it to the libraries listed below