harleyszhang / lite_llamaView on GitHub
A light llama-like llm inference framework based on the triton kernel.
172Jan 5, 2026Updated 2 months ago

Alternatives and similar repositories for lite_llama

Users that are interested in lite_llama are comparing it to the libraries listed below

Sorting:

Are these results useful?