harleyszhang / lite_llamaLinks

A light llama-like llm inference framework based on the triton kernel.
122Updated this week

Alternatives and similar repositories for lite_llama

Users that are interested in lite_llama are comparing it to the libraries listed below

Sorting: