xlite-dev / Awesome-LLM-Inference

📚A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, FlashAttention, PagedAttention, MLA, Parallelism etc.
3,876Updated last week

Alternatives and similar repositories for Awesome-LLM-Inference:

Users that are interested in Awesome-LLM-Inference are comparing it to the libraries listed below