xlite-dev / Awesome-LLM-Inference

πŸ“–A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, FlashAttention, PagedAttention, MLA, Parallelism, etc. πŸŽ‰πŸŽ‰
β˜†3,700Updated 3 weeks ago

Alternatives and similar repositories for Awesome-LLM-Inference:

Users that are interested in Awesome-LLM-Inference are comparing it to the libraries listed below