DefTruth / Awesome-LLM-Inference

πŸ“–A curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. πŸŽ‰πŸŽ‰
β˜†3,221Updated this week

Alternatives and similar repositories for Awesome-LLM-Inference:

Users that are interested in Awesome-LLM-Inference are comparing it to the libraries listed below