DefTruth / Awesome-LLM-Inference

πŸ“–A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, Flash-Attention, Paged-Attention, Parallelism, etc. πŸŽ‰πŸŽ‰
β˜†3,456Updated this week

Alternatives and similar repositories for Awesome-LLM-Inference:

Users that are interested in Awesome-LLM-Inference are comparing it to the libraries listed below