AmberLJC / LLMSys-PaperListLinks

Large Language Model (LLM) Systems Paper List

☆1,563

Alternatives and similar repositories for LLMSys-PaperList

Users that are interested in LLMSys-PaperList are comparing it to the libraries listed below

Sorting:

AmadeusChan / Awesome-LLM-System-Papers
☆609Updated 5 months ago
xlite-dev / Awesome-LLM-Inference
📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉
☆4,635Updated 2 months ago
byungsoo-oh / ml-systems-papers
Curated collection of papers in machine learning systems
☆433Updated 3 weeks ago
horseee / Awesome-Efficient-LLM
A curated list for Efficient Large Language Models
☆1,885Updated 4 months ago
hemingkx / SpeculativeDecodingPapers
📰 Must-read papers and blogs on Speculative Decoding ⚡️
☆988Updated this week
zhaochenyang20 / Awesome-ML-SYS-Tutorial
My learning notes/codes for ML SYS.
☆4,012Updated 3 weeks ago
HuangOwen / Awesome-LLM-Compression
Awesome LLM compression research papers and tools.
☆1,694Updated 3 months ago
LLMServe / DistServe
Disaggregated serving system for Large Language Models (LLMs).
☆709Updated 6 months ago
AIoT-MLSys-Lab / Efficient-LLMs-Survey
[TMLR 2024] Efficient Large Language Models: A Survey
☆1,226Updated 4 months ago
MLSys-Learner-Resources / Awesome-MLSys-Blogger
The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)
☆297Updated 9 months ago
October2001 / Awesome-KV-Cache-Compression
📰 Must-read papers on KV Cache Compression (constantly updating 🤗).
☆584Updated last month
ServerlessLLM / ServerlessLLM
Serverless LLM Serving for Everyone.
☆573Updated last week
flashinfer-ai / flashinfer
FlashInfer: Kernel Library for LLM Serving
☆3,982Updated this week
sgl-project / sgl-learning-materials
Materials for learning SGLang
☆618Updated 3 weeks ago
Zefan-Cai / Awesome-LLM-KV-Cache
Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes.
☆379Updated 7 months ago
volcengine / veScale
A PyTorch Native LLM Training Framework
☆879Updated last month
hahnyuan / LLM-Viewer
Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline mod…
☆568Updated last year
lambda7xx / awesome-AI-system
paper and its code for AI System
☆331Updated 2 months ago
microsoft / vidur
A large-scale simulation framework for LLM inference
☆462Updated 3 months ago
vllm-project / production-stack
vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization
☆1,884Updated last week
gpu-mode / awesomeMLSys
An ML Systems Onboarding list
☆917Updated 9 months ago
galeselee / Awesome_LLM_System-PaperList
Since the emergence of chatGPT in 2022, the acceleration of Large Language Model has become increasingly important. Here is a list of pap…
☆278Updated 7 months ago
THUDM / slime
slime is an LLM post-training framework for RL Scaling.
☆2,232Updated last week
ByteDance-Seed / Triton-distributed
Distributed Compiler based on Triton for Parallel Systems
☆1,206Updated 2 weeks ago
SiriusNEO / Triton-Puzzles-Lite
Puzzles for learning Triton, play it with minimal environment configuration!
☆553Updated last month
Shenggan / awesome-distributed-ml
A curated list of awesome projects and papers for distributed training or inference
☆247Updated last year
MoE-Inf / awesome-moe-inference
Curated collection of papers in MoE model inference
☆290Updated last week
AlibabaPAI / llumnix
Efficient and easy multi-instance LLM serving
☆502Updated last month
feifeibear / LLMSpeculativeSampling
Fast inference from large lauguage models via speculative decoding
☆841Updated last year
PaddleJitLab / CUDATutorial
A self-learning tutorail for CUDA High Performance Programing.
☆758Updated 4 months ago