zhaochenyang20 / Awesome-ML-SYS-TutorialLinks

My learning notes/codes for ML SYS.

☆3,920

Alternatives and similar repositories for Awesome-ML-SYS-Tutorial

Users that are interested in Awesome-ML-SYS-Tutorial are comparing it to the libraries listed below

Sorting:

xlite-dev / Awesome-LLM-Inference
📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉
☆4,615Updated 2 months ago
kvcache-ai / Mooncake
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
☆4,124Updated this week
inclusionAI / AReaL
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.
☆2,853Updated this week
AmberLJC / LLMSys-PaperList
Large Language Model (LLM) Systems Paper List
☆1,547Updated last week
THUDM / slime
slime is an LLM post-training framework for RL Scaling.
☆2,170Updated last week
alibaba / ROLL
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
☆2,083Updated last week
Unakar / Logic-RL
Reproduce R1 Zero on Logic Puzzle
☆2,407Updated 7 months ago
harleyszhang / llm_note
LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.
☆829Updated last month
stanford-cs336 / spring2025-lectures
☆1,601Updated 2 weeks ago
flashinfer-ai / flashinfer
FlashInfer: Kernel Library for LLM Serving
☆3,952Updated this week
PaddleJitLab / CUDATutorial
A self-learning tutorail for CUDA High Performance Programing.
☆751Updated 3 months ago
BBuf / how-to-optim-algorithm-in-cuda
how to optimize some algorithm in cuda.
☆2,552Updated 2 weeks ago
alibaba / Pai-Megatron-Patch
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
☆1,395Updated this week
hemingkx / SpeculativeDecodingPapers
📰 Must-read papers and blogs on Speculative Decoding ⚡️
☆979Updated last month
OpenRLHF / OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Asy…
☆8,180Updated 2 weeks ago
xlite-dev / LeetCUDA
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
☆8,063Updated last week
horseee / Awesome-Efficient-LLM
A curated list for Efficient Large Language Models
☆1,874Updated 4 months ago
TsinghuaC3I / Awesome-RL-for-LRMs
A Survey of Reinforcement Learning for Large Reasoning Models
☆1,823Updated last week
AIoT-MLSys-Lab / Efficient-LLMs-Survey
[TMLR 2024] Efficient Large Language Models: A Survey
☆1,221Updated 4 months ago
Open-Reasoner-Zero / Open-Reasoner-Zero
Official Repo for Open-Reasoner-Zero
☆2,054Updated 4 months ago
feifeibear / LLMSpeculativeSampling
Fast inference from large lauguage models via speculative decoding
☆835Updated last year
hkust-nlp / simpleRL-reason
Simple RL training for reasoning
☆3,773Updated 2 months ago
lsdefine / simple_GRPO
A very simple GRPO implement for reproducing r1-like LLM thinking.
☆1,401Updated 2 months ago
volcengine / veScale
A PyTorch Native LLM Training Framework
☆875Updated last month
tile-ai / tilelang
Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
☆3,658Updated this week
ModelTC / LightLLM
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalabili…
☆3,662Updated this week
hiyouga / EasyR1
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
☆3,836Updated this week
vllm-project / vllm-ascend
Community maintained hardware plugin for vLLM on Ascend
☆1,230Updated this week
mbzuai-oryx / Awesome-LLM-Post-training
Awesome Reasoning LLM Tutorial/Survey/Guide
☆2,109Updated last week
openreasoner / openr
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
☆1,823Updated 9 months ago