HuaizhengZhang / AI-Infra-from-Zero-to-HeroLinks
π Awesome System for Machine Learning β‘οΈ AI System Papers and Industry Practice. β‘οΈ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). π» OSDI, NSDI, SIGCOMM, SoCC, MLSys, etc. ποΈ Llama3, Mistral, etc. π§βπ» Video Tutorials.
β3,318Updated 2 months ago
Alternatives and similar repositories for AI-Infra-from-Zero-to-Hero
Users that are interested in AI-Infra-from-Zero-to-Hero are comparing it to the libraries listed below
Sorting:
- My learning notes/codes for ML SYS.β3,808Updated last week
- Large Language Model (LLM) Systems Paper Listβ1,532Updated last week
- how to optimize some algorithm in cuda.β2,548Updated this week
- Material for gpu-mode lecturesβ5,143Updated 2 weeks ago
- πA curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.πβ4,598Updated last month
- compiler learning resources collect.β2,546Updated 6 months ago
- Several simple examples for popular neural network toolkits calling custom CUDA operators.β1,513Updated 4 years ago
- A curated list of awesome Distributed Deep Learning resources.β430Updated last year
- This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce severalβ¦β1,160Updated 2 years ago
- The road to hack SysML and become an system expertβ498Updated last year
- β616Updated last year
- πLeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginnersπ, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.πβ7,874Updated 3 weeks ago
- β609Updated 5 months ago
- A self-learning tutorail for CUDA High Performance Programing.β750Updated 3 months ago
- DLRover: An Automatic Distributed Deep Learning Systemβ1,561Updated 2 weeks ago
- Dive into Deep Learning Compilerβ648Updated 3 years ago
- β2,567Updated last year
- Sample codes for my CUDA programming bookβ1,889Updated 7 months ago
- The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)β282Updated 9 months ago
- [TMLR 2024] Efficient Large Language Models: A Surveyβ1,219Updated 3 months ago
- Curated collection of papers in machine learning systemsβ420Updated last week
- Learning material for CMU10-714: Deep Learning Systemβ279Updated last year
- A list of awesome compiler projects and papers for tensor computation and deep learning.β2,657Updated 11 months ago
- Learn CUDA Programming, published by Packtβ1,196Updated last year
- Tutorial code on how to build your own Deep Learning System in 2k Linesβ2,016Updated 7 years ago
- An ML Systems Onboarding listβ910Updated 8 months ago
- Advanced Topics on Systems for Xβ279Updated last year
- how to learn PyTorch and OneFlowβ456Updated last year
- β1,931Updated 2 years ago
- CS294; AI For Systems and Systems For AIβ225Updated 6 years ago