CalvinXKY/InfraTech

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CalvinXKY/InfraTech)

CalvinXKY / InfraTech

分享AI Infra知识&代码练习：PyTorch、vLLM/SGLang、slime/vime框架入门⚡️、性能加速🚀、大模型基础🧠、AI软硬件🔧等

☆2,899

Alternatives and similar repositories for InfraTech

Users that are interested in InfraTech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

xlite-dev / LeetCUDA
View on GitHub
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
☆11,533Updated this week
CalvinXKY / BasicCUDA
View on GitHub
A tutorial for CUDA&PyTorch
☆469Mar 23, 2026Updated 3 months ago
DD-DuDa / Cute-Learning
View on GitHub
Examples of CUDA implementations by Cutlass CuTe
☆279Jul 1, 2025Updated last year
slwang-ustc / nano-vllm-v1
View on GitHub
Nano vLLM with vLLM v1's request scheduling strategy and chunked prefill
☆92Jan 26, 2026Updated 5 months ago
caomaolufei / AIInfraGuide
View on GitHub
AI Infra 全栈从0入门学习资料：https://caomaolufei.github.io/AIInfraGuide/
☆1,241Updated this week
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
zhaochenyang20 / Awesome-ML-SYS-Tutorial
View on GitHub
My learning notes for ML SYS.
☆6,710Updated this week
BBuf / AI-Infra-Auto-Driven-SKILLS
View on GitHub
☆670Updated this week
alexshuang / write-your-own-ai-compiler
View on GitHub
《自己动手写AI编译器》
☆40Oct 19, 2024Updated last year
GeeeekExplorer / nano-vllm
View on GitHub
Nano vLLM
☆14,485Apr 26, 2026Updated 2 months ago
BBuf / how-to-optim-algorithm-in-cuda
View on GitHub
how to optimize some algorithm in cuda.
☆3,137Jul 8, 2026Updated last week
vllm-project / vime
View on GitHub
An LLM post-training framework with vLLM for RL Scaling
☆363Updated this week
gpu-mode / lectures
View on GitHub
Material for gpu-mode lectures
☆6,313Jun 15, 2026Updated last month
Infrasys-AI / AISystem
View on GitHub
AISystem 主要是指AI系统，包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
☆17,201Sep 3, 2025Updated 10 months ago
xlite-dev / Awesome-LLM-Inference
View on GitHub
📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉
☆5,392Jun 23, 2026Updated 3 weeks ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Infrasys-AI / AIInfra
View on GitHub
AIInfra（AI 基础设施）指AI系统从底层芯片等硬件，到上层软件栈支持AI大模型训练和推理。
☆7,618Dec 22, 2025Updated 6 months ago
WingEdge777 / vitamin-cuda
View on GitHub
🍎 One kernel a day keeps high latency away. A hands-on CUDA learning path featuring a rich collection of kernels, from the basics to pea…
☆174Updated this week
dsl-learn / cutile-learn
View on GitHub
NVIDIA cuTile learn
☆168Dec 9, 2025Updated 7 months ago
RL-Align / RL-Kernel
View on GitHub
High-performance RL post-training infrastructure. Designed to achieve bitwise operator-level train-inference consistency across heterogen…
☆187Updated this week
Tencent / hpc-ops
View on GitHub
High Performance LLM Inference Operator Library
☆1,028Jul 2, 2026Updated last week
tile-ai / tilelang
View on GitHub
Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
☆6,636Updated this week
Tongkaio / CUDA_Kernel_Samples
View on GitHub
CUDA 算子手撕与面试指南
☆1,036Aug 23, 2025Updated 10 months ago
flashinfer-ai / flashinfer
View on GitHub
FlashInfer: Kernel Library for LLM Serving
☆5,957Updated this week
LDLINGLINGLING / nano_vllm_note
View on GitHub
注释的nano_vllm仓库，并且完成了MiniCPM4的适配以及注册新模型的功能
☆198Aug 11, 2025Updated 11 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
openmlsys / openmlsys
View on GitHub
《Machine Learning Systems: Design and Implementation》 (V2 is launching soon）
☆4,820Mar 15, 2026Updated 3 months ago
kvcache-ai / Mooncake
View on GitHub
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
☆5,821Updated this week
tile-ai / tilelang-puzzles
View on GitHub
Learning TileLang with 10 puzzles!
☆337May 28, 2026Updated last month
Kubernetes-Learning-Playground / k8s-informer-practice
View on GitHub
基于golang对k8s-client-go中的informer机制的学习
☆13Mar 10, 2024Updated 2 years ago
RussWong / vLLM_SGLang_cuteDSL_tutorial
View on GitHub
☆46Apr 16, 2026Updated 2 months ago
HobbyBear / cdndemo
View on GitHub
这一生听过许多道理，但还是过不好这一生，这是因为缺少真正的动手实践，光听道理，缺少动手实践的过程，学习难免会让人觉得味同嚼蜡，所以我的分享都比较倾向于实践，在一次次动手实践的过程中感受知识原本纯真的模样。
☆14Updated this week
sgl-project / mini-sglang
View on GitHub
A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.
☆4,568May 17, 2026Updated last month
LLMServe / dLoRA-artifact
View on GitHub
☆32May 28, 2024Updated 2 years ago
verl-project / vexact
View on GitHub
verl Zero-Mismatch Dense/MoE HuggingFace Rollout
☆60Jul 2, 2026Updated last week
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Allen-C-Guan / Pytorch-Inductor-Tutorial
View on GitHub
☆94Jun 26, 2026Updated 2 weeks ago
RiseAI-Sys / DAX
View on GitHub
High performance inference engine for diffusion models
☆107Sep 5, 2025Updated 10 months ago
jinbooooom / OriginDL
View on GitHub
Implement a Pytorch-like DL library in C++ from scratch, step by step
☆332Apr 15, 2026Updated 3 months ago
ysyisyourbrother / Galaxy-LM
View on GitHub
Work in progress LLM framework.
☆16Oct 31, 2024Updated last year
JJXiangJiaoJun / cutlass_gemv
View on GitHub
GEMV implementation with CUTLASS
☆21Aug 21, 2025Updated 10 months ago
leepoly / sm-profiler
View on GitHub
☆82Feb 5, 2026Updated 5 months ago
stas00 / ml-engineering
View on GitHub
Machine Learning Engineering Open Book
☆18,403Updated this week