This repository organizes materials, recordings, and schedules related to AI-infra learning meetings.
☆454Mar 1, 2026Updated 2 months ago
Alternatives and similar repositories for ai-infra-learning
Users that are interested in ai-infra-learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ViTALiTy (HPCA'23) Code Repository☆23Mar 13, 2023Updated 3 years ago
- 📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉☆11,050May 17, 2026Updated last week
- ☆49Apr 15, 2024Updated 2 years ago
- 使用Go语言搭建一个简单的OSS对象存储服务器,依据书ISBN978-7-115-48055-2编写,修复书上一些bug,并提供构建的完整视频过程☆11Aug 12, 2021Updated 4 years ago
- YOLOv8 C++ DET、SEG、POSE TENSORRT 推理库,便于学习开发拓展与工作中实际部署☆18Aug 22, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- LLM training parallelisms (DP, FSDP, TP, PP) in pure C☆28Jan 27, 2026Updated 3 months ago
- Hands-On Practical MLIR Tutorial☆772Oct 20, 2023Updated 2 years ago
- how to optimize some algorithm in cuda.☆2,998Updated this week
- SGLang is a fast serving framework for large language models and vision language models.☆22Updated this week
- A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.☆4,255May 17, 2026Updated last week
- A self-learning tutorail for CUDA High Performance Programing.☆989Jan 14, 2026Updated 4 months ago
- Based on Nano-vLLM, a simple replication of vLLM with self-contained paged attention and flash attention implementation☆753Mar 16, 2026Updated 2 months ago
- Nano vLLM☆13,595Apr 26, 2026Updated last month
- Puzzles for learning Triton, play it with minimal environment configuration!☆703Mar 17, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。☆7,099Dec 22, 2025Updated 5 months ago
- ☆34Dec 10, 2025Updated 5 months ago
- [CVPR2026] BinaryAttention: One-Bit QK-Attention for Vision and Diffusion Transformers☆33Mar 17, 2026Updated 2 months ago
- Source code of IPA, https://escholarship.org/uc/item/2p0805dq☆12Jun 27, 2024Updated last year
- learning how CUDA works☆389Mar 3, 2025Updated last year
- ☆115Apr 23, 2026Updated last month
- Residual vector quantization for KV cache compression in large language model☆12Oct 22, 2024Updated last year
- Multi-party Private Set Intersections & Threshold Set Intersections☆14Apr 2, 2021Updated 5 years ago
- hands on model tuning with TVM and profile it on a Mac M1, x86 CPU, and GTX-1080 GPU.☆51Jun 15, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆14Apr 28, 2026Updated 3 weeks ago
- a simple API to use CUPTI☆10Aug 19, 2025Updated 9 months ago
- 🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Mod…☆4,005Jul 25, 2025Updated 10 months ago
- Code for "An Introduction to Tensor Tiling in MLIR" tutorial given at EuroLLVM 2025☆23Jun 5, 2025Updated 11 months ago
- ☆12Jan 25, 2023Updated 3 years ago
- ☆128Sep 22, 2025Updated 8 months ago
- Code release for LightMamba accepted by DATE 2025☆23Oct 18, 2025Updated 7 months ago
- Automatic tree delineation from LiDAR point couds☆12Jun 27, 2018Updated 7 years ago
- MaxEVA: Maximizing the Efficiency of Matrix Multiplication on Versal AI Engine (accepted as full paper at FPT'23)☆22Apr 17, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- K8sSim:A Kubernetes cluster simualtor☆23Feb 9, 2023Updated 3 years ago
- UltraScale Playbook 中文版☆154Mar 15, 2025Updated last year
- ☆13May 30, 2024Updated last year
- compiler learning resources collect.☆2,736Updated this week
- This project is designed to simulate GPU information, making it easier to test scenarios where a GPU is not available.☆65Jan 9, 2026Updated 4 months ago
- 分享AI Infra知识&代码练习:PyTorch/vLLM/SGLang框架入门⚡️、性能加速🚀、大模型基础🧠、AI软硬件🔧等☆2,337May 8, 2026Updated 2 weeks ago
- ☆49Apr 29, 2025Updated last year