This repository organizes materials, recordings, and schedules related to AI-infra learning meetings.
☆370Mar 1, 2026Updated 3 weeks ago
Alternatives and similar repositories for ai-infra-learning
Users that are interested in ai-infra-learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ViTALiTy (HPCA'23) Code Repository☆23Mar 13, 2023Updated 3 years ago
- An efficient spatial accelerator enabling hybrid sparse attention mechanisms for long sequences☆32Mar 7, 2024Updated 2 years ago
- 📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉☆9,932Updated this week
- Kubernetes Learning☆12Aug 4, 2024Updated last year
- 使用Go语言搭建一个简单的OSS对象存储服务器,依据书ISBN978-7-115-48055-2编写,修复书上一些bug,并提供构建的完整视频过程☆11Aug 12, 2021Updated 4 years ago
- YOLOv8 C++ DET、SEG、POSE TENSORRT 推理库,便于学习开发拓展与工作中实际部署☆18Aug 22, 2023Updated 2 years ago
- LLM training parallelisms (DP, FSDP, TP, PP) in pure C☆26Jan 27, 2026Updated last month
- SGLang is a fast serving framework for large language models and vision language models.☆20Updated this week
- how to optimize some algorithm in cuda.☆2,872Mar 17, 2026Updated last week
- Code for NDSS '25 paper "Passive Inference Attacks on Split Learning via Adversarial Regularization"☆13Sep 16, 2024Updated last year
- Puzzles for learning Triton, play it with minimal environment configuration!☆647Mar 17, 2026Updated last week
- ☆33Dec 10, 2025Updated 3 months ago
- Source code of IPA, https://escholarship.org/uc/item/2p0805dq☆12Jun 27, 2024Updated last year
- ☆96Jan 22, 2026Updated 2 months ago
- 🎉 An awesome & curated list of best LLMOps tools.☆215Mar 16, 2026Updated last week
- A 5-level pipelined MIPS CPU with branch prediction and great cache.☆19May 9, 2021Updated 4 years ago
- A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.☆3,765Mar 13, 2026Updated last week
- Multi-party Private Set Intersections & Threshold Set Intersections☆14Apr 2, 2021Updated 4 years ago
- Residual vector quantization for KV cache compression in large language model☆12Oct 22, 2024Updated last year
- A self-learning tutorail for CUDA High Performance Programing.☆915Jan 14, 2026Updated 2 months ago
- A lightweight, configurable, and real-time simulator designed to mimic the behavior of vLLM without the need for GPUs or running actual h…☆103Updated this week
- 电子科技大学2022级研究生课程《图论及其应用》,包含教材、课件、作业和复习时写的东西。☆43Nov 26, 2022Updated 3 years ago
- a simple API to use CUPTI☆10Aug 19, 2025Updated 7 months ago
- hands on model tuning with TVM and profile it on a Mac M1, x86 CPU, and GTX-1080 GPU.☆49Jun 15, 2023Updated 2 years ago
- ☆13Mar 15, 2026Updated last week
- MaxEVA: Maximizing the Efficiency of Matrix Multiplication on Versal AI Engine (accepted as full paper at FPT'23)☆22Apr 17, 2024Updated last year
- Code for "An Introduction to Tensor Tiling in MLIR" tutorial given at EuroLLVM 2025☆22Jun 5, 2025Updated 9 months ago
- ☆121Sep 22, 2025Updated 6 months ago
- AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。☆6,461Dec 22, 2025Updated 3 months ago
- ☆12Jan 25, 2023Updated 3 years ago
- This is a c++ implement of yolov5 and fire/smoke detect.☆32Dec 15, 2021Updated 4 years ago
- learning how CUDA works☆379Mar 3, 2025Updated last year
- K8sSim:A Kubernetes cluster simualtor☆22Feb 9, 2023Updated 3 years ago
- This project is designed to simulate GPU information, making it easier to test scenarios where a GPU is not available.☆65Jan 9, 2026Updated 2 months ago
- SSR: Spatial Sequential Hybrid Architecture for Latency Throughput Tradeoff in Transformer Acceleration (Full Paper Accepted in FPGA'24)☆36Mar 12, 2026Updated last week
- A practical way of learning Swizzle☆37Feb 3, 2025Updated last year
- Triton to TVM transpiler.☆23Oct 14, 2024Updated last year
- Vitis 部署加速器工作流介绍☆11Jan 10, 2025Updated last year
- Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and opti…☆51Oct 21, 2023Updated 2 years ago