Free resource for the book AI Compiler Development Guide
☆49Dec 22, 2022Updated 3 years ago
Alternatives and similar repositories for AI_compiler_development_guide
Users that are interested in AI_compiler_development_guide are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Start AI Compiler☆46Feb 26, 2026Updated 3 weeks ago
- compiler learning resources collect.☆2,693Mar 19, 2025Updated last year
- An MLIR-based toy DL compiler for TVM Relay.☆61Oct 16, 2022Updated 3 years ago
- a simple general program language☆100Feb 2, 2026Updated last month
- Learning problem-solving, logic/set, math, physics, economics through functional programming using Haskell☆19Oct 16, 2015Updated 10 years ago
- 🐱 ncnn int8 模型量化评估☆14Oct 10, 2022Updated 3 years ago
- ☆150Mar 18, 2024Updated 2 years ago
- ☆11Nov 25, 2020Updated 5 years ago
- A practical way of learning Swizzle☆37Feb 3, 2025Updated last year
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆17Mar 13, 2023Updated 3 years ago
- My study note for mlsys☆14Nov 4, 2024Updated last year
- A curated list of research papers, datasets, and tools for applying machine learning/Deep learning techniques to compilers and program op…☆124Sep 28, 2023Updated 2 years ago
- GEMM by WMMA (tensor core)☆15Jul 31, 2022Updated 3 years ago
- Tencent Distribution of TVM☆16Apr 7, 2023Updated 2 years ago
- Hands-On Practical MLIR Tutorial☆732Oct 20, 2023Updated 2 years ago
- tutorial for writing custom pytorch cpp+cuda kernel, applied on volume rendering (NeRF)☆29Dec 12, 2023Updated 2 years ago
- play gemm with tvm☆92Jul 22, 2023Updated 2 years ago
- 使用 cutlass 仓库在 ada 架构上实现 fp8 的 flash attention☆81Aug 12, 2024Updated last year
- RISC-V CPU Labs in Chisel☆76Jan 31, 2026Updated last month
- Data Hiding in Image☆10Apr 9, 2020Updated 5 years ago
- Code for ICML 2020 paper: Do RNN and LSTM have Long Memory?☆17Jan 6, 2021Updated 5 years ago
- Parallel Prefix Sum (Scan) with CUDA☆29Jun 22, 2024Updated last year
- Embedded Universal DSL: a good DSL for us, by us☆70Updated this week
- ☆48Mar 27, 2023Updated 2 years ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆18Feb 9, 2026Updated last month
- (Academia Sinica / Computer Vision / Deep Learning) Object Detection, Person Reid, Face Reid☆13Nov 21, 2022Updated 3 years ago
- A tiny Debugger : - )☆10Jan 24, 2021Updated 5 years ago
- 鉴定网络热门并行编程框架 - 性能测评(附小彭老师锐评)已评测:Taichi、SyCL、C++、OpenMP、TBB、Mojo☆40Aug 28, 2023Updated 2 years ago
- ☆19Apr 28, 2021Updated 4 years ago
- ☆13Jul 7, 2017Updated 8 years ago
- A Computational Graph Generator for AI Compiler Fuzzing☆16May 31, 2023Updated 2 years ago
- FFTE: A Fast Fourier Transform Package (Official tarballs are unpacked into master as commits)☆12Feb 17, 2024Updated 2 years ago
- row-major matmul optimization☆712Feb 24, 2026Updated last month
- Hands-On Practical MLIR Tutorial☆53Aug 21, 2025Updated 7 months ago
- hands on model tuning with TVM and profile it on a Mac M1, x86 CPU, and GTX-1080 GPU.☆49Jun 15, 2023Updated 2 years ago
- some knowleage about SystemC/TLM etc.☆29Mar 6, 2026Updated 2 weeks ago
- This repo contains the document of index and combined doc of solutions☆16May 25, 2025Updated 9 months ago
- ☆27Aug 9, 2025Updated 7 months ago
- Quantized Attention on GPU☆44Nov 22, 2024Updated last year