AnHaechan / ai-compilers-study-materialLinks
A collection of study materials for AI compilers and systems.
☆44Updated 3 weeks ago
Alternatives and similar repositories for ai-compilers-study-material
Users that are interested in ai-compilers-study-material are comparing it to the libraries listed below
Sorting:
- An attempt at safe imperative GPU programming.☆58Updated 2 months ago
- High level synthesis language for hardware design☆61Updated this week
- SBLP 2025 MLIR Tutorial☆62Updated last month
- High-Performance SGEMM on CUDA devices☆107Updated 9 months ago
- Cuq: A MIR-to-Coq Framework Targeting PTX for Formal Semantics and Verified Translation of Rust GPU Kernels☆82Updated last week
- The Finite Field Assembly Programming Language☆36Updated 5 months ago
- Custom PTX Instruction Benchmark☆131Updated 8 months ago
- Website for CS 265☆30Updated 10 months ago
- Tenstorrent MLIR compiler☆199Updated this week
- Tensor library with autograd using only Rust's standard library☆70Updated last year
- Library to interface Compilers and ML models for ML-Enabled Compiler Optimizations☆18Updated last week
- Super fast FP32 matrix multiplication on RDNA3☆76Updated 7 months ago
- An interactive web-based tool for exploring intermediate representations of PyTorch and Triton models☆51Updated last month
- My submission for the GPUMODE/AMD fp8 mm challenge☆29Updated 4 months ago
- A massively parallel, optimal functional runtime in Rust☆31Updated last year
- Can I make an *optimizing* compiler under 1k lines of code?☆63Updated 8 months ago
- Tensor library & inference framework for machine learning☆113Updated 3 weeks ago
- LLM training in simple, raw C/CUDA☆107Updated last year
- A toy compiler for NumPy array expressions that uses e-graphs and MLIR☆109Updated 2 months ago
- Wave: Python Domain-Specific Language for High Performance Machine Learning☆17Updated last week
- The missing pieces (as far as boilerplate reduction goes) of the upstream MLIR python bindings.☆110Updated 3 weeks ago
- ☆18Updated 4 months ago
- Learning about CUDA by writing PTX code.☆145Updated last year
- Meta-GPU lesson covering general aspects of GPU programming as well as specific frameworks☆90Updated 3 weeks ago
- tiny code to access tenstorrent blackhole☆60Updated 5 months ago
- ☆76Updated this week
- ☆56Updated 4 months ago
- A pure, low-level tensor program representation enabling tensor program optimization via program rewriting. See the web demo at https://g…☆70Updated 5 months ago
- Tenstorrent console based hardware information program☆54Updated last week
- An experimental optimizing compiler for Bril using egglog☆79Updated this week