iree-org/wave

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/iree-org/wave)

iree-org / wave

Wave: Python Domain-Specific Language for High Performance Machine Learning

☆58

Alternatives and similar repositories for wave

Users that are interested in wave are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ROCm / hrx-system
View on GitHub
HRX: Hip Runtime Extended
☆18Updated this week
iree-org / aster
View on GitHub
ASTER 💫 : Assembly Tooling and Representations
☆32Jul 1, 2026Updated 2 weeks ago
ROCm / tritonBLAS
View on GitHub
A lightweight triton-based General Matrix Multiplication (GEMM) library.
☆65Jun 13, 2026Updated last month
seb-v / amd_challenge_solutions
View on GitHub
☆19Jun 6, 2025Updated last year
dafny-lang / xdsmith
View on GitHub
Fuzz testing for Dafny
☆13Jul 7, 2022Updated 4 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
ROCm / FlyDSL
View on GitHub
FlyDSL is the Python front‑end of the project: Flexible LaYout DSL.
☆237Updated this week
ROCm / iris
View on GitHub
AMD RAD's multi-GPU Triton-based framework for seamless multi-GPU programming
☆193Updated this week
triton-lang / triton-ext
View on GitHub
A collection of out-of-tree extensions for the Triton language and compiler
☆30Updated this week
foundation-model-stack / vllm-triton-backend
View on GitHub
A Triton-only attention backend for vLLM
☆27Updated this week
meta-pytorch / BackendBench
View on GitHub
Ship correct and fast LLM kernels to PyTorch
☆151Jan 14, 2026Updated 6 months ago
cherichy / tilecute
View on GitHub
☆32Jul 2, 2025Updated last year
seb-v / fp32_sgemm_amd
View on GitHub
Super fast FP32 matrix multiplication on RDNA3
☆92Mar 30, 2025Updated last year
ezyang / cute-interactive
View on GitHub
Interactive version of the CuTe layout paper
☆57Apr 14, 2026Updated 3 months ago
ROCm / aotriton
View on GitHub
Ahead of Time (AOT) Triton Math Library
☆100Jul 13, 2026Updated last week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
YJMSTR / flash-linear-attention
View on GitHub
FLA but cuTile
☆27Apr 17, 2026Updated 3 months ago
triton-lang / Triton-to-tile-IR
View on GitHub
incubator repo for CUDA-TileIR backend
☆148Jul 10, 2026Updated last week
iree-org / iree-turbine
View on GitHub
IREE's PyTorch Frontend, based on Torch Dynamo.
☆109Jul 1, 2026Updated 2 weeks ago
openxla / shardy
View on GitHub
MLIR-based partitioning system
☆198Updated this week
ROCm / rocprof-compute-viewer
View on GitHub
☆61Updated this week
ROCm / rocWMMA
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-libraries repo
☆140Jul 13, 2026Updated last week
ROCm / rocMLIR
View on GitHub
☆183Updated this week
tile-ai / TileFoundry
View on GitHub
☆54Updated this week
libxsmm / tpp-mlir
View on GitHub
TPP experimentation on MLIR for linear algebra
☆155Updated this week
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
facebookresearch / tensor-layouts
View on GitHub
A pure-Python implementation of the Nvidia CuTe layout algebra intended to be approachable and easy to learn.
☆231Jun 29, 2026Updated 3 weeks ago
alibaba / redfuser
View on GitHub
☆21Mar 17, 2026Updated 4 months ago
NVIDIA / cuda-tile
View on GitHub
CUDA Tile IR is an MLIR-based intermediate representation and compiler infrastructure for CUDA kernel optimization, focusing on tile-base…
☆999Jul 6, 2026Updated 2 weeks ago
Groverkss / mlir-tutor
View on GitHub
Exercises for Learning MLIR (Originally written for PPoPP 2026)
☆106Feb 5, 2026Updated 5 months ago
iree-org / fusilli
View on GitHub
C++ Graph API and JIT Engine powered by IREE
☆25Jul 1, 2026Updated 2 weeks ago
Infatoshi / KernelBench-Hard
View on GitHub
Surgical GPU kernel benchmark: 7 hard problems, frontier coding agents, roofline-graded against hardware peak.
☆19Jun 12, 2026Updated last month
Groverkss / tinytile
View on GitHub
Code for "An Introduction to Tensor Tiling in MLIR" tutorial given at EuroLLVM 2025
☆24Jun 5, 2025Updated last year
ROCm / ATOM
View on GitHub
AiTer Optimized Model
☆141Updated this week
llvm / eudsl
View on GitHub
Embedded Universal DSL: a good DSL for us, by us
☆76Updated this week
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
facebookexperimental / triton
View on GitHub
Github mirror of trition-lang/triton repo.
☆178Updated this week
flagos-ai / libtriton_jit
View on GitHub
A Triton JIT runtime and ffi provider in C++
☆37Updated this week
patrick-toulme / justabyte
View on GitHub
Code snippets and reproductions from JustAByte
☆48Apr 6, 2026Updated 3 months ago
IITH-Compilers / ML-Compiler-Bridge
View on GitHub
Library to interface Compilers and ML models for ML-Enabled Compiler Optimizations
☆20Oct 19, 2025Updated 9 months ago
ROCm / rocmProfileData
View on GitHub
☆30Jun 16, 2026Updated last month
cucapra / latte21
View on GitHub
Languages, Tools, and Techniques for Accelerator Design
☆33Nov 2, 2021Updated 4 years ago
wafer-ai / kernel-arena
View on GitHub
Public benchmark results from Kernel Arena, a leaderboard for LLM-generated AI accelerator kernels.
☆20Mar 11, 2026Updated 4 months ago