mvvsmk / OptMLLinks

Welcome to OptML! This repository is designed for those new to MLIR and machine learning-based optimizations. As a compiler enthusiast, I wanted to create a platform for hobbyists like myself to experiment with and benchmark new optimizations on real ML models in an out-of-tree manner.

☆20

Alternatives and similar repositories for OptML

Users that are interested in OptML are comparing it to the libraries listed below

Sorting:

tenstorrent / tt-mlir
Tenstorrent MLIR compiler
☆141Updated this week
makslevental / mlir-python-extras
The missing pieces (as far as boilerplate reduction goes) of the upstream MLIR python bindings.
☆102Updated this week
mwillsey / cs265
Website for CS 265
☆29Updated 6 months ago
intel / graph-compiler
MLIR-based toolkit targeting intel heterogeneous hardware
☆43Updated 4 months ago
MrSidims / PytorchExplorer
An interactive web-based tool for exploring intermediate representations of PyTorch and Triton models
☆46Updated 2 weeks ago
CisMine / Guide-NVIDIA-Tools
NVIDIA tools guide
☆135Updated 5 months ago
tenstorrent / tt-forge
Tenstorrent's MLIR Based Compiler. We aim to enable developers to run AI on all configurations of Tenstorrent hardware, through an open-s…
☆72Updated this week
openxla / shardy
MLIR-based partitioning system
☆97Updated this week
intel / mlir-extensions
Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.
☆134Updated last week
MPACT-ORG / mpact-compiler
Retargetable ML compilers for the twenty-first century!
☆13Updated 2 months ago
VimalWill / TinyCompiler
MLIR based Tiny Graph Compiler [dev-stage]
☆18Updated 7 months ago
tenstorrent / tt-isa-documentation
☆46Updated last week
kuterd / nv_isa_solver
Nvidia Instruction Set Specification Generator
☆278Updated 11 months ago
jmgorius / mlir-standalone-template
An out-of-tree MLIR dialect template.
☆102Updated 9 months ago
tenstorrent / tt-forge-fe
The TT-Forge FE is a graph compiler designed to optimize and transform computational graphs for deep learning models, enhancing their per…
☆44Updated this week
seb-v / fp32_sgemm_amd
Super fast FP32 matrix multiplication on RDNA3
☆64Updated 2 months ago
tenstorrent / tensix-isa-simulator
☆29Updated 3 months ago
makslevental / mmlir
A minimal (really) out-of-tree MLIR example
☆44Updated 2 weeks ago
JanakiSubu / GPU_CUDA_100
100 days of CUDA Challenge
☆38Updated this week
CRobeck / instrument-amdgpu-kernels
LLVM/MLIR based compiler instrumentation of AMD GPU kernels
☆18Updated last month
libxsmm / tpp-mlir
TPP experimentation on MLIR for linear algebra
☆131Updated this week
0xD0GF00D / DocumentSASS
Unofficial description of the CUDA assembly (SASS) instruction sets.
☆104Updated 3 months ago
drkennetz / cuda_examples
Some CUDA example code with READMEs.
☆168Updated 3 months ago
iree-org / iree-turbine
IREE's PyTorch Frontend, based on Torch Dynamo.
☆87Updated this week
seb-v / amd_challenge_solutions
☆18Updated 3 weeks ago
wpmed92 / shaderpulse
A GLSL compiler targeting SPIR-V mlir
☆20Updated 8 months ago
nod-ai / iree-amd-aie
IREE plugin repository for the AMD AIE accelerator
☆98Updated this week
pytorch-labs / triton-cpu
An experimental CPU backend for Triton (https//github.com/openai/triton)
☆43Updated 3 months ago
arc-research-lab / Aries
ARIES: An Agile MLIR-Based Compilation Flow for Reconfigurable Devices with AI Engines (FPGA 2025 Best Paper Nominee)
☆33Updated last week
opencompl / Quidditch
IREE compiler and runtime for Snitch
☆12Updated 2 months ago