microsoft/triton-shared

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/microsoft/triton-shared)

microsoft / triton-shared

Shared Middle-Layer for Triton Compilation

☆340

Alternatives and similar repositories for triton-shared

Users that are interested in triton-shared are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Cambricon / triton-linalg
View on GitHub
Development repository for the Triton-Linalg conversion
☆221Feb 7, 2025Updated last year
libxsmm / tpp-mlir
View on GitHub
TPP experimentation on MLIR for linear algebra
☆155Updated this week
triton-lang / Triton-to-tile-IR
View on GitHub
incubator repo for CUDA-TileIR backend
☆148Jul 10, 2026Updated last week
llvm / torch-mlir
View on GitHub
The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.
☆1,867Updated this week
flagos-ai / FlagGems
View on GitHub
FlagGems is an operator library for large language models implemented in the Triton Language.
☆1,052Updated this week
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
KEKE046 / mlir-tutorial
View on GitHub
Hands-On Practical MLIR Tutorial
☆811Oct 20, 2023Updated 2 years ago
bytedance / byteir
View on GitHub
A model compilation solution for various hardware
☆473Aug 20, 2025Updated 11 months ago
ROCm / rocMLIR
View on GitHub
☆183Updated this week
makslevental / mlir-python-extras
View on GitHub
The missing pieces (as far as boilerplate reduction goes) of the upstream MLIR python bindings.
☆118Mar 4, 2026Updated 4 months ago
tfruan2000 / mlsys-study-note
View on GitHub
My study note for mlsys
☆14Nov 4, 2024Updated last year
iree-org / iree
View on GitHub
A retargetable MLIR-based machine learning compiler and runtime toolkit.
☆3,847Updated this week
buddy-compiler / buddy-mlir
View on GitHub
An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).
☆743Updated this week
intel / intel-xpu-backend-for-triton
View on GitHub
OpenAI Triton backend for Intel® GPUs
☆258Updated this week
TiledTensor / TiledCUDA
View on GitHub
We invite you to visit and follow our new repository at https://github.com/microsoft/TileFusion. TiledCUDA is a highly efficient kernel …
☆192Jan 28, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
iree-org / iree-turbine
View on GitHub
IREE's PyTorch Frontend, based on Torch Dynamo.
☆110Jul 1, 2026Updated 2 weeks ago
llvm / Polygeist
View on GitHub
C/C++ frontend for MLIR. Also features polyhedral optimizations, parallel optimizations, and more!
☆623Jun 19, 2025Updated last year
onnx / onnx-mlir
View on GitHub
Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure
☆1,037Updated this week
ByteDance-Seed / Triton-distributed
View on GitHub
Distributed Compiler based on Triton for Parallel Systems
☆1,493Jul 11, 2026Updated last week
triton-lang / triton-cpu
View on GitHub
An experimental CPU backend for Triton
☆201Updated this week
openxla / shardy
View on GitHub
MLIR-based partitioning system
☆197Updated this week
TiledTensor / TiledLower
View on GitHub
TiledLower is a Dataflow Analysis and Codegen Framework written in Rust.
☆13Nov 23, 2024Updated last year
Deep-Learning-Profiling-Tools / triton-viz
View on GitHub
☆350Updated this week
tensorflow / mlir-hlo
View on GitHub
☆421Feb 24, 2026Updated 4 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
alibaba / BladeDISC
View on GitHub
BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.
☆932Dec 30, 2024Updated last year
serdes21 / flashtile
View on GitHub
FlashTile is a CUDA Tile IR compiler that is compatible with NVIDIA's tileiras, targeting SM70 through SM121 NVIDIA GPUs.
☆61Feb 6, 2026Updated 5 months ago
LeiWang1999 / Stream-k.tvm
View on GitHub
☆20Sep 28, 2024Updated last year
makslevental / nelli
View on GitHub
A lightweight, Pythonic, frontend for MLIR
☆80Oct 21, 2023Updated 2 years ago
flagos-ai / libtriton_jit
View on GitHub
A Triton JIT runtime and ffi provider in C++
☆37Updated this week
iml130 / mlir-emitc
View on GitHub
Conversions to MLIR EmitC
☆134Dec 12, 2024Updated last year
triton-lang / kernels
View on GitHub
☆115Mar 12, 2026Updated 4 months ago
sophgo / tpu-mlir
View on GitHub
Machine learning compiler based on MLIR for Sophgo TPU.
☆948Jul 8, 2026Updated last week
meta-pytorch / tritonparse
View on GitHub
TritonParse: A Compiler Tracer, Visualizer, and Reproducer for Triton Kernels
☆211Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
TiledTensor / TiledKernel
View on GitHub
TiledKernel is a code generation library based on macro kernels and memory hierarchy graph data structure.
☆19May 12, 2024Updated 2 years ago
j2kun / mlir-tutorial
View on GitHub
MLIR For Beginners tutorial
☆1,328Jul 18, 2025Updated last year
facebookexperimental / triton
View on GitHub
Github mirror of trition-lang/triton repo.
☆178Updated this week
thuml / depyf
View on GitHub
depyf is a tool to help you understand and adapt to PyTorch compiler torch.compile.
☆815Oct 13, 2025Updated 9 months ago
KnowingNothing / MatmulTutorial
View on GitHub
A Easy-to-understand TensorOp Matmul Tutorial
☆445Mar 5, 2026Updated 4 months ago
NVIDIA / cuda-tile
View on GitHub
CUDA Tile IR is an MLIR-based intermediate representation and compiler infrastructure for CUDA kernel optimization, focusing on tile-base…
☆999Jul 6, 2026Updated 2 weeks ago
IBM / triton-dejavu
View on GitHub
Framework to reduce autotune overhead to zero for well known deployments.
☆101Sep 19, 2025Updated 10 months ago