Shared Middle-Layer for Triton Compilation
☆335Dec 5, 2025Updated 6 months ago
Alternatives and similar repositories for triton-shared
Users that are interested in triton-shared are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Development repository for the Triton-Linalg conversion☆219Feb 7, 2025Updated last year
- TPP experimentation on MLIR for linear algebra☆150Updated this week
- FlagGems is an operator library for large language models implemented in the Triton Language.☆1,013Jun 3, 2026Updated last week
- The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.☆1,838Updated this week
- Hands-On Practical MLIR Tutorial☆787Oct 20, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- The missing pieces (as far as boilerplate reduction goes) of the upstream MLIR python bindings.☆118Mar 4, 2026Updated 3 months ago
- A model compilation solution for various hardware☆470Aug 20, 2025Updated 9 months ago
- ☆182Updated this week
- My study note for mlsys☆14Nov 4, 2024Updated last year
- A retargetable MLIR-based machine learning compiler and runtime toolkit.☆3,787Updated this week
- An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).☆724Updated this week
- Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure☆1,026Updated this week
- We invite you to visit and follow our new repository at https://github.com/microsoft/TileFusion. TiledCUDA is a highly efficient kernel …☆193Jan 28, 2025Updated last year
- OpenAI Triton backend for Intel® GPUs☆255Updated this week
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- IREE's PyTorch Frontend, based on Torch Dynamo.☆109Jun 3, 2026Updated last week
- C/C++ frontend for MLIR. Also features polyhedral optimizations, parallel optimizations, and more!☆616Jun 19, 2025Updated 11 months ago
- Distributed Compiler based on Triton for Parallel Systems☆1,455Apr 22, 2026Updated last month
- An experimental CPU backend for Triton☆197May 26, 2026Updated 2 weeks ago
- TiledLower is a Dataflow Analysis and Codegen Framework written in Rust.☆13Nov 23, 2024Updated last year
- ☆332Jun 3, 2026Updated last week
- ☆423Feb 24, 2026Updated 3 months ago
- BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.☆928Dec 30, 2024Updated last year
- ☆20Sep 28, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A lightweight, Pythonic, frontend for MLIR☆80Oct 21, 2023Updated 2 years ago
- TritonParse: A Compiler Tracer, Visualizer, and Reproducer for Triton Kernels☆210Jun 3, 2026Updated last week
- Conversions to MLIR EmitC☆135Dec 12, 2024Updated last year
- ☆113Mar 12, 2026Updated 2 months ago
- Machine learning compiler based on MLIR for Sophgo TPU.☆929Updated this week
- MLIR For Beginners tutorial☆1,307Jul 18, 2025Updated 10 months ago
- TiledKernel is a code generation library based on macro kernels and memory hierarchy graph data structure.☆19May 12, 2024Updated 2 years ago
- A Easy-to-understand TensorOp Matmul Tutorial☆440Mar 5, 2026Updated 3 months ago
- depyf is a tool to help you understand and adapt to PyTorch compiler torch.compile.☆809Oct 13, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Framework to reduce autotune overhead to zero for well known deployments.☆101Sep 19, 2025Updated 8 months ago
- Python interface for MLIR - the Multi-Level Intermediate Representation☆271Nov 28, 2024Updated last year
- incubator repo for CUDA-TileIR backend☆138Apr 22, 2026Updated last month
- A list of awesome compiler projects and papers for tensor computation and deep learning.☆2,754Oct 19, 2024Updated last year
- Fast low-bit matmul kernels in Triton☆467May 15, 2026Updated 3 weeks ago
- MLIR-based partitioning system☆190Updated this week
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆153Jun 1, 2026Updated last week