MLIR-based partitioning system
☆167Mar 1, 2026Updated this week
Alternatives and similar repositories for shardy
Users that are interested in shardy are comparing it to the libraries listed below
Sorting:
- Backward compatible ML compute opset inspired by HLO/MHLO☆616Updated this week
- TPP experimentation on MLIR for linear algebra☆146Feb 24, 2026Updated last week
- Retargetable ML compilers for the twenty-first century!☆13Apr 22, 2025Updated 10 months ago
- IREE's PyTorch Frontend, based on Torch Dynamo.☆105Updated this week
- The TT-Forge ONNX is a graph compiler designed to optimize and transform computational graphs for deep learning models, enhancing their p…☆54Updated this week
- ☆422Jan 4, 2026Updated last month
- Shared Middle-Layer for Triton Compilation☆329Dec 5, 2025Updated 2 months ago
- Bridging polyhedral analysis tools to the MLIR framework☆119Sep 9, 2023Updated 2 years ago
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆148Updated this week
- A Python compiler design toolkit.☆494Updated this week
- A sandbox for quick iteration and experimentation on projects related to IREE, MLIR, and LLVM☆62Mar 21, 2025Updated 11 months ago
- A machine learning compiler for GPUs, CPUs, and ML accelerators☆4,023Updated this week
- C/C++ frontend for MLIR. Also features polyhedral optimizations, parallel optimizations, and more!☆605Jun 19, 2025Updated 8 months ago
- The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.☆1,754Updated this week
- A retargetable MLIR-based machine learning compiler and runtime toolkit.☆3,614Updated this week
- Buda Compiler Backend for Tenstorrent devices☆30Apr 2, 2025Updated 11 months ago
- Library to interface Compilers and ML models for ML-Enabled Compiler Optimizations☆20Oct 19, 2025Updated 4 months ago
- Conversions to MLIR EmitC☆134Dec 12, 2024Updated last year
- Repo for AI Compiler team. The intended purpose of this repo is for implementation of a PJRT device.☆52Updated this week
- Repository for AI model benchmarking on TT-Buda☆15Feb 9, 2026Updated 3 weeks ago
- ☆45Updated this week
- Python interface for MLIR - the Multi-Level Intermediate Representation☆272Nov 28, 2024Updated last year
- TileFusion is an experimental C++ macro kernel template library that elevates the abstraction level in CUDA C for tile processing.☆106Jun 28, 2025Updated 8 months ago
- Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure☆977Feb 24, 2026Updated last week
- PyTorch distributed training acceleration framework☆54Aug 13, 2025Updated 6 months ago
- Experiments and prototypes associated with IREE or MLIR☆56Aug 9, 2024Updated last year
- Example of applying CUDA graphs to LLaMA-v2☆12Aug 25, 2023Updated 2 years ago
- ☆16Updated this week
- Multi-target compiler for Sum-Product Networks, based on MLIR and LLVM.☆25Nov 29, 2024Updated last year
- Gallina to Bedrock2 compilation toolkit☆65Feb 24, 2026Updated last week
- The missing pieces (as far as boilerplate reduction goes) of the upstream MLIR python bindings.☆118Nov 5, 2025Updated 3 months ago
- Tenstorrent MLIR compiler☆249Updated this week
- A set of Python scripts that makes your experience on TPU better☆56Sep 18, 2025Updated 5 months ago
- Embedded Universal DSL: a good DSL for us, by us☆66Updated this week
- Distributed Compiler based on Triton for Parallel Systems☆1,371Feb 13, 2026Updated 2 weeks ago
- extensible collectives library in triton☆95Mar 31, 2025Updated 11 months ago
- ☆160Dec 27, 2024Updated last year
- TVM for Tenstorrent ASICs☆28Sep 8, 2025Updated 5 months ago
- An MLIR dialect to enable the efficient acceleration of ML model on CGRAs.☆65Oct 9, 2024Updated last year