MLIR-based partitioning system
☆174Mar 20, 2026Updated this week
Alternatives and similar repositories for shardy
Users that are interested in shardy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Backward compatible ML compute opset inspired by HLO/MHLO☆628Updated this week
- TPP experimentation on MLIR for linear algebra☆146Updated this week
- Repo for AI Compiler team. The intended purpose of this repo is for implementation of a PJRT device.☆54Updated this week
- IREE's PyTorch Frontend, based on Torch Dynamo.☆106Updated this week
- The TT-Forge ONNX is a graph compiler designed to optimize and transform computational graphs for deep learning models, enhancing their p…☆54Updated this week
- Shared Middle-Layer for Triton Compilation☆329Dec 5, 2025Updated 3 months ago
- ☆423Feb 24, 2026Updated 3 weeks ago
- A machine learning compiler for GPUs, CPUs, and ML accelerators☆4,100Updated this week
- JaxPP is a library for JAX that enables flexible MPMD pipeline parallelism for large-scale LLM training☆68Mar 11, 2026Updated last week
- Buda Compiler Backend for Tenstorrent devices☆30Apr 2, 2025Updated 11 months ago
- Retargetable ML compilers for the twenty-first century!☆13Apr 22, 2025Updated 11 months ago
- Bridging polyhedral analysis tools to the MLIR framework☆119Sep 9, 2023Updated 2 years ago
- A Python compiler design toolkit.☆501Updated this week
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆148Updated this week
- A sandbox for quick iteration and experimentation on projects related to IREE, MLIR, and LLVM☆62Mar 21, 2025Updated last year
- TVM for Tenstorrent ASICs☆28Sep 8, 2025Updated 6 months ago
- The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.☆1,770Mar 13, 2026Updated last week
- C/C++ frontend for MLIR. Also features polyhedral optimizations, parallel optimizations, and more!☆606Jun 19, 2025Updated 9 months ago
- Conversions to MLIR EmitC☆135Dec 12, 2024Updated last year
- A retargetable MLIR-based machine learning compiler and runtime toolkit.☆3,661Updated this week
- extensible collectives library in triton☆97Mar 31, 2025Updated 11 months ago
- Tenstorrent MLIR compiler☆250Updated this week
- TileFusion is an experimental C++ macro kernel template library that elevates the abstraction level in CUDA C for tile processing.☆106Jun 28, 2025Updated 8 months ago
- Experiments and prototypes associated with IREE or MLIR☆56Aug 9, 2024Updated last year
- PyTorch distributed training acceleration framework☆54Aug 13, 2025Updated 7 months ago
- ☆17Updated this week
- Python interface for MLIR - the Multi-Level Intermediate Representation☆272Nov 28, 2024Updated last year
- Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure☆986Updated this week
- FractalTensor is a programming framework that introduces a novel approach to organizing data in deep neural networks (DNNs) as a list of …☆30Dec 21, 2024Updated last year
- Distributed Compiler based on Triton for Parallel Systems☆1,394Mar 11, 2026Updated last week
- ☆46Updated this week
- Convert StableHLO models into Apple Core ML format☆22Updated this week
- ☆163Dec 27, 2024Updated last year
- Close-to-metal programming for AMD NPUs☆85Updated this week
- HeteroCL-MLIR dialect for accelerator design☆42Sep 18, 2024Updated last year
- ARIES: An Agile MLIR-Based Compilation Flow for Reconfigurable Devices with AI Engines (FPGA 2025 Best Paper Nominee)☆60Mar 8, 2026Updated 2 weeks ago
- The missing pieces (as far as boilerplate reduction goes) of the upstream MLIR python bindings.☆119Mar 4, 2026Updated 2 weeks ago
- Embedded Universal DSL: a good DSL for us, by us☆70Updated this week
- torchprime is a reference model implementation for PyTorch on TPU.☆46Mar 3, 2026Updated 3 weeks ago