heterogeneity-aware-lowering-and-optimization
☆257Jan 20, 2024Updated 2 years ago
Alternatives and similar repositories for heterogeneity-aware-lowering-and-optimization
Users that are interested in heterogeneity-aware-lowering-and-optimization are comparing it to the libraries listed below
Sorting:
- A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.☆1,005Sep 19, 2024Updated last year
- BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.☆917Dec 30, 2024Updated last year
- The Tensor Algebra SuperOptimizer for Deep Learning☆739Jan 26, 2023Updated 3 years ago
- ☆423Feb 24, 2026Updated last week
- The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.☆1,760Updated this week
- An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).☆696Updated this week
- row-major matmul optimization☆707Feb 24, 2026Updated last week
- Machine learning compiler based on MLIR for Sophgo TPU.☆872Feb 12, 2026Updated 3 weeks ago
- HeteroCL: A Multi-Paradigm Programming Infrastructure for Software-Defined Heterogeneous Computing (FPGA'19 Best Paper)☆341Apr 20, 2024Updated last year
- Bridging polyhedral analysis tools to the MLIR framework☆119Sep 9, 2023Updated 2 years ago
- MLIRX is now defunct. Please see PolyBlocks - https://docs.polymagelabs.com☆38Dec 1, 2023Updated 2 years ago
- SparseTIR: Sparse Tensor Compiler for Deep Learning☆143Mar 31, 2023Updated 2 years ago
- Play with MLIR right in your browser☆138May 25, 2023Updated 2 years ago
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆44Oct 25, 2021Updated 4 years ago
- Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure☆981Updated this week
- A retargetable MLIR-based machine learning compiler and runtime toolkit.☆3,621Updated this week
- A primitive library for neural network☆1,367Nov 24, 2024Updated last year
- An MLIR-based toy DL compiler for TVM Relay.☆61Oct 16, 2022Updated 3 years ago
- AKG (Auto Kernel Generator) is an optimizer for operators in Deep Learning Networks, which provides the ability to automatically fuse ops…☆244Dec 13, 2025Updated 2 months ago
- MegCC是一个运行时超轻量,高效,移植简单的深度学习模型编译器☆485Oct 23, 2024Updated last year
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,535Updated this week
- A performant and modular runtime for TensorFlow☆753Sep 4, 2025Updated 6 months ago
- ☆17Jan 1, 2024Updated 2 years ago
- Bolt is a deep learning library with high performance and heterogeneous flexibility.☆957Apr 11, 2025Updated 10 months ago
- A list of awesome compiler projects and papers for tensor computation and deep learning.☆2,733Oct 19, 2024Updated last year
- PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections☆125Jun 23, 2022Updated 3 years ago
- ☆1,995Jul 29, 2023Updated 2 years ago
- MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.☆138Sep 25, 2023Updated 2 years ago
- A high-performance, extensible Python AOT compiler.☆451Sep 26, 2023Updated 2 years ago
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆572Apr 20, 2023Updated 2 years ago
- ☆19May 11, 2024Updated last year
- A model compilation solution for various hardware☆464Aug 20, 2025Updated 6 months ago
- Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators☆121Oct 26, 2022Updated 3 years ago
- TVM stack: exploring the incredible explosion of deep-learning frameworks and how to bring them together☆64May 22, 2018Updated 7 years ago
- High-efficiency floating-point neural network inference operators for mobile, server, and Web☆2,267Updated this week
- A CPU tool for benchmarking the peak of floating points☆579Feb 7, 2026Updated last month
- Shared Middle-Layer for Triton Compilation☆331Dec 5, 2025Updated 3 months ago
- Adlik: Toolkit for Accelerating Deep Learning Inference☆807Dec 27, 2023Updated 2 years ago
- Release of stream-specialization software/hardware stack.☆120May 5, 2023Updated 2 years ago