alibaba / heterogeneity-aware-lowering-and-optimizationView external linksLinks
heterogeneity-aware-lowering-and-optimization
☆257Jan 20, 2024Updated 2 years ago
Alternatives and similar repositories for heterogeneity-aware-lowering-and-optimization
Users that are interested in heterogeneity-aware-lowering-and-optimization are comparing it to the libraries listed below
Sorting:
- A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.☆1,006Sep 19, 2024Updated last year
- BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.☆916Dec 30, 2024Updated last year
- The Tensor Algebra SuperOptimizer for Deep Learning☆741Jan 26, 2023Updated 3 years ago
- ☆422Jan 4, 2026Updated last month
- The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.☆1,745Updated this week
- An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).☆695Updated this week
- row-major matmul optimization☆701Aug 20, 2025Updated 5 months ago
- Machine learning compiler based on MLIR for Sophgo TPU.☆864Updated this week
- HeteroCL: A Multi-Paradigm Programming Infrastructure for Software-Defined Heterogeneous Computing (FPGA'19 Best Paper)☆341Apr 20, 2024Updated last year
- Bridging polyhedral analysis tools to the MLIR framework☆119Sep 9, 2023Updated 2 years ago
- MLIRX is now defunct. Please see PolyBlocks - https://docs.polymagelabs.com☆38Dec 1, 2023Updated 2 years ago
- SparseTIR: Sparse Tensor Compiler for Deep Learning☆142Mar 31, 2023Updated 2 years ago
- Play with MLIR right in your browser☆138May 25, 2023Updated 2 years ago
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆44Oct 25, 2021Updated 4 years ago
- Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure☆973Feb 6, 2026Updated last week
- A retargetable MLIR-based machine learning compiler and runtime toolkit.☆3,604Updated this week
- A primitive library for neural network☆1,368Nov 24, 2024Updated last year
- An MLIR-based toy DL compiler for TVM Relay.☆61Oct 16, 2022Updated 3 years ago
- AKG (Auto Kernel Generator) is an optimizer for operators in Deep Learning Networks, which provides the ability to automatically fuse ops…☆245Dec 13, 2025Updated 2 months ago
- MegCC是一个运行时超轻量,高效,移植简单的深度学习模型编译器☆488Oct 23, 2024Updated last year
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,530Updated this week
- A performant and modular runtime for TensorFlow☆753Sep 4, 2025Updated 5 months ago
- ☆17Jan 1, 2024Updated 2 years ago
- Bolt is a deep learning library with high performance and heterogeneous flexibility.☆956Apr 11, 2025Updated 10 months ago
- A list of awesome compiler projects and papers for tensor computation and deep learning.☆2,728Oct 19, 2024Updated last year
- PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections☆125Jun 23, 2022Updated 3 years ago
- ☆1,988Jul 29, 2023Updated 2 years ago
- MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.☆138Sep 25, 2023Updated 2 years ago
- A high-performance, extensible Python AOT compiler.☆451Sep 26, 2023Updated 2 years ago
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆567Apr 20, 2023Updated 2 years ago
- ☆19May 11, 2024Updated last year
- Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators☆121Oct 26, 2022Updated 3 years ago
- A model compilation solution for various hardware☆464Aug 20, 2025Updated 5 months ago
- TVM stack: exploring the incredible explosion of deep-learning frameworks and how to bring them together☆64May 22, 2018Updated 7 years ago
- Shared Middle-Layer for Triton Compilation☆326Dec 5, 2025Updated 2 months ago
- High-efficiency floating-point neural network inference operators for mobile, server, and Web☆2,255Updated this week
- A CPU tool for benchmarking the peak of floating points☆576Feb 7, 2026Updated last week
- Adlik: Toolkit for Accelerating Deep Learning Inference☆809Dec 27, 2023Updated 2 years ago
- Release of stream-specialization software/hardware stack.☆120May 5, 2023Updated 2 years ago