alibaba/heterogeneity-aware-lowering-and-optimization

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/alibaba/heterogeneity-aware-lowering-and-optimization)

alibaba / heterogeneity-aware-lowering-and-optimization

heterogeneity-aware-lowering-and-optimization

☆258

Alternatives and similar repositories for heterogeneity-aware-lowering-and-optimization

Users that are interested in heterogeneity-aware-lowering-and-optimization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

microsoft / nnfusion
View on GitHub
A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.
☆1,002Sep 19, 2024Updated last year
tensorflow / mlir-hlo
View on GitHub
☆421Feb 24, 2026Updated 4 months ago
alibaba / BladeDISC
View on GitHub
BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.
☆932Dec 30, 2024Updated last year
llvm / torch-mlir
View on GitHub
The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.
☆1,867Updated this week
jiazhihao / TASO
View on GitHub
The Tensor Algebra SuperOptimizer for Deep Learning
☆743Jan 26, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
buddy-compiler / buddy-mlir
View on GitHub
An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).
☆743Updated this week
tpoisonooo / how-to-optimize-gemm
View on GitHub
row-major matmul optimization
☆743May 14, 2026Updated 2 months ago
alibaba / redfuser
View on GitHub
☆21Mar 17, 2026Updated 4 months ago
cornell-zhang / heterocl
View on GitHub
HeteroCL: A Multi-Paradigm Programming Infrastructure for Software-Defined Heterogeneous Computing (FPGA'19 Best Paper)
☆338Apr 20, 2024Updated 2 years ago
kumasento / polymer
View on GitHub
Bridging polyhedral analysis tools to the MLIR framework
☆119Sep 9, 2023Updated 2 years ago
uwsampl / SparseTIR
View on GitHub
SparseTIR: Sparse Tensor Compiler for Deep Learning
☆145Mar 31, 2023Updated 3 years ago
iree-org / iree
View on GitHub
A retargetable MLIR-based machine learning compiler and runtime toolkit.
☆3,847Updated this week
OpenPPL / ppl.nn
View on GitHub
A primitive library for neural network
☆1,367Nov 24, 2024Updated last year
polymage-labs / mlirx
View on GitHub
MLIRX is now defunct. Please see PolyBlocks - https://docs.polymagelabs.com
☆39Dec 1, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
alibaba / ai-matrix
View on GitHub
To make it easy to benchmark AI accelerators
☆193Dec 27, 2022Updated 3 years ago
sophgo / tpu-mlir
View on GitHub
Machine learning compiler based on MLIR for Sophgo TPU.
☆948Jul 8, 2026Updated last week
onnx / onnx-mlir
View on GitHub
Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure
☆1,037Updated this week
mindspore-ai / akg
View on GitHub
AKG (Auto Kernel Generator) is an optimizer for operators in Deep Learning Networks, which provides the ability to automatically fuse ops…
☆255Updated this week
bytedance / byteir
View on GitHub
A model compilation solution for various hardware
☆473Aug 20, 2025Updated 11 months ago
pytorch / FBGEMM
View on GitHub
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
☆1,570Updated this week
wzh99 / relay-mlir
View on GitHub
An MLIR-based toy DL compiler for TVM Relay.
☆62Oct 16, 2022Updated 3 years ago
tensorflow / runtime
View on GitHub
A performant and modular runtime for TensorFlow
☆753Sep 4, 2025Updated 10 months ago
google / XNNPACK
View on GitHub
High-efficiency floating-point neural network inference operators for mobile, server, and Web
☆2,399Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
merrymercy / awesome-tensor-compilers
View on GitHub
A list of awesome compiler projects and papers for tensor computation and deep learning.
☆2,766Oct 19, 2024Updated last year
MLIR-China / mlir-playground
View on GitHub
Play with MLIR right in your browser
☆140May 25, 2023Updated 3 years ago
flame / how-to-optimize-gemm
View on GitHub
☆2,020Jul 29, 2023Updated 2 years ago
Inaxo / MathLib
View on GitHub
MathLib is a versatile C++ library that provides a wide range of mathematical algorithms and functions, including but not limited to tran…
☆10Jun 6, 2023Updated 3 years ago
vortexgpgpu / NVPTX-SPIRV-Translator
View on GitHub
The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.
☆45Oct 25, 2021Updated 4 years ago
huawei-noah / bolt
View on GitHub
Bolt is a deep learning library with high performance and heterogeneous flexibility.
☆958Apr 11, 2025Updated last year
apache / tvm
View on GitHub
Open Machine Learning Compiler Framework
☆13,588Updated this week
MegEngine / MegCC
View on GitHub
MegCC是一个运行时超轻量，高效，移植简单的深度学习模型编译器
☆482Oct 23, 2024Updated last year
iml130 / mlir-emitc
View on GitHub
Conversions to MLIR EmitC
☆134Dec 12, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
cloudcores / CuAssembler
View on GitHub
An unofficial cuda assembler, for all generations of SASS, hopefully ：）
☆609Apr 20, 2023Updated 3 years ago
pku-liang / AMOS
View on GitHub
Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators
☆125Oct 26, 2022Updated 3 years ago
PolyArch / dsa-framework
View on GitHub
Release of stream-specialization software/hardware stack.
☆126May 5, 2023Updated 3 years ago
awslabs / lorien
View on GitHub
☆42Sep 8, 2023Updated 2 years ago
pigirons / cpufp
View on GitHub
A CPU tool for benchmarking the peak of floating points
☆586May 4, 2026Updated 2 months ago
mmperf / mmperf
View on GitHub
MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.
☆138Sep 25, 2023Updated 2 years ago
thu-pacman / PET
View on GitHub
PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections
☆126Jun 23, 2022Updated 4 years ago