iree-org / ireeLinks

A retargetable MLIR-based machine learning compiler and runtime toolkit.

☆3,241

Alternatives and similar repositories for iree

Users that are interested in iree are comparing it to the libraries listed below

Sorting:

llvm / torch-mlir
The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.
☆1,591Updated last week
openxla / xla
A machine learning compiler for GPUs, CPUs, and ML accelerators
☆3,383Updated this week
onnx / onnx-mlir
Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure
☆887Updated this week
google / XNNPACK
High-efficiency floating-point neural network inference operators for mobile, server, and Web
☆2,072Updated last week
openxla / stablehlo
Backward compatible ML compute opset inspired by HLO/MHLO
☆510Updated last week
tensorflow / mlir
"Multi-Level Intermediate Representation" Compiler Infrastructure
☆1,752Updated 4 years ago
pytorch / FBGEMM
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
☆1,415Updated this week
merrymercy / awesome-tensor-compilers
A list of awesome compiler projects and papers for tensor computation and deep learning.
☆2,617Updated 9 months ago
pytorch / glow
Compiler for Neural Network hardware accelerators
☆3,310Updated last year
NVIDIA / cccl
CUDA Core Compute Libraries
☆1,805Updated last week
j2kun / mlir-tutorial
MLIR For Beginners tutorial
☆1,029Updated 2 weeks ago
NVIDIA / cutlass
CUDA Templates for Linear Algebra Subroutines
☆8,149Updated this week
google / ml-compiler-opt
Infrastructure for Machine Learning Guided Optimization (MLGO) in LLVM.
☆709Updated this week
tensorflow / runtime
A performant and modular runtime for TensorFlow
☆758Updated 3 months ago
intel / llvm
Intel staging area for llvm.org contribution. Home for Intel LLVM-based projects.
☆1,357Updated this week
zwang4 / awesome-machine-learning-in-compilers
Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisati…
☆1,579Updated 2 months ago
alibaba / BladeDISC
BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.
☆886Updated 7 months ago
uxlfoundation / oneDNN
oneAPI Deep Neural Network Library (oneDNN)
☆3,856Updated this week
HazyResearch / ThunderKittens
Tile primitives for speedy kernels
☆2,541Updated this week
tensorflow / mlir-hlo
☆420Updated this week
microsoft / nnfusion
A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.
☆992Updated 10 months ago
intel / neural-compressor
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX R…
☆2,461Updated last week
dmlc / dlpack
common in-memory tensor structure
☆1,042Updated last month
mlcommons / inference
Reference implementations of MLPerf™ inference benchmarks
☆1,420Updated last week
jiazhihao / TASO
The Tensor Algebra SuperOptimizer for Deep Learning
☆726Updated 2 years ago
buddy-compiler / buddy-mlir
An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).
☆611Updated this week
google / gemmlowp
Low-precision matrix multiplication
☆1,812Updated last year
tensor-compiler / taco
The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs
☆1,314Updated 3 months ago
flame / how-to-optimize-gemm
☆1,902Updated 2 years ago
NVIDIA / cub
[ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl
☆1,765Updated last year