openxla / stablehloLinks

Backward compatible ML compute opset inspired by HLO/MHLO

☆514

Alternatives and similar repositories for stablehlo

Users that are interested in stablehlo are comparing it to the libraries listed below

Sorting:

tensorflow / mlir-hlo
☆420Updated last week
openxla / community
Stores documents and resources used by the OpenXLA developer community
☆126Updated last year
onnx / onnx-mlir
Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure
☆892Updated last week
microsoft / triton-shared
Shared Middle-Layer for Triton Compilation
☆261Updated this week
llvm / torch-mlir
The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.
☆1,595Updated this week
hidet-org / hidet
An open-source efficient deep learning framework/compiler, written in python.
☆710Updated last week
openxla / shardy
MLIR-based partitioning system
☆115Updated this week
NVIDIA / Fuser
A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
☆345Updated this week
albanD / subclass_zoo
☆171Updated last year
spcl / pymlir
Python interface for MLIR - the Multi-Level Intermediate Representation
☆263Updated 8 months ago
NVIDIA / TensorRT-Incubator
Experimental projects related to TensorRT
☆108Updated last week
triton-lang / triton-cpu
An experimental CPU backend for Triton
☆138Updated 2 months ago
facebookresearch / HolisticTraceAnalysis
A library to analyze PyTorch traces.
☆400Updated this week
google / aqt
☆323Updated this week
intel / intel-xpu-backend-for-triton
OpenAI Triton backend for Intel® GPUs
☆197Updated this week
jiazhihao / TASO
The Tensor Algebra SuperOptimizer for Deep Learning
☆726Updated 2 years ago
iree-org / iree-turbine
IREE's PyTorch Frontend, based on Torch Dynamo.
☆94Updated this week
intel / mlir-extensions
Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.
☆138Updated this week
tlc-pack / relax
☆196Updated 2 years ago
mmperf / mmperf
MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.
☆134Updated last year
pytorch / torchdynamo
A Python-level JIT compiler designed to make unmodified PyTorch programs faster.
☆1,056Updated last year
ROCm / composable_kernel
Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
☆444Updated this week
cloudcores / CuAssembler
An unofficial cuda assembler, for all generations of SASS, hopefully ：）
☆525Updated 2 years ago
daadaada / turingas
Assembler for NVIDIA Volta and Turing GPUs
☆226Updated 3 years ago
xdslproject / xdsl
A Python compiler design toolkit.
☆380Updated this week
nod-ai / SHARK-ModelDev
Unified compiler/runtime for interfacing with PyTorch Dynamo.
☆101Updated 3 weeks ago
bytedance / byteir
A model compilation solution for various hardware
☆439Updated last week
jax-ml / jax-triton
jax-triton contains integrations between JAX and OpenAI Triton
☆411Updated last month
google-research / sputnik
A library of GPU kernels for sparse matrix operations.
☆270Updated 4 years ago
dmlc / dlpack
common in-memory tensor structure
☆1,042Updated last month