openxla / stablehloLinks
Backward compatible ML compute opset inspired by HLO/MHLO
☆590Updated last week
Alternatives and similar repositories for stablehlo
Users that are interested in stablehlo are comparing it to the libraries listed below
Sorting:
- Stores documents and resources used by the OpenXLA developer community☆132Updated last year
- ☆423Updated last week
- Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure☆961Updated last week
- The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.☆1,723Updated this week
- Shared Middle-Layer for Triton Compilation☆323Updated last month
- An open-source efficient deep learning framework/compiler, written in python.☆739Updated 4 months ago
- MLIR-based partitioning system☆160Updated this week
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆371Updated this week
- Experimental projects related to TensorRT☆117Updated this week
- ☆187Updated last year
- A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.☆711Updated this week
- An experimental CPU backend for Triton☆170Updated 2 months ago
- A library to analyze PyTorch traces.☆456Updated this week
- Python interface for MLIR - the Multi-Level Intermediate Representation☆273Updated last year
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆508Updated this week
- OpenAI Triton backend for Intel® GPUs☆223Updated this week
- ☆342Updated last week
- The Tensor Algebra SuperOptimizer for Deep Learning☆734Updated 2 years ago
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆566Updated 2 years ago
- IREE's PyTorch Frontend, based on Torch Dynamo.☆103Updated last week
- CUDA Kernel Benchmarking Library☆798Updated last week
- A Quirky Assortment of CuTe Kernels☆749Updated this week
- Unified compiler/runtime for interfacing with PyTorch Dynamo.☆104Updated 3 weeks ago
- Infrastructure for Machine Learning Guided Optimization (MLGO) in LLVM.☆746Updated this week
- BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.☆743Updated 5 months ago
- Assembler for NVIDIA Volta and Turing GPUs☆236Updated 4 years ago
- A model compilation solution for various hardware☆461Updated 4 months ago
- A library of GPU kernels for sparse matrix operations.☆281Updated 5 years ago
- CUDA Tile IR is an MLIR-based intermediate representation and compiler infrastructure for CUDA kernel optimization, focusing on tile-base…☆773Updated this week
- A Python compiler design toolkit.☆471Updated this week