spcl / daceml
A Data-Centric Compiler for Machine Learning
☆82Updated last year
Alternatives and similar repositories for daceml:
Users that are interested in daceml are comparing it to the libraries listed below
- SparseTIR: Sparse Tensor Compiler for Deep Learning☆134Updated last year
- A lightweight, Pythonic, frontend for MLIR☆80Updated last year
- ☆87Updated 10 months ago
- Data-Centric MLIR dialect☆40Updated last year
- MLIR-based partitioning system☆62Updated this week
- PyTorch extension for emulating FP8 data formats on standard FP32 Xeon/GPU hardware.☆105Updated 2 months ago
- Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018☆72Updated 4 years ago
- Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.☆85Updated 2 years ago
- ☆47Updated 5 years ago
- A Vectorized N:M Format for Unleashing the Power of Sparse Tensor Cores☆48Updated last year
- IREE's PyTorch Frontend, based on Torch Dynamo.☆71Updated this week
- Test suite for probing the numerical behavior of NVIDIA tensor cores☆37Updated 6 months ago
- MLIRX is now defunct. Please see PolyBlocks - https://docs.polymagelabs.com☆38Updated last year
- ☆42Updated 4 years ago
- ☆75Updated 2 years ago
- A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators☆73Updated last year
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆130Updated this week
- Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators☆107Updated 2 years ago
- A library of GPU kernels for sparse matrix operations.☆255Updated 4 years ago
- TPP experimentation on MLIR for linear algebra☆119Updated this week
- Re-implementation of the TASO compiler using equality saturation☆123Updated 3 years ago
- ☆73Updated 3 years ago
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆38Updated 9 months ago
- Dissecting NVIDIA GPU Architecture☆88Updated 2 years ago
- The missing pieces (as far as boilerplate reduction goes) of the upstream MLIR python bindings.☆79Updated this week
- Optimize tensor program fast with Felix, a gradient descent autotuner.☆24Updated 9 months ago
- ☆48Updated 11 months ago
- ☆39Updated 4 years ago
- ☆30Updated 2 years ago
- ☆72Updated 2 months ago