spcl / dacemlLinks
A Data-Centric Compiler for Machine Learning
☆83Updated last year
Alternatives and similar repositories for daceml
Users that are interested in daceml are comparing it to the libraries listed below
Sorting:
- SparseTIR: Sparse Tensor Compiler for Deep Learning☆138Updated 2 years ago
- Data-Centric MLIR dialect☆42Updated last year
- MLIR-based partitioning system☆86Updated this week
- A lightweight, Pythonic, frontend for MLIR☆81Updated last year
- ☆96Updated last year
- MLIRX is now defunct. Please see PolyBlocks - https://docs.polymagelabs.com☆38Updated last year
- A Vectorized N:M Format for Unleashing the Power of Sparse Tensor Cores☆51Updated last year
- ☆50Updated last year
- TileFusion is an experimental C++ macro kernel template library that elevates the abstraction level in CUDA C for tile processing.☆88Updated this week
- Test suite for probing the numerical behavior of NVIDIA tensor cores☆38Updated 10 months ago
- Memory Optimizations for Deep Learning (ICML 2023)☆64Updated last year
- ☆51Updated 5 years ago
- SparseP is the first open-source Sparse Matrix Vector Multiplication (SpMV) software package for real-world Processing-In-Memory (PIM) ar…☆74Updated 2 years ago
- Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018☆71Updated 4 years ago
- Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.☆88Updated 2 years ago
- Artifacts of EVT ASPLOS'24☆25Updated last year
- ☆99Updated this week
- ☆19Updated 3 weeks ago
- A GPU algorithm for sparse matrix-matrix multiplication☆70Updated 4 years ago
- development repository for the open earth compiler☆80Updated 4 years ago
- A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.☆153Updated this week
- PyTorch emulation library for Microscaling (MX)-compatible data formats☆241Updated this week
- The missing pieces (as far as boilerplate reduction goes) of the upstream MLIR python bindings.☆99Updated last week
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆43Updated 2 months ago
- PyTorch extension for emulating FP8 data formats on standard FP32 Xeon/GPU hardware.☆110Updated 6 months ago
- ☆146Updated 10 months ago
- ☆30Updated 2 years ago
- ☆23Updated 6 months ago
- A schedule language for large model training☆148Updated 11 months ago
- ☆44Updated 4 years ago