spcl / daceml
A Data-Centric Compiler for Machine Learning
☆82Updated last year
Alternatives and similar repositories for daceml:
Users that are interested in daceml are comparing it to the libraries listed below
- Data-Centric MLIR dialect☆40Updated last year
- SparseTIR: Sparse Tensor Compiler for Deep Learning☆135Updated last year
- MLIR-based partitioning system☆73Updated this week
- A lightweight, Pythonic, frontend for MLIR☆80Updated last year
- GEMM and Winograd based convolutions using CUTLASS☆26Updated 4 years ago
- TileFusion is an experimental C++ macro kernel template library that elevates the abstraction level in CUDA C for tile processing. By pro…☆68Updated this week
- ☆91Updated 11 months ago
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆39Updated last week
- ☆73Updated 4 months ago
- PyTorch extension for emulating FP8 data formats on standard FP32 Xeon/GPU hardware.☆107Updated 3 months ago
- ☆75Updated 2 years ago
- Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018☆73Updated 4 years ago
- Benchmark for measuring the performance of sparse and irregular memory access.☆77Updated last month
- SparseP is the first open-source Sparse Matrix Vector Multiplication (SpMV) software package for real-world Processing-In-Memory (PIM) ar…☆70Updated 2 years ago
- ☆73Updated 3 years ago
- ☆17Updated 5 years ago
- ☆49Updated last year
- GVProf: A Value Profiler for GPU-based Clusters☆49Updated last year
- A Vectorized N:M Format for Unleashing the Power of Sparse Tensor Cores☆50Updated last year
- An IR for efficiently simulating distributed ML computation.☆28Updated last year
- Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.☆86Updated 2 years ago
- MLIRX is now defunct. Please see PolyBlocks - https://docs.polymagelabs.com☆38Updated last year
- ☆48Updated 5 years ago
- IREE's PyTorch Frontend, based on Torch Dynamo.☆72Updated this week
- Benchmark PyTorch Custom Operators☆14Updated last year
- Code base for OOPSLA'24 paper: UniSparse: An Intermediate Language for General Sparse Format Customization☆30Updated 4 months ago
- ☆23Updated 3 months ago
- Re-implementation of the TASO compiler using equality saturation☆125Updated 3 years ago
- ☆24Updated 3 months ago
- Training neural networks in TensorFlow 2.0 with 5x less memory☆130Updated 3 years ago