spcl / dace
DaCe - Data Centric Parallel Programming
☆509Updated this week
Alternatives and similar repositories for dace:
Users that are interested in dace are comparing it to the libraries listed below
- Kernel Tuner☆311Updated last week
- A Data-Centric Compiler for Machine Learning☆82Updated last year
- NPBench - A Benchmarking Suite for High-Performance NumPy☆77Updated this week
- CUDA Kernel Benchmarking Library☆561Updated 3 months ago
- A code generator for array-based code on CPUs and GPUs☆598Updated this week
- Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial☆209Updated 2 months ago
- collection of benchmarks to measure basic GPU capabilities☆296Updated last week
- Pluto: An automatic polyhedral parallelizer and locality optimizer☆282Updated 9 months ago
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆198Updated 2 months ago
- Stretching GPU performance for GEMMs and tensor contractions.☆233Updated this week
- ☆233Updated last week
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆130Updated this week
- A Python Compiler Design Toolkit☆312Updated this week
- An MLIR-based toolchain for AMD AI Engine-enabled devices.☆334Updated this week
- The Foundation for All Legate Libraries☆204Updated last week
- Rich editor for SDFGs with included profiling and debugging, static analysis, and interactive optimization.☆19Updated 3 weeks ago
- C/C++ frontend for MLIR. Also features polyhedral optimizations, parallel optimizations, and more!☆511Updated 4 months ago
- Python SYCL bindings and SYCL-based Python Array API library☆109Updated this week
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆349Updated this week
- RAJA Performance Suite☆118Updated this week
- HeteroCL: A Multi-Paradigm Programming Infrastructure for Software-Defined Heterogeneous Computing☆331Updated 10 months ago
- TPP experimentation on MLIR for linear algebra☆119Updated this week
- Data Parallel Extension for Numba☆79Updated 3 months ago
- ☆228Updated this week
- Advanced Profiling and Analytics for AMD Hardware☆140Updated this week
- Assembler for NVIDIA Volta and Turing GPUs☆212Updated 3 years ago
- GPUOcelot: A dynamic compilation framework for PTX☆166Updated last week
- PyTorch emulation library for Microscaling (MX)-compatible data formats☆199Updated 4 months ago
- Rodinia benchmark☆170Updated last year
- This is the top-level repository for the Accel-Sim framework.☆343Updated this week