spcl / daceLinks
DaCe - Data Centric Parallel Programming
☆558Updated this week
Alternatives and similar repositories for dace
Users that are interested in dace are comparing it to the libraries listed below
Sorting:
- Kernel Tuner☆370Updated this week
- STREAM, for lots of devices written in many programming models☆350Updated last month
- ☆287Updated last month
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆209Updated last week
- Pluto: An automatic polyhedral parallelizer and locality optimizer☆307Updated 2 months ago
- Unified Collective Communication Library☆276Updated last week
- ☆183Updated last week
- NPBench - A Benchmarking Suite for High-Performance NumPy☆89Updated 3 weeks ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆165Updated this week
- A code generator for array-based code on CPUs and GPUs☆615Updated last week
- A Python compiler design toolkit.☆433Updated this week
- Python SYCL bindings and SYCL-based Python Array API library☆117Updated this week
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆145Updated last week
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆121Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆253Updated 2 weeks ago
- ☆267Updated this week
- Benchmark for measuring the performance of sparse and irregular memory access.☆79Updated 2 months ago
- Assembler for NVIDIA Volta and Turing GPUs☆231Updated 3 years ago
- This is a set of simple programs that can be used to explore the features of a parallel platform.☆462Updated 2 months ago
- A light-weight MPI profiler.☆101Updated 3 weeks ago
- collection of benchmarks to measure basic GPU capabilities☆436Updated this week
- A Data-Centric Compiler for Machine Learning☆85Updated last year
- Python interface for MLIR - the Multi-Level Intermediate Representation☆268Updated 11 months ago
- Rodinia benchmark☆189Updated 2 years ago
- GPUOcelot: A dynamic compilation framework for PTX☆210Updated 8 months ago
- Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial☆310Updated this week
- POC work on MLIR backend☆60Updated last year
- MLIR Sample dialect☆131Updated 8 months ago
- SST Structural Simulation Toolkit Parallel Discrete Event Core and Services☆176Updated this week
- development repository for the open earth compiler☆80Updated 4 years ago