spcl / daceLinks
DaCe - Data Centric Parallel Programming
☆544Updated this week
Alternatives and similar repositories for dace
Users that are interested in dace are comparing it to the libraries listed below
Sorting:
- Kernel Tuner☆355Updated last week
- Unified Collective Communication Library☆262Updated this week
- A Python compiler design toolkit.☆380Updated this week
- STREAM, for lots of devices written in many programming models☆346Updated 11 months ago
- Benchmark for measuring the performance of sparse and irregular memory access.☆78Updated 2 months ago
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆209Updated 2 months ago
- ☆270Updated last month
- Pluto: An automatic polyhedral parallelizer and locality optimizer☆302Updated last month
- Advanced Profiling and Analytics for AMD Hardware☆161Updated this week
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆138Updated last week
- ☆164Updated this week
- ☆249Updated last month
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆247Updated this week
- Rich editor for SDFGs with included profiling and debugging, static analysis, and interactive optimization.☆20Updated 6 months ago
- A code generator for array-based code on CPUs and GPUs☆609Updated this week
- Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial☆287Updated last month
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆97Updated this week
- Python SYCL bindings and SYCL-based Python Array API library☆116Updated this week
- NPBench - A Benchmarking Suite for High-Performance NumPy☆86Updated 2 months ago
- Python interface for MLIR - the Multi-Level Intermediate Representation☆263Updated 8 months ago
- development repository for the open earth compiler☆80Updated 4 years ago
- Assembler for NVIDIA Volta and Turing GPUs☆226Updated 3 years ago
- Instructions, Docker images, and examples for Nsight Compute and Nsight Systems☆132Updated 5 years ago
- collection of benchmarks to measure basic GPU capabilities☆398Updated 5 months ago
- A Data-Centric Compiler for Machine Learning☆84Updated last year
- oneAPI Collective Communications Library (oneCCL)☆241Updated 3 weeks ago
- GPUOcelot: A dynamic compilation framework for PTX☆204Updated 5 months ago
- 🎃 GPU load-balancing library for regular and irregular computations.☆62Updated last year
- RAJA Performance Suite☆119Updated this week
- A Deep Learning Meta-Framework and HPC Benchmarking Library☆81Updated 3 years ago