spcl / daceLinks
DaCe - Data Centric Parallel Programming
☆559Updated this week
Alternatives and similar repositories for dace
Users that are interested in dace are comparing it to the libraries listed below
Sorting:
- Kernel Tuner☆372Updated last week
- STREAM, for lots of devices written in many programming models☆351Updated 2 months ago
- Unified Collective Communication Library☆278Updated last week
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆211Updated 2 weeks ago
- ☆288Updated last month
- A code generator for array-based code on CPUs and GPUs☆616Updated last week
- Pluto: An automatic polyhedral parallelizer and locality optimizer☆308Updated 2 months ago
- A Data-Centric Compiler for Machine Learning☆85Updated last year
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆126Updated this week
- Benchmark for measuring the performance of sparse and irregular memory access.☆80Updated 2 months ago
- Instructions, Docker images, and examples for Nsight Compute and Nsight Systems☆134Updated 5 years ago
- A Python compiler design toolkit.☆444Updated this week
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆145Updated this week
- This is a set of simple programs that can be used to explore the features of a parallel platform.☆465Updated 2 months ago
- Examples demonstrating available options to program multiple GPUs in a single node or a cluster☆829Updated last month
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆165Updated last week
- Rich editor for SDFGs with included profiling and debugging, static analysis, and interactive optimization.☆20Updated 2 weeks ago
- ☆267Updated last week
- Rodinia benchmark☆193Updated 2 years ago
- HPCToolkit performance tools: measurement and analysis components☆344Updated 8 months ago
- Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial☆321Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆254Updated last week
- Python interface for MLIR - the Multi-Level Intermediate Representation☆270Updated 11 months ago
- ☆185Updated last week
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆362Updated this week
- Official HPCG benchmark source code☆328Updated last year
- SST Structural Simulation Toolkit Parallel Discrete Event Core and Services☆180Updated last week
- collection of benchmarks to measure basic GPU capabilities☆456Updated 3 weeks ago
- A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)☆427Updated 10 months ago
- Includes Python bindings to instrumentation and tracing technology (ITT) APIs for VTune☆25Updated last year