spcl / daceLinks
DaCe - Data Centric Parallel Programming
☆572Updated this week
Alternatives and similar repositories for dace
Users that are interested in dace are comparing it to the libraries listed below
Sorting:
- Kernel Tuner☆379Updated this week
- STREAM, for lots of devices written in many programming models☆354Updated 4 months ago
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆212Updated this week
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆141Updated last week
- A code generator for array-based code on CPUs and GPUs☆621Updated last week
- Pluto: An automatic polyhedral parallelizer and locality optimizer☆316Updated 4 months ago
- Unified Collective Communication Library☆286Updated this week
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆148Updated this week
- ☆298Updated 3 months ago
- ☆272Updated this week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆165Updated this week
- This is a set of simple programs that can be used to explore the features of a parallel platform.☆471Updated 4 months ago
- Instructions, Docker images, and examples for Nsight Compute and Nsight Systems☆134Updated 5 years ago
- Benchmark for measuring the performance of sparse and irregular memory access.☆82Updated 5 months ago
- A Data-Centric Compiler for Machine Learning☆85Updated last month
- Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial☆344Updated last month
- Python SYCL bindings and SYCL-based Python Array API library☆121Updated this week
- Assembler for NVIDIA Volta and Turing GPUs☆236Updated 4 years ago
- CUDA Kernel Benchmarking Library☆798Updated 2 weeks ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆253Updated 2 weeks ago
- Official HPCG benchmark source code☆337Updated last year
- Examples demonstrating available options to program multiple GPUs in a single node or a cluster☆859Updated 3 months ago
- Rodinia benchmark☆199Updated 2 years ago
- development repository for the open earth compiler☆81Updated 4 years ago
- NPBench - A Benchmarking Suite for High-Performance NumPy☆91Updated last month
- HPCToolkit performance tools: measurement and analysis components☆343Updated 11 months ago
- The Foundation for All Legate Libraries☆233Updated this week
- 🎃 GPU load-balancing library for regular and irregular computations.☆64Updated 4 months ago
- Online CUDA Occupancy Calculator☆81Updated 4 years ago
- TPP experimentation on MLIR for linear algebra☆142Updated last month