spcl / dace
DaCe - Data Centric Parallel Programming
☆525Updated this week
Alternatives and similar repositories for dace
Users that are interested in dace are comparing it to the libraries listed below
Sorting:
- A Data-Centric Compiler for Machine Learning☆83Updated last year
- Unified Collective Communication Library☆252Updated this week
- STREAM, for lots of devices written in many programming models☆335Updated 8 months ago
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆205Updated last week
- Rich editor for SDFGs with included profiling and debugging, static analysis, and interactive optimization.☆19Updated 3 months ago
- CUDA Kernel Benchmarking Library☆639Updated last week
- Kernel Tuner☆336Updated this week
- NPBench - A Benchmarking Suite for High-Performance NumPy☆81Updated this week
- A Python Compiler Design Toolkit☆343Updated this week
- RAJA Performance Suite☆117Updated 2 weeks ago
- Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial☆258Updated last month
- collection of benchmarks to measure basic GPU capabilities☆372Updated 3 months ago
- Examples demonstrating available options to program multiple GPUs in a single node or a cluster☆703Updated 2 months ago
- Advanced Profiling and Analytics for AMD Hardware☆154Updated this week
- ☆246Updated 3 months ago
- A code generator for array-based code on CPUs and GPUs☆601Updated last week
- Assembler for NVIDIA Volta and Turing GPUs☆218Updated 3 years ago
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆134Updated this week
- This is a set of simple programs that can be used to explore the features of a parallel platform.☆431Updated last month
- A light-weight MPI profiler.☆94Updated 9 months ago
- Data Parallel Extension for Numba☆81Updated 6 months ago
- Pluto: An automatic polyhedral parallelizer and locality optimizer☆288Updated last month
- ☆244Updated this week
- C/C++ frontend for MLIR. Also features polyhedral optimizations, parallel optimizations, and more!☆541Updated 7 months ago
- Benchmark for measuring the performance of sparse and irregular memory access.☆77Updated last week
- Python SYCL bindings and SYCL-based Python Array API library☆111Updated this week
- Instructions, Docker images, and examples for Nsight Compute and Nsight Systems☆131Updated 4 years ago
- Official HPCG benchmark source code☆319Updated 10 months ago
- RAJA Performance Portability Layer (C++)☆516Updated this week
- HPCToolkit performance tools: measurement and analysis components☆341Updated 2 months ago