spcl / dace
DaCe - Data Centric Parallel Programming
☆515Updated this week
Alternatives and similar repositories for dace:
Users that are interested in dace are comparing it to the libraries listed below
- A Data-Centric Compiler for Machine Learning☆82Updated last year
- Kernel Tuner☆325Updated this week
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆203Updated 3 months ago
- Unified Collective Communication Library☆237Updated this week
- CUDA Kernel Benchmarking Library☆593Updated last week
- Rich editor for SDFGs with included profiling and debugging, static analysis, and interactive optimization.☆19Updated last month
- Pluto: An automatic polyhedral parallelizer and locality optimizer☆283Updated last week
- A code generator for array-based code on CPUs and GPUs☆599Updated this week
- STREAM, for lots of devices written in many programming models☆330Updated 6 months ago
- collection of benchmarks to measure basic GPU capabilities☆308Updated last month
- ☆232Updated this week
- NPBench - A Benchmarking Suite for High-Performance NumPy☆80Updated this week
- Examples demonstrating available options to program multiple GPUs in a single node or a cluster☆663Updated last month
- Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial☆247Updated this week
- The Foundation for All Legate Libraries☆206Updated this week
- Python interface for MLIR - the Multi-Level Intermediate Representation☆247Updated 3 months ago
- Stretching GPU performance for GEMMs and tensor contractions.☆233Updated this week
- A Python Compiler Design Toolkit☆322Updated this week
- ☆236Updated last month
- C/C++ frontend for MLIR. Also features polyhedral optimizations, parallel optimizations, and more!☆524Updated 5 months ago
- This is a set of simple programs that can be used to explore the features of a parallel platform.☆423Updated last week
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆129Updated last week
- ☆524Updated last week
- Assembler for NVIDIA Volta and Turing GPUs☆214Updated 3 years ago
- Benchmark for measuring the performance of sparse and irregular memory access.☆77Updated last month
- This is the top-level repository for the Accel-Sim framework.☆373Updated this week
- KvikIO - High Performance File IO☆195Updated this week
- A scalable High-Level Synthesis framework on MLIR☆252Updated 10 months ago
- Advanced Profiling and Analytics for AMD Hardware☆142Updated this week
- Instructions, Docker images, and examples for Nsight Compute and Nsight Systems☆130Updated 4 years ago