openhackathons-org / nways_multi_gpu
N-Ways to Multi-GPU Programming
☆18Updated last year
Alternatives and similar repositories for nways_multi_gpu:
Users that are interested in nways_multi_gpu are comparing it to the libraries listed below
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆51Updated 2 weeks ago
- Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial☆214Updated 3 months ago
- Repository with examples and exercises for OLCF and AMD's HIP training series☆15Updated last year
- ☆70Updated this week
- Reference implementations of MLPerf™ HPC training benchmarks☆46Updated 2 weeks ago
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆48Updated last month
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆61Updated this week
- Benchmark implementation of CosmoFlow in TensorFlow Keras☆21Updated last year
- HPCG benchmark based on ROCm platform☆37Updated 2 weeks ago
- ☆17Updated 5 years ago
- Sources for the Oak Ridge Leadership Computing Facility User Documentation☆64Updated this week
- E4S for Spack☆31Updated last month
- Hands-on HPC I/O tutorial material☆14Updated 4 months ago
- RCCL Performance Benchmark Tests☆59Updated this week
- Utility for monitoring process, thread, OS and HW resources.☆16Updated 3 weeks ago
- Training examples for SYCL☆39Updated last month
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆30Updated 3 months ago
- ☆43Updated 4 years ago
- ALCF Computational Performance Workshop☆37Updated 2 years ago
- Graph-indexed Pandas DataFrames for analyzing hierarchical performance data☆31Updated 4 months ago
- NPBench - A Benchmarking Suite for High-Performance NumPy☆78Updated this week
- PaRSEC is a generic framework for architecture aware scheduling and management of micro-tasks on distributed, GPU accelerated, many-core …☆55Updated this week
- Intermediate MPI lesson☆26Updated last year
- 🎃 GPU load-balancing library for regular and irregular computations.☆62Updated 9 months ago
- Advanced Profiling and Analytics for AMD Hardware☆141Updated this week
- CPE change log and release notes☆26Updated 6 months ago
- ☆36Updated 2 weeks ago
- RAJA Performance Suite☆119Updated this week
- ☆17Updated last year
- Scripts for building libraries with Cray's PE☆20Updated 3 years ago