olcf-tutorials / local_mpi_to_gpu
How to use node-local MPI rank IDs to manually map MPI ranks to GPUs
☆13Updated 4 years ago
Alternatives and similar repositories for local_mpi_to_gpu:
Users that are interested in local_mpi_to_gpu are comparing it to the libraries listed below
- Intermediate MPI lesson☆26Updated last year
- QMCPACK miniapp: a simplified real space QMC code for algorithm development, performance portability testing, and computer science experi…☆27Updated 6 months ago
- Molecular dynamics proxy application based on Kokkos☆32Updated 7 months ago
- OpenMP vs Offload☆21Updated last year
- A compression benchmark suite☆17Updated last year
- Logger for MPI communication☆26Updated last year
- Comb is a communication performance benchmarking tool.☆24Updated last year
- Benchmark implementation of CosmoFlow in TensorFlow Keras☆21Updated last year
- Tensor Algebra Library Routines for Shared Memory Systems☆38Updated last year
- Scripts for running various benchmarks on Isambard and other systems.☆28Updated 3 years ago
- MiniMD Molecular Dynamics Mini-App☆49Updated 6 months ago
- The Task-Aware MPI (TAMPI) library extends the functionality of standard MPI libraries by providing new mechanisms for improving the inte…☆23Updated 3 months ago
- PaRSEC is a generic framework for architecture aware scheduling and management of micro-tasks on distributed, GPU accelerated, many-core …☆54Updated this week
- CPE change log and release notes☆26Updated 5 months ago
- This tutorial demonstrates how to use CUDA-Aware MPI☆38Updated last year
- Molecular dynamics proxy application based on Cabana☆20Updated this week
- Very-Low Overhead Checkpointing System☆55Updated last month
- Training examples for SYCL☆39Updated 3 weeks ago
- A light-weight MPI profiler.☆87Updated 6 months ago
- A benchmark suite for measuring HDF5 performance.☆40Updated 6 months ago
- Distributed View Extension for Kokkos☆44Updated 2 months ago
- ☆14Updated this week
- This aims to be an wrapper to C-MPI3 for C++, using the principles of simplicity, STL, RAII and Boost and enforcing type-safety. This i…☆22Updated 4 months ago
- Python bindings for data interoperability with Kokkos (View, DynRankView)☆26Updated 5 months ago
- DPLASMA is a highly optimized, accelerator-aware, implementation of a dense linear algebra package for distributed heterogeneous systems…☆12Updated 2 months ago
- MiniFE Finite Element Mini-Application☆31Updated 9 months ago
- ALCF Computational Performance Workshop☆37Updated 2 years ago
- ☆63Updated this week
- ☆30Updated 9 months ago