olcf-tutorials / local_mpi_to_gpuLinks
How to use node-local MPI rank IDs to manually map MPI ranks to GPUs
☆14Updated 5 years ago
Alternatives and similar repositories for local_mpi_to_gpu
Users that are interested in local_mpi_to_gpu are comparing it to the libraries listed below
Sorting:
- ☆145Updated last week
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆46Updated 2 years ago
- Pragmatic, Productive, and Portable Affinity for HPC☆51Updated 3 weeks ago
- Collective and Neighbor Collective Optimizations and Extensions☆13Updated last week
- CPU and GPU tutorial examples☆13Updated 10 months ago
- ☆11Updated 10 months ago
- Molecular dynamics proxy application based on Kokkos☆33Updated last year
- PaRSEC is a generic framework for architecture aware scheduling and management of micro-tasks on distributed, GPU accelerated, many-core …☆76Updated 3 months ago
- A website covering major HPC technologies, designed to welcome contributions.☆78Updated last year
- Comb is a communication performance benchmarking tool.☆26Updated 2 years ago
- Benchmark implementation of CosmoFlow in TensorFlow Keras☆22Updated last year
- This tutorial demonstrates how to use CUDA-Aware MPI☆39Updated 2 years ago
- ALCF Computational Performance Workshop☆38Updated 3 years ago
- The Task-Aware MPI (TAMPI) library extends the functionality of standard MPI libraries by providing new mechanisms for improving the inte…☆25Updated 8 months ago
- Scripts for running various benchmarks on Isambard and other systems.☆29Updated 4 years ago
- Intermediate MPI lesson☆27Updated 2 years ago
- MPI accelerator-integrated communication extensions☆39Updated 2 years ago
- Training examples for SYCL☆49Updated 2 months ago
- ☆12Updated 6 months ago
- Very-Low Overhead Checkpointing System☆59Updated 6 months ago
- A light-weight MPI profiler.☆105Updated 4 months ago
- A parallel programming training mini app simulating weather-like flows☆173Updated 5 months ago
- Logger for MPI communication☆27Updated 2 years ago
- Distributed View Extension for Kokkos☆49Updated last year
- OpenMP vs Offload☆23Updated 2 years ago
- ☆18Updated 2 years ago
- RAJA Performance Suite☆130Updated this week
- CPE change log and release notes☆26Updated last year
- An open collaborative repository for reproducible specifications of HPC benchmarks and cross site benchmarking environments☆65Updated this week
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆55Updated 6 months ago