NVIDIA / mpi-acxLinks
MPI accelerator-integrated communication extensions
☆33Updated 2 years ago
Alternatives and similar repositories for mpi-acx
Users that are interested in mpi-acx are comparing it to the libraries listed below
Sorting:
- Comb is a communication performance benchmarking tool.☆25Updated 2 years ago
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆41Updated last year
- HPCG benchmark based on ROCm platform☆37Updated this week
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆56Updated last month
- TAU Performance System Public Mirror (Updated every night at midnight, USA Pacific Time)☆46Updated this week
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆54Updated last week
- RAJA Performance Suite☆117Updated this week
- ☆18Updated last year
- Department of Energy Standard Utility Library☆31Updated last week
- A unified framework across multiple programming platforms☆38Updated 11 months ago
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆119Updated this week
- Distributed View Extension for Kokkos☆46Updated 6 months ago
- Autonomic Performance Environment for eXascale (APEX)☆48Updated 2 weeks ago
- The LLVM DOE Fork is a fork of upstream LLVM (https://github.com/llvm/llvm-project/) that hosts multiple DOE-funded projects. Contact in…☆25Updated this week
- NAS Parallel Benchmarks for evaluating GPU and APIs☆25Updated last week
- Logger for MPI communication☆27Updated last year
- OpenMP vs Offload☆21Updated 2 years ago
- Very-Low Overhead Checkpointing System☆57Updated 4 months ago
- ROCm SPARSE marshalling library☆67Updated this week
- A tracing infrastructure for heterogeneous computing applications.☆33Updated last week
- ☆36Updated this week
- ROCm Systems Profiler☆18Updated this week
- Next generation library for iterative sparse solvers for ROCm platform☆81Updated this week
- The Task-Aware MPI (TAMPI) library extends the functionality of standard MPI libraries by providing new mechanisms for improving the inte…☆24Updated 6 months ago
- Advanced Profiling and Analytics for AMD Hardware☆156Updated this week
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆32Updated 2 months ago
- ☆95Updated this week
- CPE change log and release notes☆26Updated 8 months ago
- Next generation LAPACK implementation for ROCm platform☆101Updated this week
- Benchmarks☆17Updated last month