ICLDisco / dplasmaLinks
DPLASMA is a highly optimized, accelerator-aware, implementation of a dense linear algebra package for distributed heterogeneous systems. It is designed to deliver sustained performance for distributed systems where each node featuring multiple sockets of multicore processors, and if available, accelerators, using the PaRSEC runtime as a backen…
☆16Updated 7 months ago
Alternatives and similar repositories for dplasma
Users that are interested in dplasma are comparing it to the libraries listed below
Sorting:
- Molecular dynamics proxy application based on Kokkos☆33Updated last year
- This tool serves as a test harness for different optimization techniques to improve stencil computations performance in shared and distri…☆20Updated 3 years ago
- PaRSEC is a generic framework for architecture aware scheduling and management of micro-tasks on distributed, GPU accelerated, many-core …☆75Updated last month
- CPE change log and release notes☆26Updated last year
- The Task-Aware MPI (TAMPI) library extends the functionality of standard MPI libraries by providing new mechanisms for improving the inte…☆25Updated 6 months ago
- Python bindings for OpenSHMEM☆25Updated this week
- Partitioned Global Address Space (PGAS) library for distributed arrays☆108Updated this week
- ☆17Updated last week
- DBCSR: Distributed Block Compressed Sparse Row matrix library☆146Updated this week
- Pragmatic, Productive, and Portable Affinity for HPC☆49Updated last week
- Training examples for SYCL☆49Updated 3 weeks ago
- Wrapper interface for MPI☆97Updated 2 months ago
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆46Updated last year
- QMCPACK miniapp: a simplified real space QMC code for algorithm development, performance portability testing, and computer science experi…☆27Updated last year
- Comb is a communication performance benchmarking tool.☆25Updated 2 years ago
- ☆135Updated last week
- YAKL is A Kokkos Layer: A simple C++ framework for performance portability and Fortran code porting☆69Updated 3 months ago
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆131Updated last month
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆53Updated 4 months ago
- ☆34Updated 2 months ago
- The JUBE benchmarking environment provides a script based framework to easily create benchmark sets, run those sets on different computer…☆42Updated last year
- An MPI ABI compatibility layer☆34Updated 3 months ago
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆64Updated last month
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆36Updated last month
- This aims to be an wrapper to C-MPI3 for C++, using the principles of simplicity, STL, RAII and Boost and enforcing type-safety. This i…☆23Updated last year
- Distributed View Extension for Kokkos☆48Updated last year
- Kripke is a simple, scalable, 3D Sn deterministic particle transport code☆41Updated 2 months ago
- Scripts for running various benchmarks on Isambard and other systems.☆29Updated 4 years ago
- An open collaborative repository for reproducible specifications of HPC benchmarks and cross site benchmarking environments☆61Updated last week
- A light-weight MPI profiler.☆102Updated 2 months ago