ICLDisco / dplasmaLinks
DPLASMA is a highly optimized, accelerator-aware, implementation of a dense linear algebra package for distributed heterogeneous systems. It is designed to deliver sustained performance for distributed systems where each node featuring multiple sockets of multicore processors, and if available, accelerators, using the PaRSEC runtime as a backen…
☆16Updated 9 months ago
Alternatives and similar repositories for dplasma
Users that are interested in dplasma are comparing it to the libraries listed below
Sorting:
- CPE change log and release notes☆26Updated last year
- PaRSEC is a generic framework for architecture aware scheduling and management of micro-tasks on distributed, GPU accelerated, many-core …☆76Updated 3 months ago
- Molecular dynamics proxy application based on Kokkos☆33Updated last year
- Training examples for SYCL☆49Updated 2 months ago
- DBCSR: Distributed Block Compressed Sparse Row matrix library☆151Updated 2 weeks ago
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆131Updated 3 months ago
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆55Updated 6 months ago
- Partitioned Global Address Space (PGAS) library for distributed arrays☆110Updated this week
- This tool serves as a test harness for different optimization techniques to improve stencil computations performance in shared and distri…☆21Updated 3 years ago
- A C++based implementation of the TeaLeaf heat conduction mini-app. This implementation of TeaLeaf replicates the functionality of the ref…☆25Updated last year
- ☆19Updated 3 weeks ago
- Simplified Data Exchange for HPC Simulations☆237Updated last week
- Very-Low Overhead Checkpointing System☆59Updated 6 months ago
- Logger for MPI communication☆27Updated 2 years ago
- ☆35Updated 2 weeks ago
- ALCF Computational Performance Workshop☆38Updated 3 years ago
- Wrapper interface for MPI☆98Updated 4 months ago
- OpenACC* to OpenMP* API assisting migration tool☆39Updated last month
- This aims to be an wrapper to C-MPI3 for C++, using the principles of simplicity, STL, RAII and Boost and enforcing type-safety. This i…☆23Updated last year
- YAKL is A Kokkos Layer: A simple C++ framework for performance portability and Fortran code porting☆69Updated 5 months ago
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆138Updated 3 weeks ago
- QMCPACK miniapp: a simplified real space QMC code for algorithm development, performance portability testing, and computer science experi…☆27Updated last year
- Pragmatic, Productive, and Portable Affinity for HPC☆51Updated 3 weeks ago
- Python bindings for OpenSHMEM☆25Updated 3 weeks ago
- OpenMP vs Offload☆23Updated 2 years ago
- OpenMP Offloading Validation & Verification Suite; Official repository. We have migrated from bitbucket!! For documentation, results, pub…☆59Updated last week
- Kripke is a simple, scalable, 3D Sn deterministic particle transport code☆40Updated 3 weeks ago
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆37Updated last week
- A high-level Parallel I/O Library for structured grid applications☆22Updated this week
- Information about many aspects of high-performance computing. Wiki content moved to ~/docs.☆312Updated last month