ICLDisco / dplasmaLinks
DPLASMA is a highly optimized, accelerator-aware, implementation of a dense linear algebra package for distributed heterogeneous systems. It is designed to deliver sustained performance for distributed systems where each node featuring multiple sockets of multicore processors, and if available, accelerators, using the PaRSEC runtime as a backen…
☆15Updated 6 months ago
Alternatives and similar repositories for dplasma
Users that are interested in dplasma are comparing it to the libraries listed below
Sorting:
- DBCSR: Distributed Block Compressed Sparse Row matrix library☆146Updated this week
- PaRSEC is a generic framework for architecture aware scheduling and management of micro-tasks on distributed, GPU accelerated, many-core …☆72Updated 2 weeks ago
- Molecular dynamics proxy application based on Kokkos☆33Updated last year
- This tool serves as a test harness for different optimization techniques to improve stencil computations performance in shared and distri…☆20Updated 3 years ago
- Partitioned Global Address Space (PGAS) library for distributed arrays☆107Updated last week
- A high-level Parallel I/O Library for structured grid applications☆22Updated 3 weeks ago
- YAKL is A Kokkos Layer: A simple C++ framework for performance portability and Fortran code porting☆69Updated 2 months ago
- Comb is a communication performance benchmarking tool.☆25Updated 2 years ago
- OpenACC* to OpenMP* API assisting migration tool☆38Updated last month
- CPE change log and release notes☆26Updated last year
- ☆17Updated this week
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆45Updated last year
- ☆128Updated last week
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆36Updated 7 months ago
- This aims to be an wrapper to C-MPI3 for C++, using the principles of simplicity, STL, RAII and Boost and enforcing type-safety. This i…☆23Updated last year
- QMCPACK miniapp: a simplified real space QMC code for algorithm development, performance portability testing, and computer science experi…☆27Updated last year
- Kripke is a simple, scalable, 3D Sn deterministic particle transport code☆41Updated last month
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆54Updated 3 months ago
- Distributed View Extension for Kokkos☆48Updated 11 months ago
- OpenMP vs Offload☆22Updated 2 years ago
- ALCF Computational Performance Workshop☆38Updated 3 years ago
- ☆19Updated 2 months ago
- Scripts for running various benchmarks on Isambard and other systems.☆29Updated 4 years ago
- Lecture and hands-on material for Track 8- Machine Learning of Argonne Training Program on Extreme-Scale Computing☆45Updated 2 months ago
- Molecular dynamics proxy application based on Cabana☆21Updated 8 months ago
- RAJA Performance Suite☆125Updated this week
- Logger for MPI communication☆27Updated 2 years ago
- Wrapper interface for MPI☆97Updated last month
- An open collaborative repository for reproducible specifications of HPC benchmarks and cross site benchmarking environments☆60Updated last week
- A little library giving you a live monitoring of MPI programs.☆25Updated 3 years ago