ICLDisco / dplasma
DPLASMA is a highly optimized, accelerator-aware, implementation of a dense linear algebra package for distributed heterogeneous systems. It is designed to deliver sustained performance for distributed systems where each node featuring multiple sockets of multicore processors, and if available, accelerators, using the PaRSEC runtime as a backen…
☆12Updated last week
Alternatives and similar repositories for dplasma:
Users that are interested in dplasma are comparing it to the libraries listed below
- PaRSEC is a generic framework for architecture aware scheduling and management of micro-tasks on distributed, GPU accelerated, many-core …☆56Updated last week
- Molecular dynamics proxy application based on Kokkos☆32Updated 8 months ago
- Distributed View Extension for Kokkos☆45Updated 3 months ago
- This aims to be an wrapper to C-MPI3 for C++, using the principles of simplicity, STL, RAII and Boost and enforcing type-safety. This i…☆22Updated 5 months ago
- CPE change log and release notes☆26Updated 6 months ago
- Training examples for SYCL☆39Updated 2 months ago
- DBCSR: Distributed Block Compressed Sparse Row matrix library☆142Updated last week
- ☆14Updated last week
- Partitioned Global Address Space (PGAS) library for distributed arrays☆101Updated this week
- OpenMP vs Offload☆21Updated last year
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆121Updated 2 months ago
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆110Updated 2 months ago
- YAKL is A Kokkos Layer: A simple C++ framework for performance portability and Fortran code porting☆62Updated last week
- CS infrastructure components for HPC applications☆170Updated this week
- MiniMD Molecular Dynamics Mini-App☆50Updated 3 weeks ago
- ☆76Updated this week
- RAJA Performance Suite☆118Updated this week
- Kripke is a simple, scalable, 3D Sn deterministic particle transport code☆39Updated 3 months ago
- Very-Low Overhead Checkpointing System☆57Updated 2 months ago
- A shared-memory FFT for the Kokkos ecosystem☆31Updated this week
- Molecular dynamics proxy application based on Cabana☆21Updated last month
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆107Updated last week
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆36Updated 6 months ago
- Performance benchmarks and regression tests for the ExCALIBUR project☆25Updated this week
- A neutral particle transport mini-app to study performance of sweeps on unstructured, 3D tetrahedral meshes.☆18Updated 2 years ago
- QMCPACK miniapp: a simplified real space QMC code for algorithm development, performance portability testing, and computer science experi…☆27Updated 8 months ago
- How to use node-local MPI rank IDs to manually map MPI ranks to GPUs☆13Updated 4 years ago
- OpenMP Offloading Validation & Verification Suite; Official repository. We have migrated from bitbucket!! For documentation, results, pub…☆58Updated this week
- Autonomic Performance Environment for eXascale (APEX)☆44Updated last week
- Implementation of MPI that supports large counts☆48Updated 4 months ago