ICLDisco / dplasma
DPLASMA is a highly optimized, accelerator-aware, implementation of a dense linear algebra package for distributed heterogeneous systems. It is designed to deliver sustained performance for distributed systems where each node featuring multiple sockets of multicore processors, and if available, accelerators, using the PaRSEC runtime as a backen…
☆11Updated last week
Related projects ⓘ
Alternatives and complementary repositories for dplasma
- PaRSEC is a generic framework for architecture aware scheduling and management of micro-tasks on distributed, GPU accelerated, many-core …☆50Updated this week
- Molecular dynamics proxy application based on Kokkos☆31Updated 3 months ago
- This aims to be an wrapper to C-MPI3 for C++, using the principles of simplicity, STL, RAII and Boost and enforcing type-safety. This i…☆21Updated 3 weeks ago
- Training examples for SYCL☆38Updated 2 weeks ago
- CPE change log and release notes☆26Updated 2 months ago
- Partitioned Global Address Space (PGAS) library for distributed arrays☆100Updated this week
- Distributed View Extension for Kokkos☆43Updated 2 months ago
- QMCPACK miniapp: a simplified real space QMC code for algorithm development, performance portability testing, and computer science experi…☆27Updated 3 months ago
- MiniMD Molecular Dynamics Mini-App☆48Updated 3 months ago
- DBCSR: Distributed Block Compressed Sparse Row matrix library☆135Updated this week
- YAKL is A Kokkos Layer: A simple C++ framework for performance portability and Fortran code porting☆57Updated last week
- RAJA Performance Suite☆110Updated this week
- OpenACC* to OpenMP* API assisting migration tool☆32Updated 2 weeks ago
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆34Updated last month
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆112Updated 2 months ago
- OpenMP Offloading Validation & Verification Suite; Official repository. We have migrated from bitbucket!! For documentation, results, pub…☆54Updated this week
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆100Updated last year
- Kripke is a simple, scalable, 3D Sn deterministic particle transport code☆36Updated 4 months ago
- CS infrastructure components for HPC applications☆157Updated this week
- Experimental MPI Wrapper for Kokkos☆16Updated this week
- ☆14Updated last week
- ☆10Updated 3 months ago
- Autonomic Performance Environment for eXascale (APEX)☆38Updated last week
- ☆48Updated this week
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆92Updated last week
- DLA-Future☆65Updated this week
- OpenMP vs Offload☆21Updated last year
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆48Updated 3 months ago
- ☆14Updated this week
- Comb is a communication performance benchmarking tool.☆23Updated last year