eth-cscs / DLA-Future
DLA-Future
☆63Updated this week
Related projects: ⓘ
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆104Updated this week
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆78Updated last month
- Autonomic Performance Environment for eXascale (APEX)☆38Updated this week
- Portable HPC Containers (C++)☆47Updated 2 weeks ago
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆84Updated 2 months ago
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆109Updated 2 weeks ago
- Kokkos Remote Spaces implements distributed Kokkos Views and related APIs for distributed parallel programming.☆42Updated 2 weeks ago
- CS infrastructure components for HPC applications☆150Updated this week
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆34Updated 8 months ago
- pika builds on C++ std::execution with fiber, CUDA, HIP, and MPI support.☆62Updated this week
- RAJA Performance Suite☆110Updated last week
- CHAI and RAJA provide an excellent base on which to build portable codes. CARE expands that functionality, adding new features such as lo…☆28Updated this week
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆21Updated last week
- Partitioned Global Address Space (PGAS) library for distributed arrays☆97Updated this week
- DARMA/vt => Virtual Transport☆35Updated this week
- Department of Energy Standard Utility Library☆29Updated 2 weeks ago
- BLAS++ is a C++ wrapper around CPU and GPU BLAS (basic linear algebra subroutines), developed as part of the SLATE project.☆62Updated 2 months ago
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆47Updated last month
- This aims to be an wrapper to C-MPI3 for C++, using the principles of simplicity, STL, RAII and Boost and enforcing type-safety. This i…☆20Updated 2 months ago
- TTG: Template Task Graph C++ API☆18Updated last month
- Specialized Parallel Linear Algebra, providing distributed GEMM functionality for specific matrix distributions with optional GPU acceler…☆27Updated 2 months ago
- Repository for collecting, curating and maintaining up to date CMake scripts.☆9Updated 3 years ago
- Distributed ranges is a generalization of C++ ranges for distributed data structures.☆46Updated this week
- An OpenMP runtime implemented using HPX☆23Updated 2 years ago
- Performance-portable geometric search library☆177Updated this week
- TAU Performance System Public Mirror (Updated every night at midnight, USA Pacific Time)☆38Updated this week
- Molecular dynamics proxy application based on Kokkos☆30Updated 2 months ago
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆39Updated 7 months ago
- Generate simple index ranges in C++ and CUDA C++☆38Updated last year
- Header-only C++20 wrapper for MPI 4.0.☆43Updated 10 months ago