eth-cscs / DLA-Future
DLA-Future
☆71Updated last week
Alternatives and similar repositories for DLA-Future:
Users that are interested in DLA-Future are comparing it to the libraries listed below
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆86Updated this week
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆107Updated 2 weeks ago
- Department of Energy Standard Utility Library☆31Updated last month
- Portable HPC Containers (C++)☆48Updated this week
- Distributed View Extension for Kokkos☆45Updated 4 months ago
- Autonomic Performance Environment for eXascale (APEX)☆46Updated last week
- Partitioned Global Address Space (PGAS) library for distributed arrays☆102Updated this week
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆121Updated 3 months ago
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆112Updated 3 months ago
- OpenMP Offloading Validation & Verification Suite; Official repository. We have migrated from bitbucket!! For documentation, results, pub…☆58Updated this week
- CHAI and RAJA provide an excellent base on which to build portable codes. CARE expands that functionality, adding new features such as lo…☆30Updated last week
- RAJA Performance Suite☆117Updated 2 weeks ago
- CS infrastructure components for HPC applications☆170Updated this week
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆40Updated last year
- YAKL is A Kokkos Layer: A simple C++ framework for performance portability and Fortran code porting☆65Updated last month
- Specialized Parallel Linear Algebra, providing distributed GEMM functionality for specific matrix distributions with optional GPU acceler…☆29Updated 9 months ago
- Molecular dynamics proxy application based on Kokkos☆33Updated 9 months ago
- pika is a C++ tasking library built on std::execution with fibers, CUDA, HIP, and MPI support.☆72Updated this week
- BLAS++ is a C++ wrapper around CPU and GPU BLAS (basic linear algebra subroutines), developed as part of the SLATE project.☆78Updated last week
- DARMA/vt => Virtual Transport☆36Updated last week
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆107Updated last week
- Shroud: generate Fortran and Python wrappers for C and C++ libraries☆90Updated last week
- Distributed ranges is a generalization of C++ ranges for distributed data structures.☆50Updated 2 weeks ago
- Next generation LAPACK implementation for ROCm platform☆99Updated last week
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆36Updated 2 weeks ago
- An MPI ABI compatibility layer☆32Updated last month
- TAU Performance System Public Mirror (Updated every night at midnight, USA Pacific Time)☆44Updated this week
- DBCSR: Distributed Block Compressed Sparse Row matrix library☆142Updated last week
- This aims to be an wrapper to C-MPI3 for C++, using the principles of simplicity, STL, RAII and Boost and enforcing type-safety. This i…☆21Updated 6 months ago
- A streamlined CMake build system foundation for developing HPC software☆269Updated last month