LLNL / RAJA
RAJA Performance Portability Layer (C++)
☆486Updated this week
Related projects ⓘ
Alternatives and complementary repositories for RAJA
- Kokkos C++ Performance Portability Programming Ecosystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels☆310Updated this week
- A streamlined CMake build system foundation for developing HPC software☆260Updated last week
- Caliper is an instrumentation and performance profiling library☆350Updated last week
- Tutorials for the Kokkos C++ Performance Portability Programming Ecosystem☆294Updated 2 months ago
- Next generation of ADIOS developed in the Exascale Computing Program☆272Updated this week
- Portable and vendor neutral framework for parallel programming on heterogeneous platforms.☆393Updated 3 months ago
- Abstraction Library for Parallel Kernel Acceleration☆356Updated this week
- This is a set of simple programs that can be used to explore the features of a parallel platform.☆411Updated this week
- The Charm++ parallel programming system. Visit https://charmplusplus.org/ for more information.☆206Updated this week
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆112Updated 2 months ago
- RAJA Performance Suite☆110Updated this week
- Performance-portable geometric search library☆182Updated this week
- Simplified Data Exchange for HPC Simulations☆212Updated 2 weeks ago
- An application-focused API for memory management on NUMA & GPU architectures☆323Updated this week
- CS infrastructure components for HPC applications☆157Updated this week
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆106Updated last week
- A massively-parallel, block-sparse tensor framework written in C++☆256Updated this week
- A flyweight in situ visualization and analysis runtime for multi-physics HPC simulations☆195Updated this week
- STREAM, for lots of devices written in many programming models☆325Updated 2 months ago
- Distributed memory, MPI based SuperLU☆188Updated this week
- An implementation of BLAS using the SYCL open standard.☆259Updated last week
- Modular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template…☆353Updated 3 months ago
- Numerical linear algebra software package☆406Updated this week
- A C++17 message passing library based on MPI☆167Updated 9 months ago
- High-level C++ for Accelerator Clusters☆142Updated this week
- Performance-portable library for particle-based simulations☆212Updated 2 weeks ago
- Parallel solvers for sparse linear systems featuring multigrid methods.☆696Updated this week
- QUDA is a library for performing calculations in lattice QCD on GPUs.☆293Updated this week
- Run a parallel command inside a split tmux window☆136Updated 2 years ago
- Distributed multigrid linear solver library on GPU☆492Updated 2 months ago