pnnl / SHAD
Scalable High-performance Algorithms and Data-structures
☆128Updated last month
Alternatives and similar repositories for SHAD:
Users that are interested in SHAD are comparing it to the libraries listed below
- Autonomic Performance Environment for eXascale (APEX)☆44Updated this week
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆106Updated this week
- RAJA Performance Suite☆118Updated last week
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆120Updated last month
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆83Updated this week
- Home of ALP/GraphBLAS and ALP/Pregel, featuring shared- and distributed-memory auto-parallelisation of linear algebraic and vertex-centri…☆25Updated last month
- Caliper is an instrumentation and performance profiling library☆369Updated 2 weeks ago
- YASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-d…☆106Updated 7 months ago
- Partitioned Global Address Space (PGAS) library for distributed arrays☆101Updated this week
- Distributed View Extension for Kokkos☆45Updated 3 months ago
- Loop Kernel Analysis and Performance Modeling Toolkit☆91Updated 6 months ago
- OpenSHMEM Application Programming Interface☆53Updated 3 months ago
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆105Updated last year
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆93Updated 3 years ago
- The Combinatorial BLAS (CombBLAS) is an extensible distributed-memory parallel graph library offering a small but powerful set of linear …☆71Updated 2 weeks ago
- Advanced Profiling and Analytics for AMD Hardware☆140Updated this week
- DASH, the C++ Template Library for Distributed Data Structures with Support for Hierarchical Locality for HPC and Data-Driven Science☆157Updated 3 years ago
- A streamlined CMake build system foundation for developing HPC software☆270Updated last week
- The Charm++ parallel programming system. Visit https://charmplusplus.org/ for more information.☆210Updated this week
- A unified framework across multiple programming platforms☆36Updated 8 months ago
- Modular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template…☆358Updated 7 months ago
- Kokkos C++ Performance Portability Programming Ecosystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels☆328Updated this week
- Global Memory and Threading runtime system☆23Updated 9 months ago
- RAJA Performance Portability Layer (C++)☆505Updated this week
- High-level C++ for Accelerator Clusters☆145Updated last month
- The Task-Aware MPI (TAMPI) library extends the functionality of standard MPI libraries by providing new mechanisms for improving the inte…☆23Updated 3 months ago
- tools to create performance and roofline plots from measured data☆58Updated 10 years ago
- Next generation LAPACK implementation for ROCm platform☆98Updated last week
- DLA-Future☆69Updated this week
- Distributed ranges is a generalization of C++ ranges for distributed data structures.☆49Updated last week