pnnl / SHAD
Scalable High-performance Algorithms and Data-structures
☆128Updated 2 months ago
Alternatives and similar repositories for SHAD:
Users that are interested in SHAD are comparing it to the libraries listed below
- Autonomic Performance Environment for eXascale (APEX)☆45Updated this week
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆121Updated 2 months ago
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆107Updated this week
- RAJA Performance Suite☆118Updated this week
- The Combinatorial BLAS (CombBLAS) is an extensible distributed-memory parallel graph library offering a small but powerful set of linear …☆71Updated last week
- GraphBLAS Template Library (GBTL): C++ graph algorithms and primitives using semiring algebra as defined at graphblas.org☆133Updated last year
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆85Updated 2 weeks ago
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆93Updated 3 years ago
- Global Memory and Threading runtime system☆23Updated 10 months ago
- Home of ALP/GraphBLAS and ALP/Pregel, featuring shared- and distributed-memory auto-parallelisation of linear algebraic and vertex-centri…☆25Updated 2 weeks ago
- Caliper is an instrumentation and performance profiling library☆371Updated this week
- Partitioned Global Address Space (PGAS) library for distributed arrays☆101Updated last week
- Barcelona OpenMP Task Suite is a collection of applications that allow to test OpenMP tasking implementations and compare its behaviour u…☆45Updated 5 years ago
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆107Updated last year
- A streamlined CMake build system foundation for developing HPC software☆269Updated 3 weeks ago
- Codeplay project for contributions to the LLVM SYCL implementation☆30Updated 4 years ago
- Loop Kernel Analysis and Performance Modeling Toolkit☆92Updated 2 weeks ago
- Distributed ranges is a generalization of C++ ranges for distributed data structures.☆49Updated this week
- High-level C++ for Accelerator Clusters☆146Updated last week
- DLA-Future☆70Updated last week
- LonestarGPU: Irregular algorithms parallelized for GPUs☆34Updated 5 years ago
- YASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-d…☆106Updated 8 months ago
- PaRSEC is a generic framework for architecture aware scheduling and management of micro-tasks on distributed, GPU accelerated, many-core …☆56Updated this week
- Reference implementation of the draft C++ GraphBLAS specification.☆30Updated last month
- Distributed View Extension for Kokkos☆45Updated 4 months ago
- OpenSHMEM Application Programming Interface☆54Updated 4 months ago
- This package includes the implementation for four sparse linear algebra kernels: Sparse-Matrix-Vector-Multiplication (SpMV), Sparse-Trian…☆26Updated 4 years ago
- CHAI and RAJA provide an excellent base on which to build portable codes. CARE expands that functionality, adding new features such as lo…☆30Updated this week
- This is a set of simple programs that can be used to explore the features of a parallel platform.☆426Updated last week
- Kokkos C++ Performance Portability Programming Ecosystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels☆333Updated this week