pnnl / SHADLinks
Scalable High-performance Algorithms and Data-structures
☆131Updated 3 weeks ago
Alternatives and similar repositories for SHAD
Users that are interested in SHAD are comparing it to the libraries listed below
Sorting:
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆107Updated this week
- The Combinatorial BLAS (CombBLAS) is an extensible distributed-memory parallel graph library offering a small but powerful set of linear …☆77Updated 3 weeks ago
- Autonomic Performance Environment for eXascale (APEX)☆48Updated last month
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆130Updated last week
- Partitioned Global Address Space (PGAS) library for distributed arrays☆105Updated this week
- RAJA Performance Suite☆117Updated this week
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆87Updated this week
- Global Memory and Threading runtime system☆24Updated last year
- The Berkeley Container Library☆124Updated last year
- DASH, the C++ Template Library for Distributed Data Structures with Support for Hierarchical Locality for HPC and Data-Driven Science☆159Updated 3 years ago
- YASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-d…☆107Updated 11 months ago
- Home of ALP/GraphBLAS and ALP/Pregel, featuring shared- and distributed-memory auto-parallelisation of linear algebraic and vertex-centri…☆27Updated this week
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆93Updated 3 years ago
- A streamlined CMake build system foundation for developing HPC software☆268Updated 2 weeks ago
- OpenSHMEM Application Programming Interface☆57Updated 7 months ago
- Distributed ranges is a generalization of C++ ranges for distributed data structures.☆51Updated last week
- High-level C++ for Accelerator Clusters☆146Updated last week
- Caliper is an instrumentation and performance profiling library☆378Updated this week
- GraphBLAS Template Library (GBTL): C++ graph algorithms and primitives using semiring algebra as defined at graphblas.org☆133Updated 2 years ago
- GPI-2☆56Updated 11 months ago
- A unified framework across multiple programming platforms☆41Updated 3 weeks ago
- Loop Kernel Analysis and Performance Modeling Toolkit☆93Updated 3 months ago
- DLA-Future☆75Updated last month
- Sandia OpenSHMEM is an implementation of the OpenSHMEM specification over multiple Networking APIs, including Portals 4, the Open Fabric …☆70Updated 2 months ago
- Barcelona OpenMP Task Suite is a collection of applications that allow to test OpenMP tasking implementations and compare its behaviour u…☆46Updated 5 years ago
- Reference implementation of the draft C++ GraphBLAS specification.☆33Updated 4 months ago
- This is a set of simple programs that can be used to explore the features of a parallel platform.☆433Updated 3 weeks ago
- Kokkos C++ Performance Portability Programming Ecosystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels☆344Updated this week
- The Charm++ parallel programming system. Visit https://charmplusplus.org/ for more information.☆216Updated last week
- Modular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template…☆360Updated 10 months ago