pnnl / SHAD
Scalable High-performance Algorithms and Data-structures
☆123Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for SHAD
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆106Updated this week
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆113Updated 2 months ago
- Home of ALP/GraphBLAS and ALP/Pregel, featuring shared- and distributed-memory auto-parallelisation of linear algebraic and vertex-centri…☆25Updated last week
- Autonomic Performance Environment for eXascale (APEX)☆38Updated 3 weeks ago
- RAJA Performance Suite☆110Updated this week
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆80Updated this week
- The Combinatorial BLAS (CombBLAS) is an extensible distributed-memory parallel graph library offering a small but powerful set of linear …☆69Updated last week
- Distributed ranges is a generalization of C++ ranges for distributed data structures.☆47Updated last week
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆92Updated 2 years ago
- OpenSHMEM Application Programming Interface☆51Updated last week
- DLA-Future☆65Updated this week
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆102Updated last year
- Reference implementation of the draft C++ GraphBLAS specification.☆28Updated 9 months ago
- High-performance, GPU-aware communication library☆84Updated last month
- Partitioned Global Address Space (PGAS) library for distributed arrays☆101Updated this week
- Next generation LAPACK implementation for ROCm platform☆95Updated this week
- A streamlined CMake build system foundation for developing HPC software☆263Updated 3 weeks ago
- Caliper is an instrumentation and performance profiling library☆352Updated this week
- Kokkos C++ Performance Portability Programming Ecosystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels☆311Updated this week
- Global Memory and Threading runtime system☆23Updated 6 months ago
- LonestarGPU: Irregular algorithms parallelized for GPUs☆33Updated 5 years ago
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆100Updated this week
- GraphBLAS Template Library (GBTL): C++ graph algorithms and primitives using semiring algebra as defined at graphblas.org☆131Updated last year
- The Berkeley Container Library☆120Updated last year
- YASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-d…☆104Updated 3 months ago
- An implementation of BLAS using the SYCL open standard.☆259Updated 3 weeks ago
- SYCL Open Source Specification☆116Updated last week
- Codeplay project for contributions to the LLVM SYCL implementation☆30Updated 3 years ago
- Distributed View Extension for Kokkos☆43Updated 2 months ago
- Loop Kernel Analysis and Performance Modeling Toolkit☆89Updated 2 months ago