ParRes / KernelsLinks
This is a set of simple programs that can be used to explore the features of a parallel platform.
☆432Updated this week
Alternatives and similar repositories for Kernels
Users that are interested in Kernels are comparing it to the libraries listed below
Sorting:
- HPCToolkit performance tools: measurement and analysis components☆342Updated 3 months ago
- Information about many aspects of high-performance computing. Wiki content moved to ~/docs.☆291Updated last month
- Caliper is an instrumentation and performance profiling library☆375Updated 3 weeks ago
- RAJA Performance Portability Layer (C++)☆519Updated this week
- STREAM, for lots of devices written in many programming models☆339Updated 9 months ago
- The Charm++ parallel programming system. Visit https://charmplusplus.org/ for more information.☆214Updated this week
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆108Updated 2 years ago
- YASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-d…☆107Updated 10 months ago
- A streamlined CMake build system foundation for developing HPC software☆268Updated last month
- Kokkos C++ Performance Portability Programming Ecosystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels☆340Updated this week
- A light-weight MPI profiler.☆95Updated 10 months ago
- RAJA Performance Suite☆117Updated last week
- Partitioned Global Address Space (PGAS) library for distributed arrays☆105Updated this week
- Integrated Performance Monitoring for High Performance Computing☆88Updated 3 years ago
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆126Updated 3 weeks ago
- Tutorials for the Kokkos C++ Performance Portability Programming Ecosystem☆326Updated last month
- A massively-parallel, block-sparse tensor framework written in C++☆292Updated last week
- Modular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template…☆359Updated 10 months ago
- ☆165Updated last month
- Data parallel C++ mathematical object library☆163Updated last week
- An application-focused API for memory management on NUMA & GPU architectures☆361Updated last week
- SCR caches checkpoint data in storage on the compute nodes of a Linux cluster to provide a fast, scalable checkpoint / restart capability…☆102Updated 2 months ago
- Portable and vendor neutral framework for parallel programming on heterogeneous platforms.☆418Updated this week
- Next generation of ADIOS developed in the Exascale Computing Program☆285Updated this week
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆261Updated 4 months ago
- Abstraction Library for Parallel Kernel Acceleration☆383Updated last week
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆206Updated 3 weeks ago
- Loop Kernel Analysis and Performance Modeling Toolkit☆93Updated 2 months ago
- OpenMP Offloading Validation & Verification Suite; Official repository. We have migrated from bitbucket!! For documentation, results, pub…☆58Updated last week
- Run a parallel command inside a split tmux window☆146Updated 3 years ago