ParRes / KernelsLinks
This is a set of simple programs that can be used to explore the features of a parallel platform.
☆433Updated 3 weeks ago
Alternatives and similar repositories for Kernels
Users that are interested in Kernels are comparing it to the libraries listed below
Sorting:
- Information about many aspects of high-performance computing. Wiki content moved to ~/docs.☆291Updated last month
- Caliper is an instrumentation and performance profiling library☆377Updated this week
- HPCToolkit performance tools: measurement and analysis components☆342Updated 4 months ago
- STREAM, for lots of devices written in many programming models☆343Updated 9 months ago
- RAJA Performance Suite☆117Updated this week
- A streamlined CMake build system foundation for developing HPC software☆268Updated last week
- RAJA Performance Portability Layer (C++)☆524Updated this week
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆108Updated 2 years ago
- Partitioned Global Address Space (PGAS) library for distributed arrays☆105Updated 2 weeks ago
- The Charm++ parallel programming system. Visit https://charmplusplus.org/ for more information.☆216Updated last week
- Kokkos C++ Performance Portability Programming Ecosystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels☆342Updated this week
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆130Updated last week
- AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releas…☆224Updated this week
- Run a parallel command inside a split tmux window☆148Updated 3 years ago
- A light-weight MPI profiler.☆95Updated 11 months ago
- YASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-d…☆107Updated 11 months ago
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆206Updated last month
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆261Updated 5 months ago
- ☆166Updated 2 weeks ago
- Modular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template…☆360Updated 10 months ago
- Abstraction Library for Parallel Kernel Acceleration☆384Updated this week
- Advanced Profiling and Analytics for AMD Hardware☆156Updated this week
- SCR caches checkpoint data in storage on the compute nodes of a Linux cluster to provide a fast, scalable checkpoint / restart capability…☆102Updated 3 months ago
- Loop Kernel Analysis and Performance Modeling Toolkit☆93Updated 3 months ago
- Sandia OpenSHMEM is an implementation of the OpenSHMEM specification over multiple Networking APIs, including Portals 4, the Open Fabric …☆70Updated 2 months ago
- Next generation FFT implementation for ROCm☆195Updated this week
- A massively-parallel, block-sparse tensor framework written in C++☆293Updated last week
- Portable and vendor neutral framework for parallel programming on heterogeneous platforms.☆422Updated this week
- Tickets for the MPI Forum☆69Updated 3 years ago
- Integrated Performance Monitoring for High Performance Computing☆89Updated 3 years ago