ParRes / KernelsLinks
This is a set of simple programs that can be used to explore the features of a parallel platform.
☆434Updated last month
Alternatives and similar repositories for Kernels
Users that are interested in Kernels are comparing it to the libraries listed below
Sorting:
- Information about many aspects of high-performance computing. Wiki content moved to ~/docs.☆296Updated 2 months ago
- HPCToolkit performance tools: measurement and analysis components☆342Updated 4 months ago
- RAJA Performance Portability Layer (C++)☆526Updated last week
- STREAM, for lots of devices written in many programming models☆345Updated 10 months ago
- Caliper is an instrumentation and performance profiling library☆382Updated this week
- The Charm++ parallel programming system. Visit https://charmplusplus.org/ for more information.☆216Updated last week
- Kokkos C++ Performance Portability Programming Ecosystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels☆345Updated last week
- RAJA Performance Suite☆118Updated this week
- YASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-d…☆108Updated 11 months ago
- A light-weight MPI profiler.☆95Updated 11 months ago
- Data parallel C++ mathematical object library☆162Updated this week
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆207Updated 2 months ago
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆109Updated 2 years ago
- A streamlined CMake build system foundation for developing HPC software☆272Updated this week
- Tutorials for the Kokkos C++ Performance Portability Programming Ecosystem☆331Updated last month
- Partitioned Global Address Space (PGAS) library for distributed arrays☆104Updated 2 weeks ago
- Official HPCG benchmark source code☆323Updated last year
- A massively-parallel, block-sparse tensor framework written in C++☆297Updated 2 weeks ago
- Abstraction Library for Parallel Kernel Acceleration☆385Updated this week
- ☆167Updated last month
- Integrated Performance Monitoring for High Performance Computing☆89Updated 3 years ago
- An application-focused API for memory management on NUMA & GPU architectures☆368Updated this week
- Loop Kernel Analysis and Performance Modeling Toolkit☆94Updated 3 months ago
- Run a parallel command inside a split tmux window☆150Updated 3 years ago
- Portable and vendor neutral framework for parallel programming on heterogeneous platforms.☆423Updated last week
- High-performance, GPU-aware communication library☆86Updated 6 months ago
- Modular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template…☆361Updated 11 months ago
- SCR caches checkpoint data in storage on the compute nodes of a Linux cluster to provide a fast, scalable checkpoint / restart capability…☆103Updated 3 months ago
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆261Updated 6 months ago
- Next generation FFT implementation for ROCm☆195Updated last week